sense disambiguation corpus: Topics by WorldWideScience.org

Sample records for sense disambiguation corpus

Word Domain Disambiguation via Word Sense Disambiguation

Energy Technology Data Exchange (ETDEWEB)

Sanfilippo, Antonio P.; Tratz, Stephen C.; Gregory, Michelle L.

2006-06-04

Word subject domains have been widely used to improve the perform-ance of word sense disambiguation al-gorithms. However, comparatively little effort has been devoted so far to the disambiguation of word subject do-mains. The few existing approaches have focused on the development of al-gorithms specific to word domain dis-ambiguation. In this paper we explore an alternative approach where word domain disambiguation is achieved via word sense disambiguation. Our study shows that this approach yields very strong results, suggesting that word domain disambiguation can be ad-dressed in terms of word sense disam-biguation with no need for special purpose algorithms.
Biomedical word sense disambiguation with ontologies and metadata: automation meets accuracy

Directory of Open Access Journals (Sweden)

Hakenberg Jörg

2009-01-01

Full Text Available Abstract Background Ontology term labels can be ambiguous and have multiple senses. While this is no problem for human annotators, it is a challenge to automated methods, which identify ontology terms in text. Classical approaches to word sense disambiguation use co-occurring words or terms. However, most treat ontologies as simple terminologies, without making use of the ontology structure or the semantic similarity between terms. Another useful source of information for disambiguation are metadata. Here, we systematically compare three approaches to word sense disambiguation, which use ontologies and metadata, respectively. Results The 'Closest Sense' method assumes that the ontology defines multiple senses of the term. It computes the shortest path of co-occurring terms in the document to one of these senses. The 'Term Cooc' method defines a log-odds ratio for co-occurring terms including co-occurrences inferred from the ontology structure. The 'MetaData' approach trains a classifier on metadata. It does not require any ontology, but requires training data, which the other methods do not. To evaluate these approaches we defined a manually curated training corpus of 2600 documents for seven ambiguous terms from the Gene Ontology and MeSH. All approaches over all conditions achieve 80% success rate on average. The 'MetaData' approach performed best with 96%, when trained on high-quality data. Its performance deteriorates as quality of the training data decreases. The 'Term Cooc' approach performs better on Gene Ontology (92% success than on MeSH (73% success as MeSH is not a strict is-a/part-of, but rather a loose is-related-to hierarchy. The 'Closest Sense' approach achieves on average 80% success rate. Conclusion Metadata is valuable for disambiguation, but requires high quality training data. Closest Sense requires no training, but a large, consistently modelled ontology, which are two opposing conditions. Term Cooc achieves greater 90
WORD SENSE DISAMBIGUATION FOR TAMIL LANGUAGE USING PART-OF-SPEECH AND CLUSTERING TECHNIQUE

Directory of Open Access Journals (Sweden)

P. ISWARYA

2017-09-01

Full Text Available Word sense disambiguation is an important task in Natural Language Processing (NLP, and this paper concentrates on the problem of target word selection in machine translation. The proposed method called enhanced Word Sense Disambiguation with Part-of-Speech and Clustering based Sensecollocation (WSDPCS consists of two steps namely (i Part-of-Speech (POS tagger in disambiguating word senses and (ii Enhanced with Clustering and Sense-collocation dictionary based disambiguation. In the first step an ambiguous Tamil words are disambiguated using Tamil and English POS Tagger. If it has same type of POS category labels, then it passes the word to the next step. In the second step ambiguity is resolved using sense-collocation dictionary. The experimental analysis shows that the accuracy of proposed WSDPCS method achieves 1.86% improvement over an existing method.
Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources

Directory of Open Access Journals (Sweden)

Paweł Kędzia

2015-12-01

Full Text Available Word Sense Disambiguation Based on Large Scale Polish CLARIN Heterogeneous Lexical Resources Lexical resources can be applied in many different Natural Language Engineering tasks, but the most fundamental task is the recognition of word senses used in text contexts. The problem is difficult, not yet fully solved and different lexical resources provided varied support for it. Polish CLARIN lexical semantic resources are based on the plWordNet — a very large wordnet for Polish — as a central structure which is a basis for linking together several resources of different types. In this paper, several Word Sense Disambiguation (henceforth WSD methods developed for Polish that utilise plWordNet are discussed. Textual sense descriptions in the traditional lexicon can be compared with text contexts using Lesk’s algorithm in order to find best matching senses. In the case of a wordnet, lexico-semantic relations provide the main description of word senses. Thus, first, we adapted and applied to Polish a WSD method based on the Page Rank. According to it, text words are mapped on their senses in the plWordNet graph and Page Rank algorithm is run to find senses with the highest scores. The method presents results lower but comparable to those reported for English. The error analysis showed that the main problems are: fine grained sense distinctions in plWordNet and limited number of connections between words of different parts of speech. In the second approach plWordNet expanded with the mapping onto the SUMO ontology concepts was used. Two scenarios for WSD were investigated: two step disambiguation and disambiguation based on combined networks of plWordNet and SUMO. In the former scenario, words are first assigned SUMO concepts and next plWordNet senses are disambiguated. In latter, plWordNet and SUMO are combined in one large network used next for the disambiguation of senses. The additional knowledge sources used in WSD improved the performance
A Semantic Lexicon-Based Approach for Sense Disambiguation and Its WWW Application

Science.gov (United States)

di Lecce, Vincenzo; Calabrese, Marco; Soldo, Domenico

This work proposes a basic framework for resolving sense disambiguation through the use of Semantic Lexicon, a machine readable dictionary managing both word senses and lexico-semantic relations. More specifically, polysemous ambiguity characterizing Web documents is discussed. The adopted Semantic Lexicon is WordNet, a lexical knowledge-base of English words widely adopted in many research studies referring to knowledge discovery. The proposed approach extends recent works on knowledge discovery by focusing on the sense disambiguation aspect. By exploiting the structure of WordNet database, lexico-semantic features are used to resolve the inherent sense ambiguity of written text with particular reference to HTML resources. The obtained results may be extended to generic hypertextual repositories as well. Experiments show that polysemy reduction can be used to hint about the meaning of specific senses in given contexts.
Knowledge-based biomedical word sense disambiguation: comparison of approaches

Directory of Open Access Journals (Sweden)

Aronson Alan R

2010-11-01

Full Text Available Abstract Background Word sense disambiguation (WSD algorithms attempt to select the proper sense of ambiguous terms in text. Resources like the UMLS provide a reference thesaurus to be used to annotate the biomedical literature. Statistical learning approaches have produced good results, but the size of the UMLS makes the production of training data infeasible to cover all the domain. Methods We present research on existing WSD approaches based on knowledge bases, which complement the studies performed on statistical learning. We compare four approaches which rely on the UMLS Metathesaurus as the source of knowledge. The first approach compares the overlap of the context of the ambiguous word to the candidate senses based on a representation built out of the definitions, synonyms and related terms. The second approach collects training data for each of the candidate senses to perform WSD based on queries built using monosemous synonyms and related terms. These queries are used to retrieve MEDLINE citations. Then, a machine learning approach is trained on this corpus. The third approach is a graph-based method which exploits the structure of the Metathesaurus network of relations to perform unsupervised WSD. This approach ranks nodes in the graph according to their relative structural importance. The last approach uses the semantic types assigned to the concepts in the Metathesaurus to perform WSD. The context of the ambiguous word and semantic types of the candidate concepts are mapped to Journal Descriptors. These mappings are compared to decide among the candidate concepts. Results are provided estimating accuracy of the different methods on the WSD test collection available from the NLM. Conclusions We have found that the last approach achieves better results compared to the other methods. The graph-based approach, using the structure of the Metathesaurus network to estimate the relevance of the Metathesaurus concepts, does not perform well
Applying active learning to supervised word sense disambiguation in MEDLINE.

Science.gov (United States)

Chen, Yukun; Cao, Hongxin; Mei, Qiaozhu; Zheng, Kai; Xu, Hua

2013-01-01

This study was to assess whether active learning strategies can be integrated with supervised word sense disambiguation (WSD) methods, thus reducing the number of annotated samples, while keeping or improving the quality of disambiguation models. We developed support vector machine (SVM) classifiers to disambiguate 197 ambiguous terms and abbreviations in the MSH WSD collection. Three different uncertainty sampling-based active learning algorithms were implemented with the SVM classifiers and were compared with a passive learner (PL) based on random sampling. For each ambiguous term and each learning algorithm, a learning curve that plots the accuracy computed from the test set as a function of the number of annotated samples used in the model was generated. The area under the learning curve (ALC) was used as the primary metric for evaluation. Our experiments demonstrated that active learners (ALs) significantly outperformed the PL, showing better performance for 177 out of 197 (89.8%) WSD tasks. Further analysis showed that to achieve an average accuracy of 90%, the PL needed 38 annotated samples, while the ALs needed only 24, a 37% reduction in annotation effort. Moreover, we analyzed cases where active learning algorithms did not achieve superior performance and identified three causes: (1) poor models in the early learning stage; (2) easy WSD cases; and (3) difficult WSD cases, which provide useful insight for future improvements. This study demonstrated that integrating active learning strategies with supervised WSD methods could effectively reduce annotation cost and improve the disambiguation models.
New Perspectives on Computational and Cognitive Strategies for Word Sense Disambiguation

CERN Document Server

Kwong, Oi Yee

2013-01-01

Cognitive and Computational Strategies for Word Sense Disambiguation examines cognitive strategies by humans and computational strategies by machines, for WSD in parallel. Focusing on a psychologically valid property of words and senses, author Oi Yee Kwong discusses their concreteness or abstractness and draws on psycholinguistic data to examine the extent to which existing lexical resources resemble the mental lexicon as far as the concreteness distinction is concerned. The text also investigates the contribution of different knowledge sources to WSD in relation to this very intrinsic nature of words and senses.
Lexicographic Approaches to Sense Disambiguation in Monolingual Dictionaries and Equivalent Differentiation in Bilingual Dictionaries

Directory of Open Access Journals (Sweden)

Marjeta Vrbinc

2011-05-01

Full Text Available The article discusses methods of sense disambiguation in monolingual dictionaries and equivalent differentiation in bilingual dictionaries. In current dictionaries, sense disambiguation and equivalent differentiation is presented in the form of specifiers or glosses, collocators or indications of context, (domain labels, metalinguistic and encyclopaedic information. Each method is presented and illustrated by actual samples of dictionary articles taken from mono and bilingual dictionaries. The last part of the article is devoted to equivalent differentiation in bilingual decoding dictionaries. In bilingual dictionaries, equivalent differentiation is often needed to describe the lack of agreement between the source language (SL and target language (TL. The article concludes by stating that equivalent differentiation should be written in the native language of the target audience and sense indicators in a monolingual learner’s dictionary should be words that the users are most familiar with.
Applying active learning to supervised word sense disambiguation in MEDLINE

Science.gov (United States)

Chen, Yukun; Cao, Hongxin; Mei, Qiaozhu; Zheng, Kai; Xu, Hua

2013-01-01

Objectives This study was to assess whether active learning strategies can be integrated with supervised word sense disambiguation (WSD) methods, thus reducing the number of annotated samples, while keeping or improving the quality of disambiguation models. Methods We developed support vector machine (SVM) classifiers to disambiguate 197 ambiguous terms and abbreviations in the MSH WSD collection. Three different uncertainty sampling-based active learning algorithms were implemented with the SVM classifiers and were compared with a passive learner (PL) based on random sampling. For each ambiguous term and each learning algorithm, a learning curve that plots the accuracy computed from the test set as a function of the number of annotated samples used in the model was generated. The area under the learning curve (ALC) was used as the primary metric for evaluation. Results Our experiments demonstrated that active learners (ALs) significantly outperformed the PL, showing better performance for 177 out of 197 (89.8%) WSD tasks. Further analysis showed that to achieve an average accuracy of 90%, the PL needed 38 annotated samples, while the ALs needed only 24, a 37% reduction in annotation effort. Moreover, we analyzed cases where active learning algorithms did not achieve superior performance and identified three causes: (1) poor models in the early learning stage; (2) easy WSD cases; and (3) difficult WSD cases, which provide useful insight for future improvements. Conclusions This study demonstrated that integrating active learning strategies with supervised WSD methods could effectively reduce annotation cost and improve the disambiguation models. PMID:23364851
Word Sense Disambiguation with LSTM : Do We Really Need 100 Billion Words?

NARCIS (Netherlands)

Le, Minh; Postma, Marten; Urbani, Jacopo

2017-01-01

Recently, Yuan et al. (2016) have shown the effectiveness of using Long Short-Term Memory (LSTM) for performing Word Sense Disambiguation (WSD). Their proposed technique outperformed the previous state-of-the-art with several benchmarks, but neither the training data nor the source code was
Word embeddings and recurrent neural networks based on Long-Short Term Memory nodes in supervised biomedical word sense disambiguation.

Science.gov (United States)

Jimeno Yepes, Antonio

2017-09-01

Word sense disambiguation helps identifying the proper sense of ambiguous words in text. With large terminologies such as the UMLS Metathesaurus ambiguities appear and highly effective disambiguation methods are required. Supervised learning algorithm methods are used as one of the approaches to perform disambiguation. Features extracted from the context of an ambiguous word are used to identify the proper sense of such a word. The type of features have an impact on machine learning methods, thus affect disambiguation performance. In this work, we have evaluated several types of features derived from the context of the ambiguous word and we have explored as well more global features derived from MEDLINE using word embeddings. Results show that word embeddings improve the performance of more traditional features and allow as well using recurrent neural network classifiers based on Long-Short Term Memory (LSTM) nodes. The combination of unigrams and word embeddings with an SVM sets a new state of the art performance with a macro accuracy of 95.97 in the MSH WSD data set. Copyright © 2017 Elsevier Inc. All rights reserved.
PNNL: A Supervised Maximum Entropy Approach to Word Sense Disambiguation

Energy Technology Data Exchange (ETDEWEB)

Tratz, Stephen C.; Sanfilippo, Antonio P.; Gregory, Michelle L.; Chappell, Alan R.; Posse, Christian; Whitney, Paul D.

2007-06-23

In this paper, we described the PNNL Word Sense Disambiguation system as applied to the English All-Word task in Se-mEval 2007. We use a supervised learning approach, employing a large number of features and using Information Gain for dimension reduction. Our Maximum Entropy approach combined with a rich set of features produced results that are significantly better than baseline and are the highest F-score for the fined-grained English All-Words subtask.
Co-occurrence graphs for word sense disambiguation in the biomedical domain.

Science.gov (United States)

Duque, Andres; Stevenson, Mark; Martinez-Romo, Juan; Araujo, Lourdes

2018-05-01

Word sense disambiguation is a key step for many natural language processing tasks (e.g. summarization, text classification, relation extraction) and presents a challenge to any system that aims to process documents from the biomedical domain. In this paper, we present a new graph-based unsupervised technique to address this problem. The knowledge base used in this work is a graph built with co-occurrence information from medical concepts found in scientific abstracts, and hence adapted to the specific domain. Unlike other unsupervised approaches based on static graphs such as UMLS, in this work the knowledge base takes the context of the ambiguous terms into account. Abstracts downloaded from PubMed are used for building the graph and disambiguation is performed using the personalized PageRank algorithm. Evaluation is carried out over two test datasets widely explored in the literature. Different parameters of the system are also evaluated to test robustness and scalability. Results show that the system is able to outperform state-of-the-art knowledge-based systems, obtaining more than 10% of accuracy improvement in some cases, while only requiring minimal external resources. Copyright © 2018 Elsevier B.V. All rights reserved.
Methods of defining ontologies, word disambiguation methods, computer systems, and articles of manufacture

Science.gov (United States)

Sanfilippo, Antonio P [Richland, WA; Tratz, Stephen C [Richland, WA; Gregory, Michelle L [Richland, WA; Chappell, Alan R [Seattle, WA; Whitney, Paul D [Richland, WA; Posse, Christian [Seattle, WA; Baddeley, Robert L [Richland, WA; Hohimer, Ryan E [West Richland, WA

2011-10-11

Methods of defining ontologies, word disambiguation methods, computer systems, and articles of manufacture are described according to some aspects. In one aspect, a word disambiguation method includes accessing textual content to be disambiguated, wherein the textual content comprises a plurality of words individually comprising a plurality of word senses, for an individual word of the textual content, identifying one of the word senses of the word as indicative of the meaning of the word in the textual content, for the individual word, selecting one of a plurality of event classes of a lexical database ontology using the identified word sense of the individual word, and for the individual word, associating the selected one of the event classes with the textual content to provide disambiguation of a meaning of the individual word in the textual content.
From Word Alignment to Word Senses, via Multilingual Wordnets

Directory of Open Access Journals (Sweden)

Dan Tufis

2006-05-01

Full Text Available Most of the successful commercial applications in language processing (text and/or speech dispense with any explicit concern on semantics, with the usual motivations stemming from the computational high costs required for dealing with semantics, in case of large volumes of data. With recent advances in corpus linguistics and statistical-based methods in NLP, revealing useful semantic features of linguistic data is becoming cheaper and cheaper and the accuracy of this process is steadily improving. Lately, there seems to be a growing acceptance of the idea that multilingual lexical ontologisms might be the key towards aligning different views on the semantic atomic units to be used in characterizing the general meaning of various and multilingual documents. Depending on the granularity at which semantic distinctions are necessary, the accuracy of the basic semantic processing (such as word sense disambiguation can be very high with relatively low complexity computing. The paper substantiates this statement by presenting a statistical/based system for word alignment and word sense disambiguation in parallel corpora. We describe a word alignment platform which ensures text pre-processing (tokenization, POS-tagging, lemmatization, chunking, sentence and word alignment as required by an accurate word sense disambiguation.
Semantic Drift in Espresso-style Bootstrapping: Graph-theoretic Analysis and Evaluation in Word Sense Disambiguation

Science.gov (United States)

Komachi, Mamoru; Kudo, Taku; Shimbo, Masashi; Matsumoto, Yuji

Bootstrapping has a tendency, called semantic drift, to select instances unrelated to the seed instances as the iteration proceeds. We demonstrate the semantic drift of Espresso-style bootstrapping has the same root as the topic drift of Kleinberg's HITS, using a simplified graph-based reformulation of bootstrapping. We confirm that two graph-based algorithms, the von Neumann kernels and the regularized Laplacian, can reduce the effect of semantic drift in the task of word sense disambiguation (WSD) on Senseval-3 English Lexical Sample Task. Proposed algorithms achieve superior performance to Espresso and previous graph-based WSD methods, even though the proposed algorithms have less parameters and are easy to calibrate.
Human-machine interaction to disambiguate entities in unstructured text and structured datasets

Science.gov (United States)

Ward, Kevin; Davenport, Jack

2017-05-01

Creating entity network graphs is a manual, time consuming process for an intelligence analyst. Beyond the traditional big data problems of information overload, individuals are often referred to by multiple names and shifting titles as they advance in their organizations over time which quickly makes simple string or phonetic alignment methods for entities insufficient. Conversely, automated methods for relationship extraction and entity disambiguation typically produce questionable results with no way for users to vet results, correct mistakes or influence the algorithm's future results. We present an entity disambiguation tool, DRADIS, which aims to bridge the gap between human-centric and machinecentric methods. DRADIS automatically extracts entities from multi-source datasets and models them as a complex set of attributes and relationships. Entities are disambiguated across the corpus using a hierarchical model executed in Spark allowing it to scale to operational sized data. Resolution results are presented to the analyst complete with sourcing information for each mention and relationship allowing analysts to quickly vet the correctness of results as well as correct mistakes. Corrected results are used by the system to refine the underlying model allowing analysts to optimize the general model to better deal with their operational data. Providing analysts with the ability to validate and correct the model to produce a system they can trust enables them to better focus their time on producing higher quality analysis products.
Disregarding the Corpus: Head-word and Sense Treatment in Shona Monolingual Lexicography*

Directory of Open Access Journals (Sweden)

Webster M. Mavhu

2011-10-01

Full Text Available
Abstract: With specific reference to Shona monolingual lexicography, this article discusses how corpus-based lexicographers might, in some instances, decide not strictly to adhere to the corpus when it comes to headword and sense treatment. The writer is a member of the African Languages Research Institute (ALRI, formerly known as the African Languages Lexical (ALLEX Project. ALRI is a nonfaculty interdisciplinary unit dedicated to research on and the development of African languages in Zimbabwe. The writer is part of the six-member team that compiled the now published Shona monolingual, synchronic, medium-sized and general-purpose dictionary Du-ramazwi Guru ReChiShona (2001. The article originates from the writer's experience of working on this dictionary. The article highlights the fact that being corpus-based does not necessarily imply being corpus-bound.
Keywords: CORPUS, CORPUS-BASED, FREQUENCY, HEADWORD, LEXICOGRAPHY, SENSE, SHONA, SLANG, SYNONYMS
Opsomming: Verontagsaming van die korpus: Trefwoord- en betekenisbe-handeling in die Sjona- eentalige leksikografie. Met spesifieke verwysing na die Sjona- eentalige leksikografie bespreek hierdie artikel hoe korpusgebaseerde leksikograwe in som-mige gevalle kan besluit om nie streng by die korpus te bly wanneer dit kom by trefwoord- en betekenisbehandeling nie. Die skrywer is 'n lid van die African Language Research Institute (AL-RI, vroeër bekend as die African Languages Lexical (ALLEX Project. ALRI is 'n niefakulteits- interdissiplinêre eenheid wat hom beywer vir navorsing oor en die ontwikkeling van die Afrikatale in Zimbabwe. Die skrywer is deel van 'n span van ses lede wat die reeds gepubliseerde Sjona- een-talige, sinchroniese, middelgroot en meerdoelige woordeboek Duramazwi Guru ReChiShona (2001 saamgestel het. Die artikel het uit die skrywer se ervaring van werk aan hierdie woordeboek ont-staan. Die artikel belig die feit dat korpusgebaseerdheid nie noodwendig
A link between visual disambiguation and visual memory.

Science.gov (United States)

Hegdé, Jay; Kersten, Daniel

2010-11-10

Sensory information in the retinal image is typically too ambiguous to support visual object recognition by itself. Theories of visual disambiguation posit that to disambiguate, and thus interpret, the incoming images, the visual system must integrate the sensory information with previous knowledge of the visual world. However, the underlying neural mechanisms remain unclear. Using functional magnetic resonance imaging (fMRI) of human subjects, we have found evidence for functional specialization for storing disambiguating information in memory versus interpreting incoming ambiguous images. Subjects viewed two-tone, "Mooney" images, which are typically ambiguous when seen for the first time but are quickly disambiguated after viewing the corresponding unambiguous color images. Activity in one set of regions, including a region in the medial parietal cortex previously reported to play a key role in Mooney image disambiguation, closely reflected memory for previously seen color images but not the subsequent disambiguation of Mooney images. A second set of regions, including the superior temporal sulcus, showed the opposite pattern, in that their responses closely reflected the subjects' percepts of the disambiguated Mooney images on a stimulus-to-stimulus basis but not the memory of the corresponding color images. Functional connectivity between the two sets of regions was stronger during those trials in which the disambiguated percept was stronger. This functional interaction between brain regions that specialize in storing disambiguating information in memory versus interpreting incoming ambiguous images may represent a general mechanism by which previous knowledge disambiguates visual sensory information.

Disambiguate: An open-source application for disambiguating two species in next generation sequencing data from grafted samples.

Science.gov (United States)

Ahdesmäki, Miika J; Gray, Simon R; Johnson, Justin H; Lai, Zhongwu

2016-01-01

Grafting of cell lines and primary tumours is a crucial step in the drug development process between cell line studies and clinical trials. Disambiguate is a program for computationally separating the sequencing reads of two species derived from grafted samples. Disambiguate operates on DNA or RNA-seq alignments to the two species and separates the components at very high sensitivity and specificity as illustrated in artificially mixed human-mouse samples. This allows for maximum recovery of data from target tumours for more accurate variant calling and gene expression quantification. Given that no general use open source algorithm accessible to the bioinformatics community exists for the purposes of separating the two species data, the proposed Disambiguate tool presents a novel approach and improvement to performing sequence analysis of grafted samples. Both Python and C++ implementations are available and they are integrated into several open and closed source pipelines. Disambiguate is open source and is freely available at https://github.com/AstraZeneca-NGS/disambiguate.
The C-Cat Wordnet Package: An Open Source Package for modifying andapplying Wordnet

Energy Technology Data Exchange (ETDEWEB)

Stevens, K; Huang, T; Buttler, D

2011-09-16

We present the C-Cat Wordnet package, an open source library for using and modifying Wordnet. The package includes four key features: an API for modifying Synsets; implementations of standard similarity metrics, implementations of well known Word Sense Disambiguation algorithms, and an implementation of the Castanet algorithm. The library is easily extendible and usable in many runtime environments. We demonstrate it's use on two standard Word Sense Disambiguation tasks and apply the Castanet algorithm to a corpus.
Using Distributed Representations to Disambiguate Biomedical and Clinical Concepts

OpenAIRE

Tulkens, Stéphan; Šuster, Simon; Daelemans, Walter

2016-01-01

In this paper, we report a knowledge-based method for Word Sense Disambiguation in the domains of biomedical and clinical text. We combine word representations created on large corpora with a small number of definitions from the UMLS to create concept representations, which we then compare to representations of the context of ambiguous terms. Using no relational information, we obtain comparable performance to previous approaches on the MSH-WSD dataset, which is a well-known dataset in the bi...
A hybrid approach for robust multilingual toponym extraction and disambiguation

NARCIS (Netherlands)

Habib, Mena Badieh; van Keulen, Maurice

Toponym extraction and disambiguation are key topics recently addressed by fields of Information Extraction and Geographical Information Retrieval. Toponym extraction and disambiguation are highly dependent processes. Not only toponym extraction effectiveness affects disambiguation, but also
Disambiguate: An open-source application for disambiguating two species in next generation sequencing data from grafted samples [version 2; referees: 3 approved

Directory of Open Access Journals (Sweden)

Miika J. Ahdesmäki

2017-01-01

Full Text Available Grafting of cell lines and primary tumours is a crucial step in the drug development process between cell line studies and clinical trials. Disambiguate is a program for computationally separating the sequencing reads of two species derived from grafted samples. Disambiguate operates on DNA or RNA-seq alignments to the two species and separates the components at very high sensitivity and specificity as illustrated in artificially mixed human-mouse samples. This allows for maximum recovery of data from target tumours for more accurate variant calling and gene expression quantification. Given that no general use open source algorithm accessible to the bioinformatics community exists for the purposes of separating the two species data, the proposed Disambiguate tool presents a novel approach and improvement to performing sequence analysis of grafted samples. Both Python and C++ implementations are available and they are integrated into several open and closed source pipelines. Disambiguate is open source and is freely available at https://github.com/AstraZeneca-NGS/disambiguate.
Disambiguate: An open-source application for disambiguating two species in next generation sequencing data from grafted samples [version 1; referees: 2 approved

Directory of Open Access Journals (Sweden)

Miika J. Ahdesmäki

2016-11-01

Full Text Available Grafting of cell lines and primary tumours is a crucial step in the drug development process between cell line studies and clinical trials. Disambiguate is a program for computationally separating the sequencing reads of two species derived from grafted samples. Disambiguate operates on alignments to the two species and separates the components at very high sensitivity and specificity as illustrated in artificially mixed human-mouse samples. This allows for maximum recovery of data from target tumours for more accurate variant calling and gene expression quantification. Given that no general use open source algorithm accessible to the bioinformatics community exists for the purposes of separating the two species data, the proposed Disambiguate tool presents a novel approach and improvement to performing sequence analysis of grafted samples. Both Python and C++ implementations are available and they are integrated into several open and closed source pipelines. Disambiguate is open source and is freely available at https://github.com/AstraZeneca-NGS/disambiguate.
Authormagic – An Approach to Author Disambiguation in Large-Scale Digital Libraries

CERN Document Server

Weiler, Henning; Mele, Salvatore

2011-01-01

A collaboration of leading research centers in the field of High Energy Physics (HEP) has built INSPIRE, a novel information infrastructure, which comprises the entire corpus of about one million documents produced within the discipline, including a rich set of metadata, citation information and half a million full-text documents, and offers a unique opportunity for author disambiguation strategies. The presented approach features extended metadata comparison metrics and a three-step unsupervised graph clustering technique. The algorithm aided in identifying 200'000 individuals from 6'500'000 author signatures. Preliminary tests based on knowledge of external experts and a pilot of a crowd-sourcing system show a success rate of more than 96% within the selected test cases. The obtained author clusters serve as a recommendation for INSPIRE users to further clean the publication list in a crowd-sourced approach.
Toponym Extraction and Disambiguation Enhancement Using Loops of Feedback

NARCIS (Netherlands)

Habib, Mena Badieh; van Keulen, Maurice; Fred, A.; Dietz, J.L.G.; Liu, K.; Filipe, J.

2013-01-01

Toponym extraction and disambiguation have received much attention in recent years. Typical fields addressing these topics are information retrieval, natural language processing, and semantic web. This paper addresses two problems with toponym extraction and disambiguation. First, almost no existing
Document clustering methods, document cluster label disambiguation methods, document clustering apparatuses, and articles of manufacture

Science.gov (United States)

Sanfilippo, Antonio [Richland, WA; Calapristi, Augustin J [West Richland, WA; Crow, Vernon L [Richland, WA; Hetzler, Elizabeth G [Kennewick, WA; Turner, Alan E [Kennewick, WA

2009-12-22

Document clustering methods, document cluster label disambiguation methods, document clustering apparatuses, and articles of manufacture are described. In one aspect, a document clustering method includes providing a document set comprising a plurality of documents, providing a cluster comprising a subset of the documents of the document set, using a plurality of terms of the documents, providing a cluster label indicative of subject matter content of the documents of the cluster, wherein the cluster label comprises a plurality of word senses, and selecting one of the word senses of the cluster label.
Within- and cross-modal distance information disambiguate visual size-change perception.

Directory of Open Access Journals (Sweden)

Peter W Battaglia

2010-03-01

Full Text Available Perception is fundamentally underconstrained because different combinations of object properties can generate the same sensory information. To disambiguate sensory information into estimates of scene properties, our brains incorporate prior knowledge and additional "auxiliary" (i.e., not directly relevant to desired scene property sensory information to constrain perceptual interpretations. For example, knowing the distance to an object helps in perceiving its size. The literature contains few demonstrations of the use of prior knowledge and auxiliary information in combined visual and haptic disambiguation and almost no examination of haptic disambiguation of vision beyond "bistable" stimuli. Previous studies have reported humans integrate multiple unambiguous sensations to perceive single, continuous object properties, like size or position. Here we test whether humans use visual and haptic information, individually and jointly, to disambiguate size from distance. We presented participants with a ball moving in depth with a changing diameter. Because no unambiguous distance information is available under monocular viewing, participants rely on prior assumptions about the ball's distance to disambiguate their -size percept. Presenting auxiliary binocular and/or haptic distance information augments participants' prior distance assumptions and improves their size judgment accuracy-though binocular cues were trusted more than haptic. Our results suggest both visual and haptic distance information disambiguate size perception, and we interpret these results in the context of probabilistic perceptual reasoning.
A generic open world named entity disambiguation approach for tweets

NARCIS (Netherlands)

Habib, Mena Badieh; van Keulen, Maurice

Social media is a rich source of information. To make use of this information, it is sometimes required to extract and disambiguate named entities. In this paper we focus on named entity disambiguation (NED) in twitter messages. NED in tweets is challenging in two ways. First, the limited length of
A human language corpus for interstellar message construction

Science.gov (United States)

Elliott, John

2011-02-01

The aim of HuLCC (the human language chorus corpus), is to provide a resource of sufficient size to facilitate inter-language analysis by incorporating languages from all the major language families: for the first time all aspects of typology will be incorporated within a single corpus, adhering to a consistent grammatical classification and granularity, which historically adopt a plethora of disparate schemes. An added feature will be the inclusion of a common text element, which will be translated across all languages, to provide a precise comparable thread for detailed linguistic analysis for translation strategies and a mechanism by which these mappings can be explicitly achieved. Methods developed to solve unambiguous mappings across these languages can then be adopted for any subsequent message authored by the SETI community. Initially, it is planned to provide at least 20,000 words for each chosen language, as this amount of text exceeds the point where randomly generated text can be disambiguated from natural language and is of sufficient size useful for message transmission [1] (Elliot, 2002). This paper details the design of this resource, which ultimately will be made available to SETI upon its completion, and discusses issues 'core' to any message construction.
Robust hybrid name disambiguation framework for large databases

KAUST Repository

Zhu, Jia

2013-10-26

In many databases, science bibliography database for example, name attribute is the most commonly chosen identifier to identify entities. However, names are often ambiguous and not always unique which cause problems in many fields. Name disambiguation is a non-trivial task in data management that aims to properly distinguish different entities which share the same name, particularly for large databases like digital libraries, as only limited information can be used to identify authors\\' name. In digital libraries, ambiguous author names occur due to the existence of multiple authors with the same name or different name variations for the same person. Also known as name disambiguation, most of the previous works to solve this issue often employ hierarchical clustering approaches based on information inside the citation records, e.g. co-authors and publication titles. In this paper, we focus on proposing a robust hybrid name disambiguation framework that is not only applicable for digital libraries but also can be easily extended to other application based on different data sources. We propose a web pages genre identification component to identify the genre of a web page, e.g. whether the page is a personal homepage. In addition, we propose a re-clustering model based on multidimensional scaling that can further improve the performance of name disambiguation. We evaluated our approach on known corpora, and the favorable experiment results indicated that our proposed framework is feasible. © 2013 Akadémiai Kiadó, Budapest, Hungary.
Robust hybrid name disambiguation framework for large databases

KAUST Repository

Zhu, Jia; Yang, Yi; Xie, Qing; Wang, Liwei; Hassan, Saeed-Ul

2013-01-01

In many databases, science bibliography database for example, name attribute is the most commonly chosen identifier to identify entities. However, names are often ambiguous and not always unique which cause problems in many fields. Name disambiguation is a non-trivial task in data management that aims to properly distinguish different entities which share the same name, particularly for large databases like digital libraries, as only limited information can be used to identify authors' name. In digital libraries, ambiguous author names occur due to the existence of multiple authors with the same name or different name variations for the same person. Also known as name disambiguation, most of the previous works to solve this issue often employ hierarchical clustering approaches based on information inside the citation records, e.g. co-authors and publication titles. In this paper, we focus on proposing a robust hybrid name disambiguation framework that is not only applicable for digital libraries but also can be easily extended to other application based on different data sources. We propose a web pages genre identification component to identify the genre of a web page, e.g. whether the page is a personal homepage. In addition, we propose a re-clustering model based on multidimensional scaling that can further improve the performance of name disambiguation. We evaluated our approach on known corpora, and the favorable experiment results indicated that our proposed framework is feasible. © 2013 Akadémiai Kiadó, Budapest, Hungary.
Gamification for Word Sense Labeling

NARCIS (Netherlands)

Venhuizen, Noortje; Basile, Valerio; Evang, Kilian; Bos, Johan; Erk, Kartin; Koller, Alexander

2013-01-01

Obtaining gold standard data for word sense disambiguation is important but costly. We show how it can be done using a “Game with a Purpose” (GWAP) called Wordrobe. This game consists of a large set of multiple-choice questions on word senses generated from the Groningen Meaning Bank. The players
The strength of co-authorship in gene name disambiguation

Directory of Open Access Journals (Sweden)

Farkas Richárd

2008-01-01

Full Text Available Abstract Background A biomedical entity mention in articles and other free texts is often ambiguous. For example, 13% of the gene names (aliases might refer to more than one gene. The task of Gene Symbol Disambiguation (GSD – a special case of Word Sense Disambiguation (WSD – is to assign a unique gene identifier for all identified gene name aliases in biology-related articles. Supervised and unsupervised machine learning WSD techniques have been applied in the biomedical field with promising results. We examine here the utilisation potential of the fact – one of the special features of biological articles – that the authors of the documents are known through graph-based semi-supervised methods for the GSD task. Results Our key hypothesis is that a biologist refers to each particular gene by a fixed gene alias and this holds for the co-authors as well. To make use of the co-authorship information we decided to build the inverse co-author graph on MedLine abstracts. The nodes of the inverse co-author graph are articles and there is an edge between two nodes if and only if the two articles have a mutual author. We introduce here two methods using distances (based on the graph of abstracts for the GSD task. We found that a disambiguation decision can be made in 85% of cases with an extremely high (99.5% precision rate just by using information obtained from the inverse co-author graph. We incorporated the co-authorship information into two GSD systems in order to attain full coverage and in experiments our procedure achieved precision of 94.3%, 98.85%, 96.05% and 99.63% on the human, mouse, fly and yeast GSD evaluation sets, respectively. Conclusion Based on the promising results obtained so far we suggest that the co-authorship information and the circumstances of the articles' release (like the title of the journal, the year of publication can be a crucial building block of any sophisticated similarity measure among biological articles
NEED4Tweet: A Twitterbot for Tweets Named Entity Extraction and Disambiguation

NARCIS (Netherlands)

Habib, Mena Badieh; van Keulen, Maurice

In this demo paper, we present NEED4Tweet, a Twitterbot for named entity extraction (NEE) and disambiguation (NED) for Tweets. The straightforward application of state-of-the-art extraction and disambiguation approaches on informal text widely used in Tweets, typically results in significantly
Named entity extraction and disambiguation: the missing link

NARCIS (Netherlands)

Habib, Mena Badieh; van Keulen, Maurice

2013-01-01

Named entity extraction (NEE) and disambiguation (NED) are two areas of research that are well covered in literature. Typical fields addressing these topics are information retrieval, natural language processing, and semantic web. Although these topics are highly dependent, almost no existing works
Test results for triple-modulation radar electronics with improved range disambiguation

Energy Technology Data Exchange (ETDEWEB)

Pollastrone, Fabio, E-mail: fabio.pollastrone@enea.it; Neri, Carlo

2015-10-15

Highlights: • A new digital radar electronic system based on triple-modulation has been developed. • The triple-modulation system uses an improved algorithm for the range-disambiguation. • The new radar electronics has been applied in the IVVS optical radar prototype for ITER. • The performances obtained with IVVS double and triple-modulation were compared. - Abstract: The In Vessel Viewing System (IVVS) is an optical radar with sub milimetrical resolution that will be used for imaging and metrology pourposes in ITER. The electronics of the system is based on a Digital Radar Electronics developed in ENEA Frascati laboratories during the past years. Until the present study, the system was based on amplitude modulation technique having double-modulation frequency. The power of the laser is sinusoidally modulated and the distance of the points scanned by the laser beam is obtained measuring the phase difference between outgoing and echo signals. Recently a triple-modulation radar electronics version and an algorithm able to solve the range disambiguation were developed. The aim of the upgrade was the increase of the robustness in the range disambiguation. The paper briefly describes the updates carried out on the Digital Radar Electronics and extensively the test results obtained by comparing the performance of the triple modulation versus the double modulation techniques.
Disambiguation of neutralized forms in two Croatian varieties

Directory of Open Access Journals (Sweden)

Višnja Josipović

1994-12-01

Full Text Available A type of northwestern Croatian pronunciation, also known as the Kajkavian accent, was compared with the standard Croatian pronunciation with respect to the strategies used to disambiguate neutralized final obstruents. The two varieties differ in that the former is characterized by the phonological rule of Final Devoicing, where word-final obstruents are realized as voiceles.

Sensing Strategies for Disambiguating among Multiple Objects in Known Poses.

Science.gov (United States)

1985-08-01

ELEMENT. PROIECT. TASK Artificial Inteligence Laboratory AE OKUI UBR 545 Technology Square Cambridge, MA 021.39 11. CONTROLLING OFFICE NAME AND ADDRESS 12...AD-Ali65 912 SENSING STRATEGIES FOR DISAMBIGURTING MONG MULTIPLE 1/1 OBJECTS IN KNOWN POSES(U) MASSACHUSETTS INST OF TECH CAMBRIDGE ARTIFICIAL ...or Dist Special 1 ’ MASSACHUSETTS INSTITUTE OF TECHNOLOGY ARTIFICIAL INTELLIGENCE LABORATORY A. I. Memo 855 August, 1985 Sensing Strategies for
Link-topic model for biomedical abbreviation disambiguation.

Science.gov (United States)

Kim, Seonho; Yoon, Juntae

2015-02-01

The ambiguity of biomedical abbreviations is one of the challenges in biomedical text mining systems. In particular, the handling of term variants and abbreviations without nearby definitions is a critical issue. In this study, we adopt the concepts of topic of document and word link to disambiguate biomedical abbreviations. We newly suggest the link topic model inspired by the latent Dirichlet allocation model, in which each document is perceived as a random mixture of topics, where each topic is characterized by a distribution over words. Thus, the most probable expansions with respect to abbreviations of a given abstract are determined by word-topic, document-topic, and word-link distributions estimated from a document collection through the link topic model. The model allows two distinct modes of word generation to incorporate semantic dependencies among words, particularly long form words of abbreviations and their sentential co-occurring words; a word can be generated either dependently on the long form of the abbreviation or independently. The semantic dependency between two words is defined as a link and a new random parameter for the link is assigned to each word as well as a topic parameter. Because the link status indicates whether the word constitutes a link with a given specific long form, it has the effect of determining whether a word forms a unigram or a skipping/consecutive bigram with respect to the long form. Furthermore, we place a constraint on the model so that a word has the same topic as a specific long form if it is generated in reference to the long form. Consequently, documents are generated from the two hidden parameters, i.e. topic and link, and the most probable expansion of a specific abbreviation is estimated from the parameters. Our model relaxes the bag-of-words assumption of the standard topic model in which the word order is neglected, and it captures a richer structure of text than does the standard topic model by considering
An Abstraction-Based Data Model for Information Retrieval

Science.gov (United States)

McAllister, Richard A.; Angryk, Rafal A.

Language ontologies provide an avenue for automated lexical analysis that may be used to supplement existing information retrieval methods. This paper presents a method of information retrieval that takes advantage of WordNet, a lexical database, to generate paths of abstraction, and uses them as the basis for an inverted index structure to be used in the retrieval of documents from an indexed corpus. We present this method as a entree to a line of research on using ontologies to perform word-sense disambiguation and improve the precision of existing information retrieval techniques.
Disambiguation of Necker cube rotation by monocular and binocular depth cues: relative effectiveness for establishing long-term bias.

Science.gov (United States)

Harrison, Sarah J; Backus, Benjamin T; Jain, Anshul

2011-05-11

The apparent direction of rotation of perceptually bistable wire-frame (Necker) cubes can be conditioned to depend on retinal location by interleaving their presentation with cubes that are disambiguated by depth cues (Haijiang, Saunders, Stone, & Backus, 2006; Harrison & Backus, 2010a). The long-term nature of the learned bias is demonstrated by resistance to counter-conditioning on a consecutive day. In previous work, either binocular disparity and occlusion, or a combination of monocular depth cues that included occlusion, internal occlusion, haze, and depth-from-shading, were used to control the rotation direction of disambiguated cubes. Here, we test the relative effectiveness of these two sets of depth cues in establishing the retinal location bias. Both cue sets were highly effective in establishing a perceptual bias on Day 1 as measured by the perceived rotation direction of ambiguous cubes. The effect of counter-conditioning on Day 2, on perceptual outcome for ambiguous cubes, was independent of whether the cue set was the same or different as Day 1. This invariance suggests that a common neural population instantiates the bias for rotation direction, regardless of the cue set used. However, in a further experiment where only disambiguated cubes were presented on Day 1, perceptual outcome of ambiguous cubes during Day 2 counter-conditioning showed that the monocular-only cue set was in fact more effective than disparity-plus-occlusion for causing long-term learning of the bias. These results can be reconciled if the conditioning effect of Day 1 ambiguous trials in the first experiment is taken into account (Harrison & Backus, 2010b). We suggest that monocular disambiguation leads to stronger bias either because it more strongly activates a single neural population that is necessary for perceiving rotation, or because ambiguous stimuli engage cortical areas that are also engaged by monocularly disambiguated stimuli but not by disparity-disambiguated stimuli
An odd couple – Corpus frequency and look-up frequency: what relationship?

Directory of Open Access Journals (Sweden)

Lars Trap-Jensen

2014-12-01

Full Text Available In this paper, we investigate the relationship between log file records and corpus frequency. The study was motivated by practical considerations of how best to keep an already existing corpus-based dictionary updated. Should the next word in the dictionary be the one that follows next on a list of declining corpus frequency? Or the one that users most frequently look up but don’t find? In order to establish manageable criteria, we analysed log files for The Danish Dictionary from 2009 to 2012 and compared the list of most popular words looked up by the users with the frequency of the same words in the corpus underlying The Danish Dictionary. The users’ actual search behaviour was analysed in order to find answers to questions such as these: Are there words which are never looked up? If so, can we say something meaningful about their corpus frequency patterns – do they belong to particular parts of speech, are they particularly frequent or infrequent, could it even be that the pattern is cumulative, in such a way that a particular threshold can be identified? Ultimately, the question is whether it makes sense to use corpus frequency as a criterion for lemma selection.
Old Persian corpus [Dataset

NARCIS (Netherlands)

Bavant, M.

2011-01-01

XML Old Persian corpus. The corpus is based on publicly available data on the Web. Those data can be traced back to the grammar of Old Persian by Kent (1950). The corpus contains those data and is arranged in a way suitable for corpus searches.
CREATING AND PROCESSING A CORPUS

Directory of Open Access Journals (Sweden)

Prihantoro

2015-05-01

Full Text Available This paper seeks to describe some crucial importance of corpus and text processing. Corpus is a projection of how language is used by its speakers. Technology support has improved corpus for easier maintenance, made it space-saving, and it may electronically structure its data. The latest offers much freedom for corpus users to access and exploit it for language teaching, analysis or other specified tasks. This paper will demonstrate how to use open-access corpus on internet such as Corpus of Contemporary American English (COCA and British National Corpus (BNC. Besides how to use a corpus, another crucial importance that this paper seeks to describe is how to build a corpus. In this paper, the writer will use UNITEX, a corpus (text-based processing software. This software will demonstrate steps of corpus building, ranging from text collection, annotation, electronic dictionary application to some natural language based operations ranging from pattern matching, concordance, to simple extraction. It will show how graph technology may outperform regular expression, a retrieval method exploited by other corpus processor, in terms of writing output.
Exploring theoretical functions of corpus data in teaching translation

Directory of Open Access Journals (Sweden)

Éric Poirier

2016-04-01

Full Text Available http://dx.doi.org/10.5007/2175-7968.2016v36nesp1p177 As language referential data banks, corpora are instrumental in the exploration of translation solutions in bilingual parallel texts or conventional usages of source or target language in monolingual general or specialized texts. These roles are firmly rooted in translation processes, from analysis and interpretation of source text to searching for an acceptable equivalent and integrating it into the production of the target text. Provided the creative and not the conservative way be taken, validation or adaptation of target text in accordance with conventional usages in the target language also benefits from corpora. Translation teaching is not exploiting this way of translating that is common practice in the professional translation markets around the world. Instead of showing what corpus tools can do to translation teaching, we start our analysis with a common issue within translation teaching and show how corpus data can help to resolve it in learning activities in translation courses. We suggest a corpus-driven model for the interpretation of ‘business’ as a term and as an item in complex terms based on source text pattern analysis. This methodology will make it possible for teachers to explain and justify interpretation rules that have been defined theoretically from corpus data. It will also help teachers to conceive and non-subjectively assess practical activities designed for learners of translation. Corpus data selected for the examples of rule-based interpretations provided in this paper have been compiled in a corpus-driven study (Poirier, 2015 on the translation of the noun ‘business’ in the field of specialized translation in business, economics, and finance from English to French. The corpus methodology and rule-based interpretation of senses can be generalized and applied in the definition of interpretation rules for other language pairs and other specialized simple and
Exploring theoretical functions of corpus data in teaching translation

Directory of Open Access Journals (Sweden)

Éric Poirier

2016-06-01

Full Text Available As language referential data banks, corpora are instrumental in the exploration of translation solutions in bilingual parallel texts or conventional usages of source or target language in monolingual general or specialized texts. These roles are firmly rooted in translation processes, from analysis and interpretation of source text to searching for an acceptable equivalent and integrating it into the production of the target text. Provided the creative and not the conservative way be taken, validation or adaptation of target text in accordance with conventional usages in the target language also benefits from corpora. Translation teaching is not exploiting this way of translating that is common practice in the professional translation markets around the world. Instead of showing what corpus tools can do to translation teaching, we start our analysis with a common issue within translation teaching and show how corpus data can help to resolve it in learning activities in translation courses. We suggest a corpus-driven model for the interpretation of ‘business’ as a term and as an item in complex terms based on source text pattern analysis. This methodology will make it possible for teachers to explain and justify interpretation rules that have been defined theoretically from corpus data. It will also help teachers to conceive and non-subjectively assess practical activities designed for learners of translation. Corpus data selected for the examples of rule-based interpretations provided in this paper have been compiled in a corpus-driven study (Poirier, 2015 on the translation of the noun ‘business’ in the field of specialized translation in business, economics, and finance from English to French. The corpus methodology and rule-based interpretation of senses can be generalized and applied in the definition of interpretation rules for other language pairs and other specialized simple and complex terms. These works will encourage the
New Advances in Corpus-based Lexicography*

Directory of Open Access Journals (Sweden)

Arvi Hurskainen

2011-10-01

Full Text Available
Abstract: This article presents various approaches used in corpus-based computational lexico-graphy. A claim is made that in order for computational lexicography to be efficient, precise and comprehensive, it should utilize the method where the corpus text is first analysed, and the results of this analysis is then processed further to meet the needs of a dictionary. This method has several advantages, including high precision and recall, as well as the possibility to automate the process much further than with more traditional computational methods. The frequency list obtained by using the lemma (the equivalent of the headword as basis helps in selecting the words to be in-cluded in the dictionary. The approach is demonstrated through various phases by applying SALAMA (the Swahili Language Manager to the process. Manual work will be needed in the phase when examples of use are selected from the corpus, and possibly modified. However, the list of examples of use, arranged alphabetically according to the corresponding headword, can also be produced automatically. Thus the alphabetical list of headwords with examples of use is the mate-rial on which the lexicographer works manually. The article deals with problems encountered in compiling traditional printed dictionaries, and it excludes electronic dictionaries and thesauri.
Keywords: LEXICOGRAPHY, DICTIONARY, LANGUAGE TECHNOLOGY, COMPUTA-TIONAL LINGUISTICS, AUTOMATIC COMPILATION, DICTIONARY TESTING, INFORMA-TION RETRIEVAL, MORPHOLOGICAL ANALYSIS, SEMANTIC ANALYSIS, DISAMBIGUA-TION, HEURISTICS
Opsomming: Nuwe ontwikkelinge in korpusgebaseerde leksikografie. Hier-die artikel beskryf verskillende benaderings wat in korpusgebaseerde rekenaarleksikografie ge-bruik word. Daar word aangevoer dat vir rekenaarleksikografie om doelmatig, noukeurig en omvattend te wees, dit die metode behoort te gebruik waarby die korpusteks eers ontleed word, en die resultaat van hierdie ontleding dan verder
Neuromorphic optical sensor chip with color change-intensity change disambiguation

Science.gov (United States)

Fu, ZhenHong; Mao, Rui; Cartwright, Alexander N.; Titus, Albert H.

2010-02-01

In this paper, we describe the development of a novel, retina-like neuromorphic chip that has an array of two types of retina 'cells' arranged to mimic the fovea structure in certain animals. One of the two retina cell types performs irradiance detection and the other can perform color detection. Together, via the two parallel pathways the retina chip can perform color change intensity change disambiguation (CCICD). The irradiance detection cell has a wide-dynamic detection range that spans almost 3 orders of magnitude. The color detection cell has a buried double junction (BDJ) photodiode as the photoreceptor followed by two parallel logarithmic I-V convertors. The output from this is a color response which has at least a 50nm resolution for wavelengths from 400nm to 900nm. With these two cells, the array can perform color change -intensity change disambiguation (CCICD) to determine if a change in the output of the irradiance pathway is because of irradiance change, color change, or both. This biological retina-like neuromorphic sensor array is implemented in ON-SEMI 0.5μm technology, a standard CMOS fabrication process available at MOSIS.
Tracking Anglicisms in Domains by the Corpus-Linguistic Method

DEFF Research Database (Denmark)

Mousten, Birthe; Laursen, Anne Lise

2015-01-01

Lay investors and semi-professionals lean on professional stock bloggers and stock analysts for advice on stock investments; semi-professionals and professionals write about investments globally, and stock information has to be available in many local markets. Using the correct terminology......’s critical sense is not enough to make the right choices. Our corpus-linguistic tool can be a help in this specialized field....
Towards an integrated corpus stylistics

Directory of Open Access Journals (Sweden)

McIntyre Dan

2015-12-01

Full Text Available Over recent years, the use of corpora in stylistic analysis has grown in popularity. However, questions still remain over the remit of corpus stylistics, its distinction from corpus linguistics generally and its capacity to explain complex stylistic effects. This article argues in favour of an integrated corpus stylistics; that is, an approach to corpus stylistics that integrates it with other stylistic methods and analytical frameworks. I suggest that this approach is needed for two main reasons: (i it is analytically necessary in order to fully explain stylistic effects in texts, and (ii integrating corpus methods with other stylistic tools is what will distinguish corpus stylistics from corpus linguistics. My argument is supported by reference to examples from Mark Haddon’s no vel The Curious Incident of the Dog in the Night-time and the HBO TV series Deadwood. Both these examples rely for their explanation on a combination of corpus stylistic analytical techniques and other stylistic methods of analysis.
TwitterNEED: a hybrid approach for named entity extraction and disambiguation for tweets

NARCIS (Netherlands)

Habib, Mena Badieh; van Keulen, Maurice

Twitter is a rich source of continuously and instantly updated information. Shortness and informality of tweets are challenges for Natural Language Processing tasks. In this paper, we present TwitterNEED, a hybrid approach for Named Entity Extraction and Named Entity Disambiguation for tweets. We
Authormagic in INSPIRE Author Disambiguation in Scholarly Communication

CERN Document Server

Brooks, Travis C; Dallmeier-Tiessen, Sünje; Mele, Salvatore; Weiler, Henning

2011-01-01

“Authormagic” is a system designed to solve the systemic challenge of the attribution of scholarly artifacts to unique authors in scientific digital libraries. It relies on the unique combination of machine-based knowledge retrieval and distributed knowledge of the users of the system themselves. Algorithmically computed lists of the authors’ publications, disambiguated through a (meta-) data mining approach, allow users to follow an intuitive procedure to validate and improve content to an author’s scholarly profile. This approach constitutes the core of a new paradigm for extended author-centric and user-centric services in large-scale scientific digital libraries.
Monte Carlo methods for top-k personalized PageRank lists and name disambiguation

NARCIS (Netherlands)

Avrachenkov, Konstatin; Litvak, Nelli; Nemirovsky, Danil; Smirnova, Elena; Sokol, Marina

We study a problem of quick detection of top-k Personalized PageRank lists. This problem has a number of important applications such as finding local cuts in large graphs, estimation of similarity distance and name disambiguation. In particular, we apply our results to construct efficient algorithms
Lancaster Summer School in Corpus Linguistics

Directory of Open Access Journals (Sweden)

Jaka Čibej

2016-11-01

Full Text Available Med 12. in 15. julijem je na Univerzi v Lancastru potekala poletna šola korpusnega jezikoslovja Lancaster Summer Schools in Corpus Linguistics and Other Digital Methods. Poletno šolo so organizirali UCREL (University Centre for Computer Corpus Research on Language, ERC (Evropski svet za raziskave – European Research Council, CASS (ESRC Centre for Corpus Approaches to Social Science in ESRC (Economic and Social Research Council, razdeljena pa je bila na šest programov, prilagojenih različnim področjem: Korpusno jezikoslovje za proučevanje jezikov (Corpus Linguistics for Language Studies, Korpusno jezikoslovje za družbene vede (Corpus Linguistics for Social Science, Korpusno jezikoslovje za humanistiko (Corpus Linguistics for Humanities, Statistika za korpusno jezikoslovje (Statistics for Corpus Linguistics, Geografski informacijski sistemi za digitalno humanistiko (Geographical Information Systems for the Digital Humanities in Korpusno podprta obdelava naravnih jezikov (Corpus-based Natural Language Processing.
Búsqueda de Colocaciones en la Web para Sinónimos de Wordnet Búsqueda de Colocaciones en la Web para Sinónimos de Wordnet

Directory of Open Access Journals (Sweden)

M. Montes -y-Gómez

2012-02-01

method that allows finding significant laying to the different senses attributable to a polysemic word. A collocation is an arbitrary and recurrent combination of words. The experiment results show a great potential of the proposed method to find collocations between words by using the Web as linguistic corpus, as well as the feasibility of incorporating the lexical patterns obtained in word sense disambiguation systems that can be used, for example, in translation machines or information recovery systems. by using the Web as a linguistic corpus, as well as the feasibility of incorporation the lexical patterns obtained in word sense disambiguation systems that can be used, for example, in machines translation or information recovery systems.
Translating children’s literature: some insights from corpus stylistics

Directory of Open Access Journals (Sweden)

Anna Čermáková

2018-01-01

Full Text Available In this paper I explore the potential of a corpus stylistic approach to the study of literary translation. The study focuses on translation of children’s literature with its specific constrains, and illustrates with two corpus linguistic techniques: keyword and cluster analysis — specific cases of repetition. So in a broader sense the paper discusses the phenomenon of repetition in different literary (stylistic traditions. These are illustrated by examples from two children’s classics aimed at two different age groups: the Harry Potter and the Winnie the Pooh books — and their translations into Czech. Various shifts in translation, especially in the translation of children’s literature, are often explained by the operation of so-called ‘translation universals’. Though ‘repetition’ as such does not belong to the commonly discussed set of translation universals, the stylistic norms opposing repetition seem to be a strong explanation for the translation shifts identified.
Conversation, characterisation and corpus linguistics: Dialogue in Jane Austen’s Sense and Sensibility

Directory of Open Access Journals (Sweden)

E.H. Hubbard

2002-08-01

Full Text Available This article reports on a corpus-based exploration of the role that fictional dialogue plays in characterisation. The focus is on the two main characters of Austen’s Sense and Sensibility and (a the extent to which certain features of their dialogue can be said to tie in with general perceptions that Elinor represents the “sense” and Marianne the “sensibility” of the novel’s title; and (b the extent to which Austen can be said to have exploited these features to enable the sisters to speak with subtly differing voices. The features themselves were drawn from two linguistic frameworks, namely cohesion in text linguistics (specifically, the category of conjunctive cohesion as originated by Halliday and Hasan (1976, and the category of “involvement” in register analysis (most prominently, Biber 1988. The density of these features in each dialogue was calculated, compared statistically and salient differences considered in relation to the focal issues of the study. Although two of the five hypotheses formulated were not supported, the results overall provided strong indications that Austen successfully distinguishes between the sisters through their dialogue, and often in ways that link with less subtle, more explicit cues to their character that are given in the text. The study thus reveals how certain text-linguistic and register features can underpin characterisation in fiction, and in so doing explicates aspects of what it is that readers and literary critics respond to when they comment on characterisation in a novel.

Cross-Linguistic Differences in Prosodic Cues to Syntactic Disambiguation in German and English

Science.gov (United States)

O'Brien, Mary Grantham; Jackson, Carrie N.; Gardner, Christine E.

2014-01-01

This study examined whether late-learning English-German second language (L2) learners and late-learning German-English L2 learners use prosodic cues to disambiguate temporarily ambiguous first language and L2 sentences during speech production. Experiments 1a and 1b showed that English-German L2 learners and German-English L2 learners used a…
Kinesthetic information disambiguates visual motion signals.

Science.gov (United States)

Hu, Bo; Knill, David C

2010-05-25

Numerous studies have shown that extra-retinal signals can disambiguate motion information created by movements of the eye or head. We report a new form of cross-modal sensory integration in which the kinesthetic information generated by active hand movements essentially captures ambiguous visual motion information. Several previous studies have shown that active movement can bias observers' percepts of bi-stable stimuli; however, these effects seem to be best explained by attentional mechanisms. We show that kinesthetic information can change an otherwise stable perception of motion, providing evidence of genuine fusion between visual and kinesthetic information. The experiments take advantage of the aperture problem, in which the motion of a one-dimensional grating pattern behind an aperture, while geometrically ambiguous, appears to move stably in the grating normal direction. When actively moving the pattern, however, the observer sees the motion to be in the hand movement direction. Copyright 2010 Elsevier Ltd. All rights reserved.
Bibliographic Entity Automatic Recognition and Disambiguation

CERN Document Server

AUTHOR|(SzGeCERN)766022

This master thesis reports an applied machine learning research internship done at digital library of the European Organization for Nuclear Research (CERN). The way an author’s name may vary in its representation across scientific publications creates ambiguity when it comes to uniquely identifying an author; In the database of any scientific digital library, the same full name variation can be used by more than one author. This may occur even between authors from the same research affiliation. In this work, we built a machine learning based author name disambiguation solution. The approach consists in learning a distance function from a ground-truth data, blocking publications of broadly similar author names, and clustering the publications using a semi-supervised strategy within each of the blocks. The main contributions of this work are twofold; first, improving the distance model by taking into account the (estimated) ethnicity of the author’s full name. Indeed, names from different ethnicities, for e...
Framing effects are robust to linguistic disambiguation: A critical test of contemporary theory.

Science.gov (United States)

Chick, Christina F; Reyna, Valerie F; Corbin, Jonathan C

2016-02-01

Theoretical accounts of risky choice framing effects assume that decision makers interpret framing options as extensionally equivalent, such that if 600 lives are at stake, saving 200 implies that 400 die. However, many scholars have argued that framing effects are caused, instead, by filling in pragmatically implied information. This linguistic ambiguity hypothesis is grounded in neo-Gricean pragmatics, information leakage, and schema theory. In 2 experiments, we conducted critical tests of the linguistic ambiguity hypothesis and its relation to framing. We controlled for this crucial implied information by disambiguating it using instructions and detailed examples, followed by multiple quizzes. After disambiguating missing information, we presented standard framing problems plus truncated versions, varying types of missing information. Truncations were also critical tests of prospect theory and fuzzy trace theory. Participants were not only college students, but also middle-age adults (who showed similar results). Contrary to the ambiguity hypothesis, participants who interpreted missing information as complementary to stated information nonetheless showed robust framing effects. Although adding words like "at least" can change interpretations of framing information, this form of linguistic ambiguity is not necessary to observe risky choice framing effects. (c) 2016 APA, all rights reserved).
FTA Corpus: a parallel corpus of English and Spanish Free Trade Agreements for the study of specialized collocations

Directory of Open Access Journals (Sweden)

Pedro Patiño García

2013-04-01

Full Text Available This paper describes the Corpus of Free Trade Agreements (henceforth FTA, a specialized parallel corpus in English and Spanish from Europe and America and a smaller subcorpus in English-Norwegian and Spanish-Norwegian that was prepared and then aligned with Translation Corpus Aligner 2 (Hofland & Johansson, 1998. The data was taken from Free Trade Agreements. These agreements are specialized texts officially signed and ratified by several countries and blocks of countries in the last twenty years. Thus, FTAs are a rich repository for terminology and phraseology that is used in different fields of business activity throughout the world. The corpus contains around 1.37 million words in the English section and 1.48 million words in its Spanish counterpart, plus 60,000 words each in the Spanish-Norwegian and English-Norwegian subcorpus. The corpus is being used primarily to study the terms and specialized collocations that include these terms in this kind of specialized texts.Keywords: specialized collocation, specialized parallel corpus, corpus linguistics, Free Trade Agreement
76 FR 18395 - Safety Zone; Naval Air Station Corpus Christi Air Show, Oso Bay, Corpus Christi, TX

Science.gov (United States)

2011-04-04

...-AA00 Safety Zone; Naval Air Station Corpus Christi Air Show, Oso Bay, Corpus Christi, TX AGENCY: Coast... zone on the navigable waters of Oso Bay in Corpus Christi, Texas in support of the 2011 Naval Air... entities and very few recreational fisherman utilize this section of Oso Bay, the restriction of vessel...
Enhancing Author Information for CERN Document Server: Creating an Author Collection and Using Author Disambiguation Methods

CERN Document Server

AUTHOR|(CDS)2096274; Witowski, Sebastian

Authors are a substantial part of queries in digital libraries, where the results are reflecting the service quality and success. Ambiguous author names can confuse users and cause an inaccurate relation between authorships and individual researchers. Providing a set of disambiguated authors is challenging and related to data integration, since this is done in several ways and by different systems, both manually and automatically. Many disambiguation algorithms have been proposed in the literature, where the most solutions are solving the ambiguities by applying machine learning techniques. However, such problems cannot be solved with an accuracy of 100%. Our contributions to the CERN Document Server presented in this work consists of two parts: first, we create and deploy an author knowledge data base (collection) and second, we link authors of bibliographic records back to their authority records. For the latter, we use a library providing machine learning tools for clustering (where we use trained data fro...
Corpus vitreum, retina og chorioidea biopsi

DEFF Research Database (Denmark)

Scherfig, Erik Christian Høegh

2002-01-01

oftalmology, biopsy, choroid, corpus vitreum, retina, malignant melanoma, biopsy technic, retinoblastoma......oftalmology, biopsy, choroid, corpus vitreum, retina, malignant melanoma, biopsy technic, retinoblastoma...
Author Disambiguation in PubMed: Evidence on the Precision and Recall of Author-ity among NIH-Funded Scientists.

Science.gov (United States)

Lerchenmueller, Marc J; Sorenson, Olav

2016-01-01

We examined the usefulness (precision) and completeness (recall) of the Author-ity author disambiguation for PubMed articles by associating articles with scientists funded by the National Institutes of Health (NIH). In doing so, we exploited established unique identifiers-Principal Investigator (PI) IDs-that the NIH assigns to funded scientists. Analyzing a set of 36,987 NIH scientists who received their first R01 grant between 1985 and 2009, we identified 355,921 articles appearing in PubMed that would allow us to evaluate the precision and recall of the Author-ity disambiguation. We found that Author-ity identified the NIH scientists with 99.51% precision across the articles. It had a corresponding recall of 99.64%. Precision and recall, moreover, appeared stable across common and uncommon last names, across ethnic backgrounds, and across levels of scientist productivity.
Holistic corpus-based dialectology Dialetologia holística baseada em corpus

Directory of Open Access Journals (Sweden)

Benedikt Szmrecsanyi

2011-01-01

Full Text Available This paper is concerned with sketching future directions for corpus-based dialectology. We advocate a holistic approach to the study of geographically conditioned linguistic variability, and we present a suitable methodology, 'corpusbased dialectometry', in exactly this spirit. Specifically, we argue that in order to live up to the potential of the corpus-based method, practitioners need to (i abandon their exclusive focus on individual linguistic features in favor of the study of feature aggregates, (ii draw on computationally advanced multivariate analysis techniques (such as multidimensional scaling, cluster analysis, and principal component analysis, and (iii aid interpretation of empirical results by marshalling state-of-the-art data visualization techniques. To exemplify this line of analysis, we present a case study which explores joint frequency variability of 57 morphosyntax features in 34 dialects all over Great Britain.Este artigo debruça-se sobre o esboço propositivo de futuras direções para a dialetologia baseada em corpus. Defendemos uma abordagem holística para o estudo da variabilidade linguística geograficamente condicionada, e apresentamos uma metodologia adequada para tal - a dialetometria baseada em corpus. Mais especificamente, defendemos que para que se obtenham todos os resultados esperados da metodologia de corpus, pesquisadores devem: (i abandonar seu foco exclusivo em traços linguísticos individuais em favor do estudo dos agregados de traços, (ii amparar-se em métodos computacionais avançados de técnicas de análise multivariada (tais como escalagem multidimensional, análise de clusters, e análise de componente principal, e (iii auxiliar a interpretação de resultados empíricos através da utilização do estado da arte em técnicas de visualização. A fim de exemplificarmos essa linha de análise, apresentamos um estudo de caso que explora a variabilidade da frequência agregada de 57 tra
Towards proper name generation : A corpus analysis

NARCIS (Netherlands)

Castro Ferreira, Thiago; Wubben, Sander; Krahmer, Emiel

We introduce a corpus for the study of proper name generation. The corpus consists of proper name references to people in webpages, extracted from the Wikilinks corpus. In our analyses, we aim to identify the different ways, in terms of length and form, in which a proper names are produced
KoralQuery -- A General Corpus Query Protocol

DEFF Research Database (Denmark)

Bingel, Joachim; Diewald, Nils

2015-01-01

. In this paper, we present KoralQuery, a JSON-LD based general corpus query protocol, aiming to be independent of particular QLs, tasks and corpus formats. In addition to describing the system of types and operations that KoralQuery is built on, we exemplify the representation of corpus queries in the serialized...
Corpus Approaches to Language Ideology

Science.gov (United States)

Vessey, Rachelle

2017-01-01

This paper outlines how corpus linguistics--and more specifically the corpus-assisted discourse studies approach--can add useful dimensions to studies of language ideology. First, it is argued that the identification of words of high, low, and statistically significant frequency can help in the identification and exploration of language ideologies…
Meaningful Clusters

Energy Technology Data Exchange (ETDEWEB)

Sanfilippo, Antonio P.; Calapristi, Augustin J.; Crow, Vernon L.; Hetzler, Elizabeth G.; Turner, Alan E.

2004-05-26

We present an approach to the disambiguation of cluster labels that capitalizes on the notion of semantic similarity to assign WordNet senses to cluster labels. The approach provides interesting insights on how document clustering can provide the basis for developing a novel approach to word sense disambiguation.
Contrast radiographic study of venous drainage of the corpus cavernosum and the corpus spongiosum of the cat penis.

Science.gov (United States)

Amiri, Ali Akbar; Gilanpour, Hassan; Veshkini, Abbas

2014-01-01

The aim of this study was to determine the drainage routes of the corpus cvernosum penis and the corpus spongiosum penis in the cat using contrast cavernosography. Five male cats, 1.5-2.5 years old, weighing between 4.5 and 5.5 kg were investigated. The cats were anesthetized and the root and the proximal part of the penis were exposed by an incision on the perineum reaching the scrotum. Each cat was radiographed in lateral and dorsal recumbency before and during injection of contrast medium into the erectile bodies. The corpus spongiosum penis was injected at the bulb of the penis and the corpus cavernosum penis at the root. Injection of contrast media into the cavernous bodies showed that both the external and internal iliac veins drain the erectile bodies into the caudal vena cava. Drainage from the corpus spongiosum penis was from the bulb for the proximal part and from the glans for the distal part. The corpus cavernosum penis was drained only proximally, from the crura. There was a network of veins above the pelvic symphysis and the drainage of erectile bodies where through various routes into the internal and external iliac veins.
The Ndebele Language Corpus: A Review of Some Factors Influencing the Content of the Corpus*

Directory of Open Access Journals (Sweden)

Samukele Hadebe

2011-10-01

Full Text Available
Abstract: The Ndebele language corpus described here is that compiled by the ALLEX Project (now ALRI at the University of Zimbabwe. It is intended to reflect as much as possible the Ndebele language as spoken in Zimbabwe. The Ndebele language corpus was built in order to provide much-needed material for the study of the Ndebele language with a special focus on dictionarymaking and research. Like most corpora, the Ndebele language corpus may in future be used for other purposes not thought of at the time of its inception. It has been designed to meet generally acceptable standards so that it can be adaptable to various possible uses by various researchers. The article wants to outline the building process of the Ndebele language corpus with special emphasis on the challenges that faced compilers, and possible solutions. It is assumed that some of these challenges might not be peculiar to Ndebele alone but could also affect related African languages in a more or less similar situation. The main focus of the discussion will be the composition of the Ndebele language corpus, i.e. the type of texts that constitute the corpus. The corpus is composed of published texts, unpublished texts and oral material gathered from Ndebele-speaking districts of Zimbabwe. It will be argued that the use of the corpus and its reliability for research depends among other factors on its contents. It will also be shown that the contents of a corpus depend on a number of factors, some of which include sociolinguistic, political and economic considerations. These considerations have implications on both the content and quality of published and oral texts that constitute the Ndebele language corpus.
Keywords: CORPUS, ORAL MATERIALS, CODE-MIXING, CODE-SWITCHING, MOTHER- TONGUE, NDEBELE
Opsomming: Die Ndebeletaalkorpus: 'n Oorsig van sommige faktore wat die inhoud van die korpus be?nvloed. Die Ndebeletaalkorpus wat hier beskryf word, is di? saamgestel deur die
Measurement of normal corpus callosum with MRI in Korean adults and morphological change of corpus callosum by grade of hydrocephalus

International Nuclear Information System (INIS)

Song, Dong Hoon; Chang, Seung Kuk; Kim, Jong Deok; Eun, Tchoong Kie; Park, Dong Woo

1995-01-01

To measure the size of normal corpus callosum in each portion using objective and reproducible method with MRI and evaluation of morphological change of corpus callosum by grade of hydrocephalus. Midsagittal T1-weighted MR imaging of the corpus callosum was investigated in 41 volunteers of normal Korean adults and 19 patients with hydrocephalus. Corpus callosum was measured for the anteroposterior length(A), height(B), and the thickness of genu(C), body(D), splenium(E), and the narrowest portion of body(F). And the analysis of morphology and signal intensity of the corpus callosum were also evaluated. Hydrocephalus was graded as mild, moderate, and severe, and comparison of thickness with normal corpus callosum in each portion was done. The mean length and height were 72.3 mm, 28.6 mm in male, and 70.7 mm, 28.9 mm in female. And the mean dimension for C, D, E and F were 13.1 mm, 8 mm, 13.2 mm, 5.2 mm in male, and 12.8 mm, 7.5 mm, 12.3 mm, 5 mm in female. The morphology of normal corpus callosum was 'hook' shaped on midline sagittal T1-weighted image. Narrowing at posterior third portion of body were present on 30 cases(73.2%) and even in thickness of the body in 11 cases(26.8%). The signal intensity of the corpus callosum on midsagittal T1-weighted spin echo image of normal cases was homogeneous hyperintense as compared with cerebral gray matter. In hydrocephalus, A and B were increased and other portions were decreased in thickness. Genu and the narrowest portion of body showed significant difference of thickness according to the grade of hydrocephalus. The mean dimension of all portion of corpus callosum were larger in male than female except for callosal height but not significant statistically with the exception of splenium. Hydrocephalus lead to morphological change of the corpus callosum. Among the portion of corpus callosum, genu and the narrowest portion of the body were thought to be the most sensitive indicators of degree in hydrocephalus
Automatic vs. manual curation of a multi-source chemical dictionary: the impact on text mining

Science.gov (United States)

2010-01-01

Background Previously, we developed a combined dictionary dubbed Chemlist for the identification of small molecules and drugs in text based on a number of publicly available databases and tested it on an annotated corpus. To achieve an acceptable recall and precision we used a number of automatic and semi-automatic processing steps together with disambiguation rules. However, it remained to be investigated which impact an extensive manual curation of a multi-source chemical dictionary would have on chemical term identification in text. ChemSpider is a chemical database that has undergone extensive manual curation aimed at establishing valid chemical name-to-structure relationships. Results We acquired the component of ChemSpider containing only manually curated names and synonyms. Rule-based term filtering, semi-automatic manual curation, and disambiguation rules were applied. We tested the dictionary from ChemSpider on an annotated corpus and compared the results with those for the Chemlist dictionary. The ChemSpider dictionary of ca. 80 k names was only a 1/3 to a 1/4 the size of Chemlist at around 300 k. The ChemSpider dictionary had a precision of 0.43 and a recall of 0.19 before the application of filtering and disambiguation and a precision of 0.87 and a recall of 0.19 after filtering and disambiguation. The Chemlist dictionary had a precision of 0.20 and a recall of 0.47 before the application of filtering and disambiguation and a precision of 0.67 and a recall of 0.40 after filtering and disambiguation. Conclusions We conclude the following: (1) The ChemSpider dictionary achieved the best precision but the Chemlist dictionary had a higher recall and the best F-score; (2) Rule-based filtering and disambiguation is necessary to achieve a high precision for both the automatically generated and the manually curated dictionary. ChemSpider is available as a web service at http://www.chemspider.com/ and the Chemlist dictionary is freely available as an XML file in
Rapid Exploitation and Analysis of Documents

Energy Technology Data Exchange (ETDEWEB)

Buttler, D J; Andrzejewski, D; Stevens, K D; Anastasiu, D; Gao, B

2011-11-28

Analysts are overwhelmed with information. They have large archives of historical data, both structured and unstructured, and continuous streams of relevant messages and documents that they need to match to current tasks, digest, and incorporate into their analysis. The purpose of the READ project is to develop technologies to make it easier to catalog, classify, and locate relevant information. We approached this task from multiple angles. First, we tackle the issue of processing large quantities of information in reasonable time. Second, we provide mechanisms that allow users to customize their queries based on latent topics exposed from corpus statistics. Third, we assist users in organizing query results, adding localized expert structure over results. Forth, we use word sense disambiguation techniques to increase the precision of matching user generated keyword lists with terms and concepts in the corpus. Fifth, we enhance co-occurrence statistics with latent topic attribution, to aid entity relationship discovery. Finally we quantitatively analyze the quality of three popular latent modeling techniques to examine under which circumstances each is useful.
Neuroanatomical Anomalies of Dyslexia: Disambiguating the Effects of Disorder, Performance, and Maturation

Science.gov (United States)

Xia, Zhichao; Hoeft, Fumiko; Zhang, Linjun; Shu, Hua

2016-01-01

An increasing body of studies has revealed neuroanatomical impairments in developmental dyslexia. However, whether these structural anomalies are driven by dyslexia (disorder-specific effects), absolute reading performance (performance-dependent effects), and/or further influenced by age (maturation-sensitive effects) remains elusive. To help disentangle these sources, the current study used a novel disorder (dyslexia vs. control) by maturation (younger vs. older) factorial design in 48 Chinese children who were carefully matched. This design not only allows for direct comparison between dyslexics versus controls matched for chronological age and reading ability, but also enables examination of the influence of maturation and its interaction with dyslexia. Voxel-based morphometry (VBM) showed that dyslexic children had reduced regional gray matter volume in the left temporo-parietal cortex (spanning over Heschl’s gyrus, planum temporale and supramarginal gyrus), middle frontal gyrus, superior occipital gyrus, and reduced regional white matter in bilateral parieto-occipital regions (left cuneus and right precuneus) compared with both age-matched and reading-level matched controls. Therefore, maturational stage-invariant neurobiological signatures of dyslexia were found in brain regions that have been associated with impairments in the auditory/phonological and attentional systems. On the other hand, maturational stage-dependent effects on dyslexia were observed in three regions (left ventral occipito-temporal cortex, left dorsal pars opercularis and genu of the corpus callosum), all of which were previously reported to be involved in fluent reading and its development. These striking dissociations collectively suggest potential atypical developmental trajectories of dyslexia, where underlying mechanisms are currently unknown but may be driven by interactions between genetic and/or environmental factors. In summary, this is the first study to disambiguate

[Behavioral and cognitive profile of corpus callosum agenesia - Review].

Science.gov (United States)

Lábadi, Beatrix; Beke, Anna Maria

2016-11-30

Agenesis of corpus callosum is a relatively frequent congenital cerebral malformation including dysplasia, total or partial absence of corpus callosum. The agenesis of corpus callosum can be occured in isolated form without accompanying somatic or central nervous system abnormalities and it can be associated with other central nervus system malformations. The behavioral and cognitive outcome is more favorable for patients with isolated agenesis of corpus callous than syndromic form of corpus callosum. The aim of this study is to review recent research on behavioral and social-cognitive functions in individuals with agenesis of corpus callosum. Developmental delay is common especially in higher-order cognitive and social functions. An internet database search was performed to identify publications on the subject. Fifty-five publications in English corresponded to the criteria. These studies reported deficits in language, social cognition and emotions in individuals with agenesis of corpus callosum which is known as primary corpus callous syndrome. The results indicate that individuals with agenesis of corpus callosum have deficiency in social-cognitive domain (recognition of emotions, weakness in paralinguistic aspects of language and mentalizing abilities). The impaired social cognition can be manifested in behavioral problems like autism and attention deficit hyperactivity disorder.
MR measurement of normal corpus callosum in children

International Nuclear Information System (INIS)

Kim, Hyoung Sub; Kim, Jong Chul; Kang, Yong Soo; Lee, Young Hwan; Kim, Young Wol

1997-01-01

To measure the mean size of the various portions of the corpus callosum in normal Korean children, using MR imaging. Our subjects were 166 children (male : female=100 : 66) aged under 15 whose findings on MR imaging and neurologic examination were normal. Using midsagittal T1-weighted imaging, we measured the length of the brain and corpus callosum, the height of the latter, and the thickness of its genu body, transitional zone and splenium. The measurements were statistically analysed according to age and sex. Brain length and the size of the various portions of the corpus callosum tended to increase relatively rapidly during the first three years of life, but the rate of growth tended to decrease according to age. The mean lenght of the brain and corpus callosum and the mean thickness of the splenium of the corpus callosum did not differ according to sex. The mean thickness of the genu, body and transitional zone of the corpus callosum was greater in males than in females. The ratio of the length of the corpus callosum to the anteroposterior diameter of the brain was significantly greater in females than in males (alpha=0.05). Using MR imaging, we measured the mean sizes of the various portions of the corpus callosum in normal children;these values may provide a useful basis for determing changes occurring in its structure
Orfismo en el Corpus Philostrateum

Directory of Open Access Journals (Sweden)

Susana M. Lizcano Rejano

2003-06-01

Full Text Available We search through the Corpus Philostrateum for the presence of connections between this literary production and Orphismus – its system of beliefs, its peculiar interpretation of the traditional Greek mythology, its proposal for a particular way of life. Also, we try to determine the relation, that we can find in this corpus between the ideology and customs that the Pythagoreans and Orphics supported.
Network-based stochastic competitive learning approach to disambiguation in collaborative networks

Science.gov (United States)

Christiano Silva, Thiago; Raphael Amancio, Diego

2013-03-01

Many patterns have been uncovered in complex systems through the application of concepts and methodologies of complex networks. Unfortunately, the validity and accuracy of the unveiled patterns are strongly dependent on the amount of unavoidable noise pervading the data, such as the presence of homonymous individuals in social networks. In the current paper, we investigate the problem of name disambiguation in collaborative networks, a task that plays a fundamental role on a myriad of scientific contexts. In special, we use an unsupervised technique which relies on a particle competition mechanism in a networked environment to detect the clusters. It has been shown that, in this kind of environment, the learning process can be improved because the network representation of data can capture topological features of the input data set. Specifically, in the proposed disambiguating model, a set of particles is randomly spawned into the nodes constituting the network. As time progresses, the particles employ a movement strategy composed of a probabilistic convex mixture of random and preferential walking policies. In the former, the walking rule exclusively depends on the topology of the network and is responsible for the exploratory behavior of the particles. In the latter, the walking rule depends both on the topology and the domination levels that the particles impose on the neighboring nodes. This type of behavior compels the particles to perform a defensive strategy, because it will force them to revisit nodes that are already dominated by them, rather than exploring rival territories. Computer simulations conducted on the networks extracted from the arXiv repository of preprint papers and also from other databases reveal the effectiveness of the model, which turned out to be more accurate than traditional clustering methods.
Divergent approaches to corpus processing: the need for ...

African Journals Online (AJOL)

With a good corpus, data can be provided giving an authoritative body of linguistic evidence which can support generalisations and against which hypotheses can be tested. As this proves the invaluable status of a corpus, the article assesses the processing of the Shona corpus and discusses how some aspects of the ...
Dictionary Writing System (DWS) + Corpus Query Package (CQP ...

African Journals Online (AJOL)

In this article the integrated corpus query functionality of the dictionary compilation software TshwaneLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed ...
The biomedical discourse relation bank

Directory of Open Access Journals (Sweden)

Joshi Aravind

2011-05-01

Full Text Available Abstract Background Identification of discourse relations, such as causal and contrastive relations, between situations mentioned in text is an important task for biomedical text-mining. A biomedical text corpus annotated with discourse relations would be very useful for developing and evaluating methods for biomedical discourse processing. However, little effort has been made to develop such an annotated resource. Results We have developed the Biomedical Discourse Relation Bank (BioDRB, in which we have annotated explicit and implicit discourse relations in 24 open-access full-text biomedical articles from the GENIA corpus. Guidelines for the annotation were adapted from the Penn Discourse TreeBank (PDTB, which has discourse relations annotated over open-domain news articles. We introduced new conventions and modifications to the sense classification. We report reliable inter-annotator agreement of over 80% for all sub-tasks. Experiments for identifying the sense of explicit discourse connectives show the connective itself as a highly reliable indicator for coarse sense classification (accuracy 90.9% and F1 score 0.89. These results are comparable to results obtained with the same classifier on the PDTB data. With more refined sense classification, there is degradation in performance (accuracy 69.2% and F1 score 0.28, mainly due to sparsity in the data. The size of the corpus was found to be sufficient for identifying the sense of explicit connectives, with classifier performance stabilizing at about 1900 training instances. Finally, the classifier performs poorly when trained on PDTB and tested on BioDRB (accuracy 54.5% and F1 score 0.57. Conclusion Our work shows that discourse relations can be reliably annotated in biomedical text. Coarse sense disambiguation of explicit connectives can be done with high reliability by using just the connective as a feature, but more refined sense classification requires either richer features or more
Corpus-Based Investigations of Language Use.

Science.gov (United States)

Biber, Douglas; And Others

1996-01-01

Examines a representative text corpus to gain insights into language structure and use and to open new areas of linguistic inquiry. Various illustrations are presented that provide a glimpse into the value of corpus-based investigations for increasing one's understanding of language use and imparting insights important for designing effective…
The Yale-Classical Archives Corpus

Directory of Open Access Journals (Sweden)

Christopher William White

2016-07-01

Full Text Available The Yale-Classical Archives Corpus (YCAC contains harmonic and rhythmic information for a dataset of Western European Classical art music. This corpus is based on data from classicalarchives.com, a repository of thousands of user-generated MIDI representations of pieces from several periods of Western European music history. The YCAC makes available metadata for each MIDI file, as well as a list of pitch simultaneities ("salami slices" in the MIDI file. Metadata include the piece's composer, the composer's country of origin, date of composition, genre (e.g., symphony, piano sonata, nocturne, etc., instrumentation, meter, and key. The processing step groups the file's pitches into vertical slices each time a pitch is added or subtracted from the texture, recording the slice's offset (measured in the number of quarter notes separating the event from the file's beginning, highest pitch, lowest pitch, prime form, scale-degrees in relation to the global key (as determined by experts, and local key information (as determined by a windowed key-profile analysis. The corpus contains 13,769 MIDI files by 571 composers yielding over 14,051,144 vertical slices. This paper outlines several properties of this corpus, along with a representative study using this dataset.
INVESTIGATING THE TRANSLATION OF ACRONYMS IN THE AREA OF REMOTE SENSING: A PROPOSAL FROM THE STANDPOINT OF PHRASEOLOGY

Directory of Open Access Journals (Sweden)

Dalila dos Santos Hasmann

2014-07-01

Full Text Available The aim is to analyze a corpus of remote sensing in order to identify acronyms in English and then search for their equivalents in Portuguese. The research is based on the approach of Corpus-Based Translation Studies (BAKER, 1995, Corpus Linguistics (BERBER SARDINHA, 2004, and Phraseology (PAVEL, 2003. The program WordSmith Tools version 6.0 is used. The results show that there is no standardization in these translations.
Disambiguating past events: accurate source memory for time and context depends on different retrieval processes

OpenAIRE

Persson, Bjorn Martin; Ainge, James Alexander; O'Connor, Akira Robert

2016-01-01

Participant payment was provided by the School of Psychology and Neuroscience ResPay scheme. Current animal models of episodic memory are usually based on demonstrating integrated memory for what happened, where it happened, and when an event took place. These models aim to capture the testable features of the definition of human episodic memory which stresses the temporal component of the memory as a unique piece of source information that allows us to disambiguate one memory from another...
Diffuse corpus callosum infarction - Rare vascular entity with differing etiology.

Science.gov (United States)

Mahale, Rohan; Mehta, Anish; Buddaraju, Kiran; John, Aju Abraham; Javali, Mahendra; Srinivasa, Rangasetty

2016-01-15

Infarctions of the corpus callosum are rare vascular events. It is relatively immune to vascular insult because of its rich vascular supply from anterior and posterior circulations of brain. Report of 3 patients with largely diffuse acute corpus callosum infarction. 3 patients with largely diffuse acute corpus callosum infarction were studied and each of these 3 patients had 3 different aetiologies. The 3 different aetiologies of largely diffuse acute corpus callosum infarction were cardioembolism, tuberculous arteritis and takayasu arteritis. Diffuse corpus callosum infarcts are rare events. This case series narrates the three different aetiologies of diffuse acute corpus callosum infarction which is a rare vascular event. Copyright © 2015 Elsevier B.V. All rights reserved.
A corpus and a concordancer of academic journal articles

Directory of Open Access Journals (Sweden)

Deny A. Kwary

2018-02-01

Full Text Available This data article presents a corpus (i.e. a selection of a big number of words in an electronic form and a concordancer (i.e. a tool to show the word in its context of use of academic journal articles. As the title suggests, the data were collected from research articles published in academic journals. The corpus contains 5,686,428 words selected from 895 journal articles published by Elsevier in 2011–2015. The corpus is classified into four subject areas: Health sciences, Life sciences, Physical Sciences, and Social Sciences, following the classifications of Scopus, which is the largest abstract and citation database of peer-reviewed scientific journals, books and conference proceedings. To ease the access and utilization of the corpus, a program to produce the key word in context (KWIC and word frequency was created and placed on the website: corpus.kwary.net. The corpus is a valuable resource for researchers, teachers, and translators working on academic English.
A corpus and a concordancer of academic journal articles.

Science.gov (United States)

Kwary, Deny A

2018-02-01

This data article presents a corpus (i.e. a selection of a big number of words in an electronic form) and a concordancer (i.e. a tool to show the word in its context of use) of academic journal articles. As the title suggests, the data were collected from research articles published in academic journals. The corpus contains 5,686,428 words selected from 895 journal articles published by Elsevier in 2011-2015. The corpus is classified into four subject areas: Health sciences, Life sciences, Physical Sciences, and Social Sciences, following the classifications of Scopus, which is the largest abstract and citation database of peer-reviewed scientific journals, books and conference proceedings. To ease the access and utilization of the corpus, a program to produce the key word in context (KWIC) and word frequency was created and placed on the website: corpus.kwary.net. The corpus is a valuable resource for researchers, teachers, and translators working on academic English.
Disparity Disambiguation by Fusion of Signal and Symbolic-Level Information

DEFF Research Database (Denmark)

Ralli, J.; Diaz, J.; Ros, E.

2012-01-01

We describe a method for resolving ambiguities in low-level disparity calculations in a stereo-vision scheme by using a recurrent mechanism that we call signal-symbol loop. Due to the local nature of low-level processing it is not always possible to estimate the correct disparity values produced...... at this level. Symbolic abstraction of the signal produces robust, high confidence, multimodal image features which can be used to interpret the scene more accurately and therefore disambiguate low-level interpretations by biasing the correct disparity. The fusion process is capable of producing more accurate...... dense disparity maps than the low- and symbolic-level algorithms can produce independently. Therefore we describe an efficient fusion scheme that allows symbolic- and low-level cues to complement each other, resulting in a more accurate and dense disparity representation of the scene....
GECO, un Gestor de Corpus colaborativo basado en web

Directory of Open Access Journals (Sweden)

Gerardo Sierra

2017-12-01

Full Text Available Este artículo presenta GEstor de COrpus (GECO, un software de gestión de corpus en línea que permite a los usuarios subir colecciones de documentos y volverlos corpus digitales. En el sistema, los corpus pueden ser procesados por otras aplicaciones, las cuales están implementadas como módulos integrados a la infraestructura de GECO. En este documento se describen a detalle sus características, así como la funcionalidad del generador de concordancias desarrollado en torno a él.
Corpus Based Authenicity Analysis of Language Teaching Course Books

Directory of Open Access Journals (Sweden)

Emrah PEKSOY

2017-12-01

Full Text Available In this study, the resemblance of the language learning course books used in Turkey to authentic language spoken by native speakers is explored by using a corpus-based approach. For this, the 10-million-word spoken part of the British National Corpus was selected as reference corpus. After that, all language learning course books used in high schools in Turkey were scanned and transferred to SketchEngine, an online corpus query tool. Lastly, certain grammar points were extracted first from British National Corpus and then from course books; similaritites and differences were compared. At the end of the study, it was found that the language learning course books have little similarity to authentic language in terms of certain grammatical items and frequency of their collocations. In this way, the points to be revised and changed were explored. In addition, this study emphasized the role of corpus approach as a material development and analysis tool; and tested the functionality of course books for writers and for Ministry of National Education.
Spontal-N: A Corpus of Interactional Spoken Norwegian

OpenAIRE

Sikveland, A.; Öttl, A.; Amdal, I.; Ernestus, M.; Svendsen, T.; Edlund, J.

2010-01-01

Spontal-N is a corpus of spontaneous, interactional Norwegian. To our knowledge, it is the first corpus of Norwegian in which the majority of speakers have spent significant parts of their lives in Sweden, and in which the recorded speech displays varying degrees of interference from Swedish. The corpus consists of studio quality audio- and video-recordings of four 30-minute free conversations between acquaintances, and a manual orthographic transcription of the entire material. On basis of t...
The Influence of Reference Corpus Size on Wordsmith Tools Keywords Extraction

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2012-05-01

Full Text Available A KeyWords analysis (using WordSmith Tools enables the discovery of lexical items which reveal the main lexical sets in a text or corpus. Such an analysis requires that a reference corpus be compared to the corpus the researcher intends to describe (the study corpus. This paper presents a mathematical method for finding out the influence of reference corpus size on the number of key words extracted by the program. The results reveal that a reference corpus that is at least five times as large as the study corpus allows for drawing an amount of key words that is statistically equivalent to larger reference corpora, thus suggesting five times (as larger as the study corpora as the minimum order of magnitude for reference corpora.
Metaphor and Corpus Linguistics Metáfora e linguística de corpus

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2011-01-01

Full Text Available In this paper, I look at four different aspects of metaphor research from a corpus linguistic perspective, namely: (1 the lexicogrammar of metaphors, which refers to the patterning of linguistic metaphor revealed by corpus analysis; (2 metaphor probabilities, which is a facet of metaphor that emerges from frequency-based studies of metaphor; (3 dimensions of metaphor variation, or the search for systematic parameters of variation in metaphor use across different registers; and (4 automated metaphor retrieval, which relates to the development of software to help identify metaphors in corpora. I argue that these four aspects are interrelated, and that advances in one of them can drive changes in the others.Neste artigo discuto quarto aspectos da pesquisa sobre metáfora do ponto de vista da linguística de corpus: (1 a lexicogramática das metáforas, que se refere aos padrões da metáfora linguística revelados pela análise de corpus; (2 probabilidades metafóricas, que é uma faceta da metáfora que emerge a partir dos estudos relacionados à freqüência de metáforas; (3 dimensões da variação de metáforas, ou a busca por parâmetros sistemáticos de variação de uso de metáfora em diferentes gêneros; e (4 captura automática de metáfora, que está relacionada ao desenvolvimento de softwares que auxiliam na identificação de metáforas em corpora. I defendo que esses quatro aspectos são interrelacionados, e que progressos em um deles podem acarretar mudanças nos outros.

ANALYSIS OF SPECIALISED COLLOCATIONS IN THE AREA OF REMOTE SENSING IN THE PERSPECTIVE OF PHRASEOLOGY

Directory of Open Access Journals (Sweden)

Diva Cardoso de CAMARGO

2013-12-01

Full Text Available The aim of this research is to build and analyze a parallel corpus in the field of remote sensing in order to identify, according to its frequency, specialized collocations in English and then search for their equivalents in Portuguese. The research is based on the interdisciplinary approach of Corpus-Based Translation Studies (BAKER, 1995; CAMARGO, 2007, Corpus Linguistics (BERBER SARDINHA, 2004; TOGNINI-BONELLI, 2001, Phraseology (ORENHA-OTTAIANO, 2009; PAVEL, 1993, and some principles of Terminology (BARROS, 2004. For manipulating the corpora, the program WordSmith Tools (SCOTT, 2012 version 6.0 is used. To support this study, two comparable corpora in English and Portuguese were also built from articles published in both national and international journals in remote sensing. The results show that the collocations in Portuguese seem to be still in the process of conventionalization, as the translators made use of greater variation in their translational options, which can be a way to make the text clearer for the reader.
The Nordic Dialect Corpus – a joint research infrastructure

Directory of Open Access Journals (Sweden)

Janne Bondi Johannessen

2011-06-01

Full Text Available The paper describes the Nordic Dialect Corpus as of June 2010. The corpus is a tool that combines a number of useful features that together makes it a unique and very advanced resource for researchers of many fields of language search. The corpus is web-based and features full audio-visual representation linked to transcriptions and translations.
Morphometric changes of the corpus callosum in congenital blindness

DEFF Research Database (Denmark)

Tomaiuolo, Francesco; Campana, Serena; Collins, D Louis

2014-01-01

We examined the effects of visual deprivation at birth on the development of the corpus callosum in a large group of congenitally blind individuals. We acquired high-resolution T1-weighted MRI scans in 28 congenitally blind and 28 normal sighted subjects matched for age and gender....... There was no overall group effect of visual deprivation on the total surface area of the corpus callosum. However, subdividing the corpus callosum into five subdivisions revealed significant regional changes in its three most posterior parts. Compared to the sighted controls, congenitally blind individuals showed a 12......% reduction in the splenium, and a 20% increase in the isthmus and the posterior part of the body. A shape analysis further revealed that the bending angle of the corpus callosum was more convex in congenitally blind compared to the sighted control subjects. The observed morphometric changes in the corpus...
arTenTen: Arabic Corpus and Word Sketches

Directory of Open Access Journals (Sweden)

Tressy Arts

2014-12-01

The article also presents the ‘sketch grammar’ (the basis for the word sketches in detail, describes the process of building and processing the corpus, and considers the role of the corpus in additional research on Arabic.
Building a comprehensive syntactic and semantic corpus of Chinese clinical texts.

Science.gov (United States)

He, Bin; Dong, Bin; Guan, Yi; Yang, Jinfeng; Jiang, Zhipeng; Yu, Qiubin; Cheng, Jianyi; Qu, Chunyan

2017-05-01

To build a comprehensive corpus covering syntactic and semantic annotations of Chinese clinical texts with corresponding annotation guidelines and methods as well as to develop tools trained on the annotated corpus, which supplies baselines for research on Chinese texts in the clinical domain. An iterative annotation method was proposed to train annotators and to develop annotation guidelines. Then, by using annotation quality assurance measures, a comprehensive corpus was built, containing annotations of part-of-speech (POS) tags, syntactic tags, entities, assertions, and relations. Inter-annotator agreement (IAA) was calculated to evaluate the annotation quality and a Chinese clinical text processing and information extraction system (CCTPIES) was developed based on our annotated corpus. The syntactic corpus consists of 138 Chinese clinical documents with 47,426 tokens and 2612 full parsing trees, while the semantic corpus includes 992 documents that annotated 39,511 entities with their assertions and 7693 relations. IAA evaluation shows that this comprehensive corpus is of good quality, and the system modules are effective. The annotated corpus makes a considerable contribution to natural language processing (NLP) research into Chinese texts in the clinical domain. However, this corpus has a number of limitations. Some additional types of clinical text should be introduced to improve corpus coverage and active learning methods should be utilized to promote annotation efficiency. In this study, several annotation guidelines and an annotation method for Chinese clinical texts were proposed, and a comprehensive corpus with its NLP modules were constructed, providing a foundation for further study of applying NLP techniques to Chinese texts in the clinical domain. Copyright © 2017. Published by Elsevier Inc.
Network Analysis with the Enron Email Corpus

Science.gov (United States)

Hardin, J. S.; Sarkis, G.; URC, P. .

2015-01-01

We use the Enron email corpus to study relationships in a network by applying six different measures of centrality. Our results came out of an in-semester undergraduate research seminar. The Enron corpus is well suited to statistical analyses at all levels of undergraduate education. Through this article's focus on centrality, students can explore…
Hemoperitoneum from corpus luteum rupture in patients with aplastic anemia.

Science.gov (United States)

Wang, Huaquan; Guo, Lifang; Shao, Zonghong

2015-01-01

Aplastic anemia is a rare hematopoietic stem-cell disorder that results in pancytopenia and hypocellular bone marrow. Women with aplastic anemia usually are at increased risk of corpus luteum rupture due to thrombocytopenia and infection. Here we report two cases had hemoperitoneum from corpus luteum rupture in patients with aplastic anemia in our center. Case 1 involved two episodes of hemoperitoneum resulting from rupture of the corpus luteum in a 23-year-old unmarried female with severe aplastic anemia. This patient was managed conservatively with platelet and packed red cell transfusion. Case 2 involved two episodes of hemoperitoneum resulting from rupture of the corpus luteum in a 33-year-old married patient with aplastic anemia. Emergency laparoscopy revealed massive hemoperitoneum. Bilateral salpingo-oophorectomy were performed successively with platelet and packed red cell transfusion. Hemoperitoneum resulting from a ruptured corpus luteum is a life-threatening condition in patients with aplastic anemia. Prompt and appropriate evaluation of corpus luteum rupture and emergent therapy are needed.
Web corpus construction

CERN Document Server

Schafer, Roland

2013-01-01

The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and rem...
Divergent Approaches to Corpus Processing: The Need for ...

African Journals Online (AJOL)

Riette Ruthven

McEnery and Wilson (1996: 32) stress the importance of a corpus: 'As a stan- ... close to five million running words, and the Ndebele corpus at around three ... since their introduction and reinforcement through the second form of contact.
Clinical significance of the corpus callosum in cerebral palsy

International Nuclear Information System (INIS)

Lee, Eun Ja; Kim, Ji Chang; Kim, Jong Chul; And Others

2000-01-01

To evaluate, using magnetic resonance (MR) imaging, the clinal significance of the corpus callosum by measuring the size of various portions of the corpus callosum in children with cerebral palsy, and in paired controls. Fifty-two children (30 boys and 22 girls aged between six and 96 (median, 19) months) in whom cerebral palsy was clinically diagnosed underwent MR imaging. There were 23 term patients and 29 preterm, and the control group was selected by age and sex matching. Clinal subtypes of cerebral palsy were classified as hemiplegia (n=14), spastic diplegia (n=22), or spastic quadriplegia (n=16), and according to the severity of motor palsy, the condition was also classified as mild (n=26), moderate (n=13), or severe (n=13). In addition to the length and height of the corpus callosum, the thickness of its genu, body, transitional zone and splenium, as seen on midsagittal T1-weighted MR images, were also measured. Differences in the measured values of the two groups were statistically analysed and differences in the size of the corpus callosum according to the clinical severity and subtypes of cerebral palsy, and gestational age, were also assessed. Except for height, the measured values of the corpus callosum in patients with cerebral palsy were significantly less than those of the control group (p less than 0.05). Its size decreased according to the severity of motor palsy. Compared with term patients, the corpus callosum in preterm patients was considerably smaller (p less than 0.05). There was statistically significant correlation between the severity of motor palsy and the size of the corpus callosum. Quantitative evaluation of the corpus callosum might be a good indicator of neurologic prognosis, and a sensitive marker for assessing the extent of brain injury
Clinical significance of the corpus callosum in cerebral palsy

Energy Technology Data Exchange (ETDEWEB)

Lee, Eun Ja; Kim, Ji Chang [The Catholic University of Korea, Seoul (Korea, Republic of); Kim, Jong Chul [School of Medicine, Chungnam National University, Taejon (Korea, Republic of); And Others

2000-10-01

To evaluate, using magnetic resonance (MR) imaging, the clinal significance of the corpus callosum by measuring the size of various portions of the corpus callosum in children with cerebral palsy, and in paired controls. Fifty-two children (30 boys and 22 girls aged between six and 96 (median, 19) months) in whom cerebral palsy was clinically diagnosed underwent MR imaging. There were 23 term patients and 29 preterm, and the control group was selected by age and sex matching. Clinal subtypes of cerebral palsy were classified as hemiplegia (n=14), spastic diplegia (n=22), or spastic quadriplegia (n=16), and according to the severity of motor palsy, the condition was also classified as mild (n=26), moderate (n=13), or severe (n=13). In addition to the length and height of the corpus callosum, the thickness of its genu, body, transitional zone and splenium, as seen on midsagittal T1-weighted MR images, were also measured. Differences in the measured values of the two groups were statistically analysed and differences in the size of the corpus callosum according to the clinical severity and subtypes of cerebral palsy, and gestational age, were also assessed. Except for height, the measured values of the corpus callosum in patients with cerebral palsy were significantly less than those of the control group (p less than 0.05). Its size decreased according to the severity of motor palsy. Compared with term patients, the corpus callosum in preterm patients was considerably smaller (p less than 0.05). There was statistically significant correlation between the severity of motor palsy and the size of the corpus callosum. Quantitative evaluation of the corpus callosum might be a good indicator of neurologic prognosis, and a sensitive marker for assessing the extent of brain injury.
Comparative study on corpus development for Malay investment ...

African Journals Online (AJOL)

Comparative study on corpus development for Malay investment fraud detection in website. ... Journal of Fundamental and Applied Sciences ... The aim of this research is to develop a corpus for Malay investment fraud so that it can be used in ...
Concept annotation in the CRAFT corpus.

Science.gov (United States)

Bada, Michael; Eckert, Miriam; Evans, Donald; Garcia, Kristin; Shipley, Krista; Sitnikov, Dmitry; Baumgartner, William A; Cohen, K Bretonnel; Verspoor, Karin; Blake, Judith A; Hunter, Lawrence E

2012-07-09

Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.
Corpus Linguistics, Network Analysis and Co-occurrence Matrices Corpus Linguistics, Network Analysis and Co-occurrence Matrices

Directory of Open Access Journals (Sweden)

Keith Stuart

2009-12-01

Full Text Available This article describes research undertaken in order to design a methodology for the reticular representation of knowledge of a specific discourse community. To achieve this goal, a representative corpus of the scientific production of the members of this discourse community (Universidad Politécnica de Valencia, UPV was created. The article presents the practical analysis (frequency, keyword, collocation and cluster analysis that was carried out in the initial phases of the study aimed at establishing the theoretical and practical background and framework for our matrix and network analysis of the scientific discourse of the UPV. In the methodology section, the processes that have allowed us to extract from the corpus the linguistic elements needed to develop co-occurrence matrices, as well as the computer tools used in the research, are described. From these co-occurrence matrices, semantic networks of subject and discipline knowledge were generated. Finally, based on the results obtained, we suggest that it may be viable to extract and to represent the intellectual capital of an academic institution using corpus linguistics methods in combination with the formulations of network theory.En este artículo describimos la investigación que se ha desarrollado en el diseño de una metodología para la representación reticular del conocimiento que se genera en el seno de una institución a partir de un corpus representativo de la producción científica de los integrantes de dicha comunidad discursiva, la Universidad Politécnica de Valencia.. Para ello, presentamos las acciones que se realizaron en las fases iniciales del estudio encaminadas a establecer el marco teórico y práctico en el que se inscribe nuestro análisis. En la sección de metodología se describen las herramientas informáticas utilizadas, así como los procesos que nos permitieron disponer de aquellos elementos presentes en el corpus, que nos llevarían al desarrollo de
A Balanced and Representative Corpus: The Effects of Strict Corpus ...

African Journals Online (AJOL)

Theoretically the Northern Sotho language is made up of almost 30 dialects while practically it is not so, because the standard language was formed from very few of its dialects. As a result, even today the language has no corpus which is balanced or representative owing to the fact that almost all of the available corpora ...
Learner corpus profiles the case of Romanian learner English

CERN Document Server

Chitez, Madalina

2014-01-01

The first three chapters of the book offer relevant information on the new methodological approach, learner corpus profiling, and the exemplifying case, Romanian Learner English. The description of the Romanian Corpus of Learner English is also given special attention. The following three chapters include corpus-based frequency analyses of selected grammatical categories (articles, prepositions, genitives), combined with error analyses. In the concluding discussion, the book summarizes the features compiled as lexico-grammatical profiles.
English Writing Teaching Model Dependent on Computer Network Corpus Drive Model

Directory of Open Access Journals (Sweden)

Shi Lei

2018-03-01

Full Text Available At present, the mainstream lexicalized English writing methods take only the corpus dependence between words into consideration, without introducing the corpus collocation and other issues. “Drive” is a relatively essential feature of words. And once the drive structure of a word is determined, it will be relatively clear what kinds of words to collocate with, hence the structure of the sentence can be derived relatively directly. In this paper, the English writing model that relies on the computer network corpus drive model is put forward. In this model, rich English corpus is introduced in the decomposition of the rules and the calculation of the probability, which includes not only the corpus dependence information, but also the drive structure and other corpus collocation information. Improved computer network corpus drive model is used to carry out the English writing teaching experiment. The experimental results show that the precision and the recall rate are 88.76% and 87.43%, respectively. The F value of the comprehensive index is improved by 6.65% compared with the Collins headword driven English modes of writing.
Using a Corpus in a 300-Level Spanish Grammar Course

Science.gov (United States)

Benavides, Carlos

2015-01-01

The present study examined the use and effectiveness of a large corpus--the Corpus del Español (Davies, 2002)--in a 300-level Spanish grammar university course. Students conducted hands-on corpus searches with the goal of finding concordances containing particular types of collocations (combinations of words that tend to co-occur) and tokens (any…
Insights from a Learner Corpus as Opposed to a Native Corpus about Cohesive Devices in an Academic Writing Context

Science.gov (United States)

Ersanli, Ceylan Yangin

2015-01-01

This study reports on the insights from an EFL learner corpora (a total of 151 essays and 49,690 words) generated from essays collected over the years in a Turkish state university from freshmen students enrolling in the Advanced Writing course. The comparison of cohesive devices in the non-native corpus (NNC) with those in a native corpus (NC)…
Processing and Representation of Ambiguous Words in Chinese Reading: Evidence from Eye Movements.

Science.gov (United States)

Shen, Wei; Li, Xingshan

2016-01-01

In the current study, we used eye tracking to investigate whether senses of polysemous words and meanings of homonymous words are represented and processed similarly or differently in Chinese reading. Readers read sentences containing target words which was either homonymous words or polysemous words. The contexts of text preceding the target words were manipulated to bias the participants toward reading the ambiguous words according to their dominant, subordinate, or neutral meanings. Similarly, disambiguating regions following the target words were also manipulated to favor either the dominant or subordinate meanings of ambiguous words. The results showed that there were similar eye movement patterns when Chinese participants read sentences containing homonymous and polysemous words. The study also found that participants took longer to read the target word and the disambiguating text following it when the prior context and disambiguating regions favored divergent meanings rather than the same meaning. These results suggested that homonymy and polysemy are represented similarly in the mental lexicon when a particular meaning (sense) is fully specified by disambiguating information. Furthermore, multiple meanings (senses) are represented as separate entries in the mental lexicon.

Computerized tomography of the traumatic hematoma in the corpus callosum

International Nuclear Information System (INIS)

Ogura, Koichiro; Yamamoto, Isao; Hara, Makoto; Suzuki, Yoshio; Nakane, Toshichi; Watanabe, Masao.

1982-01-01

The value of the computerized tomography (CT) in the diagnosis of the intracerebral hematoma has been well documented. However, there is little report about the CT findings of the hematoma of the corpus callosum. This report presents two cases of the traumatic hematoma in the corpus callosum and is discussed about their CT findings. Two patients, 52 year-old male and 40 year-old male, respectively, are the cases of blunt mechanical head trauma which were accompanied neither by skull fracture nor by scalp injury. In all these cases, the findings that hematoma occupied from the genu to the body of the corpus callosum were verified by surgery and the axial CT revealed the following two similar findings. First; the hematoma in the genu of the corpus callosum was shown as a cresent-shaped high density mass. This finding seems to be due to the following anatomical structure, that is, the genu of the corpus callosum is located just in front of the anterior horn of the lateral ventricles in the shape of the convex towards posteriorly. Second; as the midportion of the body of the corpus callosum tends to be appeared narrow in width between both lateral ventricles, the hematoma which extended from the genu towards the body of the corpus callosum was shown as a dumbbell-shaped high density mass. (author)
Experimental model of human corpus cavernosum smooth muscle relaxation

Directory of Open Access Journals (Sweden)

Rommel P. Regadas

2010-08-01

Full Text Available PURPOSE: To describe a technique for en bloc harvesting of the corpus cavernosum, cavernous artery and urethra from transplant organ donors and contraction-relaxation experiments with corpus cavernosum smooth muscle. MATERIALS AND METHODS: The corpus cavernosum was dissected to the point of attachment with the crus penis. A 3 cm segment (corpus cavernosum and urethra was isolated and placed in ice-cold sterile transportation buffer. Under magnification, the cavernous artery was dissected. Thus, 2 cm fragments of cavernous artery and corpus cavernosum were obtained. Strips measuring 3 x 3 x 8 mm3 were then mounted vertically in an isolated organ bath device. Contractions were measured isometrically with a Narco-Biosystems force displacement transducer (model F-60, Narco-Biosystems, Houston, TX, USA and recorded on a 4-channel Narco-Biosystems desk model polygraph. RESULTS: Phenylephrine (1µM was used to induce tonic contractions in the corpus cavernosum (3 - 5 g tension and cavernous artery (0.5 - 1g tension until reaching a plateau. After precontraction, smooth muscle relaxants were used to produce relaxation-response curves (10-12M to 10-4 M. Sodium nitroprusside was used as a relaxation control. CONCLUSION: The harvesting technique and the smooth muscle contraction-relaxation model described in this study were shown to be useful instruments in the search for new drugs for the treatment of human erectile dysfunction.
Corpus callosum dysgenesis and lipoma: embryologic and magnetic resonance imaging aspects

International Nuclear Information System (INIS)

Abreu Junior, Luiz de; Borri, Maria Lucia; Wolosker, Angela Maria Borri; Hartmann, Luiz Guilherme de Carvalho; Galvao Filho, Mario de Melo; D'Ippolito, Giuseppe

2005-01-01

The corpus callosum is the major system of association fibers that permits communication of both cerebral hemispheres. Magnetic resonance imaging has improved the study of brain malformations, including the corpus callosum dysgenesis. Lipoma is a common finding in the spectrum of corpus callosum dysgenesis. The purpose of these study was to review the embryologic events and the magnetic resonance imaging aspects related to the corpus callosum dysgenesis and to the formation of the related lipoma. (author)
The Shona Corpus and the Problem of Tagging?

Directory of Open Access Journals (Sweden)

Emmanuel Chabata

2011-10-01

Full Text Available
Abstract: In this paper the writer examines problems the African Languages Lexical (ALLEX Project (at present the African Languages Research Institute (ALRI? encountered while tagging the Shona corpus. The problems to be highlighted include general problems which apply to more than one language as well as problems peculiar to Shona. The paper was inspired by the challenges the writer encountered when he took part in building the Shona corpus. An analysis of the problems that most corpus builders face shows that more problems are likely to be encountered when dealing with spoken corpora than with written corpora. The paper demonstrates that tagging is an important component of corpus building as it makes it easier for a researcher to extract relevant data. To utilise the benefits of a tagged corpus, the tagging should be thorough and accurate. Wellinformed decisions form an integral part of the tagging process since the utility of a tagged corpus depends largely on the input of the tagging process. This paper shows the need to take the tagging process seriously.
Keywords: ALLEX PROJECT, COMPUTER, CORPUS, ENCODING, FOREIGN WORD, LEMMATIZATION, LEXICOGRAPHY, MONITOR CORPUS, PART OF SPEECH, SCANNING, SHONA, SLANG, TAGGING, TRANSCRIPTION, WORD
Opsomming: Die Shonakorpus en die probleem van etikettering, In hierdieartikel ondersoek die outeur probleme wat die African Languages Lexical (ALLEX Project (tansdie African Languages Research Institute (ALRI» teegekom het terwyl die Shonakorpus geetiketteeris. Die probleme wat bespreek word, sluit algemene probleme in wat van toepassing is opmeer as een taa, sowel as spesifieke probleme wat eie aan Shona is. Die artikel het sy ontstaan indie uitdagings wat die outeur teegekom het terwyl hy deel gehad het aan die opbou van die Shonakorpus.'n Ontieding van die probleme waarvoor die meeste korpusbouers te staan kom, toon datdaar waarskynlik meer probleme teegekom word wanneer daar met gesproke
Corpus-Based Research and Pedagogy in EAP: From Lexis to Genre

Science.gov (United States)

Flowerdew, Lynne

2015-01-01

This plenary paper showcases current corpus-based research on written academic English, illustrating the tight links that exist between corpus research and pedagogic applications. I first explicate Sinclair's concept of the "lexical approach", which underpins much corpus research and pedagogy. I then discuss studies which focus on…
Google and beyond : web-as-corpus methodologies for translators

OpenAIRE

Ferraresi, Adriano

2009-01-01

Aquest article fa un repàs als plantejaments actuals sobre l'ús del web com a corpus lingüístic i emfatitza els avantatges (així com els inevitables riscos) que aquests poden introduir en el treball del traductor. Per tal d'il•lustrar aquest punt, es mostra un exemple de les diferents maneres en què un corpus derivat del web es pot aplicar profitosament a una tasca de traducció especialitzada.. Este artículo estudia los planteamientos actuales sobre el uso de la web como corpus lingüístico...
The structure of an entry in the National corpus of Tuvan language

Directory of Open Access Journals (Sweden)

Mengi V. Ondar

2016-12-01

Full Text Available Contemporary information technologies and mathematical modelling has made creating corpora of natural languages significantly easier. A corpus is an information and reference system based on a collection of digitally processed texts. A corpus includes various written and oral texts in the given language, a set of dictionaries and markup – information on the properties of the text. It is the presence of the markup which distinguishes a corpus from an electronic library. At the moment, national corpora are being set up for many languages of the Russian Federation, including those of the Turkic peoples. Faculty members, postgraduate and undergraduate students at Tuvan State University and Siberian Federal University are working on the National corpus of Tuvan language. This article describes the structure of a dictionary entry in the National corpus of Tuvan language. The corpus database comprises the following tables: MAIN – the headword table, RUS, ENG, GER — translations of the headword into three languages, MORPHOLOGY — the table containing morphological data on the headword. The database is built in Microsoft Office Access. Working with the corpus dictionary includes the following functions: adding, editing and removing an entry, entry search (with transcription, setting and visualizing morphological features of a headword. The project allows us to view the corpus dictionary as a multi-structure entity with a complex hierarchical structure and a dictionary entry as its key component. The corpus dictionary we developed can be used for studying Tuvan language in its pronunciation, orthography and word analysis, as well as for searching for words and collocations in the texts included into the corpus.
Classification of acquired lesions of the corpus callosum with MRI

Energy Technology Data Exchange (ETDEWEB)

Friese, S.A.; Bitzer, M.; Voigt, K.; Kueker, W. [Tuebingen Univ. (Germany). Abt. fuer Neuroradiologie; Freudenstein, D. [Department of Neurosurgery, Eberhard-Karls-University Tuebingen (Germany)

2000-11-01

MRI has facilitated diagnostic assessment of the corpus callosum. Diagnostic classification of solitary or multiple lesions of the corpus callosum has not attracted much attention, although signal abnormalities are not uncommon. Our aim was to identify characteristic imaging features of lesions frequently encountered in practice. We reviewed the case histories of 59 patients with lesions shown on MRI. The nature of the lesions was based on clinical features and/or long term follow-up (ischaemic 20, Virchow-Robin spaces 3, diffuse axonal injury 7, multiple sclerosis 11, hydrocephalus 5, acute disseminated encephalomyelitis 5, Marchiafava-Bignami disease 4, lymphoma 2, glioblastoma hamartoma each 1). The location in the sagittal plane, the relationship to the borders of the corpus callosum and midline and the size were documented. The 20 ischaemic lesions were asymmetrical but adjacent to the midline; the latter was involved in new or large lesions. Diffuse axonal injury commonly resulted in large lesions, which tended to be asymmetrical; the midline and borders of the corpus callosum were always involved. Lesions in MS were small, at the lower border of the corpus callosum next to the septum pellucidum, and crossed the midline asymmetrically. Acute disseminated encephalomyelitis and the other perivenous inflammatory diseases caused relatively large, asymmetrical lesions. Hydrocephalus resulted in lesions of the upper part of the corpus callosum, and mostly in its posterior two thirds; they were found in the midline. Lesions in Marchiafava-Bignami disease were large, often symmetrically in the midline in the splenium and did not reach the edge of the corpus callosum. (orig.)
Use of "Google Scholar" in Corpus-Driven EAP Research

Science.gov (United States)

Brezina, Vaclav

2012-01-01

This primarily methodological article makes a proposition for linguistic exploration of textual resources available through the "Google Scholar" search engine. These resources ("Google Scholar virtual corpus") are significantly larger than any existing corpus of academic writing. "Google Scholar", however, was not designed for linguistic searches…
Role of the hippocampus and orbitofrontal cortex during the disambiguation of social cues in working memory

Science.gov (United States)

Ross, Robert S.; LoPresti, Matthew L.; Schon, Karin; Stern, Chantal E.

2013-01-01

Human social interactions are complex behaviors requiring the concerted effort of multiple neural systems to track and monitor the individuals around us. Cognitively, adjusting our behavior based on changing social cues such as facial expressions relies on working memory and the ability to disambiguate, or separate, representations of overlapping stimuli resulting from viewing the same individual with different facial expressions. We conducted an fMRI experiment examining brain regions contributing to the encoding, maintenance and retrieval of overlapping identity information during working memory using a delayed match-to-sample (DMS) task. In the overlapping condition, two faces from the same individual with different facial expressions were presented at sample. In the non-overlapping condition, the two sample faces were from two different individuals with different expressions. fMRI activity was assessed by contrasting the overlapping and non-overlapping condition at sample, delay, and test. The lateral orbitofrontal cortex showed increased fMRI signal in the overlapping condition in all three phases of the DMS task and increased functional connectivity with the hippocampus when encoding overlapping stimuli. The hippocampus showed increased fMRI signal at test. These data suggest lateral orbitofrontal cortex helps encode and maintain representations of overlapping stimuli in working memory while the orbitofrontal cortex and hippocampus contribute to the successful retrieval of overlapping stimuli. We suggest the lateral orbitofrontal cortex and hippocampus play a role in encoding, maintaining, and retrieving social cues, especially when multiple interactions with an individual need to be disambiguated in a rapidly changing social context in order to make appropriate social responses. PMID:23640112
DutchParl: A corpus of parliamentary documents in Dutch

NARCIS (Netherlands)

Marx, M.; Schuth, A.

2010-01-01

A corpus called DutchParl is created which aims to contain all digitally available parliamentary documents written in the Dutch language. The first version of DutchParl contains documents from the parliaments of The Netherlands, Flanders and Belgium. The corpus is divided along three dimensions: per
An annotated corpus with nanomedicine and pharmacokinetic parameters.

Science.gov (United States)

Lewinski, Nastassja A; Jimenez, Ivan; McInnes, Bridget T

2017-01-01

A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.
Designing a Lexical Database for a Combined Use of Corpus Annotation and Dictionary Editing

DEFF Research Database (Denmark)

Kristoffersen, Jette Hedegaard; Troelsgård, Thomas; Langer, Gabriele

2016-01-01

In a combined corpus-dictionary project, you would need one lexical database that could serve as a shared “backbone” for both corpus annotation and dictionary editing, but it is not that easy to define a database structure that applies satisfactorily to both these purposes. In this paper, we...... will exemplify the problem and present ideas on how to model structures in a lexical database that facilitate corpus annotation as well as dictionary editing. The paper is a joint work between the DGS Corpus Project and the DTS Dictionary Project. The two projects come from opposite sides of the spectrum (one...... adjusting a lexical database grown from dictionary making for corpus annotating, one building a lexical database in parallel with corpus annotation and editing a corpus-based dictionary), and we will consider requirements and feasible structures for a database that can serve both corpus and dictionary....
The MR findings of the corpus callosum of normal young volunteers

International Nuclear Information System (INIS)

Okamoto, Kouichirou; Ito, Jusuke; Tokiguchi, Susumu

1990-01-01

The size and shape of the corpus callosum of twenty seven normal young volunteers (age 18-31 years, 17 men and 10 women) were investigated using a superconducting high field (1.5 T) MRI unit. The length of the corpus callosum was 71.1±5.1 mm (mean±S.D.) and the height was 24.9±2.1 mm. The length ratio of the corpus callosum to the brain was 43.9±2.3% with the ratio of the height 25.0±2.3%. The callosal index (height/length) was 35.4±2.9%. The area of the corpus callosum in the midsagittal plane was 681.4±93.6 mm 2 (min. 563 mm 2 to max. 902 mm 2 ). We divided the corpus callosum into three segments: rostrum and genu; anterior and posterior trunks; splenium. Each part accounts for one third of the total area of the corpus callosum. The genu and splenium were generally equal in thickness. The minimal thickness of the trunk was 3 mm with the maximal one 9 mm. The posterior trunk was never thicker than the anterior one. The posterior part of the posterior trunk showed thinning and concavity in almost all cases. So-called impressio corporis callosi was observed in 12 cases (44.4%). Thirteen cases (48.1%) showed a shallow concave configuration at the anterior dorsal surface of the corpus callosum. Six cases of these were thought to be due to compression by the pericallosal artery. This finding was not detected in the posterior portion of the corpus callosum. This concavity was also seen in infants. The thinning of the posterior part of the posterior trunk was seen after the development of the splenium, but the concave configuration at the anterior dorsal surface of the corpus callosum may be encountered before the full development of the genu and splenium. (author)
Language Planning: Corpus Planning.

Science.gov (United States)

Baldauf, Richard B., Jr.

1989-01-01

Focuses on the historical and sociolinguistic studies that illuminate corpus planning processes. These processes are broken down and discussed under two categories: those related to the establishment of norms, referred to as codification, and those related to the extension of the linguistic functions of language, referred to as elaboration. (60…
A corpus for plant-chemical relationships in the biomedical domain.

Science.gov (United States)

Choi, Wonjun; Kim, Baeksoo; Cho, Hyejin; Lee, Doheon; Lee, Hyunju

2016-09-20

Plants are natural products that humans consume in various ways including food and medicine. They have a long empirical history of treating diseases with relatively few side effects. Based on these strengths, many studies have been performed to verify the effectiveness of plants in treating diseases. It is crucial to understand the chemicals contained in plants because these chemicals can regulate activities of proteins that are key factors in causing diseases. With the accumulation of a large volume of biomedical literature in various databases such as PubMed, it is possible to automatically extract relationships between plants and chemicals in a large-scale way if we apply a text mining approach. A cornerstone of achieving this task is a corpus of relationships between plants and chemicals. In this study, we first constructed a corpus for plant and chemical entities and for the relationships between them. The corpus contains 267 plant entities, 475 chemical entities, and 1,007 plant-chemical relationships (550 and 457 positive and negative relationships, respectively), which are drawn from 377 sentences in 245 PubMed abstracts. Inter-annotator agreement scores for the corpus among three annotators were measured. The simple percent agreement scores for entities and trigger words for the relationships were 99.6 and 94.8 %, respectively, and the overall kappa score for the classification of positive and negative relationships was 79.8 %. We also developed a rule-based model to automatically extract such plant-chemical relationships. When we evaluated the rule-based model using the corpus and randomly selected biomedical articles, overall F-scores of 68.0 and 61.8 % were achieved, respectively. We expect that the corpus for plant-chemical relationships will be a useful resource for enhancing plant research. The corpus is available at http://combio.gist.ac.kr/plantchemicalcorpus .
Polyethylene glycol restores axonal conduction after corpus callosum transection

Directory of Open Access Journals (Sweden)

Ravinder Bamba

2017-01-01

Full Text Available Polyethylene glycol (PEG has been shown to restore axonal continuity after peripheral nerve transection in animal models. We hypothesized that PEG can also restore axonal continuity in the central nervous system. In this current experiment, coronal sectioning of the brains of Sprague-Dawley rats was performed after animal sacrifice. 3Brain high-resolution microelectrode arrays (MEA were used to measure mean firing rate (MFR and peak amplitude across the corpus callosum of the ex-vivo brain slices. The corpus callosum was subsequently transected and repeated measurements were performed. The cut ends of the corpus callosum were still apposite at this time. A PEG solution was applied to the injury site and repeated measurements were performed. MEA measurements showed that PEG was capable of restoring electrophysiology signaling after transection of central nerves. Before injury, the average MFRs at the ipsilateral, midline, and contralateral corpus callosum were 0.76, 0.66, and 0.65 spikes/second, respectively, and the average peak amplitudes were 69.79, 58.68, and 49.60 μV, respectively. After injury, the average MFRs were 0.71, 0.14, and 0.25 spikes/second, respectively and peak amplitudes were 52.11, 8.98, and 16.09 μV, respectively. After application of PEG, there were spikes in MFR and peak amplitude at the injury site and contralaterally. The average MFRs were 0.75, 0.55, and 0.47 spikes/second at the ipsilateral, midline, and contralateral corpus callosum, respectively and peak amplitudes were 59.44, 45.33, 40.02 μV, respectively. There were statistically differences in the average MFRs and peak amplitudes between the midline and non-midline corpus callosum groups (P < 0.01, P < 0.05. These findings suggest that PEG restores axonal conduction between severed central nerves, potentially representing axonal fusion.
Polyethylene glycol restores axonal conduction after corpus callosum transection.

Science.gov (United States)

Bamba, Ravinder; Riley, D Colton; Boyer, Richard B; Pollins, Alonda C; Shack, R Bruce; Thayer, Wesley P

2017-05-01

Polyethylene glycol (PEG) has been shown to restore axonal continuity after peripheral nerve transection in animal models. We hypothesized that PEG can also restore axonal continuity in the central nervous system. In this current experiment, coronal sectioning of the brains of Sprague-Dawley rats was performed after animal sacrifice. 3Brain high-resolution microelectrode arrays (MEA) were used to measure mean firing rate (MFR) and peak amplitude across the corpus callosum of the ex-vivo brain slices. The corpus callosum was subsequently transected and repeated measurements were performed. The cut ends of the corpus callosum were still apposite at this time. A PEG solution was applied to the injury site and repeated measurements were performed. MEA measurements showed that PEG was capable of restoring electrophysiology signaling after transection of central nerves. Before injury, the average MFRs at the ipsilateral, midline, and contralateral corpus callosum were 0.76, 0.66, and 0.65 spikes/second, respectively, and the average peak amplitudes were 69.79, 58.68, and 49.60 μV, respectively. After injury, the average MFRs were 0.71, 0.14, and 0.25 spikes/second, respectively and peak amplitudes were 52.11, 8.98, and 16.09 μV, respectively. After application of PEG, there were spikes in MFR and peak amplitude at the injury site and contralaterally. The average MFRs were 0.75, 0.55, and 0.47 spikes/second at the ipsilateral, midline, and contralateral corpus callosum, respectively and peak amplitudes were 59.44, 45.33, 40.02 μV, respectively. There were statistically differences in the average MFRs and peak amplitudes between the midline and non-midline corpus callosum groups ( P < 0.01, P < 0.05). These findings suggest that PEG restores axonal conduction between severed central nerves, potentially representing axonal fusion.
Parenting, corpus callosum, and executive function in preschool children.

Science.gov (United States)

Kok, Rianne; Lucassen, Nicole; Bakermans-Kranenburg, Marian J; van IJzendoorn, Marinus H; Ghassabian, Akhgar; Roza, Sabine J; Govaert, Paul; Jaddoe, Vincent W; Hofman, Albert; Verhulst, Frank C; Tiemeier, Henning

2014-01-01

In this longitudinal population-based study (N = 544), we investigated whether early parenting and corpus callosum length predict child executive function abilities at 4 years of age. The length of the corpus callosum in infancy was measured using postnatal cranial ultrasounds at 6 weeks of age. At 3 years, two aspects of parenting were observed: maternal sensitivity during a teaching task and maternal discipline style during a discipline task. Parents rated executive function problems at 4 years of age in five domains of inhibition, shifting, emotional control, working memory, and planning/organizing, using the Behavior Rating Inventory of Executive Function-Preschool Version. Maternal sensitivity predicted less executive function problems at preschool age. A significant interaction was found between corpus callosum length in infancy and maternal use of positive discipline to determine child inhibition problems: The association between a relatively shorter corpus callosum in infancy and child inhibition problems was reduced in children who experienced more positive discipline. Our results point to the buffering potential of positive parenting for children with biological vulnerability.
ANR Corpus architecturae religiosae europeae [CARE]saec. IV-X

Directory of Open Access Journals (Sweden)

Christian Sapin

2010-10-01

Full Text Available Le projet ANR «Corpus des monuments religieux antérieurs à l’an Mil» [Corpus architecturae religiosae europeae/CARE – IV-X saec.] a débuté en janvier 2008. Il représente l’apport de la France à un programme international, initié en 2002 par l’IRCLAMA de Zagreb (Croatie . Ce corpus a pour objectif de recenser les édifices religieux d’Europe entre le IVe siècle et le tout début du XIe siècle. Il regroupe déjà l’Italie, l’Espagne, la Croatie, l’Europe centrale et demain, probablement, l’Irlande...

Acute aortic dissection type A discloses Corpus alienum

Directory of Open Access Journals (Sweden)

Kolat Philipp

2009-01-01

Full Text Available Abstract We report an unusual case of an aortic type A dissection with a corpus alienum which compresses the right ventricle. The patient successfully underwent an aortic root replacement in deep hypothermia with re-implantation of the coronary arteries using a modified Bentall procedure and the resection of the corpus alienum. Intraoperative finding reveals 3 greatly adhered gauze compresses, which were most likely forgotten in the operation 34 years ago.
Corpus callosum thickness in children: an MR pattern-recognition approach on the midsagittal image

Energy Technology Data Exchange (ETDEWEB)

Andronikou, Savvas; Pillay, Tanyia; Gabuza, Lungile; Mahomed, Nasreen; Naidoo, Jaishree; Tebogo Hlabangana, Linda [University of the Witwatersrand, Radiology Department, Faculty of Health Sciences, Johannesburg (South Africa); Du Plessis, Vicci [University of KwaZulu-Natal, Radiology Department, Faculty of Health Sciences, Durban (South Africa); Prabhu, Sanjay P. [Harvard Medical School, Department of Radiology, Boston Children' s Hospital, Boston, MA (United States)

2014-08-31

Thickening of the corpus callosum is an important feature of development, whereas thinning of the corpus callosum can be the result of a number of diseases that affect development or cause destruction of the corpus callosum. Corpus callosum thickness reflects the volume of the hemispheres and responds to changes through direct effects or through Wallerian degeneration. It is therefore not only important to evaluate the morphology of the corpus callosum for congenital anomalies but also to evaluate the thickness of specific components or the whole corpus callosum in association with other findings. The goal of this pictorial review is raise awareness that the thickness of the corpus callosum can be a useful feature of pathology in pediatric central nervous system disease and must be considered in the context of the stage of development of a child. Thinning of the corpus callosum can be primary or secondary, and generalized or focal. Primary thinning is caused by abnormal or failed myelination related to the hypomyelinating leukoencephalopathies, metabolic disorders affecting white matter, and microcephaly. Secondary thinning of the corpus callosum can be caused by diffuse injury such as hypoxic-ischemic encephalopathy, human immunodeficiency virus (HIV) encephalopathy, hydrocephalus, dysmyelinating conditions and demyelinating conditions. Focal disturbance of formation or focal injury also causes localized thinning, e.g., callosal dysgenesis, metabolic disorders with localized effects, hypoglycemia, white matter injury of prematurity, HIV-related atrophy, infarction and vasculitis, trauma and toxins. The corpus callosum might be too thick because of a primary disorder in which the corpus callosum finding is essential to diagnosis; abnormal thickening can also be secondary to inflammation, infection and trauma. (orig.)
Corpus callosum thickness in children: an MR pattern-recognition approach on the midsagittal image

International Nuclear Information System (INIS)

Andronikou, Savvas; Pillay, Tanyia; Gabuza, Lungile; Mahomed, Nasreen; Naidoo, Jaishree; Tebogo Hlabangana, Linda; Du Plessis, Vicci; Prabhu, Sanjay P.

2015-01-01

Thickening of the corpus callosum is an important feature of development, whereas thinning of the corpus callosum can be the result of a number of diseases that affect development or cause destruction of the corpus callosum. Corpus callosum thickness reflects the volume of the hemispheres and responds to changes through direct effects or through Wallerian degeneration. It is therefore not only important to evaluate the morphology of the corpus callosum for congenital anomalies but also to evaluate the thickness of specific components or the whole corpus callosum in association with other findings. The goal of this pictorial review is raise awareness that the thickness of the corpus callosum can be a useful feature of pathology in pediatric central nervous system disease and must be considered in the context of the stage of development of a child. Thinning of the corpus callosum can be primary or secondary, and generalized or focal. Primary thinning is caused by abnormal or failed myelination related to the hypomyelinating leukoencephalopathies, metabolic disorders affecting white matter, and microcephaly. Secondary thinning of the corpus callosum can be caused by diffuse injury such as hypoxic-ischemic encephalopathy, human immunodeficiency virus (HIV) encephalopathy, hydrocephalus, dysmyelinating conditions and demyelinating conditions. Focal disturbance of formation or focal injury also causes localized thinning, e.g., callosal dysgenesis, metabolic disorders with localized effects, hypoglycemia, white matter injury of prematurity, HIV-related atrophy, infarction and vasculitis, trauma and toxins. The corpus callosum might be too thick because of a primary disorder in which the corpus callosum finding is essential to diagnosis; abnormal thickening can also be secondary to inflammation, infection and trauma. (orig.)
Diffusion tensor analysis of corpus callosum in progressive supranuclear palsy

International Nuclear Information System (INIS)

Ito, Shoichi; Makino, Takahiro; Shirai, Wakako; Hattori, Takamichi

2008-01-01

Progressive supranuclear palsy (PSP) is a neurodegenerative disease featuring parkinsonism, supranuclear ophthalmoplegia, dysphagia, and frontal lobe dysfunction. The corpus callosum which consists of many commissure fibers probably reflects cerebral cortical function. Several previous reports showed atrophy or diffusion abnormalities of anterior corpus callosum in PSP patients, but partitioning method used in these studies was based on data obtained in nonhuman primates. In this study, we performed a diffusion tensor analysis using a new partitioning method for the human corpus callosum. Seven consecutive patients with PSP were compared with 29 age-matched patients with Parkinson's Disease (PD) and 19 age-matched healthy control subjects. All subjects underwent diffusion tensor magnetic resonance imaging, and the corpus callosum was partitioned into five areas on the mid-sagittal plane according to a recently established topography of human corpus callosum (CC1-prefrontal area, CC2-premotor and supplementary motor area, CC3-motor area, CC4-sensory area, CC5-parietal, temporal, and occipital area). Fractional anisotropy (FA) and apparent diffusion coefficient (ADC) were measured in each area and differences between groups were analyzed. In the PSP group, FA values were significantly decreased in CC1 and CC2, and ADC values were significantly increased in CC1 and CC2. Receiver operating characteristic analysis showed excellent reliability of FA and ADC analyses of CC1 for differentiating PSP from PD. The anterior corpus callosum corresponding to the prefrontal, premotor, and supplementary motor cortices is affected in PSP patients. This analysis can be an additional test for further confirmation of the diagnosis of PSP
Diffusion tensor analysis of corpus callosum in progressive supranuclear palsy

Energy Technology Data Exchange (ETDEWEB)

Ito, Shoichi; Makino, Takahiro; Shirai, Wakako; Hattori, Takamichi [Department of Neurology, Graduate School of Medicine, Chiba University (Japan)

2008-11-15

Progressive supranuclear palsy (PSP) is a neurodegenerative disease featuring parkinsonism, supranuclear ophthalmoplegia, dysphagia, and frontal lobe dysfunction. The corpus callosum which consists of many commissure fibers probably reflects cerebral cortical function. Several previous reports showed atrophy or diffusion abnormalities of anterior corpus callosum in PSP patients, but partitioning method used in these studies was based on data obtained in nonhuman primates. In this study, we performed a diffusion tensor analysis using a new partitioning method for the human corpus callosum. Seven consecutive patients with PSP were compared with 29 age-matched patients with Parkinson's Disease (PD) and 19 age-matched healthy control subjects. All subjects underwent diffusion tensor magnetic resonance imaging, and the corpus callosum was partitioned into five areas on the mid-sagittal plane according to a recently established topography of human corpus callosum (CC1-prefrontal area, CC2-premotor and supplementary motor area, CC3-motor area, CC4-sensory area, CC5-parietal, temporal, and occipital area). Fractional anisotropy (FA) and apparent diffusion coefficient (ADC) were measured in each area and differences between groups were analyzed. In the PSP group, FA values were significantly decreased in CC1 and CC2, and ADC values were significantly increased in CC1 and CC2. Receiver operating characteristic analysis showed excellent reliability of FA and ADC analyses of CC1 for differentiating PSP from PD. The anterior corpus callosum corresponding to the prefrontal, premotor, and supplementary motor cortices is affected in PSP patients. This analysis can be an additional test for further confirmation of the diagnosis of PSP.
33 CFR 165.808 - Corpus Christi Ship Channel, Corpus Christi, TX, safety zone.

Science.gov (United States)

2010-07-01

... Petroleum Gas, the waters within a 500 yard radius of the LPG carrier while the vessel transits the Corpus Christi Ship Channel to the LPG receiving facility. The safety zone remains in effect until the LPG vessel is moored at the LPG receiving facility. (2) For outgoing tank vessels loaded with LPG, the waters...
Compiling a corpus-based dictionary grammar: an example for ...

African Journals Online (AJOL)

In this article it is shown how a corpus-based dictionary grammar may be compiled — that is, a mini-grammar fully based on corpus data and specifically written for use in and inte-grated with a dictionary. Such an effort is, to the best of our knowledge, a world's first. We exem-plify our approach for a Northern Sotho ...
The Corpus of Czech Verse

Czech Academy of Sciences Publication Activity Database

Plecháč, Petr; Kolár, Robert

2015-01-01

Roč. 2, č. 1 (2015), s. 107-118 ISSN 2346-6901 R&D Projects: GA ČR GAP406/11/1825 Institutional support: RVO:68378068 Keywords : Czech poetry * versification * corpus linguistics * theory of verse Subject RIV: AJ - Letters, Mass-media, Audiovision
Agenesis of the Corpus Callosum

Science.gov (United States)

... callosum, the structure that connects the two hemispheres (left and right) of the brain. In ACC the corpus callosum is partially or completely absent. It is caused by a disruption of brain cell migration during fetal development. ACC can occur as an isolated condition or ...
A massively parallel corpus: the Bible in 100 languages.

Science.gov (United States)

Christodouloupoulos, Christos; Steedman, Mark

We describe the creation of a massively parallel corpus based on 100 translations of the Bible. We discuss some of the difficulties in acquiring and processing the raw material as well as the potential of the Bible as a corpus for natural language processing. Finally we present a statistical analysis of the corpora collected and a detailed comparison between the English translation and other English corpora.
Lexical Properties of Slovene Sign Language: A Corpus-Based Study

Science.gov (United States)

Vintar, Špela

2015-01-01

Slovene Sign Language (SZJ) has as yet received little attention from linguists. This article presents some basic facts about SZJ, its history, current status, and a description of the Slovene Sign Language Corpus and Pilot Grammar (SIGNOR) project, which compiled and annotated a representative corpus of SZJ. Finally, selected quantitative data…
[Medicine and astrology in Arnau's corpus].

Science.gov (United States)

Giralt, Sebastià

2006-01-01

The role of astrology in Arnau de Vilanova's medical work is revisited with special attention to the problems of authorship posed by the astrological writings of Arnau's corpus and to their hypothetical chronology.
Developing a corpus of spoken language variability

Science.gov (United States)

Carmichael, Lesley; Wright, Richard; Wassink, Alicia Beckford

2003-10-01

We are developing a novel, searchable corpus as a research tool for investigating phonetic and phonological phenomena across various speech styles. Five speech styles have been well studied independently in previous work: reduced (casual), careful (hyperarticulated), citation (reading), Lombard effect (speech in noise), and ``motherese'' (child-directed speech). Few studies to date have collected a wide range of styles from a single set of speakers, and fewer yet have provided publicly available corpora. The pilot corpus includes recordings of (1) a set of speakers participating in a variety of tasks designed to elicit the five speech styles, and (2) casual peer conversations and wordlists to illustrate regional vowels. The data include high-quality recordings and time-aligned transcriptions linked to text files that can be queried. Initial measures drawn from the database provide comparison across speech styles along the following acoustic dimensions: MLU (changes in unit duration); relative intra-speaker intensity changes (mean and dynamic range); and intra-speaker pitch values (minimum, maximum, mean, range). The corpus design will allow for a variety of analyses requiring control of demographic and style factors, including hyperarticulation variety, disfluencies, intonation, discourse analysis, and detailed spectral measures.
The Shona Corpus and the Problem of Tagging | Chabata | Lexikos

African Journals Online (AJOL)

An analysis of the problems that most corpus builders face shows that more problems are likely to be encountered when dealing with spoken corpora than with written corpora. The paper demonstrates that tagging is an important component of corpus building as it makes it easier for a researcher to extract relevant data.
What Does Corpus Linguistics Have to Offer to Language Assessment?

Science.gov (United States)

Xi, Xiaoming

2017-01-01

In recent years, continuing advances in technology have increased the capacity to automate the extraction of a range of linguistic features of texts and thus have provided the impetus for the substantial growth of corpus linguistics. While corpus linguistic tools and methods have been used extensively in second language learning research, they…
Radiographic evaluation of 70 patients with absence of the corpus callosum

International Nuclear Information System (INIS)

Byrd, S.E.; Flannery, A.; Osborn, R.E.; Radkowski, M.A.; Naidich, T.P.; Bohan, T.P.

1987-01-01

Absence (agenesis) of the corpus callosum is one of the most common congenital malformations of the brain seen in the pediatric population. The authors used CT, MR imaging, or US to study 70 children with absence of the corpus callosum. Patients were divided into two groups; those with isolated absence of the corpus callosum, and those with other associated brain lesions. The associated brain lesions included interhemispheric arachnoid cyst, Dandy-Walker malformations, encephaloceles, and migrational disorders (heterotopias, schizencephaly, lissencaphaly, septo-optic dysplasia, lipoma, Chiari malformations, and holoprosenscephaly). The clinical presentations and radiologic findings are described
Bayesian stratified sampling to assess corpus utility

Energy Technology Data Exchange (ETDEWEB)

Hochberg, J.; Scovel, C.; Thomas, T.; Hall, S.

1998-12-01

This paper describes a method for asking statistical questions about a large text corpus. The authors exemplify the method by addressing the question, ``What percentage of Federal Register documents are real documents, of possible interest to a text researcher or analyst?`` They estimate an answer to this question by evaluating 200 documents selected from a corpus of 45,820 Federal Register documents. Bayesian analysis and stratified sampling are used to reduce the sampling uncertainty of the estimate from over 3,100 documents to fewer than 1,000. A possible application of the method is to establish baseline statistics used to estimate recall rates for information retrieval systems.
Corpus-based Studies on Nursing Textbooks

Directory of Open Access Journals (Sweden)

Alif Fairus Nor Mohamad

2013-07-01

Full Text Available English for Specific Purposes (ESP educators often face dilemma in deciding what lexical items to teach their students. In the field of English for Nursing Purposes (ENP, there is no exception on this issue as well. Only by analyzing the nursing corpus made up of essential core textbooks that can provide better insights and guide to both nursing students and educators. This research aims to highlight the 2,000 most frequently used nursing words across the core textbooks of nursing and to profile the types of ‘low frequency’ lexis which comprise the nursing corpus in terms of the General Service List (GSL and Academic Word List (AWL lexis coverage. By knowing the frequently used nursing words would further reduce students’ reading deficiency if the students use the 2000-word list.
An annotated corpus with nanomedicine and pharmacokinetic parameters

Directory of Open Access Journals (Sweden)

Lewinski NA

2017-10-01

Full Text Available Nastassja A Lewinski,1 Ivan Jimenez,1 Bridget T McInnes2 1Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, 2Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA Abstract: A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided. Keywords: nanotechnology, informatics, natural language processing, text mining, corpora
Corpus callosum lipoma with frontal encephalocele

International Nuclear Information System (INIS)

Srinivasa Rao, A.; Rao, V.R.K.; Ravi Mandalam, K.; Gupta, A.K.; Kumar, S.; Joseph, S.; Unni, M.

1990-01-01

Computed tomographic and plain X-ray observations in a patient with corpus callosum lipoma associated with frontal encephalocele are reported. The rarity of the lesion and the specific diagnostic criteria on CT are emphasised. (orig.)

Sexual dimorphism of the human corpus callosum: Digital morphometric study

Directory of Open Access Journals (Sweden)

Spasojević Goran

2006-01-01

Full Text Available Background/Aim. Changes in the morphology and the size of the corpus callosum, are related to various pathological conditions. An analysis of these changes requires data about sexual dimorphism of the corpus callosum, which we tried to obtain in our study. We also investigated the method of digital morphometry and compared the obtained results with the results of other authors obtained by magnetic resonance imaging or by planimetry. Methods. A morphological research included 34 human brains (cadavers of both sexes − 19 female and 15 male aged 26−72 years. By digital morphometry using an AutoCAD software we performed measurements in the corpus callosum: the length (L, width in the half of its length (WW’, length of its cortical margin (LCM, area and perimeter of the anterior and posterior callosal segments, as well as the area and perimeter of the corpus callosum section area. The investigated parameters were analyzed and compared between the females and males. Results. There was not a statistically significant difference between the males and females in the investigated parameters of the corpus callosum (t test; p > 0.05, including the mean values of the two most important parameters, the surface of its midsagittal section area (males 654.11 mm2; females 677.40 mm2 and of its perimeter (males 19.61 cm; females 19.72 cm. The results obtained by digital morphometry were in the range of the results of other authors obtained by magnetic resonance and by planimetry. However, the value of Pearson coefficient of linear correlation between the section surface area and perimeter of the corpus callosum in the males was highly significant (rxy = 0.6943, p < 0.01, while in the females this value was statistically insignificant. Conclusion. Digital morphometry is accurate method in encephalometric investigations. Our results suggest that the problem of sexual dimorphism of the corpus callosum is very complex, because the identical variables (section
Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements

Directory of Open Access Journals (Sweden)

Danuta Roszko

2015-06-01

Full Text Available Experimental Polish-Lithuanian Corpus with the Semantic Annotation Elements In the article the authors present the experimental Polish-Lithuanian corpus (ECorpPL-LT formed for the idea of Polish-Lithuanian theoretical contrastive studies, a Polish-Lithuanian electronic dictionary, and as help for a sworn translator. The semantic annotation being brought into ECorpPL-LT is extremely useful in Polish-Lithuanian contrastive studies, and also proves helpful in translation work.
Textual, Genre and Social Features of Spoken Grammar: A Corpus-Based Approach

Directory of Open Access Journals (Sweden)

Carmen Pérez-Llantada

2009-02-01

Full Text Available This paper describes a corpus-based approach to teaching and learning spoken grammar for English for Academic Purposes with reference to Bhatia’s (2002 multi-perspective model for discourse analysis: a textual perspective, a genre perspective and a social perspective. From a textual perspective, corpus-informed instruction helps students identify grammar items through statistical frequencies, collocational patterns, context-sensitive meanings and discoursal uses of words. From a genre perspective, corpus observation provides students with exposure to recurrent lexico-grammatical patterns across different academic text types (genres. From a social perspective, corpus models can be used to raise learners’ awareness of how speakers’ different discourse roles, discourse privileges and power statuses are enacted in their grammar choices. The paper describes corpus-based instructional procedures, gives samples of learners’ linguistic output, and provides comments on the students’ response to this method of instruction. Data resulting from the assessment process and student production suggest that corpus-informed instruction grounded in Bhatia’s multi-perspective model can constitute a pedagogical approach in order to i obtain positive student responses from input and authentic samples of grammar use, ii help students identify and understand the textual, genre and social aspects of grammar in real contexts of use, and therefore iii help develop students’ ability to use grammar accurately and appropriately.
Transformation priming helps to disambiguate sudden changes of sensory inputs.

Science.gov (United States)

Pastukhov, Alexander; Vivian-Griffiths, Solveiga; Braun, Jochen

2015-11-01

Retinal input is riddled with abrupt transients due to self-motion, changes in illumination, object-motion, etc. Our visual system must correctly interpret each of these changes to keep visual perception consistent and sensitive. This poses an enormous challenge, as many transients are highly ambiguous in that they are consistent with many alternative physical transformations. Here we investigated inter-trial effects in three situations with sudden and ambiguous transients, each presenting two alternative appearances (rotation-reversing structure-from-motion, polarity-reversing shape-from-shading, and streaming-bouncing object collisions). In every situation, we observed priming of transformations as the outcome perceived in earlier trials tended to repeat in subsequent trials and this repetition was contingent on perceptual experience. The observed priming was specific to transformations and did not originate in priming of perceptual states preceding a transient. Moreover, transformation priming was independent of attention and specific to low level stimulus attributes. In summary, we show how "transformation priors" and experience-driven updating of such priors helps to disambiguate sudden changes of sensory inputs. We discuss how dynamic transformation priors can be instantiated as "transition energies" in an "energy landscape" model of the visual perception. Copyright © 2015 Elsevier Ltd. All rights reserved.
Copy Number Variations Found in Patients with a Corpus Callosum Abnormality and Intellectual Disability.

Science.gov (United States)

Heide, Solveig; Keren, Boris; Billette de Villemeur, Thierry; Chantot-Bastaraud, Sandra; Depienne, Christel; Nava, Caroline; Mignot, Cyril; Jacquette, Aurélia; Fonteneau, Eric; Lejeune, Elodie; Mach, Corinne; Marey, Isabelle; Whalen, Sandra; Lacombe, Didier; Naudion, Sophie; Rooryck, Caroline; Toutain, Annick; Caignec, Cédric Le; Haye, Damien; Olivier-Faivre, Laurence; Masurel-Paulet, Alice; Thauvin-Robinet, Christel; Lesne, Fabien; Faudet, Anne; Ville, Dorothée; des Portes, Vincent; Sanlaville, Damien; Siffroi, Jean-Pierre; Moutard, Marie-Laure; Héron, Delphine

2017-06-01

To evaluate the role that chromosomal micro-rearrangements play in patients with both corpus callosum abnormality and intellectual disability, we analyzed copy number variations (CNVs) in patients with corpus callosum abnormality/intellectual disability STUDY DESIGN: We screened 149 patients with corpus callosum abnormality/intellectual disability using Illumina SNP arrays. In 20 patients (13%), we have identified at least 1 CNV that likely contributes to corpus callosum abnormality/intellectual disability phenotype. We confirmed that the most common rearrangement in corpus callosum abnormality/intellectual disability is inverted duplication with terminal deletion of the 8p chromosome (3.2%). In addition to the identification of known recurrent CNVs, such as deletions 6qter, 18q21 (including TCF4), 1q43q44, 17p13.3, 14q12, 3q13, 3p26, and 3q26 (including SOX2), our analysis allowed us to refine the 2 known critical regions associated with 8q21.1 deletion and 19p13.1 duplication relevant for corpus callosum abnormality; report a novel 10p12 deletion including ZEB1 recently implicated in corpus callosum abnormality with corneal dystrophy; and) report a novel pathogenic 7q36 duplication encompassing SHH. In addition, 66 variants of unknown significance were identified in 57 patients encompassed candidate genes. Our results confirm the relevance of using microarray analysis as first line test in patients with corpus callosum abnormality/intellectual disability. Copyright © 2017 Elsevier Inc. All rights reserved.
Sirenomelia with agenesis of corpus callosum.

Science.gov (United States)

Shirani, Shapour; Rekabi, Vahab; Kamalian, Naser

2006-07-01

Sirenomelia is a very rare anomaly presented with fusion of the lower limbs. Genitourinary, neural tube, and vertebral anomalies are found in most cases. We report a case of sirenomelia with agenesis of corpus callosum, which has not been reported previously.
75 FR 31677 - Amendment of Class E Airspace; Corpus Christi, TX

Science.gov (United States)

2010-06-04

... Jose Island Airport, TX (Lat. 27[deg]56'40'' N., long. 96[deg]59'06'' W.) Rockport, Aransas County... Meacham Blvd., Fort Worth, TX 76137; telephone (817) 321- 7716. SUPPLEMENTARY INFORMATION: History On... Corpus Christi, TX [Amended] Corpus Christi International Airport, TX (Lat. 27[deg]46'13'' N., long. 97...
The Corpus of English as Lingua Franca in Academic Settings.

Science.gov (United States)

Mauranen, Anna

2003-01-01

Describes a project to make a corpus of English spoken as a lingua franca in university settings in Finland. This corpus is one of the first to address the need for corpora that show the target for English-as-a-Foreign-Language learners whose goal is not to speak with native speakers but to interact in communities where English is a lingua franca.…
Segmentation of corpus callosum using diffusion tensor imaging: validation in patients with glioblastoma

International Nuclear Information System (INIS)

Nazem-Zadeh, Mohammad-Reza; Saksena, Sona; Babajani-Fermi, Abbas; Jiang, Quan; Soltanian-Zadeh, Hamid; Rosenblum, Mark; Mikkelsen, Tom; Jain, Rajan

2012-01-01

This paper presents a three-dimensional (3D) method for segmenting corpus callosum in normal subjects and brain cancer patients with glioblastoma. Nineteen patients with histologically confirmed treatment naïve glioblastoma and eleven normal control subjects underwent DTI on a 3T scanner. Based on the information inherent in diffusion tensors, a similarity measure was proposed and used in the proposed algorithm. In this algorithm, diffusion pattern of corpus callosum was used as prior information. Subsequently, corpus callosum was automatically divided into Witelson subdivisions. We simulated the potential rotation of corpus callosum under tumor pressure and studied the reproducibility of the proposed segmentation method in such cases. Dice coefficients, estimated to compare automatic and manual segmentation results for Witelson subdivisions, ranged from 94% to 98% for control subjects and from 81% to 95% for tumor patients, illustrating closeness of automatic and manual segmentations. Studying the effect of corpus callosum rotation by different Euler angles showed that although segmentation results were more sensitive to azimuth and elevation than skew, rotations caused by brain tumors do not have major effects on the segmentation results. The proposed method and similarity measure segment corpus callosum by propagating a hyper-surface inside the structure (resulting in high sensitivity), without penetrating into neighboring fiber bundles (resulting in high specificity)
The Gutenberg English Poetry Corpus: Exemplary Quantitative Narrative Analyses

Directory of Open Access Journals (Sweden)

Arthur M. Jacobs

2018-04-01

Full Text Available This paper describes a corpus of about 3,000 English literary texts with about 250 million words extracted from the Gutenberg project that span a range of genres from both fiction and non-fiction written by more than 130 authors (e.g., Darwin, Dickens, Shakespeare. Quantitative narrative analysis (QNA is used to explore a cleaned subcorpus, the Gutenberg English Poetry Corpus (GEPC, which comprises over 100 poetic texts with around two million words from about 50 authors (e.g., Keats, Joyce, Wordsworth. Some exemplary QNA studies show author similarities based on latent semantic analysis, significant topics for each author or various text-analytic metrics for George Eliot’s poem “How Lisa Loved the King” and James Joyce’s “Chamber Music,” concerning, e.g., lexical diversity or sentiment analysis. The GEPC is particularly suited for research in Digital Humanities, Computational Stylistics, or Neurocognitive Poetics, e.g., as training and test corpus for stimulus development and control in empirical studies.
A Corpus-Based View of Lexical Gender in Written Business English

Science.gov (United States)

Fuertes-Olivera, Pedro A.

2007-01-01

This article investigates lexical gender in specialized communication. The key method of analysis is that of forms of address, professional titles, and "generic man" in a 10 million word corpus of written Business English. After a brief introduction and literature review on both gender in specialized communication and similar corpus-based views of…
Microstructural changes in thickened corpus callosum in children: contribution of magnetic resonance diffusion tensor imaging

Energy Technology Data Exchange (ETDEWEB)

Merlini, Laura; Anooshiravani, Mehrak; Kanavaki, Aikaterini; Hanquinet, Sylviane [University of Geneva Children' s Hospital, Pediatric Radiology Unit, Geneva (Switzerland)

2015-06-15

Thickened corpus callosum is a rare finding and its pathophysiology is not well known. An anomalous supracallosal bundle has been depicted by fiber tracking in some cases but no diffusion tensor imaging metrics of thickened corpus callosum have been reported. To use diffusion tensor imaging (DTI) in cases of thickened corpus callosum to help in understanding its clinical significance. During a 7-year period five children (ages 6 months to 15 years) with thickened corpus callosum were studied. We determined DTI metrics of fractional anisotropy (FA), mean diffusivity, and axial (λ1) and radial (λ2, λ3) diffusivity and performed 3-D fiber tracking reconstruction of the thickened corpus callosum. We compared our results with data from the literature and 24 age-matched controls. Brain abnormalities were seen in all cases. All children had at least three measurements of corpus callosum thickness above the 97th percentile according to age. In all children 3-D fiber tracking showed an anomalous supracallosal bundle and statistically significant decrease in FA (P = 0.003) and λ1 (P = 0.001) of the corpus callosum compared with controls, but no significant difference in mean diffusivity and radial diffusivity. Thickened corpus callosum was associated with abnormal bundles, suggesting underlying axonal guidance abnormality. DTI metrics suggested abnormal fiber compactness and density, which may be associated with alterations in cognition. (orig.)
A Corpus-based Study of English Vocabulary in Art Research Articles

Directory of Open Access Journals (Sweden)

Ping Wang

2017-09-01

Full Text Available The learning of English as a foreign language is an additional burden for art majors. This study aimed to examine high frequency words in art research articles to improve the efficiency of art majors’ English learning, especially their academic reading and writing. For this aim, the study built a corpus, analyzed data from art research articles and compared data with three base word lists. We found that the General Service List (GSL and the Academic Word List (AWL had a high coverage in our corpus, and there was a different high frequency word order in the Art Research Article Corpus (ARAC. These findings provide some implications for teaching English for art majors.
Estrogen and oxytocin receptors in the canine corpus luteum during pregnancy and parturition

Directory of Open Access Journals (Sweden)

Gisele Almeida Lima Veiga

2015-02-01

Full Text Available The expression of genes encoding the receptors for estrogen (ERαmRNA and oxytocin (OTRmRNA was studied in the corpus luteum during pregnancy and parturition in dogs. Real-time PCR was performed to quantify the levels of ERαmRNA and OTRmRNA in the corpus luteum of bitches during Early (up to 20 days of gestation, Mid (20 to 40 days and Late Pregnancy (40 to 60 days, and Parturition (first stage of labor. The corpus luteum expressed mRNA for OTR, however ERα mRNA was not detected. There was a reduction of OTR mRNA expression in the corpus luteum from gestational Day 20 onward, which suggests an important role of OTR mRNA in the mechanism of pregnancy recognition in dogs. We concluded that the expression of OTR mRNA in canine corpus luteum vary over time, which support the idea that the sensitivity and response to hormone therapy can vary along the course of pregnancy and labor. Moreover, the canine CL lacks ERα mRNA expression during pregnancy.
Determination of indices of the corpus callosum associated with normal aging in Japanese individuals

International Nuclear Information System (INIS)

Takeda, S.; Hirashima, Y.; Ikeda, H.; Yamamoto, H.; Endo, S.; Sugino, M.

2003-01-01

Indices of the corpus callosum with normal aging and their sex differences were elucidated using quantitative MRI. We studied 94 Japanese men (mean±SD 57.3±20.8 years, range 6-90 years) and 111 Japanese women (mean±SD 61.2±17.6 years, range 9-86 years) who had no intracranial lesions on MRI and no history of neurological illness. The widths of the rostrum, body and splenium, the anterior to posterior length, and the maximum height in the midsagittal image were selected for measurement. The Evans index, which is the relative ratio of lateral ventricle expansion, and the maximum width of the third ventricle in the axial image were also estimated for comparison. The widths of rostrum, body and splenium of the corpus callosum became thinner with age. Conversely, the anterior to posterior length and the maximum height of the corpus callosum increased with age. The ratio of the width of the body to the length of the corpus callosum and the ratio of the width of the body to the height of the corpus callosum are best correlated with age. No sex differences in regional size of corpus callosum, including these two ratios, were observed in any raw measures, although ventricular indices were larger in men than women. Evaluation of the ratio of the width of the body to its length and the ratio of the width of the body to its height may enable accurate estimation of normal or pathological changes of the corpus callosum. Aging and pathological atrophy of corpus callosum can be evaluated without any adjustment for gender. (orig.)
Determination of indices of the corpus callosum associated with normal aging in Japanese individuals

Energy Technology Data Exchange (ETDEWEB)

Takeda, S.; Hirashima, Y.; Ikeda, H.; Yamamoto, H.; Endo, S. [Department of Neurosurgery, Toyama Medical and Pharmaceutical University, Sugitani 2630, Toyama-shi, 930-0194, Toyama (Japan); Sugino, M. [Department of Neurosurgery, Sugino Hospital, Sengoku-cho 6-3-3, 930-0066, Toyama (Japan)

2003-08-01

Indices of the corpus callosum with normal aging and their sex differences were elucidated using quantitative MRI. We studied 94 Japanese men (mean{+-}SD 57.3{+-}20.8 years, range 6-90 years) and 111 Japanese women (mean{+-}SD 61.2{+-}17.6 years, range 9-86 years) who had no intracranial lesions on MRI and no history of neurological illness. The widths of the rostrum, body and splenium, the anterior to posterior length, and the maximum height in the midsagittal image were selected for measurement. The Evans index, which is the relative ratio of lateral ventricle expansion, and the maximum width of the third ventricle in the axial image were also estimated for comparison. The widths of rostrum, body and splenium of the corpus callosum became thinner with age. Conversely, the anterior to posterior length and the maximum height of the corpus callosum increased with age. The ratio of the width of the body to the length of the corpus callosum and the ratio of the width of the body to the height of the corpus callosum are best correlated with age. No sex differences in regional size of corpus callosum, including these two ratios, were observed in any raw measures, although ventricular indices were larger in men than women. Evaluation of the ratio of the width of the body to its length and the ratio of the width of the body to its height may enable accurate estimation of normal or pathological changes of the corpus callosum. Aging and pathological atrophy of corpus callosum can be evaluated without any adjustment for gender. (orig.)
Corpus callosum tissue loss and development of motor and global cognitive impairment

DEFF Research Database (Denmark)

Frederiksen, Kristian S; Garde, Ellen; Skimminge, Arnold

2011-01-01

To examine the impact of corpus callosum (CC) tissue loss on the development of global cognitive and motor impairment in the elderly.......To examine the impact of corpus callosum (CC) tissue loss on the development of global cognitive and motor impairment in the elderly....
Corpus Juris ja Eesti : [bakalaureusetöö] / Artur Kink ; Tartu Ülikool, õigusteaduskond ; juhendaja: Eerik Kergandberg

Index Scriptorium Estoniae

Kink, Artur

1999-01-01

Corpus Jurise taust ja areng - finantshuvide kaitse areng, Corpus Jurise ajalugu; Corpus Jurise õiguslik baas (Amsterdami leping), ülesehitus ja struktuur (üleeuroopalise territoriaalsuse printsiip, kohtuliku kontrolli printsiip, "vastuväitelise" protsessi printsiip, kohaliku õiguse subsidiaarsuse printsiip)
On immune responsiveness of the organism of patients with corpus uteri cancer

International Nuclear Information System (INIS)

Gorodilova, V.V.; Yatskovskaya, N.L.

1978-01-01

Studied were some immunological indices in patients with cancer of corpus uteri. An attempt was made to elucidate a possible dependence of immunological indices on the process propagation rate and treatment methods. Updated methods used for uteri corpus cancer treatment except for progestinotherapeutics promote the decrease of organism responsiveness. Radiation therapy applied with total therapeutic dose has especially pronounced immunodepressing effect. Progestine series preparations result in the differentiation effect on tumours in some patients with cancer of corpus uteri, which clinically manifests in decreasing the tumour and even complete elimination. Simultaneously immunological indices in such patients are improved
Topics in Corpus-Based Dutch Syntax

NARCIS (Netherlands)

Beek, Leonoor Johanneke van der

2005-01-01

In this dissertation, corpus data is applied in various kinds of linguistic analyses. The data serves as a source of examples and counterexamples in a theoretical linguistic analysis of the Dutch cleft construction, as the source of quantitative data in a probabilistic account of the dative

Dictionary Writing System (DWS + Corpus Query Package (CQP: The Case of TshwaneLex

Directory of Open Access Journals (Sweden)

Gilles-Maurice de Schryver

2011-10-01

Full Text Available
Abstract: In this article the integrated corpus query functionality of the dictionary compilation software TshwaneLex is analysed. Attention is given to the handling of both raw corpus data and annotated corpus data. With regard to the latter it is shown how, with a minimum of human effort, machine learning techniques can be employed to obtain part-of-speech tagged corpora that can be used for lexicographic purposes. All points are illustrated with data drawn from English and Northern Sotho. The tools and techniques themselves, however, are language-independent, and as such the encouraging outcomes of this study are far-reaching.
Keywords: LEXICOGRAPHY, DICTIONARY, SOFTWARE, DICTIONARY WRITING SYS-TEM (DWS, CORPUS QUERY PACKAGE (CQP, TSHWANELEX, CORPUS, CORPUS ANNO-TATION, PART-OF-SPEECH TAGGER (POS-TAGGER, MACHINE LEARNING, NORTHERN SOTHO (SESOTHO SA LEBOA
Samenvatting: Woordenboekaanmaaksysteem + corpusanalysepakket: een studie van TshwaneLex. In dit artikel wordt het geïntegreerde corpusanalysepakket van het woordenboekaanmaaksysteem TshwaneLex geanalyseerd. Aandacht gaat zowel naar het verwer-ken van onbewerkte corpusdata als naar geannoteerde corpusdata. Wat het laatste betreft wordt aangetoond hoe, met een minimum aan intellectuele arbeid, automatische leertechnieken met suc-ces kunnen worden ingezet om corpora voor lexicografische doeleinden aan te maken waarin de woordklassen expliciet worden vermeld. Alle stappen van de redenering worden geïllustreerd met gegevens uit het Engels en Noord-Sotho. De instrumenten en technieken zelf zijn echter allemaal taalonafhankelijk, waardoor de veelbelovende resultaten van deze studie verreikend zijn.
Sleutelwoorden: LEXICOGRAFIE, WOORDENBOEK, SOFTWARE, WOORDENBOEK-AANMAAKSYSTEEM, CORPUSANALYSEPAKKET, TSHWANELEX, CORPUS, CORPUSANNO-TATIE, WOORDKLASSETAGGER, AUTOMATISCHE LEERTECHNIEKEN, NOORD-SOTHO
Using text mining techniques to extract phenotypic information from the PhenoCHF corpus.

Science.gov (United States)

Alnazzawi, Noha; Thompson, Paul; Batista-Navarro, Riza; Ananiadou, Sophia

2015-01-01

Phenotypic information locked away in unstructured narrative text presents significant barriers to information accessibility, both for clinical practitioners and for computerised applications used for clinical research purposes. Text mining (TM) techniques have previously been applied successfully to extract different types of information from text in the biomedical domain. They have the potential to be extended to allow the extraction of information relating to phenotypes from free text. To stimulate the development of TM systems that are able to extract phenotypic information from text, we have created a new corpus (PhenoCHF) that is annotated by domain experts with several types of phenotypic information relating to congestive heart failure. To ensure that systems developed using the corpus are robust to multiple text types, it integrates text from heterogeneous sources, i.e., electronic health records (EHRs) and scientific articles from the literature. We have developed several different phenotype extraction methods to demonstrate the utility of the corpus, and tested these methods on a further corpus, i.e., ShARe/CLEF 2013. Evaluation of our automated methods showed that PhenoCHF can facilitate the training of reliable phenotype extraction systems, which are robust to variations in text type. These results have been reinforced by evaluating our trained systems on the ShARe/CLEF corpus, which contains clinical records of various types. Like other studies within the biomedical domain, we found that solutions based on conditional random fields produced the best results, when coupled with a rich feature set. PhenoCHF is the first annotated corpus aimed at encoding detailed phenotypic information. The unique heterogeneous composition of the corpus has been shown to be advantageous in the training of systems that can accurately extract phenotypic information from a range of different text types. Although the scope of our annotation is currently limited to a single
Rheumatic diseases in the Corpus Hippocraticum

Directory of Open Access Journals (Sweden)

G. Squillace

2011-09-01

Full Text Available Medecine of V and IV centuries B.C. attested in the Corpus Hippocraticum ascribes all diseases to the rheuma, i.e. the flux of humours into the body. This flux produces not only the rise of cold, hoarsness, cough, reddenings, dropsy, but also arthritis, sciatica, gout.
English Collocation Learning through Corpus Data: On-Line Concordance and Statistical Information

Science.gov (United States)

Ohtake, Hiroshi; Fujita, Nobuyuki; Kawamoto, Takeshi; Morren, Brian; Ugawa, Yoshihiro; Kaneko, Shuji

2012-01-01

We developed an English Collocations On Demand system offering on-line corpus and concordance information to help Japanese researchers acquire a better command of English collocation patterns. The Life Science Dictionary Corpus consists of approximately 90,000,000 words collected from life science related research papers published in academic…
A case of total agenesis of the corpus callosum

International Nuclear Information System (INIS)

Sakamoto, Masanobu; Takeda, Katsuhiko; Bandou, Mitsuaki; Murayama, Shigeo; Sakuta, Manabu

1985-01-01

We have reported a case of agenesis of the corpus callosum, in which NMR-CT revealed a complete defect of it, and have examined the localization of the speech center of this patient. The patient is a right-handed 26-year-old man who has complained of headache on the parietal region. His neurological examination revealed only a mild mental difficulty (IQ 77). X-ray CT showed the lateral ventricles to be separated widely and the posterior horns dilated, which were compatible with the agenesis of the corpus callosum. Further, NMR-CT has revealed a total agenesis of the corpus callosum. NMR-CT seems to be highly useful for the detection of the degree of the callosal defect. We have carried out the intracarotid amobarbital injection (Wada's test) for the determination of the lateralization of cerebral speech dominance. It had been reported by some authors that when it comes to the cerebral speech dominance, acallosal patients had no difference between each hemisphere. However, our results have demonstrated a left sided dominance. (author)
NCBI disease corpus: a resource for disease name recognition and concept normalization.

Science.gov (United States)

Doğan, Rezarta Islamaj; Leaman, Robert; Lu, Zhiyong

2014-02-01

Information encoded in natural language in biomedical literature publications is only useful if efficient and reliable ways of accessing and analyzing that information are available. Natural language processing and text mining tools are therefore essential for extracting valuable information, however, the development of powerful, highly effective tools to automatically detect central biomedical concepts such as diseases is conditional on the availability of annotated corpora. This paper presents the disease name and concept annotations of the NCBI disease corpus, a collection of 793 PubMed abstracts fully annotated at the mention and concept level to serve as a research resource for the biomedical natural language processing community. Each PubMed abstract was manually annotated by two annotators with disease mentions and their corresponding concepts in Medical Subject Headings (MeSH®) or Online Mendelian Inheritance in Man (OMIM®). Manual curation was performed using PubTator, which allowed the use of pre-annotations as a pre-step to manual annotations. Fourteen annotators were randomly paired and differing annotations were discussed for reaching a consensus in two annotation phases. In this setting, a high inter-annotator agreement was observed. Finally, all results were checked against annotations of the rest of the corpus to assure corpus-wide consistency. The public release of the NCBI disease corpus contains 6892 disease mentions, which are mapped to 790 unique disease concepts. Of these, 88% link to a MeSH identifier, while the rest contain an OMIM identifier. We were able to link 91% of the mentions to a single disease concept, while the rest are described as a combination of concepts. In order to help researchers use the corpus to design and test disease identification methods, we have prepared the corpus as training, testing and development sets. To demonstrate its utility, we conducted a benchmarking experiment where we compared three different
A 38 million words Dutch text corpus and its users | Kruyt | Lexikos

African Journals Online (AJOL)

In August 1996, the 38 Million Words Corpus was available for consultation by the international research community. The present paper reports on the characteristics of this corpus (design, text classification, linguistic annotation) and on its use, both in dictionary projects and in linguistic research. In spite of limitations with ...
Preparing an annotated gold standard corpus to share with extramural investigators for de-identification research.

Science.gov (United States)

Deleger, Louise; Lingren, Todd; Ni, Yizhao; Kaiser, Megan; Stoutenborough, Laura; Marsolo, Keith; Kouril, Michal; Molnar, Katalin; Solti, Imre

2014-08-01

The current study aims to fill the gap in available healthcare de-identification resources by creating a new sharable dataset with realistic Protected Health Information (PHI) without reducing the value of the data for de-identification research. By releasing the annotated gold standard corpus with Data Use Agreement we would like to encourage other Computational Linguists to experiment with our data and develop new machine learning models for de-identification. This paper describes: (1) the modifications required by the Institutional Review Board before sharing the de-identification gold standard corpus; (2) our efforts to keep the PHI as realistic as possible; (3) and the tests to show the effectiveness of these efforts in preserving the value of the modified data set for machine learning model development. In a previous study we built an original de-identification gold standard corpus annotated with true Protected Health Information (PHI) from 3503 randomly selected clinical notes for the 22 most frequent clinical note types of our institution. In the current study we modified the original gold standard corpus to make it suitable for external sharing by replacing HIPAA-specified PHI with newly generated realistic PHI. Finally, we evaluated the research value of this new dataset by comparing the performance of an existing published in-house de-identification system, when trained on the new de-identification gold standard corpus, with the performance of the same system, when trained on the original corpus. We assessed the potential benefits of using the new de-identification gold standard corpus to identify PHI in the i2b2 and PhysioNet datasets that were released by other groups for de-identification research. We also measured the effectiveness of the i2b2 and PhysioNet de-identification gold standard corpora in identifying PHI in our original clinical notes. Performance of the de-identification system using the new gold standard corpus as a training set was very
Magnetic resonance findings of the corpus callosum in canine and feline lysosomal storage diseases.

Science.gov (United States)

Hasegawa, Daisuke; Tamura, Shinji; Nakamoto, Yuya; Matsuki, Naoaki; Takahashi, Kimimasa; Fujita, Michio; Uchida, Kazuyuki; Yamato, Osamu

2013-01-01

Several reports have described magnetic resonance (MR) findings in canine and feline lysosomal storage diseases such as gangliosidoses and neuronal ceroid lipofuscinosis. Although most of those studies described the signal intensities of white matter in the cerebrum, findings of the corpus callosum were not described in detail. A retrospective study was conducted on MR findings of the corpus callosum as well as the rostral commissure and the fornix in 18 cases of canine and feline lysosomal storage diseases. This included 6 Shiba Inu dogs and 2 domestic shorthair cats with GM1 gangliosidosis; 2 domestic shorthair cats, 2 familial toy poodles, and a golden retriever with GM2 gangliosidosis; and 2 border collies and 3 chihuahuas with neuronal ceroid lipofuscinoses, to determine whether changes of the corpus callosum is an imaging indicator of those diseases. The corpus callosum and the rostral commissure were difficult to recognize in all cases of juvenile-onset gangliosidoses (GM1 gangliosidosis in Shiba Inu dogs and domestic shorthair cats and GM2 gangliosidosis in domestic shorthair cats) and GM2 gangliosidosis in toy poodles with late juvenile-onset. In contrast, the corpus callosum and the rostral commissure were confirmed in cases of GM2 gangliosidosis in a golden retriever and canine neuronal ceroid lipofuscinoses with late juvenile- to early adult-onset, but were extremely thin. Abnormal findings of the corpus callosum on midline sagittal images may be a useful imaging indicator for suspecting lysosomal storage diseases, especially hypoplasia (underdevelopment) of the corpus callosum in juvenile-onset gangliosidoses.
Magnetic resonance findings of the corpus callosum in canine and feline lysosomal storage diseases.

Directory of Open Access Journals (Sweden)

Daisuke Hasegawa

Full Text Available Several reports have described magnetic resonance (MR findings in canine and feline lysosomal storage diseases such as gangliosidoses and neuronal ceroid lipofuscinosis. Although most of those studies described the signal intensities of white matter in the cerebrum, findings of the corpus callosum were not described in detail. A retrospective study was conducted on MR findings of the corpus callosum as well as the rostral commissure and the fornix in 18 cases of canine and feline lysosomal storage diseases. This included 6 Shiba Inu dogs and 2 domestic shorthair cats with GM1 gangliosidosis; 2 domestic shorthair cats, 2 familial toy poodles, and a golden retriever with GM2 gangliosidosis; and 2 border collies and 3 chihuahuas with neuronal ceroid lipofuscinoses, to determine whether changes of the corpus callosum is an imaging indicator of those diseases. The corpus callosum and the rostral commissure were difficult to recognize in all cases of juvenile-onset gangliosidoses (GM1 gangliosidosis in Shiba Inu dogs and domestic shorthair cats and GM2 gangliosidosis in domestic shorthair cats and GM2 gangliosidosis in toy poodles with late juvenile-onset. In contrast, the corpus callosum and the rostral commissure were confirmed in cases of GM2 gangliosidosis in a golden retriever and canine neuronal ceroid lipofuscinoses with late juvenile- to early adult-onset, but were extremely thin. Abnormal findings of the corpus callosum on midline sagittal images may be a useful imaging indicator for suspecting lysosomal storage diseases, especially hypoplasia (underdevelopment of the corpus callosum in juvenile-onset gangliosidoses.
Magnetic Resonance Findings of the Corpus Callosum in Canine and Feline Lysosomal Storage Diseases

Science.gov (United States)

Hasegawa, Daisuke; Tamura, Shinji; Nakamoto, Yuya; Matsuki, Naoaki; Takahashi, Kimimasa; Fujita, Michio; Uchida, Kazuyuki; Yamato, Osamu

2013-01-01

Several reports have described magnetic resonance (MR) findings in canine and feline lysosomal storage diseases such as gangliosidoses and neuronal ceroid lipofuscinosis. Although most of those studies described the signal intensities of white matter in the cerebrum, findings of the corpus callosum were not described in detail. A retrospective study was conducted on MR findings of the corpus callosum as well as the rostral commissure and the fornix in 18 cases of canine and feline lysosomal storage diseases. This included 6 Shiba Inu dogs and 2 domestic shorthair cats with GM1 gangliosidosis; 2 domestic shorthair cats, 2 familial toy poodles, and a golden retriever with GM2 gangliosidosis; and 2 border collies and 3 chihuahuas with neuronal ceroid lipofuscinoses, to determine whether changes of the corpus callosum is an imaging indicator of those diseases. The corpus callosum and the rostral commissure were difficult to recognize in all cases of juvenile-onset gangliosidoses (GM1 gangliosidosis in Shiba Inu dogs and domestic shorthair cats and GM2 gangliosidosis in domestic shorthair cats) and GM2 gangliosidosis in toy poodles with late juvenile-onset. In contrast, the corpus callosum and the rostral commissure were confirmed in cases of GM2 gangliosidosis in a golden retriever and canine neuronal ceroid lipofuscinoses with late juvenile- to early adult-onset, but were extremely thin. Abnormal findings of the corpus callosum on midline sagittal images may be a useful imaging indicator for suspecting lysosomal storage diseases, especially hypoplasia (underdevelopment) of the corpus callosum in juvenile-onset gangliosidoses. PMID:24386203
Jointly learning word embeddings using a corpus and a knowledge base

Science.gov (United States)

Bollegala, Danushka; Maehara, Takanori; Kawarabayashi, Ken-ichi

2018-01-01

Methods for representing the meaning of words in vector spaces purely using the information distributed in text corpora have proved to be very valuable in various text mining and natural language processing (NLP) tasks. However, these methods still disregard the valuable semantic relational structure between words in co-occurring contexts. These beneficial semantic relational structures are contained in manually-created knowledge bases (KBs) such as ontologies and semantic lexicons, where the meanings of words are represented by defining the various relationships that exist among those words. We combine the knowledge in both a corpus and a KB to learn better word embeddings. Specifically, we propose a joint word representation learning method that uses the knowledge in the KBs, and simultaneously predicts the co-occurrences of two words in a corpus context. In particular, we use the corpus to define our objective function subject to the relational constrains derived from the KB. We further utilise the corpus co-occurrence statistics to propose two novel approaches, Nearest Neighbour Expansion (NNE) and Hedged Nearest Neighbour Expansion (HNE), that dynamically expand the KB and therefore derive more constraints that guide the optimisation process. Our experimental results over a wide-range of benchmark tasks demonstrate that the proposed method statistically significantly improves the accuracy of the word embeddings learnt. It outperforms a corpus-only baseline and reports an improvement of a number of previously proposed methods that incorporate corpora and KBs in both semantic similarity prediction and word analogy detection tasks. PMID:29529052
Mind-modelling with corpus stylistics in David Copperfield.

Science.gov (United States)

Stockwell, Peter; Mahlberg, Michaela

2015-05-01

We suggest an innovative approach to literary discourse by using corpus linguistic methods to address research questions from cognitive poetics. In this article, we focus on the way that readers engage in mind-modelling in the process of characterisation. The article sets out our cognitive poetic model of characterisation that emphasises the continuity between literary characterisation and real-life human relationships. The model also aims to deal with the modelling of the author's mind in line with the modelling of the minds of fictional characters. Crucially, our approach to mind-modelling is text-driven. Therefore we are able to employ corpus linguistic techniques systematically to identify textual patterns that function as cues triggering character information. In this article, we explore our understanding of mind-modelling through the characterisation of Mr. Dick from David Copperfield by Charles Dickens. Using the CLiC tool (Corpus Linguistics in Cheshire) developed for the exploration of 19th-century fiction, we investigate the textual traces in non-quotations around this character, in order to draw out the techniques of characterisation other than speech presentation. We show that Mr. Dick is a thematically and authorially significant character in the novel, and we move towards a rigorous account of the reader's modelling of authorial intention.
Handedness and corpus callosal morphology in Williams syndrome.

Science.gov (United States)

Martens, Marilee A; Wilson, Sarah J; Chen, Jian; Wood, Amanda G; Reutens, David C

2013-02-01

Williams syndrome is a neurodevelopmental genetic disorder caused by a hemizygous deletion on chromosome 7q11.23, resulting in atypical brain structure and function, including abnormal morphology of the corpus callosum. An influence of handedness on the size of the corpus callosum has been observed in studies of typical individuals, but handedness has not been taken into account in studies of callosal morphology in Williams syndrome. We hypothesized that callosal area is smaller and the size of the splenium and isthmus is reduced in individuals with Williams syndrome compared to healthy controls, and examined age, sex, and handedness effects on corpus callosal area. Structural magnetic resonance imaging scans were obtained on 25 individuals with Williams syndrome (18 right-handed, 7 left-handed) and 25 matched controls. We found that callosal thickness was significantly reduced in the splenium of Williams syndrome individuals compared to controls. We also found novel evidence that the callosal area was smaller in left-handed participants with Williams syndrome than their right-handed counterparts, with opposite findings observed in the control group. This novel finding may be associated with LIM-kinase hemizygosity, a characteristic of Williams syndrome. The findings may have significant clinical implications in future explorations of the Williams syndrome cognitive phenotype.
Working Together: Contributions of Corpus Analyses and Experimental Psycholinguistics to Understanding Conversation.

Science.gov (United States)

Meyer, Antje S; Alday, Phillip M; Decuyper, Caitlin; Knudsen, Birgit

2018-01-01

As conversation is the most important way of using language, linguists and psychologists should combine forces to investigate how interlocutors deal with the cognitive demands arising during conversation. Linguistic analyses of corpora of conversation are needed to understand the structure of conversations, and experimental work is indispensable for understanding the underlying cognitive processes. We argue that joint consideration of corpus and experimental data is most informative when the utterances elicited in a lab experiment match those extracted from a corpus in relevant ways. This requirement to compare like with like seems obvious but is not trivial to achieve. To illustrate this approach, we report two experiments where responses to polar (yes/no) questions were elicited in the lab and the response latencies were compared to gaps between polar questions and answers in a corpus of conversational speech. We found, as expected, that responses were given faster when they were easy to plan and planning could be initiated earlier than when they were harder to plan and planning was initiated later. Overall, in all but one condition, the latencies were longer than one would expect based on the analyses of corpus data. We discuss the implication of this partial match between the data sets and more generally how corpus and experimental data can best be combined in studies of conversation.
Quantification of structural changes in the corpus callosumin children with profound hypoxic-ischaemic brain injury

Energy Technology Data Exchange (ETDEWEB)

Stivaros, Stavros M. [Manchester Academic Health Science Centre, Academic Unit of Paediatric Radiology, Royal Manchester Children' s Hospital, Central Manchester University Hospitals NHS Foundation Trust, Manchester (United Kingdom); University of Manchester, Centre for Imaging Sciences, Institute of Population Health, Manchester (United Kingdom); Radon, Mark R. [The Walton Centre NHS Foundation Trust, Department of Neuroradiology, Liverpool (United Kingdom); Mileva, Reneta; Gledson, Ann; Keane, John A. [University of Manchester, School of Computer Science, Manchester (United Kingdom); Connolly, Daniel J.A.; Batty, Ruth [Sheffield Children' s Hospital NHS Foundation Trust, Department of Neuroradiology, Sheffield (United Kingdom); Cowell, Patricia E. [University of Sheffield, Department of Human Communication Sciences, Sheffield (United Kingdom); Hoggard, Nigel; Griffiths, Paul D. [University of Sheffield, Academic Unit of Radiology, Sheffield (United Kingdom); Wright, Neville B.; Tang, Vivian [Manchester Academic Health Science Centre, Academic Unit of Paediatric Radiology, Royal Manchester Children' s Hospital, Central Manchester University Hospitals NHS Foundation Trust, Manchester (United Kingdom)

2016-01-15

Birth-related acute profound hypoxic-ischaemic brain injury has specific patterns of damage including the paracentral lobules. To test the hypothesis that there is anatomically coherent regional volume loss of the corpus callosum as a result of this hemispheric abnormality. Study subjects included 13 children with proven acute profound hypoxic-ischaemic brain injury and 13 children with developmental delay but no brain abnormalities. A computerised system divided the corpus callosum into 100 segments, measuring each width. Principal component analysis grouped the widths into contiguous anatomical regions. We conducted analysis of variance of corpus callosum widths as well as support vector machine stratification into patient groups. There was statistically significant narrowing of the mid-posterior body and genu of the corpus callosum in children with hypoxic-ischaemic brain injury. Support vector machine analysis yielded over 95% accuracy in patient group stratification using the corpus callosum centile widths. Focal volume loss is seen in the corpus callosum of children with hypoxic-ischaemic brain injury secondary to loss of commissural fibres arising in the paracentral lobules. Support vector machine stratification into the hypoxic-ischaemic brain injury group or the control group on the basis of corpus callosum width is highly accurate and points towards rapid clinical translation of this technique as a potential biomarker of hypoxic-ischaemic brain injury. (orig.)
Quantification of structural changes in the corpus callosumin children with profound hypoxic-ischaemic brain injury

International Nuclear Information System (INIS)

Stivaros, Stavros M.; Radon, Mark R.; Mileva, Reneta; Gledson, Ann; Keane, John A.; Connolly, Daniel J.A.; Batty, Ruth; Cowell, Patricia E.; Hoggard, Nigel; Griffiths, Paul D.; Wright, Neville B.; Tang, Vivian

2016-01-01

Birth-related acute profound hypoxic-ischaemic brain injury has specific patterns of damage including the paracentral lobules. To test the hypothesis that there is anatomically coherent regional volume loss of the corpus callosum as a result of this hemispheric abnormality. Study subjects included 13 children with proven acute profound hypoxic-ischaemic brain injury and 13 children with developmental delay but no brain abnormalities. A computerised system divided the corpus callosum into 100 segments, measuring each width. Principal component analysis grouped the widths into contiguous anatomical regions. We conducted analysis of variance of corpus callosum widths as well as support vector machine stratification into patient groups. There was statistically significant narrowing of the mid-posterior body and genu of the corpus callosum in children with hypoxic-ischaemic brain injury. Support vector machine analysis yielded over 95% accuracy in patient group stratification using the corpus callosum centile widths. Focal volume loss is seen in the corpus callosum of children with hypoxic-ischaemic brain injury secondary to loss of commissural fibres arising in the paracentral lobules. Support vector machine stratification into the hypoxic-ischaemic brain injury group or the control group on the basis of corpus callosum width is highly accurate and points towards rapid clinical translation of this technique as a potential biomarker of hypoxic-ischaemic brain injury. (orig.)
Partial segmental thrombosis of the corpus cavernosum: imaging findings.

Science.gov (United States)

Moya-Sánchez, E; Medina-Benítez, A; Medina-Salas, V; Fernández-Navarro, L

2018-03-05

Partial segmental thrombosis of the corpus cavernosum is an unusual clinical condition of unknown origin that mainly affects young males, whose characteristic presentation is the appearance of unexplained perineal pain associated with a palpable perineal mass. This entity consists of thrombosis in the perineal portion of the corpus cavernosum, usually unilateral and it is associated with underlying malignant pathologies and predisposing factors such as microtrauma. After the adequate adherence to conservative treatment, the appearance of complications such as erectile dysfunction is very uncommon. Copyright © 2018 SERAM. Publicado por Elsevier España, S.L.U. All rights reserved.
Analysing Culture and Interculture in Saudi EFL Textbooks: A Corpus Linguistic Approach

Science.gov (United States)

Almujaiwel, Sultan

2018-01-01

This paper combines corpus processing tools to investigate the cultural elements of Saudi education of English as a foreign language (EFL). The latest Saudi EFL textbooks (2016 onwards) are available in researchable PDF formats. This helps process them through corpus search software tools. The method adopted is based on analysing 20 cultural…
MR measurement of normal corpus callosum: Age and sex differentiation

International Nuclear Information System (INIS)

Lee, Myung Seob; Kim, Myung Soon; Park, Hyun Ju

1992-01-01

Measurement of various portions of the corpus callosum was performed on magnetic resonance(MR) images of 114 subjects with no known or suspected corpus callosal disorders. Midsagittal T1-weighted images used for measurements and mean diameters of various portions in each age and sex group were obtained. Measures of five portions were made: (A) the anterio-posterior length, (B) the diameter of genu position, (C) the diameter of splenium, (D) the diameter of mid-body portion, (E) the diameter of a narrow portion at the body of corpus callosum. The mean diameter in each gender group for A, B, C, D and E were 68.8 mm, 12.1 mm, 12.3 mm, 6,9 mm, 4.1 mm in male and 69.9 mm, 12.0 mm, 12.1 mm, 6.4 mm, 4.1 mm in female, retrospectively. The groups of 0-9 years of both genders showed the minimum mean value in each portion

MR imaging of spastic diplegia. The importance of corpus callosum

International Nuclear Information System (INIS)

Hayakawa, K.; Kanda, T.; Hashimoto, K.; Okuno, Y.; Yamori, Y.; Yuge, M.; Ando, R.; Ozaki, N.; Tamamoto, A.

1996-01-01

Purpose: The MR findings in patients with spastic diplegia were investigated and the role of MR imaging in assessing the extent of brain injury was evaluated. Material and Methods: 39 male and 24 female patients (preterm/term 43/20) were imaged using a 0.5 T MR system. Results: The MR findings in term patients were quite different from those in preterm patients; 55% of the term patients showed normal and minimal changes on MR, whereas 90.7% of the 43 preterm children had periventricular leucomalacia. The deep cerebral white matter was the most frequently involved site. Objective measurements revealed significant reductions of the entire sagittal area of corpus callosum in diplegic patients in comparison with normal controls. The motor plasy severity correlated well with the extent of corpus callosum involvement. Conclusion: The corpus callosum appears to be a sensitive marker site for the assessment of the extent of white matter injury. (orig.)
MR measurement of normal corpus callosum: Age and sex differentiation

Energy Technology Data Exchange (ETDEWEB)

Lee, Myung Seob; Kim, Myung Soon; Park, Hyun Ju [Wonju College of Medicine, Yonsei University, Wonju (Korea, Republic of)

1992-07-15

Measurement of various portions of the corpus callosum was performed on magnetic resonance(MR) images of 114 subjects with no known or suspected corpus callosal disorders. Midsagittal T1-weighted images used for measurements and mean diameters of various portions in each age and sex group were obtained. Measures of five portions were made: (A) the anterio-posterior length, (B) the diameter of genu position, (C) the diameter of splenium, (D) the diameter of mid-body portion, (E) the diameter of a narrow portion at the body of corpus callosum. The mean diameter in each gender group for A, B, C, D and E were 68.8 mm, 12.1 mm, 12.3 mm, 6,9 mm, 4.1 mm in male and 69.9 mm, 12.0 mm, 12.1 mm, 6.4 mm, 4.1 mm in female, retrospectively. The groups of 0-9 years of both genders showed the minimum mean value in each portion.
Form of the male and female corpus callosum internal organization at the mature age

Directory of Open Access Journals (Sweden)

Юрий Петрович Костиленко

2016-04-01

Full Text Available Aim: to study the special features of the male and female corpus callosum internal organization at the mature age.Materials and methods: the total preparations of the male and female corpus callosum (10 preparation of each sex at 45–60 years old were used as the material. The given preparations were used to get from it the plate cuts in the two mutually perpendicular planes with 2 mm. thick. Then the received tissue plates of the corpus callosum underwent plastination in the epoxy. Then the preparations were extracted from the non-polymerized epoxy and placed on the polyethylene film that was covered with the other film of the same size. Further this stratified block was placed amid the two glasses of the equal size that shrunk together by placing the small load on it. After the complete polymerization the received epoxy plates with the corpus callosum tissue contained in it underwent the gentle grinding and the accurate polish and as the result was obtained the surface denudation of its tissue structures that were colored with the 1 % solution of blue methylene for 1% borax solution.Results of research: at the study of the corpus callosum plastinated cuts in saggital plane was revealed that the transverse platen-form elevations of its higher surface are the cord-form tenias standing out from within and going through the corpus callosum. At its studying in the transverse cut was established that in adults can be separated two types of corpus callosum by its density: the dense one and disperse one.At the large increases of the binocular loupe (microscope MBS-9 can be seen the gaps between the adjacent commissural cords. Within it can be detected the blood vessels. On the transverse cut of commissural cords in its depth are revealed the thinnest streaks which totality consists of the two alternate dark and light lines that form the layered striation. Among the series of the light lines are visible the interlayer that separate the whole depth of
Corpus multimedia VEIGA inglés-galego de subtitulación cinematográfica

Directory of Open Access Journals (Sweden)

Patricia Sotelo Dios

2012-01-01

Full Text Available Neste artigo presento un proxecto de investigación que consiste na compilación e na explotación do corpus Veiga, un corpus multimedia de subtítulos en inglés e en galego. Trátase dun proxecto en fase de desenvolvemento que pretende servir como ferramenta para o estudo e a investigación de certos aspectos relacionados coa práctica da subtitulación intralingüística en inglés e da subtitulación interlingüística do inglés cara ao galego. O Veiga, inda que forma parte do corpus paralelo CLUVI, transcende o plano textual propio dos demais subcorpus do CLUVI e permite observar os subtítulos no seu estado natural, isto é, como parte dun produto audiovisual. Amais de cuestións relacionadas coa construción do corpus e co sistema de buscas, mencionarei algunha das posibles utilidades deste corpus para a práctica, a investigación e a formación en subtitulación.
A Lingüística de corpus: história, problemas, legitimidade

Directory of Open Access Journals (Sweden)

Jacqueline Léon

2006-01-01

Full Text Available During the nineties, the accessibility of large corpora and the possibility of manipulation of enormous quantities of linguistic data was the origin of a renewal of interest in statistical and probability evidences that served to directly question linguistics about its objectives, methods and foundations. This interest gained increasing importance and became important currently under the name of corpus linguistics, a field of dominant research in language science. In this article we will show that the designation corpus linguistics covers considerably heterogeneous theoretical positions and research, topics. We show how corpus linguistics, originally of british origin, was later endowed with historical and theoretical legitimacy while at the same time intending to establish itself as a new paradigm in language science. Finally we distinguish two attitudes inside the british tradition: one, intending to build the studies on a corpus and in a new paradigm based on a retrospective construction of the critical works of chomsky during the years 1959 and 1960, which was intended to legitimize the studies; the other attitude involves the continuity of the tradition of british empirical linguistics.
Cholesterol transport and steroidogenesis by the corpus luteum

Directory of Open Access Journals (Sweden)

Christenson Lane K

2003-11-01

Full Text Available Abstract The synthesis of progesterone by the corpus luteum is essential for the establishment and maintenance of early pregnancy. Regulation of luteal steroidogenesis can be broken down into three major events; luteinization (i.e., conversion of an ovulatory follicle, luteal regression, and pregnancy induced luteal maintenance/rescue. While the factors that control these events and dictate the final steroid end products are widely varied among different species, the composition of the corpus luteum (luteinized thecal and granulosa cells and the enzymes and proteins involved in the steroidogenic pathway are relatively similar among all species. The key factors involved in luteal steroidogenesis and several new exciting observations regarding regulation of luteal steroidogenic function are discussed in this review.
Penile erection responses of Nigella sativa seed extract on isolated rat corpus cavernosum

Science.gov (United States)

Aminyoto, M.; Ismail, S.

2018-04-01

Nigella sativa L. (NS) from Ranunculaceae family is known as black cumin in Indonesia. The seed has been used as an aphrodisiac in ethnobotanical studies and reported to have pharmacological activities such as antihypertensive through the relaxant effect of vascular smooth muscles but the direct effect to the blood vessels of the corpus cavernosum is still unknown. The purpose of this study was to examine the response of NS seed extract on penile erection in vitro. NS seeds were macerated in ethanol solvent for three days in room temperature and repeated for two times. Penile erection responses was assessed using isolated rat corpus cavernosum in Krebs-Henseleit solution, temperature 37°C, pH 7.4, aerated with carbogen gas. After acclimation, corpus cavernosum was contracted with a phenylephrine solution. Ethanolic extract of NS seeds or control solution were given after reaching the plateu phase of the highest contraction. This study showed that the contraction response of the corpus cavernosum decreased after addition of NS extract and this action was increased with the addition of the extract concentration. This study concluded that NS seed ethanol extract affects the penile erection response directly through the relaxation of blood vessels in the corpus cavernosum.
Edition des Corpus areopagiticum slavicum

Directory of Open Access Journals (Sweden)

Dieter Fahl

2005-12-01

Full Text Available An Edition of the Corpus areopagiticum slavicum In the fourteenth century, the monk Isaiah of the holy Mount Athos translated the writings of pseudo-Dionysius the Areopagite (c. end of the 5th century, core texts for Eastern and Western European theological and philosophical thought, from Greek into Church Slavonic. This first Slavic translation of Dionysius’ oeuvre (“De Coelesti Hierarchia,” “De Ecclesiastica Hierarchia,” “De Divinis Nominibus,” “De Mystica Theologia,” the epistles and scholia, which played a significant role in the development of Slavic culture, Orthodox Slavic socio-political theory and praxis, is still central to the study of Slavia Orthodoxa. A working group of German and Russian scholars has completed an edition of the translator’s Church Slavonic autograph with an en face reconstruction of the Greek text used by the translator and philological commentary. A Church Slavonic-Greek and Greek-Church Slavonic dictionary of this edition, currently in preparation, plans to make the terminology used in this influential translation accessible to interdisciplinary researchers. For the first time, the Church Slavonic lexica of this corpus, a substantial part of which was coined by the translator, will be registered in an index of words and forms.
Corpus gastritis in patients with endoscopic diagnosis of reflux oesophagitis and Barrett's oesophagus.

NARCIS (Netherlands)

Laheij, R.J.F.; Rossum, L.G.M. van; Boer, W.A. de; Jansen, J.B.M.J.

2002-01-01

BACKGROUND: A high level of gastric acid secretion is considered to be a risk factor for reflux oesophagitis or Barrett's oesophagus. Corpus gastritis may have a protective effect on the oesophagus, because of decreased gastric acid output. AIM: To determine if corpus gastritis is associated with
TwiMed: Twitter and PubMed Comparable Corpus of Drugs, Diseases, Symptoms, and Their Relations.

Science.gov (United States)

Alvaro, Nestor; Miyao, Yusuke; Collier, Nigel

2017-05-03

Work on pharmacovigilance systems using texts from PubMed and Twitter typically target at different elements and use different annotation guidelines resulting in a scenario where there is no comparable set of documents from both Twitter and PubMed annotated in the same manner. This study aimed to provide a comparable corpus of texts from PubMed and Twitter that can be used to study drug reports from these two sources of information, allowing researchers in the area of pharmacovigilance using natural language processing (NLP) to perform experiments to better understand the similarities and differences between drug reports in Twitter and PubMed. We produced a corpus comprising 1000 tweets and 1000 PubMed sentences selected using the same strategy and annotated at entity level by the same experts (pharmacists) using the same set of guidelines. The resulting corpus, annotated by two pharmacists, comprises semantically correct annotations for a set of drugs, diseases, and symptoms. This corpus contains the annotations for 3144 entities, 2749 relations, and 5003 attributes. We present a corpus that is unique in its characteristics as this is the first corpus for pharmacovigilance curated from Twitter messages and PubMed sentences using the same data selection and annotation strategies. We believe this corpus will be of particular interest for researchers willing to compare results from pharmacovigilance systems (eg, classifiers and named entity recognition systems) when using data from Twitter and from PubMed. We hope that given the comprehensive set of drug names and the annotated entities and relations, this corpus becomes a standard resource to compare results from different pharmacovigilance studies in the area of NLP. ©Nestor Alvaro, Yusuke Miyao, Nigel Collier. Originally published in JMIR Public Health and Surveillance (http://publichealth.jmir.org), 03.05.2017.
Analysis of high signal intensities of nontumorous conditions of corpus callosum on magnetic resonance T2-weighted images

International Nuclear Information System (INIS)

Kang, Moo Song; Kim, Chul Min; Chung, Chun Phil

1995-01-01

To evaluate high signal intensity of nontumorous conditions of corpus callosum on T2-weighted MR images. Forty nine patients with nontumorous high signal intensities involving corpus callosum on sagittal T2-weighted image were retrospectively analyzed. Nontumorous condition of corpus callosum were diffuse axonal injury (DAI, 19 cases), cerebral infarctions (16 cases), multiple sclerosis (MS, 5 cases), Wilson's disease (2 cases) and hydrocephalus (7 cases) that were diagnosed by clinical and MR findings. Numbers, configuration, involved thickness and sites of high signal intensities of corpus callosum were analyzed. DAI and infarctions showed either single or multiple lesions. MS and hydrocephalus showed multiple lesions, but Wilson's diseases showed single lesion. In DAI, infarctions and MS the lesions involved any part of corpus callosum, splenium in Wilson's disease, and all parts of corpus callosum in hydrocephalus. Wilson's disease showed only partial thickness involvement, and others involved partial or full thickness of corpus callosum. Configuration of high signal intensity was linear in most cases of hydrocephalus, and oval in Wilson's disease, and oval and confluent in MS, and variable in DAI and infarctions. High signal intensities of nontumorous conditions of corpus callosum revealed variable findings, and therefore, analysis of nontumorous high signal intensities of corpus callosum is not made by only MR findings but by conjuction with clinical aspects
Macroscopic morphometry of the corpus luteum of pregnant and non-pregnant zebu cows in the Colombian tropics

Directory of Open Access Journals (Sweden)

Marco González T

2017-07-01

Full Text Available The objective of the study was to determine the volume, weight, measures, ovarian location and shape of the corpus luteum of pregnant and non - pregnant cows from zebu cows of the Colombian tropics. 528 reproductive tracts were collected; 264 pregnant and 264 non-pregnant of cows benefited at the local slaughterhouse in Monteria, Córdoba, Colombia. The period of collection of the samples was extended for three months. After collection of each reproductive tract, the ovaries were separated, identified as right and left, weighed and measured. Then the drawing of the location of the corpus luteum was performed on the ovary according to the anatomical planes previously established in the corresponding form. Subsequently the corpus luteum was removed to perform their measurements, weighings and visualization of their shape. There were statistical differences between the locations of the corpus luteum in the ovary: Anterior pole, posterior pole, free edge, upper face and lower face (p≤0.05. The weight and volume of gestational corpus luteum was greater by 30 and 27.9% than the corpus luteum of non-pregnant cows. The predominant form por shape of the corpus luteum in both pregnant and non-pregnant cows was oval, then pyramidal and finally rounded. No gestation was observed contralateral to the location of the corpus luteum.
The importance of the corpus callosum in the diagnosis of multiple sclerosis

International Nuclear Information System (INIS)

Goossens-Merkt, H.; Mueller-Jensen, M.; Zanella, F.D.

1991-01-01

Besides MS there are a lot of diseases with lesions of the white matter, especially vascular diseases. In quest of a specific pattern in MRI of MS especially in early diagnosis the corpus callosum in patients with MS and in other diseases was analyzed. The progressive atrophy of the corpus callosum in the course of multiple sclerosis is well known. A good correlation between atrophy of the corpus callosum in T1 weighted MRI and the severity of organic mental disorder has been demonstrated. Since Atrophy however is an unspecific sign while demyelinating lesions are much more specific for MS, a brain region in which vascular lesions are rare but demyelinating lesions are more frequent, has been studied. (author). 10 refs.; 2 figs.; 1 tab
FREEDOM OF COMBINATION AND HETEROGENEITY: A CORPUS LINGUIST’S LOOK AT TWO SAUSSUREAN INSIGHTS

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2014-06-01

Full Text Available This article offers a reexamination of two of Saussure’s insights from the point of view of corpus linguistics—namely, freedom of combination and heterogeneity in language in use. Regarding the first insight, an analysis of word combinations in a corpus of newspaper texts written in Brazilian Portuguese was carried out to determine how many of these combinations were actual collocations—that is, were used frequently enough in a very large reference corpus (the Brazilian corpus to warrant statistical significance. The results suggested that most word combinations are not free; rather, they follow previously established preferences among speakers. Regarding the second notion, that of heterogeneity, the collocations in the newspaper texts were tracked as they were deployed one after the other along each text, and this flow was visually depicted. The inspection of the charts revealed unique patterns of the distribution of collocation, thereby suggesting that the evidence supports the view of heterogeneity. A cluster analysis was later conducted on the amount of collocations in each text, revealing three basic collocation bands onto which all the texts can be fitted. This was interpreted as suggesting that heterogeneity, despite being present and noticeable, is constrained rather than limitless. The article concludes that the methods and techniques afforded by present-day corpus linguistics can shed light onto Saussure’s many valuable insights. ------------------------------------------------------------------------------ LIBERDADE DE COMBINAÇÃO E HETEROGENEIDADE: UM OLHAR DA LINGUÍSTICA DE CORPUS EM DOIS INSIGHTS SAUSSUREANOS O artigo reexamina dois dos insights de Saussure a partir da perspectiva da linguística de corpus, a saber a liberdade de combinação e a heterogeneidade no uso da língua. Com relação ao primeiro, foi feita uma análise de combinações de palavras em corpus de textos de jornais para determinar quantas eram
Corpus web 2.0 : quelques enjeux méthodologiques et épistémologiques

Directory of Open Access Journals (Sweden)

Sabrina Bevilacqua

2016-12-01

Full Text Available Nombre de défis d’ordre méthodologique et épistémologique s’imposent aujourd’hui à la recherche scientifique orientée vers le travail sur des corpus numériques. Certes, chaque plateforme virtuelle présente une écologie spécifique (Paveau, 2013a, 2013b qui oriente un rapprochement différent tant de l’objet que du corpus. Ainsi, l’environnement Facebook (FBK, une surface essentiellement multiforme suggère un regard qui puisse saisir son hétérogénéité sémiotique et énonciative. Dans ce travail, nous visons, d’abord, une redéfinition de la notion de corpus comme « matrice du sens » (Mayaffre, 2011 : 11 permettant de focaliser les enjeux scientifiques que la conception des corpus numériques issus du Web 2.0, notamment, de FBK, entraîne; ensuite, la description de certains concepts méthodologiques et épistémologiques fondamentaux — linéarité, technodiscours, sérialité, réticularité — aidant à l’élaboration ainsi qu’à la gestion de corpus FBK.
Lexicon and teaching: an analysis of the use of less frequent prepositions in a Spanish learners corpus

Directory of Open Access Journals (Sweden)

Jéssyca Camargo Cruz

2017-08-01

Full Text Available This article aims at presenting a quantitative research and the analysis of the use of prepositions that are less frequent (underused in a corpus of learners of Spanish as a foreign language. We have observed the use of contra, hacia, enfrente de, excepto and tras through Corpus Linguistics by contrasting this lexical set and a supplementary corpus, composed by normative and descriptive Spanish grammar and by an online reference corpus of Spanish (CREA. Therefore, we present analyses made on a corpus constituted by 276 writings (85.729 words, gathered from two groups of freshman Language/Letras students, from 2011 to 2013. The data were collected with the aid of the WordSmith Tools (version 6 software and its tools, WordList and Concord enabled us to extract the frequency list of the prepositions in the corpus of study, as well as to observe and analyse their respective uses based on the lines of concordance.
Using Google as a Super Corpus to Drive Written Language Learning: A Comparison with the British National Corpus

Science.gov (United States)

Sha, Guoquan

2010-01-01

Data-driven learning (DDL), or corpus-based language learning, involves the learner in an exploratory task to discover appropriate expressions or collocates regarding his writing. However, the problematic units of meaning in each learner's writing are so diverse that conventional corpora often prove futile. The search engine Google with the…
Human corpus luteum: presence of epidermal growth factor receptors and binding characteristics

International Nuclear Information System (INIS)

Ayyagari, R.R.; Khan-Dawood, F.S.

1987-01-01

Epidermal growth factor receptors are present in many reproductive tissues but have not been demonstrated in the human corpus luteum. To determine the presence of epidermal growth factor receptors and its binding characteristics, we carried out studies on the plasma cell membrane fraction of seven human corpora lutea (days 16 to 25) of the menstrual cycle. Specific epidermal growth factor receptors were present in human corpus luteum. Insulin, nerve growth factor, and human chorionic gonadotropin did not competitively displace epidermal growth factor binding. The optimal conditions for corpus luteum-epidermal growth factor receptor binding were found to be incubation for 2 hours at 4 degrees C with 500 micrograms plasma membrane protein and 140 femtomol 125 I-epidermal growth factor per incubate. The number (mean +/- SEM) of epidermal growth factor binding sites was 12.34 +/- 2.99 X 10(-19) mol/micrograms protein; the dissociation constant was 2.26 +/- 0.56 X 10(-9) mol/L; the association constant was 0.59 +/- 0.12 X 10(9) L/mol. In two regressing corpora lutea obtained on days 2 and 3 of the menstrual cycle, there was no detectable specific epidermal growth factor receptor binding activity. Similarly no epidermal growth factor receptor binding activity could be detected in ovarian stromal tissue. Our findings demonstrate that specific receptors for epidermal growth factor are present in the human corpus luteum. The physiologic significance of epidermal growth factor receptors in human corpus luteum is unknown, but epidermal growth factor may be involved in intragonadal regulation of luteal function
Applying Corpus-Based Findings to Form-Focused Instruction: The Case of Reported Speech

Science.gov (United States)

Barbieri, Federica; Eckhardt, Suzanne E. B.

2007-01-01

Arguing that the introduction of corpus linguistics in teaching materials and the language classroom should be informed by theories and principles of SLA, this paper presents a case study illustrating how corpus-based findings on reported speech can be integrated into a form-focused model of instruction. After overviewing previous work which…
[Case of suspected multiple sclerosis with transcallosal lesions involving the upper surface of the corpus callosum].

Science.gov (United States)

Shirafuji, Toshihiko; Oya, Yasushi; Nakamura, Harumasa; Ogata, Katsuhisa; Ogawa, Masafumi; Kawai, Mitsuru

2008-05-01

A 26-year-old woman noticed gradually progressive, right lower leg weakness over a 1.5-month period. Neurological examination revealed right hemiparesis with slightly increased deep tendon reflexes, Babinski's sign on the right side, loss of position sense in the right leg, and slight loss of superficial sensation in the right toes. MR FLAIR images showed a high intensity area measuring 5 x 2 x 3 cm in the left frontal lobe, extending to the outer surface of the body of the corpus callosum and the adjacent right cingulate gyrus. Gadolinium enhancement was seen along the cortex and the outer surface of the body of the corpus callosum. CSF findings showed no pleocytosis, a protein content of 32 mg/dl, a sugar level of 85 mg/dl, and an IgG index of 0.46. The biopsy specimen obtained from the superior frontal gyrus showed perivascular cuffing of T-lymphocytes and some B-lymphocytes, as well as multiple small foci of demyelination. Starting on the second day of admission, the patient was treated with methylprednisolone pulse therapy (1,000 mg/day for 3 days); she was then switched to oral prednisolone (20 mg/day). Thereafter, the patient had two clinical relapses: one was due to a lesion in the dorsal part of the medulla oblongata associated with a disturbance of deep sensation in both hands, and the other was due to a lesion involving the right internal capsule, the globus pallidus, and the caudate nucleus associated with left facial nerve palsy. Visual evoked potentials suggested a demyelinating lesion in the right optic nerve. We suspected a diagnosis of multiple sclerosis based on the presence of more than two clinical episodes of neurological deficits with identifiable lesions on MRI. Multiple sclerosis should be considered in the differential diagnosis of lesions located in the outer part of the corpus callosum and transcallosal bilateral hemispheres on MRI, even though inner callosal lesions are common in multiple sclerosis.

Cholinergic neurotransmission in human corpus cavernosum. II. Acetylcholine synthesis

International Nuclear Information System (INIS)

Blanco, R.; De Tejada, S.; Goldstein, I.; Krane, R.J.; Wotiz, H.H.; Cohen, R.A.

1988-01-01

Physiological and histochemical evidence indicates that cholinergic nerves may participate in mediating penile erection. Acetylcholine synthesis and release was studied in isolated human corporal tissue. Human corpus cavernosum incubated with [ 3 H]choline accumulated [ 3 H]choline and synthesized [ 3 H]acethylcholine in an concentration-dependent manner. [ 3 H]Acetylcholine accumulation by the tissue was inhibited by hemicholinium-3, a specific antagonist of the high-affinity choline transport in cholinergic nerves. Transmural electrical field stimulation caused release of [ 3 H]acetylcholine which was significantly diminished by inhibiting neurotransmission with calcium-free physiological salt solution or tetrodotoxin. These observations provide biochemical and physiological evidence for the existence of cholinergic innervation in human corpus cavernosum
A critical re-examination of sexual dimorphism in the corpus callosum microstructure

DEFF Research Database (Denmark)

Westerhausen, René; Kompus, Kristiina; Dramsdahl, Margaretha

2011-01-01

the diffusion parameters did not correlate with regional callosal size. The present results indicate a stronger inter-hemispheric connectivity between the frontal lobes in males than females, which might be related to sex differences in hemispheric asymmetry and brain size........ The objective of the present DTI study was to re-examine microstructural sex differences in the corpus callosum, while controlling for corpus callosum size differences between sexes. We compared 41 female and 34 male participants using regional tract-based spatial statistics (TBSS) analysis. Clusters...... of significantly higher fractional anisotropy (FA) and lower diffusion strength in males compared to females were detected in the genu and truncus of the corpus callosum. However, only the sex difference located in the anterior genu subregions could be unequivocally interpreted. This was the only cluster where...
Neural analysis of bovine ovaries ultrasound images in the identification process of the corpus luteum

Science.gov (United States)

Górna, K.; Jaśkowski, B. M.; Okoń, P.; Czechlowski, M.; Koszela, K.; Zaborowicz, M.; Idziaszek, P.

2017-07-01

The aim of the paper is to shown the neural image analysis as a method useful for identifying the development stage of the domestic bovine corpus luteum on digital USG (UltraSonoGraphy) images. Corpus luteum (CL) is a transient endocrine gland that develops after ovulation from the follicle secretory cells. The aim of CL is the production of progesterone, which regulates many reproductive functions. In the presented studies, identification of the corpus luteum was carried out on the basis of information contained in ultrasound digital images. Development stage of the corpus luteum was considered in two aspects: just before and middle of domination phase and luteolysis and degradation phase. Prior to the classification, the ultrasound images have been processed using a GLCM (Gray Level Co-occurence Matrix). To generate a classification model, a Neural Networks module implemented in the STATISTICA was used. Five representative parameters describing the ultrasound image were used as learner variables. On the output of the artificial neural network was generated information about the development stage of the corpus luteum. Results of this study indicate that neural image analysis combined with GLCM texture analysis may be a useful tool for identifying the bovine corpus luteum in the context of its development phase. Best-generated artificial neural network model was the structure of MLP (Multi Layer Perceptron) 5:5-17-1:1.
Tone realisation in a Yoruba speech recognition corpus

CSIR Research Space (South Africa)

Van Niekerk, D

2012-05-01

Full Text Available development. Extracted contours are processed and analysed statistically to describe acoustic properties in different tonal contexts. The authors demonstrate how features useful for tone recognition or synthesis can be successfully extracted from a corpus...
A unified approach for development of Urdu Corpus for OCR and demographic purpose

Science.gov (United States)

Choudhary, Prakash; Nain, Neeta; Ahmed, Mushtaq

2015-02-01

This paper presents a methodology for the development of an Urdu handwritten text image Corpus and application of Corpus linguistics in the field of OCR and information retrieval from handwritten document. Compared to other language scripts, Urdu script is little bit complicated for data entry. To enter a single character it requires a combination of multiple keys entry. Here, a mixed approach is proposed and demonstrated for building Urdu Corpus for OCR and Demographic data collection. Demographic part of database could be used to train a system to fetch the data automatically, which will be helpful to simplify existing manual data-processing task involved in the field of data collection such as input forms like Passport, Ration Card, Voting Card, AADHAR, Driving licence, Indian Railway Reservation, Census data etc. This would increase the participation of Urdu language community in understanding and taking benefit of the Government schemes. To make availability and applicability of database in a vast area of corpus linguistics, we propose a methodology for data collection, mark-up, digital transcription, and XML metadata information for benchmarking.
Angular analysis of corpus callosum in 18 patients with frontonasal dysplasia

Directory of Open Access Journals (Sweden)

Giffoni Silvyo David Araújo

2004-01-01

Full Text Available Considering the rarity of the frontonasal dysplasia (FD and the few reports about it in a large casuistry using magnetic resonance image (MRI, we describe the results of the angular analysis of the corpus callosum of 18 individuals with FD (7 male, 11 female, using an easily-reproductive method. Group I had 12 individuals with isolated form and Group II had 6 individuals with FD syndromic with unknown etiology. The results are presented in set. Comparing with the control group, patients with FD presented alpha angle increase and beta and gamma angles reduction (p<0.05. Alpha and gamma angles express the relationship between the anterior portion of corpus callosum and the floor of 4th ventricle. Considering the embryonary development, these findings would occur secondarily to failure during the development of nasal capsula. Thus, angular anomaly in corpus callosum would be a usual finding, and not fortuitous in patients with FD.
Publishing a Quality Context-aware Annotated Corpus and Lexicon for Harassment Research

OpenAIRE

Rezvan, Mohammadreza; Shekarpour, Saeedeh; Balasuriya, Lakshika; Thirunarayan, Krishnaprasad; Shalin, Valerie; Sheth, Amit

2018-01-01

Having a quality annotated corpus is essential especially for applied research. Despite the recent focus of Web science community on researching about cyberbullying, the community dose not still have standard benchmarks. In this paper, we publish first, a quality annotated corpus and second, an offensive words lexicon capturing different types type of harassment as (i) sexual harassment, (ii) racial harassment, (iii) appearance-related harassment, (iv) intellectual harassment, and (v) politic...
Morphometry of the corpus callosum in Chinese children: relationship with gender and academic performance

International Nuclear Information System (INIS)

Ng, Wing Hung Alex; Chan, Yu.Lung; Au, Kit Sum Agnes; Yeung, Ka Wai David; Kwan, Ting Fai; To, Cho Yee

2005-01-01

The corpus callosum has been widely studied, but no study has demonstrated whether its size and shape have any relationship with language and calculation performance. To examine the morphometry of the corpus callosum of normal Chinese children and its relationship with gender and academic performance. One hundred primary school children (63 boys, 37 girls; age 6.5-10 years) were randomly selected and the standardized academic performance for each was ascertained. On the mid-sagittal section of a brain MRI, the length, height and total area of the corpus callosum and its thickness at different sites were measured. These were correlated with sex and academic performance. Apart from the normal average dimension of the different parts of the corpus callosum, thickness at the body-splenium junction in the average-to-good performance group was significantly greater than the below-average performance group in Chinese language (P=0.005), English language (P=0.02) and mathematics (P=0.01). The remainder of the callosal thickness showed no significant relationship with academic performance. There was no significant sex difference in the thickness of any part of the corpus callosum. These findings raise the suggestion that language and mathematics proficiency may be related to the morphometry of the fibre connections in the posterior parietal lobes. (orig.)
Morphometry of the corpus callosum in Chinese children: relationship with gender and academic performance

Energy Technology Data Exchange (ETDEWEB)

Ng, Wing Hung Alex; Chan, Yu.Lung [Prince of Wales Hospital, Department of Diagnostic Radiology and Organ Imaging, Shatin, Hong Kong (Hong Kong); Au, Kit Sum Agnes [James Cook University, Department of Psychology, Townsville, Queensland (Australia); Yeung, Ka Wai David; Kwan, Ting Fai; To, Cho Yee

2005-06-01

The corpus callosum has been widely studied, but no study has demonstrated whether its size and shape have any relationship with language and calculation performance. To examine the morphometry of the corpus callosum of normal Chinese children and its relationship with gender and academic performance. One hundred primary school children (63 boys, 37 girls; age 6.5-10 years) were randomly selected and the standardized academic performance for each was ascertained. On the mid-sagittal section of a brain MRI, the length, height and total area of the corpus callosum and its thickness at different sites were measured. These were correlated with sex and academic performance. Apart from the normal average dimension of the different parts of the corpus callosum, thickness at the body-splenium junction in the average-to-good performance group was significantly greater than the below-average performance group in Chinese language (P=0.005), English language (P=0.02) and mathematics (P=0.01). The remainder of the callosal thickness showed no significant relationship with academic performance. There was no significant sex difference in the thickness of any part of the corpus callosum. These findings raise the suggestion that language and mathematics proficiency may be related to the morphometry of the fibre connections in the posterior parietal lobes. (orig.)
The WONP-NURT corpus as nuclear knowledge base for text mining in the INIS database

International Nuclear Information System (INIS)

Guerra Valdes, R.

2011-01-01

In the present work the WONP-NURT corpus is taken as knowledge base for text mining in the INIS database. Main components of the information processing system, as well as computational methods for content analysis of INIS database record files are described. Results of the content analysis of the WONP-NURT corpus are reported. Furthermore, results of two comparative text mining studies in the INIS database are also shown. The first one explores 10 research areas in the more familiar nearest range of WONP-NURT corpus, while the second one surveys 15 regions in the more exotic far range. The results provide new elements to asses the significance of the WONP-NURT corpus in the context of the current state of nuclear science and technology research areas. (Author)
EuroGOV: Engineering a Multilingual Web Corpus

NARCIS (Netherlands)

Sigurbjörnsson, B.; Kamps, J.; de Rijke, M.

2005-01-01

EuroGOV is a multilingual web corpus that was created to serve as the document collection for WebCLEF, the CLEF 2005 web retrieval task. EuroGOV is a collection of web pages crawled from the European Union portal, European Union member state governmental web sites, and Russian government web sites.
Architecture of the Corpus Spongiosum : An Anatomical Study

NARCIS (Netherlands)

Ottenhof, Sarah R; de Graaf, Petra; Soeterik, Timo F W; Neeter, Lidewij M F H; Zilverschoon, Marijn; Spinder, Matty; Bosch, J L H Ruud; Bleys, Ronald L A W; Heck-de Kort, Laetitia

PURPOSE: Urethral reconstruction is performed for urethral stricture or hypospadias correction. Research on urethral tissue engineering is increasing. Because the corpus spongiosum is important to support the urethra, urethral tissue engineering should ideally be combined with reconstruction of a
JaSlo: Integration of a Japanese-Slovene Bilingual Dictionary with a Corpus Search System

Directory of Open Access Journals (Sweden)

Kristina HMELJAK SANGAWA

2012-12-01

Full Text Available The paper presents a set of integrated on-line language resources targeted at Japanese language learners, primarily those whose mother tongue is Slovene. The resources consist of the on-line Japanese-Slovene learners’ dictionary jaSlo and two corpora, a 1 million word Japanese-Slovene parallel corpus and a 300 million word corpus of web pages, where each word and sentence is marked by its difficulty level; this corpus is furthermore available as a set of five distinct corpora, each one containing sentences of the particular level. The corpora are available for exploration through NoSketch Engine, the open source version of the commercial state-of-the-art corpus analysis software Sketch Engine. The dictionary is available for Web searching, and dictionary entries have direct links to examples from the corpora, thus offering a wider picture of a possible translations in concrete contextualised examples, and b monolingual Japanese usage examples of different difficulty levels to support language learning.
Inflation Metaphor in the TIME Magazine Corpus

Science.gov (United States)

Hu, Chunyu; Liu, Huijie

2016-01-01

A historical perspective on economy metaphor can shed new lights on economic thoughts. Based on the TIME Magazine Corpus (TMC), this paper investigates inflation metaphor over 83 years and compares findings against the economic data over the relatively corresponding period. The results show how inflation, an abstract concept and a normal economic…
Characterizing the Google Books Corpus: Strong Limits to Inferences of Socio-Cultural and Linguistic Evolution.

Science.gov (United States)

Pechenick, Eitan Adam; Danforth, Christopher M; Dodds, Peter Sheridan

2015-01-01

It is tempting to treat frequency trends from the Google Books data sets as indicators of the "true" popularity of various words and phrases. Doing so allows us to draw quantitatively strong conclusions about the evolution of cultural perception of a given topic, such as time or gender. However, the Google Books corpus suffers from a number of limitations which make it an obscure mask of cultural popularity. A primary issue is that the corpus is in effect a library, containing one of each book. A single, prolific author is thereby able to noticeably insert new phrases into the Google Books lexicon, whether the author is widely read or not. With this understood, the Google Books corpus remains an important data set to be considered more lexicon-like than text-like. Here, we show that a distinct problematic feature arises from the inclusion of scientific texts, which have become an increasingly substantive portion of the corpus throughout the 1900 s. The result is a surge of phrases typical to academic articles but less common in general, such as references to time in the form of citations. We use information theoretic methods to highlight these dynamics by examining and comparing major contributions via a divergence measure of English data sets between decades in the period 1800-2000. We find that only the English Fiction data set from the second version of the corpus is not heavily affected by professional texts. Overall, our findings call into question the vast majority of existing claims drawn from the Google Books corpus, and point to the need to fully characterize the dynamics of the corpus before using these data sets to draw broad conclusions about cultural and linguistic evolution.
Corpus-Based Rhythmic Pattern Analysis of Ragtime Syncopation

NARCIS (Netherlands)

Koops, Hendrik Vincent; Volk, A.; de Haas, W.B.

2015-01-01

This paper presents a corpus-based study on rhythmic patterns in the RAG-collection of approximately 11.000 symbolically encoded ragtime pieces. While characteristic musical features that define ragtime as a genre have been debated since its inception, musicologists argue that specific syncopation
I will proclaim myself what I am : corpus stylistics and the language of Shakespeare’s soliloquies

OpenAIRE

Murphy, Sean Edward

2015-01-01

This article reports on a corpus stylistic study of the language of soliloquies in Shakespeare’s plays. Literary corpus stylistics can use corpus linguistic methods to test claims made by literary critics and identify hitherto unnoticed features. Existing literary studies of soliloquies tend to define and classify them, to trace the history of the form or to offer literary appreciation; yet they pay surprisingly little attention to the language which characterises soliloquies. By creating a s...
Atrophy and magnetization transfer ratio of the corpus callosum in patients with Alzheimer's disease

International Nuclear Information System (INIS)

Imon, Yukari; Hanyu, Haruo; Iwamoto, Toshihiko; Takasaki, Masaru; Abe, Kimihiko

1998-01-01

We compared atrophy and magnetization transfer ratio (MTR) in the corpus callosum in patients with Alzheimer's disease and age-matched normal subjects. Fifteen patients with Alzheimer's disease and fourteen normal subjects received MRI. The corpus callosum was divided into three parts (anterior, middle, and posterior portions) on midsagittal slice, and their areas on T2-weighted reversed images and MTR on magnetization transfer contrast images in each portion were measured. The area and MTR decreased significantly in the posterior portion in patients with Alzheimer's disease. In the anterior portion, MTR decreased significantly, but although the area showed no significant change. In the middle portion, the area and MTR showed no significant change. MTR and the area was correlated in each portion in patients with Alzheimer's disease. The score of Hasegawa dementia scale-revised (HDS-R) and the area of the middle, posterior and total of corpus callosum were significantly related. The score of HDS-R and MTR in the anterior portion of corpus callosum were significantly related. The present study revealed decreases in MTR in the anterior portion of the corpus callosum of patients with Alzheimer's disease although the area showed no significant change, and this change suggests the increase in free water and/or the decrease in bound water in tissues, probably due to demyelination and axonal degeneration. (author)
Text collections for evaluation of Russian morphological taggers

Directory of Open Access Journals (Sweden)

Lyashevskaya Olga

2017-12-01

Full Text Available The paper describes the preparation and development of the text collections within the framework of MorphoRuEval-2017 shared task, an evaluation campaign designed to stimulate development of the automatic morphological processing technologies for Russian. The main challenge for the organizers was to standardize all available Russian corpora with the manually verified high-quality tagging to a single format (Universal Dependencies CONLL-U. The sources of the data were the disambiguated subcorpus of the Russian National Corpus, SynTagRus, OpenCorpora.org data and GICR corpus with the resolved homonymy, all exhibiting different tagsets, rules for lemmatization, pipeline architecture, technical solutions and error systematicity. The collections includes both normative texts (the news and modern literature and more informal discourse (social media and spoken data, the texts are available under CC BY-NC-SA 3.0 license.
An Analysis of Stative Verbs Used with the Progressive Aspect in Corpus-Informed Textbooks

Science.gov (United States)

Belli, Serap Atasever

2018-01-01

This study was designed to investigate whether contemporary corpus-informed grammar textbooks written for English language learners and teachers presented the progressive use of stative verbs and if yes, which stative verbs were presented to occur with the progressive aspect and for which functions they took this aspect. A corpus of six electronic…

US News Media Portrayal of Islam and Muslims: A Corpus-Assisted Critical Discourse Analysis

Science.gov (United States)

Samaie, Mahmoud; Malmir, Bahareh

2017-01-01

This article exploits the synergy of critical discourse studies and Corpus Linguistics to study the pervasive representation of Islam and Muslims in an approximate 670,000-word corpus of US news media stories published between 2001 and 2015. Following collocation and concordance analysis of the most frequent topics or categories which revolve…
Investigation of ground-water contamination at a drainage ditch, Installation Restoration Site 4, Naval Air Station Corpus Christi, Corpus Christi, Texas, 2005–06

Science.gov (United States)

Vroblesky, Don A.; Casey, Clifton C.

2007-01-01

The U.S. Geological Survey, in cooperation with the Naval Facilities Engineering Command Southeast, used newly developed sampling methods to investigate ground-water contamination by chlorobenzenes beneath a drainage ditch on the southwestern side of Installation Restoration Site 4, Naval Air Station Corpus Christi, Corpus Christi, Texas, during 2005-06. The drainage ditch, which is a potential receptor for ground-water contaminants from Installation Restoration Site 4, intermittently discharges water to Corpus Christi Bay. This report uses data from a new type of pore-water sampler developed for this investigation and other methods to examine the subsurface contamination beneath the drainage ditch. Analysis of ground water from the samplers indicated that chlorobenzenes (maximum detected concentration of 160 micrograms per liter) are present in the ground water beneath the ditch. The concentrations of dissolved oxygen in the samples (less than 0.05-0.4 milligram per liter) showed that the ground water beneath and near the ditch is anaerobic, indicating that substantial chlorobenzene biodegradation in the aquifer beneath the ditch is unlikely. Probable alternative mechanisms of chlorobenzene removal in the ground water beneath the drainage ditch include sorption onto the organic-rich sediment and contaminant depletion by cattails through uptake, sorption, and localized soil aeration.
Infarction of the entire corpus callosum as a complication in subarachnoid hemorrhage: A case report

Directory of Open Access Journals (Sweden)

Satoru Takahashi, M.D.

2017-03-01

Full Text Available The corpus callosum is the major commissural pathway connecting the cerebral hemispheres. This pathway receives its blood supply from anterior communicating artery, pericallosal artery, and posterior pericallosal artery. However, in some cases, the entire corpus callosum is supplied by median callosal artery; thus, occlusion of this artery can lead to infarction of the entire corpus callosum. Few reports have described this type of infarction, and no reports after subarachnoid hemorrhage (SAH exist. Here, we report on a 42-year-old female who was diagnosed with SAH after two aneurysms were discovered in bifurcation of left anterior cerebral artery (A1-A2. After successful clipping was performed, the patient was alert and had no neurological deficits; moreover, the computed tomography images that were acquired after the operation showed no evidence of infarction. Nine days after admittance to the hospital, drowsiness and weakness of the left limbs with brain swelling appeared and decompressive hemi-craniectomy was performed. Diagnostic cerebral angiography revealed vasospasms in both anterior and middle cerebral arteries, thus fasudil hydrochloride was administered intra-arterially. While blood flow in all arteries improved, diffusion-weighted magnetic resonance imaging detected infarction along the entire length of the corpus callosum and in the medial region of the right frontal lobe. We believe this infarction was due to secondary ischemia of median callosal artery. This case reminded us of the anatomical variation wherein median callosal artery is the sole blood supply line for the corpus callosum and demonstrated that infarction of the entire corpus callosum is possible.
Using Edit Distance to Analyse Errors in a Natural Language to Logic Translation Corpus

Science.gov (United States)

Barker-Plummer, Dave; Dale, Robert; Cox, Richard; Romanczuk, Alex

2012-01-01

We have assembled a large corpus of student submissions to an automatic grading system, where the subject matter involves the translation of natural language sentences into propositional logic. Of the 2.3 million translation instances in the corpus, 286,000 (approximately 12%) are categorized as being in error. We want to understand the nature of…
Data for lexicography The central role of the corpus

Directory of Open Access Journals (Sweden)

Allan F. Lauder

2010-10-01

Full Text Available This paper looks at the nature of data for lexicography and in particular on the central role that electronic corpora can play in providing it. Data has traditionally come from existing dictionaries, citations, and from the lexicographer’s own knowledge of words, through introspection. Each of these is examined and evaluated. Then the electronic corpus is considered. Different kinds of corpora are described and key design criteria are explained, in particular the size of corpus needed for lexicography as well as the issue of representativeness and sampling. The advantages and disadvantages of corpora are weighed and compared against the other types of data. While each of these has benefits, it is argued that corpora are a requirement, not an option, as data for dictionary making.
Interhemispheric functional disconnection because of abnormal corpus callosum integrity in bipolar disorder type II.

Science.gov (United States)

Yasuno, Fumihiko; Kudo, Takashi; Matsuoka, Kiwamu; Yamamoto, Akihide; Takahashi, Masato; Nakagawara, Jyoji; Nagatsuka, Kazuyuki; Iida, Hidehiro; Kishimoto, Toshifumi

2016-11-01

A significantly lower fractional anisotropy (FA) value has been shown in anterior parts of the corpus callosum in patients with bipolar disorder. We investigated the association between abnormal corpus callosum integrity and interhemispheric functional connectivity (IFC) in patients with bipolar disorder. We examined the association between FA values in the corpus callosum (CC-FA) and the IFC between homotopic regions in the anterior cortical structures of bipolar disorder ( n =16) and major depressive disorder ( n =22) patients with depressed or euthymic states. We found a positive correlation between the CC-FA and IFC values between homotopic regions of the ventral prefrontal cortex and insula cortex, and significantly lower IFC between these regions in bipolar disorder patients. The abnormal corpus callosum integrity in bipolar disorder patients is relevant to the IFC between homotopic regions, possibly disturbing the exchange of emotional information between the cerebral hemispheres resulting in emotional dysregulation. None. © The Royal College of Psychiatrists 2016. This is an open access article distributed under the terms of the Creative Commons Non-Commercial, No Derivatives (CC BY-NC-ND) license.
Le corpus lexicographique dans les langues à tradition orale: le cas du dialecte fang-mekè*

Directory of Open Access Journals (Sweden)

Nzang-Bié Yolande

2011-10-01

Full Text Available
Résumé: Les corpus sont à la base de la plupart des recherches en linguistique et particulièrementlexicographique. La compilation d'un corpus est une activité spécialisée dont dépend lerésultat de la recherche en question. Le sujet de cet article est la compilation du corpus lexicographiquedans les langues à tradition orale, et exige une démarche différente de celle ayant unelongue tradition écrite. De ce fait, ces dernières disposent d'une importante documentationpouvant servir comme base pour de nombreux sujets de recherche. L'auteur propose commeapproche une analyse qui permettrait de mieux rendre compte des spécificités lexicales etsémantiques des langues à tradition orale.Par le truchement de la production orale libre, l'auteur base ses hypothèses de recherche surune expérience en dialecte fang-mekè, une variante linguistique localisée au Gabon. Les résultatspermettent de mettre l'accent sur deux données essentielles du processus de compilation dans leslangues à tradition orale: les informateurs et la représentativité du corpus. Cette dernière, qui doits'exprimer à travers des champs lexicaux diversifiés mais également équilibrés, permettrait d'élaborerdes dictionnaires dans lesquels les locuteurs, qui en sont les premiers utilisateurs, doivent sereconnaître.
Mots-clés: CORPUS, LEXICOGRAPHIE, LANGUES À TRADITION ORALE, LANGUES ÀTRADITION ÉCRITE, INFORMATEURS, EXHAUSTIVITÉ, REPRÉSENTATIVITÉ, CHAMPSLEXICAUX, ORALITÉ, ÉCRITURE, MÉTHODE, DIALECTE FANG-MEKÈ, CORPUS ÉQUILIBRÉ.
Abstract: The Lexicographic Corpus in Languages with an Oral Tradition: The Case of the Dialect Fang-Mek?. Corpora form the basis of most linguistic and especially lexicographic research. The compilation of a corpus is a specialised activity on which depends the result of the research to be undertaken. The subject of this article is the compilation of a lexicographic corpus in languages with an oral tradition
CUILESS2016: a clinical corpus applying compositional normalization of text mentions.

Science.gov (United States)

Osborne, John D; Neu, Matthew B; Danila, Maria I; Solorio, Thamar; Bethard, Steven J

2018-01-10

Traditionally text mention normalization corpora have normalized concepts to single ontology identifiers ("pre-coordinated concepts"). Less frequently, normalization corpora have used concepts with multiple identifiers ("post-coordinated concepts") but the additional identifiers have been restricted to a defined set of relationships to the core concept. This approach limits the ability of the normalization process to express semantic meaning. We generated a freely available corpus using post-coordinated concepts without a defined set of relationships that we term "compositional concepts" to evaluate their use in clinical text. We annotated 5397 disorder mentions from the ShARe corpus to SNOMED CT that were previously normalized as "CUI-less" in the "SemEval-2015 Task 14" shared task because they lacked a pre-coordinated mapping. Unlike the previous normalization method, we do not restrict concept mappings to a particular set of the Unified Medical Language System (UMLS) semantic types and allow normalization to occur to multiple UMLS Concept Unique Identifiers (CUIs). We computed annotator agreement and assessed semantic coverage with this method. We generated the largest clinical text normalization corpus to date with mappings to multiple identifiers and made it freely available. All but 8 of the 5397 disorder mentions were normalized using this methodology. Annotator agreement ranged from 52.4% using the strictest metric (exact matching) to 78.2% using a hierarchical agreement that measures the overlap of shared ancestral nodes. Our results provide evidence that compositional concepts can increase semantic coverage in clinical text. To our knowledge we provide the first freely available corpus of compositional concept annotation in clinical text.
Open Corpus Adaptation++ in GALE : friend or foe?

NARCIS (Netherlands)

De Bra, P.M.E.; Smits, D.; Pechenizkiy, M.; Knutov, E.; Yudelson, M.; Abel, F.; Houben, G.J.P.M.; Herder, E.

2012-01-01

"Open" has quickly become the hottest topic in any field related to information, including open government data, open learning resources, open user models, … Open Corpus Adaptation has been defined as the ability to perform adaptation to resources located anywhere on the Web. This leaves the
The English Definite Article: What ESL/EFL Grammars Say and What Corpus Findings Show

Science.gov (United States)

WonHo Yoo, Isaiah

2009-01-01

To ascertain whether what ESL/EFL grammars say is informed by what scholars discuss in the literature and supported by what corpus findings actually show, this paper first presents a brief overview of the literature on the English definite article and then compares popular ESL/EFL grammars' coverage of "the" and corpus findings on definite article…
Emotional Intelligence in Agenesis of the Corpus Callosum.

Science.gov (United States)

Anderson, Luke B; Paul, Lynn K; Brown, Warren S

2017-05-01

People with agenesis of the corpus callosum (AgCC) with normal general intelligence have deficits in complex cognitive processing, as well as in social cognition. It is uncertain the extent to which impoverished processing of emotions may contribute to social processing deficiencies. We used the Mayer-Salovey-Caruso Emotional Intelligence Test to clarify the nature of emotional intelligence in 16 adults with AgCC. As hypothesized, persons with AgCC exhibited greater disparities from norms on tests involving more socially complex aspects of emotions. The AgCC group did not differ from norms on the Experiential subscale, but they were significantly below norms on the Strategic subscale. These findings suggest that the corpus callosum is not essential for experiencing and thinking about basic emotions in a "normal" way, but is necessary for more complex processes involving emotions in the context of social interactions. © The Author 2017. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
ORIGINAL ARTICLE Towards the Sense Disambiguation of Afan ...

African Journals Online (AJOL)

Based on this, the integration of deep linguistic knowledge with machine learning ... 1Jimma University, College of Natural Sciences, Department of Information Science, .... thesauri and ontology. .... For each context extracted, vector space.
Corpus linguistics and statistics with R introduction to quantitative methods in linguistics

CERN Document Server

Desagulier, Guillaume

2017-01-01

This textbook examines empirical linguistics from a theoretical linguist’s perspective. It provides both a theoretical discussion of what quantitative corpus linguistics entails and detailed, hands-on, step-by-step instructions to implement the techniques in the field. The statistical methodology and R-based coding from this book teach readers the basic and then more advanced skills to work with large data sets in their linguistics research and studies. Massive data sets are now more than ever the basis for work that ranges from usage-based linguistics to the far reaches of applied linguistics. This book presents much of the methodology in a corpus-based approach. However, the corpus-based methods in this book are also essential components of recent developments in sociolinguistics, historical linguistics, computational linguistics, and psycholinguistics. Material from the book will also be appealing to researchers in digital humanities and the many non-linguistic fields that use textual data analysis and t...
LEARNING SEMANTICS-ENHANCED LANGUAGE MODELS APPLIED TO UNSUEPRVISED WSD

Energy Technology Data Exchange (ETDEWEB)

VERSPOOR, KARIN [Los Alamos National Laboratory; LIN, SHOU-DE [Los Alamos National Laboratory

2007-01-29

An N-gram language model aims at capturing statistical syntactic word order information from corpora. Although the concept of language models has been applied extensively to handle a variety of NLP problems with reasonable success, the standard model does not incorporate semantic information, and consequently limits its applicability to semantic problems such as word sense disambiguation. We propose a framework that integrates semantic information into the language model schema, allowing a system to exploit both syntactic and semantic information to address NLP problems. Furthermore, acknowledging the limited availability of semantically annotated data, we discuss how the proposed model can be learned without annotated training examples. Finally, we report on a case study showing how the semantics-enhanced language model can be applied to unsupervised word sense disambiguation with promising results.
BrAgriNews: Um Corpus Temporal-Causal (Português-Brasileiro para a Agricultura

Directory of Open Access Journals (Sweden)

Brett Drury

2017-07-01

Full Text Available Recentemente tem havido um aumento no interesse, tanto no meio acadêmico quanto na indústria, em aplicações de aprendizagem de máquina e técnicas de inteligência artificial relacionadas com problemas agrícolas. Mineração de texto e técnicas relacionadas com o processamento da língua natural, raramente foram usadas para resolver problemas agrícolas, e muito menos para a língua portuguesa. É possível que um dos fatores que influenciam a escassez no uso técnicas de mineração de texto, para analisar textos em português e resolver problemas agrícolas, pode ser devido à falta de um corpus anotado livremente disponível. Para colmatar a falta de um corpus agrícola em língua portuguesa, estamos liberando um recurso em português-brasileiro voltado para agricultura, descrito neste artigo. O corpus abrange um período parcialmente contínuo de tempo entre 1996 e 2016, consistindo de notícias em português-brasileiro que foram anotadas com o seguinte tipo de informação: causal, sentimento, entidades nomeadas que incluem expressões temporais. O corpus tem recursos adicionais como: treebank, listas de termos frequentes (sem stop-words: unigramas, bigramas e trigramas, bem como palavras ou frases que foram identificados por jornalistas como de domínio específico. Espera-se que a liberação do corpus estimule a adoção da mineração de texto na agricultura na comunidade de pesquisa lusófona.
Agenesis of the corpus callosum and autism: a comprehensive comparison.

Science.gov (United States)

Paul, Lynn K; Corsello, Christina; Kennedy, Daniel P; Adolphs, Ralph

2014-06-01

The corpus callosum, with its ∼200 million axons, remains enigmatic in its contribution to cognition and behaviour. Agenesis of the corpus callosum is a congenital condition in which the corpus callosum fails to develop; such individuals exhibit localized deficits in non-literal language comprehension, humour, theory of mind and social reasoning. These findings together with parent reports suggest that behavioural and cognitive impairments in subjects with callosal agenesis may overlap with the profile of autism spectrum disorders, particularly with respect to impairments in social interaction and communication. To provide a comprehensive test of this hypothesis, we directly compared a group of 26 adults with callosal agenesis to a group of 28 adults with a diagnosis of autism spectrum disorder but no neurological abnormality. All participants had full-scale intelligence quotient scores >78 and groups were matched on age, handedness, and gender ratio. Using the Autism Diagnostic Observation Schedule together with current clinical presentation to assess autistic symptomatology, we found that 8/26 (about a third) of agenesis subjects presented with autism. However, more formal diagnosis additionally involving recollective parent-report measures regarding childhood behaviour showed that only 3/22 met complete formal criteria for an autism spectrum disorder (parent reports were unavailable for four subjects). We found no relationship between intelligence quotient and autism symptomatology in callosal agenesis, nor evidence that the presence of any residual corpus callosum differentiated those who exhibited current autism spectrum symptoms from those who did not. Relative to the autism spectrum comparison group, parent ratings of childhood behaviour indicated children with agenesis were less likely to meet diagnostic criteria for autism, even for those who met autism spectrum criteria as adults, and even though there was no group difference in parent report of current
Corpus callosum agenesis: Role of fetal magnetic resonance imaging

Directory of Open Access Journals (Sweden)

Achour Radhouane

2016-05-01

Full Text Available Corpus callosum agenesis (CCA was evaluated by ultrasound examination and magnetic resonance imaging (MRI with many studies. Ultrasonography was able to suspect CCA by indirect signs but a definitive diagnosis of CCA was achieved in rare cases. MRI was able to diagnose complete CCA in majority of cases. Additional neurological abnormalities including heterotopia, gyration anomaly, asymmetry of the cerebral hemispheres, and Dandy-Walker variant were documented, as well as an ocular anomaly which was described, by MRI examination. Prenatal counseling for fetal agenesis of the corpus callosum is difficult as the prognosis is uncertain. The association with other cerebral abnormalities increases the likelihood of a poor outcome and ultrasonographic assessment of the fetal brain is limited. We found MRI to be a safe and useful additional procedure to complement ultrasonographic diagnosis or suspicion of CCA.
Effect of an Ethanol Extract of Scutellaria baicalensis on Relaxation in Corpus Cavernosum Smooth Muscle

Directory of Open Access Journals (Sweden)

Xiang Li

2012-01-01

Full Text Available Aims of study. The aim of the present study was to investigate whether an ethanol extract of Scutellaria baicalensis (ESB relaxes penile corpus cavernosum muscle in organ bath experiments. Materials and methods. Changes in tension of cavernous smooth muscle strips were determined by penile strip chamber model and in penile perfusion model. Isolated endothelium-intact rabbit corpus cavernosum was precontracted with phenylephrine (PE and then treated with ESB. Results. ESB relaxed penile smooth muscle in a dose-dependent manner, and this was inhibited by pre-treatment with NG-nitro-l-arginine methyl ester (l-NAME, a nitric oxide (NO synthase inhibitor, and 1H-[1, 2, 4]-oxadiazolo-[4,3-α]-quinoxalin-1-one (ODQ, a soluble guanylyl cyclase (sGC inhibitor. ESB-induced relaxation was significantly attenuated by pretreatment with tetraethylammonium (TEA, a nonselective K+ channel blocker, and charybdotoxin, a selective Ca2+-dependent K+ channel inhibitor. ESB increased the cGMP levels of rabbit corpus cavernosum in a concentration-dependent manner without changes in cAMP levels. In a perfusion model of penile tissue, ESB also relaxed penile corpus cavernosum smooth muscle in a dose-dependent manner. Conclusion. Taken together, these results suggest that ESB relaxed rabbit cavernous smooth muscle via the NO/cGMP system and Ca2+-sensitive K+ channels in the corpus cavernosum.
Effect of hypothyroidism on the purinergic responses of corpus cavernosal smooth muscle in rabbits.

Science.gov (United States)

Yildirim, M K; Bagcivan, I; Sarac, B; Kilicarslan, H; Yildirim, S; Kaya, T

2008-01-01

Several studies have reported evidence of hormonal abnormalities in 25-35% of impotent men. Hypothyroidism has been reported to occur in 6% of impotent men. In the present study, we examined purinergic relaxation responses in hypothyroidism in an experimental rabbit model and compared them with controls to evaluate the possible involvement of the purinergic pathway. The study comprised 20 male New Zealand white rabbits. The rabbits were divided into two equal groups. We tested the effects of ATP, alpha beta ATP, and adenosine precontracted with phenylephrine on the isolated corpus cavernosum preparations from control and hypothyroid rabbits. We also evaluated the effects of ATP, alpha beta ATP, and adenosine on the cGMP levels in the isolated corpus cavernosum preparations from control and hypothyroid rabbits. T3, T4, and testosterone levels were significantly lower in hypothyroid rabbits. ATP, alpha beta ATP, carbachol, and electrical field stimulation (EFS)-induced frequency-dependent relaxation responses in the isolated rabbit corpus cavernosum strips precontracted with phenylephrine reduced significantly (Phypothyroid rabbits. Reduction of relaxation response in hypothyroid rabbits corpus cavernosum can depend on a decreased release of nitric oxide (NO) from nitrergic nerves and endothelium.
Reversible Restricted Diffusion in the Corpus Callosum in Various Pediatric Diseases

Energy Technology Data Exchange (ETDEWEB)

Kim, Won Kyung; Hong, Hyun Sook; Lee, A Leum; Cha, Jang Gyu; Lee, Hae Kyung [Dept. of Radiology, Soonchunhyang University Bucheon Hospital, Soonchunhyang University College of Medicine, Bucheon (Korea, Republic of); Bae, Won Kyung [Dept. of Radiology, Soonchunhyang University Cheonan Hospital, Soonchunhyang University College of Medicine, Cheonan (Korea, Republic of)

2012-04-15

To evaluate the reversible restricted diffusion in the corpus callosum in pediatric patients with clinical findings, and to discuss the possible pathogenesis of these lesions. Between 2007 and 2011, seven children with reversible signal abnormalities in the corpus callosum were identified and retrospectively reviewed. Diseases and conditions associated with lesions included: trauma (n = 3), neonatal seizure (n = 1), clinically suspected mild encephalopathy (n = 1), multiple sclerosis (n = 1), and seizure with subdural hygroma (n = 1). The callosal lesions were located in the splenium and the genu (n = 2), the splenium and the body (n = 1), and the splenium only (n 4). The shape of the lesions was round-to-ovoid (n = 4) or linear (n = 3). Follow-up MRI scans showed completely resolved (n = 6) or persistent (n = 1) signal abnormalities on diffusion-weighted imaging as well as apparent diffusion coefficient mapping. Clinical outcomes were good in six of the patents but poor in the seventh. Reversible restricted diffusion in the corpus callosum can develop in various diseases. Knowledge of the MRI findings and associated diseases might be helpful in predicting patients' conditions and clinical outcomes.

The Wildcat Corpus of Native- and Foreign-Accented English: Communicative Efficiency across Conversational Dyads with Varying Language Alignment Profiles

Science.gov (United States)

Van Engen, Kristin J.; Baese-Berk, Melissa; Baker, Rachel E.; Choi, Arim; Kim, Midam; Bradlow, Ann R.

2010-01-01

This paper describes the development of the Wildcat Corpus of native- and foreign-accented English, a corpus containing scripted and spontaneous speech recordings from 24 native speakers of American English and 52 non-native speakers of English. The core element of this corpus is a set of spontaneous speech recordings, for which a new method of…
Corpus Planning for the Southern Peruvian Quechua Language.

Science.gov (United States)

Coronel-Molina, Serafin M.

1997-01-01

The discussion of corpus planning for the Southern Quechua language variety of Peru examines issues of graphization, standardization, modernization, and renovation of Quechua in the face of increasing domination by the Spanish language. The efforts of three major groups of linguists and other scholars working on language planning in Peru, and the…
Corpus applications for the African languages, with special ...

African Journals Online (AJOL)

In order to illustrate the feasibility of corpus applications for the African languages at present, the article first considers 'fundamental linguistic research' in the fields of phonetics and question particles. It is shown how that research was boosted as a result of the utilisation of corpora. In a second section 'language teaching ...
Considering bilingual dictionaries against a corpus. Do English ...

African Journals Online (AJOL)

This article investigates the extent to which four representatives of the latest generation of English-French / French-English dictionaries present "real English", i.e. actually used meanings of actually used English word patterns. The findings of a corpus study of the verb CONSIDER are confronted with the entries for this verb ...
The NCHLT speech corpus of the South African languages

CSIR Research Space (South Africa)

Barnard, E

2014-05-01

Full Text Available The NCHLT speech corpus contains wide-band speech from approximately 200 speakers per language, in each of the eleven of cial languages of South Africa. We describe the design and development processes that were undertaken in order to develop...
UNITS OF MEASUREMENT: ORAL TRADITION, TRANSLATION STUDIES AND CORPUS LINGUISTICS

Directory of Open Access Journals (Sweden)

John ZEMKE

2017-06-01

Full Text Available The study of the world’s verbal arts offers an opportunity to consider ways that computational analysis and modeling of narratives may lead to new understandings of how they are constructed, their dynamics and relationships. Similarly, as corpus linguistics operations must define metrics, it offers an occasion to review basic interpretive concepts such as “units of analysis, context, and genre." My essay begins with an admittedly cursory overview from a novice perspective of what capabilities corpus linguistics currently possesses for the analysis and modeling of narratives. Consideration is given to the epistemological issue in the social sciences with the positivistic prescription or empiricist description of units of analysis and the potential pitfalls or advantages corpus linguistics encounters in searching for adequate equivalent terms. This review leads naturally to reflection on the crucial determinative action of context on meaning and the extent to which current computational interfaces are able to account for and integrate into global analysis of linguistic and performance dimensions such as performer, intonation, gesture, diction, idioms and figurative language, setting, audience, time, and occasion. As a tentative conclusion from this review, it can be stated that artificial intelligence for modeling narratives or devising narrative algorithms must develop capacities to account for performance dimensions in order to fulfill their analytical potential.
Techniques and Rules of Ineffability in the Dionysian Corpus

Directory of Open Access Journals (Sweden)

Knepper Timothy D.

2014-06-01

Full Text Available Is the Dionysian God, or an experience of the Dionysian God, absolutely ineffable? Does the Dionysian corpus assert or perform such ineffability? This paper will argue that the answer to each of these questions is no. The Dionysian God is known hyper-nous as the hyper-ousia cause of all. And the Dionysian corpus unambiguously refers to, asserts of, and metaphorizes about this God just so. In arguing these points, this paper will call upon both the speech act theory of John Searle and the metaphor theory of George Lakoff and Mark Johnson. More particularly, it will look to Searle’s rules of reference and predication and conditions of illocutionary acts, as well as Lakoff and Johnson’s schematization of metaphor gestalt and entailment to show how Dionysian expressions of inexpressibility are rule-governed and the Dionysian God is thereby (relatively effable.
Incidence and lifetime risk of uterine corpus cancer in Taiwanese women from 1991 to 2010

Directory of Open Access Journals (Sweden)

Jerry Cheng-Yen Lai

2017-02-01

Conclusion: According to the observed changes in incidence rate, the burden of uterine corpus cancer in the general female population is expected to increase in the near future. From a public-health perspective, care providers should develop strategies for the prevention, early detection, and intervention to reduce the rapidly increasing incidence of uterine corpus cancer in Taiwan.
Corpus callosotomy in a patient with startle epilepsy.

Science.gov (United States)

Gómez, Nicolás Garófalo; Hamad, Ana Paula; Marinho, Murilo; Tavares, Igor M; Carrete, Henrique; Caboclo, Luís Otávio; Yacubian, Elza Márcia; Centeno, Ricardo

2013-03-01

Startle epilepsy is a syndrome of reflex epilepsy in which the seizures are precipitated by a sudden and surprising, usually auditory, stimulus. We describe herein a girl who had been suffering with startle-induced seizures since 2 years of age. She had focal, tonic and tonic-clonic seizures, refractory to antiepileptic treatment. Daily tonic seizures led to very frequent falls and morbidity. Neurologically, she had no deficit. Interictal EEG showed slow waves and epileptiform discharges in central and fronto-central regions. Video-polygraphic recordings of seizures, triggered by stimuli, showed generalised symmetric tonic posturing with ictal EEG, characterised by an abrupt and diffuse electrodecremental pattern of fast activity, followed by alpha-theta rhythm superimposed by epileptic discharges predominantly over the vertex and anterior regions. Magnetic resonance imaging showed no abnormalities. Corpus callosotomy was performed when the patient was 17. Since surgery, the patient (one year follow-up) has remained seizure-free. Corpus callosotomy may be considered in patients with startle epilepsy and tonic seizures, in the absence of focal lesions amenable to surgery. [Published with video sequences].
Learner features in a New Corpus-based Swahili dictionary ...

African Journals Online (AJOL)

As far as traditionally published Swahili language dictionaries are concerned, throughout the long history of Swahili lexicography, most new dictionaries were based on their predecessors. Thus far the only innovative traditionally printed corpus-based dictionary has been published by Finnish scholars (Abdulla et al. 2002).
Hispanismos y canarismos en un corpus de textos ingleses sobre Canarias

Directory of Open Access Journals (Sweden)

María-Isabel González-Cruz

2013-12-01

Full Text Available ResumenLas Islas Canarias (España siempre han mantenido un estrecho contactocon el mundo anglosajón, lo que ha generado importantes consecuencias económicas, así como también socioculturales, lingüísticas y literarias. Un análisis de la bibliografía inglesa sobre Canarias revela, entre otros aspectos, la tendencia al uso de hispanismos y canarismos. Este trabajo ofrece el registro de esas voces que aparecen en un corpus de catorce obras tomadas de la extensa bibliografía anglocanaria. Tras revisar brevemente la relevancia del hispanismo inglés, nuestra recopilación intenta resaltar la contribución del español de Canarias al enriquecimiento del vocabulario de la lengua inglesa, constatando cuáles de los hispanismos de nuestro corpus que son canarismos han pasado al registro lexicográfico realizado por elShorter Oxford English Dictionary on Historical Principles (2007.AbstractThe Canary Islands (Spain have always been in close contact with the Anglo-Saxon world, which has had important consequences for the economy but also at the socio-cultural, linguistic and literary levels. A review of the English bibliography on the Canaries reveals, among other aspects, a tendency in most authors to use hispanicisms and canarianisms in their texts. This article offers a record of those words which appear in a corpus of fourteen works taken from this extensive bibliography. Apart from providing an overview of the studies on hispanicisms in English, this paper’s main aim is to highlight the contribution of Canarian Spanish to the enrichment of the vocabulary of English by checking which of the hispanicisms in our corpus, which are actually canarianisms, have been included in the lexical repertoire of the Shorter Oxford English Dictionaryon Historical Principles (2007.
You Should Have the Body: Understanding Habeas Corpus

Science.gov (United States)

Landman, James

2008-01-01

English legal commentator William Blackstone described the writ of habeas corpus as a second Magna Carta, and Supreme Court Chief Justice John Marshall called it the "great writ." It has been part of the Anglo-American common law tradition since the Middle Ages. In the United States, it has been a source of tension between state and…
Afasia fluente. Materiales para su estudio.(Volumen 01 del Corpus PerLA)

OpenAIRE

Gallardo-Paúls, Beatriz; Sanmartín Sáez, Julia

2005-01-01

El corpus PerLA (“Percepción, Lenguaje y Afasia”), surge en el área de Lingüística General de la Universitat de València como respuesta a la necesidad de integrar el estudio de las patologías lingüísticas en las tendencias actuales de la pragmática y la lingüística de corpus. Para satisfacer las exigencias de estas disciplinas y disponer de datos con validez ecológica, se han realizado grabaciones a diferentes hablantes con afasia, en un contexto que intenta alejarse de las convenciones propi...
Introducing the Geneva Multimodal expression corpus for experimental research on emotion perception.

Science.gov (United States)

Bänziger, Tanja; Mortillaro, Marcello; Scherer, Klaus R

2012-10-01

Research on the perception of emotional expressions in faces and voices is exploding in psychology, the neurosciences, and affective computing. This article provides an overview of some of the major emotion expression (EE) corpora currently available for empirical research and introduces a new, dynamic, multimodal corpus of emotion expressions, the Geneva Multimodal Emotion Portrayals Core Set (GEMEP-CS). The design features of the corpus are outlined and justified, and detailed validation data for the core set selection are presented and discussed. Finally, an associated database with microcoded facial, vocal, and body action elements, as well as observer ratings, is introduced.
A corpus of images and text in online news

NARCIS (Netherlands)

L. Hollink (Laura); A. Bedjeti (Adriatik); M. van Harmelen; D. Elliott (Desmond)

2016-01-01

htmlabstractIn recent years, several datasets have been released that include images and text, giving impulse to new methods that combine natural language processing and computer vision. However, there is a need for datasets of images in their natural textual context. The ION corpus contains 300K
CORPUS CHRISTI E A FOME NO MUNDO

Directory of Open Access Journals (Sweden)

Carlos Alberto dos Santos Dutra

1995-01-01

Full Text Available A festa religiosa de Corpus Christi, data da instituição da Eucaristia, neste ano foi comemorada no dia 15 de junho. Neste sacramento, entende a Igreja católica, o próprio Cristo se comunica para nutrir e salvar o homem. Expressão e síntese do cristianismo, é a identificação do sacrifício de Cristo com o sacrifício do homem.
Big Data, Big Questions: A Closer Look at the Yale–Classical Archives Corpus (c. 2015

Directory of Open Access Journals (Sweden)

Trevor deClerq

2016-07-01

Full Text Available This paper responds to the article by Christopher White and Ian Quinn, in which these authors introduce the Yale-Classical Archives Corpus (YCAC. I begin by making some general observations about the corpus, especially with regard to ramifications of the keyboard-performance origins of many pieces in the original MIDI collection. I then assess the accuracy of the scale-degree and local-key fields in the database, which were generated by the Bellman-Budge key-finding algorithm. I point out that some of the inaccuracies from the key-finding algorithm's output may influence the results we obtain from statistical studies of this corpus. I also offer an alternative analysis to the authors' finding that the ratio of V7 to V chords increases over time in common-practice music. Specifically, I conjecture that this finding may be the result of (or related to increasing instrumental resources over time. I close with some recommendations for future versions of the corpus, such as enabling end users to help repair transcription errors as well as offer ground truths for harmonic analyses and key area information.
Corpus callosum thickness on mid-sagittal MRI as a marker of brain volume: a pilot study in children with HIV-related brain disease and controls

Energy Technology Data Exchange (ETDEWEB)

Andronikou, Savvas [University of the Witwatersrand, Department of Radiology, Faculty of Health Sciences, Cape Town (South Africa); Ackermann, Christelle [University of Stellenbosch, Department of Radiology, Stellenbosch (South Africa); Laughton, Barbara; Cotton, Mark [Stellenbosch University and Tygerberg Children' s Hospital, Children' s Infectious Diseases Research Unit, Stellenbosch (South Africa); Tomazos, Nicollette [University of Cape Town, Faculty of Commerce, Department of Management Studies, Cape Town (South Africa); Spottiswoode, Bruce [University of Cape Town, MRC/UCT Medical Imaging Research Unit, Department of Human Biology, Cape Town (South Africa); Mauff, Katya [University of Cape Town, Department of Statistical Sciences, Cape Town (South Africa); Pettifor, John M. [University of the Witwatersrand, MRC/Wits Developmental Pathways for Health Research Unit, Department of Paediatrics, Faculty of Health Sciences, Witwatersrand (South Africa)

2015-07-15

Corpus callosum thickness measurement on mid-sagittal MRI may be a surrogate marker of brain volume. This is important for evaluation of diseases causing brain volume gain or loss, such as HIV-related brain disease and HIV encephalopathy. To determine if thickness of the corpus callosum on mid-sagittal MRI is a surrogate marker of brain volume in children with HIV-related brain disease and in controls without HIV. A retrospective MRI analysis in children (<5 years old) with HIV-related brain disease and controls used a custom-developed semi-automated tool, which divided the midline corpus callosum and measured its thickness in multiple locations. Brain volume was determined using volumetric analysis. Overall corpus callosum thickness and thickness of segments of the corpus callosum were correlated with overall and segmented (grey and white matter) brain volume. Forty-four children (33 HIV-infected patients and 11 controls) were included. Significant correlations included overall corpus callosum (mean) and total brain volume (P = 0.05); prefrontal corpus callosum maximum with white matter volume (P = 0.02); premotor corpus callosum mean with total brain volume (P = 0.04) and white matter volume (P = 0.02), premotor corpus callosum maximum with white matter volume (P = 0.02) and sensory corpus callosum mean with total brain volume (P = 0.02). Corpus callosum thickness correlates with brain volume both in HIV-infected patients and controls. (orig.)
Corpus callosum thickness on mid-sagittal MRI as a marker of brain volume: a pilot study in children with HIV-related brain disease and controls

International Nuclear Information System (INIS)

Andronikou, Savvas; Ackermann, Christelle; Laughton, Barbara; Cotton, Mark; Tomazos, Nicollette; Spottiswoode, Bruce; Mauff, Katya; Pettifor, John M.

2015-01-01

Corpus callosum thickness measurement on mid-sagittal MRI may be a surrogate marker of brain volume. This is important for evaluation of diseases causing brain volume gain or loss, such as HIV-related brain disease and HIV encephalopathy. To determine if thickness of the corpus callosum on mid-sagittal MRI is a surrogate marker of brain volume in children with HIV-related brain disease and in controls without HIV. A retrospective MRI analysis in children (<5 years old) with HIV-related brain disease and controls used a custom-developed semi-automated tool, which divided the midline corpus callosum and measured its thickness in multiple locations. Brain volume was determined using volumetric analysis. Overall corpus callosum thickness and thickness of segments of the corpus callosum were correlated with overall and segmented (grey and white matter) brain volume. Forty-four children (33 HIV-infected patients and 11 controls) were included. Significant correlations included overall corpus callosum (mean) and total brain volume (P = 0.05); prefrontal corpus callosum maximum with white matter volume (P = 0.02); premotor corpus callosum mean with total brain volume (P = 0.04) and white matter volume (P = 0.02), premotor corpus callosum maximum with white matter volume (P = 0.02) and sensory corpus callosum mean with total brain volume (P = 0.02). Corpus callosum thickness correlates with brain volume both in HIV-infected patients and controls. (orig.)
Annotated chemical patent corpus: a gold standard for text mining.

Directory of Open Access Journals (Sweden)

Saber A Akhondi

Full Text Available Exploring the chemical and biological space covered by patent applications is crucial in early-stage medicinal chemistry activities. Patent analysis can provide understanding of compound prior art, novelty checking, validation of biological assays, and identification of new starting points for chemical exploration. Extracting chemical and biological entities from patents through manual extraction by expert curators can take substantial amount of time and resources. Text mining methods can help to ease this process. To validate the performance of such methods, a manually annotated patent corpus is essential. In this study we have produced a large gold standard chemical patent corpus. We developed annotation guidelines and selected 200 full patents from the World Intellectual Property Organization, United States Patent and Trademark Office, and European Patent Office. The patents were pre-annotated automatically and made available to four independent annotator groups each consisting of two to ten annotators. The annotators marked chemicals in different subclasses, diseases, targets, and modes of action. Spelling mistakes and spurious line break due to optical character recognition errors were also annotated. A subset of 47 patents was annotated by at least three annotator groups, from which harmonized annotations and inter-annotator agreement scores were derived. One group annotated the full set. The patent corpus includes 400,125 annotations for the full set and 36,537 annotations for the harmonized set. All patents and annotated entities are publicly available at www.biosemantics.org.

A new English–Arabic parallel text corpus for lexicographic ...

African Journals Online (AJOL)

The chosen source texts deal with a variety of topics such as the environment, globalization, psychology, history, politics, drama, etc. Their Arabic translations were taken from The World of Knowledge series published by the National Council for Culture, Arts and Letters (NCCAL) in Kuwait. Keywords: parallel corpus ...
Enemy Combatant Detainees: Habeas Corpus Challenges in Federal Court

Science.gov (United States)

2006-09-26

Separation of Powers Issues . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 28 Eliminating Federal Court Jurisdiction Where There Is No State Court Review . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 29 Conclusion . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 30 1 542 U.S. 466 (2004). Enemy Combatant Detainees: Habeas Corpus Challenges in Federal Court In Rasul v. Bush,1 a divided Supreme Court declared that “a state
Understanding Depressive Symptoms and Psychosocial Stressors on Twitter: A Corpus-Based Study.

Science.gov (United States)

Mowery, Danielle; Smith, Hilary; Cheney, Tyler; Stoddard, Greg; Coppersmith, Glen; Bryan, Craig; Conway, Mike

2017-02-28

With a lifetime prevalence of 16.2%, major depressive disorder is the fifth biggest contributor to the disease burden in the United States. The aim of this study, building on previous work qualitatively analyzing depression-related Twitter data, was to describe the development of a comprehensive annotation scheme (ie, coding scheme) for manually annotating Twitter data with Diagnostic and Statistical Manual of Mental Disorders, Edition 5 (DSM 5) major depressive symptoms (eg, depressed mood, weight change, psychomotor agitation, or retardation) and Diagnostic and Statistical Manual of Mental Disorders, Edition IV (DSM-IV) psychosocial stressors (eg, educational problems, problems with primary support group, housing problems). Using this annotation scheme, we developed an annotated corpus, Depressive Symptom and Psychosocial Stressors Acquired Depression, the SAD corpus, consisting of 9300 tweets randomly sampled from the Twitter application programming interface (API) using depression-related keywords (eg, depressed, gloomy, grief). An analysis of our annotated corpus yielded several key results. First, 72.09% (6829/9473) of tweets containing relevant keywords were nonindicative of depressive symptoms (eg, "we're in for a new economic depression"). Second, the most prevalent symptoms in our dataset were depressed mood and fatigue or loss of energy. Third, less than 2% of tweets contained more than one depression related category (eg, diminished ability to think or concentrate, depressed mood). Finally, we found very high positive correlations between some depression-related symptoms in our annotated dataset (eg, fatigue or loss of energy and educational problems; educational problems and diminished ability to think). We successfully developed an annotation scheme and an annotated corpus, the SAD corpus, consisting of 9300 tweets randomly-selected from the Twitter application programming interface using depression-related keywords. Our analyses suggest that keyword
Annotation of Regular Polysemy

DEFF Research Database (Denmark)

Martinez Alonso, Hector

Regular polysemy has received a lot of attention from the theory of lexical semantics and from computational linguistics. However, there is no consensus on how to represent the sense of underspecified examples at the token level, namely when annotating or disambiguating senses of metonymic words...... and metonymic. We have conducted an analysis in English, Danish and Spanish. Later on, we have tried to replicate the human judgments by means of unsupervised and semi-supervised sense prediction. The automatic sense-prediction systems have been unable to find empiric evidence for the underspecified sense, even...
Comorbidity is an independent prognostic factor in women with uterine corpus cancer

DEFF Research Database (Denmark)

Noer, Mette C; Sperling, Cecilie; Christensen, Ib J

2014-01-01

OBJECTIVE: To determine whether comorbidity independently affects overall survival in women with uterine corpus cancer. DESIGN: Cohort study. SETTING: Denmark. STUDY POPULATION: A total of 4244 patients registered in the Danish Gynecologic Cancer database with uterine corpus cancer from 1 January....... RESULTS: Univariate survival analysis showed a significant (p independent prognostic factor with hazard ratios...... ranging from 1.27 to 1.42 in mild, 1.69 to 1.74 in moderate, and 1.72 to 2.48 in severe comorbidity. Performance status was independently associated to overall survival and was found to slightly reduce the prognostic impact of comorbidity. CONCLUSION: Comorbidity is an independent prognostic factor...
Corpus linguistics, systemic functional grammar and literary meaning: a critical analysis of harry potter and the philosopher’s stone Corpus linguistics, systemic functional grammar and literary meaning: a critical analysis of harry potter and the philosopher’s stone

Directory of Open Access Journals (Sweden)

Andrew Goatly

2008-04-01

Full Text Available The research reported in this paper has two aims. First, to show how corpus linguistics, using word frequency and concordance data, which is then analysed according to transitivity systems of systemic functional grammar (SFG, can be useful to the enterprise of critical linguistics. Second, to investigate to what extent this critical corpus linguistics (CCL gives a valid representation of the meanings and ideologies of a literary text. The hypothesis tested is that semiotic models of communication, in this case of popular children’s literature, with their emphasis on the encoding and decoding of meanings, lend themselves to a corpus linguistics approach. But that, in fact, these mutually reinforcing approaches (SFG and CCL with their reliance on what is encoded as text cannot entirely succeed in accounting for how literature, in particular, is understood and interpreted, and how ideology works within it and behind it. For a richer critical discourse analysis we need a pragmatic account, for example an analysis of presupposition, inference and propositional attitude. The issues here will be discussed in the light of recent debate between Michael Stubbs and Henry Widdowson on the strengths and limitations of corpus linguistics in critical discourse analysis. The research reported in this paper has two aims. First, to show how corpus linguistics, using word frequency and concordance data, which is then analysed according to transitivity systems of systemic functional grammar (SFG, can be useful to the enterprise of critical linguistics. Second, to investigate to what extent this critical corpus linguistics (CCL gives a valid representation of the meanings and ideologies of a literary text. The hypothesis tested is that semiotic models of communication, in this case of popular children’s literature, with their emphasis on the encoding and decoding of meanings, lend themselves to a corpus linguistics approach. But that, in fact, these
Revelando sentidos na prática docente: a abordagem de corpus na análise do discurso Uncovering meanings in pedagogical practice: the corpus approach in discourse analysis

Directory of Open Access Journals (Sweden)

Vander Viana

2011-01-01

Full Text Available Este artigo discute a viabilidade da utilização de ferramentas da Linguística de Corpus na análise do discurso pedagógico. Para tanto, são apresentados dois estudos de caso. O primeiro focaliza o discurso de professores de língua inglesa de um renomado curso de idiomas do Rio de Janeiro acerca da implementação de recursos tecnológicos na sala de aula. O segundo estudo, por sua vez, busca perceber qual é o posicionamento de professores universitários de literaturas em língua inglesa sobre literatura e seu ensino. Os resultados apontam para a riqueza dos dados contextuais que podem ser depreendidos a partir de uma análise linguística de base empírica. Em última análise, o artigo revela a importância e a flexibilidade da abordagem de corpus na análise do discurso, que pode ser aplicada a inúmeros contextos.This paper discusses the feasibility of using Corpus Linguistics tools in the analysis of pedagogic discourse. For doing this, two case studies are presented. The first one focuses on the discourse of English language teachers of a well-known languages course in Rio de Janeiro about the implementation of technological resources in the classroom. The second study, in its turn, seeks to realize the position held by university professors of literatures in English language with regard to literature and its teaching. The results point out to the richness of contextual data which can be inferred from a linguistic analysis with an empirical basis. All in all, the paper uncovers the importance and flexibility of the corpus approach in discourse analysis, which may be applied to several contexts.
Euphemism vs explicitness: A corpus-based analysis of translated ...

African Journals Online (AJOL)

This article examines the governing initial norms, namely explicitness and euphemism in English source texts and Ndebele translations, focusing on how these norms influenced the strategies chosen by the Ndebele translators in the translation of taboo terms. In the article, a corpus-based approach is used to identify head ...
[Structural change of the corpus callosum fibers in toddlers with autism spectrum disorder: two-year follow-up].

Science.gov (United States)

Chang, C; Qiu, N N; Xiao, T; Xiao, X; Chu, K K; Li, Y; Wu, Q R; Fang, H; Ke, X Y

2017-12-02

Objective: To conduct a follow-up investigation of structural changes of the corpus callosum fibers of toddlers (2 to 5 years of age) with autism spectrum disorder(ASD) and to explore the associations with clinical symptoms. Method: In this prospective randomized controlled study, ASD children who were diagnosed in the Child Mental Health Research Center, Nanjing Brain Hospital Affiliated to Nanjing Medical University from May 2011 to November 2012 were included in the ASD group, and developmentally delayed children were included in the control group (DD group). Diffusion tensor imaging (DTI) data from the two groups were obtained at two age levels: 2-3 years of age, and 4-5 years of age. Region of interest analysis was applied to assess characteristic values of total area and sub-regions of corpus callosum: the fraction anisotropy (FA), the mean diffusivity (MD), the radial diffusivity (RD) and the axial diffusivity (AD). All children were assessed using the Autism Diagnostic Interview-Revised (ADI-R) and Autism Treatment Evaluation Checklist (ATEC). The characteristic values of total area and sub-regions of corpus callosum of ASD group at two age levels were analyzed by paired sample t test; the characteristic values of total area and sub-regions of corpus callosum of ASD group and DD group were analyzed by independent-sample t test; the correlations between FA values of the total area and sub-regions of corpus callosum and ADI-R or ATEC scores were analyzed by Pearson correlation analysis. Result: Forty cases meeting inclusion criteria were enrolled in ASD group, and 31 eligible cases were enrolled in the control group. Four children in the ASD group were lost to follow-up, and 5 children in the control group were lost to follow-up. Longitudinal comparison between the two age subgroups of ASD patients showed that the FA values of the total corpus callosum increased (0.499 55±0.027 59 vs . 0.505 83±0.086 64, t= 4.88, P 0.05 for all comparisons); as compared
A cascade of morphogenic signaling initiated by the meninges controls corpus callosum formation.

Science.gov (United States)

Choe, Youngshik; Siegenthaler, Julie A; Pleasure, Samuel J

2012-02-23

The corpus callosum is the most prominent commissural connection between the cortical hemispheres, and numerous neurodevelopmental disorders are associated with callosal agenesis. By using mice either with meningeal overgrowth or selective loss of meninges, we have identified a cascade of morphogenic signals initiated by the meninges that regulates corpus callosum development. The meninges produce BMP7, an inhibitor of callosal axon outgrowth. This activity is overcome by the induction of expression of Wnt3 by the callosal pathfinding neurons, which antagonize the inhibitory effects of BMP7. Wnt3 expression in the cingulate callosal pathfinding axons is developmentally regulated by another BMP family member, GDF5, which is produced by the adjacent Cajal-Retzius neurons and turns on before outgrowth of the callosal axons. The effects of GDF5 are in turn under the control of a soluble GDF5 inhibitor, Dan, made by the meninges. Thus, the meninges and medial neocortex use a cascade of signals to regulate corpus callosum development. Copyright © 2012 Elsevier Inc. All rights reserved.
Identifying biological concepts from a protein-related corpus with a probabilistic topic model

Directory of Open Access Journals (Sweden)

Lu Xinghua

2006-02-01

Full Text Available Abstract Background Biomedical literature, e.g., MEDLINE, contains a wealth of knowledge regarding functions of proteins. Major recurring biological concepts within such text corpora represent the domains of this body of knowledge. The goal of this research is to identify the major biological topics/concepts from a corpus of protein-related MEDLINE© titles and abstracts by applying a probabilistic topic model. Results The latent Dirichlet allocation (LDA model was applied to the corpus. Based on the Bayesian model selection, 300 major topics were extracted from the corpus. The majority of identified topics/concepts was found to be semantically coherent and most represented biological objects or concepts. The identified topics/concepts were further mapped to the controlled vocabulary of the Gene Ontology (GO terms based on mutual information. Conclusion The major and recurring biological concepts within a collection of MEDLINE documents can be extracted by the LDA model. The identified topics/concepts provide parsimonious and semantically-enriched representation of the texts in a semantic space with reduced dimensionality and can be used to index text.
Assessment of disease named entity recognition on a corpus of annotated sentences.

Science.gov (United States)

Jimeno, Antonio; Jimenez-Ruiz, Ernesto; Lee, Vivian; Gaudan, Sylvain; Berlanga, Rafael; Rebholz-Schuhmann, Dietrich

2008-04-11

In recent years, the recognition of semantic types from the biomedical scientific literature has been focused on named entities like protein and gene names (PGNs) and gene ontology terms (GO terms). Other semantic types like diseases have not received the same level of attention. Different solutions have been proposed to identify disease named entities in the scientific literature. While matching the terminology with language patterns suffers from low recall (e.g., Whatizit) other solutions make use of morpho-syntactic features to better cover the full scope of terminological variability (e.g., MetaMap). Currently, MetaMap that is provided from the National Library of Medicine (NLM) is the state of the art solution for the annotation of concepts from UMLS (Unified Medical Language System) in the literature. Nonetheless, its performance has not yet been assessed on an annotated corpus. In addition, little effort has been invested so far to generate an annotated dataset that links disease entities in text to disease entries in a database, thesaurus or ontology and that could serve as a gold standard to benchmark text mining solutions. As part of our research work, we have taken a corpus that has been delivered in the past for the identification of associations of genes to diseases based on the UMLS Metathesaurus and we have reprocessed and re-annotated the corpus. We have gathered annotations for disease entities from two curators, analyzed their disagreement (0.51 in the kappa-statistic) and composed a single annotated corpus for public use. Thereafter, three solutions for disease named entity recognition including MetaMap have been applied to the corpus to automatically annotate it with UMLS Metathesaurus concepts. The resulting annotations have been benchmarked to compare their performance. The annotated corpus is publicly available at ftp://ftp.ebi.ac.uk/pub/software/textmining/corpora/diseases and can serve as a benchmark to other systems. In addition, we found
Light-controlled relaxation of the rat penile corpus cavernosum using NOBL-1, a novel nitric oxide releaser

Directory of Open Access Journals (Sweden)

Yuji Hotta

2016-05-01

Full Text Available Purpose: To investigate whether relaxation of the rat penile corpus cavernosum could be controlled with NOBL-1, a novel, lightcontrollable nitric oxide (NO releaser. Materials and Methods: Fifteen-week-old male Wistar-ST rats were used. The penile corpus cavernosum was prepared and used in an isometric tension study. After noradrenaline (10−5 M achieved precontraction, the penile corpus cavernosum was irradiated by light (470–500 nm with and without NOBL-1 (10−6 M. In addition, we noted rats’ responses to light with vardenafil (10−6 M, a phosphodiesterase-5 (PDE-5 inhibitor. Next, responses to light in the presence of a guanylate cyclase inhibitor, ODQ (1H-[1,2,4] oxadiazolo[4,3-a]quinoxalin-1-one (10−5 M, were measured. All measurements were performed in pretreated L-NAME (10−4 M conditions to inhibit endogenous NO production. Results: Corpus cavernosal smooth muscle, precontracted with noradrenaline, was unchanged by light irradiation in the absence of NOBL-1. However, in the presence of NOBL-1, corpus cavernosal smooth muscle, precontracted with noradrenaline, relaxed in response to light irradiation. After blue light irradiation ceased, tension returned. In addition, the light response was obviously enhanced in the presence of a PDE-5 inhibitor. Conclusions: This study showed that rat corpus cavernosal smooth muscle relaxation can be light-controlled using NOBL-1, a novel, light sensitive NO releaser. Though further in vivo studies are needed to investigate possible usefulness, NOBL-1 may be prove to be a useful tool for erectile dysfunction therapy, specifically in the field of penile rehabilitation.
A Corpus-Based Comparative Study of "Learn" and "Acquire"

Science.gov (United States)

Yang, Bei

2016-01-01

As an important yet intricate linguistic feature in English language, synonymy poses a great challenge for second language learners. Using the 100 million-word British National Corpus (BNC) as data and the software Sketch Engine (SkE) as an analyzing tool, this article compares the usage of "learn" and "acquire" used in natural…
The Dependency Structure of Coordinate Phrases: A Corpus Approach

Science.gov (United States)

Temperley, David

2005-01-01

Hudson (1990) proposes that each conjunct in a coordinate phrase forms dependency relations with heads or dependents outside the coordinate phrase (the "multi-head" view). This proposal is tested through corpus analysis of Wall Street Journal text. For right-branching constituents (such as direct-object NPs), a short-long preference for conjunct…
Rab proteins in the brain and corpus allatum of Bombyx mori.

Science.gov (United States)

Uno, Tomohide; Furutani, Masayuki; Watanabe, Chihiro; Sakamoto, Katsuhiko; Uno, Yuichi; Kanamaru, Kengo; Yamagata, Hiroshi; Mizoguchi, Akira; Takeda, Makio

2016-07-01

In eukaryotic cells, Rab guanosine triphosphate-ases serve as key regulators of membrane-trafficking events, such as exocytosis and endocytosis. Rab3, Rab6, and Rab27 control the regulatory secretory pathway of neuropeptides and neurotransmitters. The cDNAs of Rab3, Rab6, and Rab27 from B. mori were inserted into a plasmid, transformed into Escherichia coli, and then subsequently purified. We then produced antibodies against Rab3, Rab6, and Rab27 of Bombyx mori in rabbits and rats for use in western immunoblotting and immunohistochemistry. Western immunoblotting of brain tissue revealed a single band at approximately 26 kDa. Immunohistochemistry results revealed that Rab3, Rab6, and Rab27 expression was restricted to neurons in the pars intercerebralis and dorsolateral protocerebrum of the brain. Rab3 and Rab6 co-localized with bombyxin, an insect neuropeptide. However, there was no Rab that co-localized with prothoracicotropic hormone. The corpus allatum secretes neuropeptides synthesized in the brain into the hemolymph. Results showed that Rab3 and Rab6 co-localized with bombyxin in the corpus allatum. These findings suggest that Rab3 and Rab6 are involved in neurosecretion in B. mori. This study is the first to report a possible relationship between Rab and neurosecretion in the insect corpus allatum.
From Business Corpus to Business Lexicon*

Directory of Open Access Journals (Sweden)

Li Lan

2011-10-01

Full Text Available
Abstract: Language corpora are now indispensable to dictionary compilation. They help broaden the role of the dictionary from standardizing the vocabulary to recording a language. The trilingual corpus generated by the Hong Kong Polytechnic University gives a record of business languages used in Hong Kong. It differs from other corpora in that (1 it includes English, Chinese and Japanese; (2 it shows local characteristics; and (3 it focuses on a specific area (financial services, including banking, accounting, auditing, insurance and investment. The paper discusses various issues of setting up a tricorpus, and how to make full use of the data to generate a trilingual lexicon.
Keywords: MULTILINGUAL, SPECIAL PURPOSE, CORPUS, LEXICON
Opsomming: Van sakekorpus tot sakeleksikon. Taalkorpora is tans onontbeerlik virdie samestelling van woordeboeke. Hulle help om die rol van die woordeboek uit te brei vanaf diestandaardisering van die woordeskat tot die optekening van ‘n taal. Die drietalige korpus wat deurdie Hongkongse Politegniese Universiteit ontwikkel is, verskaf ‘n opgawe van die saketale wat inHongkong gebruik word. Dit verskil van ander korpora deurdat (1 dit Engels, Chinees and Japaneesinsluit; (2 dit plaaslike eienskappe vertoon; en (3 dit op 'n spesifieke gebied (finansiële dienste,insluitende bankwese, rekeningkunde, ouditering, versekering en belegging fokus. Die artikelbespreek verskillende aspekte van die totstandbrenging van 'n drietalige korpus, en hoe om vollegebruik te maak van die data om 'n drietalige leksikon te genereer.
Sleutelwoorde: MEERTALIG, SPESIALE DOEL, KORPUS, LEKSIKON
Touching the Void - Introducing CoST: Corpus of Social Touch

NARCIS (Netherlands)

Jung, Merel M.; Poppe, Ronald; Poel, Mannes; Heylen, Dirk K. J.

2014-01-01

Touch behavior is of great importance during social interaction. To transfer the tactile modality from interpersonal interaction to other areas such as Human-Robot Interaction (HRI) and remote communication automatic recognition of social touch is necessary. This paper introduces CoST: Corpus of
The Danish NOMCO Corpus Multimodal Interaction in First Acquaintance Conversations

DEFF Research Database (Denmark)

Paggio, Patrizia; Navarretta, Costanza

2016-01-01

, specifically head movements, facial expressions, and body posture. The corpus has served as the empirical basis for a number of studies of communication phenomena related to turn management, feedback exchange, information packaging and the expression of emotional attitudes. We describe the annotation scheme...
Abnormal white matter integrity in the corpus callosum among smokers: tract-based spatial statistics.

Directory of Open Access Journals (Sweden)

Wakako Umene-Nakano

Full Text Available In the present study, we aimed to investigate the difference in white matter between smokers and nonsmokers. In addition, we examined relationships between white matter integrity and nicotine dependence parameters in smoking subjects. Nineteen male smokers were enrolled in this study. Eighteen age-matched non-smokers with no current or past psychiatric history were included as controls. Diffusion tensor imaging scans were performed, and the analysis was conducted using a tract-based special statistics approach. Compared with nonsmokers, smokers exhibited a significant decrease in fractional anisotropy (FA throughout the whole corpus callosum. There were no significant differences in radial diffusivity or axial diffusivity between the two groups. There was a significant negative correlation between FA in the whole corpus callosum and the amount of tobacco use (cigarettes/day; R = - 0.580, p = 0.023. These results suggest that the corpus callosum may be one of the key areas influenced by chronic smoking.

Sense Meets Nonsense

DEFF Research Database (Denmark)

Christiansen, Thomas Ulrich; Henrichsen, Peter Juel

2012-01-01

for investigating the relationship between early stages of the speech perceptual process and later stages. We present our considerations involved in preparing the experimental set-up, producing the anechoic recordings, compiling the data, and exploring the materials in linguistic research. We report on a small......In this paper, we present the newly established Danish speech corpus PiTu. The corpus consists of recordings of 28 native Danish talkers (14 female and 14 male) each reproducing (i) a series of nonsense syllables, and (ii) a set of authentic natural language sentences. The speech corpus is tailored...... pilot experiment demonstrating how PiTu and similar speech corpora can be used in studies of prosody as a function of semantic content. The experiment addresses the issue of whether the governing principles of Danish prosody assignment is mainly talker-specific or mainly content-typical (under...
AUTOMATIC RETRIEVAL AND THE FORMALIZATION OF MULTI WORDS EXPRESSIONS WITH F-WORDS IN THE CORPUS OF CONTEMPORARY AMERICAN ENGLISH

Directory of Open Access Journals (Sweden)

Prihantoro Prihantoro

2016-01-01

Full Text Available The research problems in this research are 1 how lexicogrammar takes role in determining polarity of F-Word1 and 2 how to formalize it for corpus processing. The data is obtained from the Contemporary American English Corpus (COCA. In this corpus, F-word is proven to be highest in frequency as compared to its distribution across corpora. Corpus methodology is applied by sending queries to retrieve F-Words to COCA interface. Tokens combination surrounding F-words resulted in the phrase and clause unit accompanying F-words, which are significant cues to determine F-word polarity. The polarity is later proven to be not necessarily negative. I also designed a computational resource to allow the retrieval of F-words offline so that users might apply it to any digital text collections.
The VGLC: The Video Game Level Corpus

OpenAIRE

Summerville, Adam James; Snodgrass, Sam; Mateas, Michael; Ontañón, Santiago

2016-01-01

Levels are a key component of many different video games, and a large body of work has been produced on how to procedurally generate game levels. Recently, Machine Learning techniques have been applied to video game level generation towards the purpose of automatically generating levels that have the properties of the training corpus. Towards that end we have made available a corpora of video game levels in an easy to parse format ideal for different machine learning and other game AI researc...
Defining Formats and Corpus- based Examples in the General ...

African Journals Online (AJOL)

rbr

Institute, University of Zimbabwe, Harare, Zimbabwe (langa@arts.uz.ac.zw). Abstract: In this article the writer ... sentative" in terms of size in order to be appropriately used as basis for such corpus-based diction- aries, the ISN editors .... (e) the format should suggest a preference rather than a restriction. For COBUILD, a good ...
Hereditary motor and sensory neuropathy with agenesis of the corpus callosum.

Science.gov (United States)

Dupré, Nicolas; Howard, Heidi C; Mathieu, Jean; Karpati, George; Vanasse, Michel; Bouchard, Jean-Pierre; Carpenter, Stirling; Rouleau, Guy A

2003-07-01

Hereditary motor and sensory neuropathy associated with agenesis of the corpus callosum (OMIM 218000) is an autosomal recessive disease of early onset characterized by a delay in developmental milestones, a severe sensory-motor polyneuropathy with areflexia, a variable degree of agenesis of the corpus callosum, amyotrophy, hypotonia, and cognitive impairment. Although this disorder has rarely been reported worldwide, it has a high prevalence in the Saguenay-Lac-St-Jean region of the province of Quebec (Canada) predominantly because of a founder effect. The gene defect responsible for this disorder recently has been identified, and it is a protein-truncating mutation in the SLC12A6 gene, which codes for a cotransporter protein known as KCC3. Herein, we provide the first extensive review of this disorder, covering epidemiological, clinical, and molecular genetic studies.
Transpositions Within User-Posted YouTube Lyric Videos: A Corpus Study

Directory of Open Access Journals (Sweden)

Joseph Plazak

2016-07-01

Full Text Available There are many practical reasons why experiences of a given musical work tend to be heard repeatedly at the same pitch transposition level, especially recordings of musical works. Yet here, a corpus study is presented that challenges this very basic assumption of music perception. In 2011, an initial corpus of 100 user-posted YouTube videos was collected in order to investigate the prevalence of transposition and tempo alterations within these videos. Results found 42% of these videos contained nominal changes of pitch (36% and/or tempo (22%. Using the same methodology, a follow-up study was performed in 2015 and found only that 24% of user-posted videos contained these same alterations. Implications for these observations are discussed in light of musical communication models, YouTubeology, and absolute pitch memory.
Using the Corpus of Spoken Afrikaans to generate an Afrikaans ...

African Journals Online (AJOL)

This paper presents two chatbot systems, ALICE and. Elizabeth, illustrating the dialogue knowledge representation and pattern matching techniques of each. We discuss the problems which arise when using the. Corpus of Spoken Afrikaans (Korpus Gesproke Afrikaans) to retrain the ALICE chatbot system with human ...
Then and now: A reconsideration of the first corpus of scientific English

Directory of Open Access Journals (Sweden)

John M. Swales

2004-10-01

Full Text Available The subtitle of Huddleston (1971 reads A syntactic study based on an analysis of scientific texts; this volume thus represents the first carefully designed and substantial corpus of scientific English. In this paper I re-examine a selection of his findings based on the science and engineering half of Hyland's corpus of 240 research articles. Features selected were variation in the passivization of individual transitive verbs, the paucity of instances of V + V-ing structures like "He continued working", and the meaning of the modal must in research prose. In all three cases, Huddleston's findings were largely confirmed in a database constructed about 35 years later, thus suggesting that English research writing in the sciences is, at least in grammatical terms, fundamentally stable. In the closing section, I contrast this linguistic stability with the rapid technological development of corpus linguistics. I instance a recent co-taught experimental course in which international senior doctoral students from the health and social sciences were able, with relatively little training and guidance, to construct paired corpora of their own research writings and of published articles from their own specialities and then conduct precisely the kinds of analysis that only a highly professional linguist could, with considerable more labour, conduct nearly forty years ago.
Corpus callosum atrophy is associated with mental slowing and executive deficits in subjects with age-related white matter hyperintensities: the LADIS Study

DEFF Research Database (Denmark)

Jokinen, Hanna; Ryberg, Charlotte; Kalska, Hely

2007-01-01

BACKGROUND: Previous research has indicated that corpus callosum atrophy is associated with global cognitive decline in neurodegenerative diseases, but few studies have investigated specific cognitive functions. OBJECTIVE: To investigate the role of regional corpus callosum atrophy in mental speed...... of the total corpus callosum area and its subregions with cognitive performance were analysed using multiple linear regression, controlling for volume of WMH and other confounding factors. RESULTS: Atrophy of the total corpus callosum area was associated with poor performance in tests assessing speed of mental...... processing--namely, trail making A and Stroop test parts I and II. Anterior, but not posterior, corpus callosum atrophy was associated with deficits of attention and executive functions as reflected by the symbol digit modalities and digit cancellation tests, as well as by the subtraction scores in the trail...
Corpus callosum atrophy is associated with mental slowing and executive deficits in subjects with age-related white matter hyperintensities. The LADIS study

DEFF Research Database (Denmark)

Jokinen, Hanne; Ryberg, Charlotte; Stegmann, Mikkel Bille

2007-01-01

Background: Previous research has indicated that corpus callosum atrophy is associated with global cognitive decline in neurodegenerative diseases, but few studies have investigated specific cognitive functions. Objective: To investigate the role of regional corpus callosum atrophy in mental speed...... of the total corpus callosum area and its subregions with cognitive performance were analysed using multiple linear regression, controlling for volume of WMH and other confounding factors. Results: Atrophy of the total corpus callosum area was associated with poor performance in tests assessing speed of mental...... processing - namely, trail making A and Stroop test parts I and II. Anterior, but not posterior, corpus callosum atrophy was associated with deficits of attention and executive functions as reflected by the symbol digit modalities and digit cancellation tests, as well as by the subtraction scores...
Phosphodiesterase-9 (PDE9) inhibition with BAY 73-6691 increases corpus cavernosum relaxations mediated by nitric oxide-cyclic GMP pathway in mice.

Science.gov (United States)

da Silva, F H; Pereira, M N; Franco-Penteado, C F; De Nucci, G; Antunes, E; Claudino, M A

2013-01-01

Phosphodiesterase-9 (PDE9) specifically hydrolyzes cyclic GMP, and was detected in human corpus cavernosum. However, no previous studies explored the selective PDE9 inhibition with BAY 73-6691 in corpus cavernosum relaxations. Therefore, this study aimed to characterize the PDE9 mRNA expression in mice corpus cavernosum, and investigate the effects of BAY 73-6691 in endothelium-dependent and -independent relaxations, along with the nitrergic corpus cavernosum relaxations. Male mice received daily gavage of BAY 73-6691 (or dimethylsulfoxide) at 3 mg kg(-1) per day for 21 days. Relaxant responses to acetylcholine (ACh), nitric oxide (NO) (as acidified sodium nitrite; NaNO2 solution), sildenafil and electrical-field stimulation (EFS) were obtained in corpus cavernosum in control and BAY 73-6691-treated mice. BAY 73-6691 was also added in vitro 30 min before construction of concentration-responses and frequency curves. PDE9A and PDE5 mRNA expression was detected in the mice corpus cavernosum in a similar manner. In vitro addition of BAY 73-6691 neither itself relaxed mice corpus cavernosum nor changed the NaNO2, sildenafil and EFS-induced relaxations. However, in mice treated chronically with BAY 73-6691, the potency (pEC50) values for ACh, NaNO2 and sildenafil were significantly greater compared with control group. The maximal responses (Emax) to NaNO2 and sildenafil were also significantly greater in BAY 73-6691-treated mice. BAY 73-6691 treatment also significantly increased the magnitude and duration of the nitrergic corpus cavernosum relaxations (8-32 Hz). In conclusion, murine corpus cavernosum expresses PDE9 mRNA. Prolonged PDE9 inhibition with BAY 73-6691 amplifies the NO-cGMP-mediated cavernosal responses, and may be of therapeutic value for erectile dysfunction.
A corpus of full-text journal articles is a robust evaluation tool for revealing differences in performance of biomedical natural language processing tools.

Science.gov (United States)

Verspoor, Karin; Cohen, Kevin Bretonnel; Lanfranchi, Arrick; Warner, Colin; Johnson, Helen L; Roeder, Christophe; Choi, Jinho D; Funk, Christopher; Malenkiy, Yuriy; Eckert, Miriam; Xue, Nianwen; Baumgartner, William A; Bada, Michael; Palmer, Martha; Hunter, Lawrence E

2012-08-17

We introduce the linguistic annotation of a corpus of 97 full-text biomedical publications, known as the Colorado Richly Annotated Full Text (CRAFT) corpus. We further assess the performance of existing tools for performing sentence splitting, tokenization, syntactic parsing, and named entity recognition on this corpus. Many biomedical natural language processing systems demonstrated large differences between their previously published results and their performance on the CRAFT corpus when tested with the publicly available models or rule sets. Trainable systems differed widely with respect to their ability to build high-performing models based on this data. The finding that some systems were able to train high-performing models based on this corpus is additional evidence, beyond high inter-annotator agreement, that the quality of the CRAFT corpus is high. The overall poor performance of various systems indicates that considerable work needs to be done to enable natural language processing systems to work well when the input is full-text journal articles. The CRAFT corpus provides a valuable resource to the biomedical natural language processing community for evaluation and training of new models for biomedical full text publications.
Assessing the Lexico-Grammatical Characteristics of a Corpus of College-Level Statistics Textbooks: Implications for Instruction and Practice

Science.gov (United States)

Wagler, Amy E.; Lesser, Lawrence M.; González, Ariel I.; Leal, Luis

2015-01-01

A corpus of current editions of statistics textbooks was assessed to compare aspects and levels of readability for the topics of "measures of center," "line of fit," "regression analysis," and "regression inference." Analysis with lexical software of these text selections revealed that the large corpus can…
Floral foregrounding: A corpus-assisted, cognitive stylistic study of the foregrounding of flowers in Mrs Dalloway

DEFF Research Database (Denmark)

Jensen, Marie Møller; Lottrup, Katrine; Nordentoft, Signe

2018-01-01

The study reported here combines quantitative and qualitative methods from both cognitive stylistics and corpus stylistics to analyze the flower-motif in Virginia Woolf’s novel Mrs Dalloway. The quantitative analysis compared the frequency of flower lemmas in the novel to both a reference corpus...... consisting of Woolf’s other works as well as a general corpus (the BNC). The analysis found significant differences between the frequencies in the novel and both corpora. The qualitative analysis is based on in the statistically significant results and considers cognitive entrenchment and salience...... in relation to these. Furthermore, the analysis also links these two notions to different types of foregrounding as conceptualized in stylistics proper. Finally, aspects of repetition, parallelism and symbolism in relation to the flower-motif are considered. In conclusion, it is found that the flower...
Dans un corpus hybride : les messages twittés, l’intertextualité et la formule

Directory of Open Access Journals (Sweden)

Virone Daniela

2015-01-01

Full Text Available L’article propose une réflexion pratique et méthodologique sur l’exploitation d’un corpus de twittes, considéré comme un corpus complexe pour ses caractéristiques particulières (dont la présence des métadonnées et la possibilité de le mettre en relation avec des corpus plus traditionnels. Le modèle d’analyse quantitative et qualitative expérimenté sur le débat autour du mariage homosexuel en France en 2013 et en particulier sur la formule « mariage pour tous », ici mot-dièse et formule, veut poser les bases pour de nouvelles méthodes d’exploitation des données en analyse du discours.
MORPOHOLOGICAL POS TAGGING IN ORAL LANGUAGE CORPUS: CHALLENGES FOR AELIUS

Directory of Open Access Journals (Sweden)

Gabriel de Ávila Othero

2014-12-01

Full Text Available In this paper, we present the results of our work with automatic morphological annotation of excerpts from a corpus of spoken language – belonging to the VARSUL project – using the free morphosyntatic tagger Aelius. We present 20 texts containing 154,530 words, annotated automatically and corrected manually. This paper presents the tagger Aelius and our work of manual review of the texts, as well as our suggestions for improvements of the tool, concerning aspects of oral texts. We verify the performance of morphosyntactic tagging a spoken language corpus, an unprecedented challenge for the tagger. Based on the errors of the tagger, we try to infer certain patterns of annotation to overcome limitations presented by the program, and we propose suggestions for implementations in order to allow Aelius to tag spoken language corpora in a more effective way, specially treating cases such as interjections, apheresis, onomatopeia and conversational markers.
Variation in Citational Practice in a Corpus of Student Biology Papers: From Parenthetical Plonking to Intertextual Storytelling

Science.gov (United States)

Swales, John M.

2014-01-01

This is a corpus-based study of a key aspect of academic writing in one discipline (biology) by final-year undergraduates and first-, second-, and third-year graduate students. The papers come from the Michigan Corpus of Upper-level Student Papers, a freely available electronic database. The principal aim of the study is to examine the extent of…
Constitution d’un Corpus de Français Langue Etrangère destiné aux Apprenants Allemands

Directory of Open Access Journals (Sweden)

Fauth Camille

2014-07-01

Nationale de la Recherche et Deutsche Forschungsgemeinschaft attribué à l’équipe Parole du LORIA UMR 7503, Nancy – France et à l’Equipe de Linguistique Computationnelle et de Phonétique FR 4.7 de l’Université de la Sarre Sarrebruck – Allemagne dans lequel le français et l’allemand sont des langues cibles. Pour la paire allemand-français, peu de corpus parallèles sont disponibles. Nous présentons ici l’élaboration d’un corpus de productions orales de locuteurs natifs et non natifs pour la paire allemand-français. Notre corpus entend mettre au jour les déviations phonétiques et phonologiques que les locuteurs allemands produisent lorsqu’ils apprennent le français. Ce travail s’insère dans un projet plus global, Ce projet entend étudier les difficultés que les locuteurs français rencontrent lorsqu’ils apprennent l’allemand, et réciproquement. Aussi, cinquante locuteurs allemands seront recrutés dans des milieux universitaires et scolaires (niveau lycée en Allemagne et cinquante locuteurs français dans les mêmes milieux en France. Il s’agit pour les deux populations de produire d’une part le corpus en langue étrangère (en langue française pour les locuteurs allemands et en langue allemande pour les locuteurs français mais également le corpus en langue maternelle (en allemand pour les allemands et en français pour les français. Les corpus ainsi obtenus devraient nous permettre d’identifier les difficultés que les locuteurs allemands ou français rencontrent lorsqu’ils apprennent le français ou l’allemand. Les données de contrôle sont doubles puisque l’on pourra à la fois se référer aux productions des apprenants dans leur langue maternelle (ici l’allemand, mais également à celles de locuteurs natifs (ici germanophones. Nous ne présenterons ici que la constitution du corpus en français.
One hundred million years of interhemispheric communication: the history of the corpus callosum

Directory of Open Access Journals (Sweden)

Aboitiz F.

2003-01-01

Full Text Available Analysis of regional corpus callosum fiber composition reveals that callosal regions connecting primary and secondary sensory areas tend to have higher proportions of coarse-diameter, highly myelinated fibers than callosal regions connecting so-called higher-order areas. This suggests that in primary/secondary sensory areas there are strong timing constraints for interhemispheric communication, which may be related to the process of midline fusion of the two sensory hemifields across the hemispheres. We postulate that the evolutionary origin of the corpus callosum in placental mammals is related to the mechanism of midline fusion in the sensory cortices, which only in mammals receive a topographically organized representation of the sensory surfaces. The early corpus callosum may have also served as a substrate for growth of fibers connecting higher-order areas, which possibly participated in the propagation of neuronal ensembles of synchronized activity between the hemispheres. However, as brains became much larger, the increasingly longer interhemispheric distance may have worked as a constraint for efficient callosal transmission. Callosal fiber composition tends to be quite uniform across species with different brain sizes, suggesting that the delay in callosal transmission is longer in bigger brains. There is only a small subset of large-diameter callosal fibers whose size increases with increasing interhemispheric distance. These limitations in interhemispheric connectivity may have favored the development of brain lateralization in some species like humans. "...if the currently received statements are correct, the appearance of the corpus callosum in the placental mammals is the greatest and most sudden modification exhibited by the brain in the whole series of vertebrated animals..." T.H. Huxley (1.
Age-related signal intensity changes in the corpus callosum: assessment with three orthogonal FLAIR images

Energy Technology Data Exchange (ETDEWEB)

Yamamoto, Akira; Miki, Yukio; Kanagaki, Mitsunori; Takahashi, Takahiro; Fushimi, Yasutaka; Haque, Tabassum Laz; Togashi, Kaori [Kyoto University, Department of Nuclear Medicine and Diagnostic Imaging, Graduate School of Medicine, Kyoto (Japan); Tomimoto, Hidekazu [Kyoto University, Department of Neurology, Graduate School of Medicine, Kyoto (Japan); Konishi, Junya [Kobe University, Department of Radiology, Graduate School of Medicine, Kobe, Hyogo (Japan)

2005-11-01

The presence of age-related hyperintensities of the corpus callosum has not been thoroughly evaluated. Fifty-two patients of 50 years of age or older (mean, 71 years; range, 50-87 years) were included in this study. Fluid-attenuated inversion recovery images were obtained in three orthogonal planes. Periventricular hyperintensities (PVHs) and deep white matter hyperintensities (DWMHs) were graded according to Fazekas' rating scale. Correlations between the presence of hyperintensities in the corpus callosum and age, and the grade of PVH and DWMH were statistically analyzed. PVH was categorized as grade 0 (n=4), grade 1 (n=28), grade 2 (n=10), or grade 3 (n=10). DWMH was categorized as grade 0 (n=4), grade 1 (n=25), grade 2 (n=8), or grade 3 (n=15). Hyperintensity was considered present in the corpus callosum in 31 of the 52 patients (60%). In these 31 patients, PVH was categorized as grade 1 (n=16), grade 2 (n=7), or grade 3 (n=8), while DWMH was categorized as grade 0 (n=1), grade 1 (n=10), grade 2 (n=7), or grade 3 (n=13). The presence of callosal hyperintensities was significantly correlated with age (p=0.001), and with PVH (p=0.04) and DWMH grades (p=0.004). Hyperintensities may be present in the corpus callosum with aging, and are correlated with PVH and DWMH. (orig.)

Age-related signal intensity changes in the corpus callosum: assessment with three orthogonal FLAIR images

International Nuclear Information System (INIS)

Yamamoto, Akira; Miki, Yukio; Kanagaki, Mitsunori; Takahashi, Takahiro; Fushimi, Yasutaka; Haque, Tabassum Laz; Togashi, Kaori; Tomimoto, Hidekazu; Konishi, Junya

2005-01-01

The presence of age-related hyperintensities of the corpus callosum has not been thoroughly evaluated. Fifty-two patients of 50 years of age or older (mean, 71 years; range, 50-87 years) were included in this study. Fluid-attenuated inversion recovery images were obtained in three orthogonal planes. Periventricular hyperintensities (PVHs) and deep white matter hyperintensities (DWMHs) were graded according to Fazekas' rating scale. Correlations between the presence of hyperintensities in the corpus callosum and age, and the grade of PVH and DWMH were statistically analyzed. PVH was categorized as grade 0 (n=4), grade 1 (n=28), grade 2 (n=10), or grade 3 (n=10). DWMH was categorized as grade 0 (n=4), grade 1 (n=25), grade 2 (n=8), or grade 3 (n=15). Hyperintensity was considered present in the corpus callosum in 31 of the 52 patients (60%). In these 31 patients, PVH was categorized as grade 1 (n=16), grade 2 (n=7), or grade 3 (n=8), while DWMH was categorized as grade 0 (n=1), grade 1 (n=10), grade 2 (n=7), or grade 3 (n=13). The presence of callosal hyperintensities was significantly correlated with age (p=0.001), and with PVH (p=0.04) and DWMH grades (p=0.004). Hyperintensities may be present in the corpus callosum with aging, and are correlated with PVH and DWMH. (orig.)
MR measurement of normal brainstem cerebellum and corpus callosum on midsagittal section

International Nuclear Information System (INIS)

Kogame, Saeko; Sawa, S.; Inoue, Yuichi; Fukuda, Teruo; Tada, Takuji; Shakudo, Miyuki; Yahata, Kunifumi; Shimizu, Hiroshi; Onoyama, Yasuhito.

1989-01-01

The dimensions of the brainstem, cerebellum and corpus callosum were measured on magnetic resonance (MR) images with sagittal spin-echo sequence. Eighty-two normal adults (average 49.6 years old) were measured. The mesencephalic, pontine or cerebellar diamaters and lengths could be measured more accurately and reproducibly than medullary diameter and length. The anterio-posterior diameter of the pons and the cerebellum was 23.2±1.4 mm and 26.4±2.5 mm respectively. The length of the pons and the cerebellum was 27.8±2 mm and 45.8±3.5 mm respectively. We have observed focal thinning at the body of corpus callosum in 73%. This narrowing is almost unquestionably a normal variant. (author)
Ruptured corpus luteal cyst: Prediction of clinical outcomes with CT

Energy Technology Data Exchange (ETDEWEB)

Lee, Myoung Seok; Moon, Min Hoan; Woo, Hyun Sik; Sung, Chang Kyu; Jeon, Hye Won; Lee, Taek Sang [SMG-SNU Boramae Medical Center, Seoul National University College of Medicine, Seoul (Korea, Republic of)

2017-08-01

To evaluate the determinant pretreatment CT findings that can predict surgical intervention for patients suffering from corpus luteal cyst rupture with hemoperitoneum. From January 2009 to December 2014, a total of 106 female patients (mean age, 26.1 years; range, 17–44 years) who visited the emergency room of our institute for acute abdominal pain and were subsequently diagnosed with ruptured corpus luteal cyst with hemoperitoneum were included in the retrospective study. The analysis of CT findings included cyst size, cyst shape, sentinel clot sign, ring of fire sign, hemoperitoneum depth, active bleeding in portal phase and attenuation of hemoperitoneum. The comparison of CT findings between the surgery and conservative management groups was performed with the Mann-Whitney U test or chi-square test. Logistic regression analysis was used to determine significant CT findings in predicting surgical intervention for a ruptured cyst. Comparative analysis revealed that the presence of active bleeding and the hemoperitoneum depth were significantly different between the surgery and conservative management groups and were confirmed as significant CT findings for predicting surgery, with adjusted odds ratio (ORs) of 3.773 and 1.318, respectively (p < 0.01). On the receiver-operating characteristic curve analysis for hemoperitoneum depth, the optimal cut-off value was 5.8 cm with 73.7% sensitivity and 58.6% specificity (Az = 0.711, p = 0.004). In cases with a hemoperitoneum depth > 5.8 cm and concurrent active bleeding, the OR for surgery increased to 5.786. The presence of active bleeding and the hemoperitoneum depth on a pretreatment CT scan can be predictive warning signs of surgery for a patient with a ruptured corpus luteal cyst with hemoperitoneum.
Disambiguating Praxis from Practice in Natural Resource Management: A Practical Space for Enhancing Experiential Learning in the Eastern Coast of Tanzania

Directory of Open Access Journals (Sweden)

Sabai Daniel

2017-03-01

Full Text Available It is evident that practice and praxis have significantly contributed to knowledge generation in the Tanzanian coastal belt, especially where Integrated Coastal Management (ICM programmes have been adopted and practiced such as Tanga, Dar es Salaam, Mtwara, Lindi, and the Coastal region (KICAMP, 2001; NICEMS, 2003. In spite of such learning evidences, users of generated natural resource data in the coastal area tend to employ practice and praxis interchangeably, conflating the two concepts together; leading to a situation where one may hardly ascribe generated knowledge appropriately to contexts that favour occurrence of each of the two constructs. The paper adopts ethnographic approach in a defined coastal case study to examine contexts and situations that signals “conflationˮ and it employs examples that may help readers of the article to disambiguate praxis from practice.
Corpus callosum atrophy in patients with mild Alzheimer's disease

DEFF Research Database (Denmark)

Frederiksen, Kristian Steen; Garde, Ellen; Skimminge, Arnold

2011-01-01

Several studies have found atrophy of the corpus callosum (CC) in patients with Alzheimer's disease (AD). However, it remains unclear whether callosal atrophy is already present in the early stages of AD, and to what extent it may be associated with other structural changes in the brain......, such as age-related white matter changes (ARWMC) and progression of the disease....
Fatty acid composition of the postmortem corpus callosum of patients with schizophrenia, bipolar disorder, or major depressive disorder.

Science.gov (United States)

Hamazaki, K; Maekawa, M; Toyota, T; Dean, B; Hamazaki, T; Yoshikawa, T

2017-01-01

Studies investigating the relationship between n-3 polyunsaturated fatty acid (PUFA) levels and psychiatric disorders have thus far focused mainly on analyzing gray matter, rather than white matter, in the postmortem brain. In this study, we investigated whether PUFA levels showed abnormalities in the corpus callosum, the largest area of white matter, in the postmortem brain tissue of patients with schizophrenia, bipolar disorder, or major depressive disorder. Fatty acids in the phospholipids of the postmortem corpus callosum were evaluated by thin-layer chromatography and gas chromatography. Specimens were evaluated for patients with schizophrenia (n=15), bipolar disorder (n=15), or major depressive disorder (n=15) and compared with unaffected controls (n=15). In contrast to some previous studies, no significant differences were found in the levels of PUFAs or other fatty acids in the corpus callosum between patients and controls. A subanalysis by sex gave the same results. No significant differences were found in any PUFAs between suicide completers and non-suicide cases regardless of psychiatric disorder diagnosis. Patients with psychiatric disorders did not exhibit n-3 PUFAs deficits in the postmortem corpus callosum relative to the unaffected controls, and the corpus callosum might not be involved in abnormalities of PUFA metabolism. This area of research is still at an early stage and requires further investigation. Copyright © 2016 Elsevier Masson SAS. All rights reserved.
Corpus callosum demyelination associated with acquired stuttering.

Science.gov (United States)

Decker, Barbara McElwee; Guitar, Barry; Solomon, Andrew

2018-04-21

Compared with developmental stuttering, adult onset acquired stuttering is rare. However, several case reports describe acquired stuttering and an association with callosal pathology. Interestingly, these cases share a neuroanatomical localisation also demonstrated in developmental stuttering. We present a case of adult onset acquired stuttering associated with inflammatory demyelination within the corpus callosum. This patient's disfluency improved after the initiation of immunomodulatory therapy. © BMJ Publishing Group Ltd (unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.
Ethiopian Journal of Education and Sciences - Vol 12, No 1 (2016)

African Journals Online (AJOL)

An overview of agricultural producers' co-operatives (APCs) in Arssi, 1978-1991 · EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT ... Towards the sense disambiguation of Afan Oromo words using hybrid approach (unsupervised machine learning and rule based) · EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT
Marchiafava-Bignami disease: magnetic resonance imaging findings in corpus callosum and subcortical white matter

Energy Technology Data Exchange (ETDEWEB)

Kawarabuki, Kentaro E-mail: bukky@h2.dion.ne.jp; Sakakibara, Takehiko; Hirai, Makoto; Yoshioka, Yuji; Yamamoto, Yasumasa; Yamaki, Tarumi

2003-11-01

A case of Marchiafava-Bignami disease (MBD) is presented using magnetic resonance imaging (MRI). A patient with a long history of alcoholism developed a gait disturbance with involuntary movements at the lower extremities. MRI scans taken at the onset showed no particular abnormalities. He progressed to a coma 10 days later. MRI scans taken 20 days after the onset showed a focal lesion at the genu of the corpus callosum and he was diagnosed as having MBD. In addition, multiple lesions were observed in bilateral frontoparietal subcortical white matter. These lesions demonstrated similar intense MRI signals as the corpus callosum.
Corpus luteum blood flow in normal and abnormal early pregnancy: evaluation and analysis with transvaginal color and pulsed doppler sonography

International Nuclear Information System (INIS)

Tang Xiaoyi; Lin Meifang; Zheng Meirong; Liang Xiaoxian; Liu Jianfeng

2005-01-01

Objective: Detecting and assessment the corpus luteum blood flow in normal and abnormal early pregnancy. Methods: Using transvaginal color and pulse Doppler sonography, we detected 215 pregnant women including 150 normal intrauterine pregnancies, 25 abortion, 29 ectopic pregnancies, and then recorded corpus luteum blood flow feature and the blood flow indexes (Vmax, RI and PI). Results: 1) Corpus luteum was successfully identified in 148 cases out of 150 of normal early pregnancies, 25 cases out of 26 of threatened abortion; 22 cases out of 29 of ectopic pregnancy. 2) Three groups shared the same feature of Color Doppler imaging: a circumferential rim around the entire corpus luteum. 3) The flow index revealed mean PVS, RI and PI had no statistical difference in normal and abnormal early pregnancy; The mean PVS was lower in ectopic pregnancy than in normal pregnancy (P<0.05), while PI and PR had no characteristic in ectopic pregnancy group compared with the indexes obtained in normal pregnancy group. Conclusion: The corpus luteum can be precisely identified in most pregnancy using transvaginal color Doppler and manifests a characterized rim Doppler imaging. PVS may help in differentiating the ectopic pregnancy from normal early pregnancy. (authors)
The Use of Corpus Examples for Language Comprehension and Production

Science.gov (United States)

Frankenberg-Garcia, Ana

2014-01-01

One of the many new features of English language learners' dictionaries derived from the technological developments that have taken place over recent decades is the presence of corpus-based examples to illustrate the use of words in context. However, empirical studies have generally not been able to produce conclusive evidence about their…
Interaction as 'involvement' in writing for students: a corpus linguistic ...

African Journals Online (AJOL)

Interaction as 'involvement' in writing for students: a corpus linguistic analysis of a key readability feature. E Hilton Hubbard. Abstract. The rapid change in the demographics of South Africa's tertiary level student population over the last decade — and most specifically the huge increase in those who have to study at a ...
Chinese students' writing in English implications from a corpus-driven study

CERN Document Server

Leedham, Maria

2014-01-01

Chinese students are the largest international student group in UK universities today, yet little is known about their undergraduate writing and the challenges they face. Drawing on the British Academic Written English corpus - a large corpus of proficient undergraduate student writing collected in the UK in the early 2000s - this study explores Chinese students' written assignments in English in a range of university disciplines, contrasting these with assignments from British students. The study is supplemented by questionnaire and interview datasets with discipline lecturers, writing tutors and students, and provides a comprehensive picture of the Chinese student writer today. Theoretically framed through work within academic literacies and lexical priming, the author seeks to explore what we know about Chinese students' writing and to extend these findings to undergraduate writing more generally. In a globalized educational environment, it is important for educators to understand differences in writing st...
Vocabulary Practice and Media Representation: A Corpus-Assisted Study of Macroeconomic News

Directory of Open Access Journals (Sweden)

Win-Ping Kuo

2015-11-01

Full Text Available This Paper introduces corpus methods and its application to media text analysis. The researcher collect 1,363 macroeconomic reports from three major Taiwanese newspapers, including Apple Daily, The Liberty Times, and The United Daily as the copra. Research shows that corpus-assisted media text analysis enables researcher to calculate frequency of vocabulary and analyze lexical structure of the text via concordance and collocation. By using macroeconomic news as the study case, this paper also found that news reports tend to simplify GDP number as a mission, prefer attributing local economic performance as a systematic problem of global economy, and treat economy as a manageable task by attributing it to the government. All these ideologies and values are reflected on vocabularies and discursive practice of media.
Segmentation of the Canine Corpus Callosum using Diffusion Tensor Imaging Tractography

Science.gov (United States)

Pierce, T.T.; Calabrese, E.; White, L.E.; Chen, S.D.; Platt, S.R.; Provenzale, J.M.

2014-01-01

Background We set out to determine functional white matter (WM) connections passing through the canine corpus callosum useful for subsequent studies of canine brains that serve as models for human WM pathway disease. Based on prior studies, we anticipated that the anterior corpus callosum would send projections to the anterior cerebral cortex while progressively posterior segments would send projections to more posterior cortex. Methods A post mortem canine brain was imaged using a 7T MRI producing 100 micron isotropic resolution DTI analyzed by tractography. Using ROIs within cortical locations, which were confirmed by a Nissl stain that identified distinct cortical architecture, we successfully identified 6 important WM pathways. We also compared fractional anisotropy (FA), apparent diffusion coefficient (ADC), radial diffusivity (RD), and axial diffusivity (AD) in tracts passing through the genu and splenium. Results Callosal fibers were organized based upon cortical destination, i.e. fibers from the genu project to the frontal cortex. Histologic results identified the motor cortex based on cytoarchitectonic criteria that allowed placement of ROIs to discriminate between frontal and parietal lobes. We also identified cytoarchitecture typical of the orbital frontal, anterior frontal, and occipital regions and placed ROIs accordingly. FA, ADC, RD and AD values were all higher in posterior corpus callosum fiber tracts. Conclusions Using 6 cortical ROIs, we identified 6 major white matter tracts that reflect major functional divisions of the cerebral hemispheres and we derived quantitative values that can be used for study of canine models of human WM pathological states. PMID:24370161
Segmentation of the canine corpus callosum using diffusion-tensor imaging tractography.

Science.gov (United States)

Pierce, Theodore T; Calabrese, Evan; White, Leonard E; Chen, Steven D; Platt, Simon R; Provenzale, James M

2014-01-01

We set out to determine functional white matter (WM) connections passing through the canine corpus callosum; these WM connections would be useful for subsequent studies of canine brains that serve as models for human WM pathway disease. Based on prior studies, we anticipated that the anterior corpus callosum would send projections to the anterior cerebral cortex whereas progressively posterior segments would send projections to more posterior cortex. A postmortem canine brain was imaged using a 7-T MRI system producing 100-μm-isotropic-resolution diffusion-tensor imaging analyzed by tractography. Using regions of interest (ROIs) within cortical locations, which were confirmed by a Nissl stain that identified distinct cortical architecture, we successfully identified six important WM pathways. We also compared fractional anisotropy (FA), apparent diffusion coefficient (ADC), radial diffusivity, and axial diffusivity in tracts passing through the genu and splenium. Callosal fibers were organized on the basis of cortical destination (e.g., fibers from the genu project to the frontal cortex). Histologic results identified the motor cortex on the basis of cytoarchitectonic criteria that allowed placement of ROIs to discriminate between frontal and parietal lobes. We also identified cytoarchitecture typical of the orbital frontal, anterior frontal, and occipital regions and placed ROIs accordingly. FA, ADC, radial diffusivity, and axial diffusivity values were all higher in posterior corpus callosum fiber tracts. Using six cortical ROIs, we identified six major WM tracts that reflect major functional divisions of the cerebral hemispheres, and we derived quantitative values that can be used for study of canine models of human WM pathologic states.
Learning From Short Text Streams With Topic Drifts.

Science.gov (United States)

Li, Peipei; He, Lu; Wang, Haiyan; Hu, Xuegang; Zhang, Yuhong; Li, Lei; Wu, Xindong

2017-09-18

Short text streams such as search snippets and micro blogs have been popular on the Web with the emergence of social media. Unlike traditional normal text streams, these data present the characteristics of short length, weak signal, high volume, high velocity, topic drift, etc. Short text stream classification is hence a very challenging and significant task. However, this challenge has received little attention from the research community. Therefore, a new feature extension approach is proposed for short text stream classification with the help of a large-scale semantic network obtained from a Web corpus. It is built on an incremental ensemble classification model for efficiency. First, more semantic contexts based on the senses of terms in short texts are introduced to make up of the data sparsity using the open semantic network, in which all terms are disambiguated by their semantics to reduce the noise impact. Second, a concept cluster-based topic drifting detection method is proposed to effectively track hidden topic drifts. Finally, extensive studies demonstrate that as compared to several well-known concept drifting detection methods in data stream, our approach can detect topic drifts effectively, and it enables handling short text streams effectively while maintaining the efficiency as compared to several state-of-the-art short text classification approaches.
Gender-based differences in the shape of the human corpus callosum are associated with allometric variations

Science.gov (United States)

Bruner, Emiliano; de la Cuétara, José Manuel; Colom, Roberto; Martin-Loeches, Manuel

2012-01-01

The corpus callosum displays considerable morphological variability between individuals. Although some characteristics are thought to differ between male and female brains, there is no agreement regarding the source of this variation. Biomedical imaging and geometric morphometrics have provided tools to investigate shape and size variation in terms of integration and correlation. Here we analyze variations at the midsagittal outline of the corpus callosum in a sample of 102 young adults in order to describe and quantify the pattern of covariation associated with its morphology. Our results suggest that the shape of the corpus callosum is characterized by low levels of morphological integration, which explains the large variability. In larger brains, a minor allometric component involves a relative reduction of the splenium. Small differences between males and?females are associated with this allometric pattern, induced primarily by size variation rather than gender-specific characteristics. PMID:22296183
ContextD: an algorithm to identify contextual properties of medical terms in a Dutch clinical corpus.

Science.gov (United States)

Afzal, Zubair; Pons, Ewoud; Kang, Ning; Sturkenboom, Miriam C J M; Schuemie, Martijn J; Kors, Jan A

2014-11-29

In order to extract meaningful information from electronic medical records, such as signs and symptoms, diagnoses, and treatments, it is important to take into account the contextual properties of the identified information: negation, temporality, and experiencer. Most work on automatic identification of these contextual properties has been done on English clinical text. This study presents ContextD, an adaptation of the English ConText algorithm to the Dutch language, and a Dutch clinical corpus. We created a Dutch clinical corpus containing four types of anonymized clinical documents: entries from general practitioners, specialists' letters, radiology reports, and discharge letters. Using a Dutch list of medical terms extracted from the Unified Medical Language System, we identified medical terms in the corpus with exact matching. The identified terms were annotated for negation, temporality, and experiencer properties. To adapt the ConText algorithm, we translated English trigger terms to Dutch and added several general and document specific enhancements, such as negation rules for general practitioners' entries and a regular expression based temporality module. The ContextD algorithm utilized 41 unique triggers to identify the contextual properties in the clinical corpus. For the negation property, the algorithm obtained an F-score from 87% to 93% for the different document types. For the experiencer property, the F-score was 99% to 100%. For the historical and hypothetical values of the temporality property, F-scores ranged from 26% to 54% and from 13% to 44%, respectively. The ContextD showed good performance in identifying negation and experiencer property values across all Dutch clinical document types. Accurate identification of the temporality property proved to be difficult and requires further work. The anonymized and annotated Dutch clinical corpus can serve as a useful resource for further algorithm development.
Advantages and Disadvantages in the Use of Internet as a Corpus

DEFF Research Database (Denmark)

Tarp, Sven; Fuertes-Olivera, Pedro A.

2016-01-01

This paper initially discusses some of the consequences which the technological development has for lexicography, especially in terms of the different types of empirical basis which can be used in dictionary projects. The most important advantages and disadvantages of using the Internet as a corpus...

Methodological Flaws in Corpus-Based Studies on Malaysian ESL Textbooks

Science.gov (United States)

Zarifi, Abdolvahed; Mukundan, Jayakaran; Rezvani Kalajahi, Seyed Ali

2014-01-01

With the increasing interest among the pedagogy researchers in the use of corpus linguistics methodologies to study textbooks, there has emerged a similar enthusiasm among the materials developers to draw on empirical findings in the development of the state-of-the-art curricula and syllabi. In order for these research findings to have their…
The significance of estradiol metabolites in human corpus luteum physiology.

Science.gov (United States)

Devoto, Luigi; Henríquez, Soledad; Kohen, Paulina; Strauss, Jerome F

2017-07-01

The human corpus luteum (CL) is a temporary endocrine gland derived from the ovulated follicle. Its formation and limited lifespan is critical for steroid hormone production required to support menstrual cyclicity, endometrial receptivity for successful implantation, and the maintenance of early pregnancy. Endocrine and paracrine-autocrine molecular mechanisms associated with progesterone production throughout the luteal phase are critical for the development, maintenance, regression, and rescue by hCG which sustains CL function into early pregnancy. However, the signaling systems driving the regression of the primate corpus luteum in non-conception cycles are not well understood. Recently, there has been interest in the functional roles of estradiol metabolites (EMs), mostly in estrogen-producing tissues. The human CL produces a number of EMs, and it has been postulated that the EMs acting via paracrine-autocrine pathways affect angiogenesis or LH-mediated events. The present review describes advances in understanding the role of EMs in the functional lifespan and regression of the human CL in non-conception cycles. Copyright © 2017 Elsevier Inc. All rights reserved.
IL NOME IN LIS NEL SEGNATO DI ADULTI UDENTI: UNA INDAGINE PRELIMINARE SUL CORPUS LISAU

Directory of Open Access Journals (Sweden)

Matteo La Grassa

2016-09-01

Full Text Available L’indagine presenta i primi risultati emersi dall’analisi di una parte del corpus LISAU (LIS di Adulti Udenti sulla produzione segnata del sintagma nominale in LIS da parte di informanti udenti che hanno appreso la LIS come L2 in età adulta. Scopo dell’indagine è cominciare a tracciare una linea di ricerca nell’ambito della linguistica acquisizionale con riferimento all’acquisizione della LIS come L2 da parte di udenti. Il corpus LISAU include il segnato di 7 informanti udenti con livello di competenza omogenea che hanno terminato un corso di terzo livello presso la sede Ente Nazionale Sordi di Prato e di 2 informanti sordi segnanti nativi considerati come gruppo di controllo. L’analisi si è incentrata sulla realizzazione dei nomi di prima e di seconda classe rilevando anche forme non citazionali, sulla realizzazione di forme plurali e sulle modalità di accordo tra nomi e aggettivi. Dalla maggior parte dei dati analizzati si rileva la piena competenza degli informanti nella realizzazione del sintagma nominale. Nouns Signed by hearing adults in LIS: a preliminary survey on the LISAU corpus The results of an analysis concerning part of the LISAU (LIS of Hearing Adults corpus related to the production of the noun phrase in LIS by hearing informants who learned the LIS L2 in adulthood are presented. The purpose of the investigation was to outline the process with regard to the acquisition of LIS L2 by hearing adults. The LISAU corpus is composed of the sign language of 7 hearing informants with a homogeneous level of competence who completed a third-level course at the Ente Nazionale Sordi in Prato. LISAU also includes the sign language of 2 deaf native signers, considered the control group. The analysis focuses on the first and second-class nouns, including non-citation forms, plural forms and noun-adjective agreement. Most of the analyzed data reveals the informants’ full competence in creating noun phrases.
Uptake of 3H-choline and synthesis of 3H-acetylcholine by human penile corpus cavernosum

International Nuclear Information System (INIS)

Blanco, R.; Saenz de Tejada, I.; Azadzoi, K.; Goldstein, I.; Krane, R.J.; Wotiz, H.H.; Cohen, R.A.

1986-01-01

The neuroeffectors which relax penile smooth muscle and lead to erection are unknown; physiological studies of human corpus cavernosum, in vitro, have suggested a significant role of cholinergic neurotransmission. To further characterize the importance of cholinergic nerves, biopsies of human corpus cavernosum were obtained at the time of penile prosthesis implantation. Tissues were incubated in 3 H-choline (10 -5 M, 80 Ci/mmol) in oxygenated physiological salt solution at 37 0 C, pH 7.4 for 1 hour. Radiolabelled compounds were extracted with perchloric acid (0.4 M) and acetylcholine and choline were separated by HPLC; 14 C-acetylcholine was used as internal standard. 3 H-choline was accumulated by the tissues (20 +/- 1.9 fmol/mg), and 3 H-acetylcholine was synthesized (4.0 +/- 1.1 fmol/mg). In control experiments, heating of the tissue blocked synthesis of 3 H-acetylcholine. Inhibition of high affinity choline transport by hemicholinium-3 (10 -5 M) diminished tissue accumulation of 3 H-choline and significantly reduced the synthesis of 3 H-acetylcholine (0.5 +/ 0.2 fmol/mg, p < 0.05). These results provide direct evidence of neuronal accumulation of choline and enzymatic conversion to acetylcholine in human corpus cavernosum. Taken together with the physiological studies, it can be concluded that cholinergic neurotransmission in human corpus cavernosum plays a role in penile erection
Applying corpus linguistics methodology to psycholinguistics research Aplicando a metodologia da linguística de corpus à pesquisa psicolinguística

Directory of Open Access Journals (Sweden)

Luciane Corrêa Ferreira

2010-01-01

Full Text Available This study concerns the use of corpus linguistics methodology in psycholinguistics research. Ten linguistic metaphors were selected from English and American newspapers. After that, we identified the underlying conceptual metaphor based on the conceptual metaphor inventory by Lakoff and Johnson (1980, 1999. We seek to investigate what sort of knowledge EFL-learners use when trying to understand a linguistic metaphor. We examined how EFL-learners comprehend linguistic metaphors, firstly without using the context and then using the context. The sample comprised 221 Brazilian students and 16 American students at UCSC. We have also carried out an empirical research using WebCorp.Este estudo investiga o uso de metodologia de lingüística de corpus na pesquisa psicolingüística. Selecionaram-se dez metáforas em jornais ingleses e norte-americanos. Depois, identificou-se a metáfora conceptual subjacente com base no inventário de metáforas conceptuais apresentado por Lakoff e Johnson (1980, 1999. Buscou-se investigar que tipo de conhecimento os aprendizes de LE empregam ao tentar compreender metáforas. Analisou-se a maneira como os aprendizes de LE compreendem metáforas lingüísticas sem usar o contexto e, depois, utilizando o contexto. A amostra incluiu 221 estudantes brasileiros e 16 estudantes norte-americanos da UCSC.
On the Disambiguation of Passively Measured In-home Gait Velocities from Multi-person Smart Homes.

Science.gov (United States)

Austin, Daniel; Hayes, Tamara L; Kaye, Jeffrey; Mattek, Nora; Pavel, Misha

2011-01-01

In-home monitoring of gait velocity with passive PIR sensors in a smart home has been shown to be an effective method of continuously and unobtrusively measuring this important predictor of cognitive function and mobility. However, passive measurements of velocity are nonspecific with regard to who generated each measurement or walking event. As a result, this method is not suitable for multi-person homes without additional information to aid in the disambiguation of gait velocities. In this paper we propose a method based on Gaussian mixture models (GMMs) combined with infrequent clinical assessments of gait velocity to model in-home walking speeds of two or more residents. Modeling the gait parameters directly allows us to avoid the more difficult problem of assigning each measured velocity individually to the correct resident. We show that if the clinically measured gait velocities of residents are separated by at least 15 cm/s a GMM can be accurately fit to the in-home gait velocity data. We demonstrate the accuracy of this method by showing that the correlation between the means of the GMMs and the clinically measured gait velocities is 0.877 (p value < 0.0001) with bootstrapped 95% confidence intervals of (0.79, 0.94) for 54 measurements of 20 subjects living in multi-person homes. Example applications of using this method to track in-home mean velocities over time are also given.
Le corpus lexicographique dans les langues à tradition orale: le cas ...

African Journals Online (AJOL)

rbr

langues à tradition orale: les informateurs et la représentativité du corpus. Cette dernière, qui doit ..... les techniques, les instruments, la pêche par les hommes, la pêche par .... cet outil indispensable qu'est le dictionnaire. Je pense que ces ...
Como encontrar as palavras-chave mais importantes de um corpus com WordSmith tools How to find the most important keywords in a corpus with WordSmith tools

Directory of Open Access Journals (Sweden)

Tony Berber-Sardinha

2005-12-01

Full Text Available Um dos procedimentos mais delicados envolvidos numa análise de corpus via palavras-chave com WordSmith Tools KeyWords é a seleção de um sub-conjunto de palavras para serem investigadas em detalhe. A seleção se faz necessária, via de regra, porque o tamanho do léxico chave de um corpus de estudo é em geral muito grande, em geral em torno de 1500 palavras ou até mais. Uma maneira de fazer esse recorte consiste na extração de palavras-chave exclusivas. O léxico chave exclusivo é composto das palavras-chave que ocorrem somente no corpus de estudo em questão em comparação com palavras-chave de outros corpora de estudo. Contudo, comparar a lista de palavras-chave com várias outras é um procedimento custoso e complicado, que não pode ser exigido da maioria dos usuários de WordSmith Tools KeyWords. Uma alternativa para este cenário seria a aplicação de um ponto de corte generalizado baseado em tendências de retorno de palavras-chave observadas através da aplicação do banco de palavras-chave existente. Tal ponto de corte indicaria a região da lista de palavras-chave na qual há maior probabilidade de ocorrência do léxico chave exclusivo. Os resultados obtidos aqui indicam um ponto de corte entre 31% a 53% das palavras da lista, a partir da primeira de uma lista ordenada por chavicidade.One of the most sensitive issues surrounding a keywords analysis with WordSmith Tools is the selection of a subset of words in a corpus that deserve being looked at in greater detail. This selection is normally needed because the size of the key word list can reach several hundred, up to 1,500 or more. One way to extract a selection consists of the pulling out 'exclusive key words'. This key lexis is made up of keywords that only in a single corpus only, in comparison with a bank of keyword lists. Nevertheless, comparing several keyword lists together is a demanding task, which most users of WordSmith Tools are not expected to cope with. An
Partial segmental thrombosis of the corpus cavernosum presenting with perineal pain.

Science.gov (United States)

Christodoulidou, Michelle; Parnham, Arie; Ramachandran, Navin; Muneer, Asif

2016-11-22

We describe the case of a man aged 43 years who presented with a 2-week history of a palpable lump in the right proximal penile shaft. This was preceded by a 6-month history of perineal pain, accompanied by erectile dysfunction. An urgent MRI scan of his penis identified a thrombus within the right crus and corpus of the penis. His thrombophilia screen was normal. The patient was started on oral anticoagulation and a phosphodiesterase inhibitor (PDE-5i) to prevent thrombus progression and maintain erectile function. At 5 months, the patients' symptoms had resolved and an MRI showed a reduction in the thrombus size. MRI is a useful imaging modality to diagnose a thrombus within the corpus cavernosum in patients presenting with a history of penile and perineal pain together with a palpable lump. The non-enhancement of the lesion helps to differentiate this from alternative rare lesions within the penis and perineum. 2016 BMJ Publishing Group Ltd.
Perceptual evaluation of corpus-based speech synthesis techniques in under-resourced environments

CSIR Research Space (South Africa)

Van Niekerk, DR

2009-11-01

Full Text Available With the increasing prominence and maturity of corpus-based techniques for speech synthesis, the process of system development has in some ways been simplified considerably. However, the dependence on sufficient amounts of relevant speech data...
Male non-insulin users with type 2 diabetes mellitus are predisposed to gastric corpus-predominant inflammation after H. pylori infection.

Science.gov (United States)

Yang, Yao-Jong; Wu, Chung-Tai; Ou, Horng-Yih; Lin, Chin-Han; Cheng, Hsiu-Chi; Chang, Wei-Lun; Chen, Wei-Ying; Yang, Hsiao-Bai; Lu, Cheng-Chan; Sheu, Bor-Shyang

2017-10-30

Both H. pylori infection and diabetes increase the risk of gastric cancer. This study investigated whether patients with type 2 diabetes mellitus (T2DM) and H. pylori infection had more severe corpus gastric inflammation and higher prevalence of precancerous lesions than non-diabetic controls. A total of 797 patients with type 2 diabetes mellitus were screened for H. pylori, of whom 264 had H. pylori infection. Of these patients, 129 received esophagogastroduodenoscopy to obtain topographic gastric specimens for gastric histology according to the modified Updated Sydney System, corpus-predominant gastritis index (CGI), Operative Link on Gastritis Assessment, and Operative Link on Gastric Intestinal Metaplasia Assessment. Non-diabetic dyspeptic patients who had H. pylori infection confirmed by esophagogastroduodenoscopy were enrolled as controls. The male as well as total T2DM patients had higher acute/chronic inflammatory and lymphoid follicle scores in the corpus than non-diabetic controls (p H. pylori-infected patients with type 2 diabetes mellitus. Patients with type 2 diabetes mellitus and H. pylori infection had more severe corpus gastric inflammation than non-diabetic controls. Moreover, male gender and non-insulin users of T2DM patients were predisposed to have corpus-predominant gastritis after H. pylori infection. ClinicalTrial: NCT02466919 , retrospectively registered may 17, 2015.
Knowledge Guided Disambiguation for Large-Scale Scene Classification With Multi-Resolution CNNs

Science.gov (United States)

Wang, Limin; Guo, Sheng; Huang, Weilin; Xiong, Yuanjun; Qiao, Yu

2017-04-01

Convolutional Neural Networks (CNNs) have made remarkable progress on scene recognition, partially due to these recent large-scale scene datasets, such as the Places and Places2. Scene categories are often defined by multi-level information, including local objects, global layout, and background environment, thus leading to large intra-class variations. In addition, with the increasing number of scene categories, label ambiguity has become another crucial issue in large-scale classification. This paper focuses on large-scale scene recognition and makes two major contributions to tackle these issues. First, we propose a multi-resolution CNN architecture that captures visual content and structure at multiple levels. The multi-resolution CNNs are composed of coarse resolution CNNs and fine resolution CNNs, which are complementary to each other. Second, we design two knowledge guided disambiguation techniques to deal with the problem of label ambiguity. (i) We exploit the knowledge from the confusion matrix computed on validation data to merge ambiguous classes into a super category. (ii) We utilize the knowledge of extra networks to produce a soft label for each image. Then the super categories or soft labels are employed to guide CNN training on the Places2. We conduct extensive experiments on three large-scale image datasets (ImageNet, Places, and Places2), demonstrating the effectiveness of our approach. Furthermore, our method takes part in two major scene recognition challenges, and achieves the second place at the Places2 challenge in ILSVRC 2015, and the first place at the LSUN challenge in CVPR 2016. Finally, we directly test the learned representations on other scene benchmarks, and obtain the new state-of-the-art results on the MIT Indoor67 (86.7\\%) and SUN397 (72.0\\%). We release the code and models at~\\url{https://github.com/wanglimin/MRCNN-Scene-Recognition}.
THE CASE FOR VERB-ADJECTIVE COLLOCATIONS: CORPUS-BASED ANALYSIS AND LEXICOGRAPHICAL TREATMENT

Directory of Open Access Journals (Sweden)

Moisés Almela

2011-10-01

Full Text Available This article explores a type of co-occurrence pattern which cannot be adequately described by existing models of collocation, and for which combinatory dictionaries have yet failed to provide sufficient information. The phenomenon of “oblique inter-collocation”, as I propose to call it, is characterised by a concatenation of syntagmatic preferences which partially contravenes the habitual grammatical order of semantic selection. In particular, I will examine some of the effects which the verb cause exerts on the distribution of attributive adjectives in the context of specific noun classes. The procedure for detecting and describing patterns of oblique inter-collocation is illustrated by means of SketchEngine corpus query tools. Based on the data extracted from a large-scale corpus, this paper carries out a critical analysis of the micro-structure in Oxford Collocations Dictionary.
Author Details

African Journals Online (AJOL)

Tesema, Workineh. Vol 12, No 1 (2016) - Articles Towards the sense disambiguation of Afan Oromo words using hybrid approach (unsupervised machine learning and rule based) Abstract PDF. ISSN: 1998-8907. AJOL African Journals Online. HOW TO USE AJOL... for Researchers · for Librarians · for Authors · FAQ's · More ...
Involvement of corpus callosum in amyotrophic lateral sclerosis shown by MRI

Energy Technology Data Exchange (ETDEWEB)

Zandijcke, M. van [Dept. of Neurology, Bruges (Belgium); Casselman, J. [Dept. of Medical Imaging, Bruges (Belgium)

1995-05-01

Abnormal high signal in the corticospinal tracts on MRI has been described in amyotrophic lateral sclerosis. We report a case with further high signal in fibres of the corpus callosum on proton density and T2-weighted spin-echo images, closely matching findings of earlier pathological reports. (orig.)
Involvement of corpus callosum in amyotrophic lateral sclerosis shown by MRI

International Nuclear Information System (INIS)

Zandijcke, M. van; Casselman, J.

1995-01-01

Abnormal high signal in the corticospinal tracts on MRI has been described in amyotrophic lateral sclerosis. We report a case with further high signal in fibres of the corpus callosum on proton density and T2-weighted spin-echo images, closely matching findings of earlier pathological reports. (orig.)
Generation of a Skeleton Corpus of Digital Objects for the Validation and Evaluation of Format Identification Tools and Signatures

Directory of Open Access Journals (Sweden)

Ross Spencer

2013-06-01

Full Text Available To preserve digital information it is vital that the format of that information can be identified, in-perpetuity. This is the major focus of research within the field of Digital Preservation. The National Archives of the UK called for the Digital Preservation and Digital Curation communities to develop a test corpus of digital objects to help further develop tools to aid this purpose. Following that call, an attempt has been made to develop the suite.This paper initially outlines a methodology to generate a skeleton corpus using simple user-generated digital objects. It then explores the lessons learnt in the generation of a corpus using scripting language techniques from the file format signatures described in The National Archives PRONOM technical registry. It will also discuss the use of the digital signature for this purpose, the benefits of developing a test corpus using this technique. Finally, this paper will outline a methodology for future research before exploring how the community can best make use of the output of this project and how this project needs to be taken forward to completion.
Revisiting corpus creation and analysis tools for translation tasks

Directory of Open Access Journals (Sweden)

Claudio Fantinuoli

2016-06-01

Full Text Available Many translation scholars have proposed the use of corpora to allow professional translators to produce high quality texts which read like originals. Yet, the diffusion of this methodology has been modest, one reason being the fact that software for corpora analyses have been developed with the linguist in mind, which means that they are generally complex and cumbersome, offering many advanced features, but lacking the level of usability and the specific features that meet translators’ needs. To overcome this shortcoming, we have developed TranslatorBank, a free corpus creation and analysis tool designed for translation tasks. TranslatorBank supports the creation of specialized monolingual corpora from the web; it includes a concordancer with a query system similar to a search engine; it uses basic statistical measures to indicate the reliability of results; it accesses the original documents directly for more contextual information; it includes a statistical and linguistic terminology extraction utility to extract the relevant terminology of the domain and the typical collocations of a given term. Designed to be easy and intuitive to use, the tool may help translation students as well as professionals to increase their translation quality by adhering to the specific linguistic variety of the target text corpus.
Revisiting corpus creation and analysis tools for translation tasks

Directory of Open Access Journals (Sweden)

Claudio Fantinuoli

2016-04-01

Many translation scholars have proposed the use of corpora to allow professional translators to produce high quality texts which read like originals. Yet, the diffusion of this methodology has been modest, one reason being the fact that software for corpora analyses have been developed with the linguist in mind, which means that they are generally complex and cumbersome, offering many advanced features, but lacking the level of usability and the specific features that meet translators’ needs. To overcome this shortcoming, we have developed TranslatorBank, a free corpus creation and analysis tool designed for translation tasks. TranslatorBank supports the creation of specialized monolingual corpora from the web; it includes a concordancer with a query system similar to a search engine; it uses basic statistical measures to indicate the reliability of results; it accesses the original documents directly for more contextual information; it includes a statistical and linguistic terminology extraction utility to extract the relevant terminology of the domain and the typical collocations of a given term. Designed to be easy and intuitive to use, the tool may help translation students as well as professionals to increase their translation quality by adhering to the specific linguistic variety of the target text corpus.
Bollywood Movie Corpus for Text, Images and Videos

OpenAIRE

Madaan, Nishtha; Mehta, Sameep; Saxena, Mayank; Aggarwal, Aditi; Agrawaal, Taneea S; Malhotra, Vrinda

2017-01-01

In past few years, several data-sets have been released for text and images. We present an approach to create the data-set for use in detecting and removing gender bias from text. We also include a set of challenges we have faced while creating this corpora. In this work, we have worked with movie data from Wikipedia plots and movie trailers from YouTube. Our Bollywood Movie corpus contains 4000 movies extracted from Wikipedia and 880 trailers extracted from YouTube which were released from 1...

Quantitative analysis of the corpus callosum in children with cerebral palsy and developmental delay: correlation with cerebral white matter volume

International Nuclear Information System (INIS)

Panigrahy, Ashok; Barnes, Patrick D.; Robertson, Robert L.; Sleeper, Lynn A.; Sayre, James W.

2005-01-01

This study was conducted to quantitatively correlate the thickness of the corpus callosum with the volume of cerebral white matter in children with cerebral palsy and developmental delay. Material and methods: A clinical database of 70 children with cerebral palsy and developmental delay was established with children between the ages of 1 and 5 years. These children also demonstrated abnormal periventricular T2 hyperintensities associated with and without ventriculomegaly. Mid-sagittal T1-weighted images were used to measure the thickness (genu, mid-body, and splenium) and length of the corpus callosum. Volumes of interest were digitized based on gray-scale densities to define the hemispheric cerebral white matter on axial T2-weighted and FLAIR images. The thickness of the mid-body of the corpus callosum was correlated with cerebral white matter volume. Subgroup analysis was also performed to examine the relationship of this correlation with both gestational age and neuromotor outcome. Statistical analysis was performed using analysis of variance and Pearson correlation coefficients. There was a positive correlation between the thickness of the mid-body of the corpus callosum and the volume of cerebral white matter across all children studied (R=0.665, P=0.0001). This correlation was not dependent on gestational age. The thickness of the mid-body of the corpus callosum was decreased in the spastic diplegia group compared to the two other groups (hypotonia and developmental delay only; P<0.0001). Within each neuromotor subgroup, there was a positive correlation between thickness of the mid-body of the corpus callosum and volume of the cerebral white matter. (orig.)
Quantitative analysis of the corpus callosum in children with cerebral palsy and developmental delay: correlation with cerebral white matter volume

Energy Technology Data Exchange (ETDEWEB)

Panigrahy, Ashok [Childrens Hospital Los Angeles, Department of Radiology, Los Angeles, CA (United States); Barnes, Patrick D. [Stanford University Medical Center, Department of Radiology, Lucile Salter Packard Children' s Hospital, Palo Alto, CA (United States); Robertson, Robert L. [Children' s Hospital Boston, Department of Radiology, Boston, MA (United States); Sleeper, Lynn A. [New England Research Institute, Watertown, MA (United States); Sayre, James W. [UCLA Medical Center, Departments of Radiology and Biostatistics, Los Angeles, CA (United States)

2005-12-01

This study was conducted to quantitatively correlate the thickness of the corpus callosum with the volume of cerebral white matter in children with cerebral palsy and developmental delay. Material and methods: A clinical database of 70 children with cerebral palsy and developmental delay was established with children between the ages of 1 and 5 years. These children also demonstrated abnormal periventricular T2 hyperintensities associated with and without ventriculomegaly. Mid-sagittal T1-weighted images were used to measure the thickness (genu, mid-body, and splenium) and length of the corpus callosum. Volumes of interest were digitized based on gray-scale densities to define the hemispheric cerebral white matter on axial T2-weighted and FLAIR images. The thickness of the mid-body of the corpus callosum was correlated with cerebral white matter volume. Subgroup analysis was also performed to examine the relationship of this correlation with both gestational age and neuromotor outcome. Statistical analysis was performed using analysis of variance and Pearson correlation coefficients. There was a positive correlation between the thickness of the mid-body of the corpus callosum and the volume of cerebral white matter across all children studied (R=0.665, P=0.0001). This correlation was not dependent on gestational age. The thickness of the mid-body of the corpus callosum was decreased in the spastic diplegia group compared to the two other groups (hypotonia and developmental delay only; P<0.0001). Within each neuromotor subgroup, there was a positive correlation between thickness of the mid-body of the corpus callosum and volume of the cerebral white matter. (orig.)
Development and Use of a Corpus Tailored for Legal English Learning

Science.gov (United States)

Skier, Jason; Vibulphol, Jutarat

2016-01-01

While corpus linguistics has been applied towards many specific academic purposes, reports are few regarding its use to facilitate learning of legal English by non-native English speakers. Specialized corpora are required because legal English often differs significantly from ordinary usage, with words such as bar, motion, and hearing having…
Automatic recognition of touch gestures in the corpus of social touch

NARCIS (Netherlands)

Jung, Merel Madeleine; Poel, Mannes; Poppe, Ronald Walter; Heylen, Dirk K.J.

For an artifact such as a robot or a virtual agent to respond appropriately to human social touch behavior, it should be able to automatically detect and recognize touch. This paper describes the data collection of CoST: Corpus of Social Touch, a data set containing 7805 captures of 14 different
Advantages and disadvantages in the use of internet as a corpus ...

African Journals Online (AJOL)

This paper initially discusses some of the consequences which the technological development has for lexicography, especially in terms of the different types of empirical basis which can be used in dictionary projects. The most important advantages and disadvantages of using the Internet as a corpus are then listed and ...
Towards a corpus of South African English: corralling the sub-varieties

African Journals Online (AJOL)

Riette Ruthven

important step towards the creation of a truly representative large corpus of SAE and ... Census data which elicit information about home language do not tell .... ISAE has absorbed lexical items such as robot (traffic light), dagha (mud), baba- ..... used their access to existing social networks to identify other contributors to.
The subjective use of postural verbs in Afrikaans (II): a corpus ...

African Journals Online (AJOL)

Kate H

Whereas the first article focused on the development of the postular .... 25. 2. Corpus discussion. Unfortunately, there are few Afrikaans corpora available .... In the third instance, it is assumed that the original meaning of the word influences the .... the extent to which they have become grammaticalised in terms of each other.
The first Malay language storytelling text-to-speech (TTS) corpus for ...

African Journals Online (AJOL)

speech annotations are described in detail in accordance to baseline work. The stories were recorded in two speaking styles that are neutral and storytelling speaking style. The first. Malay language storytelling corpus is not only necessary for the development of a storytelling text-to-speech (TTS) synthesis. It is also ...
Effects of hypo- and hyperthyroidism on proliferation, angiogenesis, apoptosis and expression of COX-2 in the corpus luteum of female rats.

Science.gov (United States)

Silva, J F; Ocarino, N M; Vieira, A L S; Nascimento, E F; Serakides, R

2013-08-01

Although thyroid dysfunction occurs frequently in humans and some animal species, the mechanisms by which hypo- and hyperthyroidism affect the corpus luteum have not been thoroughly elucidated. This study evaluated the levels of proliferative activity, angiogenesis, apoptosis and expression of cyclooxygenase-2 in the corpus luteum of female rats with thyroid dysfunction. These processes may be important in understanding the reproductive changes caused by thyroid dysfunction. A total of 18 adult female rats were divided into three groups (control, hypothyroid and hyperthyroid) with six animals per group. Three months after treatment to induce thyroid dysfunction, the rats were euthanized in the dioestrus phase. The ovaries were collected and immunohistochemically analysed for expression of the cell proliferation marker CDC-47, vascular endothelial growth factor (VEGF), VEGF receptor Flk-1 and cyclooxygenase-2 (COX-2). Apoptosis was evaluated using the TUNEL assay. Hypothyroidism reduced the intensity and area of COX-2 expression in the corpus luteum (p hyperthyroidism did not alter COX-2 expression in the dioestrus phase. Hypothyroidism significantly reduced the expression of CDC-47 in endothelial cells and pericytes in the corpus luteum, whereas hyperthyroidism did not induce a detectable change in CDC-47 expression (p > 0.05). Hypothyroidism reduced the level of apoptosis in luteal cells (p hyperthyroidism increased the level of apoptosis in the corpus luteum (p < 0.05). In conclusion, thyroid dysfunction differentially affects the levels of proliferative activity, angiogenesis and apoptosis and COX-2 expression in the corpus luteum of female rats. © 2013 Blackwell Verlag GmbH.
Treatment of uterine corpus cancer 1/1 state of development. Pt. 3. Radiotherapy

International Nuclear Information System (INIS)

Zielinski, J.

1975-01-01

The efficacity of irradiation therapy, as a unique method, was evaluated in 50 women with diagnosed uterine corpus cancer in 1/1 stage of clinical development. These patients refused other forms of therapy (surgery). The results were compared with those observed in 232 women treated by complex technique (surgery and irradiation). The percent of 5 years survival rate in the subjects treated by irradiation only made 54%, while in the complex therapy group it made 81.1%. The difference was found significant. The complications resulting from the irradiation therapy were present in 13 women (26%), while in the complex therapy group these were met in 25 women (10.8%) which difference was also significant. These facts strongly advocate the superiority of complex treatment in recent cases of uterine corpus cancer. (author)
Effect of Vestibulo-Proprioceptive Stimulations in a Child with Agenesis of the Corpus Callosum

Directory of Open Access Journals (Sweden)

Hamid Dalvand

2010-06-01

Full Text Available Background and Aim: The purpose of the present study was to investigate the effect of vestibulo-proprioceptive stimulations of sensory integration theory on the development of gross and fine motor, language and personal-social functions in a child with agenesis of the corpus callosum.Case: We report a 10.5 month old boy with agenesis of the corpus callosum. The intervention was administered based on sensory integration theory an hour a week for 20 weeks. The exercise intervention consisted of proprioceptive and linear, sustained and low frequency vestibular stimulations on suspension device and physio roll. A Denver Developmental Screening- II and milestones skill testing was completed pre-intervention and monthly. Post-intervention, age of gross motor, fine motor adaptive, language, and personal-social functions significantly improved. Based on milestones skills, maintenance of gross motor functions (e.g. sitting and quadruped position improved. The child could roll from side to side and released objects voluntarily. The reaction time to auditory stimulations became less than 2 seconds.Conclusion: vestibulo-proprioceptive stimulations using the neuroplasticity ability of the central nervous system is effective for development of gross and fine motor, language, and personal-social functions. These exercises can be administered for a child with agenesis of the corpus callosum.
How textbooks (and learners get it wrong: A corpus study of modal auxiliary verbs

Directory of Open Access Journals (Sweden)

Hayo Reinders

2013-02-01

Full Text Available Many elements contribute to the relative difficulty in acquiring specific aspects of English as a foreign language (Goldschneider & DeKeyser, 2001. Modal auxiliary verbs (e.g. could, might, are examples of a structure that is difficult for many learners. Not only are they particularly complex semantically, but especially in the Malaysian context reported on in this paper, there is no direct equivalent in the studentsâ L1. In other words, they are a good example of a structure for which successful acquisition depends very much on the quality of the input and instruction students receive. This paper reports on analysis of a 230,000 word corpus of Malaysian English textbooks, in which it was found that the relative frequency of the modals did not match that found in native speaker corpora such as the BNC. We compared the textbook corpus with a learner corpus of Malaysian form 4 learners and found no direct relationship between frequency of presentation of target forms in the textbooks and their use by students in their writing. We also found a very large percentage of errors in studentsâ writing. We suggest a number of possible reasons for these findings and discuss the implications for materials developers and teachers.
Sentence‐Chain Based Seq2seq Model for Corpus Expansion

Directory of Open Access Journals (Sweden)

Euisok Chung

2017-08-01

Full Text Available This study focuses on a method for sequential data augmentation in order to alleviate data sparseness problems. Specifically, we present corpus expansion techniques for enhancing the coverage of a language model. Recent recurrent neural network studies show that a seq2seq model can be applied for addressing language generation issues; it has the ability to generate new sentences from given input sentences. We present a method of corpus expansion using a sentence‐chain based seq2seq model. For training the seq2seq model, sentence chains are used as triples. The first two sentences in a triple are used for the encoder of the seq2seq model, while the last sentence becomes a target sequence for the decoder. Using only internal resources, evaluation results show an improvement of approximately 7.6% relative perplexity over a baseline language model of Korean text. Additionally, from a comparison with a previous study, the sentence chain approach reduces the size of the training data by 38.4% while generating 1.4‐times the number of n‐grams with superior performance for English text.
A Pedagogic Corpus Analysis: Modal Auxiliary Verbs in Malaysian English Textbooks

Science.gov (United States)

Khojasteh, Laleh; Mukundan, Jayakaran

2011-01-01

Using corpus approach, over the past two decades, a growing number of researchers started to blame textbooks for neglecting important information on the use of grammatical structures in real language use and provided ample information about the mismatch between language used in textbooks and real language in use. Likewise, the prescribed Malaysian…
A Corpus-Based Evaluation of Metaphors in a Business English Textbook

Science.gov (United States)

Skorczynska Sznajder, Hanna

2010-01-01

This study aims to evaluate the selection of metaphors in a published business English textbook using findings from a specialised corpus of written business English. While most scholars agree that metaphors should be included in English for Specific Purposes (ESP) syllabuses as a potentially problematic area in successful language learning, it is…
Corpus callosum dysgenesis and lipoma: embryologic and magnetic resonance imaging aspects; Disgenesia do corpo caloso e lipoma: consideracoes embriologicas basicas e aspectos de imagem a ressonancia magnetica

Energy Technology Data Exchange (ETDEWEB)

Abreu Junior, Luiz de; Borri, Maria Lucia; Wolosker, Angela Maria Borri; Hartmann, Luiz Guilherme de Carvalho; Galvao Filho, Mario de Melo [Hospital e Maternidade Sao Luiz, Sao Paulo, SP (Brazil). Setor de Diagnostico por Imagem]. E-mail: abreujr@directnet.com.br; D' Ippolito, Giuseppe [Hospital e Maternidade Sao Luiz, Sao Paulo, SP (Brazil). Setor de US/TC/RMN

2005-07-15

The corpus callosum is the major system of association fibers that permits communication of both cerebral hemispheres. Magnetic resonance imaging has improved the study of brain malformations, including the corpus callosum dysgenesis. Lipoma is a common finding in the spectrum of corpus callosum dysgenesis. The purpose of these study was to review the embryologic events and the magnetic resonance imaging aspects related to the corpus callosum dysgenesis and to the formation of the related lipoma. (author)
Un caso de variación no galego moderno: posibles realizacións do complemento directo dos verbos oír e escoitar

Directory of Open Access Journals (Sweden)

María Pereira Maceda

2009-01-01

Full Text Available In this article we describe a typology of arguments in the direct object slot for the Galician auditory perception verbs oír and escoitar. We begin with an overview of possible argument types, presenting relevant data concerning frequencies in a modern Galician text corpus consisting of documents written between 1800 and 2000. Following this empirical survey of actual usage, we posit structural criteria to explain the preferred or even mandatory use of one or another type of complement in various contexts. The corpus is subjected to three complementary analytical approaches. The fi rst of these is based on coreference between the verb of perception and the verb in a direct-object complement clause. The second approach takes into account the gramatical consequences of different conceptualizations of what is perceived. Finally, we look at the use of certain case markers performing a disambiguation function with regard to the implicit transitivity structure of arguments: subject plus direct object. This interpretative proposal is enriched with pertinent references to sociolinguistic features affecting the development of the Galician language up to the present.
Automatic extraction of corpus callosum from midsagittal head MR image and examination of Alzheimer-type dementia objective diagnostic system in feature analysis

International Nuclear Information System (INIS)

Kaneko, Tomoyuki; Kodama, Naoki; Kaeriyama, Tomoharu; Fukumoto, Ichiro

2004-01-01

We studied the objective diagnosis of Alzheimer-type dementia based on changes in the corpus callosum. We examined midsagittal head MR images of 40 Alzheimer-type dementia patients (15 men and 25 women; mean age, 75.4±5.5 years) and 31 healthy elderly persons (10 men and 21 women; mean age, 73.4±7.5 years), 71 subjects altogether. First, the corpus callosum was automatically extracted from midsagittal head MR images. Next, Alzheimer-type dementia was compared with the healthy elderly individuals using the features of shape factor and six features of Co-occurrence Matrix from the corpus callosum. Automatic extraction of the corpus callosum succeeded in 64 of 71 individuals, for an extraction rate of 90.1%. A statistically significant difference was found in 7 of the 9 features between Alzheimer-type dementia patients and the healthy elderly adults. Discriminant analysis using the 7 features demonstrated a sensitivity rate of 82.4%, specificity of 89.3%, and overall accuracy of 85.5%. These results indicated the possibility of an objective diagnostic system for Alzheimer-type dementia using feature analysis based on change in the corpus callosum. (author)
Annotating longitudinal clinical narratives for de-identification: The 2014 i2b2/UTHealth corpus.

Science.gov (United States)

Stubbs, Amber; Uzuner, Özlem

2015-12-01

The 2014 i2b2/UTHealth natural language processing shared task featured a track focused on the de-identification of longitudinal medical records. For this track, we de-identified a set of 1304 longitudinal medical records describing 296 patients. This corpus was de-identified under a broad interpretation of the HIPAA guidelines using double-annotation followed by arbitration, rounds of sanity checking, and proof reading. The average token-based F1 measure for the annotators compared to the gold standard was 0.927. The resulting annotations were used both to de-identify the data and to set the gold standard for the de-identification track of the 2014 i2b2/UTHealth shared task. All annotated private health information were replaced with realistic surrogates automatically and then read over and corrected manually. The resulting corpus is the first of its kind made available for de-identification research. This corpus was first used for the 2014 i2b2/UTHealth shared task, during which the systems achieved a mean F-measure of 0.872 and a maximum F-measure of 0.964 using entity-based micro-averaged evaluations. Copyright © 2015 Elsevier Inc. All rights reserved.
A corpus-based approach to generalising a chatbot system

OpenAIRE

Abu Shawar, Bayan; Atwell, Eric

2003-01-01

International research in NLP is dominated by work on English. NLP techniques and systems can be ported to other natural languages, but this is generally a labour-intensive task, requiring scarce computational and linguistic expertise; hence minority languages are poorly represented in NLP technology. We present an automated approach to porting an NLP technology, the AIML-based chatbot, to new languages, by using a corpus in the target language to retrain the chatbot. We have s...

The CALBC Silver Standard Corpus for Biomedical Named Entities - A Study in Harmonizing the Contributions from Four Independent Named Entity Taggers

NARCIS (Netherlands)

D. Rebholz-Schuhmann (Dietrich); A.J. Jimeno-Yepes (Antonio José); E.M. van Mulligen (Erik); N. Kang (Ning); J.A. Kors (Jan); D. Milward (David); P. Corbett (Peter); E. Buyko (Ekaterina); Tomanek (Katrin); E. Beisswanger (Elena); U. Hahn (Udo)

2010-01-01

textabstractThe production of gold standard corpora is time-consuming and costly. We propose an alternative: the 'silver standard corpus' (SSC), a corpus that has been generated by the harmonisation of the annotations that have been delivered from a selection of annotation systems. The systems have
Midline corpus callosum is a neuroanatomical focus of fetal alcohol damage.

Science.gov (United States)

Bookstein, Fred L; Sampson, Paul D; Connor, Paul D; Streissguth, Ann P

2002-06-15

Prenatal exposure to high levels of alcohol often induces birth defects that combine morphological stigmata with neurological or neuropsychological deficits. But it has proved problematic to diagnose these syndromes in adolescents and adults, in whom the morphological signs are absent or attenuated, the behavioral deficits nonspecific, and the exposure history often difficult to reconstruct. Localizing the associated brain abnormalities might circumvent most of these difficulties. To this end, three-dimensional (3D) locations were recorded for 67 homologous points on or near the corpus callosum in magnetic resonance (MR) brain images from 60 adolescents and adults who were normal, 60 diagnosed with fetal alcohol syndrome, and 60 diagnosed with fetal alcohol effects. We combined the standard statistical approach to this type of geometric data, Procrustes analysis, with a multivariate strategy focusing on differences in variability. In this data set, the shape of the corpus callosum and its vicinity proves systematically much more variable in the alcohol-affected brains than in those of the normal subjects. From this excess variability follows a promising classification rule, having both high sensitivity (100 out of 117) and high specificity (49 out of 60) in this sample. The discrimination uses four landmark points and two summary scores of callosal outline shape. The information from the corpus callosum and vicinity, as viewed in MR brain images of full-grown subjects, may serve as a permanent record of the prenatal effects of alcohol, even in patients who are first suspected of these syndromes relatively late in life or who lack the facial signs of prenatal alcohol damage. The statistical pattern underlying the callosal diagnosis also leads to speculations on mechanisms of the prenatal damage. Copyright 2002 Wiley-Liss, Inc.
75 FR 66301 - Amendment of Class E Airspace; Corpus Christi, TX

Science.gov (United States)

2010-10-28

..., TX (Lat. 27[deg]48'43'' N., long. 97[deg]05'20'' W.) Rockport, San Jose Island Airport, TX (Lat. 27... Meacham Blvd., Fort Worth, TX 76137; telephone (817) 321- 7716. SUPPLEMENTARY INFORMATION: History On July... Christi International Airport, TX (Lat. 27[deg]46'13'' N., long. 97[deg]30'04'' W.) Corpus Christi NAS...
Slovene specialized text corpus of Library and Information Science – An advanced lexicographic tool for library terminology research

OpenAIRE

Kanič, Ivan

2013-01-01

To support the research in the field of library and information science terminology and dictionary construction in Slovene language a specialized text corpus has been designed and constructed. The corpus has reached 3,6 million words extracted from 625 Slovene technical and scientific texts of the field. It supports a variety of specialized search methods, display of search results, and their statistic computation. The web based application is in open public access.
Rhythm histograms and musical meter: A corpus study of Malian percussion music.

Science.gov (United States)

London, Justin; Polak, Rainer; Jacoby, Nori

2017-04-01

Studies of musical corpora have given empirical grounding to the various features that characterize particular musical styles and genres. Palmer & Krumhansl (1990) found that in Western classical music the likeliest places for a note to occur are the most strongly accented beats in a measure, and this was also found in subsequent studies using both Western classical and folk music corpora (Huron & Ommen, 2006; Temperley, 2010). We present a rhythmic analysis of a corpus of 15 performances of percussion music from Bamako, Mali. In our corpus, the relative frequency of note onsets in a given metrical position does not correspond to patterns of metrical accent, though there is a stable relationship between onset frequency and metrical position. The implications of this non-congruence between simple statistical likelihood and metrical structure for the ways in which meter and metrical accent may be learned and understood are discussed, along with importance of cross-cultural studies for psychological research.
La arquitectura del pleno barroco en Granada: el hospital del Corpus Christi

Directory of Open Access Journals (Sweden)

Barrios Rozúa, Juan Manuel

2011-03-01

Full Text Available The Corpus Christi hospital of Granada was a victim of prejudices against the baroque on the part of influential historians. Nevertheless, the building is an interesting example of hospital architecture with a frankly original temple. Thanks to the exhaustive analysis of the institution’s very complete archive, it can be determined that some thirty artists worked there, including Alonso Cano and his disciple Juan Luis de Ortega, whose architectural works are evaluated here.

El hospital del Corpus Christi de Granada fue víctima de los prejuicios contra el barroco de influyentes historiadores. Sin embargo, el edificio constituye un interesante ejemplo de arquitectura hospitalaria con un templo francamente original. Gracias al análisis exhaustivo de su completo archivo, puede detectarse la labor de una treintena de artífices, entre ellos Alonso Cano y su discípulo Juan Luis de Ortega, cuyas obras arquitectónicas son valoradas aquí.
Annotated corpus and the empirical evaluation of probability estimates of grammatical forms

Directory of Open Access Journals (Sweden)

Ševa Nada

2003-01-01

Full Text Available The aim of the present study is to demonstrate the usage of an annotated corpus in the field of experimental psycholinguistics. Specifically, we demonstrate how the manually annotated Corpus of Serbian Language (Kostić, Đ. 2001 can be used for probability estimates of grammatical forms, which allow the control of independent variables in psycholinguistic experiments. We address the issue of processing Serbian inflected forms within two subparadigms of feminine nouns. In regression analysis, almost all processing variability of inflected forms has been accounted for by the amount of information (i.e. bits carried by the presented forms. In spite of the fact that probability distributions of inflected forms for the two paradigms differ, it was shown that the best prediction of processing variability is obtained by the probabilities derived from the predominant subparadigm which encompasses about 80% of feminine nouns. The relevance of annotated corpora in experimental psycholinguistics is discussed more in detail .
Boomerang sign: Clinical significance of transient lesion in splenium of corpus callosum.

Science.gov (United States)

Malhotra, Hardeep Singh; Garg, Ravindra Kumar; Vidhate, Mukund R; Sharma, Pawan Kumar

2012-04-01

Transient signal abnormality in the splenium of corpus callosum on magnetic resonance imaging (MRI) is occasionally encountered in clinical practice. It has been reported in various clinical conditions apart from patients with epilepsy. We describe 4 patients with different etiologies presenting with signal changes in the splenium of corpus callosum. They were diagnosed as having progressive myoclonic epilepsy (case 1), localization-related epilepsy (case 2), hemicrania continua (case 3), and postinfectious parkinsonism (case 4). While three patients had complete involvement of the splenium on diffusion-weighted image ("boomerang sign"), the patient having hemicrania continua showed semilunar involvement ("mini-boomerang") on T2-weighted and FLAIR image. All the cases had noncontiguous involvement of the splenium. We herein, discuss these cases with transient splenial involvement and stress that such patients do not need aggressive diagnostic and therapeutic interventions. An attempt has been made to review the literature regarding the pathophysiology, etiology, and outcome of such lesions.
Boomerang sign: Clinical significance of transient lesion in splenium of corpus callosum

Directory of Open Access Journals (Sweden)

Hardeep Singh Malhotra

2012-01-01

Full Text Available Transient signal abnormality in the splenium of corpus callosum on magnetic resonance imaging (MRI is occasionally encountered in clinical practice. It has been reported in various clinical conditions apart from patients with epilepsy. We describe 4 patients with different etiologies presenting with signal changes in the splenium of corpus callosum. They were diagnosed as having progressive myoclonic epilepsy (case 1, localization-related epilepsy (case 2, hemicrania continua (case 3, and postinfectious parkinsonism (case 4. While three patients had complete involvement of the splenium on diffusion-weighted image ("boomerang sign", the patient having hemicrania continua showed semilunar involvement ("mini-boomerang" on T2-weighted and FLAIR image. All the cases had noncontiguous involvement of the splenium. We herein, discuss these cases with transient splenial involvement and stress that such patients do not need aggressive diagnostic and therapeutic interventions. An attempt has been made to review the literature regarding the pathophysiology, etiology, and outcome of such lesions.
A Novel DBN Feature Fusion Model for Cross-Corpus Speech Emotion Recognition

Directory of Open Access Journals (Sweden)

Zou Cairong

2016-01-01

Full Text Available The feature fusion from separate source is the current technical difficulties of cross-corpus speech emotion recognition. The purpose of this paper is to, based on Deep Belief Nets (DBN in Deep Learning, use the emotional information hiding in speech spectrum diagram (spectrogram as image features and then implement feature fusion with the traditional emotion features. First, based on the spectrogram analysis by STB/Itti model, the new spectrogram features are extracted from the color, the brightness, and the orientation, respectively; then using two alternative DBN models they fuse the traditional and the spectrogram features, which increase the scale of the feature subset and the characterization ability of emotion. Through the experiment on ABC database and Chinese corpora, the new feature subset compared with traditional speech emotion features, the recognition result on cross-corpus, distinctly advances by 8.8%. The method proposed provides a new idea for feature fusion of emotion recognition.
Supporting English-medium pedagogy through an online corpus of science and engineering lectures

Science.gov (United States)

Kunioshi, Nílson; Noguchi, Judy; Tojo, Kazuko; Hayashi, Hiroko

2016-05-01

As English-medium instruction (EMI) spreads around the world, university teachers and students who are non-native speakers of English (NNS) need to put much effort into the delivery or reception of content. Construction of scientific meaning in the process of learning is already complex when instruction is delivered in the first language of the teachers and students, and may become even more challenging in a second language, because science education depends greatly on language. In order to identify important pedagogical functions that teachers use to deliver content and to present different ways to realise each function, a corpus of lectures related to science and engineering courses was created and analysed. NNS teachers and students in science and engineering involved in EMI higher education can obtain insights for delivering and listening to lectures from the Online Corpus of Academic Lectures (OnCAL).
Lingüística de Corpus: histórico e problemática

Directory of Open Access Journals (Sweden)

SARDINHA Tony Berber

2000-01-01

Full Text Available O presente trabalho oferece uma retrospectiva da Lingüística de Corpus, uma área de pesquisa que tem experimentado um crescimento vertiginoso nos últimos anos e que tem tido um impacto considerável na lingüística. A retrospectiva inclui tanto um painel histórico quanto um posicionamento em relação aos debates correntes e desenvolvimentos futuros da área. Os conceitos principais em voga na área são apresentados e discutidos. O trabalho ainda comenta os fatos mais marcantes na Lingüística de Corpus em relação à teoria e à prática, elencando os principais corpora em existência bem como as mais importantes contribuições no campo de programas de computador para análise e exploração desses corpora.
Contribution to the radiotherapy of collum and corpus carcinoma (Marburg results 1960-1979)

International Nuclear Information System (INIS)

Kleinsorge, F.

1982-01-01

The subject of this work is the therapeutic success of radiation therapy, respectively operation, with uterine carcinoma. For the evaluation 938 patients of the Marburg University Radiation Clinic from the years 1960 to 1979 were included. The 5-year survival rate with collum carcinoma was 56.99% (stage I 79.62%, II 57.29%, III 31.13%, IV 5.88%). With patients with corpus carcinoma the 5-year survival rate was 74.27% (stage II 80.79%, I 77.50%, III 57.50%, IV 30.00%). The results of various radiotherapeutic methods ('Duesseldorf method', 'Marburg method') were discussed. With respect to the international results published in 'Annual Report' the Marburg results for the treatment of collum carcinoma (the 5-year survival rates) are at world level, and the successes in the treatment of corpus carcinoma are clearly better. (TRV) [de
Exploring the Potential of Corpus Use in Translation Training

DEFF Research Database (Denmark)

Laursen, Anne Lise; Pellón, Ismael Arinas

2014-01-01

translators only allows for teaching the students methods that they can apply systematically to several professional tasks. The chapter illustrates how the traditional translation training strategies can be combined with the use of concordancing software to cope with translations.......The chapter describes the corpus analysis strategies used with the translation master’s students at the Department of Business Communication at the Faculty of Business and Social Sciences (formerly Aarhus School of Business or ASB). The short time available for the training of specialized...
Coreference annotation and resolution in the Colorado Richly Annotated Full Text (CRAFT) corpus of biomedical journal articles.

Science.gov (United States)

Cohen, K Bretonnel; Lanfranchi, Arrick; Choi, Miji Joo-Young; Bada, Michael; Baumgartner, William A; Panteleyeva, Natalya; Verspoor, Karin; Palmer, Martha; Hunter, Lawrence E

2017-08-17

Coreference resolution is the task of finding strings in text that have the same referent as other strings. Failures of coreference resolution are a common cause of false negatives in information extraction from the scientific literature. In order to better understand the nature of the phenomenon of coreference in biomedical publications and to increase performance on the task, we annotated the Colorado Richly Annotated Full Text (CRAFT) corpus with coreference relations. The corpus was manually annotated with coreference relations, including identity and appositives for all coreferring base noun phrases. The OntoNotes annotation guidelines, with minor adaptations, were used. Interannotator agreement ranges from 0.480 (entity-based CEAF) to 0.858 (Class-B3), depending on the metric that is used to assess it. The resulting corpus adds nearly 30,000 annotations to the previous release of the CRAFT corpus. Differences from related projects include a much broader definition of markables, connection to extensive annotation of several domain-relevant semantic classes, and connection to complete syntactic annotation. Tool performance was benchmarked on the data. A publicly available out-of-the-box, general-domain coreference resolution system achieved an F-measure of 0.14 (B3), while a simple domain-adapted rule-based system achieved an F-measure of 0.42. An ensemble of the two reached F of 0.46. Following the IDENTITY chains in the data would add 106,263 additional named entities in the full 97-paper corpus, for an increase of 76% percent in the semantic classes of the eight ontologies that have been annotated in earlier versions of the CRAFT corpus. The project produced a large data set for further investigation of coreference and coreference resolution in the scientific literature. The work raised issues in the phenomenon of reference in this domain and genre, and the paper proposes that many mentions that would be considered generic in the general domain are not
Lexical Analysis of the Verb "COOK" and Learning Vocabulary: A Corpus Study

Science.gov (United States)

Priyono

2011-01-01

English verbs have built-in properties that determine how they behave syntactically and generate appropriate meaning associated. With these inherent properties some verbs can fill in only in certain syntactic structures and some in others. The observation of the verb "COOK" using English corpus has revealed its lexical properties…
A corpus-based study on the translation of “namorar” and “date” in literature texts = Um estudo baseado em corpus sobre a tradução de “namorar” e “date” em textos literários

Directory of Open Access Journals (Sweden)

Fleck, Regina Caballero

2012-01-01

Full Text Available Profissionais que trabalham com traduções e línguas em geral provavelmente já encontraram em textos palavras “intraduzíveis”, tais como “namorar” e “date”. O presente estudo busca difundir o uso de ferramentas baseadas em corpus entre tradutores literários. Nossas perguntas de pesquisa são: quais as soluções tradutórias encontradas no corpus? Como essas soluções estão relacionadas a fatores extralinguísticos? Os dados deste estudo foram retirados do Compara, corpus paralelo que está disponível online e que consiste em textos originais em português e inglês alinhados com suas respectivas traduções. A fim de analisarmos os exemplos, nossos parâmetros serão as definições dos dicionários Houaiss e Oxford. Ao fim deste estudo, podemos observar uma equivalência unilateral entre “namorar” e “date” e que esses termos evoluíram de maneira distinta nos dois idiomas
BC4GO: a full-text corpus for the BioCreative IV GO task.

Science.gov (United States)

Van Auken, Kimberly; Schaeffer, Mary L; McQuilton, Peter; Laulederkind, Stanley J F; Li, Donghui; Wang, Shur-Jen; Hayman, G Thomas; Tweedie, Susan; Arighi, Cecilia N; Done, James; Müller, Hans-Michael; Sternberg, Paul W; Mao, Yuqing; Wei, Chih-Hsuan; Lu, Zhiyong

2014-01-01

Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain ∼ 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community. Database URL: http://www.biocreative.org/resources/corpora/bc-iv-go-task-corpus/. Published by Oxford University Press 2014. This work is written by US
Cervical carcinoma vs endometrial carcinoma, involving both corpus and cervix : comparison of growing pattern with MR imaging

International Nuclear Information System (INIS)

Kim, Byung Keuk; Lee, Jin Hee; Kim, Hong; Suh, Soo Ji; Kim, Jung Sik

2001-01-01

To evaluate the growth pattern depicted by MR imaging and used to differentiate between uterine cervical and endometrial carcinoma where the mass involves both the uterine corpus and cervix. The tumor growth pattern observed on MR images obtained between November 1989 and January in 1999 in 37 of 784 cervical carcinomas and 9 of 47 endometrial carcinomas in which the tumor involved both the uterine corpus and cervix was analysed. The histologic type was squamous (n=29), adenocarcinomatous (n=6) or adenosquamous (n=2) in cervical carcinoma, and carcinomatous (n=8) or adenosquamous (n=1) in endometrial carcinoma. A 1.5-T (Magnetom Vision, Siemens, Germany) and a 2.0-T unit (Spectro-20000, Goldstar, Korea) were used to obtain T1-and T2-weighted axial, T2-weighted sagittal and Gd-enhanced images. Tumor involvement of the uterine cervix was classified as Cp-n, Cp-x, or Cp-b according to involvement of the endocervix, exocervix or both. Tumors of the uterine corpus were classified as involving the mucosa(U-mu), myometrium(U-my) or serosa(U-se). In 37 cases of cervical carcinoma, all three involving the endocervix(Cp-n) invaded the endometrium(U-mu), three involving both the endo- and exocervix(Cp-b) invaded the endometrium(U-mu, 1 case), myometrium(U-my, 1 case), or serosa(U-se, 1 case), and 31 involving the full-thickness of the uterine cervix(Ct) invaded the endometrium (U-mu, 6 cases) or serosa(U-se, 25 cases). In nine cases of endometrial carcinoma, three involving the endometrium(U-mu) and five involving the myometrium(U-my) invaded the endocervix(Cp-n), and one involving the serosa(U-se) invaded the full-thickness of the uterine cervix(Ct). Cervical carcinoma tended to involve the entire cervix and the full thickness of the uterine corpus, but endometrial carcinoma tended to involve the endometrium or myometrium of the uterine corpus and endocervix
Aicardi's syndrome: (agenesis of the corpus callosum, infantile spasms, and ocular anomalies).

Science.gov (United States)

Dinani, S; Jancar, J

1984-06-01

A case of Aicardi's syndrome is reported. A 12-year-old mentally handicapped female has all the characteristics of the syndrome: agenesis of the Corpus callosum; female infant with mental handicap, epilepsy, characteristic eye lesions, vertebral anomalies and abnormal EEG pattern. The results of supporting examination and investigations are noted.

Therapy-related change of corpus callosum in a young patient with epilepsy

International Nuclear Information System (INIS)

Feitova, V.; Krupa, P.; Feit, J.

2002-01-01

Focal nonhemorrhagic lesion in the splenium of the corpus callosum in a patient with epilepsy treated with antiepileptic drugs was observed with MRI imaging. We have found only one such case during the past 2 years (series of MRI examinations of approximately 500 patients with various forms of epilepsy). (orig.)
Measures of speech rhythm and the role of corpus-based word frequency: a multifactorial comparison of Spanish(-English speakers

Directory of Open Access Journals (Sweden)

Michael J. Harris

2011-12-01

Full Text Available In this study, we address various measures that have been employed to distinguish between syllable and stress- timed languages. This study differs from all previous ones by (i exploring and comparing multiple metrics within a quantitative and multifactorial perspective and by (ii also documenting the impact of corpus-based word frequency. We begin with the basic distinctions of speech rhythms, dealing with the differences between syllable-timed languages and stress-timed languages and several methods that have been used to attempt to distinguish between the two. We then describe how these metrics were used in the current study comparing the speech rhythms of Mexican Spanish speakers and bilingual English/Spanish speakers (speakers born to Mexican parents in California. More specifically, we evaluate how well various metrics of vowel duration variability as well as the so far understudied factor of corpus-based frequency allow to classify speakers as monolingual or bilingual. A binary logistic regression identifies several main effects and interactions. Most importantly, our results call the utility of a particular rhythm metric, the PVI, into question and indicate that corpus data in the form of lemma frequencies interact with two metrics of durational variability, suggesting that durational variability metrics should ideally be studied in conjunction with corpus-based frequency data.
Hippocrates the otolaryngologist: an epidemiological analysis of ear-throat-nose diseases in the Corpus Hippocraticum.

Science.gov (United States)

Benmoussa, Nadia; Hansen, Kevin; Charlier, Philippe

2018-03-01

Hippocrates, a Greek physician during the fifth century BC., is often considered the father of medicine. The Corpus Hippocraticum comprising of 58 volumes was attributed to him alone for a long time. Nowadays, it is considered that several authors contributed to its creation between 450 and 150 BC., so over a period of 300 years. The objective of our study was to develop a nosological classification of all passages treating head and neck diseases. We read and analyzed all volumes of the Corpus Hippocraticum in French translation and extracted all passages dealing with oto-rhino-laryngological and maxillo-facial conditions (n = 65). We classified all pathologies into five distinctive nosological groups: traumatic, infectious, malformation, cancerous and inflammatory Results: Traumatic diseases represented 36.9% (n = 24), infectious 52.3% (n= 34), malformation 0% (n = 0), cancerous 11.5% (n = 1) and inflammatory 9.3% (n= 6). These results represent the living conditions of this era, during which diseases were mostly of infectious or traumatic nature (wars, physical labor and recreational sporting activity, living together on close quarters, etc.). The meticulously detailed observations of the corpus give us a precious insight into the early perception of diseases, their progression and early attempts of treatment.
Tango, senses and sensuality

Directory of Open Access Journals (Sweden)

María de los Angeles Montes

2014-11-01

Full Text Available One of the most important contributions of the Peircean paradigm to semiotics consists in its opening the sign to development and modification. Sense, meaning, is no longer a static and fixed property. The Peircean paradigm allows us to wonder about how signs are interpreted, how they make sense in actual reception practices. The purpose of this paper is to address the problem of the relationship between appropriation practices (Montes, 2011 and significance processes from the analysis of an empirical case, observing how signs of sensuality are produced in the ballroom tango dance. Tango has earned international reputation mainly as a sensuality dance thanks to its spectacularization and subsequent mediatization. However, as I expect to demonstrate, at the moment of reception, people put those discourses in interaction with specific appropriation practices that shape very special interpretive habits. I will address the issue from an empirical investigation, especially focused on the production of interpretants (emotional, energetic, and logical, that is to say, looking back to the sign reception from the body to the mind. From a corpus of 25 focused interviews with people who got to know tango through mass media but that afterwards learnt to dance it as a social dance, it is my intention to show what sensuality means to them today, and how that current practice interacts with other external and previous discourses to produce interpretive habits. Finally, I wish to offer a theoretical reflection about the relationship between these three types of interpretants, their interaction with the discourse of the mass media and the place corporality has in the reception processes.
Análisis jurídico a la ley estatutaria 1095 de 2006 de Habeas Corpus

Directory of Open Access Journals (Sweden)

María Cristina Patiño-González

2010-03-01

Full Text Available Luego de casi tres años y medio sin que existiera en Colombia un desarrollo legal del habeas corpus, el pasado 2 de noviembre de 2006 fue sancionada la Ley Estatutaria 1095, por la cual se reglamentó el artículo 30 de la Carta Política. Este cuerpo normativo dispuso que el habeas corpus tenía la naturaleza jurídica de un derecho fundamental y una acción constitucional que tutela la libertad personal cuando alguien es privado de dicha libertad con violación de las garantías constitucionales y legales. Sin embargo, en aplicación del bloque de constitucionalidad, el propio desarrollo de la Ley Estatutaria y la jurisprudencia de la Corte Constitucional, el habeas corpus también se erige como la garantía fundamental que protege los derechos funda mentales colaterales de los detenidos y ostenta la naturaleza de un recurso de amparo. El artículo ofrece un estudio sobre el desarrollo previsto por la Ley Estatutaria de Habeas Corpus en materia de definición, competencia, garantías para el ejercicio de la acción, contenido de la petición, su trámite, decisión y los medios de impugnación susceptibles de impetrar, y analiza críticamente la Sentencia C-187/06 de la Corte Constitucional que realizó el control previo de constitucionalidad; ofrece además una serie de aportes para una interpretación más garantista de la institución y se hacen observaciones de lege ferenda
LINNAEUS: A species name identification system for biomedical literature

Directory of Open Access Journals (Sweden)

Nenadic Goran

2010-02-01

Full Text Available Abstract Background The task of recognizing and identifying species names in biomedical literature has recently been regarded as critical for a number of applications in text and data mining, including gene name recognition, species-specific document retrieval, and semantic enrichment of biomedical articles. Results In this paper we describe an open-source species name recognition and normalization software system, LINNAEUS, and evaluate its performance relative to several automatically generated biomedical corpora, as well as a novel corpus of full-text documents manually annotated for species mentions. LINNAEUS uses a dictionary-based approach (implemented as an efficient deterministic finite-state automaton to identify species names and a set of heuristics to resolve ambiguous mentions. When compared against our manually annotated corpus, LINNAEUS performs with 94% recall and 97% precision at the mention level, and 98% recall and 90% precision at the document level. Our system successfully solves the problem of disambiguating uncertain species mentions, with 97% of all mentions in PubMed Central full-text documents resolved to unambiguous NCBI taxonomy identifiers. Conclusions LINNAEUS is an open source, stand-alone software system capable of recognizing and normalizing species name mentions with speed and accuracy, and can therefore be integrated into a range of bioinformatics and text-mining applications. The software and manually annotated corpus can be downloaded freely at http://linnaeus.sourceforge.net/.
Neuromyelitis optica with linear enhancement of corpus callosum in brain magnetic resonance imaging with contrast: a case report.

Science.gov (United States)

Sahraian, Mohammad Ali; Moghadasi, Abdorreza Naser; Owji, Mahsa; Naghshineh, Hoda; Minagar, Alireza

2015-06-10

Neuromyelitis optica is a demyelinating disease of the central nervous system with various patterns of brain lesions. Corpus callosum may be involved in both multiple sclerosis and neuromyelitis optica. Previous case reports have demonstrated that callosal lesions in neuromyelitis optica are usually large and edematous and have a heterogeneous intensity showing a "marbled pattern" in the acute phase. Their size and intensity may reduce with time or disappear in the chronic stages. In this report, we describe a case of a 25-year-old Caucasian man with neuromyelitis optica who presented clinically with optic neuritis and myelitis. His brain magnetic resonance imaging demonstrated linear enhancement of the corpus callosum. Brain images with contrast agent added also showed linear ependymal layer enhancement of the lateral ventricles, which has been reported in this disease previously. Linear enhancement of corpus callosum in magnetic resonance imaging with contrast agent could help in diagnosing neuromyelitis optica and differentiating it from other demyelinating disease, especially multiple sclerosis.
Applications of the Buckeye GTA Corpus for L2 teaching and research

DEFF Research Database (Denmark)

Hardman, Jocelyn; McCullough, Elizabeth

The Buckeye GTA Corpus contains 9,664 L1 and L2 sentence productions by 89 talkers (27 American English, 19 Hindi, 23 Mandarin, & 20 Korean). A total of 5,696 sentences were read in English, with each talker contributing 64 sentences. Hindi, Mandarin, and Korean talkers also read 64 sentences each...
El discurso de la ciencia y la tecnología en la prensa escrita chilena: aproximación al corpus DICIPE-2004 O Discurso da ciência e da tecnologia da imprensa escrita chilena: aproximação ao corpus DICIPE-2004 The discourse of science and technology in the chilean press: an approximation to the DICIPE-2004 corpus

Directory of Open Access Journals (Sweden)

Giovanni Parodi

2007-01-01

Full Text Available La comunicación de la ciencia y la tecnología (C&T ha cobrado gran relevancia en los últimos años, inicialmente a través de artículos científicos y actualmente através de los medios masivos de comunicación. En este contexto, los objetivos de este artículo son: a determinar y cuantificar, en términos comparativos, el espacio que un grupo de cinco periódicos chilenos destinan a la divulgación de temáticas de C&T; b determinar los tipos de textos periodísticos a través de los cuales se divulga la C&T en la prensa escrita; c identificar los macrotemas, subtemas y disciplinas presentes en el corpus. El corpus fue recolectado durante tres meses y quedó formado por 411 textos. Se calculó y normalizó la ocurrencia de textos y palabras, los tipos textuales, los macrotemas, temas y disciplinas en las cuales se inscribe cada texto. Los hallazgos muestran, entre otros, que la divulgación de la C&T ocupa en promedio un 1% de lo que se publica en estos cinco periódicos y que, mayoritariamente, predominan textos relacionados con ciencia médicas, astronomía y astrofísica y ciencias de la vida.A comunicação na ciência e na tecnologia (C&T assumiu grande relevância nos últimos anos, inicialmente através de artigos e posteriormente através dos meios de comunicação de massa. Neste contexto, os objetivos deste artigo são: a determinar e quantificar, em termos comparativos, o espaço que um grupo de jornais chilenos destina à divulgação de temas de C&T; b determinar os tipos de texto jornalpisticos através dos quais se divulga a C & T na imprensa escrita; c identificar os macrotemas, subtemas e disciplinas presentes no corpus. O corpus foi coletado durante três meses e é formado por 411 textos. Foram calculados e normatizados as ocorrências de textos e palavras, os tipo de textos, os macrotemas, temas e disciplinas nas quais se insere cada texto. Os achados mostram, entre outros, que a divulgação de C & T ocupa, em média, 1
Task Dependency Effects of Collaboration in Learners' Corpus Consultation: An Exploratory Case Study

Science.gov (United States)

Cho, Hyeyoung

2016-01-01

Collaborative learning has attracted attention as pedagogic mediation to assist learners' corpus consultation, but some studies have pointed to negative aspects of collaboration. Based on the two sides of collaboration in language learning, this study presents a qualitative investigation of different effects of collaboration depending on task…
Extracting (good) discourse examples from an oral specialised corpus of wine tasting interactions

DEFF Research Database (Denmark)

Leroyer, Patrick; Gautier, Laurent; Maazaoui, Hedi

This article explains how lexicographic examples have been extracted from a corpus of professional interactions on wine tasting for the purpose of including them i the Oenolex wine dictionary. Particularly, it explains how the SONAL software was used to analyse and process audio data and export...
Metáforas e Linguística de Corpus: metodologia de análise aplicada a um gênero de negócios Metaphors and Corpus Linguistics: a method for finding metaphors in a business genre

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2011-01-01

Full Text Available O presente trabalho visa a relatar o desenvolvimento de uma metodologia de identificação de metáforas em corpora eletrônicos. Como exemplo, foi tomado o gênero teleconferências de apresentação de resultados financeiros. A metodologia é do tipo "bottom-up" / "corpus-driven" e se baseia na identificação de palavras com frequência marcante (palavras-chave e de seus padrões de co-ocorrência, seguido do cálculo de similaridade semântica entre essas palavras. Com isso, chega-se a um conjunto de palavras que são então interpretadas em seu co-texto, por meio de concordâncias.This paper aims at reporting the development of a method for metaphor identification in computer corpora. The method was tested on a particular corpus, namely of investment conference calls, and comprises procedures that work from the bottom up, and rely on marked frequency, collocation and semantic similarity as signalling devices for metaphor. As such, the method is an example of corpus-driven research into metaphor. The application of these procedures yields a number of metaphor candidates, which are then checked manually through concordances.
Language configurations of degree-related denotations in the spoken production of a group of Colombian EFL university students: A corpus-based study

Directory of Open Access Journals (Sweden)

Wilder Yesid Escobar

2015-05-01

Full Text Available Recognizing that developing the competences needed to appropriately use linguistic resources according to contextual characteristics (pragmatics is as important as the cultural-imbedded linguistic knowledge itself (semantics and that both are equally essential to form competent speakers of English in foreign language contexts, we feel this research relies on corpus linguistics to analyze both the scope and the limitations of the sociolinguistic knowledge and the communicative skills of English students at the university level. To such end, a linguistic corpus was assembled, compared to an existing corpus of native speakers, and analyzed in terms of the frequency, overuse, underuse, misuse, ambiguity, success, and failure of the linguistic parameters used in speech acts. The findings herein describe the linguistic configurations employed to modify levels and degrees of descriptions (salient sematic theme exhibited in the EFL learners´ corpus appealing to the sociolinguistic principles governing meaning making and language use which are constructed under the social conditions of the environments where the language is naturally spoken for sociocultural exchange.
La enseñanza de la atenuación en E/LE a partir del análisis de un corpus real

Directory of Open Access Journals (Sweden)

Daniel Secchi

2017-11-01

Full Text Available Resumen: El presente trabajo busca evidenciar la potencialidad de los corpus discursivos orales reales como herramienta para la enseñanza/aprendizaje de la atenuación en E/LE, y también pretende demostrar cómo los estudiantes pueden mejorar sus habilidades pragmáticas relacionadas con la atenuación a través de un aprendizaje consciente que les permita desenvolverse en los diferentes contextos comunicativos, igual que los nativos. Palabras clave: corpus oral real, atenuación, español lengua extranjera (E/LE Abstract: The aim of the present investigation is to point out the potentiality of using real oral discourse corpus as a didactic resource in order to teach/learn mitigation in S/SL classes. Also, we want to highlight how students can improve their pragmatic and mitigation skills, thanks to a conscious learning of those strategies that help them to communicate in different communicative contexts as well as native speakers do. Keywords: real oral corpus, mitigation, Spanish as second language (S/SL
Un corpus DIY pour l’étude du roumain en diachronie. Stratégies de constitution et stratégies de recherche

Directory of Open Access Journals (Sweden)

Ana Zisman

2017-12-01

Full Text Available The present paper aims to provide an overview of some of the advantages of creating and working with a DIY corpus, i.e. a corpus compiled by the linguist, as groundwork for a PhD thesis. Collected in order to investigate the grammatical and pragmatical behavior in historical Romanian of some so-called parenthetical verbs: a zice/ a spune ‘to say’, a crede ‘to think’, a şti ‘to know’, within 5 types of texts from the 16th/17th to the 20th centuries, this DIY corpus represents a necessary alternative as a database of Romania texts. Although its creation demanded some additional steps (e.g. the selection of the texts, which is determined by various diachronical factors, such a corpus proves to be relevant for investigating parenthetical verbs in literary, historical and law texts, as well as in formal and informal letters. In order to do so, the paradigm of the afore-mentioned verbs has to be systematized in relation to a precise word frequency per text type.
The corpus-driven revolution in Polish Sign Language: the interview with Dr. Paweł Rutkowski

Directory of Open Access Journals (Sweden)

Iztok Kosem

2018-02-01

Full Text Available Dr. Paweł Rutkowski is head of the Section for Sign Linguistics at the University of Warsaw. He is a general linguist and a specialist in the field of syntax of natural languages, carrying out research on Polish Sign Language (polski język migowy — PJM. He has been awarded a number of prizes, grants and scholarships by such institutions as the Foundation for Polish Science, Polish Ministry of Science and Higher Education, National Science Centre, Poland, Polish–U.S. Fulbright Commission, Kosciuszko Foundation and DAAD. Dr. Rutkowski leads the team developing the Corpus of Polish Sign Language and the Corpus-based Dictionary of Polish Sign Language, the first dictionary of this language prepared in compliance with modern lexicographical standards. The dictionary is an open-access publication, available freely at the following address: http://www.slownikpjm.uw.edu.pl/en/. This interview took place at eLex 2017, a biennial conference on electronic lexicography, where Dr. Rutkowski was awarded the Adam Kilgarriff Prize and gave a keynote address entitled Sign language as a challenge to electronic lexicography: The Corpus-based Dictionary of Polish Sign Language and beyond. The interview was conducted by Dr. Victoria Nyst from Leiden University, Faculty of Humanities, and Dr. Iztok Kosem from the University of Ljubljana, Faculty of Arts.
Flow and Suspended Sediment Events in the Near-Coastal Zone off Corpus Christi, Texas

Science.gov (United States)

2003-09-30

redistribution of preexisting shelf sediments during storms and (2) transportation of suspended sediment from the adjacent bay- lagoon system. Snedden et al...and K.E. Schmedes. (1983). Submerged lands of Texas, Corpus Christi area: sediments, geochemistry, benthic macroinvertebrates and associated
Cluster Analysis of the Newcastle Electronic Corpus of Tyneside English: A Comparison of Methods

NARCIS (Netherlands)

Moisl, Hermann; Jones, Valerie M.

2005-01-01

This article examines the feasibility of an empirical approach to sociolinguistic analysis of the Newcastle Electronic Corpus of Tyneside English using exploratory multivariate methods. It addresses a known problem with one class of such methods, hierarchical cluster analysis—that different
Compiling an OPEC Word List: A Corpus-Informed Lexical Analysis

Directory of Open Access Journals (Sweden)

Ebtisam Saleh Aluthman

2017-01-01

Full Text Available The present study is conducted within the borders of lexicographic research, where corpora have increasingly become all-pervasive. The overall goal of this study is to compile an open-source OPEC[1] Word List (OWL that is available for lexicographic research and vocabulary learning related to English language learning for the purpose of oil marketing and oil industries. To achieve this goal, an OPEC Monthly Reports Corpus (OMRC comprising of 1,004,542 words was compiled. The OMRC consists of 40 OPEC monthly reports released between 2003 and 2015. Consideration was given to both range and frequency criteria when compiling the OWL which consists of 255 word types. Along with this basic goal, this study aims to investigate the coverage of the most well-recognised word lists, the General Service List of English Words (GSL (West ,1953 and the Academic Word List (AWL (Coxhead, 2000 in the OMRC corpus. The 255 word types included in the OWL are not overlapping with either the AWL or the GSL. Results suggest the necessity of making this discipline-specific word list for ESL students of oil marketing industries. The availability of the OWL has significant pedagogical contributions to curriculum design, learning activities and the overall process of vocabulary learning in the context of teaching English for specific purposes (ESP. OPEC stands for Organisation of Petroleum Exporting Countries.
Corpus-Aided Business English Collocation Pedagogy: An Empirical Study in Chinese EFL Learners

Science.gov (United States)

Chen, Lidan

2017-01-01

This study reports an empirical study of an explicit instruction of corpus-aided Business English collocations and verifies its effectiveness in improving learners' collocation awareness and learner autonomy, as a result of which is significant improvement of learners' collocation competence. An eight-week instruction in keywords' collocations,…

L'analyse des corpus multimodaux en ligne : état des lieux et perspectives

Directory of Open Access Journals (Sweden)

Develotte Christine

2012-07-01

Full Text Available Partant de l’ouvrage "Texte et ordinateur. L’écriture réinventée ?" de Jacques Anis (1998 nous cherchons à montrer la trajectoire des travaux français en sciences du langage sur les corpus médiés par la technologie jusqu’à aujourd’hui. La communication en ligne renvoie à des formes diverses selon qu’il s’agit de la production de textes fixes (par exemple, sites Web, courriels ou de formes plutôt centrées sur les processus d’interaction et de communication (par exemple, chat, visioconférence susceptibles donc d’être étudiées tant du point de vue de l’analyse du discours que de celui de l’analyse conversationnelle. Nous nous proposons de montrer, dans cet article, dans quelle mesure les deux traditions en sciences du langage ont trouvé matière à exploiter ces corpus en ligne en empiétant, parfois, l’une comme l’autre, sur leurs « territoires » respectifs. Dans cette perspective, nous commençons par mettre au jour l’apport des chercheurs revendiquant leur appartenance à l’analyse du discours, puis celui des chercheurs relevant de l’analyse des interactions et nous montrons les zones de tuilage entre les deux courants. Dans une dernière partie, nous nous intéressons aux défis juridiques, techniques et épistémologiques que doit relever le linguiste qui cherche à étudier des corpus multimodaux en ligne qui prennent des formes de plus en plus sophistiquées et complexes.
ABSTRACT NOUNS IN THE SPEECH OF THE EMGLISHMEN (BASED ON FICTION WORKS AND BRITISH NATIONAL CORPUS

Directory of Open Access Journals (Sweden)

Natalia Veniaminovna Khokhlova

2015-01-01

Full Text Available The research aimed at studying the use of abstract nouns in the Englishmen’s speech from the standpoint of sociolinguistics. The article introduces a new, sociolinguistic, approach to research of abstract nouns; it is also the first time they are studied in a language corpus. The first stage of the research was based on fiction literary works: abstract nouns were extracted of analysis from the statements of the characters belonging to the opposite social classes. Later, these data was compared with the results of the original corpus research based on the British national corpus: sentences with nouns were selected out of the conversational subcorpus of BNC and were further sorted into abstract, concrete and words denoting people. Then, their frequency and vocabulary was studied with regards to speakers’ age, gender and social standing. The results revealed that abstract words are used more often that concrete ones regardless of the speaker’s social characteristics, however, the size and content of vocabulary is different (it is generally more substantial in the speech of women and representatives of higher social classes. The results of this research can be used in elaborating a course of the English language or in teaching general linguistics, sociolinguistics and country studies.
Sobre la natura dels estats. Una revisió basada en corpus

Directory of Open Access Journals (Sweden)

Marta Coll-florit

2008-01-01

Full Text Available This paper aims to offer a new approach to the aspectual category of states based on Catalan data extracted from corpus. The goal is twofold: firstly, to point out that states constitute a gradual category; and secondly, to highlight that syntactic variability within the stative predicates category receives a more understandable and clear explanation if the different possibilities of situation conceptualization are taken into account.
Aristotle on Love and Friendship

Directory of Open Access Journals (Sweden)

Konstan, David

2008-06-01

Full Text Available David Konstan (Brown University, Providence argues that the term philia, in Aristotle, represents an elective, affective relationship, and not, as many scholars have maintained, a relation of mutual obligation, like that of kinship, with no necessary affective element; in addition, he disambiguates two senses of philia, one corresponding to “love”, the other designating the reciprocal affection characteristic of friendship.
Hominid mandibular corpus shape variation and its utility for recognizing species diversity within fossil Homo.

Science.gov (United States)

Lague, Michael R; Collard, Nicole J; Richmond, Brian G; Wood, Bernard A

2008-12-01

Mandibular corpora are well represented in the hominin fossil record, yet few studies have rigorously assessed the utility of mandibular corpus morphology for species recognition, particularly with respect to the linear dimensions that are most commonly available. In this study, we explored the extent to which commonly preserved mandibular corpus morphology can be used to: (i) discriminate among extant hominid taxa and (ii) support species designations among fossil specimens assigned to the genus Homo. In the first part of the study, discriminant analysis was used to test for significant differences in mandibular corpus shape at different taxonomic levels (genus, species and subspecies) among extant hominid taxa (i.e. Homo, Pan, Gorilla, Pongo). In the second part of the study, we examined shape variation among fossil mandibles assigned to Homo (including H. habilis sensu stricto, H. rudolfensis, early African H. erectus/H. ergaster, late African H. erectus, Asian H. erectus, H. heidelbergensis, H. neanderthalensis and H. sapiens). A novel randomization procedure designed for small samples (and using group 'distinctness values') was used to determine whether shape variation among the fossils is consistent with conventional taxonomy (or alternatively, whether a priori taxonomic groupings are completely random with respect to mandibular morphology). The randomization of 'distinctness values' was also used on the extant samples to assess the ability of the test to recognize known taxa. The discriminant analysis results demonstrated that, even for a relatively modest set of traditional mandibular corpus measurements, we can detect significant differences among extant hominids at the genus and species levels, and, in some cases, also at the subspecies level. Although the randomization of 'distinctness values' test is more conservative than discriminant analysis (based on comparisons with extant specimens), we were able to detect at least four distinct groups among the
Hominid mandibular corpus shape variation and its utility for recognizing species diversity within fossil Homo

Science.gov (United States)

Lague, Michael R; Collard, Nicole J; Richmond, Brian G; Wood, Bernard A

2008-01-01

Mandibular corpora are well represented in the hominin fossil record, yet few studies have rigorously assessed the utility of mandibular corpus morphology for species recognition, particularly with respect to the linear dimensions that are most commonly available. In this study, we explored the extent to which commonly preserved mandibular corpus morphology can be used to: (i) discriminate among extant hominid taxa and (ii) support species designations among fossil specimens assigned to the genus Homo. In the first part of the study, discriminant analysis was used to test for significant differences in mandibular corpus shape at different taxonomic levels (genus, species and subspecies) among extant hominid taxa (i.e. Homo, Pan, Gorilla, Pongo). In the second part of the study, we examined shape variation among fossil mandibles assigned to Homo(including H. habilis sensu stricto, H. rudolfensis, early African H. erectus/H. ergaster, late African H. erectus, Asian H. erectus, H. heidelbergensis, H. neanderthalensis and H. sapiens). A novel randomization procedure designed for small samples (and using group ‘distinctness values’) was used to determine whether shape variation among the fossils is consistent with conventional taxonomy (or alternatively, whether a priori taxonomic groupings are completely random with respect to mandibular morphology). The randomization of ‘distinctness values’ was also used on the extant samples to assess the ability of the test to recognize known taxa. The discriminant analysis results demonstrated that, even for a relatively modest set of traditional mandibular corpus measurements, we can detect significant differences among extant hominids at the genus and species levels, and, in some cases, also at the subspecies level. Although the randomization of ‘distinctness values’ test is more conservative than discriminant analysis (based on comparisons with extant specimens), we were able to detect at least four distinct groups
JaSlo: Integration of a Japanese-Slovene Bilingual Dictionary with a Corpus Search System

Directory of Open Access Journals (Sweden)

HMELJAK SANGAWA, Kristina

2012-12-01

Full Text Available The paper presents a set of integrated on-line language resources targeted at Japanese language learners, primarily those whose mother tongue is Slovene. The resources consist of the on-line Japanese-Slovene learners’ dictionary jaSlo and two corpora, a 1 million word Japanese-Slovene parallel corpus and a 300 million word corpus of web pages, where each word and sentence is marked by its difficulty level; this corpus is furthermore available as a set of five distinct corpora, each one containing sentences of the particular level. The corpora are available for exploration through NoSketch Engine, the open source version of the commercial state-of-the-art corpus analysis software Sketch Engine. The dictionary is available for Web searching, and dictionary entries have direct links to examples from the corpora, thus offering a wider picture of a possible translations in concrete contextualised examples, and b monolingual Japanese usage examples of different difficulty levels to support language learning.-----Članek predstavlja japonsko-slovenski slovar jaSlo, spletni slovar za slovensko govoreče učence japonščine, in vključitev primerov iz dveh korpusov s pomočjo odprto-kodnega korpusnega iskalnika NoSketch Engine. Korpusa sta jaSlo (milijon besed, vzporedni korpus japonskih in slovenskih besedil, ki je bil zgrajen za ta namen in vsebuje večinoma literarna, spletna in akademska besedila, ter JpWaC-L (300 milijonov besed, korpus spletnih besedil, razdeljenih v povedi, ki so rangirane po težavnostnih stopnjah. S pregledno povezavo korpusnih primerov in slovarskih iztočnic v dvojezičnem slovarju za učence japonščine kot tujega jezika, ponuja sistem uporabnikom prijazen dostop k slovarskim podatkom, tj. reprezentativnim prevodnim ustreznicam, in korpusnim podatkom, ki ponujajo a širšo sliko možnih prevodnih ustreznic v konkretnih primerih s sobesedilom in b enojezične primere rabe japonskih besed v povedih različnih te
Shape analysis of corpus callosum in phenylketonuria using a new 3D correspondence algorithm

Science.gov (United States)

He, Qing; Christ, Shawn E.; Karsch, Kevin; Peck, Dawn; Duan, Ye

2010-03-01

Statistical shape analysis of brain structures has gained increasing interest from neuroimaging community because it can precisely locate shape differences between healthy and pathological structures. The most difficult and crucial problem is establishing shape correspondence among individual 3D shapes. This paper proposes a new algorithm for 3D shape correspondence. A set of landmarks are sampled on a template shape, and initial correspondence is established between the template and the target shape based on the similarity of locations and normal directions. The landmarks on the target are then refined by iterative thin plate spline. The algorithm is simple and fast, and no spherical mapping is needed. We apply our method to the statistical shape analysis of the corpus callosum (CC) in phenylketonuria (PKU), and significant local shape differences between the patients and the controls are found in the most anterior and posterior aspects of the corpus callosum.
Pivotal role of hMT+ in long-range disambiguation of interhemispheric bistable surface motion.

Science.gov (United States)

Duarte, João Valente; Costa, Gabriel Nascimento; Martins, Ricardo; Castelo-Branco, Miguel

2017-10-01

It remains an open question whether long-range disambiguation of ambiguous surface motion can be achieved in early visual cortex or instead in higher level regions, which concerns object/surface segmentation/integration mechanisms. We used a bistable moving stimulus that can be perceived as a pattern comprehending both visual hemi-fields moving coherently downward or as two widely segregated nonoverlapping component objects (in each visual hemi-field) moving separately inward. This paradigm requires long-range integration across the vertical meridian leading to interhemispheric binding. Our fMRI study (n = 30) revealed a close relation between activity in hMT+ and perceptual switches involving interhemispheric segregation/integration of motion signals, crucially under nonlocal conditions where components do not overlap and belong to distinct hemispheres. Higher signal changes were found in hMT+ in response to spatially segregated component (incoherent) percepts than to pattern (coherent) percepts. This did not occur in early visual cortex, unlike apparent motion, which does not entail surface segmentation. We also identified a role for top-down mechanisms in state transitions. Deconvolution analysis of switch-related changes revealed prefrontal, insula, and cingulate areas, with the right superior parietal lobule (SPL) being particularly involved. We observed that directed influences could emerge either from left or right hMT+ during bistable motion integration/segregation. SPL also exhibited significant directed functional connectivity with hMT+, during perceptual state maintenance (Granger causality analysis). Our results suggest that long-range interhemispheric binding of ambiguous motion representations mainly reflect bottom-up processes from hMT+ during perceptual state maintenance. In contrast, state transitions maybe influenced by high-level regions such as the SPL. Hum Brain Mapp 38:4882-4897, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley
Features of natural and gonadotropin-releasing hormone antagonist-induced corpus luteum regression and effects of in vivo human chorionic gonadotropin.

Science.gov (United States)

Del Canto, Felipe; Sierralta, Walter; Kohen, Paulina; Muñoz, Alex; Strauss, Jerome F; Devoto, Luigi

2007-11-01

The natural process of luteolysis and luteal regression is induced by withdrawal of gonadotropin support. The objectives of this study were: 1) to compare the functional changes and apoptotic features of natural human luteal regression and induced luteal regression; 2) to define the ultrastructural characteristics of the corpus luteum at the time of natural luteal regression and induced luteal regression; and 3) to examine the effect of human chorionic gonadotropin (hCG) on the steroidogenic response and apoptotic markers within the regressing corpus luteum. Twenty-three women with normal menstrual cycles undergoing tubal ligation donated corpus luteum at specific stages in the luteal phase. Some women received a GnRH antagonist prior to collection of corpus luteum, others received an injection of hCG with or without prior treatment with a GnRH antagonist. Main outcome measures were plasma hormone levels and analysis of excised luteal tissue for markers of apoptosis, histology, and ultrastructure. The progesterone and estradiol levels, corpus luteum DNA, and protein contents in induced luteal regression resembled those of natural luteal regression. hCG treatment raised progesterone and estradiol in both natural luteal regression and induced luteal regression. The increase in apoptosis detected in induced luteal regression by cytochrome c in the cytosol, activated caspase-3, and nuclear DNA fragmentation, was similar to that observed in natural luteal regression. The antiapoptotic protein Bcl-2 was significantly lower during natural luteal regression. The proapoptotic proteins Bax and Bak were at a constant level. Apoptotic and nonapoptotic death of luteal cells was observed in natural luteal regression and induced luteal regression at the ultrastructural level. hCG prevented apoptotic cell death, but not autophagy. The low number of apoptotic cells disclosed and the frequent autophagocytic suggest that multiple mechanisms are involved in cell death at luteal
White matter changes contribute to corpus callosum atrophy in the elderly: The LADIS study

DEFF Research Database (Denmark)

Ryberg, C.; Rostrup, E.; Sjöstrand, Karl

2008-01-01

BACKGROUND AND PURPOSE: The corpus callosum (CC) is the most important structure involved in the transmission of interhemispheric information. The aim of this study was to investigate the potential correlation between regional age-related white matter changes (ARWMC) and atrophy of CC in elderly ...
Corpus callosum atrophy as a marker of clinically meaningful cognitive decline in secondary progressive multiple sclerosis. Impact on employment status.

Science.gov (United States)

Papathanasiou, Athanasios; Messinis, Lambros; Zampakis, Petros; Papathanasopoulos, Panagiotis

2017-09-01

Cognitive impairment in Multiple Sclerosis (MS) is more frequent and pronounced in secondary progressive MS (SPMS). Cognitive decline is an important predictor of employment status in patients with MS. Magnetic Resonance Imaging (MRI) markers have been used to associate tissue damage with cognitive dysfunction. The aim of the study was to designate the MRI marker that predicts cognitive decline in SPMS and explore its effect on employment status. 30 SPMS patients and 30 healthy participants underwent neuropsychological assessment using the Trail Making Test (TMT) parts A and B, semantic and phonological verbal fluency task and a computerized cognitive screening battery (Central Nervous System Vital Signs). Employment status was obtained as a quality of life measure. Brain MRI was performed in all participants. We measured total lesion volume, third ventricle width, thalamic and corpus callosum atrophy. The frequency of cognitive decline for our SPMS patients was 80%. SPMS patients differed significantly from controls in all neuropsychological measures. Corpus callosum area was correlated with cognitive flexibility, processing speed, composite memory, executive functions, psychomotor speed, reaction time and phonological verbal fluency task. Processing speed and composite memory were the most sensitive markers for predicting employment status. Corpus callosum area was the most sensitive MRI marker for memory and processing speed. Corpus callosum atrophy predicts a clinically meaningful cognitive decline, affecting employment status in our SPMS patients. Copyright © 2017 Elsevier Ltd. All rights reserved.
The usage of amount, quantity and body in a corpus of biology

Directory of Open Access Journals (Sweden)

Purificación Sánchez Hernández

2002-04-01

Full Text Available Grammars and dictionaries usually offer relevant and accurate information to students of a second language. However, the meaning of a textual element is often dynamic and that information is not always based on real usage patterns. New occurrences on the object level in new contexts can introduce novel semantic potentials, so that existing interpretations may be superseded by new ones. Concordancing has been shown to be one of the most important tools to facilitate the understanding of the usage patterns of a language. In this paper we examine the differences between amount, quantity and body as terms expressing magnitude, sum and size in a corpus of Biology. According to some popular dictionaries and grammars, the terms amount and quantity have always been considered synonymous terms for expressing magnitude, size and sum. We demonstrate that, according to our records, they cannot be always used as synonymous terms since they have different patterns of usage. On the other hand there are other forms, such as body, that appear in our Corpus, implying magnitude, size and sum, that are not usually described as having such meanings in dictionaries.
Hemoperitoneum from Corpus Luteal Cyst Rupture: A Practical Approach in Emergency Room

Directory of Open Access Journals (Sweden)

Valeria Fiaschetti

2014-01-01

Full Text Available Corpus luteum cyst rupture with consequent hemoperitoneum is a common disorder in women in their reproductive age. This condition should be promptly recognized and treated because a delayed diagnosis may significantly reduce women’s fertility and intra-abdominal bleeding may be life-threatening. Many imaging modalities play a key role in the diagnosis of acute pelvic pain from gynecological causes. Ultrasound study (USS is usually the first imaging technique for initial evaluation. USS is used to confirm or to exclude the presence of intraperitoneal fluid but it has some limitations in the identification of the bleeding source. Contrast-enhanced computed tomography (CT is the imaging modality which could be used in the acute setting in order to recognize gynecological emergencies and to establish a correct management. Magnetic resonance imaging (MRI nowadays is the most useful technique for studying the pelvis but its low availability and the long acquisition time of the images limit its usefulness in characterization of acute gynecological complications. We report a case of a young patient with hemoperitoneum from hemorrhagic corpus luteum correctly identified by transabdominal USS and contrast-enhanced CT.
In vivo high-resolution magnetic resonance elastography of the uterine corpus and cervix

International Nuclear Information System (INIS)

Jiang, Xuyuan; Asbach, Patrick; Streitberger, Kaspar-Josche; Hamm, Bernd; Sack, Ingolf; Guo, Jing; Thomas, Anke; Braun, Juergen

2014-01-01

To apply 3D multifrequency MR elastography (3DMMRE) to the uterus and analyse the viscoelasticity of the uterine tissue in healthy volunteers considering individual variations and variations over the menstrual cycle. Sixteen healthy volunteers participated in the study, one of whom was examined 12 times over two menstrual cycles. Pelvic 3DMMRE was performed on a 1.5-T scanner with seven vibration frequencies (30-60 Hz) using a piezoelectric driver. Two mechanical parameter maps were obtained corresponding to the magnitude (vertical stroke G* vertical stroke) and the phase angle (φ) of the complex shear modulus. On average, the uterine corpus had higher elasticity, but similar viscosity compared with the cervix, reflected by vertical stroke G* vertical stroke uterine corpus = 2.58 ± 0.52 kPa vs. vertical stroke G* vertical stroke cervix = 2.00 ± 0.34 kPa (p uterine corpus = 0.54 ± 0.08, φ cervix = 0.57 ± 0.12 (p = 0.428). With 2.23 ± 0.26 kPa, vertical stroke G* vertical stroke of the myometrium was lower in the secretory phase (SP) compared with that of the proliferative phase (PP, vertical stroke G* vertical stroke = 3.01 ± 0.26 kPa). For the endometrium, the value of vertical stroke G* vertical stroke in SP was 68 % lower than during PP (PP, vertical stroke G* vertical stroke = 3.34 ± 0.42 kPa; SP, vertical stroke G* vertical stroke = 1.97 ± 0.34 kPa; p = 0.0061). 3DMMRE produces high-resolution mechanical parameter maps of the uterus and cervix and shows sensitivity to structural and functional changes of the endometrium and myometrium during the menstrual cycle. (orig.)
In vivo high-resolution magnetic resonance elastography of the uterine corpus and cervix

Energy Technology Data Exchange (ETDEWEB)

Jiang, Xuyuan [The First Affiliated Hospital of China Medical University, Department of Radiology, Shenyang (China); Asbach, Patrick; Streitberger, Kaspar-Josche; Hamm, Bernd; Sack, Ingolf; Guo, Jing [Charite - Universitaetsmedizin Berlin, Department of Radiology, Berlin (Germany); Thomas, Anke [Charite - Universitaetsmedizin Berlin, Departments of Gynecology and Obstetrics, Berlin (Germany); Braun, Juergen [Charite - Universitaetsmedizin Berlin, Department of Medical Informatics, Berlin (Germany)

2014-12-15

To apply 3D multifrequency MR elastography (3DMMRE) to the uterus and analyse the viscoelasticity of the uterine tissue in healthy volunteers considering individual variations and variations over the menstrual cycle. Sixteen healthy volunteers participated in the study, one of whom was examined 12 times over two menstrual cycles. Pelvic 3DMMRE was performed on a 1.5-T scanner with seven vibration frequencies (30-60 Hz) using a piezoelectric driver. Two mechanical parameter maps were obtained corresponding to the magnitude (vertical stroke G* vertical stroke) and the phase angle (φ) of the complex shear modulus. On average, the uterine corpus had higher elasticity, but similar viscosity compared with the cervix, reflected by vertical stroke G* vertical stroke {sub uterine} {sub corpus} = 2.58 ± 0.52 kPa vs. vertical stroke G* vertical stroke {sub cervix} = 2.00 ± 0.34 kPa (p < 0.0001) and φ {sub uterine} {sub corpus} = 0.54 ± 0.08, φ {sub cervix} = 0.57 ± 0.12 (p = 0.428). With 2.23 ± 0.26 kPa, vertical stroke G* vertical stroke of the myometrium was lower in the secretory phase (SP) compared with that of the proliferative phase (PP, vertical stroke G* vertical stroke = 3.01 ± 0.26 kPa). For the endometrium, the value of vertical stroke G* vertical stroke in SP was 68 % lower than during PP (PP, vertical stroke G* vertical stroke = 3.34 ± 0.42 kPa; SP, vertical stroke G* vertical stroke = 1.97 ± 0.34 kPa; p = 0.0061). 3DMMRE produces high-resolution mechanical parameter maps of the uterus and cervix and shows sensitivity to structural and functional changes of the endometrium and myometrium during the menstrual cycle. (orig.)
Multimodal Corpus Analysis as a Method for Ensuring Cultural Usability of Embodied Conversational Agents

DEFF Research Database (Denmark)

Nakano, Yukiko; Rehm, Matthias

2009-01-01

In this paper we propose the method of multimodal corpus analysis to collect enough empirical data for modeling the behavior of embodied conversational agents. This is a prerequisite to ensure the usability of such complex interactive systems. So far, the development of embodied agents suffers fr...
A standardized procedure for using human corpus cavernosum strips to evaluate drug activity.

Science.gov (United States)

Mirone, V; Sorrentino, R; di Villa Bianca, R; Imbimbo, C; Palmieri, A; Fusco, F; Tajana, G; Cirino, G

2000-01-01

The main problem of using human corpus cavernosum (HCC) tissue to perform bioassay is linked to its limited availability further complicated by the heterogeneous source of the tissues used. Here, we show that gender reassignment is a reliable source of human tissue without major ethical problems. Indeed, the entire corpus cavernosum is obtained from the surgery procedure, which allows creating a standardized procedure to prepare HCC strip. In addition, human tissue, if kept in the fridge in the condition described, does not loose its ability to contract to phenylephrine (PE; alpha agonist), angiotensin II (AG II) and KCl up to 4 days. Furthermore, once contracted with PE, HCC relaxes to acetylcholine (endothelium-dependent mechanism); sodium nitroprusside (endothelium-independent mechanism); cromakalim (CRK), a K(ATP) channel opener; or alprostadil, a synthetic PGE2 (ALPR). In conclusion, we have standardized a procedure that allows the use of HCC strips to evaluate drug activity and/or to study pathophysiological mechanisms with an intact functional human tissue up to 4 days from the surgery procedure.
Organising white matter in a brain without corpus callosum fibres.

Science.gov (United States)

Bénézit, Audrey; Hertz-Pannier, Lucie; Dehaene-Lambertz, Ghislaine; Monzalvo, Karla; Germanaud, David; Duclap, Delphine; Guevara, Pamela; Mangin, Jean-François; Poupon, Cyril; Moutard, Marie-Laure; Dubois, Jessica

2015-02-01

Isolated corpus callosum dysgenesis (CCD) is a congenital malformation which occurs during early development of the brain. In this study, we aimed to identify and describe its consequences beyond the lack of callosal fibres, on the morphology, microstructure and asymmetries of the main white matter bundles with diffusion imaging and fibre tractography. Seven children aged between 9 and 13 years old and seven age- and gender-matched control children were studied. First, we focused on bundles within the mesial region of the cerebral hemispheres: the corpus callosum, Probst bundles and cingulum which were selected using a conventional region-based approach. We demonstrated that the Probst bundles have a wider connectivity than the previously described rostrocaudal direction, and a microstructure rather distinct from the cingulum but relatively close to callosal remnant fibres. A sigmoid bundle was found in two partial ageneses. Second, the corticospinal tract, thalamic radiations and association bundles were extracted automatically via an atlas of adult white matter bundles to overcome bias resulting from a priori knowledge of the bundles' anatomical morphology and trajectory. Despite the lack of callosal fibres and the colpocephaly observed in CCD, all major white matter bundles were identified with a relatively normal morphology, and preserved microstructure (i.e. fractional anisotropy, mean diffusivity) and asymmetries. Consequently the bundles' organisation seems well conserved in brains with CCD. These results await further investigations with functional imaging before apprehending the cognition variability in children with isolated dysgenesis. Copyright © 2014 Elsevier Ltd. All rights reserved.
Comparison of oxidative/antioxidative status of penile corpus cavernosum blood and peripheral venous blood.

Science.gov (United States)

Yeni, E; Gulum, M; Selek, S; Erel, O; Unal, D; Verit, A; Savas, M

2005-01-01

The aim of the study is to determine and to compare the oxidative and antioxidative status of penile corpus cavernosum and peripheral venous blood. A total of 28 adult healthy males were included in the study. Whole blood was simultaneously withdrawn from penile corpus cavernosum and the cubital vein and their plasma separated. Total antioxidant capacity (TAC), vitamin C, total protein, albumin, uric acid, bilirubin and total peroxide (TP) levels of both plasma samples were measured and compared. While TAC, total protein, albumin, bilirubin and uric acid levels were higher, vitamin C levels were lower in cavernosal blood than that of peripheral blood. On the other hand, TP level was found to be higher in penile blood samples than that of peripheral blood. We thought that the normal erectile process of the penile cavernosal body leads to increased production of oxidants as in the mechanism of ischaemia-reperfusion; however, the increase of TAC can prevent development of oxidative injury.

As metáforas do presidente lula na perspectiva da linguística de corpus: o caso do desenvolvimento President Lula's metaphors in a corpus linguistic perspective: The case of 'development'

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2010-01-01

Full Text Available Um dos grandes fenômenos linguísticos da vida política brasileira recente é o que a mídia vem chamando de 'metáforas do presidente Lula'. O ponto de partida deste trabalho é o fato de que deve haver muitas metáforas que passam despercebidas no discurso do presidente e que podem ser descobertas por meio de pesquisa com corpora eletrônicos. Investigamos a presença de metáforas conceptuais relacionadas a 'desenvolvimento' em um corpus composto por pronunciamentos emitidos ao longo de três anos pelo presidente Luís Inácio Lula da Silva. Os resultados indicam que há uso sistemático de três conceitos metafóricos que definem a noção de desenvolvimento do chefe de Estado: VIAGEM, CONSTRUÇÃO e ORGANISMO. Esses três conceitos, em geral, equacionam desenvolvimento com um processo longo, construído, planejado e gerado pelo governo.One of the main linguistic phenomena in recent Brazilian politics is what the media has called 'President Lula's metaphors'. The starting point for the present investigation is that there must be lots of metaphors that go unnoticed in the president's discourse and that these may be uncovered by corpus-based research. We looked at the presence of conceptual metaphors related to 'development' in a corpus of three years of official presidential speeches. The results indicated the systematic use of three metaphorical concepts that together define the notion of development for the head of State: JOURNEY, BUILDING and ORGANISM. These three concepts together equate development with a long process that is generated, planned and carried out by the government.
Penile fracture: MRI demonstration of a urethral tear associated with a rupture of the corpus cavernosum

International Nuclear Information System (INIS)

Maubon, A.J.; Roux, J.O.; Ferru, J.M.; Rouanet, J.P.; Faix, A.; Segui, B.

1998-01-01

We report a case of traumatic urethral tear associated with a rupture of the corpus cavernosum, demonstrated on MRI. We discuss the potential role of a non-invasive preoperative assessment by MRI. (orig.)
Cluster Analysis of the Newcastle Electronic Corpus of Tyneside English: In A Comparison of Methods

NARCIS (Netherlands)

Moisl, Hermann; Jones, Valerie M.

2005-01-01

This article examines the feasibility of an empirical approach to sociolinguistic analysis of the Newcastle Electronic Corpus of Tyneside English using exploratory multivariate methods. It addresses a known problem with one class of such methods, hierarchical cluster analysis—that different
Learning and memory in individuals with agenesis of the corpus callosum

OpenAIRE

Paul, Lynn K.; Erickson, Roger L.; Hartman, Jo Ann; Brown, Warren S.

2016-01-01

Damage to long white matter pathways in the cerebral cortex is known to affect memory capacity. However, the specific contribution of interhemispheric connectivity in memory functioning is only beginning to become understood. The present study examined verbal and visual memory processing in individuals with agenesis of the corpus callosum (AgCC) using the Wechsler Memory Scale-Third Edition (WMS-III; Wechsler, 1997b). Thirty participants with AgCC (FSIQ >78) were compared against 30 healthy a...
Functional Topography of Human Corpus Callosum: An fMRI Mapping Study

OpenAIRE

Fabri, Mara; Polonara, Gabriele

2013-01-01

The concept of a topographical map of the corpus callosum (CC) has emerged from human lesion studies and from electrophysiological and anatomical tracing investigations in other mammals. Over the last few years a rising number of researchers have been reporting functional magnetic resonance imaging (fMRI) activation in white matter, particularly the CC. In this study the scope for describing CC topography with fMRI was explored by evoking activation through simple sensory stimulation and moto...
When "Safe" Means "Dangerous": A Corpus Investigation of Risk Communication in the Media

Science.gov (United States)

Tang, Chris; Rundblad, Gabriella

2017-01-01

The mass media has an important role in informing the general public about emerging health risks. Content-based studies of risk communication in the media have revealed a tendency to exaggerate risks or simplify science, but linguistic studies in this area are still scarce. This paper outlines a corpus based investigation of media reporting on the…
INFECTIOUS DISEASES ARE SLEEPING MONSTERS: Conventional and culturally adapted new metaphors in a corpus of abstracts on immunology

Directory of Open Access Journals (Sweden)

Laura Hidalgo Downing

2009-04-01

Full Text Available In this paper we examine the role played by metaphor in a corpus of sixty abstracts on immunology from Scientific American. We focus on the distinction between conventional metaphors and culturally adapted new metaphors and discuss the role played by metaphor choice in the communicative purposes of the abstracts and their register features. We argue that one of the main strategies used to attract the reader‘s attention is the combination of highly conventionalized metaphors, which occur more frequently in the corpus, together with what we call “culturally adapted new metaphors”, which display different degrees of creativity and are less frequent in the corpus. Conventional metaphors typically reinforce the world view shared by the scientific community and introduce basic ideas on the subject of immunology. Culturally adapted new metaphors include a cline from slightly new perspectives of conventional models, to highly creative uses of metaphor. Culturally adapted new metaphors appeal primarily to a general readership and not to the scientific community, as they tap human emotions and mythic constructions. These play a crucial role in the abstracts, as they contribute to persuasive and didactic communicative functions in the text.
MRI Findings of Coexistence of Ectopic Neurohypophysis, Corpus Callosum Dysgenesis, and Periventricular Neuronal Heterotopia

Directory of Open Access Journals (Sweden)

Harun Arslan

2014-01-01

Full Text Available Ectopic neurohypophysis is a pituitary gland abnormality, which can accompany growth hormone deficiency associated with dwarfism. Here we present magnetic resonance imaging (MRI findings of a rare case of ectopic neurohypophysis, corpus callosum dysgenesis, and periventricular neuronal heterotopia coexisting, with a review of the literature.
Comparing a rule based vs. statistical system for automatic categorization of MEDLINE documents according to biomedical specialty

OpenAIRE

Humphrey, Susanne M.; Névéol, Aurélie; Browne, Allen; Gobeill, Julien; Ruch, Patrick; Darmoni, Stéfan J.

2010-01-01

Automatic document categorization is an important research problem in Information Science and Natural Language Processing. Many applications, including Word Sense Disambiguation and Information Retrieval in large collections, can benefit from such categorization. This paper focuses on automatic categorization of documents from the biomedical literature into broad discipline-based categories. Two different systems are described and contrasted: CISMeF, which uses rules based on human indexing o...
The Treatment of Word Sense Inventories in the ‘LACELL WSD Project’

Directory of Open Access Journals (Sweden)

Moisés Almela

2009-12-01

Full Text Available The WSD community has long debated whether the criteria for representing polysemy in general purpose dictionaries meet the specific demands of sense disambiguation tasks. Concern is growing that pre-defined sense inventories might not adjust well to the needs of WSD, because word occurrences can rarely be paired with rigid sense classes in a one-toone fashion. A second cause for concern is the level of sense granularity adopted in conventional dictionary entries.Fine-grained distinctions can be useful for a dictionary user but complicate the design and evaluation of WSD systems in a way that is often unnecessary. As a result of these objections, many experts have voiced the opinion that dictionaries are not adequate sources of sense inventories for WSD. However, the problem of word sense overlaps can also be resolved by modifying the way in which dictionary entries are processed by WSD programs. This is the solution applied in the LACELL WSD system. The algorithm selects simultaneously two or more dictionary senses if the context does not allow sufficient discrimination between/among them. This article explains the underpinnings of such proposal, as well as discussing some advantages and disadvantages.En el ámbito de la investigación sobre desambiguación léxica automática (WSD, se ha venido debatiendo largo tiempo acerca de la adecuación de los modelos de análisispolisémico empleados en los diccionarios de carácter general. En particular, existe una creciente preocupación en torno a los problemas generados por la utilización de inventarios de acepciones léxicas, ya que no es frecuente hallar correspondencias biunívocas entre los usos de una palabra en contextos específicos y las clases semánticas preestablecidas en la entrada léxica. Además, se duda de que el nivel de granularidad semántica aplicado en la lexicografía convencional sea el más adecuado para las necesidades específicas de la WSD. Como consecuencia de estas
Measuring the readability of sustainability reports: : A corpus-based analysis through standard formulae and NLP

NARCIS (Netherlands)

Smeuninx, N.; De Clerck, B.; Aerts, Walter

2016-01-01

This study characterises and problematises the language of corporate reporting along region, industry, genre, and content lines by applying readability formulae and more advanced natural language processing (NLP)–based analysis to a manually assembled 2.75-million-word corpus. Readability formulae
MORPHOMETRIC ANALYSIS OF CORPUS CALLOSUM- A STUDY IN CADAVER AND MRI

Directory of Open Access Journals (Sweden)

Ambili Puthanveetil

2017-07-01

Full Text Available BACKGROUND The Corpus Callosum (CC can best be seen in the mid-sagittal section of brain both in cadaver and MRI. The morphometric measurements of the same will be of use in neurosurgical procedures. Sexual dimorphism and the age-related changes in its measurements remained controversial. Till date, no studies have been done on corpus callosum in Kerala. MATERIALS AND METHODS Measurements of CC has been taken and studied in detail in 24 formalin fixed brains from the Department of Anatomy and 48 MR images from the Department of Radiology. The changes according to age and sex were analysed. RESULTS The mean length of CC in the cadaver was 7.24 cm, which was 3.38 cm posterior to frontal pole and 5.73 cm anterior to occipital pole. In MR images, the mean length was 7.10 in males and 6.76 in females. The difference we got was not statistically significant. The length increased with age. Thickness of genu and body decreased as the age advances, but the splenial thickness was found to be increasing with age. There was significant correlation between the thicknesses of various parts of CC. CONCLUSION The values were almost similar to those in the previous studies. Morphometrically, a significant gender difference was not identified in the present study. There were changes according to age both in males and females.
Large Sphenoethmoidal Encephalocele Associated with Agenesis of Corpus Callosum and Cleft Palate

Directory of Open Access Journals (Sweden)

Basir Hashemi

2010-06-01

Full Text Available AbstractBasal encephalocele is a rare craniofacial anomaly. In the presentpaper we report a 10-year-old boy presented with cleftpalate, congenital nystagmus, and hypertelorism. During preoperativeevaluation for cleft palate repair, a pulsatile masswas detected in the pharynx. Magnetic resonance imagingshowed sphenoethmoidal type of basal encephalocele andagenesis of corpus callosum. Neurosurgical consultation wasperformed for further evaluation and management.Iran J Med Sci 2010; 35(2: 154-156.
Corpus-aided language pedagogy : the use of concordance lines in vocabulary instruction

OpenAIRE

Kazaz, İlknur

2015-01-01

Ankara : The Program of Teaching English as a Foreign Language Bilkent University, 2015. Thesis (Master's) -- Bilkent University, 2015. Includes bibliographical references leaves 83-91. This study investigated the effectiveness of the use of a concordance software and concordance lines as a pedagogical tool to learn the target vocabulary of a text book. The purpose of the study was to compare the effects of corpus-aided vocabulary instruction with traditional vocabulary teac...
Linguistic Markers of Stance in Early and Advanced Academic Writing: A Corpus-Based Comparison

Science.gov (United States)

Aull, Laura L.; Lancaster, Zak

2014-01-01

This article uses corpus methods to examine linguistic expressions of stance in over 4,000 argumentative essays written by incoming first-year university students in comparison with the writing of upper-level undergraduate students and published academics. The findings reveal linguistic stance markers shared across the first-year essays despite…
Using a Corpus-Informed Pedagogical Intervention to Develop Language Awareness toward Appropriate Lexicogrammatical Choices

Science.gov (United States)

Fernandez, Julieta; Yuldashev, Aziz

2015-01-01

The corpus-informed pedagogical intervention described in this article was developed for an advanced English as a Second Language (ESL) course designed for prospective International Teaching Assistants (ITAs) and implemented over the course of two class periods. Its primary goal was to offer students opportunities to gain language awareness of…
Corpus Callosum Size is Linked to Dichotic Deafness and Hemisphericity, Not Sex or Handedness

Science.gov (United States)

Morton, Bruce E.; Rafto, Stein E.

2006-01-01

Individuals differ in the number of corpus callosum (CC) nerve fibers interconnecting their cerebral hemispheres by about threefold. Early reports suggested that males had smaller CCs than females. This was often interpreted to support the concept that the male brain is more "lateralized" or "specialized," thus accounting for presumed male…
Conjunctions in ELF academic discourse: a corpus-based analysis

Directory of Open Access Journals (Sweden)

Laura Centonze

2014-03-01

Full Text Available Abstract – Conjunctions as fundamental elements in the construction of discourse cohesion represent a relatively neglected research area, due to their complexity and the bewildering number of “conjunctive relations” (Halliday and Hasan 1976: 226 that they may express in context, as also highlighted in Christiansen (2011. In addition to this, there does not seem to be a shared view as far as the classification and denomination of the different kinds of conjunctions are concerned (cf. Halliday and Hasan 1976; Vande Kopple 1985; Martin and Rose 2003; Hyland 2005b. The selection of a specific type of conjunction acquires more importance because they are typically open to so many different interpretations, especially when the participants in the speech event come from diverse lingua-cultural backgrounds (cf. Guido 2007; Guido 2008; Cogo et al. 2011.Following the taxonomy provided by Halliday and Hasan (1976 for conjunctions, our study attempts to shed light on the usage of conjunctions by ELF speakers in specific contexts. We shall consider ten transcripts taken from the VOICE Corpus (Seidlhofer et. al 2013, namely five interviews and five conversations in multicultural academic contexts (approximately 4,000 words each, and analyze the number of instances for each type of conjunction (additive, adversative, clausal, temporal as well as continuatives in depth, by adopting a quantitative as well as a qualitative method and by using TextSTAT 2.9 (Huning 2012. We shall then move on to the analysis of conjunctions with respect to their internal properties/collocates and eventually see the occurrence of conjunctions by comparing them with the two different speech events which are chosen as the subject of our study, i.e. interviews and conversations. We shall see the extent to which certain conjunctions are more restricted than others in terms of usage (cf. Leung 2005 in both types of speech events, despite the great number of options available to the
Development of automatic extraction of the corpus callosum from magnetic resonance imaging of the head and examination of the early dementia objective diagnostic technique in feature analysis

International Nuclear Information System (INIS)

Kodama, Naoki; Kaneko, Tomoyuki

2005-01-01

We examined the objective diagnosis of dementia based on changes in the corpus callosum. We examined midsagittal head MR images of 17 early dementia patients (2 men and 15 women; mean age, 77.2±3.3 years) and 18 healthy elderly controls (2 men and 16 women; mean age, 73.8±6.5 years), 35 subjects altogether. First, the corpus callosum was automatically extracted from the MR images. Next, early dementia was compared with the healthy elderly individuals using 5 features of the straight-line methods, 5 features of the Run-Length Matrix, and 6 features of the Co-occurrence Matrix from the corpus callosum. Automatic extraction of the corpus callosum showed an accuracy rate of 84.1±3.7%. A statistically significant difference was found in 6 of the 16 features between early dementia patients and healthy elderly controls. Discriminant analysis using the 6 features demonstrated a sensitivity of 88.2% and specificity of 77.8%, with an overall accuracy of 82.9%. These results indicate that feature analysis based on changes in the corpus callosum can be used as an objective diagnostic technique for early dementia. (author)
The Sweet-Home speech and multimodal corpus for home automation interaction

OpenAIRE

Vacher , Michel; Lecouteux , Benjamin; Chahuara , Pedro; Portet , François; Meillon , Brigitte; Bonnefond , Nicolas

2014-01-01

International audience; Ambient Assisted Living aims at enhancing the quality of life of older and disabled people at home thanks to Smart Homes and Home Automation. However, many studies do not include tests in real settings, because data collection in this domain is very expensive and challenging and because of the few available data sets. The SWEET-H OME multimodal corpus is a dataset recorded in realistic conditions in D OMUS, a fully equipped Smart Home with microphones and home automati...

Cytokines and Angiogenesis in the Corpus Luteum

Directory of Open Access Journals (Sweden)

António M. Galvão

2013-01-01

Full Text Available In adults, physiological angiogenesis is a rare event, with few exceptions as the vasculogenesis needed for tissue growth and function in female reproductive organs. Particularly in the corpus luteum (CL, regulation of angiogenic process seems to be tightly controlled by opposite actions resultant from the balance between pro- and antiangiogenic factors. It is the extremely rapid sequence of events that determines the dramatic changes on vascular and nonvascular structures, qualifying the CL as a great model for angiogenesis studies. Using the mare CL as a model, reports on locally produced cytokines, such as tumor necrosis factor α (TNF, interferon gamma (IFNG, or Fas ligand (FASL, pointed out their role on angiogenic activity modulation throughout the luteal phase. Thus, the main purpose of this review is to highlight the interaction between immune, endothelial, and luteal steroidogenic cells, regarding vascular dynamics/changes during establishment and regression of the equine CL.
Subluxation and semantics: a corpus linguistics study.

Science.gov (United States)

Budgell, Brian

2016-06-01

The purpose of this study was to analyze the curriculum of one chiropractic college in order to discover if there were any implicit consensus definitions of the term subluxation. Using the software WordSmith Tools, the corpus of an undergraduate chiropractic curriculum was analyzed by reviewing collocated terms and through discourse analysis of text blocks containing words based on the root 'sublux.' It was possible to identify 3 distinct concepts which were each referred to as 'subluxation:' i) an acute or instantaneous injurious event; ii) a clinical syndrome which manifested post-injury; iii) a physical lesion, i.e. an anatomical or physiological derangement which in most instances acted as a pain generator. In fact, coherent implicit definitions of subluxation exist and may enjoy broad but subconscious acceptance. However, confusion likely arises from failure to distinguish which concept an author or speaker is referring to when they employ the term subluxation.
In vivo bone strain in the mandibular corpus of Sapajus during a range of oral food processing behaviors.

Science.gov (United States)

Ross, Callum F; Iriarte-Diaz, Jose; Reed, David A; Stewart, Thomas A; Taylor, Andrea B

2016-09-01

It has been hypothesized that mandibular corpus morphology of primates is related to the material properties of the foods that they chew. However, chewing foods with different material properties is accompanied by low levels of variation in mandibular strain patterns in macaques. We hypothesized that if variation in primate mandible form reflects adaptations to feeding on foods with different material and geometric properties, then this variation will be driven primarily by differences in oral food processing behavior rather than differences in chewing per se. To test this hypothesis, we recorded in vivo bone strain data from the lateral and medial surfaces of the mandibular corpus during complete feeding sequences in three adult male Sapajus as they fed on foods with a range of sizes and material properties. We assessed whether variation in mandibular corpus strain regimes is associated with variation in feeding behaviors and/or chewing on different foods, and we quantified the relative variation in mandibular corpus strain regimes associated with chewing on foods of different material properties versus a range of oral food processing behaviors (incisor, premolar, and molar biting; pulling on incisors; mastication). Feeding behavior had a significant effect on mandibular corpus strain regimes, as did chewing side and the cycle number in a feeding sequence. However, food type had weaker effects and usually only through interaction effects with chewing side and/or cycle type. Strain regimes varied most across different chew sides, then across different behaviors, and lastly between mastication cycles on different foods. Strain magnitudes associated with premolar, molar, and incisor biting were larger than those recorded during mastication. These data suggest that intra- and inter-specific variation in mandible morphology is a trade-off between performance requirements of different oral food processing behaviors and of variation in chewing side, with direct effects
Corpus-Based Websites to Promote Learner Autonomy in Correcting Writing Collocation Errors

Directory of Open Access Journals (Sweden)

Pham Thuy Dung

2016-12-01

Full Text Available The recent yet powerful emergence of E-learning and using online resources in learning EFL (English as a Foreign Language has helped promote learner autonomy in language acquisition including self-correcting their mistakes. This pilot study despite conducted on a modest sample of 25 second year students majoring in Business English at Hanoi Foreign Trade University is an initial attempt to investigate the feasibility of using corpus-based websites to promote learner autonomy in correcting collocation errors in EFL writing. The data is collected using a pre-questionnaire and a post-interview aiming to find out the participants’ change in belief and attitude toward learner autonomy in collocation errors in writing, the extent of their success in using the corpus-based websites to self-correct the errors and the change in their confidence in self-correcting the errors using the websites. The findings show that a significant majority of students have shifted their belief and attitude toward a more autonomous mode of learning, enjoyed a fair success of using the websites to self-correct the errors and become more confident. The study also yields an implication that a face-to-face training of how to use these online tools is vital to the later confidence and success of the learners
Disambiguating past events: Accurate source memory for time and context depends on different retrieval processes.

Science.gov (United States)

Persson, Bjorn M; Ainge, James A; O'Connor, Akira R

2016-07-01

Current animal models of episodic memory are usually based on demonstrating integrated memory for what happened, where it happened, and when an event took place. These models aim to capture the testable features of the definition of human episodic memory which stresses the temporal component of the memory as a unique piece of source information that allows us to disambiguate one memory from another. Recently though, it has been suggested that a more accurate model of human episodic memory would include contextual rather than temporal source information, as humans' memory for time is relatively poor. Here, two experiments were carried out investigating human memory for temporal and contextual source information, along with the underlying dual process retrieval processes, using an immersive virtual environment paired with a 'Remember-Know' memory task. Experiment 1 (n=28) showed that contextual information could only be retrieved accurately using recollection, while temporal information could be retrieved using either recollection or familiarity. Experiment 2 (n=24), which used a more difficult task, resulting in reduced item recognition rates and therefore less potential for contamination by ceiling effects, replicated the pattern of results from Experiment 1. Dual process theory predicts that it should only be possible to retrieve source context from an event using recollection, and our results are consistent with this prediction. That temporal information can be retrieved using familiarity alone suggests that it may be incorrect to view temporal context as analogous to other typically used source contexts. This latter finding supports the alternative proposal that time since presentation may simply be reflected in the strength of memory trace at retrieval - a measure ideally suited to trace strength interrogation using familiarity, as is typically conceptualised within the dual process framework. Copyright © 2016 Elsevier Inc. All rights reserved.
A New English?Arabic Parallel Text Corpus for Lexicographic Applications

Directory of Open Access Journals (Sweden)

Hashan Al-Ajmi

2011-10-01

Full Text Available
Abstract: Bilingual lexicographers, translation specialists and English teachers in the Arabworld do not have access to computerized corpora of parallel texts for the English–Arabic languagepair. This project has been carried out to meet this requirement by establishing the first generalparallel corpus of English texts and their Arabic translations. The first phase of the project involvedthe selection of general source texts having appropriate lexical and stylistic features. The chosensource texts deal with a variety of topics such as the environment, globalization, psychology, history,politics, drama, etc. Their Arabic translations were taken from The World of Knowledge seriespublished by the National Council for Culture, Arts and Letters (NCCAL in Kuwait.
Keywords: PARALLEL CORPUS, LEXICOGRAPHY, TRANSLATION, BILINGUAL DICTIONARY,COLLOCATIONS, ALIGNMENT, SYNONYMS, DERIVATIVES, ANTONYMS, GLOSSARY,FREQUENCY
Opsomming: 'n Nuwe Engels–Arabiese parallelletekskorpus vir leksikografiesetoepassings Tweetalige leksikograwe, vertaalkundiges en Engelsonderwysers in dieArabiese wêreld het nie toegang tot gerekenariseerde korpusse van parallelle tekste vir die Engels–Arabiese taalpaar nie. Hierdie projek is onderneem om in dié behoefte te voorsien deur die eerstealgemene parallelle korpus van Engelse tekste en hul Arabiese vertalings tot stand te bring. Dieeerste fase van die projek het die keuse van algemene brontekste behels wat geskikte leksikale enstilistiese eienskappe besit. Die gekose brontekste handel oor 'n verskeidenheid onderwerpe soosdie omgewing, globalisering, psigologie, geskiedenis, politiek, drama, ens. Hul Arabiese vertalingsis geneem uit The World of Knowledge-reeks gepubliseer deur die National Council for Culture, Artsand Letters (NCCAL in Koeweit.
Sleutelwoorde: PARALLELLE KORPUS, LEKSIKOGRAFIE, VERTALING, TWEETALIGEWOORDEBOEK, KOLLOKASIES, OOREENSTEMMING, SINONIEME, AFLEIDINGS, ANTONIEME
Field – Football Expressions Dictionary: a lexicographic resource based on the theoretical-methodological approach of frame semantics and corpus linguistics

Directory of Open Access Journals (Sweden)

Rove Luiza de Oliveira Chishman

2015-01-01

Full Text Available The present article aims at problematizing the relevance of Frame Semantics (Fillmore, 1982 in the development of Field – Dictionary of Football Expressions – which the configuration allows the access to football language through expressions or through scenarios – or semantic frames. Frame Semantics, a theory developed in the realm of Cognitive Linguistics, is based on empirical data collected from the analysis of electronic corpora. The extraction of the data presented in this study was done with the Sketch Engine concordance, while their analysis was relegated to Frame Semantics. Among the results, it is possible to point out at the manner in which Fillmore´s theory contributes to the analysis of polysemy, presenting the different senses of a lexical unit considering different situations – or different frames – in which they appear. This article also emphasizes the pertinence of corpus linguistics and the processing of corpora as resources that allow the analysis of linguistic constructs present in the texts. It is also important to emphasize the applicability of Frame Semantics to a resource devoted to a non-specialized public, once the theory makes the contextualization of language possible through the everyday routine of the speakers.
A Corpus Based Study on the Use of Preposition of Time "On" and "At" in Argumentative Essays of Form 4 and Form 5 Malaysian Students

Science.gov (United States)

Loke, Darina Lokeman; Ali, Juliana; Anthony, Norin Norain Zulkifli

2013-01-01

This article presents a corpus-based investigation on English prepositions of time presented in the argumentative essays of Form 4 and Form 5 Malaysian secondary students in the MCSAW corpus. The aims were to find out the distribution patterns and the common errors in the use of preposition of time, "on" and "at". This corpus…
Amphetamine-enhanced accumulation of [3H]-spiperone in mouse corpus striatum in vivo: Modification by other drugs

International Nuclear Information System (INIS)

Dorris, R.L.

1989-01-01

Other investigators have reported that amphetamine administered to rodents results in an increase in the in vivo accumulation of either the tritiated dopamine receptor ligand, spiperone or pimozide in the dopaminergic corpus striatum, (specific binding) while not altering that in the sparsely dopaminergically innervated cerebellum (non-specific binding). Experiments were undertaken to determine if the results could be replicated and if some other drugs would modify the effect. Male mice were injected with [ 3 H]-spiperone (20 μCi/Kg, 0.0003 mg/kg) s.c. and killed 2 hrs later for determination of radioactivity in corpus striatum and cerebellum. Amphetamine (20 mg/kg, i.p.) given 15 min before [ 3 H]-spiperone, increased accumulation in striatum but not cerebellum. The increase was inhibited by α - methyltyrosine (α-MT), haloperidol, reserpine or amantadine. It is suggested that the amphetamine-induced increase in accumulation of [ 3 H]-spiperone in corpus striatum (specific binding) depends on release of large amounts of dopamine, which then must be able to interact with the dopamine receptor. The antagonism of the effect by α-MT or reserpine can be explained by dopamine depletion, that of haloperidol by antagonism for binding at the receptor site. It is suggested that amantadine acts by a dual mechanism: (1) as a low efficacy agonist, it competes for binding to the receptor and (2) it has some ability to block dopamine release
Evaluating stance-annotated sentences from the Brexit Blog Corpus: A quantitative linguistic analysis

Directory of Open Access Journals (Sweden)

Simaki Vasiliki

2018-03-01

Full Text Available This paper offers a formally driven quantitative analysis of stance-annotated sentences in the Brexit Blog Corpus (BBC. Our goal is to identify features that determine the formal profiles of six stance categories (contrariety, hypotheticality, necessity, prediction, source of knowledge and uncertainty in a subset of the BBC. The study has two parts: firstly, it examines a large number of formal linguistic features, such as punctuation, words and grammatical categories that occur in the sentences in order to describe the specific characteristics of each category, and secondly, it compares characteristics in the entire data set in order to determine stance similarities in the data set. We show that among the six stance categories in the corpus, contrariety and necessity are the most discriminative ones, with the former using longer sentences, more conjunctions, more repetitions and shorter forms than the sentences expressing other stances. necessity has longer lexical forms but shorter sentences, which are syntactically more complex. We show that stance in our data set is expressed in sentences with around 21 words per sentence. The sentences consist mainly of alphabetical characters forming a varied vocabulary without special forms, such as digits or special characters.
Inner change in the Corpus Paulinum: pointers for pastoral counselling

Directory of Open Access Journals (Sweden)

Y. Campbell-Lane

2007-07-01

Full Text Available The aim of this article is to establish what perspectives exist on inner change within the “Corpus Paulinum” and how it should be applied in pastoral counselling. The Scriptural guidelines of change that will be examined for the purposes of this article, are found in the following references: Ephesians 4:22-24, Colos- sians 3:8-10, and Romans 12:1-2. The work of the Holy Spirit as “Agent of change” will also be discussed and finally some pointers on inner change and the implications for pastoral counselling will be proposed.
Microstructural damage of the posterior corpus callosum contributes to the clinical severity of neglect.

Directory of Open Access Journals (Sweden)

Marco Bozzali

Full Text Available One theory to account for neglect symptoms in patients with right focal damage invokes a release of inhibition of the right parietal cortex over the left parieto-frontal circuits, by disconnection mechanism. This theory is supported by transcranial magnetic stimulation studies showing the existence of asymmetric inhibitory interactions between the left and right posterior parietal cortex, with a right hemispheric advantage. These inhibitory mechanisms are mediated by direct transcallosal projections located in the posterior portions of the corpus callosum. The current study, using diffusion imaging and tract-based spatial statistics (TBSS, aims at assessing, in a data-driven fashion, the contribution of structural disconnection between hemispheres in determining the presence and severity of neglect. Eleven patients with right acute stroke and 11 healthy matched controls underwent MRI at 3T, including diffusion imaging, and T1-weighted volumes. TBSS was modified to account for the presence of the lesion and used to assess the presence and extension of changes in diffusion indices of microscopic white matter integrity in the left hemisphere of patients compared to controls, and to investigate, by correlation analysis, whether this damage might account for the presence and severity of patients' neglect, as assessed by the Behavioural Inattention Test (BIT. None of the patients had any macroscopic abnormality in the left hemisphere; however, 3 cases were discarded due to image artefacts in the MRI data. Conversely, TBSS analysis revealed widespread changes in diffusion indices in most of their left hemisphere tracts, with a predominant involvement of the corpus callosum and its projections on the parietal white matter. A region of association between patients' scores at BIT and brain FA values was found in the posterior part of the corpus callosum. This study strongly supports the hypothesis of a major role of structural disconnection between the
Analyzing Idioms and Their Frequency in Three Advanced ILI Textbooks: A Corpus-Based Study

Science.gov (United States)

Alavi, Sepideh; Rajabpoor, Aboozar

2015-01-01

The present study aimed at identifying and quantifying the idioms used in three ILI "Advanced" level textbooks based on three different English corpora; MICASE, BNC and the Brown Corpus, and comparing the frequencies of the idioms across the three corpora. The first step of the study involved searching the books to find multi-word…
A Corpus-based Study on the Use of Contractions by EFL Learners in Argumentative Essays

Directory of Open Access Journals (Sweden)

M. Pınar Babanoğlu

2017-01-01

Full Text Available Contraction forms in English are mostly occur in speech and informal writing and they are generally avoided in formal writing types such as academic prose, business reports and journal articles, therefore, most teachers discourage their use in academic essays (Biber, Johansonn, Leech, Conrad and Finegan 1999. Contractions in English have two types; negative contractions (isn’t, haven’t, doesn’t and verb contractions (I’m, they’ve, that’s. This corpus based study attempts to investigate contraction usage in learner and native English speaker essays. Major goal is to examine whether learners consider essay writing rules in respect of contractions which are accepted inappropriate for academic prose style. Five corpora, three learner and two native English, were utilized in order to analyze verb and not-contraction forms. Frequency calculations of contraction forms in each corpus compared via log-likelihood measurement for statistical significance. Results revealed that learners use considerably more contraction forms, especially negative ones, than native English students in their argumentative essays.
A case of the corpus callosum and alien hand syndrome from a discrete paracallosal lesion.

Science.gov (United States)

Faber, Raymond; Azad, Alvi; Reinsvold, Richard

2010-08-01

Here we present a patient with an isolated paracallosal brain lesion who exhibited behavioral changes associated with the corpus callosum syndrome (CCS) including features of the alien hand syndrome (AHS). The CCS is also known as the split-brain syndrome, the syndrome of hemisphere disconnection, the syndrome of brain bisection and the syndrome of the cerebral commissures. Because most reported cases of CCS were caused by tumors which extended beyond the corpus callosum (CC) and did not always induce a complete disconnection, there was much controversy about the role of the CC and the existence of a specific CCS. Aside from surgically based cases, the full complement of the CCS is infrequently clinically encountered. The patient described has a classic CCS from natural causes. This case report is unique in exhibiting a complete CCS with AHS secondary to an ischemic event affecting the left pericallosal region. To our knowledge this is the first case report of such a combination.
Enculturating Conversational Agents based on a Comparative Corpus Study

DEFF Research Database (Denmark)

Lipi, Afia Akhter; Yamaoka, Yuji; Rehm, Matthias

2008-01-01

When encountering people who have a different cultural background from our own, many of us feel uncomfortable because gestures and facial expressions may not be familiar to us. Thus, to enhance the believability of conversational agents, culture-specific nonverbal behaviors should be implemented ...... a more detailed analysis about posture shifts, and proposes a chat system with an embodied conversational agent (ECA) that can act as a language trainer....... into the agents. In our previous study, with the goal of building a user interface that incorporates a user’s cultural background, we have collected comparative conversation corpus in Germany and Japan, and investigated the differences in gestures and posture shifts between these two countries. This paper reports...
Chemokines in the corpus luteum: Implications of leukocyte chemotaxis

Directory of Open Access Journals (Sweden)

Liptak Amy R

2003-11-01

Full Text Available Abstract Chemokines are small molecular weight peptides responsible for adhesion, activation, and recruitment of leukocytes into tissues. Leukocytes are thought to influence follicular atresia, ovulation, and luteal function. Many studies in recent years have focused attention on the characterization of leukocyte populations within the ovary, the importance of leukocyte-ovarian cell interactions, and more recently, the mechanisms of ovarian leukocyte recruitment. Information about the role of chemokines and leukocyte trafficking (chemotaxis during ovarian function is important to understanding paracrine-autocrine relationships shared between reproductive and immune systems. Recent advances regarding chemokine expression and leukocyte accumulation within the ovulatory follicle and the corpus luteum are the subject of this mini-review.
ANR Corpus architecturae religiosae europeae [CARE], saec. IV-X

Directory of Open Access Journals (Sweden)

Christian Sapin

2008-07-01

Full Text Available À la fin de l’année 2007, le projet déposé auprès de l’Agence nationale de la recherche (ANR et consacré à la constitution d’un corpus des monuments religieux (CARE antérieurs à l’an Mil a été retenu. Il correspond au volet propre à la France. En effet, plusieurs pays, dont l’Italie, l’Espagne, la République Tchèque, la Slovaquie, la Pologne et la Croatie ont commencé depuis deux ans les travaux préparatoires à cette ambitieuse entreprise ; la Grèce est, depuis, intéressée, de même que l’Al...
Seimo posėdžių stenogramų tekstynas autorystės nustatymo bei autoriaus profilio sudarymo tyrimams | Corpus of transcribed parliamentary speeches for authorship attribution and author profiling tasks

Directory of Open Access Journals (Sweden)

Jurgita Kapočiūtė-Dzikienė

2014-12-01

Full Text Available In our paper we present a corpus of transcribed Lithuanian parliamentary speeches. The corpus is prepared in a specific format, appropriate for different authorship identification tasks. The corpus consists of approximately 111 thousand texts (24 million words. Each text matches one parliamentary speech produced during an ordinary session from the period of 7 parliamentary terms starting on March 10, 1990 and ending on December 23, 2013. The texts are grouped into 147 categories corresponding to individual authors, therefore they can be used for authorship attribution tasks; besides, these texts are also grouped according to age, gender and political views, therefore they are also suitable for author profiling tasks. Whereas short texts complicate recognition of author speaking style and are ambiguous in relation to the style of other authors, we incorporated only texts containing not less than 100 words into the corpus. In order to make each category as comprehensive and representative as possible, we included only those authors, who produced speeches at least 200 times. All the texts are lemmatized, morphologically and syntactically annotated, tokenized into the character n-grams. The statistical information of the corpus is also available. We have also demonstrated that the created corpus can be effectively used in authorship attribution and author profiling tasks with supervised machine learning methods. The corpus structure also allows using it with unsupervised machine learning methods and can be used for creation of rule-based methods, as well as in different linguistic analyses.
The incidence rate of corpus uteri cancer among females in Saudi Arabia: an observational descriptive epidemiological analysis of data from Saudi Cancer Registry 2001–2008

Directory of Open Access Journals (Sweden)

Alghamdi IG

2014-01-01

Full Text Available Ibrahim G Alghamdi,1 Issam I Hussain,1 Mohamed S Alghamdi,2 Mohamed A El-Sheemy1,3 1University of Lincoln, Brayford Pool, Lincoln, UK; 2Ministry of Health, General Directorate of Health Affairs, Al-Baha, Kingdom of Saudi Arabia; 3Research and Development, Lincoln Hospital, Lincolnshire Hospitals NHS Trust, Lincoln, UK Background: The present study reviews the epidemiological data on corpus uteri cancer among Saudi women, including its frequency, crude incidence rate, and age-standardized incidence rate (ASIR, adjusted by region and year of diagnosis. Methods: A retrospective, descriptive epidemiological analysis was conducted of all the corpus uteri cancer cases recorded in the Saudi Cancer Registry between January 2001 and December 2008. The statistical analyses were performed using descriptive statistics, analysis of variance, Poisson regression, and a simple linear model. Results: A total of 1,060 corpus uteri cancer cases were included. Women aged 60–74 years of age were most affected by the disease. The region of Riyadh in Saudi Arabia had the highest overall ASIR, at 4.4 cases per 100,000 female patients, followed by the eastern region, at 4.2, and Makkah, at 3.7. Jazan, Najran, and Qassim had the lowest average ASIRs, ranging from 0.8 to 1.4. A Poisson regression model using Jazan as the reference revealed that the corpus uteri cancer incidence rate ratio was significantly higher for the regions of Makkah, at 16.5 times (95% confidence interval [CI]: 8.0–23.0, followed by Riyadh, at 16.0 times (95% CI: 9.0–22.0, and the eastern region, at 9.9 times (95% CI: 5.6–17.6. The northern region experienced the highest changes in ASIRs of corpus uteri cancer among female Saudi patients between 2001 and 2008. Conclusion: There was a slight increase in the crude incidence rates and ASIRs for corpus uteri cancer in Saudi Arabia between 2001 and 2008. Older Saudi women were most affected by the disease. Riyadh, the eastern region, and Makkah

Illusory conjunctions in visual short-term memory: Individual differences in corpus callosum connectivity and splitting attention between the two hemifields.

Science.gov (United States)

Qin, Shuo; Ray, Nicholas R; Ramakrishnan, Nithya; Nashiro, Kaoru; O'Connell, Margaret A; Basak, Chandramallika

2016-11-01

Overloading the capacity of visual attention can result in mistakenly combining the various features of an object, that is, illusory conjunctions. We hypothesize that if the two hemispheres separately process visual information by splitting attention, connectivity of corpus callosum-a brain structure integrating the two hemispheres-would predict the degree of illusory conjunctions. In the current study, we assessed two types of illusory conjunctions using a memory-scanning paradigm; the features were either presented across the two opposite hemifields or within the same hemifield. Four objects, each with two visual features, were briefly presented together followed by a probe-recognition and a confidence rating for the recognition accuracy. MRI scans were also obtained. Results indicated that successful recollection during probe recognition was better for across hemifields conjunctions compared to within hemifield conjunctions, lending support to the bilateral advantage of the two hemispheres in visual short-term memory. Age-related differences regarding the underlying mechanisms of the bilateral advantage indicated greater reliance on recollection-based processing in young and on familiarity-based processing in old. Moreover, the integrity of the posterior corpus callosum was more predictive of opposite hemifield illusory conjunctions compared to within hemifield illusory conjunctions, even after controlling for age. That is, individuals with lesser posterior corpus callosum connectivity had better recognition for objects when their features were recombined from the opposite hemifields than from the same hemifield. This study is the first to investigate the role of the corpus callosum in splitting attention between versus within hemifields. © 2016 Society for Psychophysiological Research.
The relationship between early life stress and microstructural integrity of the corpus callosum in a non-clinical population

Directory of Open Access Journals (Sweden)

Robert Paul

2008-03-01

Full Text Available Robert Paul1, Lorrie Henry2, Stuart M Grieve3, Thomas J Guilmette2,4, Raymond Niaura4, Richard Bryant5, Steven Bruce1, Leanne M Williams3,6, Clark C Richard7, Ronald A Cohen4, Evian Gordon3,71University of Missouri, St. Louis, St. Louis, MO, USA; 2Providence College, Providence, RI, USA; 3The Brain Resource International Database, The Brain Resource Company, Ultimo, NSW, Australia; 4Brown Medical School, Department of Psychiatry, Providence, RI, USA; 5School of Psychology, University of New South Wales, Sydney, NSW, Australia; 6Brain Dynamics Centre, Westmead Millennium Institute, Westmead Hospital, Westmead, NSW, Australia; 7Cognitive Neuroscience Laboratory and School of Psychology, Flinders University, Adelaide, SA, AustraliaBackground: Previous studies have examined the impact of early life stress (ELS on the gross morphometry of brain regions, including the corpus callosum. However, studies have not examined the relationship between ELS and the microstructural integrity of the brain.Methods: In the present study we evaluated this relationship in healthy non-clinical participants using diffusion tensor imaging (DTI and self-reported history of ELS.Results: Regression analyses revealed significant reductions in fractional anisotropy (FA within the genu of the corpus callosum among those exposed to the greatest number of early life stressors, suggesting reduced microstructural integrity associated with increased ELS. These effects were most pronounced in the genu of the corpus callosum compared to the body and splenium, and were evident for females rather than males despite no differences in total ELS exposure between the sexes. In addition, a further comparison of those participants who were exposed to no ELS vs. three or more ELS events revealed lower FA in the genu of the corpus callosum among the ELS-exposed group, with trends of FA reduction in the body and the whole corpus callosum. By contrast, there were no relationships between ELS
Different mechanisms for lead acetate, aluminum and cadmium sulfate in rat corpus cavernosum

International Nuclear Information System (INIS)

Senbel, Amira M.; Saad, Evan I.; Taha, Safaa S.; Mohamed, Hosny F.

2016-01-01

Introduction: Some heavy metals show adverse vascular and neurological effects, however, their effect on erection is underestimated. This study aims to investigate the effect of Pb, Cd and Al on erectile function and their potential mechanism of action in rats. Methods: Measurement of intracavernosal pressure/mean arterial pressure (ICP/MAP) changes elicited by electrical stimulation of cavernous nerve in anesthetized rats treated with Pb-acetate, Al-sulfate, or Cd-sulfate acutely, and subacutely for 7 days. Serum creatinine, testosterone, TBARs, GSH levels and metal accumulation in corpus cavernosum were measured. Results: Pb, Al and Cd significantly reduced ICP/MAP in rats after acute (2,10–2,10 and 1,3 mg/kg respectively) and sub-acute (3, 3, and 1 mg/kg/day respectively) treatments. They selectively accumulated in the corpus cavernosum reaching 25.107 ± 2.081 μg/g wet weight for Pb, 1.029 ± 0.193 for Cd, 31.343 ± 1.991 for Al, compared to 7.084 ± 1.517, 0.296 ± 0.067, and 8.86 ± 1.115 as controls respectively. Serum creatinine levels were not altered. Cd and Al significantly reduced testosterone level to 0.483 ± 0.059 and 0.419 ± 0.037 ng/ml respectively compared to 0.927 ± 0.105 ng/ml as control. Aluminum elevated TBARs significantly by 27.843%. The acute anti-erectile action of Pb was blocked by non-selective NOS and GC inhibitors and potassium channel blocker. Lead also masked the potentiatory effect of L-arginine and diazoxide on ICP/MAP. No interaction with muscarinic or nicotinic modulators was observed. Conclusions: Pb, Cd and Al show anti-erectile effect independent on renal injury. They don not modulate cholinergic nor ganglionic transmission in corpus cavernosum. Pb may inhibit NO/cGMP/K + channel pathway. The effect of Cd and Al but not Pb seems to be hormonal dependent.
Murine Models of Gastric Corpus PreneoplasiaSummary

Directory of Open Access Journals (Sweden)

Christine P. Petersen

2017-01-01

Full Text Available Intestinal-type gastric adenocarcinoma evolves in a field of pre-existing metaplasia. Over the past 20 years, a number of murine models have been developed to address aspects of the physiology and pathophysiology of metaplasia induction. Although none of these models has achieved true recapitulation of the induction of adenocarcinoma, they have led to important insights into the factors that influence the induction and progression of metaplasia. Here, we review the pathologic definitions relevant to alterations in gastric corpus lineages and classification of metaplasia by specific lineage markers. In addition, we review present murine models of the induction and progression of spasmolytic polypeptide (TFF2âexpressing metaplasia, the predominant metaplastic lineage observed in murine models. These models provide a basis for the development of a broader understanding of the physiological and pathophysiological roles of metaplasia in the stomach. Keywords: SPEM, Intestinal Metaplasia, Gastric Cancer, TFF2, Chief Cell, Hyperplasia
A Corpus-based Study of EFL Learners’ Errors in IELTS Essay Writing

OpenAIRE

Hoda Divsar; Robab Heydari

2017-01-01

The present study analyzed different types of errors in the EFL learners’ IELTS essays. In order to determine the major types of errors, a corpus of 70 IELTS examinees’ writings were collected, and their errors were extracted and categorized qualitatively. Errors were categorized based on a researcher-developed error-coding scheme into 13 aspects. Based on the descriptive statistical analyses, the frequency of each error type was calculated and the commonest errors committed by the EFL learne...
Hereditary spastic paraplegia associated with thin corpus callosum Paraplegia espástica hereditária associada a hipoplasia de corpo caloso

Directory of Open Access Journals (Sweden)

Hélio A. Ghizoni Teive

2001-09-01

Full Text Available Autosomal recessive hereditary spastic paraplegia (AR-HSP associated with thin corpus callosum was recently described in Japan, and most families were linked to chromosome 15q13-15. We report two patients from two different Brazilian families with progressive gait disturbance starting at the second decade of life, spastic paraparesis, and mental deterioration. One patient presented cerebellar ataxia. Magnetic resonance imaging (MRI of the head of both patients showed a thin corpus callosum. AR-HSP with a thin corpus callosum is a rare disorder, mainly described in Japanese patients. We found only 4 Caucasian families with AR-HSP with thin corpus callosum described in the literature. Further studies including additional Caucasian families of AR-HSP with thin corpus callosum are required to delineate the genetic profile of this syndrome in occidental countries.A paraplegia espástica hereditária autossômica recessiva (PEH-AR associada com hipoplasia de corpo caloso foi inicialmente descrita no Japão. Estudos de ligação genética mostram que a maioria das famílias estão relacionadas ao cromossomo 15q13-15. Relatamos dois pacientes de famílias brasileiras, não relacionadas, com distúrbio de marcha com início na segunda década de vida, paraparesia espástica e comprometimento das funções cognitivas. Um dos pacientes apresentava ataxia cerebelar. A ressonância magnética de encéfalo de ambos os pacientes mostrou hipoplasia de corpo caloso. PEH-AR associada com hipoplasia de corpo caloso é uma condição rara, descrita principalmente em pacientes do Japão. Encontramos apenas 4 famílias caucasianas com PEH-AR e hipoplasia de corpo caloso. Mais estudos com famílias caucasianas são necessários para delinear o perfil genético dessa síndrome em países ocidentais.
DESIGNING EAP MATERIALS BASED ON INTERCULTURAL CORPUS ANALYSES: THE CASE OF LOGICAL MARKERS IN RESEARCH ARTICLES

Directory of Open Access Journals (Sweden)

Pilar Mur Dueñas

2009-10-01

Full Text Available The ultimate aim of intercultural analyses in English for Academic Purposes is to help non-native scholars function successfully in the international disciplinary community in English. The aim of this paper is to show how corpus-based intercultural analyses can be useful to design EAP materials on a particular metadiscourse category, logical markers, in research article writing. The paper first describes the analysis carried out of additive, contrastive and consecutive logical markers in a corpus of research articles in English and in Spanish in a particular discipline, Business Management. Differences were found in their frequency and also in the use of each of the sub-categories. Then, five activities designed on the basis of these results are presented. They are aimed at raising Spanish Business scholars' awareness of the specific uses and pragmatic function of frequent logical markers in international research articles in English.
Análisis de sentiminetos de un corpus de redes sociales.

OpenAIRE

Aguado de Cea, G.; Montiel-Ponsoda, Elena; Rodríguez Doncel, Víctor; Barrios, MA.; Bernardos Galindo, María del Socorro; Muñoz-García, O.; Rodriguez-Doncel, Victor

2012-01-01

El análisis de sentimientos de textos en las redes sociales se ha convertido en un área de investigación cada vez más relevante debido a la influencia que las opiniones expresadas tienen en potenciales usuarios. De acuerdo con una clasificación conceptual de sentimientos y basándonos en un corpus de diversos dominios comerciales, hemos trabajado en la confección de reglas que permitan la clasificación de dichos textos según el sentimiento expresado con respecto a una marca, empresa o producto...
Integrity of the corpus callosum in patients with periventricular nodular heterotopia related epilepsy by FLNA mutation.

Science.gov (United States)

Liu, Wenyu; An, Dongmei; Niu, Running; Gong, Qiyong; Zhou, Dong

2018-01-01

To investigate the quantitative diffusion properties of the corpus callosum (CC) in a large group of patients with periventricular nodular heterotopia (PNH) related epilepsy and to further investigate the effect of Filamin A ( FLNA ) mutation on these properties. Patients with PNH (n = 34), subdivided into FLNA -mutated (n = 11) and FLNA -nonmutated patients (n = 23) and healthy controls (n = 34), underwent 3.0 T structural MRI and diffusion imaging scan (64 direction). Fractional anisotropy (FA) and mean diffusivity (MD) were measured in the three major subdivisions of the CC (genu, body and splenium). Correlations between DTI metric changes and clinical parameters were also evaluated. Furthermore, the effect of FLNA mutation on structural integrity of the corpus callosum was examined. Patients with PNH and epilepsy had significant reductions in FA for the genu and splenium of the CC, accompanied by increases in MD for the splenium, as compared to healthy controls. There were no correlations between clinical parameters of epilepsy and MD. The FA value in the splenium negatively correlated with epilepsy duration. Interestingly, FLNA -mutated patients showed significantly decreased FA for all three major subdivisions of the CC, and increased MD for the genu and splenium, as compared to HCs and FLNA -nonmutated patients. These findings support the conclusion that patients with epilepsy secondary to PNH present widespread microstructural changes found in the corpus callosum that extend beyond the macroscopic MRI-visible lesions. This study also indicates that FLNA may affect white matter integrity in this disorder.
Integrity of the corpus callosum in patients with periventricular nodular heterotopia related epilepsy by FLNA mutation

Directory of Open Access Journals (Sweden)

Wenyu Liu

2018-01-01

Conclusions: These findings support the conclusion that patients with epilepsy secondary to PNH present widespread microstructural changes found in the corpus callosum that extend beyond the macroscopic MRI-visible lesions. This study also indicates that FLNA may affect white matter integrity in this disorder.
Cerebral Visual Impairment and Dysgenesis of Corpus Callosum in Multidisabled Children Aged 1 to 9 Years Old

Directory of Open Access Journals (Sweden)

Roxana CZIKER

2009-12-01

Full Text Available Aims: To emphasize the functional vision characteristics in visually impaired multiple disabled children (MDVI aged 2 to 9 years old related to brain damages on magnetic resonance imaging in different cortical and subcortical areas and in the corpus callosum region. Material and Method: 12 MDVI children with severe and mild neurological disorders were medically and neuropsychological assessed. The clinical - psychological, neurological and ophthalmological – and paraclinical methods – visual evoked potential (VEP and magnetic resonance imaging (MRI were carried out in order to outline the complete profile of each child. The assessment was completed by morphometric measurement of corpus callosum and brain. Results: 10 of infants with severe neurological disorders showed ocular disorders such as ocular motility and visual function abnormalities. Severe cognitive and psychomotor retardation were associated in visual disorders in MDVI children. Significant correlation between neurological disorders, neuropsychological [τ(12 = 0.783, p = 0.001] evaluation and visual acuity [τ(12 = 0.783, p = 0.001] were found in multiple disabled children. The significant difference of diameter [t(22 = -4.858, p = 0.000] and surface of corpus callosum [t(22 = -6.254, p = 0.000] in multiple disabled children compared with control group was found. Conclusion: The structured assessment of visually impaired children due to neurological disorders, as early as possible, is the remarkably key which reveals the functionality of child and outlines the appropriate developmental and educational rehabilitation.
Polysemous Verbs and Modality in Native and Non-Native Argumentative Writing: A Corpus-Based Study

Science.gov (United States)

Salazar, Danica; Verdaguer, Isabel

2009-01-01

The present study is a corpus-based analysis of a selection of polysemous lexical verbs used to express modality in student argumentative writing. Twenty-three lexical verbs were searched for in three 100,000-word corpora of argumentative essays written in English by American, Filipino and Spanish university students. Concordance lines were…
Tag Questions across Irish English and British English: A Corpus Analysis of Form and Function

Science.gov (United States)

Barron, Anne; Pandarova, Irina; Muderack, Karoline

2015-01-01

The present study, situated in the area of variational pragmatics, contrasts tag question (TQ) use in Ireland and Great Britain using spoken data from the Irish and British components of the International Corpus of English (ICE). Analysis is on the formal and functional level and also investigates form-functional relationships. Findings reveal…
The Brazilian Theory of Habeas Corpus for Great Apes

Directory of Open Access Journals (Sweden)

Heron José de Santana Gordilho

2016-06-01

Full Text Available This essay presents a comparison between human evolution and legal developments, trying to demonstrate how darwinian theory of evolution by natural selection has caused changes in the legal world, the bridge of today some lawyers using the recent discoveries about how similar genetic between man and great primates to claim extension of human rights for chimpanzees, bonobos, gorillas and orangs. It also that many activists for animal`s rights have considered the dispute an important strategy, whether to set new means for legal institutes such as the Habeas Corpus, hitherto used only to ensure human freedom, whether to increase the movement and increase the conscietization of the general population about the importance of the recognition of animals as holders of basic rights.
Embedding epistemic modals in English: A corpus-based study

Directory of Open Access Journals (Sweden)

Valentine Hacquard

2012-07-01

Full Text Available The question of whether epistemic modals contribute to the truth conditions of the sentences they appear in is a matter of active debate in the literature. Fueling this debate is the lack of consensus about the extent to which epistemics can appear in the scope of other operators. This corpus study investigates the distribution of epistemics in naturalistic data. Our results indicate that they do embed, supporting the view that they contribute semantic content. However, their distribution is limited, compared to that of other modals. This limited distribution seems to call for a nuanced account: while epistemics are semantically contentful, they may require special licensing conditions. http://dx.doi.org/10.3765/sp.5.4 BibTeX info
Are translations longer than source texts? A corpus-based study of explicitation

OpenAIRE

Frankenberg-Garcia, A

2009-01-01

Explicitation is the process of rendering information which is only implicit in the source text explicit in the target text, and is believed to be one of the universals of translation (Blum-Kulka 1986, Olohan and Baker 2000, Øverås 1998, Séguinot 1988, Vanderauwera 1985). The present study uses corpus technology to attempt to shed some light on the complex relationship between translation, text length and explicitation. An awareness of what makes translations longer (or shorter) and more expl...
A corpus-based analysis of textbooks used in the orientation course for immigrants in Germany: Ideological and pedagogic implications

Directory of Open Access Journals (Sweden)

Leung Ray C. H.

2016-09-01

Full Text Available Contextualized within immigrants’ acquisition of specialized knowledge about the host country at the institutional level, this article examines a 64295-word corpus of textbooks written for participants of the orientation course in German politics, history and culture. Corpus-based techniques (“keyness,” collocation and qualitative examination of concordance lines are deployed to explore the corpus. The findings reveal that the collocational patterns of the identified keywords construct particular world views vis-à-vis Germany. For instance, the keyword DDR [German Democratic Republic (GDR, aka East Germany] frequently co-occurs with negatively connoted lexis while collocates of the keywords denoting present-day Germany (e.g., Bundesrepublik Deutschland [Federal Republic of Germany] and Staat [nation, country, state] facilitate the portrayal of Germany as a nurturing welfare state that is popular among foreigners. It is argued that such discursively-construed opposition between the “bad” GDR and the “good” Federal Republic of Germany helps to legitimize the German reunification. Furthermore, it is found that certain keywords (e.g., Sie [you], Kurs [course, class] and z.B. [e.g.] are “metadiscourse resources” (Hyland, 2005. Their pedagogic effects are discussed in relation to the ideological implications of the research findings.
Informação como objeto para construção do corpus interdisciplinar entre Ciência da Informação e Ciência da Administração | Information from object to construction to interdisciplinary corpus between Information Science and Administrative Science

Directory of Open Access Journals (Sweden)

Joaquim Francisco Cavalcante de Oliveira

2011-03-01

Full Text Available Resumo Descrição e análise dos fundamentos teóricos e procedimentos metodológicos para a construção de um corpus interdisciplinar, compreendido como “o conjunto de categorias interdisciplinares afins e de significados próximos, valorizados por duas ou mais Ciências”, nesta pesquisa entre a Ciência da Informação e a Ciência da Administração, com base na informação e gestão da informação. As etapas de seu desenvolvimento incluem o referencial teórico em torno da interdisciplinaridade das áreas estudadas, especialmente Heckhausen; pesquisa documental para identificação dos autores de Ciência da Informação e de Ciência da Administração, identificados via Currículo Lattes e Diretório de Grupos de Pesquisa do CNPq e autores estrangeiros por meio do Google. A partir da análise de citação e a análise de conteúdo em revistas brasileiras de Administração e Ciência da Informação selecionadas e da identificação dos conceitos contidos em artigos, foi construído o corpus interdisciplinar. Esta metodologia tem aplicação mais ampla e pode ser adotada em pesquisas semelhantes de interdisciplinaridade de outros campos do conhecimento. Palavras-chave interdisciplinaridade; Ciência da Informação; Ciência da Administração; informação; metodologia da pesquisa; corpus interdisciplinar; gestão da informação. Abstract A description and analysis of the theoretical foundations and methodological procedures in the development of an interdisciplinary corpus, defined/described as “a group of related interdisciplinary categories and similar meanings, recognized by two or more Sciences”, the article presents results of research between Information Science and Administrative Science, based on information and information management. The stages of its development include the theoretical referential that surrounds the interdisciplinarity of the studied areas, especially Heckhausen: documental research for the
Draft Site Management and Monitoring Plan for Corpus Christi Maintenance and New Work Ocean Dredged Material Disposal Site

Science.gov (United States)

USEPA Region 6 and the US Army Corps of Engineers submit for public comment the Draft Site Management and Monitoring Plan for Corpus Christi Maintenance and New Work Ocean Dredged Material Disposal Site
Comment constituer et traiter des corpus issus des médias sociaux à l’aide des CAQDAS? NVivo : un instrument pour l’analyse des discours numériques

Directory of Open Access Journals (Sweden)

Ferrari Giovannipaolo

2015-01-01

Full Text Available Avec l’apparition des médias sociaux, le chercheur en Sciences Humaines et Sociales a vu s’ouvrir des possibilités de création de corpus à partir des données disponibles en ligne. Le risque dans ces modalités de création de corpus est de créer de grandes bases de données difficilement gérables avec les outils traditionnels et surtout dans le cadre d’une recherche qualitative. Pour cette raison, il semble important de mobiliser des outils numériques pour l’aide à la recherche. Ces logiciels sont appelés CAQDAS. Cette contribution montre comment les utiliser pour une enquête de terrain en ligne ou numérique afin d’organiser et d’analyser un corpus. Ce corpus est constitué en vue d’une analyse des discours produits dans le contexte professionnel de la radio utilisant les nouveaux médias.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.