folklore text corpora: Topics by WorldWideScience.org

Sample records for folklore text corpora

Two approaches to gathering text corpora from the WorldWideWeb

CSIR Research Space (South Africa)

Botha, G

2005-11-01

Full Text Available Many applications of pattern recognition to natural language processing require large text corpora in a specified language. For many of the languages of the world, such corpora are not readily available, but significant quantities of text...
Segmenting corpora of texts Segmentação de corpora de textos

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2002-01-01

Full Text Available The aim of the research presented here is to report on a corpus-based method for discourse analysis that is based on the notion of segmentation, or the division of texts into cohesive portions. For the purposes of this investigation, a segment is defined as a contiguous portion of written text consisting of at least two sentences. The segmentation procedure developed for the study is called LSM (link set median, which is based on the identification of lexical repetition in text. The data analysed in this investigation were three corpora of 100 texts each. Each corpus was composed of texts of one particular genre: research articles, annual business reports, and encyclopaedia entries. The total number of words in the three corpora was 1,262,710 words. The segments inserted in the texts by the LSM procedure were compared to the internal section divisions in the texts. Afterwards, the results obtained through the LSM procedure were then compared to segmentation carried out at random. The results indicated that the LSM procedure worked better than random, suggesting that lexical repetition accounts in part for the way texts are segmented into sections.O objetivo da pesquisa apresentada é relatar um método baseado em corpus para análise de discurso que se baseia na noção de segmentação, isto é, a divisão de textos em porções coesas. Para os propósitos desse estudo, um segmento é definido como uma porção contígua de texto que consiste em pelo menos sentenças. O procedimento de segmentação desenvolvido para a pesquisa chama-se LSM ('link set median' e se baseia na identificação da repetição lexical nos textos. Os dados analisados foram três corpora de 100 textos cada. Cada corpus representava um gênero específico: artigos de pesquisa, relatórios anuais de negócio e artigos de enciclopédia. O tamanho total do corpus é 1.262.710 palavras. A segmentação por LSM foi comparada à divisão interna em seções de cada texto. A
Folklore in Antiquity

Directory of Open Access Journals (Sweden)

Galit Hasan-Rokem

2018-05-01

Full Text Available Folklore exists in all human groups, small and big. Since early modernity, scholars have provided various definitions of the phenomenon, but earlier texts may also reveal awareness and reflection on the specific character folklore. In this short article, we wish to explore and look into the various definitions and characterizations of folklore given by ancient writers from various times and cultures. We will try to draw a cultural map of awareness to the phenomenon of folklore in ancient Near-Eastern texts, Greco-Roman culture, the Hebrew Bible, Early Christianity and Rabbinic literature. The main questions we wish do deal with are where and if we can find explicit mention of folklore; which folk genres are dominant in ancient writings and what was the social context of ancient folklore? That is to say, whom those text integrated in social frameworks, enabling their users to gain power or to undermine existing cultural, theological and social structures.
A linear-RBF multikernel SVM to classify big text corpora.

Science.gov (United States)

Romero, R; Iglesias, E L; Borrajo, L

2015-01-01

Support vector machine (SVM) is a powerful technique for classification. However, SVM is not suitable for classification of large datasets or text corpora, because the training complexity of SVMs is highly dependent on the input size. Recent developments in the literature on the SVM and other kernel methods emphasize the need to consider multiple kernels or parameterizations of kernels because they provide greater flexibility. This paper shows a multikernel SVM to manage highly dimensional data, providing an automatic parameterization with low computational cost and improving results against SVMs parameterized under a brute-force search. The model consists in spreading the dataset into cohesive term slices (clusters) to construct a defined structure (multikernel). The new approach is tested on different text corpora. Experimental results show that the new classifier has good accuracy compared with the classic SVM, while the training is significantly faster than several other SVM classifiers.
Representativeness in corpora of literary texts: introducing the C18P project

Directory of Open Access Journals (Sweden)

Gemeinböck, Iris

2016-07-01

Full Text Available Currently there are very few specialised corpora of literary texts that are tailored to the needs of literary critics who are interested in corpus stylistic analyses of prose fiction. Many existing corpora including literary texts were compiled for linguistic research interests and are often unsuitable for corpus stylistic purposes. The paper addresses three of the main problems: the absence of labelling of the texts for literary genre, the use of extracts, and the prevalence of linguistic periodisation schemes. C18P is a corpus of prose fiction designed specifically to address these issues. It traces the early development of the novel from 1700 up until the Victorian era. It can, for instance, be used for an analysis of the characteristic linguistic features of individual literary genres and forms. The following paper introduces the design of the corpus as well as some of its potential uses.
Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies.

Science.gov (United States)

Cohen, Raphael; Elhadad, Michael; Elhadad, Noémie

2013-01-16

The increasing availability of Electronic Health Record (EHR) data and specifically free-text patient notes presents opportunities for phenotype extraction. Text-mining methods in particular can help disease modeling by mapping named-entities mentions to terminologies and clustering semantically related terms. EHR corpora, however, exhibit specific statistical and linguistic characteristics when compared with corpora in the biomedical literature domain. We focus on copy-and-paste redundancy: clinicians typically copy and paste information from previous notes when documenting a current patient encounter. Thus, within a longitudinal patient record, one expects to observe heavy redundancy. In this paper, we ask three research questions: (i) How can redundancy be quantified in large-scale text corpora? (ii) Conventional wisdom is that larger corpora yield better results in text mining. But how does the observed EHR redundancy affect text mining? Does such redundancy introduce a bias that distorts learned models? Or does the redundancy introduce benefits by highlighting stable and important subsets of the corpus? (iii) How can one mitigate the impact of redundancy on text mining? We analyze a large-scale EHR corpus and quantify redundancy both in terms of word and semantic concept repetition. We observe redundancy levels of about 30% and non-standard distribution of both words and concepts. We measure the impact of redundancy on two standard text-mining applications: collocation identification and topic modeling. We compare the results of these methods on synthetic data with controlled levels of redundancy and observe significant performance variation. Finally, we compare two mitigation strategies to avoid redundancy-induced bias: (i) a baseline strategy, keeping only the last note for each patient in the corpus; (ii) removing redundant notes with an efficient fingerprinting-based algorithm. (a)For text mining, preprocessing the EHR corpus with fingerprinting yields
Ontology-based retrieval of bio-medical information based on microarray text corpora

DEFF Research Database (Denmark)

Hansen, Kim Allan; Zambach, Sine; Have, Christian Theil

are exponentially growing, the text corpora are sparse and inconsistent in spite of attempts to standardize the format. Ordinary keyword search may in some cases be insucient to nd rele- vant information and the potential benet of using a semantic approach in this context has only been investigated to a limited...
Using machine learning to disentangle homonyms in large text corpora.

Science.gov (United States)

Roll, Uri; Correia, Ricardo A; Berger-Tal, Oded

2018-06-01

Systematic reviews are an increasingly popular decision-making tool that provides an unbiased summary of evidence to support conservation action. These reviews bridge the gap between researchers and managers by presenting a comprehensive overview of all studies relating to a particular topic and identify specifically where and under which conditions an effect is present. However, several technical challenges can severely hinder the feasibility and applicability of systematic reviews, for example, homonyms (terms that share spelling but differ in meaning). Homonyms add noise to search results and cannot be easily identified or removed. We developed a semiautomated approach that can aid in the classification of homonyms among narratives. We used a combination of automated content analysis and artificial neural networks to quickly and accurately sift through large corpora of academic texts and classify them to distinct topics. As an example, we explored the use of the word reintroduction in academic texts. Reintroduction is used within the conservation context to indicate the release of organisms to their former native habitat; however, a Web of Science search for this word returned thousands of publications in which the term has other meanings and contexts. Using our method, we automatically classified a sample of 3000 of these publications with over 99% accuracy, relative to a manual classification. Our approach can be used easily with other homonyms and can greatly facilitate systematic reviews or similar work in which homonyms hinder the harnessing of large text corpora. Beyond homonyms we see great promise in combining automated content analysis and machine-learning methods to handle and screen big data for relevant information in conservation science. © 2017 Society for Conservation Biology.
Automatic extraction of property norm-like data from large text corpora.

Science.gov (United States)

Kelly, Colin; Devereux, Barry; Korhonen, Anna

2014-01-01

Traditional methods for deriving property-based representations of concepts from text have focused on either extracting only a subset of possible relation types, such as hyponymy/hypernymy (e.g., car is-a vehicle) or meronymy/metonymy (e.g., car has wheels), or unspecified relations (e.g., car--petrol). We propose a system for the challenging task of automatic, large-scale acquisition of unconstrained, human-like property norms from large text corpora, and discuss the theoretical implications of such a system. We employ syntactic, semantic, and encyclopedic information to guide our extraction, yielding concept-relation-feature triples (e.g., car be fast, car require petrol, car cause pollution), which approximate property-based conceptual representations. Our novel method extracts candidate triples from parsed corpora (Wikipedia and the British National Corpus) using syntactically and grammatically motivated rules, then reweights triples with a linear combination of their frequency and four statistical metrics. We assess our system output in three ways: lexical comparison with norms derived from human-generated property norm data, direct evaluation by four human judges, and a semantic distance comparison with both WordNet similarity data and human-judged concept similarity ratings. Our system offers a viable and performant method of plausible triple extraction: Our lexical comparison shows comparable performance to the current state-of-the-art, while subsequent evaluations exhibit the human-like character of our generated properties.
Corpora from a sociolinguistic perspective Corpora sob uma perspectiva sociolinguística

Directory of Open Access Journals (Sweden)

Tyler Kendall

2011-01-01

Full Text Available In this paper, I consider the use of corpora in sociolinguistic research and, more broadly, the relationships between corpus linguistics and sociolinguistics. I consider the distinction between "conventional" and "unconventional" corpora (Beal et al. 2007a, b and assess why conventional corpora have not had more traction in sociolinguistics. I then discuss the potential utility of corpora for sociolinguistic study in terms of the recent trajectory of sociolinguistic research interests (Eckert under review, acknowledging that, while many sociolinguists are increasingly using more advanced corpus-based techniques, many are, at the same time, moving away from corpus-like studies. I suggest two primary areas where corpus developers, both sociolinguistic and non-, could focus to develop more useful corpora: Corpora containing a wider range of non-standard (spoken varieties and more flexible annotation and treatment of spoken language data.Neste artigo considero o uso de corpora na pesquisa sociolingüística e, de modo mais geral, a relação entre a linguística de corpus e a sociolinguística. Reflito sobre a distinção entre corpora "convencionais" e "não-convencionais" (BEAL ET AL. 2007 a, b e avalio o porquê de corpora convencionais não terem atraído mais atenção no campo da sociolinguística. Na sequência, discuto a utilidade potencial de corpora para os estudos sociolingüísticos em termos da trajetória recente que tem sido adotada pela pesquisa nesta área (ECKHERT, em avaliação, reconhecendo que, se por um lado, muitos sociolinguistas têm ampliado o seu uso de técnicas avançadas da linguística de corpus, por outro, muitos estão, ao mesmo tempo, se afastando de estudos relaciados a corpora. Sugiro duas áreas principais nas quais compiladores de corpora, independentemente de serem sociolingüísticos ou não, poderiam enfocar para desenvolverem corpora mais úteis: corpora contendo uma amplitude maior de variedades (faladas n
The Challenge of Folklore to Medieval Studies

OpenAIRE

John Lindow

2018-01-01

When folklore began to emerge as a valid expression of a people during the early stages of national romanticism, it did so alongside texts and artifacts from the Middle Ages. The fields of folklore and medieval studies were hardly to be distinguished at that time, and it was only as folklore began to develop its own methodology (actually analogous to medieval textual studies) during the nineteenth century that the fields were distinguished. During the 1970s, however, folklore adopted a wholly...
The future of multimodal corpora O futuro dos corpora modais

Directory of Open Access Journals (Sweden)

Dawn Knight

2011-01-01

Full Text Available This paper takes stock of the current state-of-the-art in multimodal corpus linguistics, and proposes some projections of future developments in this field. It provides a critical overview of key multimodal corpora that have been constructed over the past decade and presents a wish-list of future technological and methodological advancements that may help to increase the availability, utility and functionality of such corpora for linguistic research.Este artigo apresenta um balanço do estado da arte da linguística de corpus multimodal e propõe a projeção de desenvolvimentos futuros nessa área. Um resumo crítico dos corpora multimodais-chave que foram construídos na última década é apresentado, assim como uma lista de desenvolvimentos tecnológicos e metodológicos futuros que podem auxiliar na disponibilização e utilização, bem como na funcionalidade, de tais corpora para a pesquisa linguística.
Electronic folklore among teenagers: SMS messages

Directory of Open Access Journals (Sweden)

Cvjetićanin Tijana

2006-01-01

Full Text Available The development of ICT media made way for a new form of folklore communication. Newly developed media, such as mobile phones, make it possible for their users to participate in electronically mediated communication, thus approaching the form of oral communication. The exchange of special type of SMS text messages represents a new way of transmitting contemporary folklore short forms. These messages use poetic language, they have standard style themes, patterns and formulas, and they form different genres and categories corresponding with already existing familiar folklore forms. The communication process that happens during the exchange of these messages also has folklore’s characteristics: it takes place within small groups, the communication is informal, the texts circulate in chain style, and undergo different transformation which generates the making of variants, etc. This form of electronic folklore is especially popular among teenagers, where it’s social functions and meanings are also most emphasized. Within this population, it adds to an older tradition of children’s written folklore poetry albums. Like poetry albums, SMS exchange influences the development of girls’ gender identity, providing also a socially defined channel for contacts between the sexes. It also functions as a mechanism of socialization and stratification within the group. At the same time, it creates a new field of meaning, which derives from the very media’s novelty and significance. In this sense, the exchange of SMS represents a symbolic act of acknowledging one’s belonging to the group of mobile telephone users. In this way, a new phenomenon is being symbolically processed through a new form of folklore.
Proposed Framework for the Evaluation of Standalone Corpora Processing Systems: An Application to Arabic Corpora

Directory of Open Access Journals (Sweden)

Abdulmohsen Al-Thubaity

2014-01-01

Full Text Available Despite the accessibility of numerous online corpora, students and researchers engaged in the fields of Natural Language Processing (NLP, corpus linguistics, and language learning and teaching may encounter situations in which they need to develop their own corpora. Several commercial and free standalone corpora processing systems are available to process such corpora. In this study, we first propose a framework for the evaluation of standalone corpora processing systems and then use it to evaluate seven freely available systems. The proposed framework considers the usability, functionality, and performance of the evaluated systems while taking into consideration their suitability for Arabic corpora. While the results show that most of the evaluated systems exhibited comparable usability scores, the scores for functionality and performance were substantially different with respect to support for the Arabic language and N-grams profile generation. The results of our evaluation will help potential users of the evaluated systems to choose the system that best meets their needs. More importantly, the results will help the developers of the evaluated systems to enhance their systems and developers of new corpora processing systems by providing them with a reference framework.
Overcoming Legal Limitations in Disseminating Slovene Web Corpora

Directory of Open Access Journals (Sweden)

Tomaž Erjavec

2016-09-01

Full Text Available Web texts are becoming increasingly relevant sources of information, with web corpora useful for corpus linguistic studies and development of language technologies. Even though web texts are directly accessable, which substantially simplifies the collection procedure compilation of web corpora is still complex, time consuming and expensive. It is crucial that similar endeavours are not repeated, which is why it is necessary to make the created corpora easily and widely accessible both to researchers and a wider audience. While this is logistically and technically a straightforward procedure, legal constraints, such as copyright, privacy and terms of use severely hinder the dissemination of web corpora. This paper discusses legal conditions and actual practice in this area, gives an overview of current practices and proposes a range of mitigation measures on the example of the Janes corpus of Slovene user-generated content in order to ensure free and open dissemination of Slovene web corpora.
Automatic Dictionary Expansion Using Non-parallel Corpora

Science.gov (United States)

Rapp, Reinhard; Zock, Michael

Automatically generating bilingual dictionaries from parallel, manually translated texts is a well established technique that works well in practice. However, parallel texts are a scarce resource. Therefore, it is desirable also to be able to generate dictionaries from pairs of comparable monolingual corpora. For most languages, such corpora are much easier to acquire, and often in considerably larger quantities. In this paper we present the implementation of an algorithm which exploits such corpora with good success. Based on the assumption that the co-occurrence patterns between different languages are related, it expands a small base lexicon. For improved performance, it also realizes a novel interlingua approach. That is, if corpora of more than two languages are available, the translations from one language to another can be determined not only directly, but also indirectly via a pivot language.
Spoken corpora and pragmatics Corpora orais e pragmática

Directory of Open Access Journals (Sweden)

Massimo Moneglia

2011-01-01

Full Text Available The goal of this paper is to present arguments in favour of two points related to the study of oral corpora and pragmatics: a at the level of annotation, corpora must ensure the parsing of the speech flow into utterances on the basis of prosodic cues and provide an easy access to the acoustic source; b at the level of sampling, corpora must ensure the maximum representation of context variation, rather than speaker variation. We will present the reasons which support the very basic prosodic annotation of speech (prosodic boundaries as a means to obtain relevant data from the speech flow. Starting from our present knowledge about the distribution of speech acts types in spoken corpora, we will present the reasons why building corpora in accordance to a context variation strategy should expand our knowledge of pragmatics. Additionally, we will claim that prosody is the necessary interface between locutive and illocutive acts and we will show that a deeper prosodic analysis is necessary to grasp unknown speech act types from language usage. Finally, we will briefly sketch the main assumptions of the Language into Act Theory (CRESTI, 2000 which is dedicated to the link between prosody and pragmatics and helps make explicit core aspects of pragmatic knowledge.O objetivo deste artigo é apresentar argumentos favoráveis a dois pontos relacionados ao estudo de corpora orais e pragmática: a no nível da anotação, os corpora devem garantir o processamento do fluxo discursivo em enunciados, baseando-se em chaves prosódicas, e oferecer fácil acesso aos arquivos de som; b no nível da amostragem, os corpora devem garantir a representatividade máxima de variação contextual, ao invés de variação de falantes. Apresentaremos os motivos que sustentam a escolha das fronteiras prosódicas como o referencial básico para a anotação prosódica da fala, como uma forma relevante de se obterem dados importantes do fluxo discursivo. Partindo do nosso
Building and using comparable corpora

CERN Document Server

Sharoff, Serge; Zweigenbaum, Pierre; Fung, Pascale

2013-01-01

The 1990s saw a paradigm change in the use of corpus-driven methods in NLP. In the field of multilingual NLP (such as machine translation and terminology mining) this implied the use of parallel corpora. However, parallel resources are relatively scarce: many more texts are produced daily by native speakers of any given language than translated. This situation resulted in a natural drive towards the use of comparable corpora, i.e. non-parallel texts in the same domain or genre. Nevertheless, this research direction has not produced a single authoritative source suitable for researchers and stu
Working with corpora in the translation classroom

Directory of Open Access Journals (Sweden)

Ralph Krüger

2012-10-01

Full Text Available This article sets out to illustrate possible applications of electronic corpora in the translation classroom. Starting with a survey of corpus use within corpus-based translation studies, the didactic value of corpora in the translation classroom and their epistemic value in translation teaching and practice will be elaborated. A typology of translation practice-oriented corpora will be presented, and the use of corpora in translation will be positioned within two general models of translation competence. Special consideration will then be given to the design and application of so-called Do-it-yourself (DIY corpora, which are compiled ad hoc with the aim of completing a specific translation task. In this context, possible sources for retrieving corpus texts will be presented and evaluated and it will be argued that, owing to time and availability constraints in real-life translation, the Internet should be used as a major source of corpus data. After a brief discussion of possible Internet research techniques for targeted and quality-focused corpus compilation, the possible use of the Internet itself as a macro-corpus will be elaborated. The article concludes with a brief presentation of corpus use in translation teaching in the MA in Specialised Translation Programme offered at Cologne University of Applied Sciences, Germany.
An Evaluation of Folklore Events in Serbia in Terms of Tourism

Directory of Open Access Journals (Sweden)

Željko Bjeljac

2016-02-01

Full Text Available In Serbia there are many traditional events based on tradition, folklore, old customs and traditional crafts and trades. Folklore events are the oldest elements in the development of tourism and provide a sufficient motive for tourist visits. On the basis of their program content, these events can be divided into folklore and folk music festivals, festivals of folk customs, and children’s folklore festivals. This paper offers a categorization of folklore events according to economic and geographic criteria; particular attention has been given to events that already are, or have great potential for becoming, a major attraction of the tourist destination in question and can thus contribute to a faster and higher-quality development of tourism.

Uso de corpora na formação de tradutores Using corpora in translator training

Directory of Open Access Journals (Sweden)

Antonio P. Berber Sardinha

2003-01-01

Full Text Available O presente trabalho aborda a questão do uso de corpora na formação de tradutores, enfocando mais especificamente a questão da conscientização. O trabalho apresenta uma discussão sobre o papel de corpora na tradução, sua aplicabilidade na formação profissional, e sua importância para o melhor conhecimento da constituição da linguagem. São oferecidos dois exemplos de análises, detalhadas a fim de serem aplicáveis em contextos em que haja poucos recursos de infraestrutura. As análises centram-se em pesquisas sobre as escolhas lingüísticas de um texto jornalístico traduzido para o português, e da versão brasileira de um slogan de uma campanha publicitária americana. É sugerido que essas atividades possam ser desenvolvidas com alunos de tradução, de tal modo que elas forneçam condições para que os alunos, ao explorarem corpora eletrônicos, possam se conscientizar da complexidade e da especificidade das escolhas lingüísticas envolvidas no processo tradutório.This paper tackles the issue of using corpora in translator training, focussing more specifically on the question of awareness raising. The paper presents a discussion on the role of corpora in translation, their applicability in professional development, and their importance in leading to a better understanding of how language is constituted. Two example analyses are offered and detailed, so that they are applicable to contexts in which computational resources are scarce. The analyses center around the linguistic choices in a translated newspaper text and in the Brazilian version of a slogan from an American advertising campaign. It is suggested that these activities may be carried out with translation students, in such a way that they enable students, while they explore electronic corpora, to become aware of both the complexity and the specificity of the linguistic choices involved in the process of translation.
Corpora and historical linguistics Corpora e linguística histórica

Directory of Open Access Journals (Sweden)

Merja Kytö

2011-01-01

Full Text Available The present article aims to survey and assess the current state of electronic historical corpora and corpus methodology, and attempts to look into possible future developments. It highlights the fact that within the wide spectrum of corpus linguistic methodology, historical corpus linguistics has emerged as a vibrant field that has significantly added to the appeal felt for the study of language history and change. In fact, according to a historical linguist with more than fifty years of experience, "[w]e could even go as far as to say that without the support and new impetus provided by corpora, evidence-based historical linguistics would have been close to the end of its life-span in these days of rapid-changing life and research, increasing competition on the academic career track and the methodological attractions offered to young scholars" (RISSANEN, forthcoming. Historical corpora and other electronic resources have also made the study of language history attractive: working on them engages students in an individual and interactive way that they find appealing (CURZAN 2000, p. 81.Este artigo objetiva fazer um levantamento e avaliar o estado da arte dos corpora históricos eletrônicos e da metodologia de estudos de corpora, assim como sugerir possíveis desenvolvimentos futuros na área. Destaca-se que dentro do espectro metodológico da linguística de corpus, a linguística de corpus histórica emergiu como um campo de investigação vibrante que tem adicionado interesse ao estudo da história e da mudança linguística. De acordo com um pesquisador da área com mais de cinqüenta anos de experiência, "pode-se dizer que sem o apoio e o novo ímpeto trazidos pelos corpora, a linguística histórica baseada em evidências teria estado próxima ao fim de sua vida nesses tempos de rápidas mudanças de vida e de pesquisa, aumentando a competição na carreira acadêmica e nas atrações metodológicas oferecidas aos jovens pesquisadores
El folklore y sus paradojas

Directory of Open Access Journals (Sweden)

HONORIO M. VELASCO MAILLO

1990-01-01

Full Text Available Uno de los rasgos más sobresalientes de la historia del folklore en España y otras naciones europeas son sus paradojas. Propuesto primero como ciencia ha llegado a ser claramente rechazado por posteriores ambientes científicos. Tendría interés hacer una historia social del folklore. Este artículo sugiere que tales paradojas y contradicciones están relacionadas con el paradigma científico que asumieron sus promotores, el evolucionismo cultural y con un concepto idealizado de "pueblo", que ayudaron a construir presentando colecciones de materiales. También analiza las diferentes funciones sociales que ha cumplido el discurso folklórico.
Building Collections: Folklore

Science.gov (United States)

Krapp, JoAnn Vergona

2005-01-01

Folklore, the oldest form of storytelling, reflects the culture of a country, hence its nonfiction classification. Through these tales, one senses the values, the humor, and the lifestyles of its peoples. A powerful genre, folklore is the foundation on which high fantasy is created, epic films are produced, and a single story is passed from one…
THE COMPOSER AND FOLKLORE PROBLEM: FACTORS OF STYLISTIC STRUCTURE

Directory of Open Access Journals (Sweden)

COCEAROVA GALINA

2017-12-01

Full Text Available This paper continues the author’s earlier study of the Composer and Folklore problem from the stylistic point of view. It is noted that in academic music, where the attention is focused not only on the speech or text characteristics, but primarily on the linguistic and stylistic material of folklore, the appeal to folk sources leads to the emergence of a number of stylistic factors, both, in the formation of the national style, and in the field of ethnic culture as a whole and integral stable system. The research points to the role of folklore as the genetic code of ethnic culture, as well as to other factors acting at on the level ,of musical discourse and musical language, contributing to the formation of „language flexibility” (A. Kolmogorov and, as a result, „flexibility of style”.
Folklore in China: Past, Present, and Challenges

Directory of Open Access Journals (Sweden)

Juwen Zhang

2018-04-01

Full Text Available This article first outlines the long history of folklore collection in China, and then describes the disciplinary development in the 20th century. In Section 3, it presents the current situation in terms of disciplinary infrastructure, development, contribution, and challenge, with a focus on the recent practice of safeguarding Intangible Cultural Heritage. These accounts are largely based on the views of the Chinese folklorists. In the final section, this article discusses the issues of cultural continuity, integration, and self-healing mechanisms in Chinese culture by putting Chinese folkloristics in a historical and world perspective. This paper suggests that, to understand Chinese folklore and culture, one must be aware of the most basic differences between Chinese fundamental beliefs and values and those of the West, and that Chinese folklore and folkloristics present new challenges to the current paradigms put forward in the post-colonial, post-modern, and imperial ideologies.
Importancia del folklore musical como práctica educativa

Directory of Open Access Journals (Sweden)

Arévalo, Azahara

2009-06-01

Full Text Available Educational society of today should reflect on the importance of musical folklore as an educative practice. This paper contents a reflection about folklore and different educative practices taking as examples some musical pieces from Jaen’ Song Book. These kinds of practices are essential since they develop the quality of the learning process in general and the learning of music in particular. Nowadays, the school is the unifier mean for the reappraisal, communication and transmission of the folklore of our culture. Recovering our folklore is a task that depends on every member of the community and it can be possible through the updating of these musical pieces to the new social changes and its possible spreading through the media. Jaen’ Song Book may constitute a mean for promoting its folklore among students of this province. The learning of this repertoire may also serve as an open door to the World to know the labor that is done in our schools. This paper tries to make teachers conscious that the use of folk materials may improve the learning of music as well as it may unfold a new way for future didactic, cultural and anthropological researches.
Women and the Study of Folklore.

Science.gov (United States)

Jordan, Rosan A.; De Caro, F. A.

1986-01-01

Presents a critical overview of academic writing on women and folklore, organized in three categories: (1) literature on images of women in verbal folklore, and the role of negative images in shaping attitudes; (2) research on womens' oral genres and performance and female use of folklore; and (3) studies of women as folk performers and artists.…
Proposed framework for the evaluation of standalone corpora processing systems: an application to Arabic corpora.

Science.gov (United States)

Al-Thubaity, Abdulmohsen; Al-Khalifa, Hend; Alqifari, Reem; Almazrua, Manal

2014-01-01

Despite the accessibility of numerous online corpora, students and researchers engaged in the fields of Natural Language Processing (NLP), corpus linguistics, and language learning and teaching may encounter situations in which they need to develop their own corpora. Several commercial and free standalone corpora processing systems are available to process such corpora. In this study, we first propose a framework for the evaluation of standalone corpora processing systems and then use it to evaluate seven freely available systems. The proposed framework considers the usability, functionality, and performance of the evaluated systems while taking into consideration their suitability for Arabic corpora. While the results show that most of the evaluated systems exhibited comparable usability scores, the scores for functionality and performance were substantially different with respect to support for the Arabic language and N-grams profile generation. The results of our evaluation will help potential users of the evaluated systems to choose the system that best meets their needs. More importantly, the results will help the developers of the evaluated systems to enhance their systems and developers of new corpora processing systems by providing them with a reference framework.
Of Mermaids and Changelings: Human Rights, Folklore and Contemporary Irish Language Poetry

Directory of Open Access Journals (Sweden)

Rióna Ní Fhrighil

2017-10-01

Full Text Available This article investigates the intersection of human rights discourse, Irish folklore and contemporary Irish-language poetry. The author contends that contemporary Irish-language poets Louis de Paor and Nuala Ní Dhomhnaill exploit the multi-faceted nature of international folklore motifs, along with their local variants, to represent human rights violations in their poetry. Focusing specifically on the motif of the changeling in De Paor’s poetry and on the motif of the mermaid in Ní Dhomhnaill’s, the author traces how folklore material is reimagined in ways that eschew uncomplicated transnational solidarity but which engender empathetic settlement.
BENTUK KARAKTER ANAK MELALUI DOKUMENTASI FOLKLOR LISAN KEBUDAYAAN LOKAL

Directory of Open Access Journals (Sweden)

Ranggi Ramadhani Ilminisa

2016-06-01

Full Text Available This research aims to documented the myth, legend, and fairy tale in Jombang and developing the oral folklore to be child story which contain of character education. In this case, used qualitative method. Based on results study getting nine story’s from a few of data site interpretation which include north Jombang, west, south, and middle. From the nine story’s, then documented and described on result study. Thus, it is can be reference of giving character education for kid. Penelitian ini bertujuan untuk mendokumentasikan mite, legenda, dan dongeng di Jombang dan mengemas folklor lisan tersebut menjadi cerita anak bermuatan pendidikan karakter. Dalam hal ini metode yang digunakan adalah deskriptif kualitatif. Berdasarkan hasil penelitian didapatkan sembilan cerita dari beberapa lokasi pengambilan data yang meliputi Jombang utara, barat, selatan dan tengah. Dari sembilan cerita tersebut didokumentasikan dan dideskripsikan pada temuan hasil penelitian. Dengan demikian, folklor lisan tersebut dapat dijadikan rujukan untuk membentuk pendidikan karakter anak.
Using corpora in scientific and technical translation training: resources to identify conventionality and promote creativity

Directory of Open Access Journals (Sweden)

Clara Inés López-Rodríguez

2016-06-01

Full Text Available Since the first Corpus Use and Learning to Translate (CULT Conference in Bertinoro (Italy in 1997, the usefulness of corpora for translators and trainee translators has been highlighted. From an initial approach where translators compiled ad hoc corpora in their hard drive for a subsequent study with lexical analysis software, there emerged a new trend towards the use of the Internet as corpus. In this second approach, the Web is perceived as a huge corpus which is accessed by means of online tools which produce monolingual wordlists and concordances from texts available from the Internet or pre-existing corpora, or by means of bilingual or multilingual concordancers displaying aligned texts from international institutions' parallel corpora. Bilingual concordancers and translation memories are widely used by translators and trainee translators because of the immediate translation solutions they offer, but these tools can restrain creativity by offering conventional solutions and eliminating layout and multimodal elements in texts. The aim of this article is to describe the exploitation of quality corpora in a scientific and technical translation course, focusing on texts on health translated from English into Spanish, and on terminological variation as a reflection of creativity in language.
POLÍTICAS DE LA REPRESENTACIÓN DEL FOLKLORE EN LOS MUSEOS FOLKLÓRICOS/Folklore representation policies in folk museums

Directory of Open Access Journals (Sweden)

Ana María Dupey

2012-11-01

Full Text Available Este trabajo trata sobre la invención y la reinvención de los museos de folklore. Se analizan cuáles han sido los propósitos políticos y las razones que se han esgrimido para su establecimiento y quiénes han sido los agentes de estas invenciones / reinvenciones. Si han sido producto de instituciones estatales o surgen de movimientos de elites o grupos minoritarios pertenecientes a la sociedad civil. Simultáneamente, se dilucida cómo las representaciones del folklore son semantizadas para la representación de identidades de colectivos locales, regionales, nacionales y transnacionales. Se analizan a las actuales re-orientaciones de dichas instituciones operadas a partir de los procesos de descolonización (exteriores e interiores con sus consecuencias económicas, políticas, sociales y cognitivas, b las críticas a los análisis coloniales y clasistas desarrollados en el pasado por la Etnología y el Folklore. Disciplinas que abonaron los respectivos discursos museográficos y c la revisión de la definición de la institución museo. AbstractThis work deals with the invention and the reinvention about folk museums. It analyzes what were the political purposes and the reasons that have been put forward for the establishment of folk museums and who were the agents of these inventions/reinventions. If they have been the product of state institutions or movements which arise from elite or minority groups that belongs to the civil society. Simultaneously, it is explained how the folklore representations are semanticized in the representation of the local, regional, national and transnational collective identities. It analyzes a the current guidelines for museums that are based upon the decolonization processes (internal and external and their economic, political, social and cognitive consequences, b the critiques of colonialism and classists analyses developed in the past by Ethnologhy and Folklore. Disciplines that had influenced
Pendayagunaan Folklor Sebagai Sumber Ekonomi Kreatif Di Daerah Tujuan Wisata Bali

Directory of Open Access Journals (Sweden)

I Nyoman Suarka

2014-06-01

Full Text Available Tourism practitioners in Bali commonly do not have an adequate understanding of the local culture so that the service given to tourists is less optimal. Therefore, efforts for delving into the original culture are necessary through a scientific research as a source for an information material and appreciation in developing the cultural outlooks of tourism practitioners in Bali. This research aims to delve into, preserve and develop folklores having potentials of high culture as a source of creative economy.This is a qualitative research with a morphology-ethnographic approach which attempts to describe the narrative elements of folklores as a unified whole by considering its history in the community and its supporting culture. That is, besides looking at the lore aspect through the analysis of a folklore structure, it also considers its folk aspect through the analysis of its function and significance. Furthermore, this research focuses on the opportunity for the utilization of folklores as a source of creative economy in addition to strengthening the local wisdom and preventing cultural pollution resulting from the negative aspects of tourism and globalization. Tourism practitioners in Bali commonly do not have an adequate understanding of the local culture so that the service given to tourists is less optimal. Therefore, efforts for delving into the original culture are necessary through a scientific research as a source for an information material and appreciation in developing the cultural outlooks of tourism practitioners in Bali. This research aims to delve into, preserve and develop folklores having potentials of high culture as a source of creative economy.This is a qualitative research with a morphology-‐ethnographic approach which attempts to describe the narrative elements of folklores as a unified whole by considering its history in the community and its supporting culture. That is, besides looking at the lore aspect through the
Folklore anecdote between memorata and fabulata: Field research of Serbs in Medina (Hungary

Directory of Open Access Journals (Sweden)

Ilić Marija

2007-01-01

Full Text Available This work is based on folklore material, which was gathered during ethno linguistic field research of Serbian traditional lexicon and spiritual culture in Medina village in Hungary in 2002. Folklore material is composed of the sayings by the informer Sava Sokic and primarily can be defined as a series of comical narrations. If we look upon these narrations as a genre of oral speech and within context of ethno linguistic interview, we can notice a complex structure of this oral genre. That is, this genre functions as a memorat with typical beginnings and met textual comments. On the other hand, it respects almost all genre norms, which are characteristic for folklore anecdote. Therefore, comic narrations of Save Sokic, and that are valid also for folklore anecdote in general, can be classified as borderline genre - between memorata and fabulata.
Nenets Folklore in Russian: The Movement of Culture in Forms and Languages

Directory of Open Access Journals (Sweden)

Karina Lukin

2008-09-01

Full Text Available In this methodological article the question of authenticity of folklore material is discussed. The article deals mainly with the research history of Nenets folklore studies and examines critically two of its paradigms, namely the so-called Finno-Ugric paradigm and the Soviet studies. It is argued that in these paradigms there existed biases that prevented the students to study certain kind of folklore material. The biases were related to the language and the form of the material: due to these biases folklore performed not in Nenets and not in forms defined traditional were left outside collections and research. Furthermore, it is shown that Russian speech and narratives embedded in speech are part of Nenets everyday communication and thus also material worth studying and collecting. Instead of the criticised paradigms the Nenets discourse is examined within the notions of communication centered studies that have gained attention since the 1980s.
Topic Modeling of Hierarchical Corpora /

OpenAIRE

Kim, Do-kyum

2014-01-01

The sizes of modern digital libraries have grown beyond our capacity to comprehend manually. Thus we need new tools to help us in organizing and browsing large corpora of text that do not require manually examining each document. To this end, machine learning researchers have developed topic models, statistical learning algorithms for automatic comprehension of large collections of text. Topic models provide both global and local views of a corpus; they discover topics that run through the co...
Using corpora in scientific and technical translation training: resources to identify conventionality and promote creativity

Directory of Open Access Journals (Sweden)

Clara Inés López-Rodríguez

2016-04-01

Full Text Available http://dx.doi.org/10.5007/2175-7968.2016v36nesp1p88 Since the first Corpus Use and Learning to Translate (CULT Conference in Bertinoro (Italy in 1997, the usefulness of corpora for translators and trainee translators has been highlighted. From an initial approach where translators compiled ad hoc corpora in their hard drive for a subsequent study with lexical analysis software, there emerged a new trend towards the use of the Internet as corpus. In this second approach, the Web is perceived as a huge corpus which is accessed by means of online tools which produce monolingual wordlists and concordances from texts available from the Internet or pre-existing corpora, or by means of bilingual or multilingual concordancers displaying aligned texts from international institutions' parallel corpora. Bilingual concordancers and translation memories are widely used by translators and trainee translators because of the immediate translation solutions they offer, but these tools can restrain creativity by offering conventional solutions and eliminating layout and multimodal elements in texts. The aim of this article is to describe the exploitation of quality corpora in a scientific and technical translation course, focusing on texts on health translated from English into Spanish, and on terminological variation as a reflection of creativity in language.
The use of corpora in English writing classes

Directory of Open Access Journals (Sweden)

Paula Pinto Paiva

2013-01-01

Full Text Available This study aims at discussing aspects related to learner corpora and linguistic features found in texts written by English learners based on the use of collocations in text production. For this research, we analyzed collocations with the verb “to have” and with the nouns “prejudice” and “regret”.
NETWORK FOLKLORE AND ITS ROLE IN THE FORMATION OF A COLLECTIVE COGNITIVE SPACE

Directory of Open Access Journals (Sweden)

Anastasija Belovodskaja

2014-04-01

Full Text Available The global implementation of information-communicative technologies into every sphere of human activity is being accompanied by the emergence of new forms of communication, leading to inevitable changes in the means of both the representation and reception of information. In this respect, the field of interest encompasses research into modern anonymous network creative writing, which, as a result of the technological qualities of the Internet space, produces such texts that require particular skills in both comprehension and reproduction. In turn, the products of network folklore, as they spontaneously spread on the Internet, acquire the status of particular signs of a precedent nature. At the same time, the very nature of anonymous network creative writing—amusing and colloquial—raises the attractiveness of such texts and facilitates their reception, allowing them to be used for manipulative aims. The fact that such network folklore can influence the process of idea-formation in society is predetermined by the fact that, by definition, it is the milieu where collective representations are condensed and transmitted. Thus, network folklore is in the focus of attention not only in folklore studies, but is extremely topical for research in such fields as cognitive science, linguistic-cultural studies, public relations, speech effect, and any others which take interest in the processes of keeping, receiving, and transmitting information.

KABA MALIN DEMAN: MENYIASATI DAMPAK DUA FALSAFAH MINANGKABAU DALAM FOLKLOR

Directory of Open Access Journals (Sweden)

Tienn Immerry

2017-11-01

Full Text Available Indonesian folktale is transmitted from one generation to the next by word of mouth. The changes from verbal to written manuscript has in fact undergone a long process. Folktale consists of cultural values of folk/ a particular group of people. Research on folklore is one way to reveal the philosophy contain in the written manuscript. Two of Minangkabau philosophies, extinction philosophy and marriage philosophy, are found in kaba Malin Deman, if imbalance occurs it will create problem in their society. Harmonization is the srategy for the imbalance and also as the function of folklore itself.
Building gold standard corpora for medical natural language processing tasks.

Science.gov (United States)

Deleger, Louise; Li, Qi; Lingren, Todd; Kaiser, Megan; Molnar, Katalin; Stoutenborough, Laura; Kouril, Michal; Marsolo, Keith; Solti, Imre

2012-01-01

We present the construction of three annotated corpora to serve as gold standards for medical natural language processing (NLP) tasks. Clinical notes from the medical record, clinical trial announcements, and FDA drug labels are annotated. We report high inter-annotator agreements (overall F-measures between 0.8467 and 0.9176) for the annotation of Personal Health Information (PHI) elements for a de-identification task and of medications, diseases/disorders, and signs/symptoms for information extraction (IE) task. The annotated corpora of clinical trials and FDA labels will be publicly released and to facilitate translational NLP tasks that require cross-corpora interoperability (e.g. clinical trial eligibility screening) their annotation schemas are aligned with a large scale, NIH-funded clinical text annotation project.
Folklore and the Internet: The Challenge of an Ephemeral Landscape1

Directory of Open Access Journals (Sweden)

Trevor J. Blank

2018-05-01

Full Text Available Through the lens of memetic folk humor, this essay examines the slippery, ephemeral nature of hybridized forms of contemporary digital folklore. In doing so, it is argued that scholars should not be distracted by the breakneck speed in which expressive materials proliferate and then dissipate but should instead focus on the overarching ways that popular culture and current news events infiltrate digital folk culture in the formation of individuals' cultural inventories. The process of transmission and variation that shapes the resulting hybridized folklore requires greater scrutiny and contextualization.
Discovery learning in the language-for-translation classroom: corpora as learning aids

Directory of Open Access Journals (Sweden)

Silvia Bernardini

2016-06-01

Full Text Available This contribution reviews the idea of discovery learning with corpora, proposed in the 1990s, evaluating its potential and its implications with reference to the education of translators today. The rationale behind this approach to data-driven learning, combining project-based and form-focused instruction within a socio-constructivistically inspired environment, is discussed. Examples are also provided of authentic, open-ended learning experiences, thanks to which students of translation share responsibility over the development of corpora and their consultation, and teachers can abandon the challenging role of omniscient knowledge providers and wear the more honest hat of "learning experts". Adding to the more straightforward uses of corpora in courses that aim to develop thematic, technological and information mining competences – i.e., in which training is offered in the use of corpora as professional aids –, attention is focused on foreign language teaching for translators and on corpora as learning aids, highlighting their potential for the development of the three other European Master's in Translation (EMT competences (translation service provision, language and intercultural ones.
New approaches for development, analyzing and security of multimedia archive of folklore objects

Directory of Open Access Journals (Sweden)

Galina Bogdanova

2008-07-01

Full Text Available We present new approaches used in development of the demo version of a WEB based client/server system that contains an archival fund with folklore materials of the Folklore Institute at Bulgarian Academy of Sciences (BAS. Some new methods for image and text securing to embed watermarks in system data are presented. A digital watermark is a visible or perfectly invisible, identification code that is permanently embedded in the data and remains present within the data after any decryption process. We have also developed improved tools and algorithms for analyzing of the database too.
An Interpretation of Two Oromo Folklore Genres Integrated to ...

African Journals Online (AJOL)

The purpose of this study was to analyze and interpret the meanings of two selected folklore genres namely: riddle and pastoral song portrayed in primary Oromo language student text books integrated to enhance the language skills, knowledge, attitude and cultural values of the children. Qualitative method was employed ...
Semantics, contrastive linguistics and parallel corpora

Directory of Open Access Journals (Sweden)

Violetta Koseska

2014-09-01

Full Text Available Semantics, contrastive linguistics and parallel corpora In view of the ambiguity of the term “semantics”, the author shows the differences between the traditional lexical semantics and the contemporary semantics in the light of various semantic schools. She examines semantics differently in connection with contrastive studies where the description must necessary go from the meaning towards the linguistic form, whereas in traditional contrastive studies the description proceeded from the form towards the meaning. This requirement regarding theoretical contrastive studies necessitates construction of a semantic interlanguage, rather than only singling out universal semantic categories expressed with various language means. Such studies can be strongly supported by parallel corpora. However, in order to make them useful for linguists in manual and computer translations, as well as in the development of dictionaries, including online ones, we need not only formal, often automatic, annotation of texts, but also semantic annotation - which is unfortunately manual. In the article we focus on semantic annotation concerning time, aspect and quantification of names and predicates in the whole semantic structure of the sentence on the example of the “Polish-Bulgarian-Russian parallel corpus”.
LDC Arabic Treebanks and Associated Corpora: Data Divisions Manual

OpenAIRE

Diab, Mona; Habash, Nizar; Rambow, Owen; Roth, Ryan

2013-01-01

The Linguistic Data Consortium (LDC) has developed hundreds of data corpora for natural language processing (NLP) research. Among these are a number of annotated treebank corpora for Arabic. Typically, these corpora consist of a single collection of annotated documents. NLP research, however, usually requires multiple data sets for the purposes of training models, developing techniques, and final evaluation. Therefore it becomes necessary to divide the corpora used into the required data sets...
“Stories Like the Light of Stars”: Folklore and Narrative Strategies in the Fiction of Éilís Ní Dhuibhne

Directory of Open Access Journals (Sweden)

Giovanna Tallone

2017-10-01

Full Text Available Besides being one of Ireland’s best-known and eminent writers, Éilís Ní Dhuibhne is also a professional and recognised folklorist and researcher, whose work covers a diversity of topics and subjects, mostly in the area of the tradition of oral storytelling and urban folklore. Her background in folklore has a relevant impact on her fiction, which is marked by reinvention of folklore patterns and juxtaposition of ancient stories and their contemporary counterpart. The purpose of his essay is to shed light on the impact of folklore and folklore projects on the fiction of Éilís Ní Dhuibhne in terms of in allusions, contents, discourse organization and narrative strategies. The tight link between folklore and storytelling in her writing is analysed taking into account her short stories vis-à-vis her academic work in folklore, focussing on Ní Dhuibhne’s awareness of the continuity of traditional narrative in time.
Folklore Music on Romanian TV. From State Socialist Television to Private Channels

Directory of Open Access Journals (Sweden)

Alexandra Urdea

2014-06-01

Full Text Available Music genres rooted in folklore have often been interpreted as ideological manoeuvres to forge a sense of national identity (Gordy, Mihailescu, Baker, Cash. This article explores formalized folklore performances of muzică populară as forms ‘media rituals’ (Couldry, and focuses on the role that television has played in establishing the genre as we know it today. It analyses the link between muzică populară as rooted in mass participation activities during communism, and ‘media rituals’ as framed on television (Couldry, indiscriminately and democratically involving the entire population that it addresses (and is available beyond that.
06491 Summary -- Digital Historical Corpora- Architecture, Annotation, and Retrieval

OpenAIRE

Burnard, Lou; Dobreva, Milena; Fuhr, Norbert; Lüdeling, Anke

2007-01-01

The seminar "Digital Historical Corpora" brought together scholars from (historical) linguistics, (historical) philology, computational linguistics and computer science who work with collections of historical texts. The issues that were discussed include digitization, corpus design, corpus architecture, annotation, search, and retrieval.
Collocation lists as instruments for metaphor detection in corpora Listas de colocações como instrumentos para detecção de metáforas em corpora

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2006-01-01

Full Text Available This paper reports a study on the use of collocation lists as instruments for detecting metaphors in corpora. A collocation list contains the collocations for selected words in corpora together with concordances for those words. As corpora become more available to metaphor researchers, there is a growing need for developing ways to gain access to as much data as the corpus can offer. The research described here has hopefully come some way toward meeting the challenges of developing tools for metaphor corpus research. Results suggest that the collocation lists seem to be a good pre-processing instrument for corpus research of metaphor, despite accuracy problems.Este trabalho apresenta uma pesquisa sobre o uso de listas de colocações como instrumentos para detecção de metáforas em corpora. Uma lista de colocação contém as colocações de palavras selecionadas de corpora juntamente com as concordâncias dessas palavras. Na medida que os corpora se tornam mais acessíveis aos pesquisadores de metáfora, começa a surgir uma necessidade de desenvolver maneiras de acessar a maior quantidade possível de dados que um corpus oferece. A pesquisa descrita aqui tentou enfrentar esse desafio, criando e testando ferramentas para pesquisa de metáfora baseada em corpus. Os resultados sugerem que as listas de colocações podem ser um instrumento eficaz de pré-processamento de corpus com vistas à análise humana de metáforas, a despeito de alguns problemas de precisão.
Developing intonation corpora for isiXhosa and isiZulu

CSIR Research Space (South Africa)

Govender, N

2005-11-01

Full Text Available also show how those corpora can be used without further interpretation to gain insight into matters such as overall pitch contours and gender differences, and discuss the additional steps that will be required to create truly generative models from...
From Annotated Multimodal Corpora to Simulated Human-Like Behaviors

DEFF Research Database (Denmark)

Rehm, Matthias; André, Elisabeth

2008-01-01

Multimodal corpora prove useful at different stages of the development process of embodied conversational agents. Insights into human-human communicative behaviors can be drawn from such corpora. Rules for planning and generating such behavior in agents can be derived from this information....... And even the evaluation of human-agent interactions can rely on corpus data from human-human communication. In this paper, we exemplify how corpora can be exploited at the different development steps, starting with the question of how corpora are annotated and on what level of granularity. The corpus data...
Text Mining of Supreme Administrative Court Jurisdictions

OpenAIRE

Feinerer, Ingo; Hornik, Kurt

2007-01-01

Within the last decade text mining, i.e., extracting sensitive information from text corpora, has become a major factor in business intelligence. The automated textual analysis of law corpora is highly valuable because of its impact on a company's legal options and the raw amount of available jurisdiction. The study of supreme court jurisdiction and international law corpora is equally important due to its effects on business sectors. In this paper we use text mining methods to investigate Au...
"Haunting experiences: Ghosts in contemporary folklore," by Diane E. Goldstein et al.

Directory of Open Access Journals (Sweden)

Linda Levitt

2010-03-01

Full Text Available Diane E. Goldstein, Sylvia Ann Grider, and Jeannie Banks Thomas. Haunting experiences: Ghosts in contemporary folklore. Logan: Utah State University Press, 2007, paperback, $24.95 (272p ISBN 978-0-87421-636-3.
Folklore and Folk Songs of Chittagong: A Critical Review

Directory of Open Access Journals (Sweden)

Amir Mohammad Khan

2017-04-01

Full Text Available Folk Songs stems from Folklore are very rich in the southern region of Chittagong. In this part of the world Folk Songs play pivotal role in the lifestyle of people as a heart-touching and heavenly connection exists between human, nature and Folk Songs. Folk Songs in this area are special because we found the theme of Nature Conservation in them. We took the southern part of Chittagong (Lohagara, Satkania, Chandanaish and Patiya as our research area, selected a village namely Chunati in the systematic sampling and more than 100 people were interviewed through focus group discussion and key informant interviews. The sufficient literature review is also done. People in this area love nature a lot. Here music personnel were born from time to time who not only worked for the musical development but also created consciousness among people to love nature and save it. We discussed about the origin of Folk Songs, pattern of Folk Songs to clarify the importance of Folk Songs of Chittagong for its connection to Folklore and at the same time for promoting the idea of Nature Conservation. Of course, this part of studies deserves more attention in the field of research. Our ultimate goal should be to conserve and promote Folk Songs of Chittagong with yearlong heritage that automatically will later enrich Folklore and Nature Conservation.
Use of English Corpora as a Primary Resource to Teach English to the Bengali Learners

Science.gov (United States)

Dash, Niladri Sekhar

2011-01-01

In this paper we argue in favour of teaching English as a second language to the Bengali learners with direct utilisation of English corpora. The proposed strategy is meant to be assisted with computer and is based on data, information, and examples retrieved from the present-day English corpora developed with various text samples composed by…
Folklore Epistemology: How Does Traditional Folklore Contribute to Children's Thinking and Concept Development?

Science.gov (United States)

Agbenyega, Joseph S.; Tamakloe, Deborah E.; Klibthong, Sunanta

2017-01-01

This research utilised a "stimulated recall" methodology [Calderhead, J. 1981. "Stimulated Recall: A Method for Research on Teaching." "British Journal of Educational Psychology" 51: 211-217] to explore the potential of African folklore, specifically Ghanaian folk stories in the development of children's reflective…
Corpora and corpus technology for translation purposes in professional and academic environments. Major achievements and new perspectives

Directory of Open Access Journals (Sweden)

Cécile Frérot

2016-06-01

Full Text Available The “use” of corpora and concordancers in translation teaching has grown increasingly attractive since the mid1990s’ with an abundant literature advocating their use and promoting their benefits in the translation classroom. In translator training, efforts are being made to incorporate the use of corpora and concordancers in masters’ programmes and to offer specific modules on corpora for translation as the use of translation memory (TM systems within Computer-Aided Translation (CAT courses still dominates. In the translation profession, while TM systems are part of the everyday working environment, the same cannot be said of corpora and concordancers even though the most recent surveys show that professional translators would like to learn more about the potential of corpora for translation. Overall, the “usefulness” of corpora and corpus technology at the different stages of the translation process remains poorly documented in translation but a growing number of empirical studies has started to show concern as it has now become of paramount importance to assess the extent to which corpora are of added value for translation quality in both professional and academic environments.

Using Monolingual and Bilingual Corpora in Lexicography

Science.gov (United States)

Miangah, Tayebeh Mosavi

2009-01-01

Constructing and exploiting different types of corpora are among computer applications exposed to the researchers in different branches of science including lexicography. In lexicography, different types of corpora may be of great help in finding the most appropriate uses of words and expressions by referring to numerous examples and citations.…
Working with Corpora in the Translation Classroom

Science.gov (United States)

Krüger, Ralph

2012-01-01

This article sets out to illustrate possible applications of electronic corpora in the translation classroom. Starting with a survey of corpus use within corpus-based translation studies, the didactic value of corpora in the translation classroom and their epistemic value in translation teaching and practice will be elaborated. A typology of…
Multilingual text induced spelling correction

NARCIS (Netherlands)

Reynaert, M.W.C.

2004-01-01

We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams
Electronic Corpora as Translation Tools

DEFF Research Database (Denmark)

Laursen, Anne Lise; Mousten, Birthe; Jensen, Vigdis

2012-01-01

translator who has to get a cross-linguistic overview of a new area or a new line of business. Relevant internet texts can be compiled ‘on the fly’, but internet data needs to be sorted and analyzed for rational use. Today, such sorting and analysis can be made by a low-tech, analytical software tool....... This article demonstrates how strategic steps of compiling and retrieving linguistic data by means of specific search strategies can be used to make electronic corpora an efficient tool in translators’ daily work with fields that involve new terminology, but where the skills requested to work correspond...
Russian Folklore as a Reflection of National Character in the Work of Boris Vysheslavtzev

Directory of Open Access Journals (Sweden)

Alex L. Nalepin

2016-09-01

Full Text Available The essay is focused on the spiritual crisis of Russian culture at the beginning of the 20th Century and on the search of philosophical alternatives to overcome the crisis within the framework of Russian philosophical thought. In particular, it highlights the work of Boris P. Vysheslavtzev, a major thinker among Russian immigrants and his studies in Russian folklore seen as reflection of Russian national character. The essay for the first time introduces new data concerning the specificity of the choice that was highly important for Russian literature and culture as it was for Russian folklore studies.
Folklore and Sociolinguistics

Directory of Open Access Journals (Sweden)

John Holmes McDowell

2018-01-01

Full Text Available Folklore and sociolinguistics exist in a symbiotic relationship; more than that, at points—in the ethnography of communication and in ethnopoetics, for example—they overlap and become indistinguishable. As part of a reaction to the formal rigor and social detachment of Chomsky’s theoretical linguistics, sociolinguistics emerges in the mid-twentieth century to assess the role of language in social life. Folklorists join the cause and bring to it a commitment to in-depth ethnography and a longstanding engagement with artistic communication. In this essay, I trace key phases in the development of this interdisciplinary movement, revolutionary in its reorientation of language study to the messy but fascinating realm of speech usage. I offer the concept of performative efficacy, the notion that expressive culture performances have the capacity to shape attitude and action and thereby transform perceived realities, as a means of capturing the continuing promise of a sociolinguistically informed folkloristics.
Primary diffuse large B-cell lymphoma of the corpora cavernosa presented as a perineal mass

Directory of Open Access Journals (Sweden)

González-Satué Carlos

2012-01-01

Full Text Available Primary male genital lymphomas may appear rarely in testis, and exceptionally in the penis and prostate, but there is not previous evidence of a lymphoma arising from the corpora cavernosa. We report the first case in the literature of a primary diffuse cell B lymphoma of the corpora cavernosa presented with low urinary tract symptoms, perineal pain and palpable mass. Diagnosis was based on trucut biopsy, histopathological studies and computed tomographic images.
Tula song folklore: genre-stylistic and dialectic peculiarities

Directory of Open Access Journals (Sweden)

Krasovskaya Nelli Alexandrovna

2016-06-01

Full Text Available The article analyzes the works of Tula folklore recorded in the western part of the Tula region, in terms of genre, stylistic and linguistic features. The relevance of the study is related to the fact that Tula folk songs has not been studied, linguistic features of the works are not subjected to serious analysis. The article describes the features of the genre of songs recorded in Belevsky district of Tula region, including the ancient fortunetelling chants, wedding ceremony songs, romantic ballads etc., it is cited numerous examples in the lyrics that reflect the dialectal features of the phonetic, grammatical, lexical levels. According to the authors, a modern folk song genre retains its diversity and is a kind of storeroom containing priceless linguistic wealth. The analysis allows to draw conclusions about the presence and well-preserved in the recorded music of South Russian dialect phonetic and grammatical features. So far, there is no established typology of Tula dialects, therefore, according to the authors, the fixation of folklore in the territories bordering on Tula dialects, is very important and interesting for further descriptive and comparative work on identifying the eastern and south-south-west differences in Tula dialects.
Folkloric Art in Egyptian Schools.

Science.gov (United States)

Osman, Siham

1983-01-01

Theories in art education with a western origin have been applied in Egypt to support the revival of folkloric art. There are three important phases in the teaching of a unit on applique, a decorative craft dating back to the earliest Egyptian history. (AM)
The transformation of contemporary analyses of oral folklore: Fairy tale versus fantasy

OpenAIRE

Otčenášek Jaroslav

2010-01-01

The study focuses on contemporary forms of folklore and their relationship to literary forms like Fantasy, Sci-fi, Horror and Fantasy Game. The first problem is the specification of the terms and the classification of the internal structure of these terms. A typical structure of contemporary oral folklore, such as urban legends, is a combination of classical forms of folklore (subject matter from fairy tales, anecdotes etc.) and the influence of films, television and books. This contami...
Differences in motor abilities between dancers in professional and amateur folklore ansambles

Directory of Open Access Journals (Sweden)

Kocić Jadranka

2014-01-01

Full Text Available Differences in motor abilities between dancers in Serbinan professional folklore ansamble for dance and sing 'Kolo' in Belgrade and amateur folklore ansambles from coulture-arts society 'Vila' and 'Sonja Marinković' from Novi Sad had been tested on sample of 47 members. Motor area was examined by Provincial Governement Institute tests for Sport in Novi sad, and it was received 9 variables: single movement speed, explosivity below extremities (legs, endurance in jumping, absolutely strength backs' flexor muscule, relatively strength backs' flexor muscule, absolutely strength backs' extensor muscule, relatively strength backs' extensor muscule, absolutely strength backs' flexor muscule, relatively strength backs' flexor muscule. Relatively values obtained from absolutely values results using mathemathics. To determine differences between folklore dancers in whole variable system, it was used multivariante analysis variance (MANOVA. It was determined differences between sexes in motor abilities. Data was obtained by statistic packet SPSS 10.0. The aim was to find significant differences in nine mentioned variables between professional and amateur dancers and between sexes. Received results showed that there was not significant differences between professional and amateur dancers. Between sexes it was significant differences in man benefit, except one variable single movement speed. The conclusion is that for better, statisticaly significant results, professional dancers should enlarge contents and expend training intensity.
Childbirth in ancient Rome: from traditional folklore to obstetrics.

Science.gov (United States)

Todman, Donald

2007-04-01

In ancient Rome, childbirth was a hazardous event for both mother and child with high rates of infant and maternal mortality. Traditional Roman medicine centred on folklore and religious practices, but with the development of Hippocratic medicine came significant advances in the care of women during pregnancy and confinement. Midwives or obstetrices played an important role and applied rational scientific practices to improve outcomes. This evolution from folklore to obstetrics was a pivotal point in the history of childbirth.
Guidelines for normalising Early Modern English corpora: Decisions and justifications

Directory of Open Access Journals (Sweden)

Archer Dawn

2015-03-01

Full Text Available Corpora of Early Modern English have been collected and released for research for a number of years. With large scale digitisation activities gathering pace in the last decade, much more historical textual data is now available for research on numerous topics including historical linguistics and conceptual history. We summarise previous research which has shown that it is necessary to map historical spelling variants to modern equivalents in order to successfully apply natural language processing and corpus linguistics methods. Manual and semiautomatic methods have been devised to support this normalisation and standardisation process. We argue that it is important to develop a linguistically meaningful rationale to achieve good results from this process. In order to do so, we propose a number of guidelines for normalising corpora and show how these guidelines have been applied in the Corpus of English Dialogues.
′′Early baby teeth′′: Folklore and facts

Directory of Open Access Journals (Sweden)

N Uma Maheswari

2012-01-01

Full Text Available Variations in the newborns′ oral cavity have been an enduring interest to the pediatric dentist. The occurrence of natal and neonatal teeth is a rare anomaly, which for centuries has been associated with diverse superstitions among many different ethnic groups. Natal teeth are more frequent than neonatal teeth, the ratio being approximately 3:1. The purpose of this case report is to review the literature related to the natal teeth folklore and misconceptions and discuss their possible etiology and treatment.
Corpora and Language Assessment: The State of the Art

Science.gov (United States)

Park, Kwanghyun

2014-01-01

This article outlines the current state of and recent developments in the use of corpora for language assessment and considers future directions with a special focus on computational methodology. Because corpora began to make inroads into language assessment in the 1990s, test developers have increasingly used them as a reference resource to…
The Nearly Forgotten Malay Folklore: Shall We Start with the Software?

Science.gov (United States)

Abd Rahim, Normaliza

2014-01-01

The study focuses on the nearly forgotten Malay folklore in Malaysia. The objectives of the study were to identify and discuss the types of Malay folklore among primary school learners. The samples of the study were 100 male and female students at schools in Selangor. The samples were picked at random from several schools and they were given…
Linguistic Corpora and Lexicography.

Science.gov (United States)

Meijs, Willem

1996-01-01

Overviews the development of corpus linguistics, reviews the use of corpora in modern lexicography, and presents central issues in ongoing work aimed at broadening the scope of lexicographical use of corpus data. Focuses on how the field has developed in relation to the production of new monolingual English dictionaries by major British…
The Concept of Love in Lithuanian Folklore and Mythology

Directory of Open Access Journals (Sweden)

Doc dr. Daiva Šeškauskaitė

2013-06-01

Full Text Available Love is a reserved feeling, meaning amiability and complete internecine understanding. The concept of love has always been important for human world outlook and attitude. Love can have different meaning and expression – for some people it is a nice, warm feeling, a way of action and behaviour, for others it is nothing more than a sexual attraction. As there are different perceptions of love, there also exist few common love manifestations: love can be maternal, childish, juvenile, sexual. Love can also be felt for a home land, own nation, home. Naturally love can be expressed through the particular rituals, symbols and signs. Folk songs introduce four main lover characteristics: beauty, sweetness, kindness and boon. The later feature means that a girl/ boy is supposed to be well-set, to be pleasant and comfortable to touch which is very important when choosing a wife or a husband. Love in folklore is expressed through the common metaphorical and allegorical symbols, it doesn‘t sound as explicit word – more like a metaphor or epithet. Love in folklore can be perceived and felt very differently. Love like an action – love like... special person, essential possession. Prime personal characteristics, such as kindness, tenderness, humility, are the ones to light the love fire as well as beauty, artfulness, eloquence also help. Love is supposed to lead to the sacred sacrament of marriage. Love, if real, is a serious subject. Love is worth dying for. Strong love leads to self-sacrifice. Fairy tales satirize infidelity stressing that love is right only between a wife and a husband while other options are considered as inglorious and wrong. Love, as an incest, is also common in our folklore.
FOLKLORE STUDIES AND NATIONALISM IN TURKEY ABSTRACT TÜRKİYE’DE FOLKLOR ÇALIŞMALARI VE MİLLİYETÇİLİK

Directory of Open Access Journals (Sweden)

İlhan BAŞGÖZ

2011-09-01

Full Text Available Interest in folklore began in Turkey in the second half of the nineteenth century when the need was felt to forge a national language which could be understood by the majority. The Tanzimat reforms, which were introduced in 1839, inaugurated a functional change in Ottoman literature. A new generation of writers who were in contact with the West, especially France, and admired the economic, social, and educational institutions of Europe, soon realized that literature played an important role in the development of these institutions. To create a literature using the language of "common people," which was pure Turkish and unspoiled by foreign influences, made the Tanzimat writers interested in folklore and folk literature. Many other poets, novelists, play- wrights, and the intellectuals joined the movement between 1860 and 1900. The emergence of Turkish nationalism marked a new era in the attitude of intellectuals toward folklore and it was Boratav who introduced folklore to Turkey as an independent, scientific discipline. He enlarged the scope of folklore teaching and research to include verbal and nonverbal tradition. Türkiye’de folklora olan ilk ilgi, on dokuzuncu yüzyılın ikinci yarısında halkın çoğunluğu tarafından anlaşılabilecek bir milli dilin oluşturulması ihtiyacı hissedildiğinde başladı. 1839’da ilan edilen Tanzimat reformları Osmanlı edebiyatında fonksiyonel bir değişimi başlattı. Özellikle Fransa başta olmak üzere, Batı ile sıkı ilişkiler içerisinde olan ve Avrupa’nın ekonomik, sosyal ve eğitim kurumlarını arzu eden, örnek alan yeni nesil Osmanlı yazarları, çok geçmeden bu kurumların gelişmesinde edebiyatın önemli bir rol oynadığını fark ettiler. Yabancı etkilerle kirletilmemiş, saf Türkçe olan halkın dilini kullanarak bir edebiyat yaratmak için Tanzimat yazarları, halk bilimi ve halk edebiyatı ile ilgilendiler. Pek çok şair, romancı, oyun yazarı ve entellekt
Regulation of the corpora allata in male larvae of the cockroach Diploptera punctata

International Nuclear Information System (INIS)

Paulson, C.R.

1986-01-01

The regulation of corpora allata was studied in final instar males of Diploptera punctata. The glands were manipulated in vivo and removed to determine the effect by in vitro radiochemical assay for juvenile hormone synthesis. Corpora allata were also treated with putative regulatory factors in vitro. During the final stadium the corpora allata were inhibited both by nerves and by humoral factors. Neural inhibition was shown by an increase in juvenile hormone synthesis following denervation of the corpora allata. This operation elicited an extra larval instar. Humoral inhibition was shown by the decline in juvenile hormone synthesis of adult female corpora allata following transplantation into final instar larval hosts, and conversely the increase in juvenile hormone synthesis by larval corpora allata following implantation into adult females. Humoral inhibition was prevented by decapitation of larvae prior to the head critical period for molting and restored by implantation of a larval brain, showing that the brain is the source of this inhibition

From the Problems of Dictionaries and Multi-lingual Corpora

Directory of Open Access Journals (Sweden)

Violetta Koseska-Toszewa

2015-06-01

Full Text Available From the Problems of Dictionaries and Multi-lingual Corpora The article describes the work on a number of dictionaries being developed by the Corpus Linguistics and Semantics Group of the Institute of Slavic PAS. They include “Contemporary Bulgarian-Polish Dictionary”, “Bulgarian-Polish Online Dictionary” and “Russian-Bulgarian-Polish Dictionary”. The dictionaries differ in the numbers of entries, as well as in the different degrees of their connection with parallel corpora being elaborated under the “Clarin” project. All the discussed dictionaries are similar with respect to their use of traditional, syntactic classifiers and of semantic classifiers, introduced for the first time in the existing lexicographical practice. Thanks to the “Polish-Bulgarian-Russian Corpus”, the Group has managed to verify the results of contrasting Polish and Bulgarian in the light of scope-based logical quantification. Thanks to the Russian material added to the trilingual corpus, the researchers have managed to confirm the fact that from the viewpoint of “incomplete quantification” Russian and Polish (synthetic languages behave similarly, and are opposed to the analytic Bulgarian.
Pedagogical Application of Specialized Corpora in ESP Teaching: the case of the UVaSTECorpus

Directory of Open Access Journals (Sweden)

Pedro A. Fuertes-Olivera

2015-11-01

Full Text Available This article contributes to defining the concept of specialized corpora, reviews the rationale for using them instead of general corpora in teaching activities, and offers the state of art in both corpus-based and corpus-driven approaches to ESP teaching. It also explains some decisions taken regarding the compilation of the University of Valladolid Corpus of Written Scientific and Technical English and illustrates some uses of the corpus. In particular, it presents some tasks with concordances and defends that ESP students should be taught the niceties of lexical gender as it is a grammatical category with social and/or ideological implications.
Danish TV Christmas calendars: Folklore, myth and cultural history

DEFF Research Database (Denmark)

Agger, Gunhild

2013-01-01

in which this traditional genre has succeeded in renewing itself. The so-called Pyrus series, TV 2’s Christmas calendars during the mid-1990s, exhibited folklore, myth and cultural history in a combination of entertainment and information. They were succeeded by calendars such as Jul i Valhal......This article aims at characterizing the Danish Christmas calendar as a TV institution and a meeting place for the traditions of the almanac, folklore and the history of culture. Against the background of a brief outline of the history of Danish Christmas calendars, the article explores ways...
THE STRUCTURE OF POEM IN TALE KERINCI FOLKLORE

Directory of Open Access Journals (Sweden)

- Nazurty

2015-06-01

Full Text Available Tale is the folklore in the form of poem that is sung. This study aims to gain in-depth understanding of the structure of Tale poem in the release of the Kerinci pilgrims. This qualitative study employed content analysis as the method with a structural approach. This study discussed the structure of the Tale poem. The results of the study are Tale poem consists of sampiran phrase, the rhyme/ sound phrase, and content. It composed by ten lines to twenty lines. It has ab ab rhyme according to the sound phrase flanking each line. The sound expression serves as rhyme and rhythm former.
The Galileo Legend as Scientific Folklore.

Science.gov (United States)

Lessl, Thomas M.

1999-01-01

Examines the various ways in which the legend of Galileo's persecution by the Roman Catholic Church diverges from scholarly readings of the Galileo affair. Finds five distinct themes of scientific ideology in the 40 accounts examined. Assesses the part that folklore plays in building and sustaining a professional ideology for the modern scientific…
Discovery learning in the language-for-translation classroom: corpora as learning aids

Directory of Open Access Journals (Sweden)

Silvia Bernardini

2016-04-01

This contribution reviews the idea of discovery learning with corpora, proposed in the 1990s, evaluating its potential and its implications with reference to the education of translators today. The rationale behind this approach to data-driven learning, combining project-based and form-focused instruction within a socio-constructivistically inspired environment, is discussed. Examples are also provided of authentic, open-ended learning experiences, thanks to which students of translation share responsibility over the development of corpora and their consultation, and teachers can abandon the challenging role of omniscient knowledge providers and wear the more honest hat of "learning experts". Adding to the more straightforward uses of corpora in courses that aim to develop thematic, technological and information mining competences – i.e., in which training is offered in the use of corpora as professional aids –, attention is focused on foreign language teaching for translators and on corpora as learning aids, highlighting their potential for the development of the three other European Master's in Translation (EMT competences (translation service provision, language and intercultural ones.
When phonetics matters: creation and perception of female images in song folklore

Directory of Open Access Journals (Sweden)

Stashko Halyna

2017-06-01

Full Text Available This paper presents a stylistic analysis of female images in American song folklore in order to examine how sound symbolic language elements contribute to the construction of verbal images. The results obtained show the link between sound and meaning and how such phonetic means of stylistics as assonance, alliteration, and onomatopoeia function to reinforce the meanings of words or to set the mood typical of the characters. Their synergy helps create and interpret female images and provides relevant atmosphere and background to them in folk song texts.
Slavic Phraseology: A View Through Corpora

Directory of Open Access Journals (Sweden)

Zakharov Victor

2017-12-01

Full Text Available The study of word collocability is one of the main tasks of linguistics. The combinatory ability of language units, collocability, is one of the linguistic syntagmatic laws. This phenomenon is the main object of the phraseology and lexicography. The article deals with set phrases of different types in Russian, Czech and Slovak from the point of view of their quantitative evaluation. Corpus linguistics understand set phrases as statistically determined unities. This approach is the basic point of different automatic ways to extract idioms and collocations. The paper describes experiments which show how text corpora and corpus methods and tools can be used to expand the entries in existing dictionaries and how set phrases could be evaluated quantitatively. It is shown and maintained that corpus linguistics methods and tools allow to create dictionaries of new type which have to include a larger amount of set phrases and collocations than before.
Mergelės Marijos ir akmens sąsajos lietuvių folklore

OpenAIRE

Kairaitytė, Aušra

2008-01-01

The object of this article is the relation between stone and the Blest Virgin Mary. The aim is to define the functions of stone in narratives about the Mother of God in the Lithuanian folklore, revealing the place of stone during the advent of Mary and finding parallels in the tradition of different Catholic countries. The aim is achieved by applying text analysis and comparative methods. Lithuanian folk stories tell us about growing or walking stones. [...] The other group consists of storie...
“The Foresight to Become a Mermaid”: Folkloric Cyborg Women in Éilís Ní Dhuibhne’s Short Stories

Directory of Open Access Journals (Sweden)

Rebecca Graham

2017-10-01

Full Text Available Éilís Ní Dhuibhne is both a folklorist and a feminist, who “took an interest in rewriting or re-inventing women’s history, a history which had been largely unwritten” (Ní Dhuibhne, “Negotiating” 73. Folklore stories and motifs abound in her writing. Elke D’hoker argues that Ní Dhuibhne reimagines and rewrites folktales to “reflect and interpret the social values and attitudes of a postmodern society” (D’hoker 137. The repurposing of folklore allows Ní Dhuibhne to interrogate some of the complex and controversial ways that Irish society has attempted to represent and control women, entrenching taboos about female behaviours and sexualities. Using Donna Haraway’s cyborg feminism and Karen Barad’s deployment of Haraway’s theory of diffraction, this article focuses on issues of voice and orality, and the female body in “The Mermaid Legend”, “Midwife to the Fairies”, and “Holiday in the Land of Murdered Dreams”, to argue that Ní Dhuibhne’s repurposing of folklore is a radically feminist undertaking. All three short stories, which feature female protagonists, reveal diverse, transgressive, sexual mothers and maidens whose symbolic connections with folklore allow them to challenge the restrictive constructions of women in Irish society, creating spaces to explore alternative, heterogeneous, feminist re-conceptions of identity and belonging.
Visualizing the semantic content of large text databases using text maps

Science.gov (United States)

Combs, Nathan

1993-01-01

A methodology for generating text map representations of the semantic content of text databases is presented. Text maps provide a graphical metaphor for conceptualizing and visualizing the contents and data interrelationships of large text databases. Described are a set of experiments conducted against the TIPSTER corpora of Wall Street Journal articles. These experiments provide an introduction to current work in the representation and visualization of documents by way of their semantic content.
Corpora in Language Teaching and Learning

Science.gov (United States)

Boulton, Alex

2017-01-01

This timeline looks at explicit uses of corpora in foreign or second language (L2) teaching and learning, i.e. what happens when end-users explore corpus data, whether directly via concordancers or integrated into CALL programs, or indirectly with prepared printed materials. The underlying rationale is that such contact provides the massive…
Subdomain sensitive statistical parsing using raw corpora

NARCIS (Netherlands)

Plank, B.; Sima'an, K.

2008-01-01

Modern statistical parsers are trained on large annotated corpora (treebanks). These treebanks usually consist of sentences addressing different subdomains (e.g. sports, politics, music), which implies that the statistics gathered by current statistical parsers are mixtures of subdomains of language
The transformation of contemporary analyses of oral folklore: Fairy tale versus fantasy

Directory of Open Access Journals (Sweden)

Otčenášek Jaroslav

2010-01-01

Full Text Available The study focuses on contemporary forms of folklore and their relationship to literary forms like Fantasy, Sci-fi, Horror and Fantasy Game. The first problem is the specification of the terms and the classification of the internal structure of these terms. A typical structure of contemporary oral folklore, such as urban legends, is a combination of classical forms of folklore (subject matter from fairy tales, anecdotes etc. and the influence of films, television and books. This contamination is really typical for postmodern culture. Fantasy stories can de divided into five categories - 1. alternative history (variants of past history or future evolution; 2. classical fantasy (variants of mythology or classical fairy tales or legends; 3. parody of fantasy or humour fantasy (the fantasy world is mostly only background; 4. urban fantasy (more or less a part of urban legend; 5. comics (the importance of graphic form - Superman, Batman etc.. Sci-fi and horror stories are mostly literary products influenced by classical legends or urban legends. Party games, especially “Dungeons & Dragons”, and their enactments by fans are a special part of the fantasy world. Ethnologists are faced with the questions of which method to use to carry out field research and what is actually relevant. Based on the first experiences we can see that for the research into this “new” field we can use the standard methods without problems. But for a better understanding we need to read fantasy, sci-fi and horror books, watch fantasy, sci-fi and horror movies, and get acquainted with the websites related to fantasy or sci-fi content. For a good analysis of fantasy party games one needs to become a member of a gamers’ group. The use of modern recording equipment like digital video cameras and cameras etc. is also very important.
Corpora and corpus technology for translation purposes in professional and academic environments. Major achievements and new perspectives

Directory of Open Access Journals (Sweden)

Cécile Frérot

2016-04-01

The “use” of corpora and concordancers in translation teaching has grown increasingly attractive since the mid1990s’ with an abundant literature advocating their use and promoting their benefits in the translation classroom. In translator training, efforts are being made to incorporate the use of corpora and concordancers in masters’ programmes and to offer specific modules on corpora for translation as the use of translation memory (TM systems within Computer-Aided Translation (CAT courses still dominates. In the translation profession, while TM systems are part of the everyday working environment, the same cannot be said of corpora and concordancers even though the most recent surveys show that professional translators would like to learn more about the potential of corpora for translation. Overall, the “usefulness” of corpora and corpus technology at the different stages of the translation process remains poorly documented in translation but a growing number of empirical studies has started to show concern as it has now become of paramount importance to assess the extent to which corpora are of added value for translation quality in both professional and academic environments.
“Not a Thing of the Past”, Zora Neale Hurston and the Living Legacy of Folklore « Not a Thing of the Past », Zora Neale Hurston et le legs vivant du folklore

Directory of Open Access Journals (Sweden)

Margaret Gillespie

2009-11-01

Full Text Available Auteur important bien qu’atypique de la Renaissance de Harlem et premier anthropologue afro-américain à avoir étudié sa propre culture, Zora Neale Hurston est, à de nombreux titres, un écrivain d’exception. Contrairement à d’autres, dont Robert Wright et Alain Locke, Hurston ne renie nullement le legs culturel que représente le folklore noir qu’elle apprécie selon ses propres critères, folklore qui influencera tant la forme que le fond de son art. Anthropologue de formation, Hurston appréhende néanmoins la culture noire américaine du sud non pas comme un vestige du passé qu’il conviendrait de conserver précieusement intact, mais comme une partie intégrante du vécu actuel. À travers les stratégies discursives orales vernaculaires qu’elle adopte et adapte de la tradition folklorique afro-américaine, Hurston, en pionnière, ouvre une voie et donne une voix aux écrivains Noirs à venir.
Text Induced Spelling Correction

NARCIS (Netherlands)

Reynaert, M.W.C.

2004-01-01

We present TISC, a language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from a very large corpus of raw text, without supervision, and contains word
The adversative connectives aber and but in conversational corpora.

Science.gov (United States)

Gülzow, Insa; Bartlitz, Victoria; Kuehnast, Milena; Golcher, Felix; Bittner, Dagmar

2018-03-09

We analyzed the conversational corpora of two German and two English children to investigate how the different use types of the adversative connectives aber and but influence the probability of monologically versus dialogically constructed utterances in the first year of use. Our findings show that children produce adversative connectives mainly in dialogic structures for illocutionary and theme-management purposes, but that the use types of adversative connectives lead to a different distribution of monologic and dialogic clause combinations. The results suggest that monologic and dialogic realizations as a function of text type must be considered when describing the developmental trajectory of the different use types of adversative connectives.
Developing resources for sentiment analysis of informal Arabic text in social media

OpenAIRE

Itani, Maher; Roast, Chris; Al-Khayatt, Samir

2017-01-01

Natural Language Processing (NLP) applications such as text categorization, machine translation, sentiment analysis, etc., need annotated corpora and lexicons to check quality and performance. This paper describes the development of resources for sentiment analysis specifically for Arabic text in social media. A distinctive feature of the corpora and lexicons developed are that they are determined from informal Arabic that does not conform to grammatical or spelling standards. We refer to Ara...
Use of monolingual and comparable corpora in the classroom to translate adverbial connectors

Directory of Open Access Journals (Sweden)

Beatriz Sánchez Cárdenas

2016-04-01

This research explored the reasons why certain adverbial discourse connectors, apparently easy to translate, are a source of translation problems that cannot be easily resolved with a bilingual dictionary. Moreover, this study analyzed the use of parallel corpora in the translation classroom and how it can increase the quality of text production. For this purpose, we compared student translations before and after receiving training on the use of corpus analysis tools.

Gonadotropin binding sites in human ovarian follicles and corpora lutea during the menstrual cycle

Energy Technology Data Exchange (ETDEWEB)

Shima, K.; Kitayama, S.; Nakano, R.

1987-05-01

Gonadotropin binding sites were localized by autoradiography after incubation of human ovarian sections with /sup 125/I-labeled gonadotropins. The binding sites for /sup 125/I-labeled human follicle-stimulating hormone (/sup 125/I-hFSH) were identified in the granulosa cells and in the newly formed corpora lutea. The /sup 125/I-labeled human luteinizing hormone (/sup 125/I-hLH) binding to the thecal cells increased during follicular maturation, and a dramatic increase was preferentially observed in the granulosa cells of the large preovulatory follicle. In the corpora lutea, the binding of /sup 125/I-hLH increased from the early luteal phase and decreased toward the late luteal phase. The changes in 3 beta-hydroxysteroid dehydrogenase activity in the corpora lutea corresponded to the /sup 125/I-hLH binding. Thus, the changes in gonadotropin binding sites in the follicles and corpora lutea during the menstrual cycle may help in some important way to regulate human ovarian function.
Mining knowledge from text repositories using information extraction ...

Indian Academy of Sciences (India)

Information extraction (IE); text mining; text repositories; knowledge discovery from .... general purpose English words. However ... of precision and recall, as extensive experimentation is required due to lack of public tagged corpora. 4. Mining ...
"Sempre tivemos mulheres nos cantos e nas cordas": uma pesquisa sobre o lugar feminino nas corporações musicais

Directory of Open Access Journals (Sweden)

Mayara Pacheco Coelho

2014-04-01

Full Text Available O presente artigo insere-se em projeto de pesquisa-intervenção sobre a música e suas articulações identitárias nas corporações musicais da região dos Campos das Vertentes, em especial São João del-Rei e cidades vizinhas. Nessa região, a música tem papel significativo na formação da identidade cultural dos cidadãos e na história dos municípios. O recorte atual apresenta uma investigação sobre determinações de gênero, visando conhecer como se dá a participação de musicistas nas bandas e orquestras da região. Para tanto, utilizou-se a análise arqueológica do discurso, a fim de contrapor falas de musicistas às falas de músicos das corporações e, também, às falas masculinas presentes na filosofia e ao discurso utópico sobre a mulher. Observou-se que as diferenças de gênero tradicionais conservam-se encobertas no cotidiano das corporações musicais. Entretanto, observou-se também que as musicistas começam a ser reconhecidas nas corporações e, sobretudo, reconhecem-se como capazes de, nelas, alçarem voos.
An analysis on the entity annotations in biological corpora [v1; ref status: indexed, http://f1000r.es/2o0

Directory of Open Access Journals (Sweden)

Mariana Neves

2014-04-01

Full Text Available Collection of documents annotated with semantic entities and relationships are crucial resources to support development and evaluation of text mining solutions for the biomedical domain. Here I present an overview of 36 corpora and show an analysis on the semantic annotations they contain. Annotations for entity types were classified into six semantic groups and an overview on the semantic entities which can be found in each corpus is shown. Results show that while some semantic entities, such as genes, proteins and chemicals are consistently annotated in many collections, corpora available for diseases, variations and mutations are still few, in spite of their importance in the biological domain.
Rileggendo “Folklore e profitto”. Patrimoni immateriali, mercati, turismo

Directory of Open Access Journals (Sweden)

Letizia Bindi

2014-04-01

Full Text Available Starting from the anticipatory notes of Luigi M. Lombardi Satriani’s Folklore e profitto [1973], the paper seeks to critically articulate the interesting relation between cultural heritage, capitalistic market and mass media, updating the analysis, also, to the most recent forms of the use of media in promoting and valorizing such traditions. What emerges is a twist of cultural heritage toward consumerism that imposes to anthropologists and cultural heritage scholars new challenges and questions and a late-modern rethinking of critical categories as commodification, alienation and fetishization. A central question, finally, arises about who and what should be today the social actors asked to decide about these processes of cultural manipulation in the new post-industrial and globalized scenario, characterized, inter alia, from a generalized economic crisis.
El narco-folklore: narrativas e historias de la droga en la frontera

Directory of Open Access Journals (Sweden)

Howard Campbell

2007-01-01

Full Text Available Lo que el gobierno de los Estados Unidos ha llamado La guerra contra las drogas se basa en la idea de que el consumo y tráfi co de estupefacientes son inequívocamente actividades dañinas y peligrosas que la población del país temerá y rechazará. No obstante, los resultados de estudios etnográfi cos en la frontera Estados Unidos- México indican que el tráfi co de drogas se ha convertido en una actividad tan común que ha generado su propio estilo de subcultura, incluyendo música y folklore. Hasta la fecha los estudios antropológicos de la narco-cultura en la frontera se han enfocado en los narcocorridos, un género de música mexicana popular que celebra y narra el comercio de los estupefacientes y las vidas de trafi cantes de alto nivel. Estos estudios proporcionan perspectivas valiosas sobre los funcionamientos internos de las organizaciones de la droga y del contexto cultural de los cuales emergen. Sin embargo, la mayoría de los trabajadores del narcotráfi co no son los superhéroes o los bandidos ricos retratados en los narcocorridos. Es el pueblo, que tiene como principal motivación para involucrarse en el mundo de los estupefacientes la supervivencia económica. La imagen de un rico folklore de tráfi co de drogas se ha convertido en un perfi l común en la región fronteriza de El Paso / Ciudad Juárez. Este estudio etnográfi co muestra cómo este comercio se ha convertido en una parte normal de la vida diaria. El folklore cotidiano alrededor del tráfi co de drogas indica el grado en el cual el comercio de éstas afecta a los habitantes de la frontera en múltiples niveles.
Folklore Traditions in Contemporary Everyday Life: Between Continuity and (Re)construction (based on two examples from the Czech Republic)

Czech Academy of Sciences Publication Activity Database

Uhlíková, Lucie; Pavlicová, M.

2014-01-01

Roč. 62, č. 2 (2014), s. 163-181 ISSN 1335-1303 Institutional support: RVO:68378076 Keywords : folklore * folklorism * ethno-cultural tradition * social construction * everyday life * the Czech Republic Subject RIV: AC - Archeology, Anthropology, Ethnology
Combinatorial and compositional aspects of bilingual aligned corpora

NARCIS (Netherlands)

Martzoukos, S.

2016-01-01

The subject of investigation of this thesis is the building blocks of translation in Statistical Machine Translation (SMT). We find that these building blocks, namely phrase-level dictionary entries, which are extracted from bilingual aligned corpora (training data), admit richer structure than
Uma investigação dos sentidos de um phrasal verb por meio dos corpora e dicionários on-line

Directory of Open Access Journals (Sweden)

Emiliana Fernandes Bonalumi

2014-06-01

Full Text Available Nesta pesquisa analisamos o uso do phrasal verbs throw up encontrado em dois corpora on-line originalmente escritos em língua inglesa, a saber: British National Corpus (BNC e Corpus of Contemporary American English (COCA, bem como no livro didático adotado em sala de aula New English File Upper-Intermediate, com o suporte dos dicionários on-line Cambridge Online Dictionary e Macmillan Dictionary. Objetivamos identificar, classificar e generalizar o uso e significados do phrasal verb selecionado para a análise nos respectivos corpora on-line em relação ao seu uso e significado no livro didático anteriormente mencionado. Por meio dos corpora e dicionários on-line, o aluno expandirá seu conhecimento acerca do uso e significados de um determinado phrasal verb, como o analisado nesta investigação. Palavras-chave: linguística de corpus; ensino movido por dados; phrasal verbs.
Text collections for evaluation of Russian morphological taggers

Directory of Open Access Journals (Sweden)

Lyashevskaya Olga

2017-12-01

Full Text Available The paper describes the preparation and development of the text collections within the framework of MorphoRuEval-2017 shared task, an evaluation campaign designed to stimulate development of the automatic morphological processing technologies for Russian. The main challenge for the organizers was to standardize all available Russian corpora with the manually verified high-quality tagging to a single format (Universal Dependencies CONLL-U. The sources of the data were the disambiguated subcorpus of the Russian National Corpus, SynTagRus, OpenCorpora.org data and GICR corpus with the resolved homonymy, all exhibiting different tagsets, rules for lemmatization, pipeline architecture, technical solutions and error systematicity. The collections includes both normative texts (the news and modern literature and more informal discourse (social media and spoken data, the texts are available under CC BY-NC-SA 3.0 license.
Folklore information from Assam for family planning and birth control.

Science.gov (United States)

Tiwari, K C; Majumder, R; Bhattacharjee, S

1982-11-01

The author collected folklore information on herbal treatments to control fertility from different parts of Assam, India. Temporary methods of birth control include Cissampelos pareira L. in combination with Piper nigrum L., root of Mimosa pudica L. and Hibiscus rosa-sinensis L. Plants used for permanent sterilization include Plumbago zeylanica L., Heliotropium indicum L., Salmalia malabrica, Hibiscus rosa-sinensis L., Plumeria rubra L., Bambusa rundinacea. Abortion is achieved through use of Osbeckia nepalensis or Carica papaya L. in combination with resin from Ferula narthex Boiss. It is concluded that there is tremendous scope for the collection of folklore about medicine, family planning agents, and other treatments from Assam and surrounding areas. Such a project requires proper understanding between the survey team and local people, tactful behavior, and a significant amount of time. Monetary rewards can also be helpful for obtaining information from potential respondents.
Dissemination of Values and Culture through the E-Folklore

Science.gov (United States)

Rahim, Normaliza Abd; Affendi, Nik Rafidah Nik Muhammad; Pawi, Awang Azman Awang

2017-01-01

This study focuses on the values and culture in the e-folklore. The objectives of the study were to identify and discuss the values in the song lyric "The Stork and the Mouse Deer." The song was taken from phone application in the compilation of the "Kingfisher stories" copyrighted by Dewan Bahasa and Pustaka. The e-folklore…
Early Years Education and the Value for Money Folklore

Science.gov (United States)

Campbell-Barr, Verity

2012-01-01

This article is intended as a contribution to the debate on the role of human capital in determining value for money in early years education. The article explores how the idea that early years education offers value for money has become folklore amongst policymakers and more widely. However, drawing on both interview data and existing literature…
Human attitudes towards herpetofauna: the influence of folklore and negative values on the conservation of amphibians and reptiles in Portugal.

Science.gov (United States)

Ceríaco, Luis Mp

2012-02-08

Human values and folklore of wildlife strongly influence the effectiveness of conservation efforts. These values and folklore may also vary with certain demographic characteristics such as gender, age, or education. Reptiles and amphibians are among the least appreciated of vertebrates and are victims of many negative values and wrong ideas resulting from the direct interpretation of folklore. We try to demonstrate how these values and folklore can affect the way people relate to them and also the possible conservation impacts on these animals. A questionnaire survey distributed to 514 people in the district of Évora, Portugal, was used to obtain data regarding the hypothesis that the existence of wrong ideas and negative values contributes to the phenomenon of human-associated persecution of these animals. A structural equation model was specified in order to confirm the hypothesis about the possible relationships between the presence of perceptions and negative values about amphibians and reptiles and persecution and anti-conservation attitudes. Sociodemographic variables were also added. The results of the model suggest that the presence of folklore and negative values clearly predicts persecution and anti-conservation attitudes towards amphibians and reptiles. Also, the existence of folklore varies sociodemographically, but negative values concerning these animals are widespread in the population. With the use of structural equation models, this work is a contribution to the study of how certain ideas and values can directly influence human attitudes towards herpetofauna and how they can be a serious conservation issue.
Citation Matching in Sanskrit Corpora Using Local Alignment

Science.gov (United States)

Prasad, Abhinandan S.; Rao, Shrisha

Citation matching is the problem of finding which citation occurs in a given textual corpus. Most existing citation matching work is done on scientific literature. The goal of this paper is to present methods for performing citation matching on Sanskrit texts. Exact matching and approximate matching are the two methods for performing citation matching. The exact matching method checks for exact occurrence of the citation with respect to the textual corpus. Approximate matching is a fuzzy string-matching method which computes a similarity score between an individual line of the textual corpus and the citation. The Smith-Waterman-Gotoh algorithm for local alignment, which is generally used in bioinformatics, is used here for calculating the similarity score. This similarity score is a measure of the closeness between the text and the citation. The exact- and approximate-matching methods are evaluated and compared. The methods presented can be easily applied to corpora in other Indic languages like Kannada, Tamil, etc. The approximate-matching method can in particular be used in the compilation of critical editions and plagiarism detection in a literary work.
Promoting free dialog video corpora: the IFADV corpus example

NARCIS (Netherlands)

van Son, R.J.J.H.; Wesseling, W.; Sanders, E.; van den Heuvel, H.; Kipp, M.; Martin, J.C.; Paggio, P.; Heylen, D.

2009-01-01

Research into spoken language has become more visual over the years. Both fundamental and applied research have progressively included gestures, gaze, and facial expression. Corpora of multi-modal conversational speech are rare and frequently difficult to use due to privacy and copyright
American Folk Music and Folklore Recordings 1985: A Selected List.

Science.gov (United States)

Library of Congress, Washington, DC. American Folklife Center.

Thirty outstanding records and tapes of traditional music and folklore which were released in 1985 are described in this illustrated booklet. All of these recordings are annotated with liner notes or accompanying booklets relating the recordings to the performers, their communities, genres, styles, or other pertinent information. The items are…
Human attitudes towards herpetofauna: The influence of folklore and negative values on the conservation of amphibians and reptiles in Portugal

Science.gov (United States)

2012-01-01

Background Human values and folklore of wildlife strongly influence the effectiveness of conservation efforts. These values and folklore may also vary with certain demographic characteristics such as gender, age, or education. Reptiles and amphibians are among the least appreciated of vertebrates and are victims of many negative values and wrong ideas resulting from the direct interpretation of folklore. We try to demonstrate how these values and folklore can affect the way people relate to them and also the possible conservation impacts on these animals. Methods A questionnaire survey distributed to 514 people in the district of Évora, Portugal, was used to obtain data regarding the hypothesis that the existence of wrong ideas and negative values contributes to the phenomenon of human-associated persecution of these animals. A structural equation model was specified in order to confirm the hypothesis about the possible relationships between the presence of perceptions and negative values about amphibians and reptiles and persecution and anti-conservation attitudes. Sociodemographic variables were also added. Results The results of the model suggest that the presence of folklore and negative values clearly predicts persecution and anti-conservation attitudes towards amphibians and reptiles. Also, the existence of folklore varies sociodemographically, but negative values concerning these animals are widespread in the population. Conclusions With the use of structural equation models, this work is a contribution to the study of how certain ideas and values can directly influence human attitudes towards herpetofauna and how they can be a serious conservation issue. PMID:22316318
How Can We Use Corpus Wordlists for Language Learning? Interfaces between Computer Corpora and Expert Intervention

Science.gov (United States)

Chen, Yu-Hua; Bruncak, Radovan

2015-01-01

With the advances in technology, wordlists retrieved from computer corpora have become increasingly popular in recent years. The lexical items in those wordlists are usually selected, according to a set of robust frequency and dispersion criteria, from large corpora of authentic and naturally occurring language. Corpus wordlists are of great value…
Learner Corpora without Error Tagging

Directory of Open Access Journals (Sweden)

Rastelli, Stefano

2009-01-01

Full Text Available The article explores the possibility of adopting a form-to-function perspective when annotating learner corpora in order to get deeper insights about systematic features of interlanguage. A split between forms and functions (or categories is desirable in order to avoid the "comparative fallacy" and because – especially in basic varieties – forms may precede functions (e.g., what resembles to a "noun" might have a different function or a function may show up in unexpected forms. In the computer-aided error analysis tradition, all items produced by learners are traced to a grid of error tags which is based on the categories of the target language. Differently, we believe it is possible to record and make retrievable both words and sequence of characters independently from their functional-grammatical label in the target language. For this purpose at the University of Pavia we adapted a probabilistic POS tagger designed for L1 on L2 data. Despite the criticism that this operation can raise, we found that it is better to work with "virtual categories" rather than with errors. The article outlines the theoretical background of the project and shows some examples in which some potential of SLA-oriented (non error-based tagging will be possibly made clearer.

Folklore as historical and cultural legasy of the lower Volga region in the first third of the XXth century: B.S. Laschilin, A.M. Listopadov

Directory of Open Access Journals (Sweden)

Rodionova Olga Igorevna

2013-11-01

Full Text Available In the present article the question of the folklore phenomenon in the folk art of the Lower Volga Region in the first third of the 20th century is considered. In the course of research high emphasis was placed on the Cossack subject matter. The role of B.S. Laschilin and A.M. Listopadov in collecting and publishing folk art, the folklore of the Don Cossacks, is revealed. Boris Stepanovitch Laschilin’s work left a great impact in the artistic life of our region. In B.S. Laschilin’s books, that were published in Rostov-on-Don, Saratov, Stalingrad-Volgograd, contained tales, fairy tales, bylinas, legends, songs, ditties, proverbs, sayings, ancient dramas of the first Russian folk theatres, exorcisms. Boris Stepanovitch kept selecting songs and ditties, chastooshkas for Voronezh Folk Choir “Voronezh girls”, which are still in the repertoire of the Pyatnitsky Russian Folk Chorus. Folklorist and musician Alexander Mikhailovich Listopadov, who collected and studied folk songs from his youth up, and recorded them in the Don Region hamlets and Cossack villages, spent more than 50 years of his life on the research of the Don Cossack’s musical culture. Alexander Mikhailovich Listopadov’s heritage made an important contribution to the native musical folklore study. Folklore compositions is a unique source of knowledge of history, way of life, moral and other national concepts, which allows us to reconstitute a linguistic personality of a definite historical epoch.
Studies on luteinizing hormone receptors of human corpora lutea during menstrual cycle and pregnancy

International Nuclear Information System (INIS)

Izumi, Yasushi

1982-01-01

With the purpose of explicating the lifespan of human corpora lutea, using human corpora lutea of the menstrual cycle and pregnancy, binding of 125 I-LH to the 20,000g cell membrane fraction was examined. 1) Specific bindings of 125 I-LH, 125 I-HCG were demonstrated in the 20,000g cell membrane fraction. Although LH and HCG were parallel in inhibiting 125 I-LH binding, HCG was found to be more effective. FSH did not inhibit binding. 2) Binding of 125 I-LH was dependent on time, temperature, 125 I-LH concentration, amount of the cell membrane fraction protein and pH. The highest binding was seen at pH 6.0 while incubating for 60 min at 37 0 C. 3) The number of LH receptors in human corpora lutea of the menstrual cycle increased towards midluteal phase, especiallt on 5th day from ovulation, and decreased towards late luteal phase. LH receptor was not found in corpus albicans. The apparent dissociation constant of each corpus luteum did not change throughout the menstrual cycle. 4) Corpora lutea of pregnancy contained a few or no receptors which bound 125 I-LH specifically. These data suggest that LH receptor is an important factor regulating the lifespan of corpus luteum and exogenous HCG has effect on luteal insufficiency, but the effect of HCG on threatened abortion is uncertain. (author)
Correlation among foetal number, corpora lutea and plasma progesterone in rockland-swiss mice. [Progesterone determination by radioimmunoassay

Energy Technology Data Exchange (ETDEWEB)

Simon, N G; Bridges, R S; Gandelmann, R [Rutgers - the State Univ., New Brunswick. NJ (USA). Dept. of Psychology; Rutgers - the State Univ., Newark, NJ (USA). Inst. of Animal Behavior)

1978-01-01

The relationship among plasma progesterone, number of corpora lutea, and foetal number was assessed in Rockland-Swiss albino mice. While number of corpora lutea and foetal number were significantly correlated, neither was related to plasma progesterone level. This finding in the mouse is similar to results reported in the rabbit.
Some Benefits of Corpora as a Language Learning Tool

Science.gov (United States)

Marjanovic, Tatjana

2012-01-01

What this paper is meant to do is share illustrations and insights into how English learners and teachers alike can benefit from using corpora in their work. Arguments are made for their multifaceted possibilities as grammatical, lexical and discourse pools suitable for discovering ways of the language, be they regularities or idiosyncrasies. The…
Studies on luteinizing hormone receptors of human corpora lutea during menstrual cycle and pregnancy

Energy Technology Data Exchange (ETDEWEB)

Izumi, Yasushi (Keio Univ., Tokyo (Japan). School of Medicine)

1982-10-01

With the purpose of explicating the lifespan of human corpora lutea, using human corpora lutea of the menstrual cycle and pregnancy, binding of /sup 125/I-LH to the 20,000g cell membrane fraction was examined. 1) Specific bindings of /sup 125/I-LH, /sup 125/I-HCG were demonstrated in the 20,000g cell membrane fraction. Although LH and HCG were parallel in inhibiting /sup 125/I-LH binding, HCG was found to be more effective. FSH did not inhibit binding. 2) Binding of /sup 125/I-LH was dependent on time, temperature, /sup 125/I-LH concentration, amount of the cell membrane fraction protein and pH. The highest binding was seen at pH 6.0 while incubating for 60 min at 37/sup 0/C. 3) The number of LH receptors in human corpora lutea of the menstrual cycle increased towards midluteal phase, especially on 5th day from ovulation, and decreased towards late luteal phase. LH receptor was not found in corpus albicans. The apparent dissociation constant of each corpus luteum did not change throughout the menstrual cycle. 4) Corpora lutea of pregnancy contained a few or no receptors which bound /sup 125/I-LH specifically. These data suggest that LH receptor is an important factor regulating the lifespan of corpus luteum and exogenous HCG has effect on luteal insufficiency, but the effect of HCG on threatened abortion is uncertain.
La aldea fantasma: Problemas en el estudio del folklore y la cultura popular contemporáneos

Directory of Open Access Journals (Sweden)

Díaz G. Viana, Luis

2003-06-01

Full Text Available The author analyzes the problems involved in the study of folklore and popular culture in a contemporary world, transnational and hybrid, aparently different from what the object/subject of study was supposed to be. Nevertheless he argues that the type of urban legends we can gather today through Internet does not differe from the traditional materials, such as leyends, games or mores, since they talk (as they used to about people tryng to make sense out of an always changing and mixed world.

El autor ofrece un análisis de la problemática relacionada con el estudio del folklore y la cultura popular en el mundo contemporáneo, transnacional e híbrido, aparentemente distinto de lo que se suponía que era el objeto/sujeto de estudio tradicional. Sin embargo, argumenta que el tipo de leyendas urbanas que podemos recopilar hoy a través de internet no es diferente de los materiales tradicionales, tales como leyendas, juegos o costumbres; ya que de lo que hablan éstos, al igual que aquéllos, es de las preocupaciones de las personas por dar sentido a un mundo siempre cambiante y siempre en contacto.
Effects of prostaglandin F2 alpha and a gonadotropin-releasing hormone agonist on inositol phospholipid metabolism in isolated rat corpora lutea of various ages

International Nuclear Information System (INIS)

Lahav, M.; West, L.A.; Davis, J.S.

1988-01-01

The sensitivity of rat corpora lutea to luteolytic agents increases with luteal age. We examined the effect of prostaglandin F2 alpha (PGF2 alpha) and [D-Ala6,Des-Gly10]GnRH ethylamide (GnRHa) on inositol phospholipid metabolism in day 2 and day 7 corpora lutea from PMSG-treated rats. Isolated corpora lutea were incubated with 32PO4 or [3H]inositol and were treated with LH, PGF2 alpha, or GnRHa. Phospholipids were purified by TLC, and the water-soluble products of phospholipase-C activity (inositol phosphates) were isolated by ion exchange chromatography. In day 2 corpora lutea, PGF2 alpha, (10 microM) and GnRHa (100 ng/ml) significantly increased 32PO4 incorporation into phosphatidic acid (PA) and phosphatidylinositol (PI), but not into other fractions. LH provoked slight increases in PA. Results were similar with 30 min of prelabeling or simultaneous addition of 32PO4 and stimulants. In other experiments, PGF2 alpha and GnRHa provoked rapid increases (1-5 min) in the accumulation of inositol mono-, bis-, and trisphosphates. LH did not significantly increase inositol phosphate accumulation, but stimulated cAMP accumulation in 2-day-old corpora lutea. Inositol phospholipid metabolism was increased in day 7 corpora lutea compared to that in day 2 corpora lutea. This increase was associated with increased incorporation of 32PO4 into PA and PI and increased accumulation of [3H]inositol phosphates. In day 7 corpora lutea, which are very sensitive to the luteolytic effect of PGF2 alpha, the PG-induced increase in PA labeling was small and inconsistent, whereas PI labeling was unaffected in 30-min incubations. GnRHa was without effect in such corpora lutea. LH, PGF2 alpha, or GnRHa did not increase inositol phosphate accumulation in 7-day-old corpora lutea. These studies demonstrate that the transformation of young (day 2) to mature (day 7) corpora lutea is associated with an increase in luteal inositol phospholipid metabolism
Lexical bundles in an advanced INTOCSU writing class and engineering texts: A functional analysis

Science.gov (United States)

Alquraishi, Mohammed Abdulrahman

The purpose of this study is to investigate the functions of lexical bundles in two corpora: a corpus of engineering academic texts and a corpus of IEP advanced writing class texts. This study is concerned with the nature of formulaic language in Pathway IEPs and engineering texts, and whether those types of texts show similar or distinctive formulaic functions. Moreover, the study looked into lexical bundles found in an engineering 1.26 million-word corpus and an ESL 65000-word corpus using a concordancing program. The study then analyzed the functions of those lexical bundles and compared them statistically using chi-square tests. Additionally, the results of this investigation showed 236 unique frequent lexical bundles in the engineering corpus and 37 bundles in the pathway corpus. Also, the study identified several differences between the density and functions of lexical bundles in the two corpora. These differences were evident in the distribution of functions of lexical bundles and the minimal overlap of lexical bundles found in the two corpora. The results of this study call for more attention to formulaic language at ESP and EAP programs.
Folklore, creativity, and cultural memory

DEFF Research Database (Denmark)

Glaveanu, Vlad Petre

the role of tradition and creativity in the life of a rural community. Egg decoration is an old custom, with pre-Christian roots, practiced extensively in the historical region of Bucovina, and relying on a complex system of material artefacts and symbolic elements acquired and enacted by artisans usually...... means the opposite of creativity but the actual vehicle of creative activity and its understanding as a stable cultural system ‘engraved’ in collective memory needs to be challenged. The tradition of egg decoration in Romania is a living and evolving social practice that engages the self and community......This paper addresses the question of how folk art can be, simultaneously, a vehicle for cultural memory and cultural creativity. It takes the case of Romanian Easter egg decoration as a practice situated at the intersection between art, folklore, religion and a growing market, it order to unpack...
Text mixing shapes the anatomy of rank-frequency distributions

Science.gov (United States)

Williams, Jake Ryland; Bagrow, James P.; Danforth, Christopher M.; Dodds, Peter Sheridan

2015-05-01

Natural languages are full of rules and exceptions. One of the most famous quantitative rules is Zipf's law, which states that the frequency of occurrence of a word is approximately inversely proportional to its rank. Though this "law" of ranks has been found to hold across disparate texts and forms of data, analyses of increasingly large corpora since the late 1990s have revealed the existence of two scaling regimes. These regimes have thus far been explained by a hypothesis suggesting a separability of languages into core and noncore lexica. Here we present and defend an alternative hypothesis that the two scaling regimes result from the act of aggregating texts. We observe that text mixing leads to an effective decay of word introduction, which we show provides accurate predictions of the location and severity of breaks in scaling. Upon examining large corpora from 10 languages in the Project Gutenberg eBooks collection, we find emphatic empirical support for the universality of our claim.
Preceitos e normas internas (kakun de casas comerciais japonesas: um estudo sobre a longevidade e a ética da corporação japonesa

Directory of Open Access Journals (Sweden)

Isao Yamamoto

Full Text Available O estudo de corporações de uma das maiores economias mundiais se justifica em um mundo sem fronteiras no qual hoje vivemos e onde diferenças culturais afetam relações negociais. O objetivo é explicitar como as casas comerciais e outras corporações tradicionais japonesas conseguiram enorme longevidade. Foi privilegiado o papel desempenhado pelo kakun nessas corporações; ou seja, o papel desempenhado por um conjunto de preceitos e normas internas que, tendo surgido nos séculos XVII e XVIII, tem viva a sua força até os dias correntes. O método escolhido para o estudo foi a historiografia, que visa ao resgate dos acontecimentos e das atividades humanas ao longo do tempo. Chegamos à conclusão de que muito do que pregava o kakun está hoje presente em estudos sobre organizações e gestão e que, associado a questões éticas, o kakun é, em grande parte, o responsável pela longevidade das empresas japonesas.
Benchmarking infrastructure for mutation text mining.

Science.gov (United States)

Klein, Artjom; Riazanov, Alexandre; Hindle, Matthew M; Baker, Christopher Jo

2014-02-25

Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption.
Benchmarking infrastructure for mutation text mining

Science.gov (United States)

2014-01-01

Background Experimental research on the automatic extraction of information about mutations from texts is greatly hindered by the lack of consensus evaluation infrastructure for the testing and benchmarking of mutation text mining systems. Results We propose a community-oriented annotation and benchmarking infrastructure to support development, testing, benchmarking, and comparison of mutation text mining systems. The design is based on semantic standards, where RDF is used to represent annotations, an OWL ontology provides an extensible schema for the data and SPARQL is used to compute various performance metrics, so that in many cases no programming is needed to analyze results from a text mining system. While large benchmark corpora for biological entity and relation extraction are focused mostly on genes, proteins, diseases, and species, our benchmarking infrastructure fills the gap for mutation information. The core infrastructure comprises (1) an ontology for modelling annotations, (2) SPARQL queries for computing performance metrics, and (3) a sizeable collection of manually curated documents, that can support mutation grounding and mutation impact extraction experiments. Conclusion We have developed the principal infrastructure for the benchmarking of mutation text mining tasks. The use of RDF and OWL as the representation for corpora ensures extensibility. The infrastructure is suitable for out-of-the-box use in several important scenarios and is ready, in its current state, for initial community adoption. PMID:24568600
Pre-Modern Bosom Serpents and Hippocrates' Epidemiae 5: 86: A Comparative and Contextual Folklore Approach

Directory of Open Access Journals (Sweden)

Davide Ermacora

2016-03-01

Full Text Available A short Hippocratic passage (Epidemiae 5: 86 might constitute the earliest Western surviving variant of the well-known narrative and experiential theme of snakes or other animals getting into the human body (motif B784, tale-type ATU 285B*. This paper aims: 1 to throw light on this ancient passage through a comparative folkloric analysis and through a philological-contextual study, with reference to modern and contemporary interpretations; and 2 to offer an examination of previous scholarly enquiries on the fantastic intrusion of animals into the human body. In medieval and post-medieval folklore and medicine, sleeping out in the field was dangerous: snakes and similar animals could, it was believed, crawl into the sleeper’s body through the ears, eyes, mouth, nostrils, anus and vagina. Comparative material demonstrates, meanwhile, that the thirsty snake often entered the sleeper’s mouth because of its love of milk and wine. I will argue that while Epidemiae 5: 86 is modelled on this long-standing legendary pattern, for which many interesting literary pre-modern (and modern parallels exist, its relatively precise historical and cultural framework can be efficiently analysed. The story is embedded in a broad set of Graeco-Roman ideas and practices surrounding ancient beliefs about snakes and attitudes to the drinking of unmixed wine.
Folklore and traditional ecological knowledge of geckos in Southern Portugal: implications for conservation and science

Directory of Open Access Journals (Sweden)

Vila-Viçosa Carlos M

2011-09-01

Full Text Available Abstract Traditional Ecological Knowledge (TEK and folklore are repositories of large amounts of information about the natural world. Ideas, perceptions and empirical data held by human communities regarding local species are important sources which enable new scientific discoveries to be made, as well as offering the potential to solve a number of conservation problems. We documented the gecko-related folklore and TEK of the people of southern Portugal, with the particular aim of understanding the main ideas relating to gecko biology and ecology. Our results suggest that local knowledge of gecko ecology and biology is both accurate and relevant. As a result of information provided by local inhabitants, knowledge of the current geographic distribution of Hemidactylus turcicus was expanded, with its presence reported in nine new locations. It was also discovered that locals still have some misconceptions of geckos as poisonous and carriers of dermatological diseases. The presence of these ideas has led the population to a fear of and aversion to geckos, resulting in direct persecution being one of the major conservation problems facing these animals. It is essential, from both a scientific and conservationist perspective, to understand the knowledge and perceptions that people have towards the animals, since, only then, may hitherto unrecognized pertinent information and conservation problems be detected and resolved.
'DELİ DUMRUL' BY SUAT TAŞER, WITHIN THE SCOPE OF FOLKLORE - IDEOLOGY – LITERATURE FOLKLOR-İDEOLOJİ-EDEBİYAT ÜÇGENİNDE SUAT TAŞER’İN DELİ DUMRUL’U

Directory of Open Access Journals (Sweden)

Nezir TEMUR

2011-12-01

Full Text Available It's a sociologically inevitable phenomenon that the social andpolitical changes occuring in societies evoke their reflections in culturalproductions prominently. Since the 19th Century, when nationalidentities began to take form along with romantic nationalism, folkloricartifacts which are significant conveyers of cultural recollections such ashistory and language, have confronted us as a field emphasized byideological and literary movements, notably Social Sciences. In the worldof 20th Century, when ideologies began to take form in political sense,folkloric artifacts undertook significant functions in culture policiesenvisaged by dominant ideologies for the new forms of Societies whichthey tried to build. The style of the folkloric artifacts, cultural codes theyconveyed, and their functionality have been active components in thisapproach. In this sense, the intensifying process , which begins to headtowards works of folk narrations, folk poetry, and folk literature inTurkish Literature after 1930s, gradually increases after the 1940s andthis tendency becomes one of the significant sources fostering literature.At this point, substantial works of Turkish folklore such as epics,folktales, tales, legends have been released to the public within newperspectives and techniques.It can be seen that new pursuits in expressions and utteranceshave been embarked, like in 'Deli Dumrul - Ölüm ve Aşk' (Epic and Playby Dede Korkut , which can be considered as the rewriting of one of hisepics with a new understanding. This study aims to make a comparisonbetween 'Deli Dumrul - Ölüm ve Aşk' (Epic - Play by Suat Taşer and theoriginal text of the Epic of Deli Dumrul and to examine how folkloricartifacts and cultural values tried to be transmitted into those artifactshave been modernized, adapted contemporarily and released ;the partswhere the writer digressed from the souce text during the adaptation; towhat extent the traditional context has been changed
Using corpora in scientific and technical translation training: resources to identify conventionality and promote creativity

OpenAIRE

Clara Inés López-Rodríguez

2016-01-01

http://dx.doi.org/10.5007/2175-7968.2016v36nesp1p88 Since the first Corpus Use and Learning to Translate (CULT) Conference in Bertinoro (Italy) in 1997, the usefulness of corpora for translators and trainee translators has been highlighted. From an initial approach where translators compiled ad hoc corpora in their hard drive for a subsequent study with lexical analysis software, there emerged a new trend towards the use of the Internet as corpus. In this second approach, the Web is perce...
Corporate Secretarial Bilingual Activity: An English Teaching Proposal Based on Corpora Directed to the Secretaries

Directory of Open Access Journals (Sweden)

José Roberto Lourenço

2015-07-01

Full Text Available This article presents part of research conducted in the field of Corpus Linguistics about the use of corpora in English Language Teaching specifically directed to corporate secretarial activities. The study developed at the doctoral level had FATEC-SP students as voluntary respondents to a questionnaire on corporate secretarial activities; the responses presented as one of the most important and frequent secretarial activities, "Reading, Preparation and Presentation of Administrative Report". We present a model of practice in English Teaching with an initial focus on "Company History, Strategies and Structure".
Regeneration of rat corpora cavernosa tissue by transplantation of CD133+ cells derived from human bone marrow and placement of biodegradable gel sponge sheet

Directory of Open Access Journals (Sweden)

Shogo Inoue

2017-01-01

Full Text Available The objective is to develop an easier technique for regenerating corpora cavernosa tissue through transplantation of human bone marrow-derived CD133 + cells into a rat corpora cavernosa defect model. We excised 2 mm × 2 mm squares of the right corpora cavernosa of twenty-three 8-week-old male nude rats. Alginate gel sponge sheets supplemented with 1 × 10 4 CD133 + cells were then placed over the excised area of nine rats. Functional and histological evaluations were carried out 8 weeks later. The mean intracavernous pressure/mean arterial pressure ratio for the nine rats (0.34258 ± 0.0831 was significantly higher than that for eight rats with only the excision (0.0580 ± 0.0831, P = 0.0238 and similar to that for five rats for which the penis was exposed, and there was no excision (0.37228 ± 0.1051, P = 0.8266. Immunohistochemical analysis revealed that the nine fully treated rats had venous sinus-like structures and quantitative reverse transcription polymerase chain reaction analysis of extracts from their alginate gel sponge sheets revealed that the amounts of mRNA encoding the nerve growth factor (NGF, and vascular endothelial growth factor (VEGF were significantly higher than those for rats treated with alginate gel sheets without cell supplementation (NGF: P = 0.0309; VEGF: P < 0.0001. These findings show that transplantation of CD133 + cells accelerates functional and histological recovery in the corpora cavernosa defect model.
A tm Plug-In for Distributed Text Mining in R

Directory of Open Access Journals (Sweden)

Stefan Theussl

2012-11-01

Full Text Available R has gained explicit text mining support with the tm package enabling statisticians to answer many interesting research questions via statistical analysis or modeling of (text corpora. However, we typically face two challenges when analyzing large corpora: (1 the amount of data to be processed in a single machine is usually limited by the available main memory (i.e., RAM, and (2 the more data to be analyzed the higher the need for efficient procedures for calculating valuable results. Fortunately, adequate programming models like MapReduce facilitate parallelization of text mining tasks and allow for processing data sets beyond what would fit into memory by using a distributed file system possibly spanning over several machines, e.g., in a cluster of workstations. In this paper we present a plug-in package to tm called tm.plugin.dc implementing a distributed corpus class which can take advantage of the Hadoop MapReduce library for large scale text mining tasks. We show on the basis of an application in culturomics that we can efficiently handle data sets of significant size.

Conservation Implications of the Prevalence and Representation of Locally Extinct Mammals in the Folklore of Native Americans

Directory of Open Access Journals (Sweden)

Preston Matthew

2009-01-01

Full Text Available Many rationales for wildlife conservation have been suggested. One rationale not often mentioned is the impact of extinctions on the traditions of local people, and conservationists′ subsequent need to strongly consider culturally based reasons for conservation. As a first step in strengthening the case for this rationale, we quantitatively examined the presence and representation of eight potentially extinct mammals in folklore of 48 Native American tribes that live/lived near to 11 national parks in the United States. We aimed to confirm if these extinct animals were traditionally important species for Native Americans. At least one-third of the tribes included the extinct mammals in their folklore (N=45 of 124 and about half of these accounts featured the extinct species with positive and respectful attitudes, especially the carnivores. This research has shown that mammals that might have gone locally extinct have been prevalent and important in Native American traditions. Research is now needed to investigate if there indeed has been or might be any effects on traditions due to these extinctions. Regardless, due to even the possibility that the traditions of local people might be adversely affected by the loss of species, conservationists might need to consider not only all the biological reasons to conserve, but also cultural ones.
Opinion Mining in Latvian Text Using Semantic Polarity Analysis and Machine Learning Approach

Directory of Open Access Journals (Sweden)

Gatis Špats

2016-07-01

Full Text Available In this paper we demonstrate approaches for opinion mining in Latvian text. Authors have applied, combined and extended results of several previous studies and public resources to perform opinion mining in Latvian text using two approaches, namely, semantic polarity analysis and machine learning. One of the most significant constraints that make application of opinion mining for written content classification in Latvian text challenging is the limited publicly available text corpora for classifier training. We have joined several sources and created a publically available extended lexicon. Our results are comparable to or outperform current achievements in opinion mining in Latvian. Experiments show that lexicon-based methods provide more accurate opinion mining than the application of Naive Bayes machine learning classifier on Latvian tweets. Methods used during this study could be further extended using human annotators, unsupervised machine learning and bootstrapping to create larger corpora of classified text.
Laughter annotations in conversational speech corpora - possibilities and limitations for phonetic analysis

NARCIS (Netherlands)

Truong, Khiet Phuong; Trouvain, Jürgen

Existing laughter annotations provided with several publicly available conversational speech corpora (both multiparty and dyadic conversations) were investigated and compared. We discuss the possibilities and limitations of these rather coarse and shallow laughter annotations. There are definition
Vodú Chic: Haitian Religion and the Folkloric Imaginary in Socialist Cuba

Directory of Open Access Journals (Sweden)

Grete Viddal

2012-12-01

Full Text Available During the first three decades of the twentieth century, hundreds of thousands of Haitian agricultural laborers arrived in Cuba seeking employment in the expanding sugar industry. Historically, Haitian cane cutters were marginal and occupied the lowest socio-economic status in Cuban society. Until relatively recently, the maintenance of Haitian spiritual beliefs, music, dance, and language in Cuba were associated with rural isolation and poverty. Today however, the continuation of Haitian customs is no longer linked with isolation, but exactly the opposite: performance troupes, heritage festivals, art exhibitions, the circulation of religious specialists, collaborations with research centers and academia, endorsement by music promoters, and the tourism industry. Cubans of Haitian heritage have found innovative ways to transform the abject into the exotic, and are currently gaining a public voice in cultural production, particularly through folkloric performance.
When “She” Is Not Maud: An Esoteric Foundation and Subtext for Irish Folklore in the Works of W.B. Yeats

Directory of Open Access Journals (Sweden)

C. Nicholas Serra

2017-10-01

Full Text Available This article examines Yeats’s broad use of Irish folklore between 1888 and 1938, and attempts to find a justification for his contention that his own unique metaphysical system expressed in both editions of A Vision, itself an outgrowth of his three decades of ritual practice as an initiate in the Hermetic Order of the Golden Dawn, could somehow function as both an interpretation and enlargement of “the folk-lore of the villages”. Beyond treating Irish fairy stories as a way for Yeats to establish his own Irishness, capture what remained of “reckless Ireland” in its twilight, or create a political counter-discourse set against English hegemony, the immutability and immortality of the sídhe are considered in light of the assertions of several minor lectures from the Golden Dawn. This connection sheds new light on Yeats’s ideas about Unity of Being, and hypothesizes a possible esoteric path to “escape” from his system of phases so as to resolve the body-soul dilemma evident in his poetry.
Use of monolingual and comparable corpora in the classroom to translate adverbial connectors

Directory of Open Access Journals (Sweden)

Beatriz Sánchez Cárdenas

2016-06-01

Full Text Available Research in terminology has traditionally focused on nouns. Considerably less attention has been paid to other grammatical categories such as adverbs. However, these words can also be problematic for the novice translator, who tends to use the translation correspondences in bilingual dictionaries without realizing that formal equivalence is not necessarily the same as textual equivalence. However, semantic values, acquired in context, go far beyond dictionary meaning and are related to phenomena such as semantic prosody and preferences of lexical selection that can vary, depending on text type and specialized domain. This research explored the reasons why certain adverbial discourse connectors, apparently easy to translate, are a source of translation problems that cannot be easily resolved with a bilingual dictionary. Moreover, this study analyzed the use of parallel corpora in the translation classroom and how it can increase the quality of text production. For this purpose, we compared student translations before and after receiving training on the use of corpus analysis tools
Using Corpora in EFL Classrooms: The Case Study of IELTS Preparation

Science.gov (United States)

Smirnova, Elizaveta A.

2017-01-01

This article describes the gathered experience in using corpora in an IELTS preparation course. The practice demonstrates an attempt to reduce negative washback effects occurring when preparation courses just concentrate on the test format neglecting the importance of development of learners' language skills and general study skills. Some…
ROMANIAN FOLKLORE MOTIFS IN FASHION DESIGN

Directory of Open Access Journals (Sweden)

MOCENCO Alexandra

2014-05-01

Full Text Available The traditional Romanian costume such as the entire popular art (architecture, woodcarvins, pottery etc. was born and lasted in our country since ancient times. Closely related to human existence, the traditional costume reflected over the years as reflected nowadays, the mentality and artistic conception of the people. Today the traditional Romanian costume became an inspiration source to the wholesale fashion production industry designers, both Romanian and international. Although the contemporary designers are working in accordance with a vision, using a wide area of styles, methods and current technology, they usually return to traditional techniques and ethnic folklore motifs, which converts and resize them, integrating them in their contemporary space. Adrian Oianu is a very appreciated Romanian designer who launched two collections inspired by his native’s country traditional costumes: “Suflecata pan’ la brau” (“Turned up ‘til the belt” and “Bucurie” (“Joy”. Dorin Negrau had as inspiration for his “Lost” collection the traditional costume from the Bihor region. Yves Saint Laurent had a collection inspired by the Romanian traditional flax blouses called “La blouse roumaine”. The paper presents the traditional Romanian values throw fashion collections. The research activity will create innovative concepts to support the garment industry in order to develop their own brand and to bring the design activities in Romania at an international level. The research was conducted during the initial stage of a project, financed through national founds, consisting in a documentary study on ethnographic characteristics of the popular costume from different regions of the country.
Using Small Parallel Corpora to Develop Collocation-Centred Activities in Specialized Translation Classes

Directory of Open Access Journals (Sweden)

Postolea Sorina

2016-12-01

Full Text Available The research devoted to special languages as well as the activities carried out in specialized translation classes tend to focus primarily on one-word or multi-word terminological units. However, a very important part in the making of specialist registers and texts is played by specialised collocations, i.e. relatively stable word combinations that do not designate concepts but are nevertheless of frequent use in a given field of activity. This is why helping students acquire competences relative to the identification and processing of collocations should become an important objective in specialised translation classes. An easily accessible and dependable resource that may be successfully used to this purpose is represented by corpora and corpus analysis tools, whose usefulness in translator training has been highlighted by numerous studies. This article proposes a series of practical, task-based activities-developed with the help of a small-size parallel corpus of specialised texts-that aim to raise the translation trainees′ awareness of the collocations present in specialised texts and to provide suggestions about their processing in translation.
Induction of canine deciduoma in some reproductive stages with the different condition of corpora lutea.

Science.gov (United States)

Nomura, K

1997-03-01

Bitches were examined to see whether canine deciduoma could be induced at some reproductive stages with the different conditions of corpora lutea by inserting a silk suture into the uterine lumen. The bitches stimulated in the early and middle stages of diestrus or in unilateral pregnancy corresponding to these diestrous stages formed deciduoma at a high induction rate, however, no difference in the strength of decidual reaction between the pregnant and diestrous stages was recognized. On the other hand, no reaction could be seen in bitches in late diestrus, the late stage of unilateral pregnancy or the post partum repair phase in which stromal decidual cells similar to those of the rodentia can be seen. In already implanted uteri, however, no deciduoma was formed in the interplacental areas. Even though the corpora lutea were functional, new additional stimulations were not accepted at the interplacental area in which the uterine horn had already been influenced by fertilized ova. From these results, it was suggested that in the dog as well as the rodentia, the endometrium has to be under the influence of functional corpora lutea in order to form deciduoma.
Combining Language Corpora with Experimental and Computational Approaches for Language Acquisition Research

Science.gov (United States)

Monaghan, Padraic; Rowland, Caroline F.

2017-01-01

Historically, first language acquisition research was a painstaking process of observation, requiring the laborious hand coding of children's linguistic productions, followed by the generation of abstract theoretical proposals for how the developmental process unfolds. Recently, the ability to collect large-scale corpora of children's language…
Chinese legal texts – Quantitative Description

Directory of Open Access Journals (Sweden)

Ľuboš GAJDOŠ

2017-06-01

Full Text Available The aim of the paper is to provide a quantitative description of legal Chinese. This study adopts the approach of corpus-based analyses and it shows basic statistical parameters of legal texts in Chinese, namely the length of a sentence, the proportion of part of speech etc. The research is conducted on the Chinese monolingual corpus Hanku. The paper also discusses the issues of statistical data processing from various corpora, e.g. the tokenisation and part of speech tagging and their relevance to study of registers variation.
A Set of Annotation Interfaces for Alignment of Parallel Corpora

Directory of Open Access Journals (Sweden)

Singh Anil Kumar

2014-09-01

Full Text Available Annotation interfaces for parallel corpora which fit in well with other tools can be very useful. We describe a set of annotation interfaces which fulfill this criterion. This set includes a sentence alignment interface, two different word or word group alignment interfaces and an initial version of a parallel syntactic annotation alignment interface. These tools can be used for manual alignment, or they can be used to correct automatic alignments. Manual alignment can be performed in combination with certain kinds of linguistic annotation. Most of these interfaces use a representation called the Shakti Standard Format that has been found to be very robust and has been used for large and successful projects. It ties together the different interfaces, so that the data created by them is portable across all tools which support this representation. The existence of a query language for data stored in this representation makes it possible to build tools that allow easy search and modification of annotated parallel data.
The Corpora of China English: Implications for an EFL Dictionary for ...

African Journals Online (AJOL)

The localization of the English language in China has brought about a distinctive English variety which has come to be known as China English. Recently, several corpora of China English have been or are being built; these will help us to identify the established linguistic features of this variety, and should greatly facilitate ...
[Folklore and popular medicine in the Amazon].

Science.gov (United States)

Henrique, Márcio Couto

2009-01-01

This discussion of the relations between folklore and popular medicine in the Amazon takes Canuto Azevedo's story "Filhos do boto" (Children of the porpoise) as an analytical reference point. Replete with elements of cultural reality, folk tales can serve as historical testimonies expressing clashes between different traditions. Folk records are fruit of what is often a quarrelsome dialogue between folklorists, social scientists, physicians, and pajés and their followers, and their analysis should take into account the conditions under which they were produced. Based on the imaginary attached to the figure of the porpoise--a seductive creature with healing powers--the article explores how we might expand knowledge of popular medicine as practiced in the Amazon, where the shamanistic rite known as pajelança cabocla has a strong presence.
Pathway computation in models derived from bio-science text sources

DEFF Research Database (Denmark)

Andreasen, Troels; Bulskov, Henrik; Jensen, Per Anker

2017-01-01

This paper outlines a system, OntoScape, serving to accomplish complex inference tasks on knowledge bases and bio-models derived from life-science text corpora. The system applies so-called natural logic, a form of logic which is readable for humans. This logic affords ontological representations...
Dynamics of extracellular matrix in ovarian follicles and corpora lutea of mice

DEFF Research Database (Denmark)

Irving-Rodgers, Helen F; Hummitzsch, Katja; Murdiyarso, Lydia S

2009-01-01

Despite the mouse being an important laboratory species, little is known about changes in its extracellular matrix (ECM) during follicle and corpora lutea formation and regression. Follicle development was induced in mice (29 days of age/experimental day 0) by injections of pregnant mare's serum...... and antral follicles. The focimatrix, a specialised matrix of the membrana granulosa, contained collagen type IV alpha1 and alpha2, laminin alpha1, beta1 and gamma1 chains, nidogens 1 and 2, perlecan and collagen type XVIII. In the corpora lutea, staining was restricted to capillary sub-endothelial basal...... gonadotrophin on days 0 and 1 and ovulation was induced by injection of human chorionic gonadotrophin on day 2. Ovaries were collected for immunohistochemistry (n=10 per group) on days 0, 2 and 5. Another group was mated and ovaries were examined on day 11 (n=7). Collagen type IV alpha1 and alpha2, laminin...
Contribuições das teorias institucionais para o estudo de subsidiárias de corporações multinacionais

Directory of Open Access Journals (Sweden)

Takeyoshi Imasato

Full Text Available Este ensaio destaca, inicialmente, as contribuições dos Estudos Organizacionais para o entendimento das corporações multinacionais. Em decorrência da capacidade de influenciar os demais atores nos âmbitos local, nacional, regional, internacional e transnacional, as multinacionais desafiam as abordagens tradicionais de estudos organizacionais seguidas por pesquisadores da área de Gestão Internacional. A seguir, o ensaio explora as possibilidades e os limites das abordagens de teoria institucional para o entendimento das subsidiárias de corporações multinacionais. Esse aporte teórico pode auxiliar tanto no estudo dessas empresas quanto da natureza das diferenças entre as instituições nos diversos países de operação, por possibilitarem a análise simultânea de múltiplos contextos institucionais simultaneamente. Como resultado, o ensaio contribui para o desenvolvimento teórico das interfaces entre as áreas de Estudos Organizacionais e de Gestão Internacional, principalmente, no que se refere às investigações que enfatizem o papel estratégico das subsidiárias.
Deleterious effects of progestagen treatment in VEGF expression in corpora lutea of pregnant ewes.

Science.gov (United States)

Letelier, C A; Sanchez, M A; Garcia-Fernandez, R A; Sanchez, B; Garcia-Palencia, P; Gonzalez-Bulnes, A; Flores, J M

2011-06-01

The aim of the current study was to determine the possible effects of progestagen oestrous synchronization on vascular endothelial growth factor (VEGF) expression during sheep luteogenesis and the peri-implantation period and the relationship with luteal function. At days 9, 11, 13, 15, 17 and 21 of pregnancy, the ovaries from 30 progestagen treated and 30 ewes cycling after cloprostenol injection were evaluated by ultrasonography and, thereafter, collected and processed for immunohistochemical evaluation of VEGF; blood samples were drawn for evaluating plasma progesterone. The progestagen-treated group showed smaller corpora lutea than cloprostenol-treated and lower progesterone secretion. The expression of VEGF in the luteal cells increased with time in the cloprostenol group, but not in the progestagen-treated group, which even showed a decrease between days 11 and 13. In progestagen-treated sheep, VEGF expression in granulosa-derived parenchymal lobule capillaries was correlated with the size of the luteal tissue, larger corpora lutea had higher expression, and tended to have a higher progesterone secretion. In conclusion, the current study indicates the existence of deleterious effects from exogenous progestagen treatments on progesterone secretion from induced corpora lutea, which correlate with alterations in the expression of VEGF in the luteal tissue and, this, presumably in the processes of neoangiogenesis and luteogenesis. © 2010 Blackwell Verlag GmbH.
Luteinizing hormone receptors in human ovarian follicles and corpora lutea during the menstrual cycle

International Nuclear Information System (INIS)

Yamoto, M.; Nakano, R.; Iwasaki, M.; Ikoma, H.; Furukawa, K.

1986-01-01

The binding of 125 I-labeled human luteinizing hormone (hLH) to the 2000-g fraction of human ovarian follicles and corpora lutea during the entire menstrual cycle was examined. Specific high affinity, low capacity receptors for hLH were demonstrated in the 2000-g fraction of both follicles and corpora lutea. Specific binding of 125 I-labeled hLH to follicular tissue increased from the early follicular phase to the ovulatory phase. Specific binding of 125 I-labeled hLH to luteal tissue increased from the early luteal phase to the midluteal phase and decreased towards the late luteal phase. The results of the present study indicate that the increase and decrease in receptors for hLH during the menstrual cycle might play an important role in the regulation of the ovarian cycle

IDEOLOGICAL APPROACHES OF FOLKLORE STUDIES IN KYRGYZSTAN ON THE SOVIET UNION PERIOD: ERSOLTONOY EPIC EXAMPLE SOVYETLER BİRLİĞİ DÖNEMİNDE KIRGIZİSTAN’DA FOLKLOR ÇALIŞMALARINDA İDEOLOJİK YAKLAŞIMLAR: ER SOLTONOY DESTANI ÖRNEĞİ

Directory of Open Access Journals (Sweden)

Mehmet ÇERİBAŞ

2012-01-01

Full Text Available Folklore, emerged in the 19th century with the romance movement as a tool of nationalizm, acted as shield aganist discriminative movements in the countries which weren’t able to achieve political unity. Political movement, which doesn’t consist freedom of expression and based on single party system like socialism, nazism and communism, wanted to take advantage of all communication channels for propaganda purpose. These movements imposed important folklore products which was considered as a means of communication and interaction. One of these is to understand the judgements values and develop policies on this judgements, other is to ensure harmony between the regime and people-more clearly by formatting fort he purpose of regime.Socialism which is of the movements using folklore for the ideological purpose have benefited from folklore to make people of occupied countries for he emperialist purpose compatible. Epic type, decorated with elements of romantics and nationalism, is used to increase nationalism by the Turks tribes where oral culture is dominant during the war period at ordinary times has taken spokemanship of proletariat class. Such work has been tested on the Kyrgyz Turks which were nomadic horseman and interested in the type of epic proceeding from Er Soltonoy’s of Kyrgyz Turks. 19. yüzyılda ortaya çıkan romantizm hareketiyle uluslaşmanın bir aracı olarak görülen folklor ürünleri, siyasi birliğini sağlayamamış ülkeler tarafından dıştan gelecek ayrıştırıcı akımlara karşı kalkan görevini görmüştür. Nazizm, Sosyalizm ve Komünizm gibi tek parti sistemine dayanan ve ifade özgürlüğünün olmadığı siyasi akımlar ise halka ulaşabilecekleri bütün iletişim kanallarından propaganda amacıyla yararlanmak istemişler; bu akımlar dönemin iletişim araçlarından sayılan folklor ürünlerine de bu bağlamda önemli görevler yüklemişlerdir. Bu görevlerden biri, halkın değer yarg
Application of Learner Corpora to Second Language Learning and Teaching: An Overview

Science.gov (United States)

Xu, Qi

2016-01-01

The paper gives an overview of learner corpora and their application to second language learning and teaching. It is proposed that there are four core components in learner corpus research, namely, corpus linguistics expertise, a good background in linguistic theory, knowledge of SLA theory, and a good understanding of foreign language teaching…
Polish Phoneme Statistics Obtained On Large Set Of Written Texts

Directory of Open Access Journals (Sweden)

Bartosz Ziółko

2009-01-01

Full Text Available The phonetical statistics were collected from several Polish corpora. The paper is a summaryof the data which are phoneme n-grams and some phenomena in the statistics. Triphonestatistics apply context-dependent speech units which have an important role in speech recognitionsystems and were never calculated for a large set of Polish written texts. The standardphonetic alphabet for Polish, SAMPA, and methods of providing phonetic transcriptions are described.
Microsyntactic Annotation of Corpora and its Use in Computational Linguistics Tasks

Directory of Open Access Journals (Sweden)

Iomdin Leonid

2017-12-01

Full Text Available Microsyntax is a linguistic discipline dealing with idiomatic elements whose important properties are strongly related to syntax. In a way, these elements may be viewed as transitional entities between the lexicon and the grammar, which explains why they are often underrepresented in both of these resource types: the lexicographer fails to see such elements as full-fledged lexical units, while the grammarian finds them too specific to justify the creation of individual well-developed rules. As a result, such elements are poorly covered by linguistic models used in advanced modern computational linguistic tasks like high-quality machine translation or deep semantic analysis. A possible way to mend the situation and improve the coverage and adequate treatment of microsyntactic units in linguistic resources is to develop corpora with microsyntactic annotation, closely linked to specially designed lexicons. The paper shows how this task is solved in the deeply annotated corpus of Russian, SynTagRus.
A practical application of text mining to literature on cognitive rehabilitation and enhancement through neurostimulation

Directory of Open Access Journals (Sweden)

Puiu F Balan

2014-09-01

Full Text Available The exponential growth in publications represents a major challenge for researchers. Many scientific domains, including neuroscience, are not yet fully engaged in exploiting large bodies of publications. In this paper, we promote the idea to partially automate the processing of scientific documents, specifically using text mining (TM, to efficiently review big corpora of publications. The cognitive advantage given by TM is mainly related to the automatic extraction of relevant trends from corpora of literature, otherwise impossible to analyze in short periods of time. Specifically, the benefits of TM are increased speed, quality and reproducibility of text processing, boosted by rapid updates of the results. First, we selected a set of TM-tools that allow user-friendly approaches of the scientific literature, and which could serve as a guide for researchers willing to incorporate TM in their work. Second, we used these TM-tools to obtain basic insights into the relevant literature on cognitive rehabilitation (CR and cognitive enhancement (CE using transcranial magnetic stimulation (TMS. TM readily extracted the diversity of TMS applications in CR and CE from vast corpora of publications, automatically retrieving trends already described in published reviews. TMS emerged as one of the important non-invasive tools that can both improve cognitive and motor functions in numerous neurological diseases and induce modulations/enhancements of many fundamental brain functions. TM also revealed trends in big corpora of publications by extracting occurrence frequency and relationships of particular subtopics. Moreover, we showed that CR and CE share research topics, both aiming to increase the brain’s capacity to process information, thus supporting their integration in a larger perspective. Methodologically, despite limitations of a simple user-friendly approach, TM served well the reviewing process.
Luteinizing hormone receptors in human ovarian follicles and corpora lutea during the menstrual cycle

Energy Technology Data Exchange (ETDEWEB)

Yamoto, M.; Nakano, R.; Iwasaki, M.; Ikoma, H.; Furukawa, K.

1986-08-01

The binding of /sup 125/I-labeled human luteinizing hormone (hLH) to the 2000-g fraction of human ovarian follicles and corpora lutea during the entire menstrual cycle was examined. Specific high affinity, low capacity receptors for hLH were demonstrated in the 2000-g fraction of both follicles and corpora lutea. Specific binding of /sup 125/I-labeled hLH to follicular tissue increased from the early follicular phase to the ovulatory phase. Specific binding of /sup 125/I-labeled hLH to luteal tissue increased from the early luteal phase to the midluteal phase and decreased towards the late luteal phase. The results of the present study indicate that the increase and decrease in receptors for hLH during the menstrual cycle might play an important role in the regulation of the ovarian cycle.
The specificity of folklore and mythological motifs in the novel “Tsar Maiden” by Vsevolod Solovyov

Directory of Open Access Journals (Sweden)

Lyapina Svetlana Mitrofanovna

2014-12-01

Full Text Available The article deals with folklore motifs in the novel by Vsevolod Solovyov “Tsar-maiden”, and reveals the link between this work and a magic tale. The author comes to the conclusion that the appeal to the image of the Tsar-maiden due to the desire of the writer to show the irrational spirit of pre-Petrine Russia, judgment of the people of the rulers of Imperial power. In the popular view of the nation the fact that the woman has become a monarch it was beyond their comprehension and considered a miracle akin to a fairy tale. Therefore, from Vsevolod Solovyov’s viewpoint, a fabulous image of the Tsar-maiden in the minds of the people coincided with the image of Princess Sophia.
Statistical modeling of biomedical corpora: mining the Caenorhabditis Genetic Center Bibliography for genes related to life span

Directory of Open Access Journals (Sweden)

Jordan MI

2006-05-01

Full Text Available Abstract Background The statistical modeling of biomedical corpora could yield integrated, coarse-to-fine views of biological phenomena that complement discoveries made from analysis of molecular sequence and profiling data. Here, the potential of such modeling is demonstrated by examining the 5,225 free-text items in the Caenorhabditis Genetic Center (CGC Bibliography using techniques from statistical information retrieval. Items in the CGC biomedical text corpus were modeled using the Latent Dirichlet Allocation (LDA model. LDA is a hierarchical Bayesian model which represents a document as a random mixture over latent topics; each topic is characterized by a distribution over words. Results An LDA model estimated from CGC items had better predictive performance than two standard models (unigram and mixture of unigrams trained using the same data. To illustrate the practical utility of LDA models of biomedical corpora, a trained CGC LDA model was used for a retrospective study of nematode genes known to be associated with life span modification. Corpus-, document-, and word-level LDA parameters were combined with terms from the Gene Ontology to enhance the explanatory value of the CGC LDA model, and to suggest additional candidates for age-related genes. A novel, pairwise document similarity measure based on the posterior distribution on the topic simplex was formulated and used to search the CGC database for "homologs" of a "query" document discussing the life span-modifying clk-2 gene. Inspection of these document homologs enabled and facilitated the production of hypotheses about the function and role of clk-2. Conclusion Like other graphical models for genetic, genomic and other types of biological data, LDA provides a method for extracting unanticipated insights and generating predictions amenable to subsequent experimental validation.
Bollywood Movie Corpus for Text, Images and Videos

OpenAIRE

Madaan, Nishtha; Mehta, Sameep; Saxena, Mayank; Aggarwal, Aditi; Agrawaal, Taneea S; Malhotra, Vrinda

2017-01-01

In past few years, several data-sets have been released for text and images. We present an approach to create the data-set for use in detecting and removing gender bias from text. We also include a set of challenges we have faced while creating this corpora. In this work, we have worked with movie data from Wikipedia plots and movie trailers from YouTube. Our Bollywood Movie corpus contains 4000 movies extracted from Wikipedia and 880 trailers extracted from YouTube which were released from 1...
WARCProcessor: An Integrative Tool for Building and Management of Web Spam Corpora

Directory of Open Access Journals (Sweden)

Miguel Callón

2017-12-01

Full Text Available In this work we present the design and implementation of WARCProcessor, a novel multiplatform integrative tool aimed to build scientific datasets to facilitate experimentation in web spam research. The developed application allows the user to specify multiple criteria that change the way in which new corpora are generated whilst reducing the number of repetitive and error prone tasks related with existing corpus maintenance. For this goal, WARCProcessor supports up to six commonly used data sources for web spam research, being able to store output corpus in standard WARC format together with complementary metadata files. Additionally, the application facilitates the automatic and concurrent download of web sites from Internet, giving the possibility of configuring the deep of the links to be followed as well as the behaviour when redirected URLs appear. WARCProcessor supports both an interactive GUI interface and a command line utility for being executed in background.
Juvenile hormone biosynthesis gene expression in the corpora allata of honey bee (Apis mellifera L. female castes.

Directory of Open Access Journals (Sweden)

Ana Durvalina Bomtorin

Full Text Available Juvenile hormone (JH controls key events in the honey bee life cycle, viz. caste development and age polyethism. We quantified transcript abundance of 24 genes involved in the JH biosynthetic pathway in the corpora allata-corpora cardiaca (CA-CC complex. The expression of six of these genes showing relatively high transcript abundance was contrasted with CA size, hemolymph JH titer, as well as JH degradation rates and JH esterase (jhe transcript levels. Gene expression did not match the contrasting JH titers in queen and worker fourth instar larvae, but jhe transcript abundance and JH degradation rates were significantly lower in queen larvae. Consequently, transcriptional control of JHE is of importance in regulating larval JH titers and caste development. In contrast, the same analyses applied to adult worker bees allowed us inferring that the high JH levels in foragers are due to increased JH synthesis. Upon RNAi-mediated silencing of the methyl farnesoate epoxidase gene (mfe encoding the enzyme that catalyzes methyl farnesoate-to-JH conversion, the JH titer was decreased, thus corroborating that JH titer regulation in adult honey bees depends on this final JH biosynthesis step. The molecular pathway differences underlying JH titer regulation in larval caste development versus adult age polyethism lead us to propose that mfe and jhe genes be assayed when addressing questions on the role(s of JH in social evolution.
[Single and combining effects of Calculus Bovis and zolpidem on inhibitive neurotransmitter of rat striatum corpora].

Science.gov (United States)

Liu, Ping; He, Xinrong; Guo, Mei

2010-04-01

To investigate the correlation effects between single or combined administration of Calculus Bovis or zolpidem and changes of inhibitive neurotransmitter in rat striatum corpora. Sampling from rat striatum corpora was carried out through microdialysis. The content of two inhibitive neurotransmitters in rat corpus striatum- glycine (Gly) and gama aminobutyric acid (GABA), was determined by HPLC, which involved pre-column derivation with orthophthaladehyde, reversed-phase gradient elution and fluorescence detection. GABA content of rat striatum corpora in Calculus Bovis group was significantly increased compared with saline group (P Calculus Boris plus zolpidem group were increased largely compared with saline group as well (P Calculus Bovis group was higher than combination group (P Calculus Bovis or zolpidem group was markedly increased compared with saline group or combination group (P Calculus Bovis group, zolpidem group and combination group. The magnitude of increase was lower in combination group than in Calculus Bovis group and Zolpidem group, suggesting that Calculus Bovis promoted encephalon inhibition is more powerful than zolpidem. The increase in two inhibitive neurotransmitters did not show reinforcing effect in combination group, suggesting that Calculus Bovis and zolpidem may compete the same receptors. Therefore, combination of Calculus Bovis containing drugs and zolpidem has no clinical significance. Calculus Bovis shouldn't as an aperture-opening drugs be used for resuscitation therapy.
A new universality class in corpus of texts; A statistical physics study

Science.gov (United States)

Najafi, Elham; Darooneh, Amir H.

2018-05-01

Text can be regarded as a complex system. There are some methods in statistical physics which can be used to study this system. In this work, by means of statistical physics methods, we reveal new universal behaviors of texts associating with the fractality values of words in a text. The fractality measure indicates the importance of words in a text by considering distribution pattern of words throughout the text. We observed a power law relation between fractality of text and vocabulary size for texts and corpora. We also observed this behavior in studying biological data.
U-Compare: share and compare text mining tools with UIMA

Science.gov (United States)

Kano, Yoshinobu; Baumgartner, William A.; McCrohon, Luke; Ananiadou, Sophia; Cohen, K. Bretonnel; Hunter, Lawrence; Tsujii, Jun'ichi

2009-01-01

Summary: Due to the increasing number of text mining resources (tools and corpora) available to biologists, interoperability issues between these resources are becoming significant obstacles to using them effectively. UIMA, the Unstructured Information Management Architecture, is an open framework designed to aid in the construction of more interoperable tools. U-Compare is built on top of the UIMA framework, and provides both a concrete framework for out-of-the-box text mining and a sophisticated evaluation platform allowing users to run specific tools on any target text, generating both detailed statistics and instance-based visualizations of outputs. U-Compare is a joint project, providing the world's largest, and still growing, collection of UIMA-compatible resources. These resources, originally developed by different groups for a variety of domains, include many famous tools and corpora. U-Compare can be launched straight from the web, without needing to be manually installed. All U-Compare components are provided ready-to-use and can be combined easily via a drag-and-drop interface without any programming. External UIMA components can also simply be mixed with U-Compare components, without distinguishing between locally and remotely deployed resources. Availability: http://u-compare.org/ Contact: kano@is.s.u-tokyo.ac.jp PMID:19414535
Corpora amylacea in temporal lobe epilepsy associated with hippocampal sclerosis

Directory of Open Access Journals (Sweden)

Ribeiro Marlise de Castro

2003-01-01

Full Text Available Hippocampal sclerosis (HS is the commonest pathology in epileptic patients undergoing temporal lobe epilepsy surgery. Beside, there are an increased density of corpora amylacea (CA founded in 6 to 63% of those cases. OBJECTIVE: verify the presence of CA and the clinical correlates of their occurrence in a consective series of patients undergoing temporal surgery with diagnosis of HS. METHOD: We reviewed 72 hippocampus specimens from January 1997 to July 2000. Student's t test for independent, samples, ANOVA and Tukey test were performed for statistical analysis. RESULTS: CA were found in 35 patients (49%, whose mean epilepsy duration (28.7 years was significantly longer than that group of patients without CA (19.5 years, p= 0.001. Besides, when CA were found, duration was also significantly correlated with distribution within hippocampus: 28.7 years with diffuse distribution of CA, 15.4 with exclusively subpial and 17.4 years with distribution subpial plus perivascular (p= 0.001. CONCLUSION: Our findings corroborate the presence of CA in patients with HS and suggest that a longer duration of epilepsy correlate with a more distribution of CA in hippocampus.
A practical application of text mining to literature on cognitive rehabilitation and enhancement through neurostimulation.

Science.gov (United States)

Balan, Puiu F; Gerits, Annelies; Vanduffel, Wim

2014-01-01

The exponential growth in publications represents a major challenge for researchers. Many scientific domains, including neuroscience, are not yet fully engaged in exploiting large bodies of publications. In this paper, we promote the idea to partially automate the processing of scientific documents, specifically using text mining (TM), to efficiently review big corpora of publications. The "cognitive advantage" given by TM is mainly related to the automatic extraction of relevant trends from corpora of literature, otherwise impossible to analyze in short periods of time. Specifically, the benefits of TM are increased speed, quality and reproducibility of text processing, boosted by rapid updates of the results. First, we selected a set of TM-tools that allow user-friendly approaches of the scientific literature, and which could serve as a guide for researchers willing to incorporate TM in their work. Second, we used these TM-tools to obtain basic insights into the relevant literature on cognitive rehabilitation (CR) and cognitive enhancement (CE) using transcranial magnetic stimulation (TMS). TM readily extracted the diversity of TMS applications in CR and CE from vast corpora of publications, automatically retrieving trends already described in published reviews. TMS emerged as one of the important non-invasive tools that can both improve cognitive and motor functions in numerous neurological diseases and induce modulations/enhancements of many fundamental brain functions. TM also revealed trends in big corpora of publications by extracting occurrence frequency and relationships of particular subtopics. Moreover, we showed that CR and CE share research topics, both aiming to increase the brain's capacity to process information, thus supporting their integration in a larger perspective. Methodologically, despite limitations of a simple user-friendly approach, TM served well the reviewing process.
GOSPEL TEXT IN SCIENCE FICTION NOVELETTES BY V. P. KRAPIVIN (THE CYCLE "IN THE HEART OF THE GREAT CRYSTAL"

Directory of Open Access Journals (Sweden)

Velikanova E. A.

2011-11-01

Full Text Available The article analyses evangelical motives and images in a cycle of science fiction stories In the heart of the Great Crystal by Vladislav Krapivin. The reference to the evangelical text and connection to folklore and literary elements create the modern moral maintenance of books of the writer addressed to the teenage reader.
Contrasting Specific English Corpora: Language Variation

Directory of Open Access Journals (Sweden)

María Luisa Carrió Pastor

2009-12-01

Full Text Available The scientific community has traditionally considered technical English as neutral and objective, able to transmit ideas and research in simple sentences and specialized vocabulary. Nevertheless, global communication and intense information delivery have produced a range of different ways of knowledge transmission. Although technical English is considered an objective way to transmit science, writers of academic papers use some words or structures with different frequency in the same genre. As a consequence of this, contrastive studies about the use of second languages have been increasingly attracting scholarly attention. In this research, we evidence that variation in language production is a reality and can be proved contrasting corpora written by native writers of English and by non-native writers of English. The objectives of this paper are first to detect language variation in a technical English corpus; second, to demonstrate that this finding evidences the parts of the sentence that are more sensitive to variation; finally, it also evidences the non-standardisation of technical English. In order to fulfil these objectives, we analysed a corpus of fifty scientific articles written by native speakers of English and fifty scientific articles written by non-native speakers of English. The occurrences were classified and counted in order to detect the most common variations. Further analysis indicated that the variations were caused by mother tongue interference in virtually all cases, although meaning was only very rarely obscured. These findings suggest that the use of certain patterns and expressions originating from L1 interference should be considered as correct as standard English.La comunidad científica considera al inglés técnico como un tipo de lenguaje neutral y objetivo, capaz de transmitir ideas y hallazgos en frases simples y vocabulario reconocido por los especialistas de ese campo. Sin embargo, la comunicación global y el
Progestogen treatments for cycle management in a sheep model of assisted conception affect the growth patterns, the expression of luteinizing hormone receptors, and the progesterone secretion of induced corpora lutea.

Science.gov (United States)

Letelier, Claudia; García-Fernández, Rosa Ana; Contreras-Solis, Ignacio; Sanchez, María Angeles; Garcia-Palencia, Pilar; Sanchez, Belen; Gonzalez-Bulnes, Antonio; Flores, Juana María

2010-03-01

To determine, in a sheep model, the effect of a short-term progestative treatment on growth dynamics and functionality of induced corpora lutea. Observational, model study. Public university. Sixty adult female sheep. Synchronization and induction of ovulation with progestogens and prostaglandin analogues; ovarian ultrasonography, blood sampling, and ovariectomy. Determination of pituitary function and morphologic characteristics, expression of luteinizing hormone (LH) receptors, and progesterone secretion of corpora lutea. The use of progestative pretreatments for assisted conception affect the growth patterns, the expression of LH receptors, and the progesterone secretion of induced corpora lutea. The current study indicates, in a sheep model, the existence of deleterious effects from progestogens on functionality of induced corpora lutea. Copyright 2010 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.
Charles Dicken’s Use of Folklore: A Study of Elements in Bleak House

Science.gov (United States)

1981-04-21

asserts the association between death and blackness by misquoting Shakespeare ; in Hamlet Shakespeare refers to the "fell sergeant, Death," ~66 and Dickens...2t464. One can find many allusions to works by Shakespeare in Dickens’s novels. The relevance of Shakespeare as a source is a field that awaits extensive... Shakespeare Land (London: Mitchell Hughes and Clarke, 1929), p.41. 3Cora Linn Daniels, ed. Encyclopedia of Superstitions, Folklore and the Occult Sciences of

Blood transfusion and resuscitation using penile corpora: an experimental study.

Science.gov (United States)

Abolyosr, Ahmad; Sayed, M A; Elanany, Fathy; Smeika, M A; Shaker, S E

2005-10-01

To test the feasibility of using the penile corpora cavernosa for blood transfusion and resuscitation purposes. Three male donkeys were used for autologous blood transfusion into the corpus cavernosum during three sessions with a 1-week interval between each. Two blood units (450 mL each) were transfused per session to each donkey. Moreover, three dogs were bled up until a state of shock was produced. The mean arterial blood pressure decreased to 60 mm Hg. The withdrawn blood (mean volume 396.3 mL) was transfused back into their corpora cavernosa under 150 mm Hg pressure. Different transfusion parameters were assessed. The Assiut faculty of medicine ethical committee approved the study before its initiation. For the donkey model, the mean time of blood collection was 12 minutes. The mean time needed to establish corporal access was 22 seconds. The mean time of blood transfusion was 14.2 minutes. The mean rate of blood transfusion was 31.7 mL/min. Mild penile elongation with or without mild penile tumescence was observed on four occasions. All penile shafts returned spontaneously to their pretransfusion state at a maximum of 5 minutes after cessation of blood transfusion. No extravasation, hematoma formation, or color changes occurred. Regarding the dog model, the mean rate of transfusion was 35.2 mL/min. All dogs were resuscitated at the end of the transfusion. The corpus cavernosum is a feasible, simple, rapid, and effective alternative route for blood transfusion and venous access. It can be resorted to whenever necessary. It is a reliable means for volume replacement and resuscitation in males.
Folklore and traditional ecological knowledge of geckos in Southern Portugal: implications for conservation and science

Science.gov (United States)

2011-01-01

Traditional Ecological Knowledge (TEK) and folklore are repositories of large amounts of information about the natural world. Ideas, perceptions and empirical data held by human communities regarding local species are important sources which enable new scientific discoveries to be made, as well as offering the potential to solve a number of conservation problems. We documented the gecko-related folklore and TEK of the people of southern Portugal, with the particular aim of understanding the main ideas relating to gecko biology and ecology. Our results suggest that local knowledge of gecko ecology and biology is both accurate and relevant. As a result of information provided by local inhabitants, knowledge of the current geographic distribution of Hemidactylus turcicus was expanded, with its presence reported in nine new locations. It was also discovered that locals still have some misconceptions of geckos as poisonous and carriers of dermatological diseases. The presence of these ideas has led the population to a fear of and aversion to geckos, resulting in direct persecution being one of the major conservation problems facing these animals. It is essential, from both a scientific and conservationist perspective, to understand the knowledge and perceptions that people have towards the animals, since, only then, may hitherto unrecognized pertinent information and conservation problems be detected and resolved. PMID:21892925
Computing Pathways in Bio-Models Derived from Bio-Science Text Sources

DEFF Research Database (Denmark)

Andreasen, Troels; Bulskov, Henrik; Nilsson, Jørgen Fischer

2015-01-01

This paper outlines a system, OntoScape, serving to accomplish complex inference tasks on knowledge bases and bio-models derived from life-science text corpora. The system applies so-called natural logic, a form of logic which is readable for humans. This logic affords ontological representations...... of complex terms appearing in the text sources. Along with logical propositions, the system applies a semantic graph representation facilitating calculation of bio-pathways. More generally, the system aords means of query answering appealing to general and domain specic inference rules....
ONTOGRABBING: Extracting Information from Texts Using Generative Ontologies

DEFF Research Database (Denmark)

Nilsson, Jørgen Fischer; Szymczak, Bartlomiej Antoni; Jensen, P.A.

2009-01-01

We describe principles for extracting information from texts using a so-called generative ontology in combination with syntactic analysis. Generative ontologies are introduced as semantic domains for natural language phrases. Generative ontologies extend ordinary finite ontologies with rules...... for producing recursively shaped terms representing the ontological content (ontological semantics) of NL noun phrases and other phrases. We focus here on achieving a robust, often only partial, ontology-driven parsing of and ascription of semantics to a sentence in the text corpus. The aim of the ontological...... analysis is primarily to identify paraphrases, thereby achieving a search functionality beyond mere keyword search with synsets. We further envisage use of the generative ontology as a phrase-based rather than word-based browser into text corpora....
CUILESS2016: a clinical corpus applying compositional normalization of text mentions.

Science.gov (United States)

Osborne, John D; Neu, Matthew B; Danila, Maria I; Solorio, Thamar; Bethard, Steven J

2018-01-10

Traditionally text mention normalization corpora have normalized concepts to single ontology identifiers ("pre-coordinated concepts"). Less frequently, normalization corpora have used concepts with multiple identifiers ("post-coordinated concepts") but the additional identifiers have been restricted to a defined set of relationships to the core concept. This approach limits the ability of the normalization process to express semantic meaning. We generated a freely available corpus using post-coordinated concepts without a defined set of relationships that we term "compositional concepts" to evaluate their use in clinical text. We annotated 5397 disorder mentions from the ShARe corpus to SNOMED CT that were previously normalized as "CUI-less" in the "SemEval-2015 Task 14" shared task because they lacked a pre-coordinated mapping. Unlike the previous normalization method, we do not restrict concept mappings to a particular set of the Unified Medical Language System (UMLS) semantic types and allow normalization to occur to multiple UMLS Concept Unique Identifiers (CUIs). We computed annotator agreement and assessed semantic coverage with this method. We generated the largest clinical text normalization corpus to date with mappings to multiple identifiers and made it freely available. All but 8 of the 5397 disorder mentions were normalized using this methodology. Annotator agreement ranged from 52.4% using the strictest metric (exact matching) to 78.2% using a hierarchical agreement that measures the overlap of shared ancestral nodes. Our results provide evidence that compositional concepts can increase semantic coverage in clinical text. To our knowledge we provide the first freely available corpus of compositional concept annotation in clinical text.
Plant derived substances with anti-cancer activity: from folklore to practice

Directory of Open Access Journals (Sweden)

Marcelo eFridlender

2015-10-01

Full Text Available Plants have had an essential role in the folklore of ancient cultures. In addition to the use as food and spices, plants have also been utilized as medicines for over 5000 years. It is estimated that 70-95% of the population in developing countries continues to use traditional medicines even today. A new trend, that involved the isolation of plant active compounds begun during the early 19th century. This trend led to the discovery of different active compounds that are derived from plants. In the last decades, more and more new materials derived from plants have been authorized and subscribed as medicines, including those with anti-cancer activity. Cancer is among the leading causes of morbidity and mortality worldwide. The number of new cases is expected to rise by about 70% over the next 2 decades. Thus, there is a real need for new efficient anti-cancer drugs with reduced side effects, and plants are a promising source for such entities. Here we focus on some plant-derived substances exhibiting anti-cancer and chemoprevention activity, their mode of action and bioavailability. These include paclitaxel, curcumin and cannabinoids. In addition, development and use of their synthetic analogs, and those of strigolactones, are discussed. Also discussed are commercial considerations and future prospects for development of plant derived substances with anti-cancer activity.
Biomechanically Preferred Consonant-Vowel Combinations Fail to Appear in Adult Spoken Corpora

Science.gov (United States)

Whalen, D. H.; Giulivi, Sara; Nam, Hosung; Levitt, Andrea G.; Hallé, Pierre; Goldstein, Louis M.

2012-01-01

Certain consonant/vowel (CV) combinations are more frequent than would be expected from the individual C and V frequencies alone, both in babbling and, to a lesser extent, in adult language, based on dictionary counts: Labial consonants co-occur with central vowels more often than chance would dictate; coronals co-occur with front vowels, and velars with back vowels (Davis & MacNeilage, 1994). Plausible biomechanical explanations have been proposed, but it is also possible that infants are mirroring the frequency of the CVs that they hear. As noted, previous assessments of adult language were based on dictionaries; these “type” counts are incommensurate with the babbling measures, which are necessarily “token” counts. We analyzed the tokens in two spoken corpora for English, two for French and one for Mandarin. We found that the adult spoken CV preferences correlated with the type counts for Mandarin and French, not for English. Correlations between the adult spoken corpora and the babbling results had all three possible outcomes: significantly positive (French), uncorrelated (Mandarin), and significantly negative (English). There were no correlations of the dictionary data with the babbling results when we consider all nine combinations of consonants and vowels. The results indicate that spoken frequencies of CV combinations can differ from dictionary (type) counts and that the CV preferences apparent in babbling are biomechanically driven and can ignore the frequencies of CVs in the ambient spoken language. PMID:23420980
The interpretation of dream meaning: Resolving ambiguity using Latent Semantic Analysis in a small corpus of text.

Science.gov (United States)

Altszyler, Edgar; Ribeiro, Sidarta; Sigman, Mariano; Fernández Slezak, Diego

2017-11-01

Computer-based dreams content analysis relies on word frequencies within predefined categories in order to identify different elements in text. As a complementary approach, we explored the capabilities and limitations of word-embedding techniques to identify word usage patterns among dream reports. These tools allow us to quantify words associations in text and to identify the meaning of target words. Word-embeddings have been extensively studied in large datasets, but only a few studies analyze semantic representations in small corpora. To fill this gap, we compared Skip-gram and Latent Semantic Analysis (LSA) capabilities to extract semantic associations from dream reports. LSA showed better performance than Skip-gram in small size corpora in two tests. Furthermore, LSA captured relevant word associations in dream collection, even in cases with low-frequency words or small numbers of dreams. Word associations in dreams reports can thus be quantified by LSA, which opens new avenues for dream interpretation and decoding. Copyright © 2017 Elsevier Inc. All rights reserved.
(Text) Mining the LANDscape: Themes and Trends over 40 years of Landscape and Urban Planning

Science.gov (United States)

Paul H. Gobster

2014-01-01

In commemoration of the journal's 40th anniversary, the co-editor explores themes and trends covered by Landscape and Urban Planning and its parent journals through a qualitative comparison of co-occurrence term maps generated from the text corpora of its abstracts across the four decadal periods of publication.Cluster maps generated from the...
The European Circulation of Nordic Texts in the Romantic Period

DEFF Research Database (Denmark)

Jensen-Rix, Robert William

2017-01-01

history of rediscovering Old Norse texts (i.e., poetry and prose written in the North Germanic language until the 14th century, known primarily from Icelandic manuscripts) and medieval Nordic folklore (found in medieval ballads, sagas, and heroic legends) differed in various European countries......, there was also a remarkable sense of common aim and purpose in the reception history as it developed during the Romantic period. This was because European scholars and writers had come to see medieval Nordic texts as epitomizing the manners and literature of a common Germanic past. In particular, Old Norse texts...... from Icelandic manuscripts were believed to preserve the pre-Christian religion, as this was once shared by Scandinavians, Anglo-Saxons, Germans, and the Franks. Thus, interest in such texts circulated with particular intensity between Scandinavia, Germany, and Britain, as well as, to a lesser degree...
From Folklore to Scientific Evidence: Breast-Feeding and Wet-Nursing in Islam and the Case of Non-Puerperal Lactation

Science.gov (United States)

Moran, Lia; Gilad, Jacob

2007-01-01

Breast-feeding practice has an important medical and socio-cultural role. It has many anthropological aspects concerning the “power structures” that find their expression in breast-feeding and the practices that formed around it, both socially, scientifically, and legally-speaking. Breast-feeding has been given much attention by religions and taboos, folklore, and misconception abound around it making it a topic of genuine curiosity. This paper aims at expanding the spectrum of folklore associated with breast-feeding. The paper deals with historical, religious, and folkloristic aspects of breast-feeding, especially wet-nursing, in Islam and focuses on an intriguing Islamic tale on breast-feeding - lactation by non-pregnant women (or non-puerperal lactation). Apparently, accounts of non-puerperal lactation are not restricted to Islam but have been documented in various societies and religions throughout centuries. Two medical situations - hyperprolactinemia and induced lactation, appear as possible explanations for this phenomenon. This serves as an excellent example for the value of utilizing contemporary scientific knowledge in order to elucidate the origin, anthropology and evolvement of ancient myth and superstition. PMID:23675050
Adapting computational text analysis to social science (and vice versa

Directory of Open Access Journals (Sweden)

Paul DiMaggio

2015-11-01

Full Text Available Social scientists and computer scientist are divided by small differences in perspective and not by any significant disciplinary divide. In the field of text analysis, several such differences are noted: social scientists often use unsupervised models to explore corpora, whereas many computer scientists employ supervised models to train data; social scientists hold to more conventional causal notions than do most computer scientists, and often favor intense exploitation of existing algorithms, whereas computer scientists focus more on developing new models; and computer scientists tend to trust human judgment more than social scientists do. These differences have implications that potentially can improve the practice of social science.
Creazione e sviluppo di corpora multimediali. Nuove metodologie di ricerca nella traduzione audiovisiva

OpenAIRE

Valentini, Cristina

2009-01-01

The construction and use of multimedia corpora has been advocated for a while in the literature as one of the expected future application fields of Corpus Linguistics. This research project represents a pioneering experience aimed at applying a data-driven methodology to the study of the field of AVT, similarly to what has been done in the last few decades in the macro-field of Translation Studies. This research was based on the experience of Forlixt 1, the Forlì Corpus of Screen Translation,...
Specification of Drosophila corpora cardiaca neuroendocrine cells from mesoderm is regulated by Notch signaling.

Directory of Open Access Journals (Sweden)

Sangbin Park

2011-08-01

Full Text Available Drosophila neuroendocrine cells comprising the corpora cardiaca (CC are essential for systemic glucose regulation and represent functional orthologues of vertebrate pancreatic α-cells. Although Drosophila CC cells have been regarded as developmental orthologues of pituitary gland, the genetic regulation of CC development is poorly understood. From a genetic screen, we identified multiple novel regulators of CC development, including Notch signaling factors. Our studies demonstrate that the disruption of Notch signaling can lead to the expansion of CC cells. Live imaging demonstrates localized emergence of extra precursor cells as the basis of CC expansion in Notch mutants. Contrary to a recent report, we unexpectedly found that CC cells originate from head mesoderm. We show that Tinman expression in head mesoderm is regulated by Notch signaling and that the combination of Daughterless and Tinman is sufficient for ectopic CC specification in mesoderm. Understanding the cellular, genetic, signaling, and transcriptional basis of CC cell specification and expansion should accelerate discovery of molecular mechanisms regulating ontogeny of organs that control metabolism.
Conservation Implications of the Prevalence and Representation of Locally Extinct Mammals in the Folklore of Native Americans

OpenAIRE

Preston Matthew; Harcourt Alexander

2009-01-01

Many rationales for wildlife conservation have been suggested. One rationale not often mentioned is the impact of extinctions on the traditions of local people, and conservationists′ subsequent need to strongly consider culturally based reasons for conservation. As a first step in strengthening the case for this rationale, we quantitatively examined the presence and representation of eight potentially extinct mammals in folklore of 48 Native American tribes that live/lived near to 11 n...
The development of folklore, arts and crafts in ukrainian ethnic minorities: trends (1990 – 2000-s)

OpenAIRE

V. M. Pekarchuk

2014-01-01

On the basis of represented wide palette of historical facts, analytic works, scientific documents it is made an attempt to reproduce the place and role of folklore, arts and crafts of Ukrainian ethnic minority cultures within 1990 2000 ies. The importance of the designated problem is caused, first of all, the need to have a clear understanding of the mechanism of the decision problem of an independent state of interethnic relations. It was found that during the study years in Ukraine,...
The friends that game together: A folkloric expansion of textual poaching to genre farming for socialization in tabletop role-playing games

Directory of Open Access Journals (Sweden)

Michael Robert Underwood

2009-03-01

Full Text Available Tabletop role-playing games (RPGs are a folkloric form for creating and reaffirming community bonds and performing identity. Gaming is used to communicate and perform cultural capital and identity through fictional narratives, functioning as a form of community building and/or personal expression. With quotations from ethnographic research over the course of 2 years, including interviews with several groups of gamers and participant observation, I examine the ways that players create and affirm social bonds. I return to Michel De Certeau's idea of textual poaching, as adapted by Henry Jenkins, to contrast with it a new concept of genre farming. As both platform for and object of genre farming, RPGs allow players to display cultural competence, create and reaffirm social ties, and seek entertainment in a collaborative fashion.
Blending research methods: Qualitative and quantitative approaches to researching computer corpora for language learning.

OpenAIRE

Boulton , Alex

2011-01-01

International audience; This paper outlines how corpora (in printed, electronic or multi-modal form) can be used in language learning, an area often referred to as "data-driven learning" or DDL (Johns 1991). The alleged advantages are numerous, but are in need of empirical support which is frequently claimed to be lacking in the field. However, over 80 studies have so far attempted to evaluate some aspect of corpus use by non-native speakers (Boulton 2010): these are briefly reviewed as a who...
Designing and Implementing a Cross-Language Information Retrieval System Using Linguistic Corpora

Directory of Open Access Journals (Sweden)

Amin Nezarat

2012-03-01

Full Text Available Information retrieval (IR is a crucial area of natural language processing (NLP and can be defined as finding documents whose content is relevant to the query need of a user. Cross-language information retrieval (CLIR refers to a kind of information retrieval in which the language of the query and that of searched document are different. In fact, it is a retrieval process where the user presents queries in one language to retrieve documents in another language. This paper tried to construct a bilingual lexicon of parallel chunks of English and Persian from two very large monolingual corpora an English-Persian parallel corpus which could be directly applied to cross-language information retrieval tasks. For this purpose, a statistical measure known as Association Score (AS was used to compute the association value between every two corresponding chunks in the corpus using a couple of complicated algorithms. Once the CLIR system was developed using this bilingual lexicon, an experiment was performed on a set of one hundred English and Persian phrases and collocations to see to what extend this system was effective in assisting the users find the most relevant and suitable equivalents of their queries in either language.
Russian Folk Culture in the 20 th Century: Oral Evidence of the Villagers (On the Materials of Folklore Expeditions

Directory of Open Access Journals (Sweden)

Ekaterina A. Dorokhova

2017-12-01

Full Text Available Folk culture is capable of developing certain adaptation mechanisms that help it promptly react to the changing conditions of natural, socio-political, and economic environment. This is evidenced by the stories of the villagers recorded during folklore expeditions to different regions of Russia. The article highlights changes that took place in the traditional Russian culture under the influence of collectivization in the 1920s–1930s, the collapse of kolkhozes in the 1990s, the development of the rural club amateur performances in the Soviet time, the events of the World War II, modern military conflicts, and Chernobyl ecological catastrophe. The authors come to conclusion that representatives of traditional culture flexibly adapt to their new living conditions, while extreme conditions such as wars and ecological catastrophes often contribute to the actualization of folk culture and enable the return of its certain aspects to living practice.

1970 MLA Abstracts of Articles in Scholarly Journals, Volume I: General, English, American, Medieval and Neo-Latin, Celtic Literatures; and Folklore.

Science.gov (United States)

Fisher, John H., Comp.; Achtert, Walter S., Comp.

The first volume of an annual series following the arrangement of the "MLA International Bibliography" includes sections on General, English, American, Medieval and Neo-Latin, Celtic literatures, and Folklore. A classified collection of 1,744 brief abstracts of journalarticles on the modern languages and literatures to be used in conjunction with…
Mining consumer health vocabulary from community-generated text.

Science.gov (United States)

Vydiswaran, V G Vinod; Mei, Qiaozhu; Hanauer, David A; Zheng, Kai

2014-01-01

Community-generated text corpora can be a valuable resource to extract consumer health vocabulary (CHV) and link them to professional terminologies and alternative variants. In this research, we propose a pattern-based text-mining approach to identify pairs of CHV and professional terms from Wikipedia, a large text corpus created and maintained by the community. A novel measure, leveraging the ratio of frequency of occurrence, was used to differentiate consumer terms from professional terms. We empirically evaluated the applicability of this approach using a large data sample consisting of MedLine abstracts and all posts from an online health forum, MedHelp. The results show that the proposed approach is able to identify synonymous pairs and label the terms as either consumer or professional term with high accuracy. We conclude that the proposed approach provides great potential to produce a high quality CHV to improve the performance of computational applications in processing consumer-generated health text.
From university research to innovation: Detecting knowledge transfer via text mining

Energy Technology Data Exchange (ETDEWEB)

Woltmann, S.; Clemmensen, L.; Alkærsig, L

2016-07-01

Knowledge transfer by universities is a top priority in innovation policy and a primary purpose for public research funding, due to being an important driver of technical change and innovation. Current empirical research on the impact of university research relies mainly on formal databases and indicators such as patents, collaborative publications and license agreements, to assess the contribution to the socioeconomic surrounding of universities. In this study, we present an extension of the current empirical framework by applying new computational methods, namely text mining and pattern recognition. Text samples for this purpose can include files containing social media contents, company websites and annual reports. The empirical focus in the present study is on the technical sciences and in particular on the case of the Technical University of Denmark (DTU). We generated two independent text collections (corpora) to identify correlations of university publications and company webpages. One corpus representing the company sites, serving as sample of the private economy and a second corpus, providing the reference to the university research, containing relevant publications. We associated the former with the latter to obtain insights into possible text and semantic relatedness. The text mining methods are extrapolating the correlations, semantic patterns and content comparison of the two corpora to define the document relatedness. We expect the development of a novel tool using contemporary techniques for the measurement of public research impact. The approach aims to be applicable across universities and thus enable a more holistic comparable assessment. This rely less on formal databases, which is certainly beneficial in terms of the data reliability. We seek to provide a supplementary perspective for the detection of the dissemination of university research and hereby enable policy makers to gain additional insights of (informal) contributions of knowledge
Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.

Directory of Open Access Journals (Sweden)

Anika Oellrich

Full Text Available Electronic health records and scientific articles possess differing linguistic characteristics that may impact the performance of natural language processing tools developed for one or the other. In this paper, we investigate the performance of four extant concept recognition tools: the clinical Text Analysis and Knowledge Extraction System (cTAKES, the National Center for Biomedical Ontology (NCBO Annotator, the Biomedical Concept Annotation System (BeCAS and MetaMap. Each of the four concept recognition systems is applied to four different corpora: the i2b2 corpus of clinical documents, a PubMed corpus of Medline abstracts, a clinical trails corpus and the ShARe/CLEF corpus. In addition, we assess the individual system performances with respect to one gold standard annotation set, available for the ShARe/CLEF corpus. Furthermore, we built a silver standard annotation set from the individual systems' output and assess the quality as well as the contribution of individual systems to the quality of the silver standard. Our results demonstrate that mainly the NCBO annotator and cTAKES contribute to the silver standard corpora (F1-measures in the range of 21% to 74% and their quality (best F1-measure of 33%, independent from the type of text investigated. While BeCAS and MetaMap can contribute to the precision of silver standard annotations (precision of up to 42%, the F1-measure drops when combined with NCBO Annotator and cTAKES due to a low recall. In conclusion, the performances of individual systems need to be improved independently from the text types, and the leveraging strategies to best take advantage of individual systems' annotations need to be revised. The textual content of the PubMed corpus, accession numbers for the clinical trials corpus, and assigned annotations of the four concept recognition systems as well as the generated silver standard annotation sets are available from http://purl.org/phenotype/resources. The textual content
PEDANT: Parallel Texts in Göteborg

Directory of Open Access Journals (Sweden)

Daniel Ridings

2012-09-01

Full Text Available
The article presents the status of the PEDANT project with parallel corpora at the Language Bank at Göteborg University. The solutions for access to the corpus data are presented. Access is provided by way of the internet and standard applications and SGML-aware programming tools. The SGML format for encoding translation pairs is outlined together. The methods allow working with everything from plain text to texts densely encoded with linguistic information.

In hierdie artikel word 'n beskrywing gegee van die stand van die PEDANT-projek met parallelle korpora by die Taalbank by die Universiteit van Göteborg. Oplossings vir die verkryging van toegang tot die korpusdata word aangedui. Toegang word verskaf deur middel van die Internet en standaardtoepassings en SGML-sensitiewe programmeringshulpmiddels. Die SGML-formaat vir die enkodering van vertaalpare word gesamentlik geskets. Hierdie metodes laat toe dat gewerk kan word met enigiets vanaf suiwer teks tot tekste wat taalkundig dig geëtiketteer is.
The Application of Hermeneutical Analysis to Research on the Cold War in Soviet Animation Media Texts from the Second Half of the 1940s

Science.gov (United States)

Fedorov, A. V.

2015-01-01

The Cold War era, which spawned a mutual ideological confrontation between communist and capitalist countries, left its mark on all categories of media texts, including cartoons and animations. Cartoons were used by the authorities as tools for delivering the necessary confrontational ideological content in an attractive folkloric, fairy-tale…
From university research to innovation Detecting knowledge transfer via text mining

DEFF Research Database (Denmark)

Woltmann, Sabrina; Clemmensen, Line Katrine Harder; Alkærsig, Lars

2016-01-01

and indicators such as patents, collaborative publications and license agreements, to assess the contribution to the socioeconomic surrounding of universities. In this study, we present an extension of the current empirical framework by applying new computational methods, namely text mining and pattern...... associated the former with the latter to obtain insights into possible text and semantic relatedness. The text mining methods are extrapolating the correlations, semantic patterns and content comparison of the two corpora to define the document relatedness. We expect the development of a novel tool using...... recognition. Text samples for this purpose can include files containing social media contents, company websites and annual reports. The empirical focus in the present study is on the technical sciences and in particular on the case of the Technical University of Denmark (DTU). We generated two independent...
“Tá cuid de na mná blasta/Some Women Are Sweet Talkers”: Representations of Women in Seán Ó hEochaidh’s Field Diaries for the Irish Folklore Commission

Directory of Open Access Journals (Sweden)

Lillis Ó Laoire

2017-10-01

Full Text Available This article discusses representations of women in diaries written by Seán Ó hEochaidh as part of his work as a field collector for the Irish Folklore Commission (1935-1971. Focusing on a number of well-described events and characters, the article reveals the collector’s attitude to women as they emerge from his writing. It also shows how women could help or hinder his collecting work. The disparities of the lives of a number of working women from Donegal during the period are also highlighted.
Comparative metabolism of branched-chain amino acids to precursors of juvenile hormone biogenesis in corpora allata of lepidopterous versus nonlepidopterous insects

Energy Technology Data Exchange (ETDEWEB)

Brindle, P.A.; Schooley, D.A.; Tsai, L.W.; Baker, F.C.

1988-08-05

Comparative studies were performed on the role of branched-chain amino acids (BCAA) in juvenile hormone (JH) biosynthesis using several lepidopterous and nonlepidopterous insects. Corpora cardiaca-corpora allata complexes (CC-CA, the corpora allata being the organ of JH biogenesis) were maintained in culture medium containing a uniformly /sup 14/C-labeled BCAA, together with (methyl-/sup 3/H)methionine as mass marker for JH quantification. BCAA catabolism was quantified by directly analyzing the medium for the presence of /sup 14/C-labeled propionate and/or acetate, while JHs were extracted, purified by liquid chromatography, and subjected to double-label liquid scintillation counting. Our results indicate that active BCAA catabolism occurs within the CC-CA of lepidopterans, and this efficiently provides propionyl-CoA (from isoleucine or valine) for the biosynthesis of the ethyl branches of JH I and II. Acetyl-CoA, formed from isoleucine or leucine catabolism, is also utilized by lepidopteran CC-CA for biosynthesizing JH III and the acetate-derived portions of the ethyl-branched JHs. In contrast, CC-CA of nonlepidopterans fail to catabolize BCAA. Consequently, exogenous isoleucine or leucine does not serve as a carbon source for the biosynthesis of JH III by these glands, and no propionyl-CoA is produced for genesis of ethyl-branched JHs. This is the first observation of a tissue-specific metabolic difference which in part explains why these novel homosesquiterpenoids exist in lepidopterans, but not in nonlepidopterans.
Comparative metabolism of branched-chain amino acids to precursors of juvenile hormone biogenesis in corpora allata of lepidopterous versus nonlepidopterous insects

International Nuclear Information System (INIS)

Brindle, P.A.; Schooley, D.A.; Tsai, L.W.; Baker, F.C.

1988-01-01

Comparative studies were performed on the role of branched-chain amino acids (BCAA) in juvenile hormone (JH) biosynthesis using several lepidopterous and nonlepidopterous insects. Corpora cardiaca-corpora allata complexes (CC-CA, the corpora allata being the organ of JH biogenesis) were maintained in culture medium containing a uniformly 14 C-labeled BCAA, together with [methyl- 3 H]methionine as mass marker for JH quantification. BCAA catabolism was quantified by directly analyzing the medium for the presence of 14 C-labeled propionate and/or acetate, while JHs were extracted, purified by liquid chromatography, and subjected to double-label liquid scintillation counting. Our results indicate that active BCAA catabolism occurs within the CC-CA of lepidopterans, and this efficiently provides propionyl-CoA (from isoleucine or valine) for the biosynthesis of the ethyl branches of JH I and II. Acetyl-CoA, formed from isoleucine or leucine catabolism, is also utilized by lepidopteran CC-CA for biosynthesizing JH III and the acetate-derived portions of the ethyl-branched JHs. In contrast, CC-CA of nonlepidopterans fail to catabolize BCAA. Consequently, exogenous isoleucine or leucine does not serve as a carbon source for the biosynthesis of JH III by these glands, and no propionyl-CoA is produced for genesis of ethyl-branched JHs. This is the first observation of a tissue-specific metabolic difference which in part explains why these novel homosesquiterpenoids exist in lepidopterans, but not in nonlepidopterans
Poder e identidade grupal: um estudo em corporações musicais da região das vertentes

Directory of Open Access Journals (Sweden)

Marcos Vieira-Silva

2013-01-01

Full Text Available A investigação produzida buscou compreender a constituição histórica das formações identitárias e suas articulações com as relações de poder, no desempenho das atividades cotidianas de três corporações musicais mineiras. Percebeu-se que o processo identitário dos músicos é permeado pelo prestígio e valor que a tradição musical imprime na região. As diferenciações na produção de identidades individuais e coletivas podem exercer influências nas relações de poder inter e intragrupais. Também, as diversas formas de estabelecimento das relações de poder entre os agentes exercem influências no desenvolvimento do processo grupal e na atividade musical. Atividade, esta, que legitima tanto as identidades coletivas quanto as individuais, mantendo a vida musical da Região das Vertentes viva e intensa através dos tempos.
Symbolic Machine Learning: A Different Answer to the Problem of the Acquisition of Lexical Knowledge from Corpora

Directory of Open Access Journals (Sweden)

Pascale Sébillot

2008-07-01

Full Text Available One relevant way to structure the domain of lexical knowledge (e.g. relations between lexical units acquisition from corpora is to oppose numerical versus symbolic techniques. Numerical approaches of acquisition exploit the frequential aspect of data, have been widely used, and produce portable systems, but poor explanations of their results. Symbolic approaches exploit the structural aspect of data. Among them, the symbolic machine learning (ML techniques can infer efficient and expressive patterns of a target relation from examples of elements that verify this relation. These methods are however far less known, and the aim of this paper is to point out their interest through the description of one precise experiment. To remove their supervised characteristic, and instead of opposing them to numerical approaches, we finally show that it is possible to combine one symbolic ML technique to one numerical one, and keep advantages of both (meaningful patterns, efficient extraction, portability.
DEEP LEARNING MODEL FOR BILINGUAL SENTIMENT CLASSIFICATION OF SHORT TEXTS

Directory of Open Access Journals (Sweden)

Y. B. Abdullin

2017-01-01

Full Text Available Sentiment analysis of short texts such as Twitter messages and comments in news portals is challenging due to the lack of contextual information. We propose a deep neural network model that uses bilingual word embeddings to effectively solve sentiment classification problem for a given pair of languages. We apply our approach to two corpora of two different language pairs: English-Russian and Russian-Kazakh. We show how to train a classifier in one language and predict in another. Our approach achieves 73% accuracy for English and 74% accuracy for Russian. For Kazakh sentiment analysis, we propose a baseline method, that achieves 60% accuracy; and a method to learn bilingual embeddings from a large unlabeled corpus using a bilingual word pairs.
Generation of silver standard concept annotations from biomedical texts with special relevance to phenotypes.

Science.gov (United States)

Oellrich, Anika; Collier, Nigel; Smedley, Damian; Groza, Tudor

2015-01-01

Electronic health records and scientific articles possess differing linguistic characteristics that may impact the performance of natural language processing tools developed for one or the other. In this paper, we investigate the performance of four extant concept recognition tools: the clinical Text Analysis and Knowledge Extraction System (cTAKES), the National Center for Biomedical Ontology (NCBO) Annotator, the Biomedical Concept Annotation System (BeCAS) and MetaMap. Each of the four concept recognition systems is applied to four different corpora: the i2b2 corpus of clinical documents, a PubMed corpus of Medline abstracts, a clinical trails corpus and the ShARe/CLEF corpus. In addition, we assess the individual system performances with respect to one gold standard annotation set, available for the ShARe/CLEF corpus. Furthermore, we built a silver standard annotation set from the individual systems' output and assess the quality as well as the contribution of individual systems to the quality of the silver standard. Our results demonstrate that mainly the NCBO annotator and cTAKES contribute to the silver standard corpora (F1-measures in the range of 21% to 74%) and their quality (best F1-measure of 33%), independent from the type of text investigated. While BeCAS and MetaMap can contribute to the precision of silver standard annotations (precision of up to 42%), the F1-measure drops when combined with NCBO Annotator and cTAKES due to a low recall. In conclusion, the performances of individual systems need to be improved independently from the text types, and the leveraging strategies to best take advantage of individual systems' annotations need to be revised. The textual content of the PubMed corpus, accession numbers for the clinical trials corpus, and assigned annotations of the four concept recognition systems as well as the generated silver standard annotation sets are available from http://purl.org/phenotype/resources. The textual content of the Sh
Fast and Effective Approximations for Summarization and Categorization of Very Large Text Corpora

OpenAIRE

Godbehere, Andrew B.

2015-01-01

Given the overwhelming quantities of data generated every day, there is a pressing need for tools that can extract valuable and timely information. Vast reams of text data are now published daily, containing information of interest to those in social science, marketing, finance, and public policy, to name a few. Consider the case of the micro-blogging website Twitter, which in May 2013 was estimated to contain 58 million messages per day: in a single day, Twitter generates a greater volume of...
Transformation of folklore tradition in the poem by M.I. Tsvetaeva “From the Sea”

Directory of Open Access Journals (Sweden)

Galieva Marianna Andreevna

2015-06-01

Full Text Available The paper studies the functioning of the folk tradition in the poetics by M.I. Tsvetaeva. The object of research is the poem “From the Sea” of 1926. Scientists have carefully studied motivic structure of the poem, but the attention is not paid to the folk elements. Special attention is paid to the motive of travel to “the other world”, which in terms of the semantics is correlated with the motive of sleep. Folklorism creativity of M.I. Tsvetaeva is studied enough, but there is always a need for the identification of implicit forms of folk traditions that exist in the poetics. In our work we are talking about the breaking of the folk tradition, its inner form. The connection to the archetypal models of poetry (the ship by pre-genre formations. Appeal to the fabulous tradition, to the motif of travel to “the other world” shows the archetypal, not typical in the poetry of the early XX century. It is applied the historical and typological method; Tsvetaeva’s metaphor is genetically traced to the ritual of reality expressed in the plot structure of the ship, eydology of the “other kingdom”. Historical poetics allows look at the poem “From the Sea” differently.
The Folklore - Nationalism Relationship in the Balkans. Case Study “Whose Is This Song?” by Adela Peeva

Directory of Open Access Journals (Sweden)

Elena-Lorena Nedelcu

2016-06-01

Full Text Available This article analyses a 2003 documentary titled “Whose Is This Song?” by Bulgarian movie director Adela Peeva, in the purpose of understanding the relationship between the folklore and the nationalism in the Balkans. The theme of the documentary is the director’s quest to trace the roots of a folk song that she had thought was 100 percent Bulgarian since her childhood. The documentary follows Peeva’s journey with a camera in hand around Turkey, Greece, Macedonia, Albania, Bosnia and Herzegovina, Serbia and Bulgaria, where she discovers that the song is sung by all of these nations. The documentary can be interpreted as showing how an ordinary song could become an instrument of fanatical nationalism and that it reveals mutual strife instead of Balkan unity. In a region defined by ethnic hatred and war, what begins as a simply investigation of the true origins of a song, ends as a sociological and historical exploration of the deep misunderstandings between the people of the Balkans.
Chapter 16: text mining for translational bioinformatics.

Science.gov (United States)

Cohen, K Bretonnel; Hunter, Lawrence E

2013-04-01

Text mining for translational bioinformatics is a new field with tremendous research potential. It is a subfield of biomedical natural language processing that concerns itself directly with the problem of relating basic biomedical research to clinical practice, and vice versa. Applications of text mining fall both into the category of T1 translational research-translating basic science results into new interventions-and T2 translational research, or translational research for public health. Potential use cases include better phenotyping of research subjects, and pharmacogenomic research. A variety of methods for evaluating text mining applications exist, including corpora, structured test suites, and post hoc judging. Two basic principles of linguistic structure are relevant for building text mining applications. One is that linguistic structure consists of multiple levels. The other is that every level of linguistic structure is characterized by ambiguity. There are two basic approaches to text mining: rule-based, also known as knowledge-based; and machine-learning-based, also known as statistical. Many systems are hybrids of the two approaches. Shared tasks have had a strong effect on the direction of the field. Like all translational bioinformatics software, text mining software for translational bioinformatics can be considered health-critical and should be subject to the strictest standards of quality assurance and software testing.
Text mining, a race against time? An attempt to quantify possible variations in text corpora of medical publications throughout the years.

Science.gov (United States)

Wagner, Mathias; Vicinus, Benjamin; Muthra, Sherieda T; Richards, Tereza A; Linder, Roland; Frick, Vilma Oliveira; Groh, Andreas; Rubie, Claudia; Weichert, Frank

2016-06-01

The continuous growth of medical sciences literature indicates the need for automated text analysis. Scientific writing which is neither unitary, transcending social situation nor defined by a timeless idea is subject to constant change as it develops in response to evolving knowledge, aims at different goals, and embodies different assumptions about nature and communication. The objective of this study was to evaluate whether publication dates should be considered when performing text mining. A search of PUBMED for combined references to chemokine identifiers and particular cancer related terms was conducted to detect changes over the past 36 years. Text analyses were performed using freeware available from the World Wide Web. TOEFL Scores of territories hosting institutional affiliations as well as various readability indices were investigated. Further assessment was conducted using Principal Component Analysis. Laboratory examination was performed to evaluate the quality of attempts to extract content from the examined linguistic features. The PUBMED search yielded a total of 14,420 abstracts (3,190,219 words). The range of findings in laboratory experimentation were coherent with the variability of the results described in the analyzed body of literature. Increased concurrence of chemokine identifiers together with cancer related terms was found at the abstract and sentence level, whereas complexity of sentences remained fairly stable. The findings of the present study indicate that concurrent references to chemokines and cancer increased over time whereas text complexity remained stable. Copyright © 2016 Elsevier Ltd. All rights reserved.
Entropy Rate Estimates for Natural Language—A New Extrapolation of Compressed Large-Scale Corpora

Directory of Open Access Journals (Sweden)

Ryosuke Takahira

2016-10-01

Full Text Available One of the fundamental questions about human language is whether its entropy rate is positive. The entropy rate measures the average amount of information communicated per unit time. The question about the entropy of language dates back to experiments by Shannon in 1951, but in 1990 Hilberg raised doubt regarding a correct interpretation of these experiments. This article provides an in-depth empirical analysis, using 20 corpora of up to 7.8 gigabytes across six languages (English, French, Russian, Korean, Chinese, and Japanese, to conclude that the entropy rate is positive. To obtain the estimates for data length tending to infinity, we use an extrapolation function given by an ansatz. Whereas some ansatzes were proposed previously, here we use a new stretched exponential extrapolation function that has a smaller error of fit. Thus, we conclude that the entropy rates of human languages are positive but approximately 20% smaller than without extrapolation. Although the entropy rate estimates depend on the script kind, the exponent of the ansatz function turns out to be constant across different languages and governs the complexity of natural language in general. In other words, in spite of typological differences, all languages seem equally hard to learn, which partly confirms Hilberg’s hypothesis.

HUBUNGAN ANTARA STATUS GIZI DAN TINGKAT KEBUGARAN JASMANI DENGAN PRODUKTIVITAS KERJA PADA TENAGA KERJA WANITA UNIT SPINNING 1 BAGIAN WINDING PT. APAC INTI CORPORA BAWEN

Directory of Open Access Journals (Sweden)

Sri Rahayu Utami

2014-10-01

Full Text Available Tujuan penelitian ini untuk mengetahui hubungan antara status gizi dan tingkat kebugaran jasmani dengan produktivitas kerja pada tenaga kerja wanita unit Spinning 1 bagian Winding PT. Apac Inti Corpora Bawen. Jenis penelitian menggunakan explanatory research dengan pendekatan cross sectional. Populasi berjumlah 73 orang dengan sampel 45 orang. Pengambilan sampel menggunakan metode simple random sampling. Instrument yang digunakan adalah timbangan berat badan dan tinggi badan, bangku harvard, metronome, stopwatch dan lembar data produktivitas. Analisis data menggunakan uji Chi-Square dengan α = 0,05. Dan didapatkan hasil bahwa ada hubungan antara status gizi (p=0,005, tingkat kebugaran jasmani (p=0,001 dengan produktivitas kerja. Melalui penelitian ini diharapkan pekerja dapat mengkonsumsi makanan yang mengandung gizi seimbang ,serta melakukan olahraga untuk meningkatkan kebugaran jasmaninya. The purpose of this research to determine the relationship between nutritional status and level of physical fitness by working on labor productivity women Spinning unit 1 part Winding PT. Apac Inti Corpora Bawen. This research was explanatory research with cross sectional approach. Population was a 73 employees. And sample was 45 employees. Instrument was a weight scales and height, harvard bench, metronome, stopwatch and productivity data sheet. Was processed, using the Chi-Square statistic with α = 0.05. The results was a relationship between nutritional status (p = 0.005, level of physical fitness (p = 0.001 with labor productivity. This research will expect workers to consume foods that contain balanced nutrition and exercise to improve physical fitness.
Overfitting Reduction of Text Classification Based on AdaBELM

Directory of Open Access Journals (Sweden)

Xiaoyue Feng

2017-07-01

Full Text Available Overfitting is an important problem in machine learning. Several algorithms, such as the extreme learning machine (ELM, suffer from this issue when facing high-dimensional sparse data, e.g., in text classification. One common issue is that the extent of overfitting is not well quantified. In this paper, we propose a quantitative measure of overfitting referred to as the rate of overfitting (RO and a novel model, named AdaBELM, to reduce the overfitting. With RO, the overfitting problem can be quantitatively measured and identified. The newly proposed model can achieve high performance on multi-class text classification. To evaluate the generalizability of the new model, we designed experiments based on three datasets, i.e., the 20 Newsgroups, Reuters-21578, and BioMed corpora, which represent balanced, unbalanced, and real application data, respectively. Experiment results demonstrate that AdaBELM can reduce overfitting and outperform classical ELM, decision tree, random forests, and AdaBoost on all three text-classification datasets; for example, it can achieve 62.2% higher accuracy than ELM. Therefore, the proposed model has a good generalizability.
Utilité du partage des corpus pour l'analyse des interactions en ligne en situation d'apprentissage : un exemple d'approche méthodologique autour d'une base de corpus d'apprentissage Benefits of Sharing Corpora when Analyzing Online Interactions: an Example of Methodology Related to a Databank of Learning and Teaching Corpora.

Directory of Open Access Journals (Sweden)

Maud Ciekanski

2010-12-01

Full Text Available La recherche sur les interactions en ligne en situation d'apprentissage offre encore trop peu souvent la possibilité d'accéder aux données à partir desquelles les chercheurs ont élaboré les analyses présentées dans les publications. Cela restreint, d'une part, la compréhension des phénomènes étudiés et, d'autre part, empêche toute réplication dans le but de comparaisons, d'analyses cumulatives ou contrastives. Dans le projet Mulce, nous défendons le point de vue méthodologique suivant : pour permettre une analyse des interactions situées, il convient de relier les différentes données issues de formations en ligne pour construire un objet d'analyse exploitable par différentes équipes et disciplines. Le constat actuel est que les données sont souvent décontextualisées, parcellaires ou simplement inaccessibles à la communauté des chercheurs. Nous proposons donc de structurer les données en corpus d'apprentissage (Letec de façon à rendre possible leur échange et la capitalisation des analyses. Le protocole de recherche, le scénario pédagogique, les interactions, productions et traces, les licences et les analyses capitalisables en sont les constituants. Cet article présente, dans un premier temps, les questionnements, à la fois théoriques, techniques et méthodologiques soulevés par la conception d'un tel projet. Dans un deuxième temps, nous illustrerons notre démarche à partir d'exemples issus des formations Simuligne et Copéas, en indiquant les processus simples de transformation du format Mulce aux formats requis par deux logiciels d'aide à l'analyse (l'un sur les forums, l'autre sur l'alignement entre vidéo et transcription. Nous insistons plus particulièrement sur l'intérêt de ces outils pour l'analyse des phénomènes de polyfocalisation et d'écriture multimodale dans l'analyse des interactions multimodales, caractéristiques des environnements d'apprentissage en ligne. Nous conclurons notre
Juvenile hormone biosynthesis gene expression in the corpora allata of honey bee (Apis mellifera L.) female castes.

Science.gov (United States)

Bomtorin, Ana Durvalina; Mackert, Aline; Rosa, Gustavo Conrado Couto; Moda, Livia Maria; Martins, Juliana Ramos; Bitondi, Márcia Maria Gentile; Hartfelder, Klaus; Simões, Zilá Luz Paulino

2014-01-01

Juvenile hormone (JH) controls key events in the honey bee life cycle, viz. caste development and age polyethism. We quantified transcript abundance of 24 genes involved in the JH biosynthetic pathway in the corpora allata-corpora cardiaca (CA-CC) complex. The expression of six of these genes showing relatively high transcript abundance was contrasted with CA size, hemolymph JH titer, as well as JH degradation rates and JH esterase (jhe) transcript levels. Gene expression did not match the contrasting JH titers in queen and worker fourth instar larvae, but jhe transcript abundance and JH degradation rates were significantly lower in queen larvae. Consequently, transcriptional control of JHE is of importance in regulating larval JH titers and caste development. In contrast, the same analyses applied to adult worker bees allowed us inferring that the high JH levels in foragers are due to increased JH synthesis. Upon RNAi-mediated silencing of the methyl farnesoate epoxidase gene (mfe) encoding the enzyme that catalyzes methyl farnesoate-to-JH conversion, the JH titer was decreased, thus corroborating that JH titer regulation in adult honey bees depends on this final JH biosynthesis step. The molecular pathway differences underlying JH titer regulation in larval caste development versus adult age polyethism lead us to propose that mfe and jhe genes be assayed when addressing questions on the role(s) of JH in social evolution.
Helios: Understanding Solar Evolution Through Text Analytics

Energy Technology Data Exchange (ETDEWEB)

Randazzese, Lucien [SRI International, Menlo Park, CA (United States)

2016-12-02

This proof-of-concept project focused on developing, testing, and validating a range of bibliometric, text analytic, and machine-learning based methods to explore the evolution of three photovoltaic (PV) technologies: Cadmium Telluride (CdTe), Dye-Sensitized solar cells (DSSC), and Multi-junction solar cells. The analytical approach to the work was inspired by previous work by the same team to measure and predict the scientific prominence of terms and entities within specific research domains. The goal was to create tools that could assist domain-knowledgeable analysts in investigating the history and path of technological developments in general, with a focus on analyzing step-function changes in performance, or “breakthroughs,” in particular. The text-analytics platform developed during this project was dubbed Helios. The project relied on computational methods for analyzing large corpora of technical documents. For this project we ingested technical documents from the following sources into Helios: Thomson Scientific Web of Science (papers), the U.S. Patent & Trademark Office (patents), the U.S. Department of Energy (technical documents), the U.S. National Science Foundation (project funding summaries), and a hand curated set of full-text documents from Thomson Scientific and other sources.
Comparative Evaluation of the Aphrodisiac efficacy of Sildenafil and Carpolobia lutea Root Extract in Male Rabbits

Directory of Open Access Journals (Sweden)

Ayobami Dare

2015-12-01

Full Text Available Aims: In spite of folkloric use of the root of Carpolobia lutea as sexual stimulant in man, there has been limited scientific proof of its efficacy. This study evaluates efficacy of methanol extract of Carpolobia lutea root (MECLR on sexual activity of male rabbits. Methods: Twenty adult male rabbits were grouped into four of five rabbits each. Groups 1-4 were treated orally for 28 days with 2ml/kg 1% tween 20 (vehicle, 40mg/kg MECLR, 80mg/kg MECLR, and 0.5mg/kg sildenafil citrate respectively. Sexual activities of males from each group was assessed by cohabiting them with sexually receptive female at estrus on days 0,1,3 and 5 using digital camera mounted on mating arena. Serum testosterone and nitric oxide concentration of the corpora cavernosa homogenates were also determined. Results: MECLR caused a dose dependent significant increase in mount frequency, intromission frequency and ejaculatory latency, while it reduced mount latency, intromission latency and post ejaculatory latency (similar to sildenafil citrate when compared with the control. MECLR also caused a significant increase in nitric oxide concentration in corpora cavernosa but no change in serum testosterone concentration. Conclusions: Results suggest that MECLR enhances male sexual activity possibly by augmenting nitric oxide concentration. This study thus provides novel scientific rationale for the use of Carpolobia lutea in the management of penile erectile dysfunction and impaired libido. [J Intercult Ethnopharmacol 2015; 4(4.000: 302-307
Building an ontology of pulmonary diseases with natural language processing tools using textual corpora.

Science.gov (United States)

Baneyx, Audrey; Charlet, Jean; Jaulent, Marie-Christine

2007-01-01

Pathologies and acts are classified in thesauri to help physicians to code their activity. In practice, the use of thesauri is not sufficient to reduce variability in coding and thesauri are not suitable for computer processing. We think the automation of the coding task requires a conceptual modeling of medical items: an ontology. Our task is to help lung specialists code acts and diagnoses with software that represents medical knowledge of this concerned specialty by an ontology. The objective of the reported work was to build an ontology of pulmonary diseases dedicated to the coding process. To carry out this objective, we develop a precise methodological process for the knowledge engineer in order to build various types of medical ontologies. This process is based on the need to express precisely in natural language the meaning of each concept using differential semantics principles. A differential ontology is a hierarchy of concepts and relationships organized according to their similarities and differences. Our main research hypothesis is to apply natural language processing tools to corpora to develop the resources needed to build the ontology. We consider two corpora, one composed of patient discharge summaries and the other being a teaching book. We propose to combine two approaches to enrich the ontology building: (i) a method which consists of building terminological resources through distributional analysis and (ii) a method based on the observation of corpus sequences in order to reveal semantic relationships. Our ontology currently includes 1550 concepts and the software implementing the coding process is still under development. Results show that the proposed approach is operational and indicates that the combination of these methods and the comparison of the resulting terminological structures give interesting clues to a knowledge engineer for the building of an ontology.
Aspects of Text Mining From Computational Semiotics to Systemic Functional Hypertexts

Directory of Open Access Journals (Sweden)

Alexander Mehler

2001-05-01

Full Text Available The significance of natural language texts as the prime information structure for the management and dissemination of knowledge in organisations is still increasing. Making relevant documents available depending on varying tasks in different contexts is of primary importance for any efficient task completion. Implementing this demand requires the content based processing of texts, which enables to reconstruct or, if necessary, to explore the relationship of task, context and document. Text mining is a technology that is suitable for solving problems of this kind. In the following, semiotic aspects of text mining are investigated. Based on the primary object of text mining - natural language lexis - the specific complexity of this class of signs is outlined and requirements for the implementation of text mining procedures are derived. This is done with reference to text linkage introduced as a special task in text mining. Text linkage refers to the exploration of implicit, content based relations of texts (and their annotation as typed links in corpora possibly organised as hypertexts. In this context, the term systemic functional hypertext is introduced, which distinguishes genre and register layers for the management of links in a poly-level hypertext system.
Transylvanianism, Nationalism, Folklore: The Academic Career of Olga Nagy in the Light of her Posthumous Book, Vallomások (2010

Directory of Open Access Journals (Sweden)

Kata Zsófia Vincze

2016-01-01

Full Text Available The volume Vallomások [‘Testimony’], published posthumously in 2010, is the folklorist Olga Nagy’s (1921-2006 last book. In this paper I will analyze Nagy’s academic significance in the light of her own last self reflection presented in Vallomások. This volume provides an exciting overview of the internal dynamics of East-Central European culture and interethnic relations. While I examine Nagy’s life work, especially her academic work on rural women and her new ideas regarding the alive folklore, I will also reflect on the ideology of so called Transylvanianism that constitutes the framework of many Hungarian writings from Romania. Transylvanianism is a complex ideology rooted in the Hungarian national movement of the nineteenth century, one that later turned into a complex manifestation of the Hungarian minorities in Romania through literature, culture, politics and self-definition. Elaborated by writers, historians and journalists, Transylvanianism after 1918—and even more vehemently after 1947—aimed to preserve and reinforce Hungarian national pride and identity in the region through cultural activities, education and political action.
Glandectomy with preservation of corpora cavernosa in the treatment of penile carcinoma

Directory of Open Access Journals (Sweden)

Fonseca Aluizio G. da

2003-01-01

Full Text Available INTRODUCTION: The objective of this work is to describe a conservative surgical technique as an alternative to classic penile amputations, aiming the local control of the disease, in addition to trying to preserve the patient's sexual function. SURGICAL TECHNIQUE: After a circular incision of the skin around the penis, the subfascial plane is developed until the base of the organ. The dorsal neurovascular bundle and the urethra are isolated in their distal extremities. The neurovascular bundle is sectioned distally. A retrocoronal dissection plane is developed between the glans and the corpora cavernosa. When this stage is complete, the glans is fixed only to the urethra, which is distally sectioned as well. The neurovascular bundle is fixed to the dorsal albuginea. Following the spatulation of the urethra, a neomeatus is created using the overlay skin of the penis. Between January 2001 and July 2002, we employed this technique in 6 patients who had epidermoid carcinoma of the penis, which were limited to the glans, superficial, well or moderately differentiated and measuring up to 3 cm. COMMENTS: Several conservative surgical methods for treatment of carcinoma of the penis aim the organ's preservation, in an attempt of improving the quality of life of patients, however the indexes of local recurrence and failure in disease control are significant. The described technique showed to be safe and effective for disease control, in addition to preserving sexual function in all patients who were treated, representing, thus, a quite appealing conservative surgical alternative in selected cases.
Sparse Machine Learning Methods for Understanding Large Text Corpora

Data.gov (United States)

National Aeronautics and Space Administration — Sparse machine learning has recently emerged as powerful tool to obtain models of high-dimensional data with high degree of interpretability, at low computational...
tagtog: interactive and text-mining-assisted annotation of gene mentions in PLOS full-text articles.

Science.gov (United States)

Cejuela, Juan Miguel; McQuilton, Peter; Ponting, Laura; Marygold, Steven J; Stefancsik, Raymund; Millburn, Gillian H; Rost, Burkhard

2014-01-01

The breadth and depth of biomedical literature are increasing year upon year. To keep abreast of these increases, FlyBase, a database for Drosophila genomic and genetic information, is constantly exploring new ways to mine the published literature to increase the efficiency and accuracy of manual curation and to automate some aspects, such as triaging and entity extraction. Toward this end, we present the 'tagtog' system, a web-based annotation framework that can be used to mark up biological entities (such as genes) and concepts (such as Gene Ontology terms) in full-text articles. tagtog leverages manual user annotation in combination with automatic machine-learned annotation to provide accurate identification of gene symbols and gene names. As part of the BioCreative IV Interactive Annotation Task, FlyBase has used tagtog to identify and extract mentions of Drosophila melanogaster gene symbols and names in full-text biomedical articles from the PLOS stable of journals. We show here the results of three experiments with different sized corpora and assess gene recognition performance and curation speed. We conclude that tagtog-named entity recognition improves with a larger corpus and that tagtog-assisted curation is quicker than manual curation. DATABASE URL: www.tagtog.net, www.flybase.org.
WARCProcessor: An Integrative Tool for Building and Management of Web Spam Corpora.

Science.gov (United States)

Callón, Miguel; Fdez-Glez, Jorge; Ruano-Ordás, David; Laza, Rosalía; Pavón, Reyes; Fdez-Riverola, Florentino; Méndez, Jose Ramón

2017-12-22

In this work we present the design and implementation of WARCProcessor, a novel multiplatform integrative tool aimed to build scientific datasets to facilitate experimentation in web spam research. The developed application allows the user to specify multiple criteria that change the way in which new corpora are generated whilst reducing the number of repetitive and error prone tasks related with existing corpus maintenance. For this goal, WARCProcessor supports up to six commonly used data sources for web spam research, being able to store output corpus in standard WARC format together with complementary metadata files. Additionally, the application facilitates the automatic and concurrent download of web sites from Internet, giving the possibility of configuring the deep of the links to be followed as well as the behaviour when redirected URLs appear. WARCProcessor supports both an interactive GUI interface and a command line utility for being executed in background.
1970 MLA International Bibliography of Books and Articles on the Modern Languages and Literatures, Volume I: General, English, American, Medieval and Neo-Latin, Celtic Literatures; and Folklore.

Science.gov (United States)

Meserole, Harrison T., Comp.

Volume 1 of the four-volume, international bibliography contains over 11,140 entries referring to books, Festschriften, analyzed collections, and articles which focus on General, English, American, medieval and neo-Latin, and Celtic literatures. A section of folklore is also included. The section on general literature includes: (1) aesthetics, (2)…
A 38 Million Words Dutch Text Corpus and its Users

African Journals Online (AJOL)

part of speech, was made accessible via Internet (Kruyt 1995a, b). A 27 Million ..... corpora yet, and that 16 user accounts are reserved for students of the Free ... are from Norway, Denmark, Austria, Slovenia, Latvia, Malaysia and Korea.
A Linguistic Inquiry and Word Count Analysis of the Adult Attachment Interview in Two Large Corpora.

Science.gov (United States)

Waters, Theodore E A; Steele, Ryan D; Roisman, Glenn I; Haydon, Katherine C; Booth-LaForce, Cathryn

2016-01-01

An emerging literature suggests that variation in Adult Attachment Interview (AAI; George, Kaplan, & Main, 1985) states of mind about childhood experiences with primary caregivers is reflected in specific linguistic features captured by the Linguistic Inquiry Word Count automated text analysis program (LIWC; Pennebaker, Booth, & Francis, 2007). The current report addressed limitations of prior studies in this literature by using two large AAI corpora ( N s = 826 and 857) and a broader range of linguistic variables, as well as examining associations of LIWC-derived AAI dimensions with key developmental antecedents. First, regression analyses revealed that dismissing states of mind were associated with transcripts that were more truncated and deemphasized discussion of the attachment relationship whereas preoccupied states of mind were associated with longer, more conflicted, and angry narratives. Second, in aggregate, LIWC variables accounted for over a third of the variation in AAI dismissing and preoccupied states of mind, with regression weights cross-validating across samples. Third, LIWC-derived dismissing and preoccupied state of mind dimensions were associated with direct observations of maternal and paternal sensitivity as well as infant attachment security in childhood, replicating the pattern of results reported in Haydon, Roisman, Owen, Booth-LaForce, and Cox (2014) using coder-derived dismissing and preoccupation scores in the same sample.
Sleep paralysis in Brazilian folklore and other cultures: a brief review

Directory of Open Access Journals (Sweden)

José Felipe Rodriguez de Sá

2016-09-01

Full Text Available Sleep paralysis (SP is a dissociative state that occurs mainly during awakening. SP is characterized by altered motor, perceptual, emotional and cognitive functions, such as inability to perform voluntary movements, visual hallucinations, feelings of chest pressure, delusions about a frightening presence and, in some cases, fear of impending death. Most people experience SP rarely, but typically when sleeping in supine position; however, SP is considered a disease (parasomnia when recurrent and/or associated to emotional burden. Interestingly, throughout human history, different peoples interpreted SP under a supernatural view. For example, Canadian Eskimos attribute SP to spells of shamans, who hinder the ability to move, and provoke hallucinations of a shapeless presence. In the Japanese tradition, SP is due to a vengeful spirit who suffocates his enemies while sleeping. In Nigerian culture, a female demon attacks during dreaming and provokes paralysis. A modern manifestation of SP is the report of alien abductions, experienced as inability to move during awakening associated with visual hallucinations of aliens. Furthermore, SP is a significant example of how a specific biological phenomenon can be interpreted and shaped by different cultural contexts. In order to further explore the ethnopsychology of SP, the Pisadeira, a character of Brazilian folklore originated in the country’s Southeast, but also found in other regions with variant names, has been reviewed. Pisadeira is described as a crone with long fingernails who lurks on roofs at night and tramples on the chest of those who sleep on a full stomach with the belly up. This legend is mentioned in many anthropological accounts; however, we found no comprehensive reference on the Pisadeira from the perspective of sleep science. Here we aim to fill this gap. We first review the neuropsychological aspects of SP, and then present the folk tale of the Pisadeira. Finally, we summarize the
Korpusy jako zdroje dat pro úpravy nástrojů automatické morfologické analýzy (Slovotvorné varianty adjektiv na [(ou|í]cí z hlediska morfologického značkování : Corpora as Data Sources for the Up-Grading of Morphological Tagging

Directory of Open Access Journals (Sweden)

Osolsobě, Klára

2015-10-01

Full Text Available Adjectives ending with -oucí/-ící are regularly derived from verbs and hence are not usually listed in any of the Czech monolingual dictionaries. On the level of automatic morphological analysis (the dictionary of Czech they should be generated from verbal roots and tagged as verbal adjectives (pos tag: AG.*. The data from Czech corpora prove a inconsistencies in tagging and b gaps in the dictionary. The main cause of both kinds of insufficiency is the existence of variants on the level of verbal forms from which the verbal adjectives are potentially derived. Consequently, text corpora are a significant sourceof knowledge about the formation and use of adjectives with endings -oucí/-ící that can be important for both a automatic morphological analysis of Czech and b theoretical description of Czech grammar(derivational morphology. Our goal is to present a corpus-based study of the Czech gerund, i.e. verbaladjectives with -oucí/-ící. The link between the inflected and the word-formation variants will bedemonstrated using material from the SYN corpus (2,6 billion tokens of written Czech and the large web corpus czTenTen12 (5,2 billion tokens of Czech text from the Internet — cleaned and deduplicated.
Dose-Volume Parameters of the Corpora Cavernosa Do Not Correlate With Erectile Dysfunction After External Beam Radiotherapy for Prostate Cancer: Results From a Dose-Escalation Trial

International Nuclear Information System (INIS)

Wielen, Gerard J. van der; Hoogeman, Mischa S.; Dohle, Gert R.; Putten, Wim L.J. van; Incrocci, Luca

2008-01-01

Purpose: To analyze the correlation between dose-volume parameters of the corpora cavernosa and erectile dysfunction (ED) after external beam radiotherapy (EBRT) for prostate cancer. Methods and Materials: Between June 1997 and February 2003, a randomized dose-escalation trial comparing 68 Gy and 78 Gy was conducted. Patients at our institute were asked to participate in an additional part of the trial evaluating sexual function. After exclusion of patients with less than 2 years of follow-up, ED at baseline, or treatment with hormonal therapy, 96 patients were eligible. The proximal corpora cavernosa (crura), the superiormost 1-cm segment of the crura, and the penile bulb were contoured on the planning computed tomography scan and dose-volume parameters were calculated. Results: Two years after EBRT, 35 of the 96 patients had developed ED. No statistically significant correlations between ED 2 years after EBRT and dose-volume parameters of the crura, the superiormost 1-cm segment of the crura, or the penile bulb were found. The few patients using potency aids typically indicated to have ED. Conclusion: No correlation was found between ED after EBRT for prostate cancer and radiation dose to the crura or penile bulb. The present study is the largest study evaluating the correlation between ED and radiation dose to the corpora cavernosa after EBRT for prostate cancer. Until there is clear evidence that sparing the penile bulb or crura will reduce ED after EBRT, we advise to be careful in sparing these structures, especially when this involves reducing treatment margins
Dynamic Penile Corpora Cavernosa Reconstruction Using Bilateral Innervated Gracilis Muscles: A Preclinical Investigation.

Science.gov (United States)

Yin, Zhuming; Liu, Liqiang; Xue, Bingjian; Fan, Jincai; Chen, Wenlin; Liu, Zheng

2018-03-07

investigation proves that corpora cavernosa reconstruction using bilateral innervated gracilis muscles is technically feasible and functionally efficacious. Yin Z, Liu L, Xue B, et al. Dynamic Penile Corpora Cavernosa Reconstruction Using Bilateral Innervated Gracilis Muscles: A Preclinical Investigation. Sex Med 2018;XX:XXX-XXX. Copyright © 2018 The Authors. Published by Elsevier Inc. All rights reserved.

Who's Afraid of the Big, Bad Folk?

Czech Academy of Sciences Publication Activity Database

Feinberg, Joseph Grim

2013-01-01

Roč. 61, č. 5 (2013), s. 548-560 ISSN 1335-1303 Institutional support: RVO:67985955 Keywords : tradition * folklore * folklore studies * reconceptualization * authenticity * the politics of folklore Subject RIV: AJ - Letters, Mass-media, Audiovision http://www.uet.sav.sk/files/etno5-2013-text-web.pdf
Gigafida and slWaC: topic comparison

Directory of Open Access Journals (Sweden)

Nataša Logar Berginc

2013-05-01

Full Text Available In the article, the following two issues are analyzed: (a incorporation of texts from the Internet into existing reference corpora and comparison with the existence of web corpora, and (b the latest two corpora of Slovenian language texts: the Gigafida corpus consisting mainly of printed texts and to a lesser extent also web texts, and the slWaC corpus which is entirely compiled from web texts. First, similarities and differences between the two corpora are identified using the topic modelling method, and then the same method is applied to the individual taxonomic categories of the Gigafida corpus. The first part of the analysis showed that the work of reference corpus compilers is currently still incoherent with regard to the incorporation of Internet texts into corpora which should reveal the overall picture of a certain language. In case compilers decide to incorporate web texts, the range of included genres is generally broad. The second part of the analysis showed a significant thematic variation between the Gigafida and slWaC corpora, and pointed out the most typical themes covered by each of the six Gigafida corpus parts.
Reshaping Text Data for Efficient Processing on Amazon EC2

Directory of Open Access Journals (Sweden)

Gabriela Turcu

2011-01-01

Full Text Available Text analysis tools are nowadays required to process increasingly large corpora which are often organized as small files (abstracts, news articles, etc.. Cloud computing offers a convenient, on-demand, pay-as-you-go computing environment for solving such problems. We investigate provisioning on the Amazon EC2 cloud from the user perspective, attempting to provide a scheduling strategy that is both timely and cost effective. We derive an execution plan using an empirically determined application performance model. A first goal of our performance measurements is to determine an optimal file size for our application to consume. Using the subset-sum first fit heuristic we reshape the input data by merging files in order to match as closely as possible the desired file size. This also speeds up the task of retrieving the results of our application, by having the output be less segmented. Using predictions of the performance of our application based on measurements on small data sets, we devise an execution plan that meets a user specified deadline while minimizing cost.
Folklore motives in the early compositions of Nikola Borota - Radovan

Directory of Open Access Journals (Sweden)

Jovanović Jelena

2014-01-01

Full Text Available The creative work of Nikola Borota - Radovan (musician, composer, lyricist, arranger and record producer, based in New Zealand - formerly from Yugoslavia held a specific place in development of world music (polygenre in his native homeland in the early 1970s. This study focuses on his creative principles, applied to works published between the years 1970 and 1975 (while the role of these works in social, cultural and political context of the time and place will be elaborated in another study, see Jovanović 2014. The platform established to present this unique musical approach authenticaly was called kamen na kamen (a studio and stage outfit that has included number of collaborations over many years. Based on the musical models and aethetics of the folk revival and created under influence of The Beatles’, in adition to many other popular music production directions of the era, Borota’s works reveal significant musical, performance and production qualities, innovative expression and musical solutions, that need to be percieved from the contemporary (ethnomusicological point of view. Despite the fact that many prominent creative Yugoslav musicians of the time also worked within a similar framework I would argue that Mr. Borota’s creative outcome was signifficantly different from other Yugoslav popular music creative efforts. This is particularly noticeable in the author’s unique treatment of South-European and other folklore motives, which is the main topic of this study. Folk (ethnic idioms exploited by Mr. Borota in his compositions originate from the rural traditions of western Dinaric regions. This is especially true for the rhythmic formations of deaf or silent dance; for the semi-urban and urban tradition of the Balkans and the Mediterranean; Middle European traditions; traditions from non-European peoples; elements of Italian Renaissance; and international (mostly Anglo-American musical models. Compositions are analysed partly in
Menzerath-Altmann law for distinct word distribution analysis in a large text

Science.gov (United States)

Eroglu, Sertac

2013-06-01

The empirical law uncovered by Menzerath and formulated by Altmann, known as the Menzerath-Altmann law (henceforth the MA law), reveals the statistical distribution behavior of human language in various organizational levels. Building on previous studies relating organizational regularities in a language, we propose that the distribution of distinct (or different) words in a large text can effectively be described by the MA law. The validity of the proposition is demonstrated by examining two text corpora written in different languages not belonging to the same language family (English and Turkish). The results show not only that distinct word distribution behavior can accurately be predicted by the MA law, but that this result appears to be language-independent. This result is important not only for quantitative linguistic studies, but also may have significance for other naturally occurring organizations that display analogous organizational behavior. We also deliberately demonstrate that the MA law is a special case of the probability function of the generalized gamma distribution.
A Text-Independent Speaker Authentication System for Mobile Devices

Directory of Open Access Journals (Sweden)

Florentin Thullier

2017-09-01

Full Text Available This paper presents a text independent speaker authentication method adapted to mobile devices. Special attention was placed on delivering a fully operational application, which admits a sufficient reliability level and an efficient functioning. To this end, we have excluded the need for any network communication. Hence, we opted for the completion of both the training and the identification processes directly on the mobile device through the extraction of linear prediction cepstral coefficients and the naive Bayes algorithm as the classifier. Furthermore, the authentication decision is enhanced to overcome misidentification through access privileges that the user should attribute to each application beforehand. To evaluate the proposed authentication system, eleven participants were involved in the experiment, conducted in quiet and noisy environments. Public speech corpora were also employed to compare this implementation to existing methods. Results were efficient regarding mobile resources’ consumption. The overall classification performance obtained was accurate with a small number of samples. Then, it appeared that our authentication system might be used as a first security layer, but also as part of a multilayer authentication, or as a fall-back mechanism.
FacetGist: Collective Extraction of Document Facets in Large Technical Corpora.

Science.gov (United States)

Siddiqui, Tarique; Ren, Xiang; Parameswaran, Aditya; Han, Jiawei

2016-10-01

Given the large volume of technical documents available, it is crucial to automatically organize and categorize these documents to be able to understand and extract value from them. Towards this end, we introduce a new research problem called Facet Extraction. Given a collection of technical documents, the goal of Facet Extraction is to automatically label each document with a set of concepts for the key facets ( e.g. , application, technique, evaluation metrics, and dataset) that people may be interested in. Facet Extraction has numerous applications, including document summarization, literature search, patent search and business intelligence. The major challenge in performing Facet Extraction arises from multiple sources: concept extraction, concept to facet matching, and facet disambiguation. To tackle these challenges, we develop FacetGist, a framework for facet extraction. Facet Extraction involves constructing a graph-based heterogeneous network to capture information available across multiple local sentence-level features, as well as global context features. We then formulate a joint optimization problem, and propose an efficient algorithm for graph-based label propagation to estimate the facet of each concept mention. Experimental results on technical corpora from two domains demonstrate that Facet Extraction can lead to an improvement of over 25% in both precision and recall over competing schemes.
If only Derrida missed that flight... About the assessment of the "academic achievements" of the so-called "American Anthropology" by Belgrade Structural-semiotic School of Folklore

Directory of Open Access Journals (Sweden)

Miloš Milenković

2016-02-01

Full Text Available Taking into account recent critiques of "underdevelopment", "positivism", "methodological backwardness" and other failings attributed to socalled "American Anthropology" by some of the authors from the Belgrade Structural-semiotic School of Anthropology of Folklore, I analyse the context in which colleagues and students may be tempted to explain common sense political connection between polyphone ethnography, neo-romanticism and nationalism as counter-intuitive history of the discipline. I already pointed that the important transformative differences in the attitudes towards structuralism between European anthropologists, especially Belgrade Structural-semiotic School of Anthropology of Folklore and so called "American Anthropology", are the consequence of a pure coincidence – the fact that French structuralism and French poststructuralism were launched simultaneously at the American interdisciplinary intellectual scene ("Theory" at the same conference. This ironic concurrence would not be much more than one entertaining episode for students, historians of anthropology and historians of ideas, if there were no attempts (more and more frequent and increasingly fluently articulated to compare different intellectual traditions as they were elements of the same unilineal evolution of the discipline. Belgrade Structural-semiotic School (further called only SS and especially its spiritus movens and most prominent representative Prof. Kovačević started in recent years to criticise some "American Anthropology" measuring its academic "achievement" (the author’s term in comparative perspective and taking as an analytical unit uncritically generalized traditions marked with a single term of "postmodern anthropology" on the one hand, and "anthropology" on the other. Belgrade SS School did develop globally original, although badly promoted and never fully used, battery for the synchronic analysis of the folklore phenomena, but this was done only after
The Food Code in the Yakut Culture: Semantics and Functions

Science.gov (United States)

Gabysheva, Luiza Lvovna

2016-01-01

The relevance of researching the issue of a specific cultural meaning for a word in a folklore text is based on its being insufficiently studied and due to the importance for solving the problem of the folklore language semantic features. Yakut nominations for dairy products, which are the key words in the language of the Sakha people's folklore,…
Argumentation Within Language as Subsidy for the Evaluation of Reading Practices and Production of Argumentative Texts

Directory of Open Access Journals (Sweden)

Lauro Gomes

2016-12-01

Full Text Available This paper aims to present an evaluation proposal of the performance in reading and writing dissertative-argumentative texts, based on principles and concepts from the theory of Argumentation in Language – created by Jean-Claude Anscombre and Oswald Ducrot, especially the version of the Theory of the Semantic Blocks and the works inspired by it. The goal is to create criteria which are capable of being less intuitive in judging the performance in reading and wrinting dissertative-argumentative texts. The analysis of the corpora – the Enem 2011’s composition proposal and 50 (fifty texts written by the students – and the test of the criteria of reading and writing evaluation in this work revealed practice funcionality and efficiency of criteria. The results allow these criteria to be applied in any evaluation processes of dissertative-argumenative texts. Finally, this paper offers theoretical and methodological subisdies which can help teachers and professors to qualify their teaching of reading and writing and the evaluation of student’s texts.
The search for novel anticancer agents: a differentiation-based assay and analysis of a folklore product.

Science.gov (United States)

Dinnen, R D; Ebisuzaki, K

1997-01-01

One alternative approach to the current use of cytotoxic anticancer drugs involves the use of differentiation-inducing agents. However, a wider application of this strategy would require the development of assays to search for new differentiation-inducing agents. In this report we describe an in vitro assay using the murine erythroleukemia (clone 3-1) cells. Tests for the efficacy of this assay for the analysis of antineoplastic activity in natural products led to studies on pau d'arco, a South American folklore product used in the treatment of cancer. Purification of the activity in aqueous extracts by solvent partition and thin layer chromatography (TLC) indicated the presence of two activities, one of which was identified as lapachol. The activity in the pau d'arco extracts and of lapachol was inhibited by vitamin K1. As a vitamin K antagonist, lapachol might target such vitamin K-dependent reactions as the activation of a ligand for the Axl receptor tyrosine kinase.
Declaraciones patrimoniales, turismo y conocimientos locales: Posibilidades de los estudios del folklore para el caso de las ferias en la quebrada de Humahuaca (Jujuy-Argentina Patrimony Statements, Tourism and Local Knowledge: Folklore Studies Posibilities in Quebrada de Humahuaca Fairs Case (Jujuy - Argentina

Directory of Open Access Journals (Sweden)

Liliana Bergesio

2010-12-01

Full Text Available La Quebrada de Humahuaca se encuentra en la porción central de la provincia de Jujuy (al noroeste de la República Argentina y su poblamiento ronda los 11.000 años de antigüedad. Esta región fue declarada en el año 2003 por la Organización de las Naciones Unidas para la Educación, la Ciencia y la Cultura (UNESCO como "Patrimonio Cultural y Natural de la Humanidad". A partir de esa fecha se incrementó el desarrollo de circuitos turísticos de aventura y culturales. Esta declaración le dio un nuevo impulso a la Quebrada de Humahuaca en el mercado nacional e internacional del turismo. Y el auge de este último en la zona generó que cada pueblo buscara sus propias alternativas para atraer visitantes. Entre las estrategias más comunes está la realización de ferias y fiestas que buscan destacar características locales particulares. En este trabajo proponemos analizar el caso de la localidad de Coctaca (Departamento de Humahuaca y un evento que allí se realiza, en el mes de febrero, el cual incluye la Feria "Los Sabores de la Historia", el "Encuentro de Mujeres Andinas" y la "Serenata a los Andenes de Cultivo". El objetivo del trabajo es plantear las posibilidades que aportan los estudios del folklore para articular en el análisis temas como lo local y global; lo cultural y económico; los productores con sus productos y el turismo con sus demandas y expectativas.Quebrada de Humahuaca is set in the central portion of the Jujuy Province (Northwest of Argentinian Republic and it has been inhabited approximately by 11.000 years. In 2003 this region was declared "Cultural and Natural Patrimony of Mandkind" by The United Nations Educational, Scientific and Cultural Organization (UNESCO. From that date the cultural and adventure tourism circuits development increased. This statement gave new impetus to Quebrada de Humahuaca in the national and international tourism market. And the rise of the latter in the area generated each little town to
Influence of communal and private folklore on bringing meaning to the experience of persistent pain.

Science.gov (United States)

Hendricks, Joyce Marie

2015-11-01

To provide an overview of the relevance and strengths of using the literary folkloristic methodology to explore the ways in which people with persistent pain relate to and make sense of their experiences through narrative accounts. Storytelling is a conversation with a purpose. The reciprocal bond between researcher and storyteller enables the examination of the meaning of experiences. Life narratives, in the context of wider traditional and communal folklore, can be analysed to discover how people make sense of their circumstances. This paper draws from the experience of the author, who has previously used this narrative approach. It is a reflection of how the approach may be used to understand those experiencing persistent pain without a consensual diagnosis. Using an integrative method, peer-reviewed research and discussion papers published between January 1990 and December 2014 and listed in the CINAHL, Science Direct, PsycINFO and Google Scholar databases were reviewed. In addition, texts that addressed research methodologies such as literary folkloristic methodology and Marxist literary theory were used. The unique role that nurses play in managing pain is couched in the historical and cultural context of nursing. Literary folkloristic methodology offers an opportunity to gain a better understanding and appreciation of how the experience of pain is constructed and to connect with sufferers. Literary folkloristic methodology reveals that those with persistent pain are often rendered powerless to live their lives. Increasing awareness of how this experience is constructed and maintained also allows an understanding of societal influences on nursing practice. Nurse researchers try to understand experiences in light of specific situations. Literary folkloristic methodology can enable them to understand the inter-relationship between people in persistent pain and how they construct their experiences.
Folclore e medicina popular na Amazônia Folklore and popular medicine in the Amazon

Directory of Open Access Journals (Sweden)

Márcio Couto Henrique

2009-12-01

Full Text Available Discute as relações entre folclore e medicina popular na Amazônia, tendo como referencial de análise o conto "Filhos do boto", de Canuto Azevedo. Aponta que os contos folclóricos estão saturados de elementos da realidade cultural e podem ser utilizados como testemunhos históricos que expressam embates entre diferentes tradições. Os registros folclóricos são fruto do diálogo muitas vezes conflituoso entre folcloristas, cientistas sociais, médicos, pajés e seus seguidores, e sua análise deve ser acompanhada de reflexão sobre as condições de sua produção. Neste caso específico, trata-se de refletir, com base no imaginário de sedução e cura em torno do boto, sobre a possibilidade de ampliar o conhecimento sobre a medicina popular praticada na Amazônia, região de forte presença da pajelança cabocla.This discussion of the relations between folklore and popular medicine in the Amazon takes Canuto Azevedo's story "Filhos do boto" (Children of the porpoise as an analytical reference point. Replete with elements of cultural reality, folk tales can serve as historical testimonies expressing clashes between different traditions. Folk records are fruit of what is often a quarrelsome dialogue between folklorists, social scientists, physicians, and pajés and their followers, and their analysis should take into account the conditions under which they were produced. Based on the imaginary attached to the figure of the porpoise - a seductive creature with healing powers - the article explores how we might expand knowledge of popular medicine as practiced in the Amazon, where the shamanistic rite known as pajelança cabocla has a strong presence.
A religião como meio de inclusão e de exclusão nas corporações de ofício de Estrasburgo (1681-1789

Directory of Open Access Journals (Sweden)

Hanna Sonkajärvi

Full Text Available O artigo propõe uma análise das dinâmicas de inclusão e de exclusão construídas a partir do pertencimento religioso, ou confessional nas corporações de ofício em Estrasburgo no século XVIII. Na sociedade do Antigo Regime, a religião fazia parte - assim como o status social, os vínculos familiares, o gênero, o patronato e os meios financeiros, a língua e os direitos de burguesia - dos fatores decisivos para incluir ou excluir os estrangeiros do acesso aos recursos econômicos, políticos ou sociais das localidades. A construção e a preservação das fronteiras religiosas são examinadas a partir do exemplo dos marceneiros e dos barqueiros na cidade multi-confessional de Estrasburgo.
Ande-Ande Lumut: Adaptasi Folklor ke Teater Epik Brecht

Directory of Open Access Journals (Sweden)

Philipus Nugroho Hari Wibowo

2013-11-01

and Japan. The adaptation theory is developing well; everything can be used as an adaptation object, poems, novels, dramas, paintings, dances, and video games. Kemuning is performed by the performing concept of Brecht’s epic theater. However, this is an effort to fi nd out the new form of reading in Ande-Ande Lumut story. The epic theater against one of the main elements in Aristotle’s drama that has been developed by Stanislavsky’s method; there should be an empathy in every aspect of performance. According to Brecht, this process has caused an effect which should be avoided because it brings audience’s passive attitude. Therefore, he tried to make a theory of destroying the illusion, of interrupting method, and of controlling emotion. Brecht’s identical works focus on the social themes, especially on the themes that show the poor people who are suffering from the authority’s policy. The common problems between the master and its worker are refl ected on hisstory. The Kemuning performance has tried to show the prostitutes’ life that is closed to any negative things. In fact, they are still being needed by the society. Unfortunately, sometimes they become the source of scapegoats to any troubles and are always blamed to. Implicitly, this performance is aimed to fi ght for the prostitutes’ life. The audience is invited to see the other points of view about their life that are often regarded as negative by the people. Moreover, Brecht said that a good and demanded theater in this modern era is a theater that can arouse the audience’s critical thinking activities. Therefore, this performance is supposed to be able to motivate the arts lovers in producing a critical analysis to any social awareness and in creating a new movement to any signifi cant changes in society. Keywords: Folklore, Ande-Ande Lumut, Adaptation, and Brecht’ Epic Theater
"Old Oxen Cannot Plow": Stereotype Themes of Older Adults in Turkish Folklore.

Science.gov (United States)

Marcus, Justin; Sabuncu, Neslihan

2016-12-01

Although much research has established the nature of attitudes and stereotypes toward older adults, there are conflicting explanations for the root cause of ageism, including the sociocultural view and interpersonal views, that age bias against older adults is uniquely a product of modernity and occurs through social interactions, and the evolutionary view and intraindividual views, that age bias against older adults is rooted in our naturally occurring and individually held fear of death. We make initial investigations into resolving this conflict, by analyzing literature from a society predating the Industrial Revolution, the society of Ottoman Turks. Using Grounded Theory, we analyzed 1,555 Turkish fairy tales of the most well-known older adult in Turkish folklore, Nasreddin Hoca, for stereotype themes of older adults. Using the same method, we then analyzed 22,000+ Turkish sayings and proverbs for the same themes. Results indicated older adults to be viewed both positively and negatively. Positive stereotypes included wisdom, warmth, deserving of respect, and retirement. Negative stereotypes included incompetence, inadaptability, and frailty/nearing of death. Older females were viewed more negatively relative to older males. Results indicated views of older adults to parallel those found in contemporary research. Results have implications for the design of interventions to reduce ageism and on the cross-cultural generalizability of age-based stereotypes. © The Author 2015. Published by Oxford University Press on behalf of The Gerontological Society of America. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
GURU MENDONGENG KEARIFAN LOKAL BANYUMASAN

Directory of Open Access Journals (Sweden)

Sugeng Priyadi

2015-09-01

Full Text Available Teachers skills in the collection and writing of folklore needs to be improved so that the cultural heritage of ancestors can be preserved. Furthermore, teachers develop learning model with storytelling folklore virtue that can be absorbed by the students. Learning model mythlogos- ethos could explain the mandate contained in folklore. The mandate is a form of local wisdom through character education. Keywords: folktale, local wisdom
Mesures de comparabilité pour la construction assistée de corpus comparables bilingues thématiques

OpenAIRE

Ke , Guiyao

2014-01-01

Thematic comparable corpora regroup texts from a same topic and written in several languages, highly similar but without mutual translations. Comparing with parallel corpora which regroup pairs of translations, comparable corpora have three advantages: firstly, they are rich and big resources jointly in volume and in covered period; secondly, comparable corpora provide original language and thematic resources. Finally, they are less expensive to develop than parallel corpus. With the consider...
The relevance of folkloric usage of plant galls as medicines: Finding the scientific rationale.

Science.gov (United States)

Patel, Seema; Rauf, Abdur; Khan, Haroon

2018-01-01

Galls, the abnormal growths in plants, induced by virus, bacteria, fungi, nematodes, arthropods, or even other plants, are akin to cancers in fauna. The galls which occur in a myriad of forms are phytochemically-distinct from the normal plant tissues, for these are the sites of tug-of-war, just like the granuloma in animals. To counter the stressors, in the form of the effector proteins of the invaders, the host plants elaborate a large repertoire of metabolites, which they normally will not produce. Perturbation of the jasmonic acid pathway, and the overexpression of auxin, and cytokinin, promote the tissue proliferation and the resultant galls. Though the plant family characteristics and the attackers determine the gall biochemistry, most of the galls are rich in bioactive phytochemicals such as phenolic acids, anthocyanins, purpurogallin, flavonoids, tannins, steroids, triterpenes, alkaloids, lipophilic components (tanshinone) etc. Throughout the long trajectory of evolution, humans have learned to use the galls as therapeutics, much like other plant parts. In diverse cultures, the evidence of folkloric usage of galls abound. Among others, galls from the plant genus like Rhus, Pistacia, Quercus, Terminalia etc. are popular as ethnomedicine. This review mines the literature on galling agents, and the medicinal relevance of galls. Copyright © 2017 Elsevier Masson SAS. All rights reserved.

Sentiment analysis methods for understanding large-scale texts: a case for using continuum-scored words and word shift graphs

Directory of Open Access Journals (Sweden)

Andrew J Reagan

2017-10-01

Full Text Available Abstract The emergence and global adoption of social media has rendered possible the real-time estimation of population-scale sentiment, an extraordinary capacity which has profound implications for our understanding of human behavior. Given the growing assortment of sentiment-measuring instruments, it is imperative to understand which aspects of sentiment dictionaries contribute to both their classification accuracy and their ability to provide richer understanding of texts. Here, we perform detailed, quantitative tests and qualitative assessments of 6 dictionary-based methods applied to 4 different corpora, and briefly examine a further 20 methods. We show that while inappropriate for sentences, dictionary-based methods are generally robust in their classification accuracy for longer texts. Most importantly they can aid understanding of texts with reliable and meaningful word shift graphs if (1 the dictionary covers a sufficiently large portion of a given text’s lexicon when weighted by word usage frequency; and (2 words are scored on a continuous scale.
Text-Mining Applications for Creation of Biofilm Literature Database

Directory of Open Access Journals (Sweden)

Kanika Gupta

2017-10-01

So in the present research published corpora of 34306 documents for biofilm was collected from PubMed database along with non-indexed resources like books, conferences, newspaper articles, etc. and these were divided into five categories i.e. classification, growth and development, physiology, drug effects and radiation effects. These five categories were further individually divided into three parts i.e. Journal Title, Abstract Title, and Abstract Text to make indexing highly specific. Text-processing was done using the software Rapid Miner_v5.3, which tokenizes the entire text into words and provides the frequency of each word within the document. The obtained words were normalized using Remove Stop and Stem Word command of Rapid Miner_v5.3 which removes the stopping and stemming words. The obtained words were stored in MS-Excel 2007 and were sorted in decreasing order of frequency using Sort & Filter command of MS-Excel 2007. The words are visualization through networks obtained by Cytoscape_v2.7.0. Now the words obtained were highly specific for biofilms, generating a controlled biofilm vocabulary and this vocabulary could be used for indexing articles for biofilm (similar to MeSH database which indexes articles for PubMed. The obtained keywords information was stored in the relational database which is locally hosted using the WAMP_v2.4 (Windows, Apache, MySQL, PHP server. The available biofilm vocabulary will be significant for researchers studying biofilm literature, making their search easy and efficient.
A New English?Arabic Parallel Text Corpus for Lexicographic Applications

Directory of Open Access Journals (Sweden)

Hashan Al-Ajmi

2011-10-01

Full Text Available
Abstract: Bilingual lexicographers, translation specialists and English teachers in the Arabworld do not have access to computerized corpora of parallel texts for the English–Arabic languagepair. This project has been carried out to meet this requirement by establishing the first generalparallel corpus of English texts and their Arabic translations. The first phase of the project involvedthe selection of general source texts having appropriate lexical and stylistic features. The chosensource texts deal with a variety of topics such as the environment, globalization, psychology, history,politics, drama, etc. Their Arabic translations were taken from The World of Knowledge seriespublished by the National Council for Culture, Arts and Letters (NCCAL in Kuwait.
Keywords: PARALLEL CORPUS, LEXICOGRAPHY, TRANSLATION, BILINGUAL DICTIONARY,COLLOCATIONS, ALIGNMENT, SYNONYMS, DERIVATIVES, ANTONYMS, GLOSSARY,FREQUENCY
Opsomming: 'n Nuwe Engels–Arabiese parallelletekskorpus vir leksikografiesetoepassings Tweetalige leksikograwe, vertaalkundiges en Engelsonderwysers in dieArabiese wêreld het nie toegang tot gerekenariseerde korpusse van parallelle tekste vir die Engels–Arabiese taalpaar nie. Hierdie projek is onderneem om in dié behoefte te voorsien deur die eerstealgemene parallelle korpus van Engelse tekste en hul Arabiese vertalings tot stand te bring. Dieeerste fase van die projek het die keuse van algemene brontekste behels wat geskikte leksikale enstilistiese eienskappe besit. Die gekose brontekste handel oor 'n verskeidenheid onderwerpe soosdie omgewing, globalisering, psigologie, geskiedenis, politiek, drama, ens. Hul Arabiese vertalingsis geneem uit The World of Knowledge-reeks gepubliseer deur die National Council for Culture, Artsand Letters (NCCAL in Koeweit.
Sleutelwoorde: PARALLELLE KORPUS, LEKSIKOGRAFIE, VERTALING, TWEETALIGEWOORDEBOEK, KOLLOKASIES, OOREENSTEMMING, SINONIEME, AFLEIDINGS, ANTONIEME
Challenges for automatically extracting molecular interactions from full-text articles.

Science.gov (United States)

McIntosh, Tara; Curran, James R

2009-09-24

The increasing availability of full-text biomedical articles will allow more biomedical knowledge to be extracted automatically with greater reliability. However, most Information Retrieval (IR) and Extraction (IE) tools currently process only abstracts. The lack of corpora has limited the development of tools that are capable of exploiting the knowledge in full-text articles. As a result, there has been little investigation into the advantages of full-text document structure, and the challenges developers will face in processing full-text articles. We manually annotated passages from full-text articles that describe interactions summarised in a Molecular Interaction Map (MIM). Our corpus tracks the process of identifying facts to form the MIM summaries and captures any factual dependencies that must be resolved to extract the fact completely. For example, a fact in the results section may require a synonym defined in the introduction. The passages are also annotated with negated and coreference expressions that must be resolved.We describe the guidelines for identifying relevant passages and possible dependencies. The corpus includes 2162 sentences from 78 full-text articles. Our corpus analysis demonstrates the necessity of full-text processing; identifies the article sections where interactions are most commonly stated; and quantifies the proportion of interaction statements requiring coherent dependencies. Further, it allows us to report on the relative importance of identifying synonyms and resolving negated expressions. We also experiment with an oracle sentence retrieval system using the corpus as a gold-standard evaluation set. We introduce the MIM corpus, a unique resource that maps interaction facts in a MIM to annotated passages within full-text articles. It is an invaluable case study providing guidance to developers of biomedical IR and IE systems, and can be used as a gold-standard evaluation set for full-text IR tasks.
Probing the statistical properties of unknown texts: application to the Voynich Manuscript.

Science.gov (United States)

Amancio, Diego R; Altmann, Eduardo G; Rybski, Diego; Oliveira, Osvaldo N; Costa, Luciano da F

2013-01-01

While the use of statistical physics methods to analyze large corpora has been useful to unveil many patterns in texts, no comprehensive investigation has been performed on the interdependence between syntactic and semantic factors. In this study we propose a framework for determining whether a text (e.g., written in an unknown alphabet) is compatible with a natural language and to which language it could belong. The approach is based on three types of statistical measurements, i.e. obtained from first-order statistics of word properties in a text, from the topology of complex networks representing texts, and from intermittency concepts where text is treated as a time series. Comparative experiments were performed with the New Testament in 15 different languages and with distinct books in English and Portuguese in order to quantify the dependency of the different measurements on the language and on the story being told in the book. The metrics found to be informative in distinguishing real texts from their shuffled versions include assortativity, degree and selectivity of words. As an illustration, we analyze an undeciphered medieval manuscript known as the Voynich Manuscript. We show that it is mostly compatible with natural languages and incompatible with random texts. We also obtain candidates for keywords of the Voynich Manuscript which could be helpful in the effort of deciphering it. Because we were able to identify statistical measurements that are more dependent on the syntax than on the semantics, the framework may also serve for text analysis in language-dependent applications.
O papel do folclore na motivação para atividades físicas de idosas The role of folklore in the motivation for physical activity of elderly

Directory of Open Access Journals (Sweden)

Berta Leni Costa Cardoso

2011-03-01

Full Text Available Existem muitos relatos sobre os benefícios biológicos da atividade física em idosos. Porém, o número de praticantes ainda não é satisfatório. Esse ponto controverso foi usado no presente artigo. Pesquisou-se sobre o uso do folclore local como um mecanismo educacional e motivacional útil no aumento da prática de atividades físicas para idosas. Foram entrevistadas idosas do Clube da Amizade em Caetité - BA, que foram motivadas e estimuladas pela dança. Este artigo também usou as reflexões de Paulo Freire, que admite o uso da cultura e contexto de vida pessoal como o mais importante meio de motivação e de educação. Os resultados provaram que é positivo o uso deste citado processo motivacional em estimular idosas nas suas aulas de educação física. Elas relataram que se sentem muito motivadas durante as aulas de dança enquanto podem escutar músicas que as fazem lembrar de seu passado, cultura e valores morais.There are many reports about the biological benefits of the physical activity in older individuals. However the number of physically active elderly is still not satisfactory. This controversial point was used in the present article. It searches if the use of the local folklore as an educating and motivating mechanism was useful for increasing physical activity practices in older individuals. Individuals from "Clube da Amizade" in Caetité city, Bahia (Brazil were interviewed to assess how folkloric dance was used to motivate them in physical education classes. This article also uses the Paulo Freire reflections that admit the use of regional cultural aspects and the life context as the most import strategy to teach and motivate the participants. The results indicated that is positive to use this referred motivational process to stimulate old ladies in the physical education classes. The interviewed ladies reported that they feel very stimulate during dance classes while they listen to music that makes them to remember
Game Edukasi Pengenalan Cerita Rakyat Lampung Pada Platform Android

Directory of Open Access Journals (Sweden)

Ardi Zulkarnais

2018-01-01

Full Text Available Folklore is an oral tradition story passed down from generation to generation in the life of society. But today, folklore is less popular than abroad cinema which is packed with interesting form. In the Lampung region there is many people who do not know the folklore story about Lampung. In fact, folklore has a moral value, and also as a cultural heritage of the region. The purpose of this research is to design and build an educational game application of Lampung folklore to increase the interest of children and society to know and read Lampung folklore which is a cultural heritage that must be preserved. The development of educational folklore game application constructed from web and mobile platforms. The testing method that is done on aspects of Usability, Functionality, Portability, and Efficiency. Based on the results of usability testing on 5th and 6th grade elementary students using a questionnaire obtained 92.44% results, the functionality tested by 2 experts in the field of software engineering obtained 100% results, portability performed on the smartphone android version of gingerbread until marshmallow obtained 80% , and testing efficiency using Testdroid gets 15% average CPU usage results and an average memory of 175 MB.
SAIL: Summation-bAsed Incremental Learning for Information-Theoretic Text Clustering.

Science.gov (United States)

Cao, Jie; Wu, Zhiang; Wu, Junjie; Xiong, Hui

2013-04-01

Information-theoretic clustering aims to exploit information-theoretic measures as the clustering criteria. A common practice on this topic is the so-called Info-Kmeans, which performs K-means clustering with KL-divergence as the proximity function. While expert efforts on Info-Kmeans have shown promising results, a remaining challenge is to deal with high-dimensional sparse data such as text corpora. Indeed, it is possible that the centroids contain many zero-value features for high-dimensional text vectors, which leads to infinite KL-divergence values and creates a dilemma in assigning objects to centroids during the iteration process of Info-Kmeans. To meet this challenge, in this paper, we propose a Summation-bAsed Incremental Learning (SAIL) algorithm for Info-Kmeans clustering. Specifically, by using an equivalent objective function, SAIL replaces the computation of KL-divergence by the incremental computation of Shannon entropy. This can avoid the zero-feature dilemma caused by the use of KL-divergence. To improve the clustering quality, we further introduce the variable neighborhood search scheme and propose the V-SAIL algorithm, which is then accelerated by a multithreaded scheme in PV-SAIL. Our experimental results on various real-world text collections have shown that, with SAIL as a booster, the clustering performance of Info-Kmeans can be significantly improved. Also, V-SAIL and PV-SAIL indeed help improve the clustering quality at a lower cost of computation.
Knowledge based word-concept model estimation and refinement for biomedical text mining.

Science.gov (United States)

Jimeno Yepes, Antonio; Berlanga, Rafael

2015-02-01

Text mining of scientific literature has been essential for setting up large public biomedical databases, which are being widely used by the research community. In the biomedical domain, the existence of a large number of terminological resources and knowledge bases (KB) has enabled a myriad of machine learning methods for different text mining related tasks. Unfortunately, KBs have not been devised for text mining tasks but for human interpretation, thus performance of KB-based methods is usually lower when compared to supervised machine learning methods. The disadvantage of supervised methods though is they require labeled training data and therefore not useful for large scale biomedical text mining systems. KB-based methods do not have this limitation. In this paper, we describe a novel method to generate word-concept probabilities from a KB, which can serve as a basis for several text mining tasks. This method not only takes into account the underlying patterns within the descriptions contained in the KB but also those in texts available from large unlabeled corpora such as MEDLINE. The parameters of the model have been estimated without training data. Patterns from MEDLINE have been built using MetaMap for entity recognition and related using co-occurrences. The word-concept probabilities were evaluated on the task of word sense disambiguation (WSD). The results showed that our method obtained a higher degree of accuracy than other state-of-the-art approaches when evaluated on the MSH WSD data set. We also evaluated our method on the task of document ranking using MEDLINE citations. These results also showed an increase in performance over existing baseline retrieval approaches. Copyright © 2014 Elsevier Inc. All rights reserved.
Event-based text mining for biology and functional genomics

Science.gov (United States)

Thompson, Paul; Nawaz, Raheel; McNaught, John; Kell, Douglas B.

2015-01-01

The assessment of genome function requires a mapping between genome-derived entities and biochemical reactions, and the biomedical literature represents a rich source of information about reactions between biological components. However, the increasingly rapid growth in the volume of literature provides both a challenge and an opportunity for researchers to isolate information about reactions of interest in a timely and efficient manner. In response, recent text mining research in the biology domain has been largely focused on the identification and extraction of ‘events’, i.e. categorised, structured representations of relationships between biochemical entities, from the literature. Functional genomics analyses necessarily encompass events as so defined. Automatic event extraction systems facilitate the development of sophisticated semantic search applications, allowing researchers to formulate structured queries over extracted events, so as to specify the exact types of reactions to be retrieved. This article provides an overview of recent research into event extraction. We cover annotated corpora on which systems are trained, systems that achieve state-of-the-art performance and details of the community shared tasks that have been instrumental in increasing the quality, coverage and scalability of recent systems. Finally, several concrete applications of event extraction are covered, together with emerging directions of research. PMID:24907365
PENENTUAN FAKTOR DAN TARAF FAKTOR DALAM PENGENDALIAN KUALITAS PRODUKSI BENANG PCM DI PT APAC INTI CORPORA DENGAN METODE DESAIN EKSPERIMEN

Directory of Open Access Journals (Sweden)

Darminto Pujotomo

2012-02-01

Full Text Available PT. APAC Inti Corpora merupakan salah satu perusahaan tekstil yang terbesar di Asia Tenggara dimana salah satu jenis produknya adalah benang PCM yang dihasilkan oleh departemen spinning 4. Permasalahan yang muncul adalah produk akhir yang cacat melebihi target perusahaan sebesar 0,8% dari total produksi, sedangkan perusahaan dituntut untuk menghasilkan produk cacat seminimal mungkin. Masalah ini muncul karena masih banyaknya cacat yang timbul pada benang PCM yang didominan oleh cacat crossing (24,67%, cacat ring cone (21,98%, cacat tanpa ekor (16,02% dan kontaminasi (12,50%. Penelitian ini dimaksudkan untuk melakukan penilaian terhadap proses yang terjadi dan apabila ternyata memang terjadi proses yang tidak terkendali maka selanjutnya akan dilakukan identifikasi dan analisa faktor-faktor yang mempunyai pengaruh secara signifikan terhadap ttimbulnya cacat crossing pada benang PCM. Metode yang digunakan untuk menilai proses operasi adalah metode pengendalian proses statistik (statistical process control, sedangkan metode yang digunakan untuk menganalisa faktor-faktor yang berpengaruh terhadap timbulnya cacat benang PCM adalah metode desain eksperimen faktorial. Dari grafik pengendali dan penentuan kemampuan proses dapat diketahui bahwa proses operasi yang terjadi berada di luar kontrol karena menghasilkan cukup banyak produk cacat. Faktor-faktor yang akan diteliti dalam penelitian ini adalah faktor ukuran benang, umur mesin dan kecepatan mesin yang masing-masing faktor terdiri dari 2 taraf faktor. Faktor ukuran benang terdiri dari tipis dan tebal. Faktor umur mesin terdiri dari mesin lama dan mesin baru.Faktor kecepatan mesin terdiri dari 900 MPM dan 1000 MPM. Berdasarkaan perhitungan analisa variansi (ANAVA dan test hipotesa, faktor yang signifikan menyebabkan timbulnya cacat crossing adalah faktor ukuran benang dan umur mesin. Kata kunci : cacat crossing, pengendalian kualitas, ANAVA PT.APAC Inti Corpora is the largest textile
Level set segmentation of bovine corpora lutea in ex situ ovarian ultrasound images

Directory of Open Access Journals (Sweden)

Adams Gregg P

2008-08-01

Full Text Available Abstract Background The objective of this study was to investigate the viability of level set image segmentation methods for the detection of corpora lutea (corpus luteum, CL boundaries in ultrasonographic ovarian images. It was hypothesized that bovine CL boundaries could be located within 1–2 mm by a level set image segmentation methodology. Methods Level set methods embed a 2D contour in a 3D surface and evolve that surface over time according to an image-dependent speed function. A speed function suitable for segmentation of CL's in ovarian ultrasound images was developed. An initial contour was manually placed and contour evolution was allowed to proceed until the rate of change of the area was sufficiently small. The method was tested on ovarian ultrasonographic images (n = 8 obtained ex situ. A expert in ovarian ultrasound interpretation delineated CL boundaries manually to serve as a "ground truth". Accuracy of the level set segmentation algorithm was determined by comparing semi-automatically determined contours with ground truth contours using the mean absolute difference (MAD, root mean squared difference (RMSD, Hausdorff distance (HD, sensitivity, and specificity metrics. Results and discussion The mean MAD was 0.87 mm (sigma = 0.36 mm, RMSD was 1.1 mm (sigma = 0.47 mm, and HD was 3.4 mm (sigma = 2.0 mm indicating that, on average, boundaries were accurate within 1–2 mm, however, deviations in excess of 3 mm from the ground truth were observed indicating under- or over-expansion of the contour. Mean sensitivity and specificity were 0.814 (sigma = 0.171 and 0.990 (sigma = 0.00786, respectively, indicating that CLs were consistently undersegmented but rarely did the contour interior include pixels that were judged by the human expert not to be part of the CL. It was observed that in localities where gradient magnitudes within the CL were strong due to high contrast speckle, contour expansion stopped too early. Conclusion The
Language and folklore in Hamid Mosaddeq’s poem

Directory of Open Access Journals (Sweden)

IRAN

2016-02-01

Full Text Available Abstract"Standard language", "sub-standard language" and "meta-standard language" are the language types of many varieties. Use of sub- standard language in making poetry, known as “stylistic deviation”, is one of the ways of highlighting poetic language. More attention to this technique of language in the contemporary period was paid by Nima. Nima believed that all words have the potentiality to enter the realm of poetry. No word is essentially poetic or non-poetic, but the way of using words by the poet determines its poetic value.Hamid Mossadegh by the use of sub-standard language elements, in addition to increasing the richness of his poems, made them closer to the mind, language and life of people. Folkloric elements of Mosaddeq’s poems were divided into seven groups: 1 Slang words, 2 common and spoken vocabulary 3 Irony and Proverbs 4 Tlfzhay popular 5 allusion to folk tales 6 folk beliefs and customs 7 local vocabulary.Slang words in poems Mosaddeq in the "verb" and "noun" have been examined. Many folk verbs such as "Shangidan" and "gap zadan (to chat" in Mosaddeq’s poems have been applied. Some of folk verbs in his poems are in such a way that at first, one could not understand the point. These verbs have several meanings that one or more specific meanings are slang, like verb "gereftan (to get" that means "to grow the root of the plant" has slang sense.There is an abundance application of folk nouns in Mosaddeq’s poem. Some of the nouns used in Mosaddeq’s poem, considering their figurative meanings, can be investigated in the folk nouns group, like "foot" in the figurative sense of "will"."Colloquial and current words are of the most frequent elements of folk words in the poetry of Mosaddeq. These words in the category of "nouns" and "verbs" could be analyzed. Lexical verbs such as "to hip" and "Perfume of Moskow" are of this kind. "Irony and Proverbs" are the other folk elements of the poetry of Mosaddeq. "till eye can see
Context-dependent modelling of English vowels in Sepedi code-switched speech

CSIR Research Space (South Africa)

Modipa, TI

2012-11-01

Full Text Available multilingual systems (combining dictionaries, language and/or acoustic models from multiple languages) or by running more than one monolingual system in parallel, switching from the one to the other [2], [3]. We are interested in the first approach.... DATA In this section we describe the data used during experiments: the audio corpora, phone sets and dictionaries. A. Audio corpora We use two different audio corpora for the experiments: a general Sepedi corpus (NCHLT [8]) and a custom...
Reduction Corporoplasty

Directory of Open Access Journals (Sweden)

Tariq S. Hakky

2015-04-01

Full Text Available Objective Here we present the first video demonstration of reduction corporoplasty in the management of phallic disfigurement in a 17 year old man with a history sickle cell disease and priapism. Introduction Surgical management of aneurysmal dilation of the corpora has yet to be defined in the literature. Materials and Methods: We preformed bilateral elliptical incisions over the lateral corpora as management of aneurysmal dilation of the corpora to correct phallic disfigurement. Results The patient tolerated the procedure well and has resolution of his corporal disfigurement. Conclusions Reduction corporoplasty using bilateral lateral elliptical incisions in the management of aneurysmal dilation of the corpora is a safe an feasible operation in the management of phallic disfigurement.
Viana, V.; Tagnin, S. E. O. (orgs.. Corpora no ensino de línguas estrangeiras DOI: 10.5007/2175-7968.2011v1n27p294

Directory of Open Access Journals (Sweden)

Leticia Rebollo Couto

2011-11-01

Full Text Available Os trabalhos agrupados neste volume exploram através do viés da Linguística de Corpus, aplicações para o ensino de línguas e de tradução, além de oferecerem subsídios teóricos e reflexões sobre essa emergente subárea dos estudos lingüísticos. Corpora no Ensino de Línguas Estrangeiras é o primeiro volume de seu gênero no mercado editorial brasileiro e inova pelo tema e por congregar pesquisadores experientes e professores de línguas que juntos oferecem ao leitor elementos para aguçar a sua curiosidade e colocar em prática, na sua sala de aula, algumas das sugestões oferecidas pelos autores. O livro, além de estabelecer mais firmemente o perfil da pesquisa e das aplicações da Linguística de Corpus no Brasil, é de interesse para professores de línguas, tradutores, lingüistas e outros profissionais da área de Letras, que certamente nele encontrarão o alicerce para o desenvolvimento de suas competências nas metodologias e aplicações desse estimulante campo do saber.
PEMANFAATAN CERITA RAKYAT SEBAGAI PENANAMAN ETIKA UNTUK MEMBENTUK PENDIDIKAN KARAKTER BANGSA

Directory of Open Access Journals (Sweden)

M. Kristanto

2014-08-01

Full Text Available Folklore that are emerging in various regions in Indonesia have ethical moral values that are beneficial to the formation of a golden generation of Indonesia. Folklore when inherited or inculcated into children early on will equip students motor and psychomotor development, especially in students' character membangan early winning personality. Planting of ethics is intended to form a person's character that leads to positive things. Planting good ethics can certainly build character, attitudes, and behaviors that reinforce soft skills to instill good habits. Utilization of folklore that there are very effective to teach ethics and good morals. Through the characters in the story can be conveyed attitudes, behaviors, and said words that reflect the character and moral ethics. In the story reflected the presence of noble values, among others, honesty, cooperation, hard work, responsibility, religion. These values can be used as a means of character education. Keywords: folklore, values, ethics, character education.
Entity recognition from clinical texts via recurrent neural network.

Science.gov (United States)

Liu, Zengjian; Yang, Ming; Wang, Xiaolong; Chen, Qingcai; Tang, Buzhou; Wang, Zhe; Xu, Hua

2017-07-05

Entity recognition is one of the most primary steps for text analysis and has long attracted considerable attention from researchers. In the clinical domain, various types of entities, such as clinical entities and protected health information (PHI), widely exist in clinical texts. Recognizing these entities has become a hot topic in clinical natural language processing (NLP), and a large number of traditional machine learning methods, such as support vector machine and conditional random field, have been deployed to recognize entities from clinical texts in the past few years. In recent years, recurrent neural network (RNN), one of deep learning methods that has shown great potential on many problems including named entity recognition, also has been gradually used for entity recognition from clinical texts. In this paper, we comprehensively investigate the performance of LSTM (long-short term memory), a representative variant of RNN, on clinical entity recognition and protected health information recognition. The LSTM model consists of three layers: input layer - generates representation of each word of a sentence; LSTM layer - outputs another word representation sequence that captures the context information of each word in this sentence; Inference layer - makes tagging decisions according to the output of LSTM layer, that is, outputting a label sequence. Experiments conducted on corpora of the 2010, 2012 and 2014 i2b2 NLP challenges show that LSTM achieves highest micro-average F1-scores of 85.81% on the 2010 i2b2 medical concept extraction, 92.29% on the 2012 i2b2 clinical event detection, and 94.37% on the 2014 i2b2 de-identification, which is considerably competitive with other state-of-the-art systems. LSTM that requires no hand-crafted feature has great potential on entity recognition from clinical texts. It outperforms traditional machine learning methods that suffer from fussy feature engineering. A possible future direction is how to integrate knowledge
Online úskalia folkloristického výskumu

Directory of Open Access Journals (Sweden)

Eva Šipöczová

2012-12-01

Full Text Available The problematic aspects of internet research are slowly becoming manifest in all the Humanities. The primary concern of ethnology and folkloristics is contemporary folklore – anecdotes, urban legends, rumours, conspiracy theories, all of which are abundant on the internet. Neverteless, the nature of virtual reality has given rise to new methodological problems and uncertainties. How should we collect folklore material on the internet? A conflict rages between classic face-to-face research, and the physically distanced research of the vitrual space. Virtual space has diffrent rules and principles of communication; it functions in a diffrent way. Who is our informer? who is the proprietor of folklore on the internet? And how can we give relevant context to collected materials? The next ambuscade is the fact that the internet creates a inexhaustible quantity of folklore material which lives in databases, discussion forums, emails, chatrooms, social networks, etc.--How can we use this huge database? This contribution does not profess to provide a correct methodology for conducting qualitative folklore research on the internet. Its purpose is to point out the problems and thus join the discussion already taking place around the world, and increasingly in our region.
Exogenous estradiol enhances apoptosis in regressing post-partum rat corpora lutea possibly mediated by prolactin

Directory of Open Access Journals (Sweden)

Telleria Carlos M

2005-08-01

Full Text Available Abstract Background In pregnant rats, structural luteal regression takes place after parturition and is associated with cell death by apoptosis. We have recently shown that the hormonal environment is responsible for the fate of the corpora lutea (CL. Changing the levels of circulating hormones in post-partum rats, either by injecting androgen, progesterone, or by allowing dams to suckle, was coupled with a delay in the onset of apoptosis in the CL. The objectives of the present investigation were: i to examine the effect of exogenous estradiol on apoptosis of the rat CL during post-partum luteal regression; and ii to evaluate the post-partum luteal expression of the estrogen receptor (ER genes. Methods In a first experiment, rats after parturition were separated from their pups and injected daily with vehicle or estradiol benzoate for 4 days. On day 4 post-partum, animals were sacrificed, blood samples were taken to determine serum concentrations of hormones, and the ovaries were isolated to study apoptosis in situ. In a second experiment, non-lactating rats after parturition received vehicle, estradiol benzoate or estradiol benzoate plus bromoergocryptine for 4 days, and their CL were isolated and used to study apoptosis ex vivo. In a third experiment, we obtained CL from rats on day 15 of pregnancy and from non-lactating rats on day 4 post-partum, and studied the expression of the messenger RNAs (mRNAs encoding the ERalpha and ERbeta genes. Results Exogenous administration of estradiol benzoate induced an increase in the number of apoptotic cells within the CL on day 4 post-partum when compared with animals receiving vehicle alone. Animals treated with the estrogen had higher serum prolactin and progesterone concentrations, with no changes in serum androstenedione. Administration of bromoergocryptine blocked the increase in serum prolactin and progesterone concentrations, and DNA fragmentation induced by the estrogen treatment. ERalpha and

Data for lexicography The central role of the corpus

Directory of Open Access Journals (Sweden)

Allan F. Lauder

2010-10-01

Full Text Available This paper looks at the nature of data for lexicography and in particular on the central role that electronic corpora can play in providing it. Data has traditionally come from existing dictionaries, citations, and from the lexicographer’s own knowledge of words, through introspection. Each of these is examined and evaluated. Then the electronic corpus is considered. Different kinds of corpora are described and key design criteria are explained, in particular the size of corpus needed for lexicography as well as the issue of representativeness and sampling. The advantages and disadvantages of corpora are weighed and compared against the other types of data. While each of these has benefits, it is argued that corpora are a requirement, not an option, as data for dictionary making.
MACUNAÍMA, O HERÓI SEM NENHUM CARÁTER

Directory of Open Access Journals (Sweden)

Prof. Ms. Maria Teresa Hellmeister Fornaciari

2011-05-01

Full Text Available According to an unpublished preface, Mário de Andrade himself said that Macunaíma is an anthology of Brazilian folklore. This text aims to analyse some aspects of this folklore throughout the pages of the work seen as one of the major pillars of Brazilian literature. In addition, the fiction not only mirrors what the Brazilian man was, what he is and what he will be (? but focalizes the Brazilian writer, his deep and indispensable roots so that he can be born and flourish with resourcefulness and talent as well. This text also aims to highlight the importance on using these genuine values on art compositions as it was accomplished in the remarkable text of Mário de Andrade.
The Influence of Reference Corpus Size on Wordsmith Tools Keywords Extraction

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2012-05-01

Full Text Available A KeyWords analysis (using WordSmith Tools enables the discovery of lexical items which reveal the main lexical sets in a text or corpus. Such an analysis requires that a reference corpus be compared to the corpus the researcher intends to describe (the study corpus. This paper presents a mathematical method for finding out the influence of reference corpus size on the number of key words extracted by the program. The results reveal that a reference corpus that is at least five times as large as the study corpus allows for drawing an amount of key words that is statistically equivalent to larger reference corpora, thus suggesting five times (as larger as the study corpora as the minimum order of magnitude for reference corpora.
Combining machine learning, crowdsourcing and expert knowledge to detect chemical-induced diseases in text.

Science.gov (United States)

Bravo, Àlex; Li, Tong Shu; Su, Andrew I; Good, Benjamin M; Furlong, Laura I

2016-01-01

Drug toxicity is a major concern for both regulatory agencies and the pharmaceutical industry. In this context, text-mining methods for the identification of drug side effects from free text are key for the development of up-to-date knowledge sources on drug adverse reactions. We present a new system for identification of drug side effects from the literature that combines three approaches: machine learning, rule- and knowledge-based approaches. This system has been developed to address the Task 3.B of Biocreative V challenge (BC5) dealing with Chemical-induced Disease (CID) relations. The first two approaches focus on identifying relations at the sentence-level, while the knowledge-based approach is applied both at sentence and abstract levels. The machine learning method is based on the BeFree system using two corpora as training data: the annotated data provided by the CID task organizers and a new CID corpus developed by crowdsourcing. Different combinations of results from the three strategies were selected for each run of the challenge. In the final evaluation setting, the system achieved the highest Recall of the challenge (63%). By performing an error analysis, we identified the main causes of misclassifications and areas for improving of our system, and highlighted the need of consistent gold standard data sets for advancing the state of the art in text mining of drug side effects.Database URL: https://zenodo.org/record/29887?ln¼en#.VsL3yDLWR_V. © The Author(s) 2016. Published by Oxford University Press.
Recent Periodicals: Local History, Family and Community History, Cultural Heritage, Folk Studies, Anthropology - A Review (2016

Directory of Open Access Journals (Sweden)

R. Vladova

2017-12-01

Full Text Available An annual bibliography of papers in the field of local history, family and community history, cultural heritage, folk studies and anthropology, published in 2016, is collected. The inspected journals are: Bulgarian Journal of Science and Education Policy, Chemistry: Bulgarian Journal of Science Education, Current Anthropology, Family and Community History, Folklore, History and Memory, Journal of Family History, Journal of Folklore Research, Past & Present, Winterthur Portfolio. Many of those journals are available at us under subscription.
Os regimentos das corporações dos ofícios mecânicos: O caso do Retábulo-mor da Sé de Lamego (1506-1511 do pintor português Vasco Fernandes

Directory of Open Access Journals (Sweden)

Joana Salgueiro

2010-08-01

Full Text Available O núcleo em estudo: Retábulo-mor da Sé de Lamego (1506-1511, obra de incontestável importância histórico-artística do pintor quinhentista Vasco Fernandes, “Grão Vasco”, é um conjunto valiosamente documentado pelo seu contrato de obra, que subsistiu até à actualidade. No entanto, sabe-se que muitas vezes os dados empiricamente percepcionados ou mesmo presentes nos actos notariais relativos à feitura do retábulo, por inúmeras razões, nem sempre correspondem na íntegra à realidade. O trabalho que se segue, tem como objectivo, cruzar o conhecimento técnico e material dos suportes destas pinturas, com os dados analisados nos regimentos das corporações dos ofícios mecânicos do trabalho das madeiras: carpinteiros, carpinteiros de marcenaria, marceneiros, entalhadores (e por comparação pintores; de modo a determinar, através das metodologias de examinação dos aprendizes dos ofícios, e restantes normativas, as técnicas e materiais de execução exigidas, no contexto histórico do período Renascentista português.
Improving Feature Representation Based on a Neural Network for Author Profiling in Social Media Texts.

Science.gov (United States)

Gómez-Adorno, Helena; Markov, Ilia; Sidorov, Grigori; Posadas-Durán, Juan-Pablo; Sanchez-Perez, Miguel A; Chanona-Hernandez, Liliana

2016-01-01

We introduce a lexical resource for preprocessing social media data. We show that a neural network-based feature representation is enhanced by using this resource. We conducted experiments on the PAN 2015 and PAN 2016 author profiling corpora and obtained better results when performing the data preprocessing using the developed lexical resource. The resource includes dictionaries of slang words, contractions, abbreviations, and emoticons commonly used in social media. Each of the dictionaries was built for the English, Spanish, Dutch, and Italian languages. The resource is freely available.
Linguistic positivity in historical texts reflects dynamic environmental and psychological factors.

Science.gov (United States)

Iliev, Rumen; Hoover, Joe; Dehghani, Morteza; Axelrod, Robert

2016-12-06

People use more positive words than negative words. Referred to as "linguistic positivity bias" (LPB), this effect has been found across cultures and languages, prompting the conclusion that it is a panhuman tendency. However, although multiple competing explanations of LPB have been proposed, there is still no consensus on what mechanism(s) generate LPB or even on whether it is driven primarily by universal cognitive features or by environmental factors. In this work we propose that LPB has remained unresolved because previous research has neglected an essential dimension of language: time. In four studies conducted with two independent, time-stamped text corpora (Google books Ngrams and the New York Times), we found that LPB in American English has decreased during the last two centuries. We also observed dynamic fluctuations in LPB that were predicted by changes in objective environment, i.e., war and economic hardships, and by changes in national subjective happiness. In addition to providing evidence that LPB is a dynamic phenomenon, these results suggest that cognitive mechanisms alone cannot account for the observed dynamic fluctuations in LPB. At the least, LPB likely arises from multiple interacting mechanisms involving subjective, objective, and societal factors. In addition to having theoretical significance, our results demonstrate the value of newly available data sources in addressing long-standing scientific questions.
POLTERGEIST PHENOMENA IN CONTEMPORARY FOLKLORE

Directory of Open Access Journals (Sweden)

Oana VOICHICI

2017-05-01

Full Text Available The article deals with instances of the supernatural in Romanian urban legends, namely what we call the strigoi , or poltergeist. Usually, folklorists tend to exclude the supernatural f rom the category of urban legends, however we have decided to take these accounts into consideration based on the fact that the transmitter, the narrators do not distinguish between these elements and the rest of contemporary legends and today’s popular cu lture abounds in such accounts.
FOLKLORE ELEMENTS IN BEDRİ RAHMİ EYUBOGLU’S POEMS BEDRİ RAHMİ EYÜBOĞLU’NUN ŞİİRLERİNDE HALK BİLİMİ UNSURLARI

Directory of Open Access Journals (Sweden)

Bahar DOĞAN

2012-01-01

Full Text Available The aim of this study is to figure out the folklore elements in Eyuboglu’s poems. Thus, his poem books Dol Karabakır Dol and Karadut were examined. In this study, research model was used. In interpreting the results of the study 25 items which were classified by Ornek in his book “Turk Halk Bilimi”, were used.The examples in Eyuboglu’s poems includes village,town and city life; folk architecture; vecihles and transportation technics; ecomomic type; classic folk-economy; nutrition, cuisine, storeroom; measurement, weighing and calculating methods; folk arts and handmade craft; folklore; folk believes, customs and traditions; transition period; stereotyped behaves and expression; folk literature; folk dance; folk music and folk musical instruments.The poet in his pems give place to folk songs and folk arts enormously. The poets who says “ Whenever I hear a village song , I feel shame of my poesy’’ aslo give places to beauty of his country. Occasionally usuing local accents in his poems makes him a simple one from the public.According to this study giving place Eyuboglu’s poems in the textbooks can be an important step for growing up persons who have versatile personality. Bu araştırma Bedri Rahmi Eyüboğlu’nun şiirlerindeki halk bilimi unsurlarını belirlemek amacıyla yapılmıştır. Bu doğrultuda Eyüboğlu’nun Dol Karabakır Dol ve Karadut şiir kitapları incelenmiştir. Araştırmada tarama modeli kullanılmıştır. Elde edilen bulguların yorumlanmasında Örnek’in, Türk Halk Bilimi kitabında halk biliminin çalışma konularını sınıflandırdığı yirmi beş madde kullanılmıştır.Eyüboğlu’nun şiirlerinde köy, kasaba ve kent yaşamı; halk mimarisi; taşıtlar ve taşıma teknikleri; ekonomi türleri; halk ekonomisi; beslenme, mutfak, kiler; ölçme, tartma, hesaplama biçimleri; halk sanatları ve zanaatları; halk bilgisi; halk inançları, töreler, adetler, gelenek ve görenekler; geçiş d
An annotated corpus with nanomedicine and pharmacokinetic parameters

Directory of Open Access Journals (Sweden)

Lewinski NA

2017-10-01

Full Text Available Nastassja A Lewinski,1 Ivan Jimenez,1 Bridget T McInnes2 1Department of Chemical and Life Science Engineering, Virginia Commonwealth University, Richmond, VA, 2Department of Computer Science, Virginia Commonwealth University, Richmond, VA, USA Abstract: A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration’s Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided. Keywords: nanotechnology, informatics, natural language processing, text mining, corpora
BC4GO: a full-text corpus for the BioCreative IV GO task.

Science.gov (United States)

Van Auken, Kimberly; Schaeffer, Mary L; McQuilton, Peter; Laulederkind, Stanley J F; Li, Donghui; Wang, Shur-Jen; Hayman, G Thomas; Tweedie, Susan; Arighi, Cecilia N; Done, James; Müller, Hans-Michael; Sternberg, Paul W; Mao, Yuqing; Wei, Chih-Hsuan; Lu, Zhiyong

2014-01-01

Gene function curation via Gene Ontology (GO) annotation is a common task among Model Organism Database groups. Owing to its manual nature, this task is considered one of the bottlenecks in literature curation. There have been many previous attempts at automatic identification of GO terms and supporting information from full text. However, few systems have delivered an accuracy that is comparable with humans. One recognized challenge in developing such systems is the lack of marked sentence-level evidence text that provides the basis for making GO annotations. We aim to create a corpus that includes the GO evidence text along with the three core elements of GO annotations: (i) a gene or gene product, (ii) a GO term and (iii) a GO evidence code. To ensure our results are consistent with real-life GO data, we recruited eight professional GO curators and asked them to follow their routine GO annotation protocols. Our annotators marked up more than 5000 text passages in 200 articles for 1356 distinct GO terms. For evidence sentence selection, the inter-annotator agreement (IAA) results are 9.3% (strict) and 42.7% (relaxed) in F1-measures. For GO term selection, the IAAs are 47% (strict) and 62.9% (hierarchical). Our corpus analysis further shows that abstracts contain ∼ 10% of relevant evidence sentences and 30% distinct GO terms, while the Results/Experiment section has nearly 60% relevant sentences and >70% GO terms. Further, of those evidence sentences found in abstracts, less than one-third contain enough experimental detail to fulfill the three core criteria of a GO annotation. This result demonstrates the need of using full-text articles for text mining GO annotations. Through its use at the BioCreative IV GO (BC4GO) task, we expect our corpus to become a valuable resource for the BioNLP research community. Database URL: http://www.biocreative.org/resources/corpora/bc-iv-go-task-corpus/. Published by Oxford University Press 2014. This work is written by US
СЕМАНТИЧЕСКАЯ РАЗМЕТКА НАЦИОНАЛЬНОГО КОРПУСА ЧУВАШСКОГО ЯЗЫКА

Directory of Open Access Journals (Sweden)

Zheltov, P.V.

2016-09-01

Full Text Available In the paper is described the system of semantic tags ready to use in the National Corpora of Chuvash language. The approach used for this purpose is based on the semantic classification of the lexicon and turns to be universal and applicable to any other languages. The practical benefit of tagging the vocabulary and the text corpora is the improvement of the search results quality and the extension of user’s facilities. The tagging and the semantic classification must be oriented towards some paradigms of programming. We have chosen the functional paradigm.
Cell line name recognition in support of the identification of synthetic lethality in cancer from text

Science.gov (United States)

Kaewphan, Suwisa; Van Landeghem, Sofie; Ohta, Tomoko; Van de Peer, Yves; Ginter, Filip; Pyysalo, Sampo

2016-01-01

Motivation: The recognition and normalization of cell line names in text is an important task in biomedical text mining research, facilitating for instance the identification of synthetically lethal genes from the literature. While several tools have previously been developed to address cell line recognition, it is unclear whether available systems can perform sufficiently well in realistic and broad-coverage applications such as extracting synthetically lethal genes from the cancer literature. In this study, we revisit the cell line name recognition task, evaluating both available systems and newly introduced methods on various resources to obtain a reliable tagger not tied to any specific subdomain. In support of this task, we introduce two text collections manually annotated for cell line names: the broad-coverage corpus Gellus and CLL, a focused target domain corpus. Results: We find that the best performance is achieved using NERsuite, a machine learning system based on Conditional Random Fields, trained on the Gellus corpus and supported with a dictionary of cell line names. The system achieves an F-score of 88.46% on the test set of Gellus and 85.98% on the independently annotated CLL corpus. It was further applied at large scale to 24 302 102 unannotated articles, resulting in the identification of 5 181 342 cell line mentions, normalized to 11 755 unique cell line database identifiers. Availability and implementation: The manually annotated datasets, the cell line dictionary, derived corpora, NERsuite models and the results of the large-scale run on unannotated texts are available under open licenses at http://turkunlp.github.io/Cell-line-recognition/. Contact: sukaew@utu.fi PMID:26428294
Lexikos - Vol 6 (1996)

African Journals Online (AJOL)

Using Learner Corpora for L2 Lexicography: Information on Collocational Errors for EFL learners' · EMAIL FREE FULL TEXT EMAIL FREE FULL TEXT DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT. Yukio Tono, 116-132 ...
JaSlo: Integration of a Japanese-Slovene Bilingual Dictionary with a Corpus Search System

Directory of Open Access Journals (Sweden)

Kristina HMELJAK SANGAWA

2012-12-01

Full Text Available The paper presents a set of integrated on-line language resources targeted at Japanese language learners, primarily those whose mother tongue is Slovene. The resources consist of the on-line Japanese-Slovene learners’ dictionary jaSlo and two corpora, a 1 million word Japanese-Slovene parallel corpus and a 300 million word corpus of web pages, where each word and sentence is marked by its difficulty level; this corpus is furthermore available as a set of five distinct corpora, each one containing sentences of the particular level. The corpora are available for exploration through NoSketch Engine, the open source version of the commercial state-of-the-art corpus analysis software Sketch Engine. The dictionary is available for Web searching, and dictionary entries have direct links to examples from the corpora, thus offering a wider picture of a possible translations in concrete contextualised examples, and b monolingual Japanese usage examples of different difficulty levels to support language learning.
BioC-compatible full-text passage detection for protein-protein interactions using extended dependency graph.

Science.gov (United States)

Peng, Yifan; Arighi, Cecilia; Wu, Cathy H; Vijay-Shanker, K

2016-01-01

There has been a large growth in the number of biomedical publications that report experimental results. Many of these results concern detection of protein-protein interactions (PPI). In BioCreative V, we participated in the BioC task and developed a PPI system to detect text passages with PPIs in the full-text articles. By adopting the BioC format, the output of the system can be seamlessly added to the biocuration pipeline with little effort required for the system integration. A distinctive feature of our PPI system is that it utilizes extended dependency graph, an intermediate level of representation that attempts to abstract away syntactic variations in text. As a result, we are able to use only a limited set of rules to extract PPI pairs in the sentences, and additional rules to detect additional passages for PPI pairs. For evaluation, we used the 95 articles that were provided for the BioC annotation task. We retrieved the unique PPIs from the BioGRID database for these articles and show that our system achieves a recall of 83.5%. In order to evaluate the detection of passages with PPIs, we further annotated Abstract and Results sections of 20 documents from the dataset and show that an f-value of 80.5% was obtained. To evaluate the generalizability of the system, we also conducted experiments on AIMed, a well-known PPI corpus. We achieved an f-value of 76.1% for sentence detection and an f-value of 64.7% for unique PPI detection.Database URL: http://proteininformationresource.org/iprolink/corpora. © The Author(s) 2016. Published by Oxford University Press.
Between Folk and Lore: Performing, Textualising and (misInterpreting the Irish Oral Tradition

Directory of Open Access Journals (Sweden)

Vito Carrassi

2017-10-01

Full Text Available Folklore, as a historical and cultural process producing and transmitting beliefs, stories, customs, and practices, has always thrived and evolved in the broader context of history and culture. Consequently, tradition and modernity have long coexisted and influenced one another, in particular in the world of folk narratives, orality and literature, storytellers and writers. Since the nineteenth century, folklorists (a category including a variety of figures have collected, transcribed and published pieces of oral tradition, thus giving folklore a textual form and nature. However, folk narratives continue to be also a living and performed experience for the tradition bearers, a process giving rise to ever new and different expressions, according to the changing historical, social, cultural, and economic conditions. To be sure, folklore – and folk narrative – needs to be constantly lived and performed to remain something actually pertinent and significant, and not only within the oral and traditional contexts. Interestingly, between the nineteenth and the twentieth centuries, folklore increasingly came to be regarded as and transformed into an inheritance, a valuable, national heritage particularly fitting for those countries, such as Ireland, in search of a strong, national identity. In this light, folklore and folk narratives, beside their routine existence within their original contexts, were consciously “performed” by the official culture, which employed them in politics, education, literature, etc. In the process, it could happen that folk materials were dehistoricised and idealised, “embalmed” according to Máirtin Ó Cadhain, and even trivialised. This situation was turned into a fruitful and significant source of inspiration for the literary parody of Myles na gCopaleen (Flann O’Brien who, in his Gaelic novel, An Béal Bocht, revealed the funny yet distressing truth of the Irish folklore being misunderstood and betrayed by
Analysis of Influence of Different Relations Types on the Quality of Thesaurus Application to Text Classification Problems

Directory of Open Access Journals (Sweden)

Nadezhda S. Lagutina

2017-01-01

Full Text Available The main purpose of the article is to analyze how effectively different types of thesaurus relations can be used for solutions of text classification tasks. The basis of the study is an automatically generated thesaurus of a subject area, that contains three types of relations: synonymous, hierarchical and associative. To generate the thesaurus the authors use a hybrid method based on several linguistic and statistical algorithms for extraction of semantic relations. The method allows to create a thesaurus with a sufficiently large number of terms and relations among them. The authors consider two problems: topical text classification and sentiment classification of large newspaper articles. To solve them, the authors developed two approaches that complement standard algorithms with a procedure that take into account thesaurus relations to determine semantic features of texts. The approach to topical classification includes the standard unsupervised BM25 algorithm and the procedure, that take into account synonymous and hierarchical relations of the thesaurus of the subject area. The approach to sentiment classification consists of two steps. At the first step, a thesaurus is created, whose terms weight polarities are calculated depending on the term occurrences in the training set or on the weights of related thesaurus terms. At the second step, the thesaurus is used to compute the features of words from texts and to classify texts by the algorithm SVM or Naive Bayes. In experiments with text corpora BBCSport, Reuters, PubMed and the corpus of articles about American immigrants, the authors varied the types of thesaurus relations that are involved in the classification and the degree of their use. The results of the experiments make it possible to evaluate the efficiency of the application of thesaurus relations for classification of raw texts and to determine under what conditions certain relationships affect more or less. In particular, the
CERITA RAKYAT BERBASIS MOBILE UNTUK ANAK SEKOLAH DASAR

Directory of Open Access Journals (Sweden)

I Nyoman Laba Jayanta

2017-11-01

Full Text Available This research aimed at developing a mobile-based folklore with local wisdom inserted for elementary school students in which the language used is Balinese. The present research was applying System Development Life Cycle (SDLC research method. The study underwent five steps, that is, Analysis, Design, Implementation, Testing, and Evaluation. The first step of this research was need analysis of the application. In this step, the need analysis of content application and application development was done. The next step was application design including flowchart design and storyboard. The steps of development was using application design plan. In this step, the result was Balinese folklore application with local wisdom inserted. To test the function of this application, an application test was done by applying black box method. This evaluation was conducted by involving teacher and elementary school students at SDN 3 Banyuning. The result of evaluation showed that it was found 20 out of 25 students liked this folklore application with local wisdom inserted.

In Search of a National Epic: The use of Old Norse myths in Tolkien's vision of Middle-earth

Directory of Open Access Journals (Sweden)

Tommy Kuusela

2014-05-01

Full Text Available In this article some aspects of Tolkien’s work with regard to his relationship to folklore and nationalism are presented. It is also argued, contrary to Lauri Honko’s view of literary epics, that pre-literary sources constitute a problem for the creators of literary epics and that their elements can direct the choice of plot and form. Tolkien felt that there was a British – but no English – mythology comparable to the Greek, Finnish or Norse ones. He tried to reconstruct the ‘lost mythology’ with building blocks from existing mythologies, and dedicated his work to the English people. In this, he saw himself as a compiler of old source material. This article considers his use of Old Norse sources. With Honko’s notion of the second life of folklore it is argued that Tolkien managed to popularise folklore material while his efforts to make his work exclusively English failed; for a contemporary audience it is rather cross-cultural.
Parsing with subdomain instance weighting from raw corpora

NARCIS (Netherlands)

Plank, B.; Sima'an, K.

2008-01-01

The treebanks that are used for training statistical parsers consist of hand-parsed sentences from a single source/domain like newspaper text. However, newspaper text concerns different subdomains of language use (e.g. finance, sports, politics, music), which implies that the statistics gathered by
Parsing with Subdomain Instance Weighting from Raw Corpora

NARCIS (Netherlands)

Plank, Barbara; Sima'an, Khalil

2008-01-01

The treebanks that are used for training statistical parsers consist of hand-parsed sentences from a single source/domain like newspaper text. However, newspaper text concerns different subdomains of language use (e.g. finance, sports, politics, music), which implies that the statistics gathered by
The language of poetic texts in contemporary Tuvan pop songs

Directory of Open Access Journals (Sweden)

Oyumaa M. Saaya

2017-06-01

Full Text Available The article presents a linguistic analysis of lyrics of modern Tuvan pop songs. While studying them is important for understanding contemporary songwriting in Tuva, it is also necessary to discover what linguistic means, functional styles and vocabulary are used by modern authors of popular lyrics. The study can also help identify how contemporary global trends influence songwriting in means of linguistics. Three groups of songs can be defined in Tuvan pop music. The first of them comprises songs written by both professional poets and amateurs with good writing skills. Their texts have homogenous literary style and are intended for general audience (rather than specific groups of listeners. They do not feature any jargon or youth slang. The second group consists of “songs of the people” which are still popular and relevant, but not classified as folklore. This group also contains songs previously banned by censorship, and those written by ex-convicts. Their lyrics differ in style, and the vocabulary is also heterogenous: they can include slang and contain vernacular language. The third group includes songs following popular global and Russian trends, which triggered rapid evolution in Tuvan songwriting. There is significant number of authors or even creative unions, who write both lyric and music. They are stylistically uneven, contain a lot of neologisms, borrowed vocabulary, slang and jargon words and sometimes even macaronic (mixed language. The author provides a more in-depth analysis of lyrics belonging to the third group of songs. They can be divided into 6 thematic subgroups which greatly vary in lexical content and the use of tropes. The lyrics of contemporary Tuvan songs are quite close to the everyday language young people use. Active employment of jargon in the language of young and middle-aged people, especially in lyrics of modern songs, steadily decreases the literary norms of Tuvan language. The author emphasizes that
The Category of Time in Fairy Tales: Searching for Folk Calendar Time in the Estonian Fairy Tale Corpus

Directory of Open Access Journals (Sweden)

Mairi Kaasik

2011-03-01

Full Text Available The article examines how folk calendar holidays are represented in Estonian fairy tales. It introduces some views presented in folklore studies about the concept of time in fairy tales and finds parallels with them in the Estonian context. The analysis relies on the digital corpus of Estonian fairy tales (5400 variants, created from the texts found in the Estonian Folklore Archives by the Fairy Tale Project of the Department of Estonian and Comparative Folklore, University of Tartu. Folk calendar holidays occur in Estonian fairy tales relatively seldom; most often these are holidays that occupy a significant place in the Estonian folk calendar (Christmas, St. John’s Day, Easter, St. George’s Day. Calendar holidays are notably mentioned more often in tale types which remain on the borderline between the fairy tale and the legend or the fairy tale and the religious tale. In Estonian fairy tales, calendar holidays are used on three levels of meaning: (1 the holiday is organically associated with the tale type; it has an essential role in the plot of the tale; (2 to a certain extent, the holiday could be replaced by another holiday having an analogous meaning; (3 the holiday forms an unimportant or occasional addition to the tale.
Revitalising and Innovating Tradition: The Individual Motivations behind New Songs in the Slovácko Region

Czech Academy of Sciences Publication Activity Database

Uhlíková, Lucie

2017-01-01

Roč. 65, č. 2 (2017), s. 289-303 ISSN 0350-0861 Institutional support: RVO:68378076 Keywords : Folklore and creativity * song composing * inovation and revitalization of tradition * cultural heritage * folklore movement * folk revival movement Subject RIV: AC - Archeology, Anthropology, Ethnology OBOR OECD: Folklore studies
Tall Tales: The Simpsons deconstructing the american myth

Directory of Open Access Journals (Sweden)

Hanna Betina Götz

2014-07-01

Full Text Available This article aims to analyze the episode Tall Tales from the series The Simpsons that revisits legends of the American folklore. The TV series pays homage to both the time of the pioneers in their travels to the Far West in the nineteenth century, as well as to one of the most iconic and folk characters of the American culture from that period: the Hobo was a beggar, a figure of the American folklore during the Great Depression. It is also interesting to focus on the American imaginary in order to understand how the authors of The Simpsons perform these recreations in contemporary times
Automatically Extracting Typical Syntactic Differences from Corpora

NARCIS (Netherlands)

Wiersma, Wybo; Nerbonne, John; Lauttamus, Timo

We develop an aggregate measure of syntactic difference for automatically finding common syntactic differences between collections of text. With the use of this measure, it is possible to mine for differences between, for example, the English of learners and natives, or between related dialects. If
Language and folklore in Hamid Mosaddeqâs poem

Directory of Open Access Journals (Sweden)

نداسادات IRAN

2016-01-01

Full Text Available Abstract"Standard language", "sub-standard language" and "meta-standard language" are the language types of many varieties. Use of sub- standard language in making poetry, known as âstylistic deviationâ, is one of the ways of highlighting poetic language. More attention to this technique of language in the contemporary period was paid by Nima. Nima believed that all words have the potentiality to enter the realm of poetry. No word is essentially poetic or non-poetic, but the way of using words by the poet determines its poetic value.Hamid Mossadegh by the use of sub-standard language elements, in addition to increasing the richness of his poems, made them closer to the mind, language and life of people. Folkloric elements of Mosaddeqâs poems were divided into seven groups: 1 Slang words, 2 common and spoken vocabulary 3 Irony and Proverbs 4 Tlfzhay popular 5 allusion to folk tales 6 folk beliefs and customs 7 local vocabulary.Slang words in poems Mosaddeq in the "verb" and "noun" have been examined. Many folk verbs such as "Shangidan" and "gap zadan (to chat" in Mosaddeqâs poems have been applied. Some of folk verbs in his poems are in such a way that at first, one could not understand the point. These verbs have several meanings that one or more specific meanings are slang, like verb "gereftan (to get" that means "to grow the root of the plant" has slang sense.There is an abundance application of folk nouns in Mosaddeqâs poem. Some of the nouns used in Mosaddeqâs poem, considering their figurative meanings, can be investigated in the folk nouns group, like "foot" in the figurative sense of "will"."Colloquial and current words are of the most frequent elements of folk words in the poetry of Mosaddeq. These words in the category of "nouns" and "verbs" could be analyzed. Lexical verbs such as "to hip" and "Perfume of Moskow" are of this kind. "Irony and Proverbs" are the other folk elements of the poetry of Mosaddeq
A comprehensive benchmark of kernel methods to extract protein-protein interactions from literature.

Directory of Open Access Journals (Sweden)

Domonkos Tikk

Full Text Available The most important way of conveying new findings in biomedical research is scientific publication. Extraction of protein-protein interactions (PPIs reported in scientific publications is one of the core topics of text mining in the life sciences. Recently, a new class of such methods has been proposed - convolution kernels that identify PPIs using deep parses of sentences. However, comparing published results of different PPI extraction methods is impossible due to the use of different evaluation corpora, different evaluation metrics, different tuning procedures, etc. In this paper, we study whether the reported performance metrics are robust across different corpora and learning settings and whether the use of deep parsing actually leads to an increase in extraction quality. Our ultimate goal is to identify the one method that performs best in real-life scenarios, where information extraction is performed on unseen text and not on specifically prepared evaluation data. We performed a comprehensive benchmarking of nine different methods for PPI extraction that use convolution kernels on rich linguistic information. Methods were evaluated on five different public corpora using cross-validation, cross-learning, and cross-corpus evaluation. Our study confirms that kernels using dependency trees generally outperform kernels based on syntax trees. However, our study also shows that only the best kernel methods can compete with a simple rule-based approach when the evaluation prevents information leakage between training and test corpora. Our results further reveal that the F-score of many approaches drops significantly if no corpus-specific parameter optimization is applied and that methods reaching a good AUC score often perform much worse in terms of F-score. We conclude that for most kernels no sensible estimation of PPI extraction performance on new text is possible, given the current heterogeneity in evaluation data. Nevertheless, our study
Concept annotation in the CRAFT corpus.

Science.gov (United States)

Bada, Michael; Eckert, Miriam; Evans, Donald; Garcia, Kristin; Shipley, Krista; Sitnikov, Dmitry; Baumgartner, William A; Cohen, K Bretonnel; Verspoor, Karin; Blake, Judith A; Hunter, Lawrence E

2012-07-09

Manually annotated corpora are critical for the training and evaluation of automated methods to identify concepts in biomedical text. This paper presents the concept annotations of the Colorado Richly Annotated Full-Text (CRAFT) Corpus, a collection of 97 full-length, open-access biomedical journal articles that have been annotated both semantically and syntactically to serve as a research resource for the biomedical natural-language-processing (NLP) community. CRAFT identifies all mentions of nearly all concepts from nine prominent biomedical ontologies and terminologies: the Cell Type Ontology, the Chemical Entities of Biological Interest ontology, the NCBI Taxonomy, the Protein Ontology, the Sequence Ontology, the entries of the Entrez Gene database, and the three subontologies of the Gene Ontology. The first public release includes the annotations for 67 of the 97 articles, reserving two sets of 15 articles for future text-mining competitions (after which these too will be released). Concept annotations were created based on a single set of guidelines, which has enabled us to achieve consistently high interannotator agreement. As the initial 67-article release contains more than 560,000 tokens (and the full set more than 790,000 tokens), our corpus is among the largest gold-standard annotated biomedical corpora. Unlike most others, the journal articles that comprise the corpus are drawn from diverse biomedical disciplines and are marked up in their entirety. Additionally, with a concept-annotation count of nearly 100,000 in the 67-article subset (and more than 140,000 in the full collection), the scale of conceptual markup is also among the largest of comparable corpora. The concept annotations of the CRAFT Corpus have the potential to significantly advance biomedical text mining by providing a high-quality gold standard for NLP systems. The corpus, annotation guidelines, and other associated resources are freely available at http://bionlp-corpora.sourceforge.net/CRAFT/index.shtml.
Books received by the Rijksherbarium library

NARCIS (Netherlands)

NN,

1977-01-01

This richly illustrated book shows many of the most common and most spectacular species of fungi found in Western Europe. The text is very popular and touches such items as poisionousness, edibility, culinary instructions, prejudices, and folklore.
Evaluating topic models with stability

CSIR Research Space (South Africa)

De Waal, A

2008-11-01

Full Text Available Topic models are unsupervised techniques that extract likely topics from text corpora, by creating probabilistic word-topic and topic-document associations. Evaluation of topic models is a challenge because (a) topic models are often employed...
The Value in Verifying Medical Folklore

Directory of Open Access Journals (Sweden)

Dennis J. Baumgardner

2017-08-01

Full Text Available Citing a related article published within this issue of the Journal of Patient-Centered Research and Reviews, the author opines on why traditional ideas regarding human health can persist over decades, and even centuries, despite a lack of scientifically accumulated evidence. It is important to keep in mind that some commonly accepted truths are supported by little to no factual data, and that occasionally patients may benefit from clarification on what is (or, often, is not actually known about longstanding “rules of thumb” (eg, certain home remedies, disease-prevention measures or behavioral concerns. On the flip side, traditions that are shown to be not harmful, like drinking chicken soup to relieve cold symptoms, may be safely indulged regardless of effectiveness.
VITA-6.2: Advanced visual tool for information management

International Nuclear Information System (INIS)

Jacobson, Z.; Truong, Q.S.; Houston, B.; Taylor, V.; Herber, N.; El Gebaly, A.

2007-01-01

Visual Interface for Text Analysis (VITA), our combined user interface and meta-search engine software application, improves the quality and speed at which intelligence analysts can explore novel massive text corpora via innovations that facilitate user contextual awareness. (author)
Perancangan Komunikasi Visual Film Animasi Pendek “Sitiha dan Sisiti”

Directory of Open Access Journals (Sweden)

Fenny Wijaya

2010-10-01

Full Text Available The purpose of this research is to acquire, collect and analyze data needed to realize the design of short animated 3D films with a folklore theme which is presented with a visually appeal to interest spectators, especially children, so the moral message can be conveyed. The research method is to survey directly to the field, namely the cultural center of Indonesia TMII, playground and library. In addition to the literature media such as books, magazines and journals and supported with references from the internet media relating to the topic. Results to be achieved are for the moral message conveyed in this animated folklore film can be received and understood by the audience, especially children. Conclusion at the present time, visual communications media such as movies and television shows are very popular among children. So by using the medium of animated films, children will be more interested and may like local folklore again, since local productions are not of lesser quality than the outside impressions.
Revisiting corpus creation and analysis tools for translation tasks

Directory of Open Access Journals (Sweden)

Claudio Fantinuoli

2016-06-01

Full Text Available Many translation scholars have proposed the use of corpora to allow professional translators to produce high quality texts which read like originals. Yet, the diffusion of this methodology has been modest, one reason being the fact that software for corpora analyses have been developed with the linguist in mind, which means that they are generally complex and cumbersome, offering many advanced features, but lacking the level of usability and the specific features that meet translators’ needs. To overcome this shortcoming, we have developed TranslatorBank, a free corpus creation and analysis tool designed for translation tasks. TranslatorBank supports the creation of specialized monolingual corpora from the web; it includes a concordancer with a query system similar to a search engine; it uses basic statistical measures to indicate the reliability of results; it accesses the original documents directly for more contextual information; it includes a statistical and linguistic terminology extraction utility to extract the relevant terminology of the domain and the typical collocations of a given term. Designed to be easy and intuitive to use, the tool may help translation students as well as professionals to increase their translation quality by adhering to the specific linguistic variety of the target text corpus.
The role and importance of Tuvan literature and teaching it in schools in the preservation and development of Tuvan language

Directory of Open Access Journals (Sweden)

Lidiia Kh. Oorzhak

2018-03-01

Full Text Available The authors of the article start off by expressing their concern about the level of command of Tuvan by the younger generation, especially children. Preserving and developing Tuvan language is impossible without literature in Tuvan and teaching it in schools and other educational institutions. The article deals with the issues of teaching Tuvan literature in secondary comprehensive schools of the Republic of Tuva. The authors also provide an overview of textbooks of Tuvan literature compiled at the laboratory of Tuvan philology, Institute for the Development of National Schools of the Republic of Tuva, in compliance with the Federal educational standards of Russian Federation. The textbook provide the mandatory minimum of the standard-provided content of general education and guarantee the required quality of knowledge for school graduates. In 2013-2017, textbooks titled «Tөreen chogaal» (Literature in the Native Tongue were compiled and published for Grades 5-9, as well as two accompanying textbooks for Grades 5 and 6. The textbooks rely on the methodological principles of the study program «Tyva aas chogaaly bolgash literatura. Niiti өөredilge cherleriniң 5-11 klasstarynga chizhek programma» (Tuvan folklore and literature. Sample Study Program for Grade 5-11 of Comprehensive Schools. In comparison to the previous generation of textbooks, these have been largely updated both in their structure and scope of its content. The texts were grouped in the following categories: “Folklore, the nation’s boundless treasury”, “From folklore to literary genres”, “The world of childhood”, “The world of wonders”, “Holy places”, “The Stars of Victory” and “Animal world”. They prominently feature folklore texts, including shaman songs; tests and creative tasks have also been developed. In terms of their content and methodology, the textbooks intend to familiarize students with the spiritual, moral and aesthetic values of
Serum progesterone levels for diagnosing pregnancy and monitoring corpora lutea function during different reproductive stages in hormonally-treated heat synchronized female damascus goats

International Nuclear Information System (INIS)

Zakawi, M.

2003-01-01

An experiment was conducted on female damascus goats the breeding season to diagnose pregnancy on days 21-22 and 40-44 after mating and to monitor the corpora lutea function during different reproductive stages by measuring serum progesterone levels using radioimmunoassay. A total of 75 intact female damascus goats were divided into 3 equal groups, S, P and C. females in group S were fitted with sponges containing 60 mg of medroxyprogesterone acetate (MAP) for 14 days and injected, at the sponge withdrawal, with pregnant mare serum gonadotrophin (PMSG). Females in group P were injected twice with prostaglandin F 2a at 11 day intervals. Females in group C (control) received no treatment. The results indicated that the accuracy of positive pregnancy on days 21-22 and 40-44 was 90.5% and 94.4%, respectively, and it was 100% for detecting non-pregnancy. There was no significant difference(p>0.05)among the 3 groups in serum progesterone level between days 21-22 and 40-44 after mating. Whereas, there were significant(p -1 at matinf, during pregnancy and at kidding. The triplet carrying goats had a significantly(p -1 , respectively. While, there was no significant difference in serum progesterone levels between the single and twin-carrying goats
AHP 10: Folklore: Bear and Rabbit (I

Directory of Open Access Journals (Sweden)

G.yu lha གཡུ་ལྷ།

2011-06-01

Full Text Available G.yu Iha writes: I recorded this folktale from Thub bstan (b. 1936, the reincarnate lama in Siyuewu Village (Puxi Township, Rangtang County, Aba Tibetan and Qiang Autonomous Prefecture, Sichuan Province when I visited him in the winter of 2009-2010. Thub bstan learned this folktale from his mother. I heard this tale when I was around six years old from my great grandfather when my family was having dinner near the stove one evening.

Avoid violence, rioting, and outrage; approach celebration, delight, and strength: Using large text corpora to compute valence, arousal, and the basic emotions.

Science.gov (United States)

Westbury, Chris; Keith, Jeff; Briesemeister, Benny B; Hofmann, Markus J; Jacobs, Arthur M

2015-01-01

Ever since Aristotle discussed the issue in Book II of his Rhetoric, humans have attempted to identify a set of "basic emotion labels". In this paper we propose an algorithmic method for evaluating sets of basic emotion labels that relies upon computed co-occurrence distances between words in a 12.7-billion-word corpus of unselected text from USENET discussion groups. Our method uses the relationship between human arousal and valence ratings collected for a large list of words, and the co-occurrence similarity between each word and emotion labels. We assess how well the words in each of 12 emotion label sets-proposed by various researchers over the past 118 years-predict the arousal and valence ratings on a test and validation dataset, each consisting of over 5970 items. We also assess how well these emotion labels predict lexical decision residuals (LDRTs), after co-varying out the effects attributable to basic lexical predictors. We then demonstrate a generalization of our method to determine the most predictive "basic" emotion labels from among all of the putative models of basic emotion that we considered. As well as contributing empirical data towards the development of a more rigorous definition of basic emotions, our method makes it possible to derive principled computational estimates of emotionality-specifically, of arousal and valence-for all words in the language.
A unified framework for evaluating the risk of re-identification of text de-identification tools.

Science.gov (United States)

Scaiano, Martin; Middleton, Grant; Arbuckle, Luk; Kolhatkar, Varada; Peyton, Liam; Dowling, Moira; Gipson, Debbie S; El Emam, Khaled

2016-10-01

It has become regular practice to de-identify unstructured medical text for use in research using automatic methods, the goal of which is to remove patient identifying information to minimize re-identification risk. The metrics commonly used to determine if these systems are performing well do not accurately reflect the risk of a patient being re-identified. We therefore developed a framework for measuring the risk of re-identification associated with textual data releases. We apply the proposed evaluation framework to a data set from the University of Michigan Medical School. Our risk assessment results are then compared with those that would be obtained using a typical contemporary micro-average evaluation of recall in order to illustrate the difference between the proposed evaluation framework and the current baseline method. We demonstrate how this framework compares against common measures of the re-identification risk associated with an automated text de-identification process. For the probability of re-identification using our evaluation framework we obtained a mean value for direct identifiers of 0.0074 and a mean value for quasi-identifiers of 0.0022. The 95% confidence interval for these estimates were below the relevant thresholds. The threshold for direct identifier risk was based on previously used approaches in the literature. The threshold for quasi-identifiers was determined based on the context of the data release following commonly used de-identification criteria for structured data. Our framework attempts to correct for poorly distributed evaluation corpora, accounts for the data release context, and avoids the often optimistic assumptions that are made using the more traditional evaluation approach. It therefore provides a more realistic estimate of the true probability of re-identification. This framework should be used as a basis for computing re-identification risk in order to more realistically evaluate future text de-identification tools
An annotated corpus with nanomedicine and pharmacokinetic parameters.

Science.gov (United States)

Lewinski, Nastassja A; Jimenez, Ivan; McInnes, Bridget T

2017-01-01

A vast amount of data on nanomedicines is being generated and published, and natural language processing (NLP) approaches can automate the extraction of unstructured text-based data. Annotated corpora are a key resource for NLP and information extraction methods which employ machine learning. Although corpora are available for pharmaceuticals, resources for nanomedicines and nanotechnology are still limited. To foster nanotechnology text mining (NanoNLP) efforts, we have constructed a corpus of annotated drug product inserts taken from the US Food and Drug Administration's Drugs@FDA online database. In this work, we present the development of the Engineered Nanomedicine Database corpus to support the evaluation of nanomedicine entity extraction. The data were manually annotated for 21 entity mentions consisting of nanomedicine physicochemical characterization, exposure, and biologic response information of 41 Food and Drug Administration-approved nanomedicines. We evaluate the reliability of the manual annotations and demonstrate the use of the corpus by evaluating two state-of-the-art named entity extraction systems, OpenNLP and Stanford NER. The annotated corpus is available open source and, based on these results, guidelines and suggestions for future development of additional nanomedicine corpora are provided.
CROATIAN ADULT SPOKEN LANGUAGE CORPUS (HrAL

Directory of Open Access Journals (Sweden)

Jelena Kuvač Kraljević

2016-01-01

Full Text Available Interest in spoken-language corpora has increased over the past two decades leading to the development of new corpora and the discovery of new facets of spoken language. These types of corpora represent the most comprehensive data source about the language of ordinary speakers. Such corpora are based on spontaneous, unscripted speech defined by a variety of styles, registers and dialects. The aim of this paper is to present the Croatian Adult Spoken Language Corpus (HrAL, its structure and its possible applications in different linguistic subfields. HrAL was built by sampling spontaneous conversations among 617 speakers from all Croatian counties, and it comprises more than 250,000 tokens and more than 100,000 types. Data were collected during three time slots: from 2010 to 2012, from 2014 to 2015 and during 2016. HrAL is today available within TalkBank, a large database of spoken-language corpora covering different languages (https://talkbank.org, in the Conversational Analyses corpora within the subsection titled Conversational Banks. Data were transcribed, coded and segmented using the transcription format Codes for Human Analysis of Transcripts (CHAT and the Computerised Language Analysis (CLAN suite of programmes within the TalkBank toolkit. Speech streams were segmented into communication units (C-units based on syntactic criteria. Most transcripts were linked to their source audios. The TalkBank is public free, i.e. all data stored in it can be shared by the wider community in accordance with the basic rules of the TalkBank. HrAL provides information about spoken grammar and lexicon, discourse skills, error production and productivity in general. It may be useful for sociolinguistic research and studies of synchronic language changes in Croatian.
The Role of the Repeat in the Bear Feast in Traditional Khanty Culture

Directory of Open Access Journals (Sweden)

Anna A. Grinevich (Zorkoltseva

2012-09-01

Full Text Available This paper is devoted to a role of repeat in Khanty folklore. Songs of a bear feast have served as the source material for the research. The author traces the role of a repeat at different text levels: structure, lexical level, and plot. The repeat is proposed as a fundamental method of traditional Khanty arts.
EL MOVIMIENTO INSTITUCIONALIZADO: DANZAS FOLKLÓRICAS ARGENTINAS, LA PROFESIONALIZACIÓN DE SU ENSEÑANZA / Institutionalized movement: professional education of argentine folk dances

Directory of Open Access Journals (Sweden)

María Belén Hirose

2010-12-01

Full Text Available En 1948 se creó la Escuela Nacional de Danzas Folklóricas, como parte del Plan Quinquenal del primer gobierno de Juan D. Perón (1946-1952. Así se dio inicio a la profesionalización de la transmisión y difusión de las danzas folklóricas en su carácter de danzas nacionales, tarea que quedaría concretada con la formación de un cuerpo de profesores nacionales de danza. Este proceso suponía el establecimiento de criterios de selección y transformación de aquellas danzas que se consideraran adecuadas para dar materialidad, mediante coreografías, música, vestimenta y eventos, al sentimiento de la nacionalidad. El folklore académico, en pleno proceso de consolidación como disciplina científica, fue también funcional al proyecto nacional, proveyendo los criterios para la creación del repertorio de danzas que sería enseñado en Buenos Aires y transportado luego a las provincias.En este artículo nos proponemos describir el desarrollo histórico que posibilitó la institucionalización de la enseñanza de las danzas folklóricas en la Argentina, y los efectos de dicha institucionalización. Exploramos el rol que diversos grupos o individuos pertenecientes al ámbito político, cultural y/o académico, asignaron a la enseñanza de las danzas folklóricas en las diferentes etapas del proceso de construcción y fortalecimiento del estado-nación argentino.Palabras clave: danza; folklore; Argentina; docencia; peronismoAbstractIn 1948 Argentina’s National School of Folkloric Dances was created as a part of Juan D. Peron’s “Quinquennial Plan”, launched during his first administration. Thus, the transmission and diffusion of folkloric dances as national symbols began to be professionalized, the development of which was accomplish by the instruction of a troupe of national dance teachers. This process required a repertoire based on the selection and transformation of those dances considered to be adequate expressions of Argentine
Phrasing history : Selecting sources in digital repositories

NARCIS (Netherlands)

Huistra, Hieke; Mellink, Bram

2016-01-01

In recent years, mass digitization has opened up voluminous text corpora to human interpretation. Full-text search lets historians now find new sources that can change their understanding of thoroughly studied historical episodes. At the same time, it forces scholars to access historical sources in
Phrasing history: Selecting sources in digital repositories

NARCIS (Netherlands)

Huistra, H.; Mellink, B.

2016-01-01

In recent years, mass digitization has opened up voluminous text corpora to human interpretation. Full-text search lets historians now find new sources that can change their understanding of thoroughly studied historical episodes. At the same time, it forces scholars to access historical sources in
Chemical Topic Modeling: Exploring Molecular Data Sets Using a Common Text-Mining Approach.

Science.gov (United States)

Schneider, Nadine; Fechner, Nikolas; Landrum, Gregory A; Stiefl, Nikolaus

2017-08-28

Big data is one of the key transformative factors which increasingly influences all aspects of modern life. Although this transformation brings vast opportunities it also generates novel challenges, not the least of which is organizing and searching this data deluge. The field of medicinal chemistry is not different: more and more data are being generated, for instance, by technologies such as DNA encoded libraries, peptide libraries, text mining of large literature corpora, and new in silico enumeration methods. Handling those huge sets of molecules effectively is quite challenging and requires compromises that often come at the expense of the interpretability of the results. In order to find an intuitive and meaningful approach to organizing large molecular data sets, we adopted a probabilistic framework called "topic modeling" from the text-mining field. Here we present the first chemistry-related implementation of this method, which allows large molecule sets to be assigned to "chemical topics" and investigating the relationships between those. In this first study, we thoroughly evaluate this novel method in different experiments and discuss both its disadvantages and advantages. We show very promising results in reproducing human-assigned concepts using the approach to identify and retrieve chemical series from sets of molecules. We have also created an intuitive visualization of the chemical topics output by the algorithm. This is a huge benefit compared to other unsupervised machine-learning methods, like clustering, which are commonly used to group sets of molecules. Finally, we applied the new method to the 1.6 million molecules of the ChEMBL22 data set to test its robustness and efficiency. In about 1 h we built a 100-topic model of this large data set in which we could identify interesting topics like "proteins", "DNA", or "steroids". Along with this publication we provide our data sets and an open-source implementation of the new method (CheTo) which
Web corpus construction

CERN Document Server

Schafer, Roland

2013-01-01

The World Wide Web constitutes the largest existing source of texts written in a great variety of languages. A feasible and sound way of exploiting this data for linguistic research is to compile a static corpus for a given language. There are several adavantages of this approach: (i) Working with such corpora obviates the problems encountered when using Internet search engines in quantitative linguistic research (such as non-transparent ranking algorithms). (ii) Creating a corpus from web data is virtually free. (iii) The size of corpora compiled from the WWW may exceed by several orders of magnitudes the size of language resources offered elsewhere. (iv) The data is locally available to the user, and it can be linguistically post-processed and queried with the tools preferred by her/him. This book addresses the main practical tasks in the creation of web corpora up to giga-token size. Among these tasks are the sampling process (i.e., web crawling) and the usual cleanups including boilerplate removal and rem...
PESQUISA EM EDUCAÇÃO: O WORDSMITH COMO FERRAMENTA DE EXPLORAÇÃO DE CORPORA

Directory of Open Access Journals (Sweden)

Maria Zuleide da Costa Pereira; Samara Wanderley Xavier Barbosa

2014-09-01

Full Text Available Este texto constitui-se a partir da implementação das ações de um projeto do Programa Institucional de Bolsas de Iniciação Científica (PIBIC da UFPB, intitulado “Os Sentidos do Currículo nas Escolas da Rede Municipal de Ensino de João Pessoa/PB”, e desenvolvido no período de 2013 a 2014. O objetivo do plano/projeto é destacar o papel do software Worsmith Tools 6, como ferramenta de análise de corpora, na exploração dos sentidos de educação, currículo e ensino nos documentos curriculares analisados, que são os documentos de políticas curriculares nacionais (Lei de Diretrizes e Bases de nº 9394/96, Parâmetros Curriculares Nacionais de 1ª a 4ª série, Diretrizes Curriculares Gerais para Educação Básica e Diretrizes Curriculares para o Ensino Fundamental de Nove Anos e os locais (Projetos Político-Pedagógicos de nove escolas da Rede Municipal de Ensino. Dessa forma, mostramos os recursos do conjunto de ferramentas utilizadas, exemplificando de que modo elas contribuíram para uma análise documental mais exata e confiável do que outras perspectivas de análise linguística permitiriam. De fato, ao mesmo tempo em que facilitaram às possíveis articulações entre educação, currículo e ensino, o conjunto de ferramentas em questão, como argumenta Sardinha (2004, nos deu a possibilidade de analisar vários aspectos da linguagem, tais como: a composição lexical, o tema dos textos selecionados e a organização da retórica e composicional dos gêneros discursivos. Metodologicamente, organizamos os documentos em análise num conjunto de textos informatizados, de tal forma que se tornaram adequados para o pesquisador analisar, sempre tendo em vista a autenticidade, a legibilidade e a extensão dos textos, e a seleção criteriosa dos enunciados que comporiam o corpus. Para empreender a análise propriamente dita, decidimos utilizar o Worsmith Tools 6 e suas três ferramentas o Wordlist, o Concord e o Keywords, cada
Del mito de “la silla peligrosa” a la leyenda urbana de la aguja escondida y el contagio del SIDA

Directory of Open Access Journals (Sweden)

Ángel J. Gonzalo Tobajas

2014-11-01

Full Text Available Although urban legends are considered of modern folklore paradigms, its origins can be traced to different cultures and different times. This article begins with different versions of a rumor that was spread through the internet and warned about possible AIDS infections by hidden needles in movie theater seats; and concludes by showing that this is a common motif in world literature syllabus: the Greek mythology offered us the versions of Theseus and Pirithous trapped in the Hades or the Procrustean bed, and the arthurian literature the “Siége Perilous” of Galaz, for example. Furthermore, folklores as different as the Mayan of Yucatan or the Spanish of the village of Luciana (Ciudad Real give due consideration to this topic.
Latent semantics of action verbs reflect phonetic parameters of intensity and emotional content

DEFF Research Database (Denmark)

Petersen, Michael Kai

2015-01-01

already in toddlers, this study explores whether articulatory and acoustic parameters may likewise differentiate the latent semantics of action verbs. Selecting 3 X 20 emotion, face, and hand related verbs known to activate premotor areas in the brain, their mutual cosine similarities were computed using...... latent semantic analysis LSA, and the resulting adjacency matrices were compared based on two different large scale text corpora; HAWIK and TASA. Applying hierarchical clustering to identify common structures across the two text corpora, the verbs largely divide into combined mouth and hand movements...... versus emotional expressions. Transforming the verbs into their constituent phonemes, and projecting them into an articulatory space framed by tongue height and formant frequencies, the clustered small and large size movements appear differentiated by front versus back vowels corresponding to increasing...
Deconstruction the end of writing: 'Everything is a text, there is nothing outside context'

Directory of Open Access Journals (Sweden)

Gavin P. Hendricks

2016-03-01

]. Language is a constant movement of differences and everything acquires the instability and ambiguity inherent in language (Callinicos 2004. The implications of Derrida�s reading based on his work Of Grammatology (1976 have impacted everything in the humanities and social sciences, including law, anthropology, linguistics and gender studies, as the meaning of the text is not only inscribed in the sign (signifier and the signified, but everything is a �text� and meaning and representation are how we interpret it.Intradisciplinary and/or interdisciplinary implications: Derrida sought to subvert the �sign� in structuralism, as it opens the door to dialogue with the socially constructed �Other� in relation to the �sign� and the false consciousness construction of the text by the West. This challenges the existing interpretive paradigm and open oral and written dialogue of the text for the �other� in terms of the meaning and representation of the oral text, the oral archival memory of the other, indigenous knowledge systems, African rituals, folklore, storytelling and verbal arts.
Film Animasi Malaysia: Narasi Verbal ke Visual

Directory of Open Access Journals (Sweden)

Ahmad Nizam bin Othman

2009-03-01

Full Text Available Among the issues are to look into the approaches and education problems that happen in the verbal narration in old society and how the construction and preservation of legends and folklore into animation forms. This paper is to identify the icon, structure and method that use in film animation to emphasize the culture understanding of the legends and folklore. The ideas of this paper are to identify form; icon and meaning that attach in local folklore and translated into animation based on culture theory. This case study research involves collecting data through documentation; including interview method, observation and visual understanding on animation collection. The data analysis concentrates on technique and overall animation process, focusing on its impact on culture and local society. Legend and folklore are a part of Malays tradition culture that has been spreading and handing on from one generation to another. The story of fairy tales, myths, extraordinary and miraculous of early centuries of human evolution now been represent in new form, that involved visual and other human sense’s. This paper will discuss about the condition of the understanding and imaginations in this new generation are still same as early century about the beauty of Mahsuri, the extraordinary and enthusiastic of Hang Tuah and the kindheartedness of Bawang Putih. How human accepted and preserves tradition and culture customs but in adaptation process, upgrading and decent with civilization maturity. The outcome are expected to contribute in preserving culture and tradition.
Inquérito sôbre a incidência da esquistossomose mansônica entre indivíduos interessados em ingressar em corporação militar do Estado de São Paulo: considerações sôbre a referida verminose como causa de rejeição de candidatos a empregos

Directory of Open Access Journals (Sweden)

Vicente Amato Neto

1970-10-01

Full Text Available Em várias regiões do Brasil, há rejeição, por diferentes instituições, de indivíduos com esquistossomíase mansônica que se candidatam a empregos, sem serem levados em conta os estádios evolutivos da verminose. Preocupados com essa questão e com a finalidade de coletar, a título de exemplo, informação objetiva sobre aspecto prático a ela concernente, efetuaram os autores inquérito entre 601 pessoas interessadas em ingressar em corporação militar da cidade de São Paulo, baseado na utilização da prova intradérmica para o diagnóstico da helmintíase. Registraram a percentagem de positividade de 13,3%, considerada muito expressiva e tradutora de situação concreta, merecedora de enfática consideração, em face às implicações, de múltiplas ordens, tais como social, econômica e médica, que encerra.
Resurrection, revenance, and exhumation: the problematics of the dead body in songs and laments

Directory of Open Access Journals (Sweden)

Madis Arukask

2011-01-01

Full Text Available Different types of folklore texts differ from each other by their function. We can distinguish between genres meant to be believed (like legend and genres recognized in advance as fiction (fairy-tale. At the same time, textual fiction may also have served practical purposes—such as the telling of fairy-tales during the late autumn and early winter for purposes of fertility magic—as used to be the case in the Estonian folk tradition. There are folklore genres that have functioned, among other things, as an accompaniment, comment on, or support to rituals or practices being carried out—for instance, an incantation during a cure, or a lament in death-related procedures, when a person must be separated from his familiar environment. The same textual formulae fulfil different tasks in different genres, which means that they also carry a different meaning. The present paper considers some themes related to the bodily aspect of humanity in various genres of folklore, particularly in songs and laments, as well as in practices related to death and commemoration. As expected, the problems connected with the human body have in these genres undergone transformations of meaning, the understanding and interpretation of which may vary considerably. The material discussed in the article derives mainly from the Balto-Finnic and north Russian cultural area, partly from the author's own experience during his field trips.
Revisiting corpus creation and analysis tools for translation tasks

Directory of Open Access Journals (Sweden)

Claudio Fantinuoli

2016-04-01

Many translation scholars have proposed the use of corpora to allow professional translators to produce high quality texts which read like originals. Yet, the diffusion of this methodology has been modest, one reason being the fact that software for corpora analyses have been developed with the linguist in mind, which means that they are generally complex and cumbersome, offering many advanced features, but lacking the level of usability and the specific features that meet translators’ needs. To overcome this shortcoming, we have developed TranslatorBank, a free corpus creation and analysis tool designed for translation tasks. TranslatorBank supports the creation of specialized monolingual corpora from the web; it includes a concordancer with a query system similar to a search engine; it uses basic statistical measures to indicate the reliability of results; it accesses the original documents directly for more contextual information; it includes a statistical and linguistic terminology extraction utility to extract the relevant terminology of the domain and the typical collocations of a given term. Designed to be easy and intuitive to use, the tool may help translation students as well as professionals to increase their translation quality by adhering to the specific linguistic variety of the target text corpus.
The tradition of intertextuality in the author's tale of the material of the Italian language

Directory of Open Access Journals (Sweden)

Маргарита Евгеньевна Каскова

2012-03-01

Full Text Available The article discussed the problem of drawing and interpretation of folklore stories by Italian authors. The tradition of intertextuality in the author's tale can be traced to ancient times and is regarded as an example of works by writers Fedro, S. Venni, C. Collodi, G. Rodari.
Teaching Spanish Pragmatics Through Colloquial Conversations

Directory of Open Access Journals (Sweden)

Albelda Marco, Marta

2017-11-01

Full Text Available This paper focuses on the advantages of teaching and learning a foreign language with and through spoken discursive corpora, and especially colloquial and conversational ones. The benefits of developing oral competence and communicative skills in language learners using colloquial conversations will be exposed and discussed. In this paper, we characterise the colloquial conversation and the features that define this register and discursive genre. Being the most natural and original way to communicate among human beings, the colloquial conversation is the most common means to communicate, and therefore, this genre should have a greater presence in foreign-language classrooms. Secondly, we expound on the advantages of teaching using colloquial conversations corpora, particularly resulting from its contextualisation (the linguistic input is learnt in its real and authentic context and from its oral and conversational features (prosodic elements and interactional mechanisms. Thirdly, the paper provides a list of corpora of colloquial conversations that are available in Spanish, focusing on Val.Es.Co. colloquial corpus (peninsular Spanish oral corpus, Briz et al., 2002; Cabedo & Pons online, www.valesco.es. Finally, a set of pragmatic applications of corpora in foreign-language classroom is offered, in particular using the Val.Es.Co. colloquial corpus: functions of discourse markers and interjections (whose meanings change depending on the context, strategies of turn-takings, ways of introducing new topic in the dialogues, mechanisms of keeping or “stealing” the turn, devices to introduce direct speech, attitudes expressed by the falling and rising intonations, hedges and intensifiers, and so on. In general, this paper pretends to offer ideas, resources and materials to make the students more competent in communication using authentic discursive oral corpora.

Penile alterations at early stage of type 1 diabetes in rats

Directory of Open Access Journals (Sweden)

Mingfang Tao

Full Text Available ABSTRACT Objective Diabetes affects the erectile function significantly. However, the penile alterations in the early stage of diabetes in experimental animal models have not been well studied. We examined the changes of the penis and its main erectile components in diabetic rats. Materials and methods Male Sprague-Dawley rats were divided into 2 groups: streptozotocin (STZ-induced diabetics and age-matched controls. Three or nine weeks after diabetes induction, the penis was removed for immunohistochemical staining of smooth muscle and neuronal nitric oxide synthase (nNOS in midshaft penile tissues. The cross-sectional areas of the whole midshaft penis and the corpora cavernosa were quantified. The smooth muscle in the corpora cavernosa and nNOS in the dorsal nerves were quantified. Results The weight, but not the length, of the penis was lower in diabetics. The cross-sectional areas of the total midshaft penis and the corpora cavernosa were lower in diabetic rats compared with controls 9 weeks, but not 3 weeks after diabetes induction. The cross-sectional area of smooth muscle in the corpora cavernosa as percentage of the overall area of the corpora cavernosa was lower in diabetic rats than in controls 9 weeks, but not 3 weeks after diabetes induction. Percentage change of nNOS in dorsal nerves was similar at 3 weeks, and has a decreased trend at 9 weeks in diabetic rats compared with controls. Conclusions Diabetes causes temporal alterations in the penis, and the significant changes in STZ rat model begin 3-9 weeks after induction. Further studies on the reversibility of the observed changes are warranted.
Comment exploiter les 'corpus-surprise' ?

Directory of Open Access Journals (Sweden)

Rittaud-Hutinet, Chantal

2009-01-01

Full Text Available To what extent non-recorded oral corpora may constitute objects of analysis of pragmatic meaning?These corpora are heard by chance: on the radio, on television, in the street, a shop, a means of transport or generally in any conversational interaction in which the linguist participates, but had not previously planned to record for his research. The problem of the use of these corpora in linguistics is all the more crucial since the aim, in phonopragmatics, is to discover the functions and significations of their phonic part. I shall attempt to answer the following questions:–The accuracy of the transcription with respect to the original. To what extent can we ignore our own phonological code, our regional variants, mastered/partly known styles of speech?–The reliability of the oral reproduction carried out by the linguist – for example, during a talk at a conference. What is his capacity for deferred mimicry?–The relation between a significant discrepancy and the elocutionary habits of the speaker.–The relation between the comprehension of the external auditors and the effect produced on the 'real' person addressed.Considering that transparency is (sometimes? often? an illusion, I shall also examine what precautions should be taken so that these corpora offer guarantees as to the veracity.
Directed Activities Related to Text: Text Analysis and Text Reconstruction.

Science.gov (United States)

Davies, Florence; Greene, Terry

This paper describes Directed Activities Related to Text (DART), procedures that were developed and are used in the Reading for Learning Project at the University of Nottingham (England) to enhance learning from texts and that fall into two broad categories: (1) text analysis procedures, which require students to engage in some form of analysis of…
FORGOTTEN PAGES FROM ION MACOVEI’S CREATION

Directory of Open Access Journals (Sweden)

COCEAROVA GALINA

2015-09-01

Full Text Available This article is devoted to Ion Macovei’s creation, namely, to his Symphonieta Florile dalbe (lily-whiteflowers in the version for string orchestra, realized by conductor Dumitru Goia. The author presents some information about the history of this creation and analyses the six-part cycle of the Symphonieta, where every movement has its own title in Latin. The character of these miniatures which are closely connected with Moldova calendar folklore, and first of all, with the carol genre, determined the use of both archetypes of the monodic melody and of the early heterophony principles as well as the mode systems characteristic of regional musical folklore. here is also an analysis of the methods of orchestration used by Dumitru Goia in the score of the Symphonieta.
Implications of rural tourism and agritourism in sustainable rural development

Directory of Open Access Journals (Sweden)

Flavia-Lorena Cut-Lupulescu

2014-10-01

Full Text Available Romania shows: a variety of historical cultural values - folk art, ethnography, folklore, traditions, historical artifacts - a natural harmoniously combined with a varied and picturesque landscape background. All these are facets of Romanian rural tourism in particular. Occurred and developed by the various forms of relief since the time of the Thracian-Dacian, Romanian rural settlements kept and still keeps in good measure ancient customs and traditions, a rich and varied folklore, ethnography and folk original elements that can be travel exploited in a strategy for the organization and development of rural tourism. Rural tourism in our country always practical, but spontaneous, sporadic, random, and mostly unorganized form of manifestation is the beginning of the '20s and '30s, the casual visitor accommodation citizens of rural settlements.
Katalog démonologických pověstí, žánr a strukturální naratologie: literárněteoretické poznámky k folkloristickému dílu

Czech Academy of Sciences Publication Activity Database

Šidák, Pavel

2016-01-01

Roč. 64, č. 3 (2016), s. 408-418 ISSN 0009-0468 Institutional support: RVO:68378068 Keywords : folklore studies * literary studies * methodology * catalogue of folklore material Subject RIV: AJ - Letters, Mass-media, Audiovision
Indian English Evolution and Focusing Visible Through Power Laws

Directory of Open Access Journals (Sweden)

Vineeta Chand

2017-11-01

Full Text Available New dialect emergence and focusing in language contact settings is difficult to capture and date in terms of global structural dialect stabilization. This paper explores whether diachronic power law frequency distributions can provide evidence of dialect evolution and new dialect focusing, by considering the quantitative frequency characteristics of three diachronic Indian English (IE corpora (1970s–2008. The results demonstrate that IE consistently follows power law frequency distributions and the corpora are each best fit by Mandelbrot’s Law. Diachronic changes in the constants are interpreted as evidence of lexical and syntactic collocational focusing within the process of new dialect formation. Evidence of new dialect focusing is also visible through apparent time comparison of spoken and written data. Age and gender-separated sub-corpora of the most recent corpus show minimal deviation, providing apparent time evidence for emerging IE dialect stability. From these findings, we extend the interpretation of diachronic changes in the β coefficient—as indicative of changes in the degree of synthetic/analytic structure—so that β is also sensitive to grammaticalization and changes in collocational patterns.
Lidový hudebně-taneční projev jako scénický tvar - folklorní soubory v českých zemích v péči jedné instituce

Czech Academy of Sciences Publication Activity Database

Stavělová, Daniela

2015-01-01

Roč. 25, č. 4 (2015), s. 292-306 ISSN 0862-8351 Institutional support: RVO:68378076 Keywords : folk dance * folk music * stage * show * institution * folklore ensemble * folklore stylization Subject RIV: AC - Archeology, Anthropology, Ethnology
Když o víně, tak povinně!

Czech Academy of Sciences Publication Activity Database

Tyllner, Lubomír

2012-01-01

Roč. 23, č. 11 (2012), s. 32 ISSN 1210-7972 Institutional support: RVO:68378076 Keywords : folklore * folklorism * folk music * folk dance * festival of folk music * Competition festival Subject RIV: AC - Archeology, Anthropology, Ethnology
La revista Demófilo y la antropología cultural en Andalucía

Directory of Open Access Journals (Sweden)

Rodríguez Becerra, Salvador

2002-06-01

Full Text Available Published since 1987 by the Machado Foundation, of Seville, Demófilo represents the resumption of scientific folklore studies in the late 19^th century championed by Antonio Machado y Álvarez and published in the journal El Folk-lore Andaluz. The basic objectives of Demófilo are «to rescue, analyze and divulge Andalousia's traditional culture.» The journal is oriented toward high schools, colleges and universities in Andalousia as well as any institution interested in the traditional culture of the area, such as museums and organizations supporting Andalousia's cultural heritage. The authors also include in this article a summary of the journal's monographic issues published to date.

Publicación de la Fundación Machado, de Sevilla, y nacida en 1987, Demófilo enlaza con su predecesora en el siglo XIX, El Folk-lore Andaluz, del movimiento sevillano de estudio científico del folklore que animara sobre todo Antonio Machado y Álvarez. Demófilo tiene como objetivos básicos «rescatar, analizar y difundir la cultura tradicional andaluza» y va dirigida especialmente a centros de enseñanza media y universidades de Andalucía, así como a instituciones interesadas en la cultura tradicional de la región, como museos y asociaciones de defensa del patrimonio cultural. Los autores incluyen también en este artículo un resumen de los números monográficos de la revista publicados hasta la fecha.
Spoken language identification system adaptation in under-resourced environments

CSIR Research Space (South Africa)

Kleynhans, N

2013-12-01

Full Text Available Speech Recognition (ASR) systems in the developing world is severely inhibited. Given that few task-specific corpora exist and speech technology systems perform poorly when deployed in a new environment, we investigate the use of acoustic model adaptation...
Transformace folklorní látky v umělecké literatuře : případ démonologické pověsti a díla K. V. Raise

Czech Academy of Sciences Publication Activity Database

Šidák, Pavel

2015-01-01

Roč. 62, č. 5 (2015), s. 707-723 ISSN 0009-0468 Institutional support: RVO:68378068 Keywords : folklore * demonological tale * transformation of folklore material * intertextuality * poetics * K. V. Rais Subject RIV: AJ - Letters, Mass-media, Audiovision
Challenges to Issues of Balance and Representativeness in African Lexicography

Directory of Open Access Journals (Sweden)

Thapelo Joseph Otlogetswe

2011-10-01

Full Text Available
Abstract: Modern dictionaries depend on corpora of different sizes and types for frequency listings, concordances and collocations, illustrative sentences and grammatical information. With the help of computer software, retrieving such information has increasingly become relatively easy. However, the quality of retrieved information for lexicographic purposes depends on the information input at the stage of corpus construction. If corpora are not representative of the different language usages of a speech community, they may prove to be unreliable sources of lexicographic information. There are, however, issues in African languages which make many African corpora questionable. These issues include a lack of texts of different genres, the unavailability of balanced and representative written texts, a complete absence of spoken texts as well as literacy problems in African societies. This article therefore explores the different challenges to the construction of reliable corpora in African languages. It argues that African languages face peculiar challenges and corpus research may require a different treatment compared to European and American corpus research. It finally concludes that issues of balance and representativeness appear theoretically impossible when looking at the results of sociolinguistic research on the different existing language varieties which are difficult to represent accurately in a corpus.
Keywords: AFRICAN LANGUAGES, BALANCE, BANK OF ENGLISH, BORROWING,BRITISH NATIONAL CORPUS, COBUILD, CODE-SWITCHING, COMPUTERS, CORPORA,DIALECT, DICTIONARIES, FREQUENCY, LANGUAGE VARIETY, REPRESENTATIVENESS,SETSWANA, SOCIOLINGUISTICS, SPEECH, TEXT
Opsomming: Uitdagings betreffende kwessies van balans en verteenwoordigendheidin Afrikaleksikografie. Moderne woordeboeke steun op korpusse vanverskillende groottes en soorte vir frekwensielyste, konkordansies en kollokasies, voorbeeldsinneen taalkundige inligting. Met die hulp van
Proper Names and Named Entities Recognition in the Automatic Text Processing. Review of the book: Nouvel, D., Ehrmann, M., & Rosset, S. (2016. Named Entities for Computational Linguistics. London; Hoboken: ISTE Ltd; John Wiley & Sons, Inc., 2016.

Directory of Open Access Journals (Sweden)

Daria M. Golikova

2018-03-01

Full Text Available The reviewed book by Damien Nouvel, Maud Ehrmann, and Sophie Rosset Named Entities for Computational Linguistics deals with automatic processing of texts, written in a natural language, and with named entities recognition, aimed at extracting most important information in these texts. The notion of named entities here extends to the entire set of linguistic units referring to an object. The researchers minutely consider the concept of named entities, juxtaposing this category to that of proper names and comparing their definitions, and describe all the stages of creation and implementation of automatic text annotation algorithms, as well as different ways of evaluating their performance quality. Proper names, in this context, are seen as a particular instance of named entities, one of the typical sources of reference to real objects to be electronically recognized in the text. The book provides a detailed overview and analysis of previous studies in the same field, based mainly on the English language data. It presents instruments and resources required to create and implement the algorithms in question, these may include typologies, knowledge or databases, and various types of corpora. Theoretical considerations, proposed by the authors, are supported by a significant number of exemplary cases, with algorithms operation principles presented in charts. The reviewed book gives quite a comprehensive picture of modern computational linguistic studies focused on named entities recognition and indicates some problems which are unresolved as yet.
When POS datasets don’t add up: Combatting sample bias

DEFF Research Database (Denmark)

Hovy, Dirk; Plank, Barbara; Søgaard, Anders

2014-01-01

Several works in Natural Language Processing have recently looked into part-of-speech (POS) annotation of Twitter data and typically used their own data sets. Since conventions on Twitter change rapidly, models often show sample bias. Training on a combination of the existing data sets should help...... overcome this bias and produce more robust models than any trained on the individual corpora. Unfortunately, combining the existing corpora proves difficult: many of the corpora use proprietary tag sets that have little or no overlap. Even when mapped to a common tag set, the different corpora...
A Customizable Text Classifier for Text Mining

Directory of Open Access Journals (Sweden)

Yun-liang Zhang

2007-12-01

Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.
Scandinavian belief in fate

Directory of Open Access Journals (Sweden)

Åke Ström

1967-02-01

Full Text Available In point of principle, Christianity does not give room for any belief in fate. Astrology, horoscopes, divination, etc., are strictly rejected. Belief in fate never disappeared in Christian countries, nor did it in Scandinavia in Christian times. Especially in folklore we can find it at any period: People believed in an implacable fate. All folklore is filled up with this belief in destiny. Nobody can escape his fate. The future lies in the hands of fate, and the time to come takes its form according to inscrutable laws. The pre-Christian period in Scandinavia, dominated by pagan Norse religion, and the secularized epoch of the 20th century, however, show more distinctive and more widespread beliefs in fate than does the Christian period. The present paper makes a comparison between these forms of belief.
Pastourelle et folklore

OpenAIRE

Dumas, René

2014-01-01

Dans l'état où il nous est parvenu, le Tristan de Béroul ne nous offre que peu d'évocations de la ville et de la demeure où évoluent les différents personnages. Pourtant, bon nombre des moments-clés du roman se situent précisément en milieu urbain : la scène de la marche au suplice d'Iseut, par exemple, ou celle de la fête qui célèbre son retour auprès du roi Marc. Aussi tenterons-nous de préciser dans quel cadre de vie se situent les aventures de Tristan et Iseut et de rechercher l'ordre urb...
Folklorismus v historických souvislostech let 1945-1989 (na příkladu folklorního hnutí v České rapublice)

Czech Academy of Sciences Publication Activity Database

Pavlicová, M.; Uhlíková, Lucie

2008-01-01

Roč. 18, č. 4 (2008), s. 187-197 ISSN 0862-8351 Institutional research plan: CEZ:AV0Z90580513 Keywords : Folklorism * Folklore Movement * Real Socialism * New Songs * Ideology * Censorship * Self-censorship * Personal Motivation Subject RIV: AC - Archeology, Anthropology, Ethnology
ДО ІСТОРІЇ ЗБЕРЕЖЕННЯ ТВОРЧОЇ СПАДЩИНИ М. В. ЛИСЕНКА (музейний аспект

Directory of Open Access Journals (Sweden)

І. П. Якубовський

2011-10-01

Full Text Available The author of this article aims to show, on the base of museum and archive sources, exhibits and expositions, the history of creative heritage of а unique and genius personality of Mykola Vitalijovych Lysenko — the founder of Ukrainian classical music and artist, who, for all of his live had been collecting and compiling Ukrainian folklore.

EXPLORING THEORETICAL FUNCTIONS OF CORPUS DATA IN TEACHING TRANSLATION

OpenAIRE

Poirier, Éric

2016-01-01

Abstract As language referential data banks, corpora are instrumental in the exploration of translation solutions in bilingual parallel texts or conventional usages of source or target language in monolingual general or specialized texts. These roles are firmly rooted in translation processes, from analysis and interpretation of source text to searching for an acceptable equivalent and integrating it into the production of the target text. Provided the creative and not the conservative way be...
Naming Disney's Dwarfs.

Science.gov (United States)

Sidwell, Robert T.

1980-01-01

Discusses Disney's version of the folkloric dwarfs in his production of "Snow White" and weighs the Disney rendition of the dwarf figure against the corpus of traits and behaviors pertaining to dwarfs in traditional folklore. Concludes that Disney's dwarfs are "anthropologically true." (HOD)
Texting while driving: is speech-based text entry less risky than handheld text entry?

Science.gov (United States)

He, J; Chaparro, A; Nguyen, B; Burge, R J; Crandall, J; Chaparro, B; Ni, R; Cao, S

2014-11-01

Research indicates that using a cell phone to talk or text while maneuvering a vehicle impairs driving performance. However, few published studies directly compare the distracting effects of texting using a hands-free (i.e., speech-based interface) versus handheld cell phone, which is an important issue for legislation, automotive interface design and driving safety training. This study compared the effect of speech-based versus handheld text entries on simulated driving performance by asking participants to perform a car following task while controlling the duration of a secondary text-entry task. Results showed that both speech-based and handheld text entries impaired driving performance relative to the drive-only condition by causing more variation in speed and lane position. Handheld text entry also increased the brake response time and increased variation in headway distance. Text entry using a speech-based cell phone was less detrimental to driving performance than handheld text entry. Nevertheless, the speech-based text entry task still significantly impaired driving compared to the drive-only condition. These results suggest that speech-based text entry disrupts driving, but reduces the level of performance interference compared to text entry with a handheld device. In addition, the difference in the distraction effect caused by speech-based and handheld text entry is not simply due to the difference in task duration. Copyright © 2014 Elsevier Ltd. All rights reserved.
Profiling vocabulary in psychology journal abstracts: A comparison between Iranian and Anglo-American journals

Directory of Open Access Journals (Sweden)

Is’haaq Akbarian

2017-01-01

Full Text Available Lexical profiling has yielded fruitful results for language description and pedagogy (Liu, 2014, and particularly highlighted the significance of academic vocabulary for EFL learners in this process. This investigation, likewise, attempts to comparatively profile the vocabulary, more particularly the academic vocabulary, in the ‘abstract’ section of scholarly articles in Iranian and Anglo-American refereed journals in psychology. Iranian journals under study publish articles in Persian but also include an English abstract whereas the latter publish papers in English. For this purpose, a corpus (consisting of 307,126 words, with two sub-corpora of almost similar size and characteristics, was collected from Iranian and Anglo-American journals and analyzed through the software Range. The analyses conducted show a coverage of over 15 percent and the use of over 500 words of the Academic Word List (AWL in both Iranian and Anglo-American sub-corpora. However, there are variations in academic and nonacademic vocabulary use in abstracts across the two sub-corpora above. Most of the academic words used belong to the beginning AWL sub-lists. Pedagogical implications are made for reading and writing, particularly in EAP contexts.
Human language reveals a universal positivity bias.

Science.gov (United States)

Dodds, Peter Sheridan; Clark, Eric M; Desu, Suma; Frank, Morgan R; Reagan, Andrew J; Williams, Jake Ryland; Mitchell, Lewis; Harris, Kameron Decker; Kloumann, Isabel M; Bagrow, James P; Megerdoomian, Karine; McMahon, Matthew T; Tivnan, Brian F; Danforth, Christopher M

2015-02-24

Using human evaluation of 100,000 words spread across 24 corpora in 10 languages diverse in origin and culture, we present evidence of a deep imprint of human sociality in language, observing that (i) the words of natural human language possess a universal positivity bias, (ii) the estimated emotional content of words is consistent between languages under translation, and (iii) this positivity bias is strongly independent of frequency of word use. Alongside these general regularities, we describe interlanguage variations in the emotional spectrum of languages that allow us to rank corpora. We also show how our word evaluations can be used to construct physical-like instruments for both real-time and offline measurement of the emotional content of large-scale texts.
Tunical Outer Layer Plays an Essential Role in Penile Veno-occlusive Mechanism Evidenced from Electrocautery Effects to the Corpora Cavernosa in Defrosted Human Cadavers.

Science.gov (United States)

Hsieh, Cheng-Hsing; Huang, Yi-Ping; Tsai, Mang-Hung; Chen, Heng-Shen; Huang, Po-Cheng; Lin, Chung-Wu; Hsu, Geng-Long

2015-12-01

To determine the exact anatomical structure for establishing penile veno-occlusive function, we sought to conduct a hemodynamic study on defrosted human cadavers. Thirteen penises were used for this experiment, and 11 intact penises were allocated into the electrocautery group (EG, n = 6) and the ligation group (LG, n = 5). A circumcision was made on the penis to access the veins. Two #19 scalp needles were fixed in the 3 and 9 o'clock positions in the distal penis for colloid infusion and intracavernous pressure (ICP) monitoring, respectively. For the EG, the deep dorsal vein and cavernosal vein trunks were freed for 3-5 cm where at least 3 emissary veins were identified via opening Buck's fascia; these veins underwent electrocautery at 45 watts, while the ICP was maintained at 0, 50, 75, 100, 125, and 150 mmHg, respectively. For control, venous ligation was made but at the ICP of 150 mmHg. A tissue block including the emissary vein was then obtained for histological analysis. Except all in the EG and those whose ICP exceed 125 mmHg in the EG, the sinusoids of the corpora cavernosa sustained varied fulgurated fibrosis in every specimen and the severity appeared reversely commensurate with the ICP regarding sinusoidal clumping and darkish bands (P electrocautery damage to intracavernous sinusoids once the ICP reached a level corresponding to a rigid erection. The outer tunica plays an essential role in fulfilling the veno-occlusive mechanism. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.
Efficient data selection for ASR

CSIR Research Space (South Africa)

Kleynhans, NT

2014-10-01

Full Text Available the deployment of ASR systems in the developing world is severely inhibited. One approach to assist with resource-scarce ASR system development, is to select ‘‘useful’’ training samples which could reduce the resources needed to collect new corpora. In this work...
ANALYSIS OF SPECIALISED COLLOCATIONS IN THE AREA OF REMOTE SENSING IN THE PERSPECTIVE OF PHRASEOLOGY

Directory of Open Access Journals (Sweden)

Diva Cardoso de CAMARGO

2013-12-01

Full Text Available The aim of this research is to build and analyze a parallel corpus in the field of remote sensing in order to identify, according to its frequency, specialized collocations in English and then search for their equivalents in Portuguese. The research is based on the interdisciplinary approach of Corpus-Based Translation Studies (BAKER, 1995; CAMARGO, 2007, Corpus Linguistics (BERBER SARDINHA, 2004; TOGNINI-BONELLI, 2001, Phraseology (ORENHA-OTTAIANO, 2009; PAVEL, 1993, and some principles of Terminology (BARROS, 2004. For manipulating the corpora, the program WordSmith Tools (SCOTT, 2012 version 6.0 is used. To support this study, two comparable corpora in English and Portuguese were also built from articles published in both national and international journals in remote sensing. The results show that the collocations in Portuguese seem to be still in the process of conventionalization, as the translators made use of greater variation in their translational options, which can be a way to make the text clearer for the reader.
Important Text Characteristics for Early-Grades Text Complexity

Science.gov (United States)

Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

2015-01-01

The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…
Frank Wollman v kontextu strukturální teorie a terénních výzkumů slovesného folkloru (Příspěvek k nálezu tzv. wollmanovského moravského sběru)

Czech Academy of Sciences Publication Activity Database

Zelenková, Anna

2017-01-01

Roč. 104, č. 4 (2017), s. 493-509 ISSN 0009-0794 Institutional support: RVO:68378017 Keywords : Wollman, Frank * Czech and Slovak folklore studies * the structural theory of folklore * Moravian collection of folk verbal art Subject RIV: AJ - Letters, Mass-media, Audiovision OBOR OECD: Literary theory
Reduction corporoplasty.

Science.gov (United States)

Hakky, Tariq S; Martinez, Daniel; Yang, Christopher; Carrion, Rafael E

2015-01-01

Here we present the first video demonstration of reduction corporoplasty in the management of phallic disfigurement in a 17 year old man with a history sickle cell disease and priapism. Surgical management of aneurysmal dilation of the corpora has yet to be defined in the literature. We preformed bilateral elliptical incisions over the lateral corpora as management of aneurysmal dilation of the corpora to correct phallic disfigurement. The patient tolerated the procedure well and has resolution of his corporal disfigurement. Reduction corporoplasty using bilateral lateral elliptical incisions in the management of aneurysmal dilation of the corpora is a safe an feasible operation in the management of phallic disfigurement.
Exploring theoretical functions of corpus data in teaching translation

OpenAIRE

Éric Poirier

2016-01-01

http://dx.doi.org/10.5007/2175-7968.2016v36nesp1p177 As language referential data banks, corpora are instrumental in the exploration of translation solutions in bilingual parallel texts or conventional usages of source or target language in monolingual general or specialized texts. These roles are firmly rooted in translation processes, from analysis and interpretation of source text to searching for an acceptable equivalent and integrating it into the production of the target text. Provi...
ANÁLISE SOBRE AS NORMAS E DOS INDICADORES DE SUSTENTABILIDADE E A SUA INTEGRAÇÃO PARA GESTÃO CORPORATIVA

Directory of Open Access Journals (Sweden)

Alexandre André Feil

2013-09-01

Full Text Available O conceito da sustentabilidade e as normas de qualidade desafiam corporações e pesquisadores para a criação de modelos produtivos que contemplam a sustentabilidade sob os aspectos ambientais, sociais e econômicos, apoiados na teoria do triple-botton line. Este artigo tem o objetivo de relacionar as normas de qualidade (certificações e os métodos de sustentabilidade corporativa comparando a integração dos sistemas de gestão ambiental e gestão gerencial. Utilizou-se metodologia qualitativa com perspectivas na abordagem bibliográfica e descritiva, buscando as principais normas de qualidade utilizadas nas corporações globais e os modelos de mensuração da sustentabilidade na concepção dos principais cientistas e pesquisadores. Identificou-se que há grande aderência das normas de qualidade e à sustentabilidade corporativa, sendo contemplados em ambos os casos a ligação entre os aspectos ambientais, sociais e econômicos. No entanto, constata-se a divergência entre as partes quanto ao consenso de um modelo de sustentabilidade de utilização global. Sugere-se que os gestores das corporações possam realizar a integração entre a gestão das normas de qualidade e a sustentabilidade, e em conseqüência, reduzir os custos, a mão de obra e o tempo, a fim de agregar maior eficiência nos controles internos das corporações e do monitoramento dos Stakeholders.
King David and the Frog

Directory of Open Access Journals (Sweden)

Marina Ritzarev

2015-04-01

Full Text Available The interrelations between the liturgical and paraliturgical genres of sacred music in both live practice and in historiography are explored. Parallels are found between eighteenth-century Russian and modern Hebrew religious music. The author's theory of the vernacular in music is applied to explain the stylistic openness in paraliturgical music (as a parallel to onto-vernacular folklore.
Zítra se bude tančit všude, aneb jak jsme se protancovali ke svobodě. Dichotomie tzv. folklorního hnutí druhé poloviny 20. století

Czech Academy of Sciences Publication Activity Database

Stavělová, Daniela

2017-01-01

Roč. 104, č. 4 (2017), s. 411-432 ISSN 0009-0794 R&D Projects: GA ČR(CZ) GA17-26672S Institutional support: RVO:68378076 Keywords : folklore revival movement * folklorism * folk ensembles * oral history * narratives * Czech Republic Subject RIV: AC - Archeology, Anthropology, Ethnology OBOR OECD: Antropology, ethnology
A cohesive page ranking and depth-first crawling scheme for ...

African Journals Online (AJOL)

Documents or corpora of known measures in query types, recalls and precision from the Text Retrieval Conference (TREC), the Initiative for Evaluation of XML retrieval (INEX) and REUTERs collection, were used as work bench for evaluation of the system. The results obtained showed significant improvement from results if ...
Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

Science.gov (United States)

White, Sheida

2012-01-01

This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…
The Anti-Diarrhea Properties Of Zingibier Offcinale | Nwoko ...

African Journals Online (AJOL)

Introduction: The crude extract of the plant Zingiber officinale has a high folkloric reputation for anti-diarrhea activity. This study investigated the scientific basis of this folkloric claim. Materials and Methods: Diarrhea was induced in albino mice and albino wistar rats using Castor-oil. The animals (mice) were offered the ...
Literatura Oral Hispanica (Hispanic Oral Literature).

Science.gov (United States)

McAlpine, Dave

As part of a class in Hispanic Oral Literature, students collected pieces of folklore from various Hispanic residents in the region known as "Siouxland" in Iowa. Consisting of some of the folklore recorded from the residents, this paper includes 18 "cuentos y leyendas" (tales and legends), 48 "refranes" (proverbs), 17…
Control of corpus allatum activity in the adult Colorado potato beetle

International Nuclear Information System (INIS)

Khan, M.A.

1983-01-01

Assay conditions for the short-term, radiochemical, in vitro determination of the spontaneous rate of juvenile hormone biosynthesis by isolated corpora allata from Leptinotarsa decemlineata have been further improved permitting the measurement of juvenile hormone biosynthesis by individual pairs of corpora allata. Using the new assay conditions, the activities of adult corpora allata during maturation were found to be significantly higher in reproductive, long-day animals than in pre-diapause, short-day beetles. During diapause no activity was detectable, whereas corpora allata from post-diapause beetles were reactivated totally after 5 days. Simultaneous determination of the in vitro rates of juvenile hormone biosynthesis and corpus allatum volumes revealed no clear correlation. (Auth.)

L2 writing assistants and context-aware dictionaries: New ...

African Journals Online (AJOL)

Dictionaries are increasingly integrated into other tools designed to assist the reading, writing and translation of texts. Write Assistant is a newly developed tool aimed at assisting people writing in a second language. It feeds on big data taken in from corpora and digital dictionaries. The paper discusses the philosophy ...
Daži semantiski atšķirīgi homoģenētiski krāsu nosaukumi latviešu un lietuviešu valodā (zils : žilas, ruds : rudas

Directory of Open Access Journals (Sweden)

Anta Trumpa

2011-12-01

Full Text Available MANCHE SEMANTISCH UNTERSCHIEDLICHE HOMOGENETISCHE FARBENBENENNUNGEN IN DER LETTISCHEN UND LITAUISCHEN SPRACHE (zils : žìlas, ruds : rùdasZusammenfassungIm Beitrag sind zwei formal entsprechende, aber semantisch unterschiedliche Wortpaare der lettischen und litauischen Sprache – lett. zils ‘blau’ : žìlas ‘grau’, ruds ‘rötlich, rotbraun’ : rùdas ‘braun’ – analysiert. Auf der Basis von Angaben der Mundarten, der etymologischen Untersuchungen, der alten Wörterbücher und Texte wie auch Folklore wurde versucht festzustellen, welche der beiden Sprachen die altertümlichere Bedeutung aufbewart hat und welche semantische Prozesse in der Bedeutungsentwicklung dieser Farbenbenennungen geschehen sind.Die Analyse dieser Farbenbenennungen lieβ auch manche allgemeine Schlussfolgerungen ziehen: 1 die Wahrnehmung von Farben ist subjektiv, deshalb ist bei Farbenbenennungen die Möglichkeit der semantischen Differenziation gröβer als bei Adjektiven insgesamt; 2 offensichtlich ist ein Teil der Farbenbenennungen in relativ später Periode in jeder Sprache unabhängig entstanden; 3 alte Farbenbenennungen haben sich nicht selten in der Folklore, besonders in den Volksliedern, erhalten.
Song forms from Kustilj and neighbouring villages

Directory of Open Access Journals (Sweden)

Kristina PLANJANIN SIMIC

2011-01-01

Full Text Available Song-forms constitute one of the four sub-categories of folklore within the classification of children’s folklore The song-forms reflect children's responses in relation to nature. They are dedicated to animals that children find interesting and dear. In the distant past, they were performed at fixed hours and days, on certain places and there was a number of their repetition, but over the past centuries, they lost the initial position and became the motive for play and recreational activities for children. In the examples collected for this paper, what can be observed and singled out are a few basic melodic and rhythmic motifs that also occur in children's songs around the world, the connection between children's rhythm with the text, simplicity and the syllable of melody as well as the fact that the tone of these songs often relates to archaic diatonic infra-pentatonic series. In addition to educational and entertainment features, these songs reveal a mentality, way of thinking, creativity and spiritual development of a generation that will grow up at the beginning of the 21st century.
Against Her Kind: The Phenomenom of Women against Women in Ovia Cult Worship

Science.gov (United States)

Yakubu, Anthonia Makwemoisa

2014-01-01

This paper addresses the incidence of 'Women against Women' in Nigerian folklore. Much has been written on Nigerian folklore, but mainly from within the mortal axis, as reflected in many folktales that cut across different communities in Nigeria. However, it has been observed that this gender phenomenon extends to the supernatural realm, where…
CzEngClass – Towards a Lexicon of Verb Synonyms with Valency Linked to Semantic Roles

Directory of Open Access Journals (Sweden)

Urešová Zdeňka

2017-12-01

Full Text Available In this paper, we introduce our ongoing project about synonymy in bilingual context. This project aims at exploring semantic ‘equivalence’ of verb senses of generally different verbal lexemes in a bilingual (Czech-English setting. Specifically, it focuses on their valency behavior within such equivalence groups. We believe that using bilingual context (translation as an important factor in the delimitation of classes of synonymous lexical units (verbs, in our case may help to specify the verb senses, also with regard to the (semantic roles relation to other verb senses and roles of their arguments more precisely than when using monolingual corpora. In our project, we work “bottom-up”, i.e., from an evidence as recorded in our corpora and not “top-down”, from a predefined set of semantic classes.
A Corpus of Annotated Irish Traditional Dance Music Recordings: Design and Benchmark Evaluations

OpenAIRE

Beauguitte, Pierre; Duggan, Bryan; Kelleher, John

2016-01-01

An emerging trend in music information retrieval (MIR) is the use of supervised machine learning to train automatic music transcription models. A prerequisite of adopting a machine learning methodology is the availability of annotated corpora. However, different genres of music have different characteristics and modelling these characteristics is an important part of creating state of the art MIR systems. Consequently, although some music corpora are available the use of these corpora is tied...
Text and ideology: text-oriented discourse analysis

Directory of Open Access Journals (Sweden)

Maria Eduarda Gonçalves Peixoto

2018-04-01

Full Text Available The article aims to contribute to the understanding of the connection between text and ideology articulated by the text-oriented analysis of discourse (ADTO. Based on the reflections of Fairclough (1989, 2001, 2003 and Fairclough and Chouliaraki (1999, the debate presents the social ontology that ADTO uses to base its conception of social life as an open system and textually mediated; the article then explains the chronological-narrative development of the main critical theories of ideology, by virtue of which ADTO organizes the assumptions that underpin the particular use it makes of the term. Finally, the discussion presents the main aspects of the connection between text and ideology, offering a conceptual framework that can contribute to the domain of the theme according to a critical discourse analysis approach.
Partition of Ni between olivine and sulfide: the effect of temperature, f_{{text{O}}_{text{2}} } and f_{{text{S}}_{text{2}} }

Science.gov (United States)

Fleet, M. E.; Macrae, N. D.

1987-03-01

The experimental distribution coefficient for Ni/ Fe exchange between olivine and monosulfide (KD3) is 35.6±1.1 at 1385° C, f_{{text{O}}_{text{2}} } = 10^{ - 8.87} ,f_{{text{S}}_{text{2}} } = 10^{ - 1.02} , and olivine of composition Fo96 to Fo92. These are the physicochemical conditions appropriate to hypothesized sulfur-saturated komatiite magma. The present experiments equilibrated natural olivine grains with sulfide-oxide liquid in the presence of a (Mg, Fe)-alumino-silicate melt. By a variety of different experimental procedures, K D3 is shown to be essentially constant at about 30 to 35 in the temperature range 900 to 1400° C, for olivine of composition Fo97 to FoO, monosulfide composition with up to 70 mol. % NiS, and a wide range of f_{{text{O}}_{text{2}} } and f_{{text{S}}_{text{2}} }.
PERAN DONGENG BAGI PERKEMBANGAN DAN PEMBENTUKAN KEPRIBADIAN ANAK

Directory of Open Access Journals (Sweden)

Ipriansyah Ipriansyah

2011-06-01

Full Text Available Abstract:Various forms of folklore such as fairy tales are almost extinct because they are less popular compared to some TV shows. Even though various forms of folklore are only a fewbut it teaches positive values that are useful for children's development. Tale, for example, is believed to have an important role in helping cognitive development such as language, thought, and sosioemosional of a child such as emotions and personality. A fairy tale is quite reasonable to have an important role toward the development of children. Development is a pattern of change as a result of biological, cognitive, and sosioemosional processes which has begun from the time of conception until the rest of a lifetime .Among periods of human development, there is a phase of human development which refers to the storytelling phase, that is when a child is in the age of 5 to 8 years. Kata kunci: fairy tales/stories, child development, child's personality
Environmental Hermeneutics: Ethnic and Ecological Traditions in Aesthetic Dialogue with Nature

Directory of Open Access Journals (Sweden)

Boldonova Irina

2016-01-01

Full Text Available The article presents dialogic attitude towards nature and focuses on the aesthetic form of interaction with environment via folklore and imaginative writing. The article analyzes the development of scientific thought from human ecology to environmental hermeneutics. Hermeneutic methodology is used in the field of “aesthetics of nature”, therefore, the author applies hermeneutic categories such as tradition, historically effective consciousness, hermeneutic circle, application to cultural heritage of one of Siberia’s natives and proves the advantages, heuristic value of these categories in analyzing dialogue with nature. Aesthetic dialogue with nature is studied on the example of ethnic and ecological traditions of the Buryat nomads, who historically migrated across Central Asia, nowadays live around Lake Baikal. The author argues that revitalizing ethnic and ecological traditions in folklore and contemporary national literature presents a hermeneutic dialogue with nature and considers it a valuable resource for ethical assumptions and ecological education for sustainable development.
African Oral Literature and the Humanities: Challenges and Prospects

Directory of Open Access Journals (Sweden)

Enongene Mirabeau Sone

2018-03-01

Full Text Available This paper examines the origin, evolution and emergence of folklore (oral literature as an academic discipline in Africa and its place in the humanities. It draws attention to the richness of indigenous knowledge contained in oral literature and demonstrates how the ethical and moral gap in the existing educational system can be filled by the moral precepts embedded in oral literature. The paper argues that African oral literature has not received the attention it deserves among other disciplines of the humanities in institutions of higher learning in Africa. It concludes that any discussion on African literature will be incomplete, and indeed irrelevant, if it does not equally give adequate attention to the oral literature of the African people. As a result, a new curriculum and pedagogy must be designed to give pride of place to folklore and oral literature as the best repository of our cultural norms and values especially in African tertiary institutions.
Christianity and Community development in Igboland, 1960-2000

African Journals Online (AJOL)

FEN

In the words of Benjamin Botkin, folklore is a body of traditional beliefs, customs, and ... artists weave in their works in order to give it a true touch of beauty and glamour ... Achebe had a profound influence on many other Nigerian novelists ... culture and the folklore of her people which unconsciously shaped the context ...
POMEGRANATE IN WRITTEN AND SPOKEN LITERATURE / AZERBAYCAN SÖZLÜ VE YAZILI EDEBIYATINDA NAR

Directory of Open Access Journals (Sweden)

Dr. Mehmet İSMAİL

2008-10-01

Full Text Available According to the scientists punica’s native land isAzerbaijan. This fruit grows up in many countries allover the world and there are lots of kinds of it inAzerbaijan. This is a survey of pomegranate as it featuresin the folklore (tale, legend, myth, proverb, riddle, curse,praise, expression, folk song of Azerbaijan. And alsosome examples of folk-literature is added in which punicaexists.
THE PROBLEM OF THE PLOT AND THE GENRE IN N. A. RADISHCHEV’S “CHURILA PLENKOVICH, BOGATYR SONGWRITING”

Directory of Open Access Journals (Sweden)

Olga V. Zakharova

2017-03-01

Full Text Available In the 18th–19th centuries, attempts to appropriate folklore heroes and genres in literature were made. One of such literary works was N. A. Radishchev’s poem (1801 where the epic hero Churila Plenkovich became a character. Imitating the folkloric and literary tradition (Homer, Virgil, Ariosto, Voltaire, Wieland, V. A. Levshin, I. P. Bogdanovich, the author combined the images and motives of such genres as bylina (Russian epic song, fairy and literary tale, heroic and comic poem in his work. Using the storyline and retaining the main motives of Levshin’s tale, the poet added mythological and fairy images (Zmey Gorynych (dragon, Yaga, Lel’ (Lel’o, Lada to his writing, supplemented the narrative with new motives, and gave justifications for the heroes’ actions. Radishchev created literary heroes using fairytale types; he showed their sufferings, emotions and feelings. Radishchev’s Churila lost all features of the bogatyr. He is a literary hero — a handsome, good-looking, ardent and sensitive young man. Prelepa and Yaga have fallen in love with him. His feats represent a plot of romance and adventure novel; his heroic deeds are inspired by Lel’o, a god of love who gives the bogatyr strength. As a result of these transformations, an original fabulous plot and genre of the literary work appeared. The writer is precise in his defining: his Churila is a character of the epic story in verse based on folkloric and literary tale.
Text-Fabric

NARCIS (Netherlands)

Roorda, Dirk

2016-01-01

Text-Fabric is a Python3 package for Text plus Annotations. It provides a data model, a text file format, and a binary format for (ancient) text plus (linguistic) annotations. The emphasis of this all is on: data processing; sharing data; and contributing modules. A defining characteristic is that
Aportes materiales y psicoafectivos del negro en el folklore colombiano

Directory of Open Access Journals (Sweden)

Manuel Zapata Olivella

1967-06-01

Full Text Available La mayoría de los países latinoamericanos se han conformado por los aportes básicos de las culturas indígena, hispánica y africana. El grado de este mestizaje varía en unos y otros, según la importancia de los grupos étnicos. En Colombia, el equilibrio cultural no siempre corresponde a la mezcla de las razas.
E-text

DEFF Research Database (Denmark)

Finnemann, Niels Ole

2018-01-01

text can be defined by taking as point of departure the digital format in which everything is represented in the binary alphabet. While the notion of text, in most cases, lends itself to be independent of medium and embodiment, it is also often tacitly assumed that it is, in fact, modeled around...... the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities...
Elements of characterology in folklore music of Dinaric area

Directory of Open Access Journals (Sweden)

Kenjalović Milorad

2012-01-01

Full Text Available Dinaric type of man, with all its anthropological, genetic and psychological characteristics presents an orthodox example of patriarchal upbringing and tradition. Regardless of their patriarchalism and apparent insensitivity to other people, in almost every element of their intellectual work (music, dance, sazings, etc. the fleshly and instinctive, that had to be satisfied regardless of all bans and restraints, and the message doubtless confirms that he did live in accordance with instincts, but at the same time he had to respect criteria of patriarchal moral. In this work the autors cite several songs from this area and analyze it from the perspective of psychology and characterology, finding the elements of love joy and sorrow, cure, passion, women shyness, etc.
New Sources for Janáček´s Essay Brezovská píseň and His Notation of Long-Drawn-Out Folksongs

Czech Academy of Sciences Publication Activity Database

Procházková, Jarmila

2018-01-01

Roč. 55, č. 1 (2018), s. 41-55 ISSN 0018-7003 Institutional support: RVO:68378076 Keywords : Leoš Janáček (1854-1928) * folklore studies * Janáček´s literary work * Janáček´s collection of folk music Subject RIV: AL - Art, Architecture, Cultural Heritage OBOR OECD: Folklore studies
English word frequency and recognition in bilinguals: Inter-corpus comparison and error analysis.

Science.gov (United States)

Shi, Lu-Feng

2015-01-01

This study is the second of a two-part investigation on lexical effects on bilinguals' performance on a clinical English word recognition test. Focus is on word-frequency effects using counts provided by four corpora. Frequency of occurrence was obtained for 200 NU-6 words from the Hoosier mental lexicon (HML) and three contemporary corpora, American National Corpora, Hyperspace analogue to language (HAL), and SUBTLEX(US). Correlation analysis was performed between word frequency and error rate. Ten monolinguals and 30 bilinguals participated. Bilinguals were further grouped according to their age of English acquisition and length of schooling/working in English. Word frequency significantly affected word recognition in bilinguals who acquired English late and had limited schooling/working in English. When making errors, bilinguals tended to replace the target word with a word of a higher frequency. Overall, the newer corpora outperformed the HML in predicting error rate. Frequency counts provided by contemporary corpora predict bilinguals' recognition of English monosyllabic words. Word frequency also helps explain top replacement words for misrecognized targets. Word-frequency effects are especially prominent for bilinguals foreign born and educated.

Znaczenie dwujęzycznych korpusów w polsko‑litewskich badaniach konfrontatywnych

Directory of Open Access Journals (Sweden)

Roman Roszko

2015-07-01

Full Text Available The meaning of bilingual corpora in the Polish-Lithuanian comparative studies In his article, the author compares and contrasts the results of his own research on the hypothetical modality in Polish and Lithuanian: a carried out together with Danuta Roszko, using the traditional method (without use of bilingual corpora in the 90s; b with use of parallel Polish-Lithuanian corpora resources. As for the contrast of the two methods, special attention has been drawn to the lexical exponents singled out. The use of the corpora resources resulted in the fact that the number of exponents of hipothetical modality singled out in the two languages has slightly risen. Moreover, the borders between the corresponding groups of exponents have become more distinct and obvious. There has been confirmed a possibility of using the corresponding groups of exponents to express the meanings of the adjacent groups. The conclusion has been drawn that this phenomenon is as obvious now as it was earlier expected (in studies without use of bilingual corpora. The separate analysis of corpora resources with the division into the material being a mutual Polish-Lithuanian translations (i.e. from Polish into Lithuanian and vice versa and b translations into Polish and Lithuanian from third languages (here: from German, English or Russian does not significantly influence the number and diversity of the lexical exponents applied in the two languages. This fact proves a high competence of the translators. The formal resemblance of some of the Polish and Lithuanian exponents does not have a significant influence on which form to choose in the target language. In the translations from Polish into Lithuanian, part of the lexical exponents are conveyed with morphological exponents (lack of such in Polish. The hypothetical modality understated in Polish is sometimes clarified in translations into Lithuanian with the help of morphological forms. In some translations from Lithuanian
I La Galigo Folklore Illustration on Textile Media

Directory of Open Access Journals (Sweden)

Yosepin Sri Ningsih

2014-01-01

Full Text Available This project was an effort in conserving the I La Galigo epic story while at the same time adding value to silk, the famous textile product from South Sulawesi and the origin of I La Galigo. As a work of literature, I La Galigo is categorized as an epic. It is known locally as Sureq Galigo in Bugis. It is divided in a number of episodes or tereng. The most well-known tereng is the one which describes the relationship between Sawerigading and a princess called I We Cudai. From that relationship I La Galigo, the central character of this epic is born. For this project six of the most well-known episodes were selected because of the amount of available supporting data, both theoretical and visual. The selected episodes were translated from their original narrative form into visual language or images. The illustration technique used in this project was STP (Space Time Plane. With this technique every object is drawn from varying viewpoints in one frame, both in space and time. Hand embroidery was added to the painted images. The silk painting can be used as an interior element with value added by the I La Galigo illustrations. Keywords: I La Galigo; epic; Sulawesi; illustration; silk painting; space time plane.
Folklore in bureaucracy code: Running a music event

Directory of Open Access Journals (Sweden)

Krstanović-Lukić Miroslava

2004-01-01

Full Text Available A music folk-created piece of work is a construction expressed as a paradigm part of a set in the bureaucracy system and the public arena. Such a work is a mechanical concept, which defines inheritance as a construction of authenticity saturated with elements of folk, national culture. It is also a subject of certain conventions in the system of regulations; namely, it is a part of the administrative code. The usage of the folk created work as a paradigm and legislations is realized through an organizational apparatus that is, it becomes entertainment, a spectacle. This paper analyzes the functioning of the organizational machinery of a folk spectacle, starting with the government authorities, local self-management and the spectacle's administrative committees. To illustrate this phenomenon, the paper presents the development of a trumpet playing festival in Dragačevo. This particular festival establishes a cultural, economic and political order with a clear and defined division of power. The analysis shows that the folk event in question, through its programs and activities, represents a scene and arena of individual and group interests. Organizational interactions are recognized in binary oppositions: sovereignty/dependency official/unofficial, dominancy/ subordination, innovative/inherited common/different, needed/useful, original/copy, one's own/belonging to someone else.
Monitoring interaction and collective text production through text mining

Directory of Open Access Journals (Sweden)

Macedo, Alexandra Lorandi

2014-04-01

Full Text Available This article presents the Concepts Network tool, developed using text mining technology. The main objective of this tool is to extract and relate terms of greatest incidence from a text and exhibit the results in the form of a graph. The Network was implemented in the Collective Text Editor (CTE which is an online tool that allows the production of texts in synchronized or non-synchronized forms. This article describes the application of the Network both in texts produced collectively and texts produced in a forum. The purpose of the tool is to offer support to the teacher in managing the high volume of data generated in the process of interaction amongst students and in the construction of the text. Specifically, the aim is to facilitate the teacher’s job by allowing him/her to process data in a shorter time than is currently demanded. The results suggest that the Concepts Network can aid the teacher, as it provides indicators of the quality of the text produced. Moreover, messages posted in forums can be analyzed without their content necessarily having to be pre-read.
Teaching Text Structure: Examining the Affordances of Children's Informational Texts

Science.gov (United States)

Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

2016-01-01

This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…
Text Analysis: Critical Component of Planning for Text-Based Discussion Focused on Comprehension of Informational Texts

Science.gov (United States)

Kucan, Linda; Palincsar, Annemarie Sullivan

2018-01-01

This investigation focuses on a tool used in a reading methods course to introduce reading specialist candidates to text analysis as a critical component of planning for text-based discussions. Unlike planning that focuses mainly on important text content or information, a text analysis approach focuses both on content and how that content is…
Text Maps: Helping Students Navigate Informational Texts.

Science.gov (United States)

Spencer, Brenda H.

2003-01-01

Notes that a text map is an instructional approach designed to help students gain fluency in reading content area materials. Discusses how the goal is to teach students about the important features of the material and how the maps can be used to build new understandings. Presents the procedures for preparing and using a text map. (SG)
Birgit Steinbügl: Deutsch-englische Kollokationen: Erfassung in zweisprachigen Wörterbüchern und Grenzen der korpusbasierten Analyse

Directory of Open Access Journals (Sweden)

Maria Smit

2011-10-01

Full Text Available This study investigates the role of collocations in dictionary use, and the extent to which users' needs are taken into account in the process of dictionary writing. Steinbügl decided to concentrate on bilingual dictionaries, because this type of dictionary is relatively less well explored in metalexicographical literature. German-English examples are analysed and evaluated. Instead of selecting examples randomly, she uses a comparative corpus of 200 collocations she put together herself in accordance with scientific reasons explained in detail. She questions the selection of collocations from existing corpora for her purposes, because these corpora are based on competing collocational theories. In order to come to meaningful conclusions, she prefers to delineate her own research approach, also however investigating the structures of bilingual dictionaries and dictionary articles, as well as situations of dictionary use.
Lingüística de Corpus: histórico e problemática

Directory of Open Access Journals (Sweden)

SARDINHA Tony Berber

2000-01-01

Full Text Available O presente trabalho oferece uma retrospectiva da Lingüística de Corpus, uma área de pesquisa que tem experimentado um crescimento vertiginoso nos últimos anos e que tem tido um impacto considerável na lingüística. A retrospectiva inclui tanto um painel histórico quanto um posicionamento em relação aos debates correntes e desenvolvimentos futuros da área. Os conceitos principais em voga na área são apresentados e discutidos. O trabalho ainda comenta os fatos mais marcantes na Lingüística de Corpus em relação à teoria e à prática, elencando os principais corpora em existência bem como as mais importantes contribuições no campo de programas de computador para análise e exploração desses corpora.
The Only Safe SMS Texting Is No SMS Texting.

Science.gov (United States)

Toth, Cheryl; Sacopulos, Michael J

2015-01-01

Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.
Non pigmenting mucosal fixed drug eruption due to tadalafil: A report of two cases

Directory of Open Access Journals (Sweden)

Sudip Das

2014-01-01

Full Text Available Various ′sex-stimulant′ medicines with fancy names and attractive packaging are available over the counter. Most contain phosphodiesterase 5 inhibitors in various strengths, often with herbal additions. These drugs are used erratically by the lay public, driven by folklore that such usage leads to increase in the length, girth or firmness of the penis. Such indiscriminate use by an otherwise healthy population leads to undue side effects.
Collecting and evaluating speech recognition corpora for nine Southern Bantu languages

CSIR Research Space (South Africa)

Badenhorst, JAC

2009-03-01

Full Text Available The authors describes the Lwazi corpus for automatic speech recognition (ASR), a new telephone speech corpus which includes data from nine Southern Bantu languages. Because of practical constraints, the amount of speech per language is relatively...
Modal Auxiliary Verbs in Prescribed Malaysian English Textbooks

Science.gov (United States)

Mukundan, Jayakaran; Khojasteh, Laleh

2011-01-01

The use of corpus-based findings in order to inform L2 teaching materials have been emphasized by many researchers owing to the fact that the studies of authentic texts have revealed some inconsistencies between the use of grammatical structures in corpora, and those found in language textbooks that are based purely on hunch. Therefore, by…
Text Mining.

Science.gov (United States)

Trybula, Walter J.

1999-01-01

Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…
SparkText: Biomedical Text Mining on Big Data Framework.

Directory of Open Access Journals (Sweden)

Zhan Ye

Full Text Available Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment.In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM, and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes.This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.
Production of [Formula: see text] and [Formula: see text] in p-Pb collisions at [Formula: see text] TeV.

Science.gov (United States)

Adamová, D; Aggarwal, M M; Aglieri Rinella, G; Agnello, M; Agrawal, N; Ahammed, Z; Ahmad, S; Ahn, S U; Aiola, S; Akindinov, A; Alam, S N; Albuquerque, D S D; Aleksandrov, D; Alessandro, B; Alexandre, D; Alfaro Molina, R; Alici, A; Alkin, A; Alme, J; Alt, T; Altinpinar, S; Altsybeev, I; Alves Garcia Prado, C; An, M; Andrei, C; Andrews, H A; Andronic, A; Anguelov, V; Anson, C; Antičić, T; Antinori, F; Antonioli, P; Anwar, R; Aphecetche, L; Appelshäuser, H; Arcelli, S; Arnaldi, R; Arnold, O W; Arsene, I C; Arslandok, M; Audurier, B; Augustinus, A; Averbeck, R; Azmi, M D; Badalà, A; Baek, Y W; Bagnasco, S; Bailhache, R; Bala, R; Baldisseri, A; Ball, M; Baral, R C; Barbano, A M; Barbera, R; Barile, F; Barioglio, L; Barnaföldi, G G; Barnby, L S; Barret, V; Bartalini, P; Barth, K; Bartke, J; Bartsch, E; Basile, M; Bastid, N; Basu, S; Bathen, B; Batigne, G; Batista Camejo, A; Batyunya, B; Batzing, P C; Bearden, I G; Beck, H; Bedda, C; Behera, N K; Belikov, I; Bellini, F; Bello Martinez, H; Bellwied, R; Beltran, L G E; Belyaev, V; Bencedi, G; Beole, S; Bercuci, A; Berdnikov, Y; Berenyi, D; Bertens, R A; Berzano, D; Betev, L; Bhasin, A; Bhat, I R; Bhati, A K; Bhattacharjee, B; Bhom, J; Bianchi, L; Bianchi, N; Bianchin, C; Bielčík, J; Bielčíková, J; Bilandzic, A; Biro, G; Biswas, R; Biswas, S; Blair, J T; Blau, D; Blume, C; Boca, G; Bock, F; Bogdanov, A; Boldizsár, L; Bombara, M; Bonomi, G; Bonora, M; Book, J; Borel, H; Borissov, A; Borri, M; Botta, E; Bourjau, C; Braun-Munzinger, P; Bregant, M; Broker, T A; Browning, T A; Broz, M; Brucken, E J; Bruna, E; Bruno, G E; Budnikov, D; Buesching, H; Bufalino, S; Buhler, P; Buitron, S A I; Buncic, P; Busch, O; Buthelezi, Z; Butt, J B; Buxton, J T; Cabala, J; Caffarri, D; Caines, H; Caliva, A; Calvo Villar, E; Camerini, P; Capon, A A; Carena, F; Carena, W; Carnesecchi, F; Castillo Castellanos, J; Castro, A J; Casula, E A R; Ceballos Sanchez, C; Cerello, P; Chang, B; Chapeland, S; Chartier, M; Charvet, J L; Chattopadhyay, S; Chattopadhyay, S; Chauvin, A; Cherney, M; Cheshkov, C; Cheynis, B; Chibante Barroso, V; Chinellato, D D; Cho, S; Chochula, P; Choi, K; Chojnacki, M; Choudhury, S; Christakoglou, P; Christensen, C H; Christiansen, P; Chujo, T; Chung, S U; Cicalo, C; Cifarelli, L; Cindolo, F; Cleymans, J; Colamaria, F; Colella, D; Collu, A; Colocci, M; Conesa Balbastre, G; Conesa Del Valle, Z; Connors, M E; Contreras, J G; Cormier, T M; Corrales Morales, Y; Cortés Maldonado, I; Cortese, P; Cosentino, M R; Costa, F; Costanza, S; Crkovská, J; Crochet, P; Cuautle, E; Cunqueiro, L; Dahms, T; Dainese, A; Danisch, M C; Danu, A; Das, D; Das, I; Das, S; Dash, A; Dash, S; De, S; De Caro, A; de Cataldo, G; de Conti, C; de Cuveland, J; De Falco, A; De Gruttola, D; De Marco, N; De Pasquale, S; De Souza, R D; Degenhardt, H F; Deisting, A; Deloff, A; Deplano, C; Dhankher, P; Di Bari, D; Di Mauro, A; Di Nezza, P; Di Ruzza, B; Diaz Corchero, M A; Dietel, T; Dillenseger, P; Divià, R; Djuvsland, Ø; Dobrin, A; Domenicis Gimenez, D; Dönigus, B; Dordic, O; Drozhzhova, T; Dubey, A K; Dubla, A; Ducroux, L; Duggal, A K; Dupieux, P; Ehlers, R J; Elia, D; Endress, E; Engel, H; Epple, E; Erazmus, B; Erhardt, F; Espagnon, B; Esumi, S; Eulisse, G; Eum, J; Evans, D; Evdokimov, S; Fabbietti, L; Fabris, D; Faivre, J; Fantoni, A; Fasel, M; Feldkamp, L; Feliciello, A; Feofilov, G; Ferencei, J; Fernández Téllez, A; Ferreiro, E G; Ferretti, A; Festanti, A; Feuillard, V J G; Figiel, J; Figueredo, M A S; Filchagin, S; Finogeev, D; Fionda, F M; Fiore, E M; Floris, M; Foertsch, S; Foka, P; Fokin, S; Fragiacomo, E; Francescon, A; Francisco, A; Frankenfeld, U; Fronze, G G; Fuchs, U; Furget, C; Furs, A; Fusco Girard, M; Gaardhøje, J J; Gagliardi, M; Gago, A M; Gajdosova, K; Gallio, M; Galvan, C D; Gangadharan, D R; Ganoti, P; Gao, C; Garabatos, C; Garcia-Solis, E; Garg, K; Garg, P; Gargiulo, C; Gasik, P; Gauger, E F; Gay Ducati, M B; Germain, M; Ghosh, P; Ghosh, S K; Gianotti, P; Giubellino, P; Giubilato, P; Gladysz-Dziadus, E; Glässel, P; Goméz Coral, D M; Gomez Ramirez, A; Gonzalez, A S; Gonzalez, V; González-Zamora, P; Gorbunov, S; Görlich, L; Gotovac, S; Grabski, V; Graczykowski, L K; Graham, K L; Greiner, L; Grelli, A; Grigoras, C; Grigoriev, V; Grigoryan, A; Grigoryan, S; Grion, N; Gronefeld, J M; Grosa, F; Grosse-Oetringhaus, J F; Grosso, R; Gruber, L; Grull, F R; Guber, F; Guernane, R; Guerzoni, B; Gulbrandsen, K; Gunji, T; Gupta, A; Gupta, R; Guzman, I B; Haake, R; Hadjidakis, C; Hamagaki, H; Hamar, G; Hamon, J C; Harris, J W; Harton, A; Hatzifotiadou, D; Hayashi, S; Heckel, S T; Hellbär, E; Helstrup, H; Herghelegiu, A; Herrera Corral, G; Herrmann, F; Hess, B A; Hetland, K F; Hillemanns, H; Hippolyte, B; Hladky, J; Horak, D; Hosokawa, R; Hristov, P; Hughes, C; Humanic, T J; Hussain, N; Hussain, T; Hutter, D; Hwang, D S; Ilkaev, R; Inaba, M; Ippolitov, M; Irfan, M; Isakov, V; Islam, M S; Ivanov, M; Ivanov, V; Izucheev, V; Jacak, B; Jacazio, N; Jacobs, P M; Jadhav, M B; Jadlovska, S; Jadlovsky, J; Jahnke, C; Jakubowska, M J; Janik, M A; Jayarathna, P H S Y; Jena, C; Jena, S; Jercic, M; Jimenez Bustamante, R T; Jones, P G; Jusko, A; Kalinak, P; Kalweit, A; Kang, J H; Kaplin, V; Kar, S; Karasu Uysal, A; Karavichev, O; Karavicheva, T; Karayan, L; Karpechev, E; Kebschull, U; Keidel, R; Keijdener, D L D; Keil, M; Ketzer, B; Mohisin Khan, M; Khan, P; Khan, S A; Khanzadeev, A; Kharlov, Y; Khatun, A; Khuntia, A; Kielbowicz, M M; Kileng, B; Kim, D W; Kim, D J; Kim, D; Kim, H; Kim, J S; Kim, J; Kim, M; Kim, M; Kim, S; Kim, T; Kirsch, S; Kisel, I; Kiselev, S; Kisiel, A; Kiss, G; Klay, J L; Klein, C; Klein, J; Klein-Bösing, C; Klewin, S; Kluge, A; Knichel, M L; Knospe, A G; Kobdaj, C; Kofarago, M; Kollegger, T; Kolojvari, A; Kondratiev, V; Kondratyeva, N; Kondratyuk, E; Konevskikh, A; Kopcik, M; Kour, M; Kouzinopoulos, C; Kovalenko, O; Kovalenko, V; Kowalski, M; Koyithatta Meethaleveedu, G; Králik, I; Kravčáková, A; Krivda, M; Krizek, F; Kryshen, E; Krzewicki, M; Kubera, A M; Kučera, V; Kuhn, C; Kuijer, P G; Kumar, A; Kumar, J; Kumar, L; Kumar, S; Kundu, S; Kurashvili, P; Kurepin, A; Kurepin, A B; Kuryakin, A; Kushpil, S; Kweon, M J; Kwon, Y; La Pointe, S L; La Rocca, P; Lagana Fernandes, C; Lakomov, I; Langoy, R; Lapidus, K; Lara, C; Lardeux, A; Lattuca, A; Laudi, E; Lavicka, R; Lazaridis, L; Lea, R; Leardini, L; Lee, S; Lehas, F; Lehner, S; Lehrbach, J; Lemmon, R C; Lenti, V; Leogrande, E; León Monzón, I; Lévai, P; Li, S; Li, X; Lien, J; Lietava, R; Lindal, S; Lindenstruth, V; Lippmann, C; Lisa, M A; Litichevskyi, V; Ljunggren, H M; Llope, W J; Lodato, D F; Loenne, P I; Loginov, V; Loizides, C; Loncar, P; Lopez, X; López Torres, E; Lowe, A; Luettig, P; Lunardon, M; Luparello, G; Lupi, M; Lutz, T H; Maevskaya, A; Mager, M; Mahajan, S; Mahmood, S M; Maire, A; Majka, R D; Malaev, M; Maldonado Cervantes, I; Malinina, L; Mal'Kevich, D; Malzacher, P; Mamonov, A; Manko, V; Manso, F; Manzari, V; Mao, Y; Marchisone, M; Mareš, J; Margagliotti, G V; Margotti, A; Margutti, J; Marín, A; Markert, C; Marquard, M; Martin, N A; Martinengo, P; Martinez, J A L; Martínez, M I; Martínez García, G; Martinez Pedreira, M; Mas, A; Masciocchi, S; Masera, M; Masoni, A; Mastroserio, A; Mathis, A M; Matyja, A; Mayer, C; Mazer, J; Mazzilli, M; Mazzoni, M A; Meddi, F; Melikyan, Y; Menchaca-Rocha, A; Meninno, E; Mercado Pérez, J; Meres, M; Mhlanga, S; Miake, Y; Mieskolainen, M M; Mihaylov, D; Mikhaylov, K; Milano, L; Milosevic, J; Mischke, A; Mishra, A N; Miśkowiec, D; Mitra, J; Mitu, C M; Mohammadi, N; Mohanty, B; Montes, E; Moreira De Godoy, D A; Moreno, L A P; Moretto, S; Morreale, A; Morsch, A; Muccifora, V; Mudnic, E; Mühlheim, D; Muhuri, S; Mukherjee, M; Mulligan, J D; Munhoz, M G; Münning, K; Munzer, R H; Murakami, H; Murray, S; Musa, L; Musinsky, J; Myers, C J; Naik, B; Nair, R; Nandi, B K; Nania, R; Nappi, E; Naru, M U; Natal da Luz, H; Nattrass, C; Navarro, S R; Nayak, K; Nayak, R; Nayak, T K; Nazarenko, S; Nedosekin, A; Negrao De Oliveira, R A; Nellen, L; Nesbo, S V; Ng, F; Nicassio, M; Niculescu, M; Niedziela, J; Nielsen, B S; Nikolaev, S; Nikulin, S; Nikulin, V; Noferini, F; Nomokonov, P; Nooren, G; Noris, J C C; Norman, J; Nyanin, A; Nystrand, J; Oeschler, H; Oh, S; Ohlson, A; Okubo, T; Olah, L; Oleniacz, J; Oliveira Da Silva, A C; Oliver, M H; Onderwaater, J; Oppedisano, C; Orava, R; Oravec, M; Ortiz Velasquez, A; Oskarsson, A; Otwinowski, J; Oyama, K; Ozdemir, M; Pachmayer, Y; Pacik, V; Pagano, D; Pagano, P; Paić, G; Pal, S K; Palni, P; Pan, J; Pandey, A K; Panebianco, S; Papikyan, V; Pappalardo, G S; Pareek, P; Park, J; Park, W J; Parmar, S; Passfeld, A; Pathak, S P; Paticchio, V; Patra, R N; Paul, B; Pei, H; Peitzmann, T; Peng, X; Pereira, L G; Pereira Da Costa, H; Peresunko, D; Perez Lezama, E; Peskov, V; Pestov, Y; Petráček, V; Petrov, V; Petrovici, M; Petta, C; Pezzi, R P; Piano, S; Pikna, M; Pillot, P; Pimentel, L O D L; Pinazza, O; Pinsky, L; Piyarathna, D B; Płoskoń, M; Planinic, M; Pluta, J; Pochybova, S; Podesta-Lerma, P L M; Poghosyan, M G; Polichtchouk, B; Poljak, N; Poonsawat, W; Pop, A; Poppenborg, H; Porteboeuf-Houssais, S; Porter, J; Pospisil, J; Pozdniakov, V; Prasad, S K; Preghenella, R; Prino, F; Pruneau, C A; Pshenichnov, I; Puccio, M; Puddu, G; Pujahari, P; Punin, V; Putschke, J; Qvigstad, H; Rachevski, A; Raha, S; Rajput, S; Rak, J; Rakotozafindrabe, A; Ramello, L; Rami, F; Rana, D B; Raniwala, R; Raniwala, S; Räsänen, S S; Rascanu, B T; Rathee, D; Ratza, V; Ravasenga, I; Read, K F; Redlich, K; Rehman, A; Reichelt, P; Reidt, F; Ren, X; Renfordt, R; Reolon, A R; Reshetin, A; Reygers, K; Riabov, V; Ricci, R A; Richert, T; Richter, M; Riedler, P; Riegler, W; Riggi, F; Ristea, C; Rodríguez Cahuantzi, M; Røed, K; Rogochaya, E; Rohr, D; Röhrich, D; Rokita, P S; Ronchetti, F; Ronflette, L; Rosnet, P; Rossi, A; Rotondi, A; Roukoutakis, F; Roy, A; Roy, C; Roy, P; Rubio Montero, A J; Rui, R; Russo, R; Rustamov, A; Ryabinkin, E; Ryabov, Y; Rybicki, A; Saarinen, S; Sadhu, S; Sadovsky, S; Šafařík, K; Saha, S K; Sahlmuller, B; Sahoo, B; Sahoo, P; Sahoo, R; Sahoo, S; Sahu, P K; Saini, J; Sakai, S; Saleh, M A; Salzwedel, J; Sambyal, S; Samsonov, V; Sandoval, A; Sarkar, D; Sarkar, N; Sarma, P; Sas, M H P; Scapparone, E; Scarlassara, F; Scharenberg, R P; Scheid, H S; Schiaua, C; Schicker, R; Schmidt, C; Schmidt, H R; Schmidt, M O; Schmidt, M; Schukraft, J; Schutz, Y; Schwarz, K; Schweda, K; Scioli, G; Scomparin, E; Scott, R; Šefčík, M; Seger, J E; Sekiguchi, Y; Sekihata, D; Selyuzhenkov, I; Senosi, K; Senyukov, S; Serradilla, E; Sett, P; Sevcenco, A; Shabanov, A; Shabetai, A; Shadura, O; Shahoyan, R; Shangaraev, A; Sharma, A; Sharma, A; Sharma, M; Sharma, M; Sharma, N; Sheikh, A I; Shigaki, K; Shou, Q; Shtejer, K; Sibiriak, Y; Siddhanta, S; Sielewicz, K M; Siemiarczuk, T; Silvermyr, D; Silvestre, C; Simatovic, G; Simonetti, G; Singaraju, R; Singh, R; Singhal, V; Sinha, T; Sitar, B; Sitta, M; Skaali, T B; Slupecki, M; Smirnov, N; Snellings, R J M; Snellman, T W; Song, J; Song, M; Soramel, F; Sorensen, S; Sozzi, F; Spiriti, E; Sputowska, I; Srivastava, B K; Stachel, J; Stan, I; Stankus, P; Stenlund, E; Stiller, J H; Stocco, D; Strmen, P; Suaide, A A P; Sugitate, T; Suire, C; Suleymanov, M; Suljic, M; Sultanov, R; Šumbera, M; Sumowidagdo, S; Suzuki, K; Swain, S; Szabo, A; Szarka, I; Szczepankiewicz, A; Szymanski, M; Tabassam, U; Takahashi, J; Tambave, G J; Tanaka, N; Tarhini, M; Tariq, M; Tarzila, M G; Tauro, A; Tejeda Muñoz, G; Telesca, A; Terasaki, K; Terrevoli, C; Teyssier, B; Thakur, D; Thakur, S; Thomas, D; Tieulent, R; Tikhonov, A; Timmins, A R; Toia, A; Tripathy, S; Trogolo, S; Trombetta, G; Trubnikov, V; Trzaska, W H; Trzeciak, B A; Tsuji, T; Tumkin, A; Turrisi, R; Tveter, T S; Ullaland, K; Umaka, E N; Uras, A; Usai, G L; Utrobicic, A; Vala, M; Van Der Maarel, J; Van Hoorne, J W; van Leeuwen, M; Vanat, T; Vande Vyvre, P; Varga, D; Vargas, A; Vargyas, M; Varma, R; Vasileiou, M; Vasiliev, A; Vauthier, A; Vázquez Doce, O; Vechernin, V; Veen, A M; Velure, A; Vercellin, E; Vergara Limón, S; Vernet, R; Vértesi, R; Vickovic, L; Vigolo, S; Viinikainen, J; Vilakazi, Z; Villalobos Baillie, O; Villatoro Tello, A; Vinogradov, A; Vinogradov, L; Virgili, T; Vislavicius, V; Vodopyanov, A; Völkl, M A; Voloshin, K; Voloshin, S A; Volpe, G; von Haller, B; Vorobyev, I; Voscek, D; Vranic, D; Vrláková, J; Wagner, B; Wagner, J; Wang, H; Wang, M; Watanabe, D; Watanabe, Y; Weber, M; Weber, S G; Weiser, D F; Wessels, J P; Westerhoff, U; Whitehead, A M; Wiechula, J; Wikne, J; Wilk, G; Wilkinson, J; Willems, G A; Williams, M C S; Windelband, B; Witt, W E; Yalcin, S; Yang, P; Yano, S; Yin, Z; Yokoyama, H; Yoo, I-K; Yoon, J H; Yurchenko, V; Zaccolo, V; Zaman, A; Zampolli, C; Zanoli, H J C; Zaporozhets, S; Zardoshti, N; Zarochentsev, A; Závada, P; Zaviyalov, N; Zbroszczyk, H; Zhalov, M; Zhang, H; Zhang, X; Zhang, Y; Zhang, C; Zhang, Z; Zhao, C; Zhigareva, N; Zhou, D; Zhou, Y; Zhou, Z; Zhu, H; Zhu, J; Zhu, X; Zichichi, A; Zimmermann, A; Zimmermann, M B; Zimmermann, S; Zinovjev, G; Zmeskal, J

2017-01-01

The transverse momentum distributions of the strange and double-strange hyperon resonances ([Formula: see text], [Formula: see text]) produced in p-Pb collisions at [Formula: see text] TeV were measured in the rapidity range [Formula: see text] for event classes corresponding to different charged-particle multiplicity densities, [Formula: see text]d[Formula: see text]/d[Formula: see text]. The mean transverse momentum values are presented as a function of [Formula: see text]d[Formula: see text]/d[Formula: see text], as well as a function of the particle masses and compared with previous results on hyperon production. The integrated yield ratios of excited to ground-state hyperons are constant as a function of [Formula: see text]d[Formula: see text]/d[Formula: see text]. The equivalent ratios to pions exhibit an increase with [Formula: see text]d[Formula: see text]/d[Formula: see text], depending on their strangeness content.
The nuclear modification of charged particles in Pb-Pb at $\\sqrt{\\text{s}_\\text{NN}} = \\text{5.02}\\,\\text{TeV}$ measured with ALICE

CERN Document Server

Gronefeld, Julius

2016-09-21

The study of inclusive charged-particle production in heavy-ion collisions provides insights into the density of the medium and the energy-loss mechanisms. The observed suppression of high-$\\textit{p}_\\text{T}$ yield is generally attributed to energy loss of partons as they propagate through a deconfined state of quarks and gluons - Quark-Gluon Plasma (QGP) - predicted by QCD. Such measurements allow the characterization of the QGP by comparison with models. In these proceedings, results on high-$\\textit{p}_\\text{T}$ particle production measured by ALICE in Pb-Pb collisions at $ \\sqrt{\\text{s}_\\text{NN}}\\, = 5.02\\ \\rm{TeV}$ as well as well in pp at $\\sqrt{\\text{s}}\\,=5.02\\ \\rm{TeV}$ are presented for the first time. The nuclear modification factors ($\\text{R}_\\text{AA}$) in Pb-Pb collisions are presented and compared with model calculations.
Observation of [Formula: see text] and [Formula: see text] decays.

Science.gov (United States)

Aaij, R; Adeva, B; Adinolfi, M; Ajaltouni, Z; Akar, S; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Andreassi, G; Andreotti, M; Andrews, J E; Appleby, R B; Archilli, F; d'Argent, P; Arnau Romeu, J; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Babuschkin, I; Bachmann, S; Back, J J; Badalov, A; Baesso, C; Baker, S; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Baszczyk, M; Batozskaya, V; Batsukh, B; Battista, V; Bay, A; Beaucourt, L; Beddow, J; Bedeschi, F; Bediaga, I; Bel, L J; Bellee, V; Belloli, N; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bertolin, A; Betancourt, C; Betti, F; Bettler, M-O; van Beuzekom, M; Bezshyiko, Ia; Bifani, S; Billoir, P; Bird, T; Birnkraut, A; Bitadze, A; Bizzeti, A; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Boettcher, T; Bondar, A; Bondar, N; Bonivento, W; Bordyuzhin, I; Borgheresi, A; Borghi, S; Borisyak, M; Borsato, M; Bossu, F; Boubdir, M; Bowcock, T J V; Bowen, E; Bozzi, C; Braun, S; Britsch, M; Britton, T; Brodzicka, J; Buchanan, E; Burr, C; Bursche, A; Buytaert, J; Cadeddu, S; Calabrese, R; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D H; Capriotti, L; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carniti, P; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cavallero, G; Cenci, R; Charles, M; Charpentier, Ph; Chatzikonstantinidis, G; Chefdeville, M; Chen, S; Cheung, S-F; Chobanova, V; Chrzaszcz, M; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coco, V; Cogan, J; Cogneras, E; Cogoni, V; Cojocariu, L; Collazuol, G; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombs, G; Coquereau, S; Corti, G; Corvo, M; Costa Sobral, C M; Couturier, B; Cowan, G A; Craik, D C; Crocombe, A; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Da Cunha Marinho, F; Dall'Occo, E; Dalseno, J; David, P N Y; Davis, A; De Aguiar Francisco, O; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Serio, M; De Simone, P; Dean, C-T; Decamp, D; Deckenhoff, M; Del Buono, L; Demmer, M; Dendek, A; Derkach, D; Deschamps, O; Dettori, F; Dey, B; Di Canto, A; Dijkstra, H; Dordei, F; Dorigo, M; Dosil Suárez, A; Dovbnya, A; Dreimanis, K; Dufour, L; Dujany, G; Dungs, K; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Déléage, N; Easo, S; Ebert, M; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; Ely, S; Esen, S; Evans, H M; Evans, T; Falabella, A; Farley, N; Farry, S; Fay, R; Fazzini, D; Ferguson, D; Fernandez Prieto, A; Ferrari, F; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fini, R A; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fleuret, F; Fohl, K; Fontana, M; Fontanelli, F; Forshaw, D C; Forty, R; Franco Lima, V; Frank, M; Frei, C; Fu, J; Furfaro, E; Färber, C; Gallas Torreira, A; Galli, D; Gallorini, S; Gambetta, S; Gandelman, M; Gandini, P; Gao, Y; Garcia Martin, L M; García Pardiñas, J; Garra Tico, J; Garrido, L; Garsed, P J; Gascon, D; Gaspar, C; Gavardi, L; Gazzoni, G; Gerick, D; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianì, S; Gibson, V; Girard, O G; Giubega, L; Gizdov, K; Gligorov, V V; Golubkov, D; Golutvin, A; Gomes, A; Gorelov, I V; Gotti, C; Govorkova, E; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graverini, E; Graziani, G; Grecu, A; Griffith, P; Grillo, L; Gruberg Cazon, B R; Grünberg, O; Gushchin, E; Guz, Yu; Gys, T; Göbel, C; Hadavizadeh, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hatch, M; He, J; Head, T; Heister, A; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hombach, C; Hopchev, H; Hulsbergen, W; Humair, T; Hushchyn, M; Hussain, N; Hutchcroft, D; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jawahery, A; Jiang, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kandybei, S; Kanso, W; Karacson, M; Kariuki, J M; Karodia, S; Kecke, M; Kelsey, M; Kenyon, I R; Kenzie, M; Ketel, T; Khairullin, E; Khanji, B; Khurewathanakul, C; Kirn, T; Klaver, S; Klimaszewski, K; Koliiev, S; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Kosmyntseva, A; Kozachuk, A; Kozeiha, M; Kravchuk, L; Kreplin, K; Kreps, M; Krokovny, P; Kruse, F; Krzemien, W; Kucewicz, W; Kucharczyk, M; Kudryavtsev, V; Kuonen, A K; Kurek, K; Kvaratskheliya, T; Lacarrere, D; Lafferty, G; Lai, A; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Leflat, A; Lefrançois, J; Lefèvre, R; Lemaitre, F; Lemos Cid, E; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Likhomanenko, T; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, X; Loh, D; Longstaff, I; Lopes, J H; Lucchesi, D; Lucio Martinez, M; Luo, H; Lupato, A; Luppi, E; Lupton, O; Lusiani, A; Lyu, X; Machefert, F; Maciuc, F; Maev, O; Maguire, K; Malde, S; Malinin, A; Maltsev, T; Manca, G; Mancinelli, G; Manning, P; Maratas, J; Marchand, J F; Marconi, U; Marin Benito, C; Marino, P; Marks, J; Martellotti, G; Martin, M; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massacrier, L M; Massafferri, A; Matev, R; Mathad, A; Mathe, Z; Matteuzzi, C; Mauri, A; Maurin, B; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; Meadows, B; Meier, F; Meissner, M; Melnychuk, D; Merk, M; Merli, A; Michielin, E; Milanes, D A; Minard, M-N; Mitzel, D S; Mogini, A; Molina Rodriguez, J; Monroy, I A; Monteil, S; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Morris, A B; Mountain, R; Muheim, F; Mulder, M; Mussini, M; Müller, D; Müller, J; Müller, K; Müller, V; Naik, P; Nakada, T; Nandakumar, R; Nandi, A; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nieswand, S; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; O'Hanlon, D P; Oblakowska-Mucha, A; Obraztsov, V; Ogilvy, S; Oldeman, R; Onderwater, C J G; Otalora Goicochea, J M; Otto, A; Owen, P; Oyanguren, A; Pais, P R; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Pappalardo, L L; Parker, W; Parkes, C; Passaleva, G; Pastore, A; Patel, G D; Patel, M; Patrignani, C; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perret, P; Pescatore, L; Petridis, K; Petrolini, A; Petrov, A; Petruzzo, M; Picatoste Olloqui, E; Pietrzyk, B; Pikies, M; Pinci, D; Pistone, A; Piucci, A; Playfer, S; Plo Casasus, M; Poikela, T; Polci, F; Poluektov, A; Polyakov, I; Polycarpo, E; Pomery, G J; Popov, A; Popov, D; Popovici, B; Poslavskii, S; Potterat, C; Price, E; Price, J D; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Quagliani, R; Rachwal, B; Rademacker, J H; Rama, M; Ramos Pernas, M; Rangel, M S; Raniuk, I; Ratnikov, F; Raven, G; Redi, F; Reichert, S; Dos Reis, A C; Remon Alepuz, C; Renaudin, V; Ricciardi, S; Richards, S; Rihl, M; Rinnert, K; Rives Molina, V; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Lopez, J A; Rodriguez Perez, P; Rogozhnikov, A; Roiser, S; Rollings, A; Romanovskiy, V; Romero Vidal, A; Ronayne, J W; Rotondo, M; Rudolph, M S; Ruf, T; Ruiz Valls, P; Saborido Silva, J J; Sadykhov, E; Sagidova, N; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santimaria, M; Santovetti, E; Sarti, A; Satriano, C; Satta, A; Saunders, D M; Savrina, D; Schael, S; Schellenberg, M; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmelzer, T; Schmidt, B; Schneider, O; Schopper, A; Schubert, K; Schubiger, M; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Semennikov, A; Sergi, A; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Siddi, B G; Silva Coutinho, R; Silva de Oliveira, L; Simi, G; Simone, S; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, E; Smith, I T; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Souza De Paula, B; Spaan, B; Spradlin, P; Sridharan, S; Stagni, F; Stahl, M; Stahl, S; Stefko, P; Stefkova, S; Steinkamp, O; Stemmle, S; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Sun, L; Sutcliffe, W; Swientek, K; Syropoulos, V; Szczekowski, M; Szumlak, T; T'Jampens, S; Tayduganov, A; Tekampe, T; Tellarini, G; Teubert, F; Thomas, E; van Tilburg, J; Tilley, M J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Toriello, F; Tournefier, E; Tourneur, S; Trabelsi, K; Traill, M; Tran, M T; Tresch, M; Trisovic, A; Tsaregorodtsev, A; Tsopelas, P; Tully, A; Tuning, N; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vacca, C; Vagnoni, V; Valassi, A; Valat, S; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vecchi, S; van Veghel, M; Velthuis, J J; Veltri, M; Veneziano, G; Venkateswaran, A; Vernet, M; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Viemann, H; Vilasis-Cardona, X; Vitti, M; Volkov, V; Vollhardt, A; Voneki, B; Vorobyev, A; Vorobyev, V; Voß, C; de Vries, J A; Vázquez Sierra, C; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wang, J; Ward, D R; Wark, H M; Watson, N K; Websdale, D; Weiden, A; Whitehead, M; Wicht, J; Wilkinson, G; Wilkinson, M; Williams, M; Williams, M P; Williams, M; Williams, T; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wraight, K; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yin, H; Yu, J; Yuan, X; Yushchenko, O; Zarebski, K A; Zavertyaev, M; Zhang, L; Zhang, Y; Zhang, Y; Zhelezov, A; Zheng, Y; Zhokhov, A; Zhu, X; Zhukov, V; Zucchelli, S

2017-01-01

The decays [Formula: see text] and [Formula: see text] are observed for the first time using a data sample corresponding to an integrated luminosity of 3.0 fb[Formula: see text], collected by the LHCb experiment in proton-proton collisions at the centre-of-mass energies of 7 and 8[Formula: see text]. The branching fractions relative to that of [Formula: see text] are measured to be [Formula: see text]where the first uncertainties are statistical and the second are systematic.
THE CATEGORY „HOME” IN THE ANTROPOLOGICAL SPACE OF CULTURE

Directory of Open Access Journals (Sweden)

COMENDANT TATIANA

2015-09-01

Full Text Available The present work considers the category „Home” in the anthropological space of culture. The authors analyze the typology of human nature within the cultural-historical space. The article also underlines the semantic recurrence of the archetypes of the category „Home” in the archaic forms of social conscience such as: mythology, folklore etc. Special attention is given to the treatment of this category in holy religious texts. Emphasis is laid on the characteristic features of the process of modifying the category „Home” in contemporary reality.
SparkText: Biomedical Text Mining on Big Data Framework.

Science.gov (United States)

Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

Text analysis methods, text analysis apparatuses, and articles of manufacture

Science.gov (United States)

Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

2014-10-28

Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.
Measurement of [Formula: see text] polarisation in [Formula: see text] collisions at [Formula: see text] = 7 TeV.

Science.gov (United States)

Aaij, R; Adeva, B; Adinolfi, M; Affolder, A; Ajaltouni, Z; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Anderson, J; Andreassen, R; Andreotti, M; Andrews, J E; Appleby, R B; Aquines Gutierrez, O; Archilli, F; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Bachmann, S; Back, J J; Badalov, A; Balagura, V; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Batozskaya, V; Bauer, Th; Bay, A; Beddow, J; Bedeschi, F; Bediaga, I; Belogurov, S; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bettler, M-O; van Beuzekom, M; Bien, A; Bifani, S; Bird, T; Bizzeti, A; Bjørnstad, P M; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Bondar, A; Bondar, N; Bonivento, W; Borghi, S; Borgia, A; Borsato, M; Bowcock, T J V; Bowen, E; Bozzi, C; Brambach, T; van den Brand, J; Bressieux, J; Brett, D; Britsch, M; Britton, T; Brook, N H; Brown, H; Bursche, A; Busetto, G; Buytaert, J; Cadeddu, S; Calabrese, R; Callot, O; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carranza-Mejia, H; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cenci, R; Charles, M; Charpentier, Ph; Cheung, S-F; Chiapolini, N; Chrzaszcz, M; Ciba, K; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coca, C; Coco, V; Cogan, J; Cogneras, E; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombes, M; Coquereau, S; Corti, G; Corvo, M; Counts, I; Couturier, B; Cowan, G A; Craik, D C; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Dalseno, J; David, P; David, P N Y; Davis, A; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Silva, W; De Simone, P; Decamp, D; Deckenhoff, M; Del Buono, L; Déléage, N; Derkach, D; Deschamps, O; Dettori, F; Di Canto, A; Dijkstra, H; Donleavy, S; Dordei, F; Dorigo, M; Dosil Suárez, A; Dossett, D; Dovbnya, A; Dupertuis, F; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Easo, S; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; El Rifai, I; Elsasser, Ch; Esen, S; Evans, T; Falabella, A; Färber, C; Farinelli, C; Farry, S; Ferguson, D; Fernandez Albor, V; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fontana, M; Fontanelli, F; Forty, R; Francisco, O; Frank, M; Frei, C; Frosini, M; Fu, J; Furfaro, E; Gallas Torreira, A; Galli, D; Gandelman, M; Gandini, P; Gao, Y; Garofoli, J; Garra Tico, J; Garrido, L; Gaspar, C; Gauld, R; Gavardi, L; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianelle, A; Giani, S; Gibson, V; Giubega, L; Gligorov, V V; Göbel, C; Golubkov, D; Golutvin, A; Gomes, A; Gordon, H; Gotti, C; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graziani, G; Grecu, A; Greening, E; Gregson, S; Griffith, P; Grillo, L; Grünberg, O; Gui, B; Gushchin, E; Guz, Yu; Gys, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Hampson, T; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hartmann, T; He, J; Head, T; Heijne, V; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hoballah, M; Hombach, C; Hulsbergen, W; Hunt, P; Hussain, N; Hutchcroft, D; Hynds, D; Iakovenko, V; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jaton, P; Jawahery, A; Jezabek, M; Jing, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kaballo, M; Kandybei, S; Kanso, W; Karacson, M; Karbach, T M; Kelsey, M; Kenyon, I R; Ketel, T; Khanji, B; Khurewathanakul, C; Klaver, S; Kochebina, O; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Korolev, M; Kozlinskiy, A; Kravchuk, L; Kreplin, K; Kreps, M; Krocker, G; Krokovny, P; Kruse, F; Kucharczyk, M; Kudryavtsev, V; Kurek, K; Kvaratskheliya, T; La Thi, V N; Lacarrere, D; Lafferty, G; Lai, A; Lambert, D; Lambert, R W; Lanciotti, E; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Lefèvre, R; Leflat, A; Lefrançois, J; Leo, S; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Liles, M; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, G; Lohn, S; Longstaff, I; Longstaff, I; Lopes, J H; Lopez-March, N; Lowdon, P; Lu, H; Lucchesi, D; Luisier, J; Luo, H; Lupato, A; Luppi, E; Lupton, O; Machefert, F; Machikhiliyan, I V; Maciuc, F; Maev, O; Malde, S; Manca, G; Mancinelli, G; Manzali, M; Maratas, J; Marchand, J F; Marconi, U; Marino, P; Märki, R; Marks, J; Martellotti, G; Martens, A; Martín Sánchez, A; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massafferri, A; Matev, R; Mathe, Z; Matteuzzi, C; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; McSkelly, B; Meadows, B; Meier, F; Meissner, M; Merk, M; Milanes, D A; Minard, M-N; Molina Rodriguez, J; Monteil, S; Moran, D; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Mountain, R; Muheim, F; Müller, K; Muresan, R; Muster, B; Naik, P; Nakada, T; Nandakumar, R; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nicol, M; Niess, V; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; Oblakowska-Mucha, A; Obraztsov, V; Oggero, S; Ogilvy, S; Okhrimenko, O; Oldeman, R; Onderwater, G; Orlandea, M; Otalora Goicochea, J M; Owen, P; Oyanguren, A; Pal, B K; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Parkes, C; Parkinson, C J; Passaleva, G; Patel, G D; Patel, M; Patrignani, C; Pazos Alvarez, A; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perez Trigo, E; Perret, P; Perrin-Terrin, M; Pescatore, L; Pesen, E; Petridis, K; Petrolini, A; Picatoste Olloqui, E; Pietrzyk, B; Pilař, T; Pinci, D; Pistone, A; Playfer, S; Plo Casasus, M; Polci, F; Polok, G; Poluektov, A; Polycarpo, E; Popov, A; Popov, D; Popovici, B; Potterat, C; Powell, A; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Rachwal, B; Rademacker, J H; Rakotomiaramanana, B; Rama, M; Rangel, M S; Raniuk, I; Rauschmayr, N; Raven, G; Redford, S; Reichert, S; Reid, M M; Dos Reis, A C; Ricciardi, S; Richards, A; Rinnert, K; Rives Molina, V; Roa Romero, D A; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Perez, P; Roiser, S; Romanovsky, V; Romero Vidal, A; Rotondo, M; Rouvinet, J; Ruf, T; Ruffini, F; Ruiz, H; Ruiz Valls, P; Sabatino, G; Saborido Silva, J J; Sagidova, N; Sail, P; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santovetti, E; Sapunov, M; Sarti, A; Satriano, C; Satta, A; Savrie, M; Savrina, D; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmidt, B; Schneider, O; Schopper, A; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Seco, M; Semennikov, A; Senderowska, K; Sepp, I; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Shires, A; Silva Coutinho, R; Simi, G; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, N A; Smith, E; Smith, E; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Soomro, F; Souza, D; Souza De Paula, B; Spaan, B; Sparkes, A; Spinella, F; Spradlin, P; Stagni, F; Stahl, S; Steinkamp, O; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Stroili, R; Subbiah, V K; Sun, L; Sutcliffe, W; Swientek, K; Swientek, S; Syropoulos, V; Szczekowski, M; Szczypka, P; Szilard, D; Szumlak, T; T'Jampens, S; Teklishyn, M; Tellarini, G; Teodorescu, E; Teubert, F; Thomas, C; Thomas, E; van Tilburg, J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Torr, N; Tournefier, E; Tourneur, S; Tran, M T; Tresch, M; Tsaregorodtsev, A; Tsopelas, P; Tuning, N; Ubeda Garcia, M; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vagnoni, V; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vázquez Sierra, C; Vecchi, S; Velthuis, J J; Veltri, M; Veneziano, G; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Vilasis-Cardona, X; Vollhardt, A; Volyanskyy, D; Voong, D; Vorobyev, A; Vorobyev, V; Voß, C; Voss, H; de Vries, J A; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wandernoth, S; Wang, J; Ward, D R; Watson, N K; Webber, A D; Websdale, D; Whitehead, M; Wicht, J; Wiedner, D; Wiggers, L; Wilkinson, G; Williams, M P; Williams, M; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wright, S; Wu, S; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yuan, X; Yushchenko, O; Zangoli, M; Zavertyaev, M; Zhang, F; Zhang, L; Zhang, W C; Zhang, Y; Zhelezov, A; Zhokhov, A; Zhong, L; Zvyagin, A

The polarisation of prompt [Formula: see text] mesons is measured by performing an angular analysis of [Formula: see text] decays using proton-proton collision data, corresponding to an integrated luminosity of 1.0[Formula: see text], collected by the LHCb detector at a centre-of-mass energy of 7 TeV. The polarisation is measured in bins of transverse momentum [Formula: see text] and rapidity [Formula: see text] in the kinematic region [Formula: see text] and [Formula: see text], and is compared to theoretical models. No significant polarisation is observed.
Crowdfunding: entre as Multidões e as Corporações

Directory of Open Access Journals (Sweden)

Erick Felinto

2013-01-01

Full Text Available Este artigo examina as práticas de crowdfunding e crowdsourcing no contexto da chamada web 2.0. Por meio de uma exploração filosófica e sociológica das noções de multidão e de indivíduo, investigamos as tensões ideológicas que cercam essas praticas, encaradas por vezes como libertárias, por vezes como conservadoras. O artigo aborda estudos de caso que ajudam a ilustrar os aspectos contraditórios do crowdfunding.
Corpora and Cultural Cognition

DEFF Research Database (Denmark)

Jensen, Kim Ebensgaard

2017-01-01

Cultural cognition is, to a great extent, transmitted through language and, consequently, reflected and replicated in language use. Cultural cognition may be instantiated in various patterns of language use, such as the discursive behavior of constructions. Very often, such instantiations can be ...... is addressed. In the third part of the chapter, three case studies are presented – one from Danish and two from English – to illustrate the analysis of cultural conceptualization via corpus-linguistic techniques....
Production of K[Formula: see text](892)[Formula: see text] and [Formula: see text](1020) in p-Pb collisions at [Formula: see text] = 5.02 TeV.

Science.gov (United States)

Adam, J; Adamová, D; Aggarwal, M M; Aglieri Rinella, G; Agnello, M; Agrawal, N; Ahammed, Z; Ahmad, S; Ahn, S U; Aiola, S; Akindinov, A; Alam, S N; Aleksandrov, D; Alessandro, B; Alexandre, D; Alfaro Molina, R; Alici, A; Alkin, A; Almaraz, J R M; Alme, J; Alt, T; Altinpinar, S; Altsybeev, I; Alves Garcia Prado, C; Andrei, C; Andronic, A; Anguelov, V; Antičić, T; Antinori, F; Antonioli, P; Aphecetche, L; Appelshäuser, H; Arcelli, S; Arnaldi, R; Arnold, O W; Arsene, I C; Arslandok, M; Audurier, B; Augustinus, A; Averbeck, R; Azmi, M D; Badalà, A; Baek, Y W; Bagnasco, S; Bailhache, R; Bala, R; Balasubramanian, S; Baldisseri, A; Baral, R C; Barbano, A M; Barbera, R; Barile, F; Barnaföldi, G G; Barnby, L S; Barret, V; Bartalini, P; Barth, K; Bartke, J; Bartsch, E; Basile, M; Bastid, N; Basu, S; Bathen, B; Batigne, G; Batista Camejo, A; Batyunya, B; Batzing, P C; Bearden, I G; Beck, H; Bedda, C; Behera, N K; Belikov, I; Bellini, F; Bello Martinez, H; Bellwied, R; Belmont, R; Belmont-Moreno, E; Belyaev, V; Benacek, P; Bencedi, G; Beole, S; Berceanu, I; Bercuci, A; Berdnikov, Y; Berenyi, D; Bertens, R A; Berzano, D; Betev, L; Bhasin, A; Bhat, I R; Bhati, A K; Bhattacharjee, B; Bhom, J; Bianchi, L; Bianchi, N; Bianchin, C; Bielčík, J; Bielčíková, J; Bilandzic, A; Biro, G; Biswas, R; Biswas, S; Bjelogrlic, S; Blair, J T; Blau, D; Blume, C; Bock, F; Bogdanov, A; Bøggild, H; Boldizsár, L; Bombara, M; Book, J; Borel, H; Borissov, A; Borri, M; Bossú, F; Botta, E; Bourjau, C; Braun-Munzinger, P; Bregant, M; Breitner, T; Broker, T A; Browning, T A; Broz, M; Brucken, E J; Bruna, E; Bruno, G E; Budnikov, D; Buesching, H; Bufalino, S; Buncic, P; Busch, O; Buthelezi, Z; Butt, J B; Buxton, J T; Caffarri, D; Cai, X; Caines, H; Calero Diaz, L; Caliva, A; Calvo Villar, E; Camerini, P; Carena, F; Carena, W; Carnesecchi, F; Castillo Castellanos, J; Castro, A J; Casula, E A R; Ceballos Sanchez, C; Cerello, P; Cerkala, J; Chang, B; Chapeland, S; Chartier, M; Charvet, J L; Chattopadhyay, S; Chattopadhyay, S; Chauvin, A; Chelnokov, V; Cherney, M; Cheshkov, C; Cheynis, B; Chibante Barroso, V; Chinellato, D D; Cho, S; Chochula, P; Choi, K; Chojnacki, M; Choudhury, S; Christakoglou, P; Christensen, C H; Christiansen, P; Chujo, T; Chung, S U; Cicalo, C; Cifarelli, L; Cindolo, F; Cleymans, J; Colamaria, F; Colella, D; Collu, A; Colocci, M; Conesa Balbastre, G; Conesa Del Valle, Z; Connors, M E; Contreras, J G; Cormier, T M; Corrales Morales, Y; Cortés Maldonado, I; Cortese, P; Cosentino, M R; Costa, F; Crochet, P; Cruz Albino, R; Cuautle, E; Cunqueiro, L; Dahms, T; Dainese, A; Danisch, M C; Danu, A; Das, D; Das, I; Das, S; Dash, A; Dash, S; De, S; De Caro, A; de Cataldo, G; de Conti, C; de Cuveland, J; De Falco, A; De Gruttola, D; De Marco, N; De Pasquale, S; Deisting, A; Deloff, A; Dénes, E; Deplano, C; Dhankher, P; Di Bari, D; Di Mauro, A; Di Nezza, P; Diaz Corchero, M A; Dietel, T; Dillenseger, P; Divià, R; Djuvsland, Ø; Dobrin, A; Domenicis Gimenez, D; Dönigus, B; Dordic, O; Drozhzhova, T; Dubey, A K; Dubla, A; Ducroux, L; Dupieux, P; Ehlers, R J; Elia, D; Endress, E; Engel, H; Epple, E; Erazmus, B; Erdemir, I; Erhardt, F; Espagnon, B; Estienne, M; Esumi, S; Eum, J; Evans, D; Evdokimov, S; Eyyubova, G; Fabbietti, L; Fabris, D; Faivre, J; Fantoni, A; Fasel, M; Feldkamp, L; Feliciello, A; Feofilov, G; Ferencei, J; Fernández Téllez, A; Ferreiro, E G; Ferretti, A; Festanti, A; Feuillard, V J G; Figiel, J; Figueredo, M A S; Filchagin, S; Finogeev, D; Fionda, F M; Fiore, E M; Fleck, M G; Floris, M; Foertsch, S; Foka, P; Fokin, S; Fragiacomo, E; Francescon, A; Frankenfeld, U; Fronze, G G; Fuchs, U; Furget, C; Furs, A; Fusco Girard, M; Gaardhøje, J J; Gagliardi, M; Gago, A M; Gallio, M; Gangadharan, D R; Ganoti, P; Gao, C; Garabatos, C; Garcia-Solis, E; Gargiulo, C; Gasik, P; Gauger, E F; Germain, M; Gheata, A; Gheata, M; Ghosh, P; Ghosh, S K; Gianotti, P; Giubellino, P; Giubilato, P; Gladysz-Dziadus, E; Glässel, P; Goméz Coral, D M; Gomez Ramirez, A; Gonzalez, V; González-Zamora, P; Gorbunov, S; Görlich, L; Gotovac, S; Grabski, V; Grachov, O A; Graczykowski, L K; Graham, K L; Grelli, A; Grigoras, A; Grigoras, C; Grigoriev, V; Grigoryan, A; Grigoryan, S; Grinyov, B; Grion, N; Gronefeld, J M; Grosse-Oetringhaus, J F; Grossiord, J-Y; Grosso, R; Guber, F; Guernane, R; Guerzoni, B; Gulbrandsen, K; Gunji, T; Gupta, A; Gupta, R; Haake, R; Haaland, Ø; Hadjidakis, C; Haiduc, M; Hamagaki, H; Hamar, G; Hamon, J C; Harris, J W; Harton, A; Hatzifotiadou, D; Hayashi, S; Heckel, S T; Hellbär, E; Helstrup, H; Herghelegiu, A; Herrera Corral, G; Hess, B A; Hetland, K F; Hillemanns, H; Hippolyte, B; Horak, D; Hosokawa, R; Hristov, P; Huang, M; Humanic, T J; Hussain, N; Hussain, T; Hutter, D; Hwang, D S; Ilkaev, R; Inaba, M; Incani, E; Ippolitov, M; Irfan, M; Ivanov, M; Ivanov, V; Izucheev, V; Jacazio, N; Jacobs, P M; Jadhav, M B; Jadlovska, S; Jadlovsky, J; Jahnke, C; Jakubowska, M J; Jang, H J; Janik, M A; Jayarathna, P H S Y; Jena, C; Jena, S; Jimenez Bustamante, R T; Jones, P G; Jusko, A; Kalinak, P; Kalweit, A; Kamin, J; Kang, J H; Kaplin, V; Kar, S; Karasu Uysal, A; Karavichev, O; Karavicheva, T; Karayan, L; Karpechev, E; Kebschull, U; Keidel, R; Keijdener, D L D; Keil, M; Mohisin Khan, M; Khan, P; Khan, S A; Khanzadeev, A; Kharlov, Y; Kileng, B; Kim, D W; Kim, D J; Kim, D; Kim, H; Kim, J S; Kim, M; Kim, M; Kim, S; Kim, T; Kirsch, S; Kisel, I; Kiselev, S; Kisiel, A; Kiss, G; Klay, J L; Klein, C; Klein, J; Klein-Bösing, C; Klewin, S; Kluge, A; Knichel, M L; Knospe, A G; Kobdaj, C; Kofarago, M; Kollegger, T; Kolojvari, A; Kondratiev, V; Kondratyeva, N; Kondratyuk, E; Konevskikh, A; Kopcik, M; Kostarakis, P; Kour, M; Kouzinopoulos, C; Kovalenko, O; Kovalenko, V; Kowalski, M; Koyithatta Meethaleveedu, G; Králik, I; Kravčáková, A; Kretz, M; Krivda, M; Krizek, F; Kryshen, E; Krzewicki, M; Kubera, A M; Kučera, V; Kuhn, C; Kuijer, P G; Kumar, A; Kumar, J; Kumar, L; Kumar, S; Kurashvili, P; Kurepin, A; Kurepin, A B; Kuryakin, A; Kweon, M J; Kwon, Y; La Pointe, S L; La Rocca, P; Ladron de Guevara, P; Lagana Fernandes, C; Lakomov, I; Langoy, R; Lara, C; Lardeux, A; Lattuca, A; Laudi, E; Lea, R; Leardini, L; Lee, G R; Lee, S; Lehas, F; Lemmon, R C; Lenti, V; Leogrande, E; León Monzón, I; León Vargas, H; Leoncino, M; Lévai, P; Li, S; Li, X; Lien, J; Lietava, R; Lindal, S; Lindenstruth, V; Lippmann, C; Lisa, M A; Ljunggren, H M; Lodato, D F; Loenne, P I; Loginov, V; Loizides, C; Lopez, X; López Torres, E; Lowe, A; Luettig, P; Lunardon, M; Luparello, G; Lutz, T H; Maevskaya, A; Mager, M; Mahajan, S; Mahmood, S M; Maire, A; Majka, R D; Malaev, M; Maldonado Cervantes, I; Malinina, L; Mal'Kevich, D; Malzacher, P; Mamonov, A; Manko, V; Manso, F; Manzari, V; Marchisone, M; Mareš, J; Margagliotti, G V; Margotti, A; Margutti, J; Marín, A; Markert, C; Marquard, M; Martin, N A; Martin Blanco, J; Martinengo, P; Martínez, M I; Martínez García, G; Martinez Pedreira, M; Mas, A; Masciocchi, S; Masera, M; Masoni, A; Massacrier, L; Mastroserio, A; Matyja, A; Mayer, C; Mazer, J; Mazzoni, M A; Mcdonald, D; Meddi, F; Melikyan, Y; Menchaca-Rocha, A; Meninno, E; Mercado Pérez, J; Meres, M; Miake, Y; Mieskolainen, M M; Mikhaylov, K; Milano, L; Milosevic, J; Minervini, L M; Mischke, A; Mishra, A N; Miśkowiec, D; Mitra, J; Mitu, C M; Mohammadi, N; Mohanty, B; Molnar, L; Montaño Zetina, L; Montes, E; Moreira De Godoy, D A; Moreno, L A P; Moretto, S; Morreale, A; Morsch, A; Muccifora, V; Mudnic, E; Mühlheim, D; Muhuri, S; Mukherjee, M; Mulligan, J D; Munhoz, M G; Munzer, R H; Murakami, H; Murray, S; Musa, L; Musinsky, J; Naik, B; Nair, R; Nandi, B K; Nania, R; Nappi, E; Naru, M U; Natal da Luz, H; Nattrass, C; Navarro, S R; Nayak, K; Nayak, R; Nayak, T K; Nazarenko, S; Nedosekin, A; Nellen, L; Ng, F; Nicassio, M; Niculescu, M; Niedziela, J; Nielsen, B S; Nikolaev, S; Nikulin, S; Nikulin, V; Noferini, F; Nomokonov, P; Nooren, G; Noris, J C C; Norman, J; Nyanin, A; Nystrand, J; Oeschler, H; Oh, S; Oh, S K; Ohlson, A; Okatan, A; Okubo, T; Olah, L; Oleniacz, J; Oliveira Da Silva, A C; Oliver, M H; Onderwaater, J; Oppedisano, C; Orava, R; Ortiz Velasquez, A; Oskarsson, A; Otwinowski, J; Oyama, K; Ozdemir, M; Pachmayer, Y; Pagano, P; Paić, G; Pal, S K; Pan, J; Pandey, A K; Papikyan, V; Pappalardo, G S; Pareek, P; Park, W J; Parmar, S; Passfeld, A; Paticchio, V; Patra, R N; Paul, B; Pei, H; Peitzmann, T; Pereira Da Costa, H; Peresunko, D; Pérez Lara, C E; Perez Lezama, E; Peskov, V; Pestov, Y; Petráček, V; Petrov, V; Petrovici, M; Petta, C; Piano, S; Pikna, M; Pillot, P; Pimentel, L O D L; Pinazza, O; Pinsky, L; Piyarathna, D B; Płoskoń, M; Planinic, M; Pluta, J; Pochybova, S; Podesta-Lerma, P L M; Poghosyan, M G; Polichtchouk, B; Poljak, N; Poonsawat, W; Pop, A; Porteboeuf-Houssais, S; Porter, J; Pospisil, J; Prasad, S K; Preghenella, R; Prino, F; Pruneau, C A; Pshenichnov, I; Puccio, M; Puddu, G; Pujahari, P; Punin, V; Putschke, J; Qvigstad, H; Rachevski, A; Raha, S; Rajput, S; Rak, J; Rakotozafindrabe, A; Ramello, L; Rami, F; Raniwala, R; Raniwala, S; Räsänen, S S; Rascanu, B T; Rathee, D; Read, K F; Redlich, K; Reed, R J; Rehman, A; Reichelt, P; Reidt, F; Ren, X; Renfordt, R; Reolon, A R; Reshetin, A; Revol, J-P; Reygers, K; Riabov, V; Ricci, R A; Richert, T; Richter, M; Riedler, P; Riegler, W; Riggi, F; Ristea, C; Rocco, E; Rodríguez Cahuantzi, M; Rodriguez Manso, A; Røed, K; Rogochaya, E; Rohr, D; Röhrich, D; Romita, R; Ronchetti, F; Ronflette, L; Rosnet, P; Rossi, A; Roukoutakis, F; Roy, A; Roy, C; Roy, P; Rubio Montero, A J; Rui, R; Russo, R; Ryabinkin, E; Ryabov, Y; Rybicki, A; Sadovsky, S; Šafařík, K; Sahlmuller, B; Sahoo, P; Sahoo, R; Sahoo, S; Sahu, P K; Saini, J; Sakai, S; Saleh, M A; Salzwedel, J; Sambyal, S; Samsonov, V; Šándor, L; Sandoval, A; Sano, M; Sarkar, D; Sarma, P; Scapparone, E; Scarlassara, F; Schiaua, C; Schicker, R; Schmidt, C; Schmidt, H R; Schuchmann, S; Schukraft, J; Schulc, M; Schuster, T; Schutz, Y; Schwarz, K; Schweda, K; Scioli, G; Scomparin, E; Scott, R; Šefčík, M; Seger, J E; Sekiguchi, Y; Sekihata, D; Selyuzhenkov, I; Senosi, K; Senyukov, S; Serradilla, E; Sevcenco, A; Shabanov, A; Shabetai, A; Shadura, O; Shahoyan, R; Shangaraev, A; Sharma, A; Sharma, M; Sharma, M; Sharma, N; Shigaki, K; Shtejer, K; Sibiriak, Y; Siddhanta, S; Sielewicz, K M; Siemiarczuk, T; Silvermyr, D; Silvestre, C; Simatovic, G; Simonetti, G; Singaraju, R; Singh, R; Singha, S; Singhal, V; Sinha, B C; Sinha, T; Sitar, B; Sitta, M; Skaali, T B; Slupecki, M; Smirnov, N; Snellings, R J M; Snellman, T W; Søgaard, C; Song, J; Song, M; Song, Z; Soramel, F; Sorensen, S; Souza, R D de; Sozzi, F; Spacek, M; Spiriti, E; Sputowska, I; Spyropoulou-Stassinaki, M; Stachel, J; Stan, I; Stankus, P; Stefanek, G; Stenlund, E; Steyn, G; Stiller, J H; Stocco, D; Strmen, P; Suaide, A A P; Sugitate, T; Suire, C; Suleymanov, M; Suljic, M; Sultanov, R; Šumbera, M; Szabo, A; Szanto de Toledo, A; Szarka, I; Szczepankiewicz, A; Szymanski, M; Tabassam, U; Takahashi, J; Tambave, G J; Tanaka, N; Tangaro, M A; Tarhini, M; Tariq, M; Tarzila, M G; Tauro, A; Tejeda Muñoz, G; Telesca, A; Terasaki, K; Terrevoli, C; Teyssier, B; Thäder, J; Thomas, D; Tieulent, R; Timmins, A R; Toia, A; Trogolo, S; Trombetta, G; Trubnikov, V; Trzaska, W H; Tsuji, T; Tumkin, A; Turrisi, R; Tveter, T S; Ullaland, K; Uras, A; Usai, G L; Utrobicic, A; Vajzer, M; Vala, M; Valencia Palomo, L; Vallero, S; Van Der Maarel, J; Van Hoorne, J W; van Leeuwen, M; Vanat, T; Vande Vyvre, P; Varga, D; Vargas, A; Vargyas, M; Varma, R; Vasileiou, M; Vasiliev, A; Vauthier, A; Vechernin, V; Veen, A M; Veldhoen, M; Velure, A; Venaruzzo, M; Vercellin, E; Vergara Limón, S; Vernet, R; Verweij, M; Vickovic, L; Viesti, G; Viinikainen, J; Vilakazi, Z; Villalobos Baillie, O; Villatoro Tello, A; Vinogradov, A; Vinogradov, L; Vinogradov, Y; Virgili, T; Vislavicius, V; Viyogi, Y P; Vodopyanov, A; Völkl, M A; Voloshin, K; Voloshin, S A; Volpe, G; von Haller, B; Vorobyev, I; Vranic, D; Vrláková, J; Vulpescu, B; Wagner, B; Wagner, J; Wang, H; Wang, M; Watanabe, D; Watanabe, Y; Weber, M; Weber, S G; Weiser, D F; Wessels, J P; Westerhoff, U; Whitehead, A M; Wiechula, J; Wikne, J; Wilk, G; Wilkinson, J; Williams, M C S; Windelband, B; Winn, M; Yang, H; Yang, P; Yano, S; Yasar, C; Yin, Z; Yokoyama, H; Yoo, I-K; Yoon, J H; Yurchenko, V; Yushmanov, I; Zaborowska, A; Zaccolo, V; Zaman, A; Zampolli, C; Zanoli, H J C; Zaporozhets, S; Zardoshti, N; Zarochentsev, A; Závada, P; Zaviyalov, N; Zbroszczyk, H; Zgura, I S; Zhalov, M; Zhang, H; Zhang, X; Zhang, Y; Zhang, C; Zhang, Z; Zhao, C; Zhigareva, N; Zhou, D; Zhou, Y; Zhou, Z; Zhu, H; Zhu, J; Zichichi, A; Zimmermann, A; Zimmermann, M B; Zinovjev, G; Zyzak, M

The production of K[Formula: see text](892)[Formula: see text] and [Formula: see text](1020) mesons has been measured in p-Pb collisions at [Formula: see text][Formula: see text] 5.02 TeV. K[Formula: see text] and [Formula: see text] are reconstructed via their decay into charged hadrons with the ALICE detector in the rapidity range [Formula: see text]. The transverse momentum spectra, measured as a function of the multiplicity, have a p[Formula: see text] range from 0 to 15 GeV/ c for K[Formula: see text] and from 0.3 to 21 GeV/ c for [Formula: see text]. Integrated yields, mean transverse momenta and particle ratios are reported and compared with results in pp collisions at [Formula: see text][Formula: see text] 7 TeV and Pb-Pb collisions at [Formula: see text][Formula: see text] 2.76 TeV. In Pb-Pb and p-Pb collisions, K[Formula: see text] and [Formula: see text] probe the hadronic phase of the system and contribute to the study of particle formation mechanisms by comparison with other identified hadrons. For this purpose, the mean transverse momenta and the differential proton-to-[Formula: see text] ratio are discussed as a function of the multiplicity of the event. The short-lived K[Formula: see text] is measured to investigate re-scattering effects, believed to be related to the size of the system and to the lifetime of the hadronic phase.
Computer Learner Corpora: Analysing Interlanguage Errors in Synchronous and Asynchronous Communication

Science.gov (United States)

MacDonald, Penny; Garcia-Carbonell, Amparo; Carot, Sierra, Jose Miguel

2013-01-01

This study focuses on the computer-aided analysis of interlanguage errors made by the participants in the telematic simulation IDEELS (Intercultural Dynamics in European Education through on-Line Simulation). The synchronous and asynchronous communication analysed was part of the MiLC Corpus, a multilingual learner corpus of texts written by…
SparkText: Biomedical Text Mining on Big Data Framework

Science.gov (United States)

He, Karen Y.; Wang, Kai

2016-01-01

Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652
Text against Text: Counterbalancing the Hegemony of Assessment.

Science.gov (United States)

Cosgrove, Cornelius

A study examined whether composition specialists can counterbalance the potential privileging of the assessment perspective, or of self-appointed interpreters of that perspective, through the study of assessment discourse as text. Fourteen assessment texts were examined, most of them journal articles and most of them featuring the common…
Predicting Prosody from Text for Text-to-Speech Synthesis

CERN Document Server

Rao, K Sreenivasa

2012-01-01

Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.
Molecular Imaging on the Cerebral Pathological Damage Target of Ketamine Dependence

Directory of Open Access Journals (Sweden)

YANG Hong-jie1,2;HU Shu1;JIA Shao-wei1;GAO Zhou1;WANG Tong3;ZHAO Zheng-qin1

2014-02-01

Full Text Available To study the cerebral pathological damage target which result from abusing ketamine through molecular imaging techniques， 20 cases of ketamine dependent patients looking for treatment at the Peking University Shenzhen Hospital and 31 healthy volunteers were included in this study, all of them got brain SPECT DAT imaging. The results were analyzed by SPSS 16.0. The bilateral caudate nucleus and putamen of healthy volunteers were roughly equally large, and the radioactive distribution of DAT in healthy volunteers were uniform and symmetrical. The bilateral corpora striatum showed typical “panda eyes” pattern. But the bilateral corpora striatum of ketamine dependent patients got smaller in shape, got disorders in pattern, and the radioactive distribution of DAT reduced or defected or even got disturbance and with much more non-specific radioactive. The V, m and Ra of bilateral corpora striatum in ketamine dependent patients were （21.03±3.15） cm3, （22.08±3.31） g and （5.37±1.08） %, respectively, which were significantly lower than the healthy volunteers (p<0.01. The cerebral pathological damage target which resulted from abusing ketamine was similar to those of compound codeine phosphate antitussive solution dependence, heroin dependence and MDMA dependence, all of these psychoactive substances damaged the function of DAT.
Penile Embryology and Anatomy

Directory of Open Access Journals (Sweden)

Jenny H. Yiee

2010-01-01

Full Text Available Knowledge of penile embryology and anatomy is essential to any pediatric urologist in order to fully understand and treat congenital anomalies. Sex differentiation of the external genitalia occurs between the 7thand 17th weeks of gestation. The Y chromosome initiates male differentiation through the SRY gene, which triggers testicular development. Under the influence of androgens produced by the testes, external genitalia then develop into the penis and scrotum. Dorsal nerves supply penile skin sensation and lie within Buck's fascia. These nerves are notably absent at the 12 o'clock position. Perineal nerves supply skin sensation to the ventral shaft skin and frenulum. Cavernosal nerves lie within the corpora cavernosa and are responsible for sexual function. Paired cavernosal, dorsal, and bulbourethral arteries have extensive anastomotic connections. During erection, the cavernosal artery causes engorgement of the cavernosa, while the deep dorsal artery leads to glans enlargement. The majority of venous drainage occurs through a single, deep dorsal vein into which multiple emissary veins from the corpora and circumflex veins from the spongiosum drain. The corpora cavernosa and spongiosum are all made of spongy erectile tissue. Buck's fascia circumferentially envelops all three structures, splitting into two leaves ventrally at the spongiosum. The male urethra is composed of six parts: bladder neck, prostatic, membranous, bulbous, penile, and fossa navicularis. The urethra receives its blood supply from both proximal and distal directions.
VisualUrText: A Text Analytics Tool for Unstructured Textual Data

Science.gov (United States)

Zainol, Zuraini; Jaymes, Mohd T. H.; Nohuddin, Puteri N. E.

2018-05-01

The growing amount of unstructured text over Internet is tremendous. Text repositories come from Web 2.0, business intelligence and social networking applications. It is also believed that 80-90% of future growth data is available in the form of unstructured text databases that may potentially contain interesting patterns and trends. Text Mining is well known technique for discovering interesting patterns and trends which are non-trivial knowledge from massive unstructured text data. Text Mining covers multidisciplinary fields involving information retrieval (IR), text analysis, natural language processing (NLP), data mining, machine learning statistics and computational linguistics. This paper discusses the development of text analytics tool that is proficient in extracting, processing, analyzing the unstructured text data and visualizing cleaned text data into multiple forms such as Document Term Matrix (DTM), Frequency Graph, Network Analysis Graph, Word Cloud and Dendogram. This tool, VisualUrText, is developed to assist students and researchers for extracting interesting patterns and trends in document analyses.
Layout-aware text extraction from full-text PDF of scientific articles

Directory of Open Access Journals (Sweden)

Ramakrishnan Cartic

2012-05-01

Full Text Available Abstract Background The Portable Document Format (PDF is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the ‘Layout-Aware PDF Text Extraction’ (LA-PDFText system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Results Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1 Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2 Classifying text blocks into rhetorical categories using a rule-based method and (3 Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF
Working with text tools, techniques and approaches for text mining

CERN Document Server

Tourte, Gregory J L

2016-01-01

Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...
Apresentação dos recursos linguísticos para a língua portuguesa e procedimentos da análise de corpus do léxico português

Directory of Open Access Journals (Sweden)

Blažka Müller Pograjc

2007-12-01

Full Text Available Observa-se, actualmente, em todo o mundo, um interesse crescente pela criação de recursos linguísticos, nomeadamente corpora e léxicos de grandes dimensões, o que tem sido possível graças ao extraordinário desenvolvimento da informática e do poder dos computadores. Estes recursos linguísticos específicos de cada língua, em associação com tecnologias adequadas à extracção de dados e de conhecimentos, constituem pré-requisitos indispensáveis a um grande conjunto de trabalhos de investigação. Os corpora proporcionam novas maneiras de estudar as línguas, das quais resultam descrições, generalizações e hipótesis teóricas de grande consistência porque são fundamentadas nos dados empíricos.
Korpora og korpusprogammel i opbygningen af fagordbøger

DEFF Research Database (Denmark)

Weilgaard Christensen, Lotte

1996-01-01

for extracting data for terminological purposes are presented. Existing tools seem to be prototypes or they do not meet the requirements which such a terminological tool ought to meet. Nevertheless, having in mind the latest developments in the field, we shall probably before long be presented with tools which...... do meet those requirements to a much higher degree than the tools which are in the market at present. In my article, I list the requirements which should be kept in mind when building corpora and corpora tools for terminological purposes. At the end of the article, I present some terminological......As the use of corpora for terminological purposes has so far received very little attention, my purpose is to present important concepts in corpus linguistics and to discuss their relevance for special language corpora intended for terminology-related data retrieval. Further, some tools...
Layout-aware text extraction from full-text PDF of scientific articles.

Science.gov (United States)

Ramakrishnan, Cartic; Patnia, Abhishek; Hovy, Eduard; Burns, Gully Apc

2012-05-28

The Portable Document Format (PDF) is the most commonly used file format for online scientific publications. The absence of effective means to extract text from these PDF files in a layout-aware manner presents a significant challenge for developers of biomedical text mining or biocuration informatics systems that use published literature as an information source. In this paper we introduce the 'Layout-Aware PDF Text Extraction' (LA-PDFText) system to facilitate accurate extraction of text from PDF files of research articles for use in text mining applications. Our paper describes the construction and performance of an open source system that extracts text blocks from PDF-formatted full-text research articles and classifies them into logical units based on rules that characterize specific sections. The LA-PDFText system focuses only on the textual content of the research articles and is meant as a baseline for further experiments into more advanced extraction methods that handle multi-modal content, such as images and graphs. The system works in a three-stage process: (1) Detecting contiguous text blocks using spatial layout processing to locate and identify blocks of contiguous text, (2) Classifying text blocks into rhetorical categories using a rule-based method and (3) Stitching classified text blocks together in the correct order resulting in the extraction of text from section-wise grouped blocks. We show that our system can identify text blocks and classify them into rhetorical categories with Precision1 = 0.96% Recall = 0.89% and F1 = 0.91%. We also present an evaluation of the accuracy of the block detection algorithm used in step 2. Additionally, we have compared the accuracy of the text extracted by LA-PDFText to the text from the Open Access subset of PubMed Central. We then compared this accuracy with that of the text extracted by the PDF2Text system, 2commonly used to extract text from PDF. Finally, we discuss preliminary error analysis for
Folklore Studies on Birth Related Customs within the Banat Community

Directory of Open Access Journals (Sweden)

Alexin Otilia Daniela

2017-12-01

Full Text Available Birth is perceived as a threshold, a milestone, and is best described as passing from one stage to another and from one status to another. This article aims to present the customs regarding the birth of a child, as they were preserved in the Banat folk mentality: the origin of the midwife and her role as mediator, the belief in the unfailing destiny foreseen by the book of fate, the rite of the first bath having a huge importance for the future of the child and a series of magic and religious acts meant to ward off the Evil forces that intend to harm the child and to restore the balance.
Text mining from ontology learning to automated text processing applications

CERN Document Server

Biemann, Chris

2014-01-01

This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects
A Comparison of English and Japanese Proverbs Using Natural Semantic Metalanguage

Directory of Open Access Journals (Sweden)

Miles Neale

2015-06-01

Full Text Available This investigation examines the meaning of semantically similar English and Japanese proverbs. It uses textual data sourced from online corpora to highlight and compare the different cultural and conceptual elements embedded within these proverbs. The findings of this investigation demonstrate that matching proverbs from different languages is a potentially problematic exercise, both in dictionaries and in the second-language classroom.

Cognitive “Boy stories”: urban folklore and urban topographies

Directory of Open Access Journals (Sweden)

Bojan Žikić

2016-02-01

Full Text Available The culturally cognitive perception of Belgrade’s topographies is considered through its deployment, symbolic use and narrative foundation. As the explanatory material-one football-media incident, the use of certain areas of the city in a spectacleceremonial manner, knowledge and lore of certain elements of the Belgrade topographies and the organization of «the football Belgrade»-were considered. The attitude is taken that the topography of a city is a multifaceted cultural constituent, whose structure of particular meaning, as a part of cultural communication, is determined by the very fact it is an urban space. Physical aspects of spatial-ness are reduced to relationism, i.e. it has a meaning for the cultural communication only when the elements of urban topographies are brought into correlation. Other characteristics of physical spatial-ness are irrelevant for such communication. Meaning relations in which elements of urban topographies exist are formed on the very fact of them being urban, that is, the afore mentioned denotation that is ascribed to space, stems from those cultural features and artifacts that are associated in a given milieu with certain concrete elements of urban topographies.
A comparison of the chemical constituents of Barbadian medicinal plants within their respective plant families with established drug compounds and phytochemicals used to treat communicable and non-communicable diseases.

Science.gov (United States)

Cohall, D; Carrington, S

2012-01-01

Barbados has a strong base in the practice of folklore botanical medicines. Consistent with the rest of the Caribbean region, the practice is criticized due to lack of evidence on the efficacy and safety testing. The objectives of this review article are i) to categorize and identify plants by their possible indications and their scientific classification and ii) to determine if the chemical constituents of the plants will be able to provide some insight into their possible uses in folklore medicine based on existing scientific research on their chemical constituents and also by their classification. A review of the folklore botanical medicines of Barbados was done. Plants were primarily grouped based on their use to treat particular communicable and non-communicable diseases. Plants were then secondarily grouped based on their families. The chemical profiles of the plants were then compared to established drug compounds currently approved for the conventional treatment of illnesses and also to established phytochemicals. The extensive literature review identified phytochemical compounds in particular plants used in Barbadian folklore medicine. Sixty-six per cent of reputed medicinal plants contain pharmacologically active phytochemicals; fifty-one per cent of these medicinal plants contain phytochemicals with activities consistent with their reported use. Folklore botanical medicine is well grounded on investigation of the scientific rationale. The research showed that fifty-one per cent of the identified medicinal plants have chemical compounds which have been identified to be responsible for its associated medicinal activity. To a lesser extent, approved drug compounds from drug regulatory bodies with similar chemical structure to the bioactive compounds in the plants proved to validate the use of some of these plants to treat illnesses.
From Text to Political Positions: Text analysis across disciplines

NARCIS (Netherlands)

Kaal, A.R.; Maks, I.; van Elfrinkhof, A.M.E.

2014-01-01

ABSTRACT From Text to Political Positions addresses cross-disciplinary innovation in political text analysis for party positioning. Drawing on political science, computational methods and discourse analysis, it presents a diverse collection of analytical models including pure quantitative and
LocText

DEFF Research Database (Denmark)

Cejuela, Juan Miguel; Vinchurkar, Shrikant; Goldberg, Tatyana

2018-01-01

trees and was trained and evaluated on a newly improved LocTextCorpus. Combined with an automatic named-entity recognizer, LocText achieved high precision (P = 86%±4). After completing development, we mined the latest research publications for three organisms: human (Homo sapiens), budding yeast...
Contextual Text Mining

Science.gov (United States)

Mei, Qiaozhu

2009-01-01

With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…
The Keyword Bank as a tool for finding exclusive keywords in WordSmith Tools

Directory of Open Access Journals (Sweden)

Tony Berber Sardinha

2008-12-01

Full Text Available KeyWords is a very useful program for computer text analysis found in WordSmith Tools. A problem with KeyWords, though, is the large number of keywords returned by the program, which can be at least 500. This paper proposes a procedure for making reductions in lists of keywords based on the concept of exclusive keywords. These are words that are key in the study corpus only, in comparison to lots of others. This procedure draws on the existence of a keyword bank, which is a collection of keywords from several corpora. When contrasted to a study corpus, the keyword bank brings up keywords that are found in the study corpus only, leaving out those that are key in other corpora. This enables the researcher to focus on words that are most typical of his/her own corpus. The analysis reported here, carried out with a large multi-register keyword bank, suggests that the keyword bank achieved its goal, by allowing for a 77% reduction in the total keywords, and by selecting keywords that are most representative of the study corpus in question.
Field – Football Expressions Dictionary: a lexicographic resource based on the theoretical-methodological approach of frame semantics and corpus linguistics

Directory of Open Access Journals (Sweden)

Rove Luiza de Oliveira Chishman

2015-01-01

Full Text Available The present article aims at problematizing the relevance of Frame Semantics (Fillmore, 1982 in the development of Field – Dictionary of Football Expressions – which the configuration allows the access to football language through expressions or through scenarios – or semantic frames. Frame Semantics, a theory developed in the realm of Cognitive Linguistics, is based on empirical data collected from the analysis of electronic corpora. The extraction of the data presented in this study was done with the Sketch Engine concordance, while their analysis was relegated to Frame Semantics. Among the results, it is possible to point out at the manner in which Fillmore´s theory contributes to the analysis of polysemy, presenting the different senses of a lexical unit considering different situations – or different frames – in which they appear. This article also emphasizes the pertinence of corpus linguistics and the processing of corpora as resources that allow the analysis of linguistic constructs present in the texts. It is also important to emphasize the applicability of Frame Semantics to a resource devoted to a non-specialized public, once the theory makes the contextualization of language possible through the everyday routine of the speakers.
Systematic characterizations of text similarity in full text biomedical publications.

Science.gov (United States)

Sun, Zhaohui; Errami, Mounir; Long, Tara; Renard, Chris; Choradia, Nishant; Garner, Harold

2010-09-15

Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text articles are becoming increasingly available, yet the similarities among them have not been systematically studied. Here, we quantitatively investigated the full text similarity of biomedical publications in PubMed Central. 72,011 full text articles from PubMed Central (PMC) were parsed to generate three different datasets: full texts, sections, and paragraphs. Text similarity comparisons were performed on these datasets using the text similarity algorithm eTBLAST. We measured the frequency of similar text pairs and compared it among different datasets. We found that high abstract similarity can be used to predict high full text similarity with a specificity of 20.1% (95% CI [17.3%, 23.1%]) and sensitivity of 99.999%. Abstract similarity and full text similarity have a moderate correlation (Pearson correlation coefficient: -0.423) when the similarity ratio is above 0.4. Among pairs of articles in PMC, method sections are found to be the most repetitive (frequency of similar pairs, methods: 0.029, introduction: 0.0076, results: 0.0043). In contrast, among a set of manually verified duplicate articles, results are the most repetitive sections (frequency of similar pairs, results: 0.94, methods: 0.89, introduction: 0.82). Repetition of introduction and methods sections is more likely to be committed by the same authors (odds of a highly similar pair having at least one shared author, introduction: 2.31, methods: 1.83, results: 1.03). There is also significantly more similarity in pairs of review articles than in pairs containing one review and one nonreview paper (frequency of similar pairs: 0.0167 and 0.0023, respectively). While quantifying abstract similarity is an effective approach for finding duplicate citations, a comprehensive full text analysis is necessary to uncover all potential duplicate citations in the scientific literature and is helpful when
Aukštaitijos ir Žemaitijos paribio dainuojamojo folkloro ypatumai

OpenAIRE

Račiūnaitė-Vyčinienė, Daiva

2007-01-01

The article discusses the subject, which has remained almost unstudied by ethnomusicologists, i. e. the peculiarities of the traditional sung folklore of the region of the borderline between the Upper Lithuania and Samogitia. Most attention is dedicated to the “Samogitian” character of the peripheral Upper Lithuania. The article emphasizes that the folklore singing traditions of the peripheral Upper Lithuania (Radviliškis, Šiauliai, Pakruojis and adjacent areas) may be characterized by peculi...
Production of [Formula: see text] and [Formula: see text] in proton-proton collisions at [Formula: see text] 7 TeV.

Science.gov (United States)

Abelev, B; Adam, J; Adamová, D; Aggarwal, M M; Rinella, G Aglieri; Agnello, M; Agostinelli, A; Agrawal, N; Ahammed, Z; Ahmad, N; Ahmed, I; Ahn, S U; Ahn, S A; Aimo, I; Aiola, S; Ajaz, M; Akindinov, A; Alam, S N; Aleksandrov, D; Alessandro, B; Alexandre, D; Alici, A; Alkin, A; Alme, J; Alt, T; Altinpinar, S; Altsybeev, I; Alves Garcia Prado, C; Andrei, C; Andronic, A; Anguelov, V; Anielski, J; Antičić, T; Antinori, F; Antonioli, P; Aphecetche, L; Appelshäuser, H; Arcelli, S; Armesto, N; Arnaldi, R; Aronsson, T; Arsene, I C; Arslandok, M; Augustinus, A; Averbeck, R; Awes, T C; Azmi, M D; Bach, M; Badalà, A; Baek, Y W; Bagnasco, S; Bailhache, R; Bala, R; Baldisseri, A; Baltasar Dos Santos Pedrosa, F; Baral, R C; Barbera, R; Barile, F; Barnaföldi, G G; Barnby, L S; Barret, V; Bartke, J; Basile, M; Bastid, N; Basu, S; Bathen, B; Batigne, G; Batista Camejo, A; Batyunya, B; Batzing, P C; Baumann, C; Bearden, I G; Beck, H; Bedda, C; Behera, N K; Belikov, I; Bellini, F; Bellwied, R; Belmont-Moreno, E; Belmont, R; Belyaev, V; Bencedi, G; Beole, S; Berceanu, I; Bercuci, A; Berdnikov, Y; Berenyi, D; Berger, M E; Bertens, R A; Berzano, D; Betev, L; Bhasin, A; Bhat, I R; Bhati, A K; Bhattacharjee, B; Bhom, J; Bianchi, L; Bianchi, N; Bianchin, C; Bielčík, J; Bielčíková, J; Bilandzic, A; Bjelogrlic, S; Blanco, F; Blau, D; Blume, C; Bock, F; Bogdanov, A; Bøggild, H; Bogolyubsky, M; Böhmer, F V; Boldizsár, L; Bombara, M; Book, J; Borel, H; Borissov, A; Bossú, F; Botje, M; Botta, E; Böttger, S; Braun-Munzinger, P; Bregant, M; Breitner, T; Broker, T A; Browning, T A; Broz, M; Bruna, E; Bruno, G E; Budnikov, D; Buesching, H; Bufalino, S; Buncic, P; Busch, O; Buthelezi, Z; Caffarri, D; Cai, X; Caines, H; Calero Diaz, L; Caliva, A; Calvo Villar, E; Camerini, P; Carena, F; Carena, W; Castillo Castellanos, J; Casula, E A R; Catanescu, V; Cavicchioli, C; Ceballos Sanchez, C; Cepila, J; Cerello, P; Chang, B; Chapeland, S; Charvet, J L; Chattopadhyay, S; Chattopadhyay, S; Chelnokov, V; Cherney, M; Cheshkov, C; Cheynis, B; Chibante Barroso, V; Chinellato, D D; Chochula, P; Chojnacki, M; Choudhury, S; Christakoglou, P; Christensen, C H; Christiansen, P; Chujo, T; Chung, S U; Cicalo, C; Cifarelli, L; Cindolo, F; Cleymans, J; Colamaria, F; Colella, D; Collu, A; Colocci, M; Conesa Balbastre, G; Conesa Del Valle, Z; Connors, M E; Contreras, J G; Cormier, T M; Corrales Morales, Y; Cortese, P; Cortés Maldonado, I; Cosentino, M R; Costa, F; Crochet, P; Cruz Albino, R; Cuautle, E; Cunqueiro, L; Dainese, A; Dang, R; Danu, A; Das, D; Das, I; Das, K; Das, S; Dash, A; Dash, S; De, S; Delagrange, H; Deloff, A; Dénes, E; D'Erasmo, G; De Caro, A; de Cataldo, G; de Cuveland, J; De Falco, A; De Gruttola, D; De Marco, N; De Pasquale, S; de Rooij, R; Diaz Corchero, M A; Dietel, T; Dillenseger, P; Divià, R; Di Bari, D; Di Liberto, S; Di Mauro, A; Di Nezza, P; Djuvsland, Ø; Dobrin, A; Dobrowolski, T; Domenicis Gimenez, D; Dönigus, B; Dordic, O; Dørheim, S; Dubey, A K; Dubla, A; Ducroux, L; Dupieux, P; Dutta Majumdar, A K; Hilden, T E; Ehlers, R J; Elia, D; Engel, H; Erazmus, B; Erdal, H A; Eschweiler, D; Espagnon, B; Esposito, M; Estienne, M; Esumi, S; Evans, D; Evdokimov, S; Fabris, D; Faivre, J; Falchieri, D; Fantoni, A; Fasel, M; Fehlker, D; Feldkamp, L; Felea, D; Feliciello, A; Feofilov, G; Ferencei, J; Fernández Téllez, A; Ferreiro, E G; Ferretti, A; Festanti, A; Figiel, J; Figueredo, M A S; Filchagin, S; Finogeev, D; Fionda, F M; Fiore, E M; Floratos, E; Floris, M; Foertsch, S; Foka, P; Fokin, S; Fragiacomo, E; Francescon, A; Frankenfeld, U; Fuchs, U; Furget, C; Furs, A; Fusco Girard, M; Gaardhøje, J J; Gagliardi, M; Gago, A M; Gallio, M; Gangadharan, D R; Ganoti, P; Gao, C; Garabatos, C; Garcia-Solis, E; Gargiulo, C; Garishvili, I; Gerhard, J; Germain, M; Gheata, A; Gheata, M; Ghidini, B; Ghosh, P; Ghosh, S K; Gianotti, P; Giubellino, P; Gladysz-Dziadus, E; Glässel, P; Gomez Ramirez, A; González-Zamora, P; Gorbunov, S; Görlich, L; Gotovac, S; Graczykowski, L K; Grelli, A; Grigoras, A; Grigoras, C; Grigoriev, V; Grigoryan, A; Grigoryan, S; Grinyov, B; Grion, N; Grosse-Oetringhaus, J F; Grossiord, J-Y; Grosso, R; Guber, F; Guernane, R; Guerzoni, B; Guilbaud, M; Gulbrandsen, K; Gulkanyan, H; Gumbo, M; Gunji, T; Gupta, A; Gupta, R; Khan, K H; Haake, R; Haaland, Ø; Hadjidakis, C; Haiduc, M; Hamagaki, H; Hamar, G; Hanratty, L D; Hansen, A; Harris, J W; Hartmann, H; Harton, A; Hatzifotiadou, D; Hayashi, S; Heckel, S T; Heide, M; Helstrup, H; Herghelegiu, A; Herrera Corral, G; Hess, B A; Hetland, K F; Hippolyte, B; Hladky, J; Hristov, P; Huang, M; Humanic, T J; Hussain, N; Hussain, T; Hutter, D; Hwang, D S; Ilkaev, R; Ilkiv, I; Inaba, M; Innocenti, G M; Ionita, C; Ippolitov, M; Irfan, M; Ivanov, M; Ivanov, V; Jachołkowski, A; Jacobs, P M; Jahnke, C; Jang, H J; Janik, M A; Jayarathna, P H S Y; Jena, C; Jena, S; Jimenez Bustamante, R T; Jones, P G; Jung, H; Jusko, A; Kadyshevskiy, V; Kalinak, P; Kalweit, A; Kamin, J; Kang, J H; Kaplin, V; Kar, S; Karasu Uysal, A; Karavichev, O; Karavicheva, T; Karpechev, E; Kebschull, U; Keidel, R; Keijdener, D L D; Svn, M Keil; Khan, M M; Khan, P; Khan, S A; Khanzadeev, A; Kharlov, Y; Kileng, B; Kim, B; Kim, D W; Kim, D J; Kim, J S; Kim, M; Kim, M; Kim, S; Kim, T; Kirsch, S; Kisel, I; Kiselev, S; Kisiel, A; Kiss, G; Klay, J L; Klein, J; Klein-Bösing, C; Kluge, A; Knichel, M L; Knospe, A G; Kobdaj, C; Kofarago, M; Köhler, M K; Kollegger, T; Kolojvari, A; Kondratiev, V; Kondratyeva, N; Konevskikh, A; Kovalenko, V; Kowalski, M; Kox, S; Koyithatta Meethaleveedu, G; Kral, J; Králik, I; Kravčáková, A; Krelina, M; Kretz, M; Krivda, M; Krizek, F; Kryshen, E; Krzewicki, M; Kučera, V; Kucheriaev, Y; Kugathasan, T; Kuhn, C; Kuijer, P G; Kulakov, I; Kumar, J; Kurashvili, P; Kurepin, A; Kurepin, A B; Kuryakin, A; Kushpil, S; Kweon, M J; Kwon, Y; Ladron de Guevara, P; Lagana Fernandes, C; Lakomov, I; Langoy, R; Lara, C; Lardeux, A; Lattuca, A; La Pointe, S L; La Rocca, P; Lea, R; Leardini, L; Lee, G R; Legrand, I; Lehnert, J; Lemmon, R C; Lenti, V; Leogrande, E; Leoncino, M; León Monzón, I; Lévai, P; Li, S; Lien, J; Lietava, R; Lindal, S; Lindenstruth, V; Lippmann, C; Lisa, M A; Ljunggren, H M; Lodato, D F; Loenne, P I; Loggins, V R; Loginov, V; Lohner, D; Loizides, C; Lopez, X; López Torres, E; Lu, X-G; Luettig, P; Lunardon, M; Luparello, G; Ma, R; Maevskaya, A; Mager, M; Mahapatra, D P; Mahmood, S M; Maire, A; Majka, R D; Malaev, M; Maldonado Cervantes, I; Malinina, L; Mal'Kevich, D; Malzacher, P; Mamonov, A; Manceau, L; Manko, V; Manso, F; Manzari, V; Marchisone, M; Mareš, J; Margagliotti, G V; Margotti, A; Marín, A; Markert, C; Marquard, M; Martashvili, I; Martin, N A; Martinengo, P; Martínez, M I; Martínez García, G; Martin Blanco, J; Martynov, Y; Mas, A; Masciocchi, S; Masera, M; Masoni, A; Massacrier, L; Mastroserio, A; Matyja, A; Mayer, C; Mazer, J; Mazzoni, M A; Meddi, F; Menchaca-Rocha, A; Meninno, E; Mercado Pérez, J; Meres, M; Miake, Y; Mikhaylov, K; Milano, L; Milosevic, J; Mischke, A; Mishra, A N; Miśkowiec, D; Mitra, J; Mitu, C M; Mlynarz, J; Mohammadi, N; Mohanty, B; Molnar, L; Montaño Zetina, L; Montes, E; Morando, M; Moreira De Godoy, D A; Moretto, S; Morreale, A; Morsch, A; Muccifora, V; Mudnic, E; Mühlheim, D; Muhuri, S; Mukherjee, M; Müller, H; Munhoz, M G; Murray, S; Musa, L; Musinsky, J; Nandi, B K; Nania, R; Nappi, E; Nattrass, C; Nayak, K; Nayak, T K; Nazarenko, S; Nedosekin, A; Nicassio, M; Niculescu, M; Niedziela, J; Nielsen, B S; Nikolaev, S; Nikulin, S; Nikulin, V; Nilsen, B S; Noferini, F; Nomokonov, P; Nooren, G; Norman, J; Nyanin, A; Nystrand, J; Oeschler, H; Oh, S; Oh, S K; Okatan, A; Okubo, T; Olah, L; Oleniacz, J; Oliveira Da Silva, A C; Onderwaater, J; Oppedisano, C; Ortiz Velasquez, A; Oskarsson, A; Otwinowski, J; Oyama, K; Ozdemir, M; Sahoo, P; Pachmayer, Y; Pachr, M; Pagano, P; Paić, G; Pajares, C; Pal, S K; Palmeri, A; Pant, D; Papikyan, V; Pappalardo, G S; Pareek, P; Park, W J; Parmar, S; Passfeld, A; Patalakha, D I; Paticchio, V; Paul, B; Pawlak, T; Peitzmann, T; Pereira Da Costa, H; Pereira De Oliveira Filho, E; Peresunko, D; Pérez Lara, C E; Pesci, A; Peskov, V; Pestov, Y; Petráček, V; Petran, M; Petris, M; Petrovici, M; Petta, C; Piano, S; Pikna, M; Pillot, P; Pinazza, O; Pinsky, L; Piyarathna, D B; Płoskoń, M; Planinic, M; Pluta, J; Pochybova, S; Podesta-Lerma, P L M; Poghosyan, M G; Pohjoisaho, E H O; Polichtchouk, B; Poljak, N; Pop, A; Porteboeuf-Houssais, S; Porter, J; Potukuchi, B; Prasad, S K; Preghenella, R; Prino, F; Pruneau, C A; Pshenichnov, I; Puccio, M; Puddu, G; Pujahari, P; Punin, V; Putschke, J; Qvigstad, H; Rachevski, A; Raha, S; Rajput, S; Rak, J; Rakotozafindrabe, A; Ramello, L; Raniwala, R; Raniwala, S; Räsänen, S S; Rascanu, B T; Rathee, D; Rauf, A W; Razazi, V; Read, K F; Real, J S; Redlich, K; Reed, R J; Rehman, A; Reichelt, P; Reicher, M; Reidt, F; Renfordt, R; Reolon, A R; Reshetin, A; Rettig, F; Revol, J-P; Reygers, K; Riabov, V; Ricci, R A; Richert, T; Richter, M; Riedler, P; Riegler, W; Riggi, F; Rivetti, A; Rocco, E; Rodríguez Cahuantzi, M; Rodriguez Manso, A; Røed, K; Rogochaya, E; Rohni, S; Rohr, D; Röhrich, D; Romita, R; Ronchetti, F; Ronflette, L; Rosnet, P; Rossi, A; Roukoutakis, F; Roy, A; Roy, C; Roy, P; Rubio Montero, A J; Rui, R; Russo, R; Ryabinkin, E; Ryabov, Y; Rybicki, A; Sadovsky, S; Šafařík, K; Sahlmuller, B; Sahoo, R; Sahu, P K; Saini, J; Sakai, S; Salgado, C A; Salzwedel, J; Sambyal, S; Samsonov, V; Sanchez Castro, X; Sánchez Rodríguez, F J; Šándor, L; Sandoval, A; Sano, M; Santagati, G; Sarkar, D; Scapparone, E; Scarlassara, F; Scharenberg, R P; Schiaua, C; Schicker, R; Schmidt, C; Schmidt, H R; Schuchmann, S; Schukraft, J; Schulc, M; Schuster, T; Schutz, Y; Schwarz, K; Schweda, K; Scioli, G; Scomparin, E; Scott, R; Segato, G; Seger, J E; Sekiguchi, Y; Selyuzhenkov, I; Senosi, K; Seo, J; Serradilla, E; Sevcenco, A; Shabetai, A; Shabratova, G; Shahoyan, R; Shangaraev, A; Sharma, A; Sharma, N; Sharma, S; Shigaki, K; Shtejer, K; Sibiriak, Y; Siddhanta, S; Siemiarczuk, T; Silvermyr, D; Silvestre, C; Simatovic, G; Singaraju, R; Singh, R; Singha, S; Singhal, V; Sinha, B C; Sinha, T; Sitar, B; Sitta, M; Skaali, T B; Skjerdal, K; Slupecki, M; Smirnov, N; Snellings, R J M; Søgaard, C; Soltz, R; Song, J; Song, M; Soramel, F; Sorensen, S; Spacek, M; Spiriti, E; Sputowska, I; Spyropoulou-Stassinaki, M; Srivastava, B K; Stachel, J; Stan, I; Stefanek, G; Steinpreis, M; Stenlund, E; Steyn, G; Stiller, J H; Stocco, D; Stolpovskiy, M; Strmen, P; Suaide, A A P; Sugitate, T; Suire, C; Suleymanov, M; Sultanov, R; Šumbera, M; Symons, T J M; Szabo, A; Szanto de Toledo, A; Szarka, I; Szczepankiewicz, A; Szymanski, M; Takahashi, J; Tangaro, M A; Tapia Takaki, J D; Tarantola Peloni, A; Tarazona Martinez, A; Tariq, M; Tarzila, M G; Tauro, A; Tejeda Muñoz, G; Telesca, A; Terasaki, K; Terrevoli, C; Thäder, J; Thomas, D; Tieulent, R; Timmins, A R; Toia, A; Trubnikov, V; Trzaska, W H; Tsuji, T; Tumkin, A; Turrisi, R; Tveter, T S; Ullaland, K; Uras, A; Usai, G L; Vajzer, M; Vala, M; Valencia Palomo, L; Vallero, S; Vande Vyvre, P; Van Der Maarel, J; Van Hoorne, J W; van Leeuwen, M; Vargas, A; Vargyas, M; Varma, R; Vasileiou, M; Vasiliev, A; Vechernin, V; Veldhoen, M; Velure, A; Venaruzzo, M; Vercellin, E; Vergara Limón, S; Vernet, R; Verweij, M; Vickovic, L; Viesti, G; Viinikainen, J; Vilakazi, Z; Villalobos Baillie, O; Vinogradov, A; Vinogradov, L; Vinogradov, Y; Virgili, T; Vislavicius, V; Viyogi, Y P; Vodopyanov, A; Völkl, M A; Voloshin, K; Voloshin, S A; Volpe, G; von Haller, B; Vorobyev, I; Vranic, D; Vrláková, J; Vulpescu, B; Vyushin, A; Wagner, B; Wagner, J; Wagner, V; Wang, M; Wang, Y; Watanabe, D; Weber, M; Weber, S G; Wessels, J P; Westerhoff, U; Wiechula, J; Wikne, J; Wilde, M; Wilk, G; Wilkinson, J; Williams, M C S; Windelband, B; Winn, M; Yaldo, C G; Yamaguchi, Y; Yang, H; Yang, P; Yang, S; Yano, S; Yasnopolskiy, S; Yi, J; Yin, Z; Yoo, I-K; Yushmanov, I; Zaccolo, V; Zach, C; Zaman, A; Zampolli, C; Zaporozhets, S; Zarochentsev, A; Závada, P; Zaviyalov, N; Zbroszczyk, H; Zgura, I S; Zhalov, M; Zhang, H; Zhang, X; Zhang, Y; Zhao, C; Zhigareva, N; Zhou, D; Zhou, F; Zhou, Y; Zhuo, Zhou; Zhu, H; Zhu, J; Zhu, X; Zichichi, A; Zimmermann, A; Zimmermann, M B; Zinovjev, G; Zoccarato, Y; Zyzak, M

The production of the strange and double-strange baryon resonances ([Formula: see text], [Formula: see text]) has been measured at mid-rapidity ([Formula: see text][Formula: see text]) in proton-proton collisions at [Formula: see text] [Formula: see text] 7 TeV with the ALICE detector at the LHC. Transverse momentum spectra for inelastic collisions are compared to QCD-inspired models, which in general underpredict the data. A search for the [Formula: see text] pentaquark, decaying in the [Formula: see text] channel, has been carried out but no evidence is seen.
Dažas problēmas latviešu literārās valodas izveides procesā

Directory of Open Access Journals (Sweden)

Jānis Rozenbergs

2011-12-01

Full Text Available SOME PROBLEMS OF THE FORMATION PROCESS OF THE LATVIAN LITERARY LANGUAGESummary0. Literary language is the most complete variety of language, manifesting its functions in the best way possible and unifying the nation, as well as representing national mentality among other nations and their languages.0.1. When speaking about the Latvian literary language and its formation, it seems useful to acknowledge that the literary language is (1 the language of the entire nation, (2 is being consciously cultivated and (3 has written form.0.2. When dealing with the Latvian literary language formation processes, one should take into consideration (a the specific external (sociopolitical conditions, (b the sources of the literary language, (c aspects of language development and its research.1. Development of the Latvian language has been affected by external sociopolitical factors and factors of migration of representatives of various cultural layers; these factors have both stimulated and hampered the overall formation of the language both in space and time.2. Research of the Latvian literary language is complicated, because the most reliable proofs of this process are written texts, which in Latvian appeared only in the 16th century. Therefore both folk-lore and the spoken language can be used as sources.2.1. From the 16th to the 19th century written texts were mainly produced by German clergymen who in the beginning (in the 16th century had a poor knowledge of Latvian. Therefore these texts must be properly handled by differentiating the sociopolitical and philological activities of the Germans. Beginning with the 17th century a normative approach has been consciously applied to the language and thus a common variety of the language is being created by maximally keeping aloof of various patois forms.2.2. The source of analysis of the literary language and the process of its formation is the abundant Latvian folk-lore and especially the folk-songs (dainas
Tourist souvenir of Serbia

Directory of Open Access Journals (Sweden)

Vlastelica Radomir

2002-01-01

Full Text Available If national habits and ceremonies would not comply with deeper needs and laws of human life and society, then human forces that last for centuries could not be so strong. Deepness, variety and combination of folk heritage and tourist needs was first noticed only in 20th century. Whenever was necessary to tell something in economic-tourist presentations about social problems, then the phenomenon of folklore gave that information which was most suitable.
The Interethnic and Interreligious Values in Turkish and Crimean Legends

OpenAIRE

Anastasiia Zherdieva

2014-01-01

The present paper examines interethnic and interreligious values in Turkish and Crimean folk legends. The folklore of both Crimea and Turkey has a multicultural background, which makes both corpuses of texts suitable for research. In the course of the study, a wide range of published Turkish and Crimean legends were reviewed and analysed. There are two deeply-rooted tendencies in the studied legends. First of all, the interethnic and interreligious relationships can be described as ghastly an...
Fósseis: Mitos e Folclore

Directory of Open Access Journals (Sweden)

Antonio Carlos Sequeira Fernandes

2005-11-01

Full Text Available Fossils have been familiar objects to man since the prehistoric times, with striking connotations in the folklore of several cultures. They were used as decorative elements in necklaces, regarded as heroes or giants in the classical greek and roman times, interpreted as teeth and bones of dragons, used as amulets against the bites and poisons of snakes, and as medicines to the treatment of several disorders. This article describes some of these examples.
"Městské legendy" na pomezí oborů: antropologický, folkloristický nebo sociologický diskurs?

Directory of Open Access Journals (Sweden)

Petr Janeček

2009-04-01

Full Text Available Theoretical approach to contemporary oral narratives such as „urban legends“, rumour or gossip has been always afflicted by artifical gaps between various academical fields, most notably between social sciences (social anthropology, sociology and humanities (folkloristics, literary science. Presented article briefly sumarizes some of the most interesting main theories - but also defficiences - of contemporary folklore studies in fields of folkloristics, literary theory, social antropology and sociology, and calls for multidisciplinary analysis of this phenomenon.
Megalourethra as a rare cause for erectile dysfunction

Directory of Open Access Journals (Sweden)

Robert Pallas, MD, Bch

2015-01-01

Full Text Available MRI findings of megalourethra have not previously been reported. We present a case of an adult presenting with lifelong erectile dysfunction secondary to poor development of the corpus spongiosum and corpora cavernosa. The pathogenesis, typical presentation, and treatment of megalourethra, as well as the use of modern imaging techniques to aid in the diagnosis and treatment of this disease are discussed.
Magnetic Resonance Imaging: An accurate diagnostic tool in the precise localization of penile fracture

Directory of Open Access Journals (Sweden)

Mujeeb M Rahiman

2013-01-01

Full Text Available An 18-year-old male presented with history and clinical findings suggestive of penile fracture. An MRI demonstrated disruption of the tunica albuginea and corpora cavernosa on the left dorso-lateral aspect, mid-shaft of penis with adjacent hematoma, and subcutaneous edema. At surgery, imaging findings were found to be accurate, and the penis was successfully repaired with minimal postoperative morbidity.
Speech and Language and Language Translation (SALT)

Science.gov (United States)

2012-12-01

Humayoun was conducted, along with a review of proposed Pashto rules as described in academic papers. In particular, Zuhra and Khan 2009 [11...French paraphrases, using an external Berkeley parser trained for French. Paraphrasing and Plagiarism : A review was made of literature on the use of...paraphrasing and comparable text, and of literature on the related field of plagiarism detection. Metrics and corpora for plagiarism detection were
Pertumbuhan Prenatal dalam Kandungan Kambing Melalui Superovulasi

Directory of Open Access Journals (Sweden)

ADRIANI

2007-06-01

Full Text Available Thirty six Etawah-grade does (BW 20.4-44.2 kg, age 2.5-7 years were used to study the efficacy of increasing secretion of endogenous hormones of pregnancy by superovulation of does to stimulate of growth prenatal in uterus. The does were injected with pregnant mare serum gonadotrophin (PMSG, 0 IU/kg BW [grouped into nonsuperovulation-NSO] and 15 IU/kg BW [grouped into Superovulation-SO]. Intravaginal sponge (60 mg medroxyprogesterone acetate was applied for 14 days to synchronize estrus cycle. Twenty four hours prior to sponge removal, PMSG was injected to stimulate superovulation. After sponge removal, five experimental does were mixed with one buck for natural mating. Superovulation prior to mating increased number of corpora lutea, mean of maternal serum estradiol concentration, progesterone concentration, litter size, average birth weight and average milk yield, by 112, 67, 42, 27, 32, and 35%, respectively. Those were correlated with the increase of uterine, corpora lutea, and individual birth weight.
An Analysis of The Oxford Guide to Practical Lexicography (Atkins and Rundell 2008

Directory of Open Access Journals (Sweden)

Gilles-Maurice de Schryver

2011-10-01

Full Text Available
Abstract: Since at least a decade ago, the lexicographic community at large has been demandingthat a modern textbook be designed — one that would place corpora at the centre of the lexicographicenterprise. Written by two of the most respected practising lexicographers, this book hasfinally arrived, and delivers on very many levels. This review article presents a critical analysis ofits features.
Keywords: LEXICOGRAPHY, LEARNERS' DICTIONARY, MONOLINGUAL, BILINGUAL,CORPUS, FRAME SEMANTICS, ENGLISH, FRENCH, TEXTBOOK
Samenvatting: Een analyse van The Oxford Guide to Practical Lexicography(Atkins en Rundell 2008. Al minstens tien jaar lang eist de volledige lexicografischegemeenschap dat een modern tekstboek zou worden ontworpen — één dat corpora in het centrumvan de lexicografische belangstelling zou plaatsen. Geschreven door twee van de meest gerespecteerdepraktiserende lexicografen, is dit boek er nu eindelijk, en het ontgoochelt niet. Dit recensieartikelanalyseert de kenmerken ervan kritisch.
Sleutelwoorden: LEXICOGRAFIE, LEERWOORDENBOEK, VERKLAREND (MONOLINGUAAL,VERTALEND (BILINGUAAL, CORPUS, FRAME SEMANTICS, ENGELS, FRANS,TEKSTBOEK

Web Resources and Tools for Slovenian with a Focus on the Slovenian-English Language Infrastructure: Dictionaries in the Digital Age

Directory of Open Access Journals (Sweden)

Mojca Šorli

2017-12-01

Full Text Available The article begins with a presentation of a selection of electronic monolingual and bi/multilingual lexicographic resources and corpora available today to contemporary users of Slovene. The focus is on works combined with English and designed for translation purposes which provide information on the meaning of words and wider lexical units, i.e., e-dictionaries, lexical databases, web translation tools and various corpora. In a separate sub-section the most common translation technologies are presented, together with an evaluation of their role in the modern translation process. Sections 2 and 3 provide a brief outline of the changes that have affected classical dictionary planning, compilation and use in the new digital environment, as well as of the relationship between dictionaries and related resources, such as lexical databases. Some stereotypes regarding dictionary use are identified and, in conclusion, the existing corpus-based databases for the Slovenian-English pair are presented, with a view to determining priorities for the future interlingual infrastructure action plans in Slovenia.
Using Artificial Intelligence Techniques to Implement a Multifactor Authentication System

Directory of Open Access Journals (Sweden)

Jackson Phiri

2011-08-01

Full Text Available The recent years have seen a rise in the number of cases of cyber-crime committed through identity theft and fraud. To address this problem, this paper uses adaptive neural-fuzzy inference system, fuzzy logic and artificial neural network to implement a multifactor authentication system through a technique of information fusion. To begin with, the identity attributes are mined using the three corpora from three major sources namely the social networks, a set of questionnaires and application forms from the various services offered both in the real and cyberspace. The statistical information generated by the corpora is then used to compose an identity attribute metric model. The composed identity attributes metrics values classified as biometrics, device metrics and pseudo metrics are then fused at the score level through a technique of information fusion in a multifactor authentication system by using each of the above artificial intelligence technologies and the results compared.
An insight into Twitter: a corpus based contrastive study in English and Spanish.

Directory of Open Access Journals (Sweden)

Irina Argüelles Álvarez

2012-07-01

Full Text Available The aim of this paper is to study the use of Spanish and English in the micro-blogging social network Twitter from a contrastive point of view. A quantitative research methodology is applied in order firstly, to identify specific common characteristics of language, organization and content in the medium and secondly, to find eventual differences in the use of a particular language. To carry out the experiment, two corpora were constructed using language data from Twitter, one in Spanish with a total number of 4,027,746 words and another with similar characteristics in English with a total number of 4,655,992 words. From the results obtained, the conclusion is that there are a number of very general discourse and organizational features common to the two corpora under study. It is also concluded that there are some particular characteristics which differentiate the use of English and Spanish in the medium.
Finding Translation Examples for Under-Resourced Language Pairs or for Narrow Domains; the Case for Machine Translation

Directory of Open Access Journals (Sweden)

Dan Tufis

2012-07-01

Full Text Available The cyberspace is populated with valuable information sources, expressed in about 1500 different languages and dialects. Yet, for the vast majority of WEB surfers this wealth of information is practically inaccessible or meaningless. Recent advancements in cross-lingual information retrieval, multilingual summarization, cross-lingual question answering and machine translation promise to narrow the linguistic gaps and lower the communication barriers between humans and/or software agents. Most of these language technologies are based on statistical machine learning techniques which require large volumes of cross lingual data. The most adequate type of cross-lingual data is represented by parallel corpora, collection of reciprocal translations. However, it is not easy to find enough parallel data for any language pair might be of interest. When required parallel data refers to specialized (narrow domains, the scarcity of data becomes even more acute. Intelligent information extraction techniques from comparable corpora provide one of the possible answers to this lack of translation data.
On the development of a tagset for Northern Sotho with special reference to the issue of standardisation

Directory of Open Access Journals (Sweden)

E. Taljard

2008-07-01

Full Text Available Working with corpora in the South African Bantu languages has up till now been limited to the utilisation of raw corpora. Such corpora, however, have limited functionality. Thus the next logical step in any NLP application is the development of software for automatic tagging of electronic texts. The development of a tagset is one of the first steps in corpus annotation. The authors of this article argue that the design of a tagset cannot be isolated from the purpose of the tagset, or from the place of the tagset and its design within the bigger picture of the architecture of corpus annotation. Usage-related aspects therefore feature prominently in the design of the tagset for Northern Sotho. It is explained why this proposed tagset is biased towards human readability, rather than machine readability; this choice of a stochastic tagger is motivated, and the relationship between tokenising, tagging, morphological analysis and parsing is discussed. In order to account at least to some extent for the morphological complexity of Northern Sotho at the tagging level, a multilevel annotation is opted for: the first level comprising obligatory information and the second optional and recommended information. Finally, aspects of standardisation are considered against the background of reuse, of sharing of resources, and of possible adaptation for use by other disjunctively written South African Bantu languages. It is not the aim of this article to evaluate the results of any tagging procedure using the proposed tagset. It only describes the design and motivates the choices made with regard to the tagset design. However, an evaluation is in process and results will be published in the near future (cf. Faaß et al., s.a..
Texting Styles and Information Change of SMS Text Messages in Filipino

Science.gov (United States)

Cabatbat, Josephine Jill T.; Tapang, Giovanni A.

2013-02-01

We identify the different styles of texting in Filipino short message service (SMS) texts and analyze the change in unigram and bigram frequencies due to these styles. Style preference vectors for sample texts were calculated and used to identify the style combination used by an average individual. The change in Shannon entropy of the SMS text is explained in light of a coding process.
Conflictos identitarios del Pueblo Negro en el Valle del Chota-Provincia de Imbabura: entre la folklorización y la descolonización.

OpenAIRE

León Bernardo, Alexandra Natali

2017-01-01

Chota-Imbabura between the folklorization and the decolonization have been the principal motivation to be able to realize this investigation, since it is necessary to recognize colonial processes that the people should identify as such and since inside his context such certain conceptualizations are re-meant as: identity, culture, cultural inheritance, African decent black like point of item the decolonization and the folklorization. In such a way that the black people of the Valley of the Ch...
Relational Data Modelling of Textual Corpora: The Skaldic Project and its Extensions

DEFF Research Database (Denmark)

Wills, Tarrin Jon

2015-01-01

Skaldic poetry is a highly complex textual phenomenon both in terms of the intricacy of the poetry and its contextual environment. Extensible Markup Language (XML) applications such as that of the Text Encoding Initiative provide a means of semantic representation of some of these complexities. XML...
Collecting and evaluating speech recognition corpora for 11 South African languages

CSIR Research Space (South Africa)

Badenhorst, J

2011-08-01

Full Text Available . In addition, speech-based access to information may empower illiterate or semi-literate peo- ple, 98% of whom live in the developing world. SDSs can play a useful role in a wide range of applications. Of particular importance in Africa are applications... speech (i.e. appropriate for the recognition task in terms of the language used, the profile of the speakers, speaking style, etc.) This speech generally needs to be curated and transcribed prior to the development of ASR sys- tems, and for most...
Computing meaning v.4

CERN Document Server

Bunt, Harry; Pulman, Stephen

2013-01-01

This book is a collection of papers by leading researchers in computational semantics. It presents a state-of-the-art overview of recent and current research in computational semantics, including descriptions of new methods for constructing and improving resources for semantic computation, such as WordNet, VerbNet, and semantically annotated corpora. It also presents new statistical methods in semantic computation, such as the application of distributional semantics in the compositional calculation of sentence meanings. Computing the meaning of sentences, texts, and spoken or texted dialogue i
Developing a broadband automatic speech recognition system for Afrikaans

CSIR Research Space (South Africa)

De Wet, Febe

2011-08-01

Full Text Available baseline transcription for the news data. The match between a baseline transcription and its corre- sponding audio can be evaluated automatically using an ASR system in forced alignment mode. Only those bulletins for which a bad match is indicated... Component Index for data [3]. occurrence of Afrikaans words3. Other text corpora that are currently under construction in- clude daily downloads of the scripts of news bulletins that are read on an Afrikaans radio station as well as transcripts of par...
English Metafunction Analysis in Chemistry Text: Characterization of Scientific Text

Directory of Open Access Journals (Sweden)

Ahmad Amin Dalimunte, M.Hum

2013-09-01

Full Text Available The objectives of this research are to identify what Metafunctions are applied in chemistry text and how they characterize a scientific text. It was conducted by applying content analysis. The data for this research was a twelve-paragraph chemistry text. The data were collected by applying a documentary technique. The document was read and analyzed to find out the Metafunction. The data were analyzed by some procedures: identifying the types of process, counting up the number of the processes, categorizing and counting up the cohesion devices, classifying the types of modulation and determining modality value, finally counting up the number of sentences and clauses, then scoring the grammatical intricacy index. The findings of the research show that Material process (71of 100 is mostly used, circumstance of spatial location (26 of 56 is more dominant than the others. Modality (5 is less used in order to avoid from subjectivity. Impersonality is implied through less use of reference either pronouns (7 or demonstrative (7, conjunctions (60 are applied to develop ideas, and the total number of the clauses are found much more dominant (109 than the total number of the sentences (40 which results high grammatical intricacy index. The Metafunction found indicate that the chemistry text has fulfilled the characteristics of scientific or academic text which truly reflects it as a natural science.
“Girls Text Really Weird”: Gender, Texting and Identity Among Teens

DEFF Research Database (Denmark)

Ling, Richard; Baron, Naomi; Lenhart, Amanda

2014-01-01

This article examines the strategies used by teenagers for interacting with members of the opposite sex when texting. This article uses material from a series of nine focus groups from 2009 in four US cities. It reports on the strategies they use and the problems they encounter as they negotiate...... this portion of their lives. Texting is a direct, person-to-person venue where they can develop their gendered identity and also investigate romantic interaction. In this activity, both genders show the ability to make fine-grained interpretations of texts, often interpreting the meaning of punctuation...... and other paralinguistic devices. In addition, they use texts to characterize the opposite sex. Teen boys' texts are seen as short and perhaps brisk when viewed by girls. Boys see teen girls' texts as being overly long, prying and containing unneeded elements. The discussion of these practices shows how...
The Shona Corpus and the Problem of Tagging | Chabata | Lexikos

African Journals Online (AJOL)

An analysis of the problems that most corpus builders face shows that more problems are likely to be encountered when dealing with spoken corpora than with written corpora. The paper demonstrates that tagging is an important component of corpus building as it makes it easier for a researcher to extract relevant data.
Evaluating Bilingual and Monolingual Dictionaries for L2 Learners.

Science.gov (United States)

Hunt, Alan

1997-01-01

A discussion of dictionaries and their use for second language (L2) learning suggests that lack of computerized modern language corpora can adversely affect bilingual dictionaries, commonly used by L2 learners, and shows how use of such corpora has benefitted two contemporary monolingual L2 learner dictionaries (1995 editions of the Longman…
Learning from Learners: A Non-Standard Direct Approach to the Teaching of Writing Skills in EFL in a University Context

Science.gov (United States)

Fuster-Márquez, Miguel; Gregori-Signes, Carmen

2018-01-01

Corpora have been used in English as a foreign language materials for decades, and native corpora have been present in the classroom by means of direct approaches such as Data-Driven Learning (Johns, T., and P. King 1991. "'Should you be Persuaded'- Two Samples of Data-Driven Learning Materials." In "Classroom Concordancing,"…
Menstruation in Ulysses.

Science.gov (United States)

Mullin, Katherine

2008-01-01

This article investigates James Joyce's fascination with a wide variety of medical texts, sexual folklores, religious beliefs, and persistent superstitions about menstruation. That fascination finds its way into Ulysses, which draws upon a number of intertexts to inform a curiosity about the female body most strikingly articulated by Bloom, Molly, and Gerty MacDowell. These intertexts are not simply imported into the novel but are dismantled and interrogated, as Joyce exposes, rather than endorses, clichés of essential femininity.
Problems and progress in nature conservation in Rhodesia

Directory of Open Access Journals (Sweden)

G.F.T. Child

1977-12-01

Full Text Available The conflicting emotions generated around the aesthetic qualities of wildlife and its pragmatic use as a resource are a feature of human societies stretching into antiquity. On the one hand it has been, and remains, the subject of much folklore and art in societies extending from the Stone Age to the Technological Age. On the other, hunting for the necessities of life, and more recently for recreation, goes very deep into the history of the human race.
Sonidos de un Chile profundo: Hacia un análisis crítico del Archivo Sonoro de Música Tradicional Chilena en relación a la conformación del folclore en Chile Sounds from the Depth of Chile: Toward a Critical Analysis of the Sound Archive of Chilean Traditional Music as regards the establishing of Folklore in Chile

Directory of Open Access Journals (Sweden)

Mariana León Villagra

2011-06-01

Full Text Available La revisión del proceso de rescate patrimonial del Archivo Sonoro de Música Tradicional permite criticar los conceptos de patrimonio e identidad pertinentes, a la luz de la actual situación de las músicas locales y tradicionales en el contexto globalizado de las tecnologías digitales. Según esta perspectiva, es necesario recrear la historia del desarrollismo cultural chileno de la década de los 40 y de los 50, mediante el análisis de la construcción de la identidad nacional bajo el concepto de folclore. Al poner en valor estas músicas y sonoridades tradicionales, fortalecemos la presencia de las identidades locales en la cultura chilena, destacando su importancia para una democratización real de las políticas culturales del estado.The process of rescueing from destruction the patrimonial legacy contained in the Sound Archive of Chilean Traditional Music serves as a basis for a critical review of the concepts of identity and patrimony within the current situation of traditional and local musics in the worldwide context of digital technologies. According to this perspective, it is necessary to review the history of the Chilean cultural development of the 40's and 50's, focusing on the construction of a national identity based on the concept of folklore. Ifthe value of these traditional musics and sounds is brought to light it is possible to strengthen the presence of local identities in the Chilean culture, thus emphasizing their importance for cultural state policies aiming to be truly democratic.
Analyzing Idioms and Their Frequency in Three Advanced ILI Textbooks: A Corpus-Based Study

Science.gov (United States)

Alavi, Sepideh; Rajabpoor, Aboozar

2015-01-01

The present study aimed at identifying and quantifying the idioms used in three ILI "Advanced" level textbooks based on three different English corpora; MICASE, BNC and the Brown Corpus, and comparing the frequencies of the idioms across the three corpora. The first step of the study involved searching the books to find multi-word…

Anterior Urethral Advancement in Repair of Hypospadias: A ...

African Journals Online (AJOL)

xp

meticulous dissection was performed to free the two penile skin flaps from the spongy urethra which was then dissected and mobilized from the groove formed by the two corpora cavernosa of the penis starting at the midpenile area .Special care should be taken during the dissection to avoid injury to corpora cavernosa, that ...
[Erectile function and ablative surgery of penile tumors].

Science.gov (United States)

Pisani, E; Austoni, E; Trinchieri, A; Ceresoli, A; Mantovani, F; Colombo, F; Mastromarino, G; Vecchio, D; Canclini, L; Fenice, O

1994-02-01

The Authors try to show the possibility to combine radical excision with minimal invasiveness in the surgery of penile cancer. The focal point of every therapeutic decision is correct clinical staging. Unfortunately there's some confusion in the two international staging systems (TNM and Jackson's classification). In fact it's not clear the anatomical difference between epithelioma of the glans infiltrating corpus spongiosum and subcoronary epithelioma of the shaft infiltrating the corpora cavernosa. It's obvious that the infiltration of the corpora cavernosa is a far more aggressive oncological manifestation than that of tumour infiltrating the corpus spongiosum. So we consider Jackson's classification more congenial. In terms of surgery this anatomical independence makes it easy to consider the corpora cavernosa as a distinct entity, so they remain perfectly functional when separated from the glandulo-spongio-urethral unit with its vasculo-nervous bundle. This makes conservation of the erectile function, when clinical staging show us that the tumour is not infiltrating the corpora cavernosa. The Authors show their results, which seem to be rather good.
L2 write assistants and context-aware dictionaries: New challenges to lexicography

DEFF Research Database (Denmark)

Tarp, Sven; Fisker, Kasper; Sepstrup, Peter

2017-01-01

Dictionaries are increasingly integrated into other tools designed to assist the reading, writing and translation of texts. Write Assistant is a newly developed tool aimed at assisting people writing in a second language. It feeds on big data taken in from corpora and digital dictionaries...... dictionaries need to be conceptionally adapted to the specific tool in order to optimize the service. All this poses new challenges to lexicography....
INVECTIVES AS ANTHROP METAPHORS (ON THE EXAMPLE OF THE LEXEME "CUNT"

Directory of Open Access Journals (Sweden)

GOLODNAYA V.N.

2015-01-01

Full Text Available The article is devoted to some formats and contexts of using the word "cunt" as an anthrop metaphors within the corpora approach. The metaphor gradation takes place in the framework of a binary opposition "We/Ours" - "They/Others", in which target groups referents are presented in a negative/positive way. The anthrop metaphor "cunt" is hypothesized to appear as a result of its emotive meaning's reconsideration.
Different Senses of Entropy—Implications for Education

Directory of Open Access Journals (Sweden)

Helge Strömdahl

2010-03-01

Full Text Available A challenge in the teaching of entropy is that the word has several different senses, which may provide an obstacle for communication. This study identifies five distinct senses of the word ‘entropy’, using the Principled Polysemy approach from the field of linguistics. A semantic network is developed of how the senses are related, using text excerpts from dictionaries, text books and text corpora. Educational challenges such as the existence of several formal senses of entropy and the intermediary position of entropy as disorder along the formal/non-formal scale are presented using a two-Dimensional Semiotic/semantic Analysing Schema (2-D SAS.
POLTERGEIST PHENOMENA IN CONTEMPORARY FOLKLORE

OpenAIRE

Oana VOICHICI

2017-01-01

The article deals with instances of the supernatural in Romanian urban legends, namely what we call the strigoi , or poltergeist. Usually, folklorists tend to exclude the supernatural f rom the category of urban legends, however we have decided to take these accounts into consideration based on the fact that the transmitter, the narrators do not distinguish between these elements and the rest of contemporary legends and today’s popular cu lture abounds in such accounts.
Discovering Folklore Through Community Resources.

Science.gov (United States)

Sumpter, Magdalena Benavides, Ed.

The folkways and cultural heritage of the Mexican Americans of South Texas are explored in this volume which is designed to provide the student with the opportunity for cultural enrichment, oral language development, and vocabulary expansion. The first chapter deals with "Creencias" which are common beliefs handed down from generation to…
Computational text analysis and reading comprehension exam complexity towards automatic text classification

CERN Document Server

Liontou, Trisevgeni

2014-01-01

This book delineates a range of linguistic features that characterise the reading texts used at the B2 (Independent User) and C1 (Proficient User) levels of the Greek State Certificate of English Language Proficiency exams in order to help define text difficulty per level of competence. In addition, it examines whether specific reader variables influence test takers' perceptions of reading comprehension difficulty. The end product is a Text Classification Profile per level of competence and a formula for automatically estimating text difficulty and assigning levels to texts consistently and re
Quantification of total tannin, flavonoid contents and pharmacognostic study of the seeds of Swietenia mahagoni (Linn.

Directory of Open Access Journals (Sweden)

Tasmia Rahman

2016-11-01

Full Text Available Objective: To justify the folkloric use of Swietenia mahagoni (S. mahagoni seeds, 90% ethanolic extract and their aqueous and organic partitioning substances were evaluated for their possible antidiarrhoeal and antimicrobial potentials in vivo. Methods: Crude ethanolic extract of S. mahagoni seeds were subjected and partitioned into fractions using solvents at different polarity. Antimicrobial and antidiarrheal activities were evaluated and subsequently outcomes were corresponded with the conventional standard drugs. Results: The antidiarrheal activity was assessed using mouse model, where unfractionated ethanolic extract significantly reduced, the number, onset, rate and weight of diarrheal episodes. This fraction showed the limited number of defecation episodes of 27.0% and 40.9% at dose of 200 mg/kg and 400 mg/kg body weight respectively and reference drug, loperamide, showed 53% at a dose of 50 mg/kg. All extract fractions exhibited the significant potential to kill or subside the growth of known Gram-positive and Gram-negative bacteria. Conclusions: Ethanolic extract and their aqueous and organic fractious revealed the seeds of S. mahagoni (Linn. have the potential to be used as a remedy for diarrhea and known pathogenic microbes which ensured the folkloric use of the seeds of S. mahagoni.
Sequence spaces [Formula: see text] and [Formula: see text] with application in clustering.

Science.gov (United States)

Khan, Mohd Shoaib; Alamri, Badriah As; Mursaleen, M; Lohani, Qm Danish

2017-01-01

Distance measures play a central role in evolving the clustering technique. Due to the rich mathematical background and natural implementation of [Formula: see text] distance measures, researchers were motivated to use them in almost every clustering process. Beside [Formula: see text] distance measures, there exist several distance measures. Sargent introduced a special type of distance measures [Formula: see text] and [Formula: see text] which is closely related to [Formula: see text]. In this paper, we generalized the Sargent sequence spaces through introduction of [Formula: see text] and [Formula: see text] sequence spaces. Moreover, it is shown that both spaces are BK -spaces, and one is a dual of another. Further, we have clustered the two-moon dataset by using an induced [Formula: see text]-distance measure (induced by the Sargent sequence space [Formula: see text]) in the k-means clustering algorithm. The clustering result established the efficacy of replacing the Euclidean distance measure by the [Formula: see text]-distance measure in the k-means algorithm.
Vocabulary Constraint on Texts

Directory of Open Access Journals (Sweden)

C. Sutarsyah

2008-01-01

Full Text Available This case study was carried out in the English Education Department of State University of Malang. The aim of the study was to identify and describe the vocabulary in the reading text and to seek if the text is useful for reading skill development. A descriptive qualitative design was applied to obtain the data. For this purpose, some available computer programs were used to find the description of vocabulary in the texts. It was found that the 20 texts containing 7,945 words are dominated by low frequency words which account for 16.97% of the words in the texts. The high frequency words occurring in the texts were dominated by function words. In the case of word levels, it was found that the texts have very limited number of words from GSL (General Service List of English Words (West, 1953. The proportion of the first 1,000 words of GSL only accounts for 44.6%. The data also show that the texts contain too large proportion of words which are not in the three levels (the first 2,000 and UWL. These words account for 26.44% of the running words in the texts.Â It is believed that the constraints are due to the selection of the texts which are made of a series of short-unrelated texts. This kind of text is subject to the accumulation of low frequency words especially those of content words and limited of words from GSL. It could also defeat the development of students' reading skills and vocabulary enrichment.
Stemming Malay Text and Its Application in Automatic Text Categorization

Science.gov (United States)

Yasukawa, Michiko; Lim, Hui Tian; Yokoo, Hidetoshi

In Malay language, there are no conjugations and declensions and affixes have important grammatical functions. In Malay, the same word may function as a noun, an adjective, an adverb, or, a verb, depending on its position in the sentence. Although extensively simple root words are used in informal conversations, it is essential to use the precise words in formal speech or written texts. In Malay, to make sentences clear, derivative words are used. Derivation is achieved mainly by the use of affixes. There are approximately a hundred possible derivative forms of a root word in written language of the educated Malay. Therefore, the composition of Malay words may be complicated. Although there are several types of stemming algorithms available for text processing in English and some other languages, they cannot be used to overcome the difficulties in Malay word stemming. Stemming is the process of reducing various words to their root forms in order to improve the effectiveness of text processing in information systems. It is essential to avoid both over-stemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes. Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary. The use of set of rules is aimed at reducing the occurrence of under-stemming errors, while that of the dictionaries is believed to reduce the occurrence of over-stemming errors. We performed an experiment to evaluate the application of our stemmer in text mining software. For the experiment, text data used were actual web pages collected from the World Wide Web to demonstrate the effectiveness of our Malay stemming algorithm. The experimental results showed that our stemmer can effectively increase the precision of the extracted Boolean expressions for text categorization.
Dangers of the vagina.

Science.gov (United States)

Beit-Hallahmi, B

1985-12-01

Beliefs, myths, and literary expressions of men's fear of female genitals are reviewed. Both clinical evidence and folklore provide evidence that men imagine female genitals not only as a source of pleasure and attraction, but also as a source of danger in a very physical sense. The vagina dentata myth has many versions, including some modern ones, and its message is always the same: an awesome danger emanating from a woman's body. The prevalence of such feelings in folklore and in literature is noted.
Criteria of ‘authenticity’ in traditional Georgian musical performance

Directory of Open Access Journals (Sweden)

Gabisonia Tamaz

2014-01-01

Full Text Available Today we often use term ‘authentic’ in relation to different appearances of Georgian folk music. Along with the unambiguous meaning ‘real’ this term also has other meanings: ‘ethnic’, ‘rural’, ‘old’, ‘function of usual environment’, ‘traditional-stylistic’, ‘authoritative’, or ‘reproductive’. In spite of some interconnections that arise from the term ‘authentic’ and its other meanings, the most relevant way to apply this popular term for performers and audiences of ‘real folklore’ is traditionality. This factor is manifested in the following contexts: a performer (receiver and distributor of tradition, unobtrusively and orally, b motivation/function (representative and spontaneous function, hereditary, utilitarian and aesthetic-daily motivation, c repertoire (compliance of musical and verbal text’s sample with its social function, eluding canonized versions, d expression (adequate articulation, performing regulation which is not determined by the stage, traditional instrument etc.. The problem of authenticity is more successfully regulated in traditional Georgian church music than in folk music. For the latter, in this regard the special difficulty is caused by identification of modern trends that contain folk motifs. The most popular among them is distinctive, with its stylistic reminiscent layer from the Eastern Georgian Mountains, which we refer as ‘para-folkore’. Notwithstanding the fact that Georgian folklore is not centrally authorized, modernization of folklore samples and also those manifestations of post-folklore that are further away from the traditional motifs attract a wide range of listeners. Essentially, the meaning of ‘authentic’ in the Georgian ethno-musical context is presented as performance of the traditional rural repertoire with traditional articulation. However, we think that it is convenient for the criteria of traditional, usual environment to be added to this
Automated Analysis of Corpora Callosa

DEFF Research Database (Denmark)

Stegmann, Mikkel Bille; Davies, Rhodri H.

2003-01-01

This report describes and evaluates the steps needed to perform modern model-based interpretation of the corpus callosum in MRI. The process is discussed from the initial landmark-free contours to full-fledged statistical models based on the Active Appearance Models framework. Topics treated incl...... include landmark placement, background modelling and multi-resolution analysis. Preliminary quantitative and qualitative validation in a cross-sectional study show that fully automated analysis and segmentation of the corpus callosum are feasible....
Linguistic Corpora and Language Teaching.

Science.gov (United States)

Murison-Bowie, Simon

1996-01-01

Examines issues raised by corpus linguistics concerning the description of language. The article argues that it is necessary to start from correct descriptions of linguistic units and the contexts in which they occur. Corpus linguistics has joined with language teaching by sharing a recognition of the importance of a larger, schematic view of…
XML and Free Text.

Science.gov (United States)

Riggs, Ken Roger

2002-01-01

Discusses problems with marking free text, text that is either natural language or semigrammatical but unstructured, that prevent well-formed XML from marking text for readily available meaning. Proposes a solution to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML. (Author/LRW)
Text File Comparator

Science.gov (United States)

Kotler, R. S.

1983-01-01

File Comparator program IFCOMP, is text file comparator for IBM OS/VScompatable systems. IFCOMP accepts as input two text files and produces listing of differences in pseudo-update form. IFCOMP is very useful in monitoring changes made to software at the source code level.
Resistindo ao desenvolvimento neocolonial: a luta do povo de Andalgalá contra projetos megamineiros

Directory of Open Access Journals (Sweden)

Maria Ceci Misoczky

Full Text Available A América Latina vem experimentando uma nova era de declarada fé dos governos no mito do desenvolvimento, em articulação com a expansão de políticas extrativistas exportadoras em um contexto de renovada dependência. A face mais dramática do extrativismo na região tem sido a crescente presença de corporações mineiras transnacionais apoiadas por governos nacionais e regionais e por instituições internacionais financeiras e de apoio ao desenvolvimento, e intensamente resistidas por movimentos sociais populares. Neste artigo apresentamos o caso de Andalgalá (uma pequena cidade na Província de Catamarca, na Argentina e as lutas do povo contra corporações mineiras transnacionais e seus aliados. Na tradição da Filosofia da Libertação e do método ana-dialético de Dussel, nos engajamos com o que tem sido denominado "comunidades argentinas do NÃO", expressando sua oposição a formas neocoloniais de desenvolvimento e gestão. Neste artigo estamos especificamente interessados em compreender como dois dispositivos gerencialistas usados pelas corporações mineiras, responsabilidade social corporativa (RSC e pactos de governança, impactam a luta do povo. Acima de tudo, este artigo oferece instantâneos de batalhas na linha de frente do extrativismo. Esperamos ter dado voz àquelas pessoas que normalmente não são ouvidas, criando um espaço para suas visões sobre um tipo diferente de desenvolvimento.
Measurement of the [Formula: see text] meson lifetime using [Formula: see text] decays.

Science.gov (United States)

Aaij, R; Adeva, B; Adinolfi, M; Affolder, A; Ajaltouni, Z; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Cartelle, P Alvarez; Alves, A A; Amato, S; Amerio, S; Amhis, Y; Anderlini, L; Anderson, J; Andreassen, R; Andreotti, M; Andrews, J E; Appleby, R B; Gutierrez, O Aquines; Archilli, F; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Bachmann, S; Back, J J; Badalov, A; Balagura, V; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Batozskaya, V; Bauer, Th; Bay, A; Beddow, J; Bedeschi, F; Bediaga, I; Belogurov, S; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bettler, M-O; van Beuzekom, M; Bien, A; Bifani, S; Bird, T; Bizzeti, A; Bjørnstad, P M; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Bondar, A; Bondar, N; Bonivento, W; Borghi, S; Borgia, A; Borsato, M; Bowcock, T J V; Bowen, E; Bozzi, C; Brambach, T; van den Brand, J; Bressieux, J; Brett, D; Britsch, M; Britton, T; Brook, N H; Brown, H; Bursche, A; Busetto, G; Buytaert, J; Cadeddu, S; Calabrese, R; Callot, O; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carranza-Mejia, H; Carson, L; Carvalho Akiba, K; Casse, G; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cenci, R; Charles, M; Charpentier, Ph; Cheung, S-F; Chiapolini, N; Chrzaszcz, M; Ciba, K; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coca, C; Coco, V; Cogan, J; Cogneras, E; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombes, M; Coquereau, S; Corti, G; Counts, I; Couturier, B; Cowan, G A; Craik, D C; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Dalseno, J; David, P; David, P N Y; Davis, A; De Bonis, I; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Silva, W; De Simone, P; Decamp, D; Deckenhoff, M; Del Buono, L; Déléage, N; Derkach, D; Deschamps, O; Dettori, F; Di Canto, A; Dijkstra, H; Donleavy, S; Dordei, F; Dorigo, M; Dorosz, P; Dosil Suárez, A; Dossett, D; Dovbnya, A; Dupertuis, F; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Easo, S; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; El Rifai, I; Elsasser, Ch; Falabella, A; Färber, C; Farinelli, C; Farry, S; Ferguson, D; Fernandez Albor, V; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fiore, M; Fiorini, M; Fitzpatrick, C; Fontana, M; Fontanelli, F; Forty, R; Francisco, O; Frank, M; Frei, C; Frosini, M; Furfaro, E; Gallas Torreira, A; Galli, D; Gandelman, M; Gandini, P; Gao, Y; Garofoli, J; Garra Tico, J; Garrido, L; Gaspar, C; Gauld, R; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianelle, A; Gibson, V; Giubega, L; Gligorov, V V; Göbel, C; Golubkov, D; Golutvin, A; Gomes, A; Gordon, H; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graziani, G; Grecu, A; Greening, E; Gregson, S; Griffith, P; Grillo, L; Grünberg, O; Gui, B; Gushchin, E; Guz, Yu; Gys, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Hafkenscheid, T W; Haines, S C; Hall, S; Hamilton, B; Hampson, T; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hartmann, T; He, J; Head, T; Heijne, V; Hennessy, K; Henrard, P; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hoballah, M; Hombach, C; Hulsbergen, W; Hunt, P; Huse, T; Hussain, N; Hutchcroft, D; Hynds, D; Iakovenko, V; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jans, E; Jaton, P; Jawahery, A; Jing, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kaballo, M; Kandybei, S; Kanso, W; Karacson, M; Karbach, T M; Kenyon, I R; Ketel, T; Khanji, B; Khurewathanakul, C; Klaver, S; Kochebina, O; Komarov, I; Koopman, R F; Koppenburg, P; Korolev, M; Kozlinskiy, A; Kravchuk, L; Kreplin, K; Kreps, M; Krocker, G; Krokovny, P; Kruse, F; Kucharczyk, M; Kudryavtsev, V; Kurek, K; Kvaratskheliya, T; La Thi, V N; Lacarrere, D; Lafferty, G; Lai, A; Lambert, D; Lambert, R W; Lanciotti, E; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Lefèvre, R; Leflat, A; Lefrançois, J; Leo, S; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Liles, M; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, G; Lohn, S; Longstaff, I; Lopes, J H; Lopez-March, N; Lowdon, P; Lu, H; Lucchesi, D; Luisier, J; Luo, H; Luppi, E; Lupton, O; Machefert, F; Machikhiliyan, I V; Maciuc, F; Maev, O; Malde, S; Manca, G; Mancinelli, G; Manzali, M; Maratas, J; Marconi, U; Marino, P; Märki, R; Marks, J; Martellotti, G; Martens, A; Martín Sánchez, A; Martinelli, M; Martinez Santos, D; Martins Tostes, D; Massafferri, A; Matev, R; Mathe, Z; Matteuzzi, C; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; McSkelly, B; Meadows, B; Meier, F; Meissner, M; Merk, M; Milanes, D A; Minard, M-N; Molina Rodriguez, J; Monteil, S; Moran, D; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Mountain, R; Mous, I; Muheim, F; Müller, K; Muresan, R; Muryn, B; Muster, B; Naik, P; Nakada, T; Nandakumar, R; Nasteva, I; Needham, M; Neubert, S; Neufeld, N; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nicol, M; Niess, V; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; Oblakowska-Mucha, A; Obraztsov, V; Oggero, S; Ogilvy, S; Okhrimenko, O; Oldeman, R; Onderwater, G; Orlandea, M; Otalora Goicochea, J M; Owen, P; Oyanguren, A; Pal, B K; Palano, A; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Pappalardo, L; Parkes, C; Parkinson, C J; Passaleva, G; Patel, G D; Patel, M; Patrignani, C; Pavel-Nicorescu, C; Pazos Alvarez, A; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perez Trigo, E; Perret, P; Perrin-Terrin, M; Pescatore, L; Pesen, E; Pessina, G; Petridis, K; Petrolini, A; Picatoste Olloqui, E; Pietrzyk, B; Pilař, T; Pinci, D; Pistone, A; Playfer, S; Plo Casasus, M; Polci, F; Polok, G; Poluektov, A; Polycarpo, E; Popov, A; Popov, D; Popovici, B; Potterat, C; Powell, A; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Rachwal, B; Rademacker, J H; Rakotomiaramanana, B; Rama, M; Rangel, M S; Raniuk, I; Rauschmayr, N; Raven, G; Redford, S; Reichert, S; Reid, M M; Dos Reis, A C; Ricciardi, S; Richards, A; Rinnert, K; Rives Molina, V; Roa Romero, D A; Robbe, P; Roberts, D A; Rodrigues, A B; Rodrigues, E; Rodriguez Perez, P; Roiser, S; Romanovsky, V; Romero Vidal, A; Rotondo, M; Rouvinet, J; Ruf, T; Ruffini, F; Ruiz, H; Ruiz Valls, P; Sabatino, G; Saborido Silva, J J; Sagidova, N; Sail, P; Saitta, B; Salustino Guimaraes, V; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santovetti, E; Sapunov, M; Sarti, A; Satriano, C; Satta, A; Savrie, M; Savrina, D; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmidt, B; Schneider, O; Schopper, A; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Seco, M; Semennikov, A; Senderowska, K; Sepp, I; Serra, N; Serrano, J; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, O; Shevchenko, V; Shires, A; Silva Coutinho, R; Simi, G; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, N A; Smith, E; Smith, E; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Soomro, F; Souza, D; Souza De Paula, B; Spaan, B; Sparkes, A; Spinella, F; Spradlin, P; Stagni, F; Stahl, S; Steinkamp, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Stroili, R; Subbiah, V K; Sun, L; Sutcliffe, W; Swientek, S; Syropoulos, V; Szczekowski, M; Szczypka, P; Szilard, D; Szumlak, T; T'Jampens, S; Teklishyn, M; Tellarini, G; Teodorescu, E; Teubert, F; Thomas, C; Thomas, E; van Tilburg, J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Torr, N; Tournefier, E; Tourneur, S; Tran, M T; Tresch, M; Tsaregorodtsev, A; Tsopelas, P; Tuning, N; Ubeda Garcia, M; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vagnoni, V; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vázquez Sierra, C; Vecchi, S; Velthuis, J J; Veltri, M; Veneziano, G; Vesterinen, M; Viaud, B; Vieira, D; Vilasis-Cardona, X; Vollhardt, A; Volyanskyy, D; Voong, D; Vorobyev, A; Vorobyev, V; Voß, C; Voss, H; de Vries, J A; Waldi, R; Wallace, C; Wallace, R; Wandernoth, S; Wang, J; Ward, D R; Watson, N K; Webber, A D; Websdale, D; Whitehead, M; Wicht, J; Wiechczynski, J; Wiedner, D; Wiggers, L; Wilkinson, G; Williams, M P; Williams, M; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wright, S; Wu, S; Wyllie, K; Xie, Y; Xing, Z; Yang, Z; Yuan, X; Yushchenko, O; Zangoli, M; Zavertyaev, M; Zhang, F; Zhang, L; Zhang, W C; Zhang, Y; Zhelezov, A; Zhokhov, A; Zhong, L; Zvyagin, A

The lifetime of the [Formula: see text] meson is measured using semileptonic decays having a [Formula: see text] meson and a muon in the final state. The data, corresponding to an integrated luminosity of [Formula: see text], are collected by the LHCb detector in [Formula: see text] collisions at a centre-of-mass energy of 8 TeV. The measured lifetime is [Formula: see text]where the first uncertainty is statistical and the second is systematic.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.