WorldWideScience

Sample records for stylistic text classification

  1. A stylistic classification of Russian-language texts based on the random walk model

    Science.gov (United States)

    Kramarenko, A. A.; Nekrasov, K. A.; Filimonov, V. V.; Zhivoderov, A. A.; Amieva, A. A.

    2017-09-01

    A formal approach to text analysis is suggested that is based on the random walk model. The frequencies and reciprocal positions of the vowel letters are matched up by a process of quasi-particle migration. Statistically significant difference in the migration parameters for the texts of different functional styles is found. Thus, a possibility of classification of texts using the suggested method is demonstrated. Five groups of the texts are singled out that can be distinguished from one another by the parameters of the quasi-particle migration process.

  2. Stylistic devices in french advertising texts

    Directory of Open Access Journals (Sweden)

    А С Борисова

    2009-09-01

    Full Text Available This article deals with the problem of stylistic devices of the lexical level, widely applied in modern French advertising texts as means of linguistic manipulation on mass consciousness.

  3. A pragmatic-Stylistic Approach to the Study of Dramatic Texts

    Directory of Open Access Journals (Sweden)

    Khamail Ali Waheeb

    2017-08-01

    Full Text Available The present paper aims at investigating the role of interaction between stylistics and pragmatics, as two linguistic approaches, that play on the interpretation of literary texts. Stylistics is thought to be a multidisciplinary field that interacts with other linguistic fields and branches for the analysis of literary and non-literary texts. The language of dramatic texts is seen to be similar to that of everyday usage, therefore, pragmatic theories are used to interpret it. Stylistics applies these theories for stylistic effects. Consequently, the paper intends to examine the role which these two fields play in the analysis of literature and its importance

  4. Stylistic Fronting in the Ormulum - Scandinavian Syntactic Phenomena in Early Middle English Texts

    Directory of Open Access Journals (Sweden)

    Carola Trips

    2004-01-01

    Full Text Available This paper deals with Scandinavian influence in Early Middle English texts and especially with one syntactic phenomenon, stylistic fronting. It is claimed here that the OV/VO word order change in Early Middle English was triggered by language contact with Scandinavian (Kroch & Taylor, 1997 and that the occurrence of syntactic phenomena like stylistic fronting are taken to be evidence for the heavy impact on the English language that led to this change. The focus of the paper lies on the findings from one Early Middle English poetic text, the Ormulum, which shows non-syntactic as well as syntactic evidence for Scandinavian influence. It is shown that the orders that seem to reflect the fronting operation are indeed true instantiations of stylistic fronting. Moreover, in this text stylistic fronting is a phonological phenomenon, because it is used by Orm, its author, whenever the metrical pattern would otherwise be violated. Thus, it was part of Orm's grammar and he could use it for phonological reasons. The fact that the fronting operation is metrically driven supports Holmberg's (2000 analysis of stylistic fronting as a PF-operation.

  5. STYLISTICS OF GRAMMAR IN POETIC TEXTS (ANALYZING POETRY BY B. OKUDZHAVA

    Directory of Open Access Journals (Sweden)

    E. N. Nikitina

    2014-01-01

    Full Text Available The paper is concerned with some syntactic constructions producing specific stylistic effects in poetic texts. It is argued that the violation of syntactic norm by some constructions can be interpreted with respect to the poet’s esthetic goals.

  6. Speech Act Classification of German Advertising Texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2015-12-01

    Full Text Available This paper uses the theory of speech acts and the underlying concept of pragmalinguistics to determine the types of speech acts and their classification in the German advertising printed texts. We ascertain that the advertising of cars and accessories, household appliances and computer equipment, watches, fancy goods, food, pharmaceuticals, and financial, insurance, legal services and also airline advertising is dominated by a pragmatic principle, which is based on demonstrating information about the benefits of a product / service. This influences the frequent usage of certain speech acts. The dominant form of exposure is to inform the recipient-user about the characteristics of the advertised product. This information is fore-grounded by means of stylistic and syntactic constructions specific to the advertisement (participial constructions, appositional constructions which contribute to emphasize certain notional components within the framework of the advertising text. Stylistic and syntactic devices of reduction (parceling constructions convey the author's idea. Other means like repetitions, enumerations etc are used by the advertiser to strengthen his selling power. The advertiser focuses the attention of the consumer on the characteristics of the product seeking to convince him of the utility of the product and to influence his/ her buying behavior.

  7. SAW Classification Algorithm for Chinese Text Classification

    Directory of Open Access Journals (Sweden)

    Xiaoli Guo

    2015-02-01

    Full Text Available Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word algorithm. The algorithm uses the special space effect of Chinese text where words have an implied correlation between text information mining and text categorization for high-correlation matching. Experiments show that SAW classification algorithm on the premise of ensuring precision in classification, significantly improve the classification precision and recall, obviously improving the performance of information retrieval, and providing an effective means of data use in the era of big data information extraction.

  8. DYNAMIC FEATURE SELECTION FOR WEB USER IDENTIFICATION ON LINGUISTIC AND STYLISTIC FEATURES OF ONLINE TEXTS

    Directory of Open Access Journals (Sweden)

    A. A. Vorobeva

    2017-01-01

    Full Text Available The paper deals with identification and authentication of web users participating in the Internet information processes (based on features of online texts.In digital forensics web user identification based on various linguistic features can be used to discover identity of individuals, criminals or terrorists using the Internet to commit cybercrimes. Internet could be used as a tool in different types of cybercrimes (fraud and identity theft, harassment and anonymous threats, terrorist or extremist statements, distribution of illegal content and information warfare. Linguistic identification of web users is a kind of biometric identification, it can be used to narrow down the suspects, identify a criminal and prosecute him. Feature set includes various linguistic and stylistic features extracted from online texts. We propose dynamic feature selection for each web user identification task. Selection is based on calculating Manhattan distance to k-nearest neighbors (Relief-f algorithm. This approach improves the identification accuracy and minimizes the number of features. Experiments were carried out on several datasets with different level of class imbalance. Experiment results showed that features relevance varies in different set of web users (probable authors of some text; features selection for each set of web users improves identification accuracy by 4% at the average that is approximately 1% higher than with the use of static set of features. The proposed approach is most effective for a small number of training samples (messages per user.

  9. A contrastive-stylistic study into the tense distribution in English and Slovene fictional texts

    Directory of Open Access Journals (Sweden)

    Silvana Orel Kos

    2008-12-01

    Full Text Available The article addresses contrastive and narratological issues of the unity vs. diversity of temporal spheres in fictional texts. It focuses on the presentation of mimetic discourse within the past time-sphere narrative, trying to establish the narrative or stylistic functions of the present and past time-sphere verb actions with respect to the role of the narrator or that of the character. the diegetic and mimetic functions of verb actions in certain temporal spheres, ie. tense usage in (free indirect discourse (free direct discourse, will be contrastively studied in original fictional texts and their translations, in both directions between english and Slovene. the character’s mimetic discourse may be presented through different narrative forms, spanning the report-control cline from the forms “in total control” of the character, ie. free direct discourse, to that “apparently in total control” of the narrator, ie. speech act and thought act report (cf. Leech and Short 1981: 324. in addition to the character’s verbal and mental responses, the study includes mediated instances of the character’s sensory responses, the basic formula thus being: He said that/thought that/saw that. Our contrastive analysis considers only fictional texts whose diegesis is rendered   in the narrative past tenses, as the english language system observes the sequence of tenses, while the Slovene language does not. the diegesis of a fictional text may be completely located in the present time-sphere, yet such texts do not present any major issues in terms of contrastive relevance for the studied language pair.

  10. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases...... the accuracy at the same time. The test example is classified using simpler and smaller model. The training examples in a particular cluster share the common vocabulary. At the time of clustering, we do not take into account the labels of the training examples. After the clusters have been created......, the classifier is trained on each cluster having reduced dimensionality and less number of examples. The experimental results show that the proposed model outperforms the existing classification models for the task of suspicious email detection and topic categorization on the Reuters-21578 and 20 Newsgroups...

  11. The Role of Stylistic Approach in Teaching Contemporary Adabi TextsThe Case of Elaik ya Valadi of Saad-e-Sabbah

    Directory of Open Access Journals (Sweden)

    Nouroddin Parvin

    2014-01-01

    In this research, we will analyze the "eleik Ya Valadi" elegy of Saad e Sabbah, one of the pioneers of contemporary Arabic Rassa poetry at the morphological and syntactic levels with discretional-analytical methods and in stylistic context to analyze the role of stylistics in teaching Arabic texts. One of the most important results of this research is that it is one of the best methods in education of literacy, because of the fact that it considers the consistency and coordination between method of teaching and literacy, and also increases the students’ motivation for understanding and communication. Keywords: Contemporary literacy, teaching literacy, stylistics, Saad e sabbah

  12. STYLISTICS IN BRAZIL

    Directory of Open Access Journals (Sweden)

    Carlos Eduardo Falcão Uchôa

    2015-11-01

    Full Text Available This article aims at pointing out the origins and the evolution of Stylistics in Brazil. After an introduction about the emergence, in Europe, of Stylistics as a discipline in the field of language studies, the article concerns itself with the contributions of Philology, Linguistics, the Theory of Literature and Grammar to the study of the most different stylistic devices employed by writers, particularly by Brazilian ones.

  13. THE ROLE OF STYLISTIC MEANS IN CREATING PARADOX

    Directory of Open Access Journals (Sweden)

    Marina Vladimirovna Lekova

    2017-03-01

    Full Text Available Purpose. Despite the existence of massive scientific literature devoted to paradox this phenomenon does not seem to have received sufficient linguistic coverage. The purpose of the article is to describe stylistic mechanisms underlying paradoxical statements. The investigation was carried out on the material collected from the fairy tales by O. Wilde, the distinguished master of paradoxes. Methodology. The investigation is based on the methods of stylistic identification, classification and description, as well as contextual analysis. Results. The article treats paradox not only as a logical, but also as a linguistic category. Among stylistic markers of paradoxes personification, hyperbole, irony, antithesis and contrast are analyzed. Attempts are made to prove that these stylistic devices are paradoxical by their nature. The article considers varieties of paradoxical statements spawned by each of them in the fairy tales by O. Wilde. Practical applications. The results of the investigation can be employed at lectures and practical classes in Theory of literature and English stylistics.

  14. Free-Text Disease Classification

    Science.gov (United States)

    2011-09-01

    1-59593-597- 7. http://doi.acm.org/10.1145/1277741.1277889. [9] Ingo Feinerer. tm: Text Mining Package, 2011. http://tm.r-forge.r-project. org/. R...package version 0.5-6. [10] Duncan Temple Lang. r-cran-xml, 2011. [11] Ingo Feinerer, Kurt Hornik, and David Meyer. Text mining infrastructure in r

  15. CCM: A Text Classification Method by Clustering

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    In this paper, a new Cluster based Classification Model (CCM) for suspicious email detection and other text classification tasks, is presented. Comparative experiments of the proposed model against traditional classification models and the boosting algorithm are also discussed. Experimental results...... show that the CCM outperforms traditional classification models as well as the boosting algorithm for the task of suspicious email detection on terrorism domain email dataset and topic categorization on the Reuters-21578 and 20 Newsgroups datasets. The overall finding is that applying a cluster based...

  16. About the role of stylistic and syntactic devices of expansion in the informational complex of dicteme of a German advertising text

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2012-12-01

    Full Text Available The article highlights stylistic and syntactic devices of expansion, which act as compositional means, vary normative syntactic structure of an advertising text, contribute to sense formation, creating conditions for the purpose of advertiser’s intent. By means of these language elements expressing invariant tactic sense the advertiser consciously expands and/or complicates the informative complex of dicteme, an acting text unit, transmitting superfluous impressive information together with factual one. Combination of factual and impressive items of information activates both rational and emotional perceptional channels of prospective consumer, intensifies the positioning process of an advertised article.

  17. Radical stylistics: yet another epithet? Radical stylistics: yet another epithet?

    Directory of Open Access Journals (Sweden)

    Sônia Zyngier

    2008-04-01

    Full Text Available An investigation into the history of stylistics will reveal that definitions have had short lives, mostly because stylistics has tended to be narrowly defined. Very few studies have attempted to integrate all the multiple approaches into one large and flexible model (cf. Carter, 1989. As a result, stylistics has been sitting uncomfortably between linguistics and literature and questions about its validity remain in every theoretician’s agenda. These questions are not new. Twenty one years ago Fish (1973 asked what stylistics was and why people were saying such terrible things about it. Here he maintained that stylisticians could collect relevant data but were still unable to justify the interpretation of these data. The scientific method proposed failed when it came to interpretation, which remained arbitrary. An investigation into the history of stylistics will reveal that definitions have had short lives, mostly because stylistics has tended to be narrowly defined. Very few studies have attempted to integrate all the multiple approaches into one large and flexible model (cf. Carter, 1989. As a result, stylistics has been sitting uncomfortably between linguistics and literature and questions about its validity remain in every theoretician’s agenda. These questions are not new. Twenty one years ago Fish (1973 asked what stylistics was and why people were saying such terrible things about it. Here he maintained that stylisticians could collect relevant data but were still unable to justify the interpretation of these data. The scientific method proposed failed when it came to interpretation, which remained arbitrary.

  18. Using Genetic Algorithms for Texts Classification Problems

    Directory of Open Access Journals (Sweden)

    A. A. Shumeyko

    2009-01-01

    Full Text Available The avalanche quantity of the information developed by mankind has led to concept of automation of knowledge extraction – Data Mining ([1]. This direction is connected with a wide spectrum of problems - from recognition of the fuzzy set to creation of search machines. Important component of Data Mining is processing of the text information. Such problems lean on concept of classification and clustering ([2]. Classification consists in definition of an accessory of some element (text to one of in advance created classes. Clustering means splitting a set of elements (texts on clusters which quantity are defined by localization of elements of the given set in vicinities of these some natural centers of these clusters. Realization of a problem of classification initially should lean on the given postulates, basic of which – the aprioristic information on primary set of texts and a measure of affinity of elements and classes.

  19. Arabic Text Classification Using Support Vector Machines

    NARCIS (Netherlands)

    Gharib, Tarek F.; Habib, Mena B.; Fayed, Zaki T.

    2009-01-01

    Text classification (TC) is the process of classifying documents into a predefined set of categories based on their content. Arabic language is highly inflectional and derivational language which makes text mining a complex task. In this paper we applied the Support Vector Machines (SVM) model in

  20. Clustering Analysis within Text Classification Techniques

    Directory of Open Access Journals (Sweden)

    Madalina ZURINI

    2011-01-01

    Full Text Available The paper represents a personal approach upon the main applications of classification which are presented in the area of knowledge based society by means of methods and techniques widely spread in the literature. Text classification is underlined in chapter two where the main techniques used are described, along with an integrated taxonomy. The transition is made through the concept of spatial representation. Having the elementary elements of geometry and the artificial intelligence analysis, spatial representation models are presented. Using a parallel approach, spatial dimension is introduced in the process of classification. The main clustering methods are described in an aggregated taxonomy. For an example, spam and ham words are clustered and spatial represented, when the concepts of spam, ham and common and linkage word are presented and explained in the xOy space representation.

  1. Grammatical-stylistic Adaptation of Journalistic Text in Accordance with the Communicative-pragmatic Orientation to the Addressee (Based on Articles by V.G. Korolenko

    Directory of Open Access Journals (Sweden)

    Mahmood Hemin Tahir Mahmood

    2016-12-01

    Full Text Available Publicist is able to convey the essence and emotional aspect of certain phenomena of reality to the audience- (he/she is a symbolic figure of any era. During the radical historical and socio-political period of changes, personality of the journalist (publicist socially and psychologically becomes conditioned, as they are in charge of the formation of public views. V.G. Korolenko was one of those Publicists, whose observations and reflections are relevant to date. Studying linguistic specificity of articles of the outstanding publicist reveals the canonical features of his idiostyle and priority stylistic devices, such as (dialogization, metaphorization, and the use of antithesis. The article explains the transformation of publicist’s traditional voice manner in a text, addressed not to prepared intelligent reader but to the simple peasant, who has no experience of rational analysis of socially meaningful text. Comparative analysis of the articles «domestic phenomenon» and «The Fall of royal power» can detect the start of the dialogue leveling, priority of grading and epithet as the stylistic devices in post-revolutionary period journalism of V.G Korolenko.

  2. Accurate stemming of Dutch for text classification

    NARCIS (Netherlands)

    Gaustad, T; Bouma, G; Theune, M; Nijholt, A; Hondorp, H

    2002-01-01

    This paper investigates the use of stemming for classification of Dutch (email) texts. We introduce a stemmer, which combines dictionary lookup (implemented efficiently as a finite state automaton) with a rule-based backup strategy and,how, that it outperforms the Dutch Porter stemmer in terms of

  3. Information fusion for automatic text classification

    Energy Technology Data Exchange (ETDEWEB)

    Dasigi, V. [Department of Computer Science and Information Technology, Sacred Heart University, Fairfield, CT (United States); Mann, R.C.; Protopopescu, V.A. [Computer and Mathematics Division, Oak Ridge National Laboratory, Oak Ridge, TN (United States)

    1996-08-01

    Analysis and classification of free text documents encompass decision-making processes that rely on several clues derived from text and other contextual information. When using multiple clues, it is generally not known a priori how these should be integrated into a decision. An algorithmic sensor based on Latent Semantic Indexing (LSI) (a recent successful method for text retrieval rather than classification) is the primary sensor used in our work, but its utility is limited by the {ital reference}{ital library} of documents. Thus, there is an important need to complement or at least supplement this sensor. We have developed a system that uses a neural network to integrate the LSI-based sensor with other clues derived from the text. This approach allows for systematic fusion of several information sources in order to determine a combined best decision about the category to which a document belongs.

  4. Toward text understanding: classification of text documents by word map

    Science.gov (United States)

    Visa, Ari J. E.; Toivanen, Jarmo; Back, Barbro; Vanharanta, Hannu

    2000-04-01

    In many fields, for example in business, engineering, and law there is interest in the search and the classification of text documents in large databases. To information retrieval purposes there exist methods. They are mainly based on keywords. In cases where keywords are lacking the information retrieval is problematic. One approach is to use the whole text document as a search key. Neural networks offer an adaptive tool for this purpose. This paper suggests a new adaptive approach to the problem of clustering and search in large text document databases. The approach is a multilevel one based on word, sentence, and paragraph level maps. Here only the word map level is reported. The reported approach is based on smart encoding, on Self-Organizing Maps, and on document histograms. The results are very promising.

  5. Speech Act Classification of German Advertising Texts

    OpenAIRE

    Артур Нарманович Мамедов

    2015-01-01

    This paper uses the theory of speech acts and the underlying concept of pragmalinguistics to determine the types of speech acts and their classification in the German advertising printed texts. We ascertain that the advertising of cars and accessories, household appliances and computer equipment, watches, fancy goods, food, pharmaceuticals, and financial, insurance, legal services and also airline advertising is dominated by a pragmatic principle, which is based on demonstrating information a...

  6. STYLISTICS AND CONTEMPORARY LITERARY DISCOURSE

    Directory of Open Access Journals (Sweden)

    Elis de Almeida Cardoso

    2015-11-01

    Full Text Available For a long time, Stylistics was considered a discipline akin to Rhetoric, being responsible for the study of figures of speech existing in literary texts. This reduced idea was extended, because, today, style is considered forms of expression in different enunciative contexts. Thus, its importance for discourse studies, because each gender has specific linguistic usages that generate different meaning effects. In the case of literary discourse, researchers are faced with the question of form, of expressiveness and of individuation of speech, so they try to investigate the expressive effects obtained through stylistic uses. If the study of the style of a particular period in literature became important because there were groups of writers with very similar stylistic features that revealed their artistic ideals, currently, there is a great stylistic diversity, which makes it difficult to work with the style of a period and “opens the door” to a more plural analysis, which tries to identify the stylistic uses of each author, of the various groups that appear simultaneously in postmodernity with distinct objectives. Thus, this article aims to investigate, drawing on Stylistics, how it is possible to study the meaning effects obtained in contemporary literary discourse.

  7. Research on Classification of Chinese Text Data Based on SVM

    Science.gov (United States)

    Lin, Yuan; Yu, Hongzhi; Wan, Fucheng; Xu, Tao

    2017-09-01

    Data Mining has important application value in today’s industry and academia. Text classification is a very important technology in data mining. At present, there are many mature algorithms for text classification. KNN, NB, AB, SVM, decision tree and other classification methods all show good classification performance. Support Vector Machine’ (SVM) classification method is a good classifier in machine learning research. This paper will study the classification effect based on the SVM method in the Chinese text data, and use the support vector machine method in the chinese text to achieve the classify chinese text, and to able to combination of academia and practical application.

  8. Automatic Amharic text news classification: Aneural networks ...

    African Journals Online (AJOL)

    The study is on classification of Amharic news automatically using neural networks approach. Learning Vector Quantization (LVQ) algorithm is employed to classify new instance of Amharic news based on classifier developed using training dataset. Two weighting schemes, Term Frequency (TF) and Term Frequency by ...

  9. Functional Stylistics and Peripeteic Texts

    DEFF Research Database (Denmark)

    Borchmann, Simon

    2008-01-01

    Using a pragmatically based linguistic description apparatus on literary use of language is not unproblematic. Observations show that literary use of language violates the norms contained by this apparatus. With this paper I suggest how we can deal with this problem by setting up a frame for the ...

  10. The stylistic idiosyncrasy of Almodóvar’s speech and its influence on the target text. Analysis of some translation strategies for rendering social dialects in the Polish version of the film "Volver" (2006

    Directory of Open Access Journals (Sweden)

    Leticia Santamaría Ciordia

    2012-01-01

    Full Text Available The identification of an original and idiosyncratic style in the work of the Spanish filmmaker Pedro Almodóvar provides the tone of the speech with a semantic function: linguistic variation and specific registers help to transmit the message and intention of the author. As a consequence of this, the possibilities for the translation being carried away in similar linguistic terms to those of the original text will be higher than in texts without linguistic peculiarities or stylistically unmarked. The purpose of this paper is to reflect on the influence exerted by the original author in the reception of his work abroad and, consequently, in its translation. In this sense, we will analyse the informative value of certain linguistic uses as well as the importance of enriching stylistically the subtitles in these connotative texts through the example of Almodóvar’s film "Volver" and its translation into Polish (2006.

  11. A ROUGH GUIDE TO DOING CORPUS STYLISTICS

    Directory of Open Access Journals (Sweden)

    Tania M. G. Shepherd

    2015-11-01

    Full Text Available This article has two main purposes. The first is to provide a short panorama of existing trends within computer-assisted stylistics. The second is to analyse a prize winning novel by English writer Julian Barnes, by resorting to the tenets and working tools of one of the newest branch of Stylistics, the so-called Corpus stylistics. To this end, the article starts by looking at various attempts at defining what style is and their implications to the definition of the discipline known as Stylistics. Then the paper presents recent work within the field of Corpus stylistics, as it describes the uses of computational tools as part of the stylistician tool kit. Finally, the paper provides a variety of ways with which a literary work may be approached digitally with a view to showing how computational tools can aid the stylistician in acts of interpretation.

  12. A text classification algorithm based on feature weighting

    Science.gov (United States)

    Yang, Han; Cui, Honggang; Tang, Hao

    2017-08-01

    The text classification comes down to match according to certain characteristics of the data to be classified. Of course, the complete match is not possible, so the optimal matching result must be selected to complete the classification. Aiming at the shortcomings of the traditional KNN text classification algorithm, a KNN text classification algorithm based on feature weighting is proposed. The algorithm considers the contribution of each dimension to the classification of the model, gives different characteristics to different weights, improves the function of important features, and improves the classification accuracy of the algorithm.

  13. Automatic Amharic text news classification: Aneural networks ...

    African Journals Online (AJOL)

    Ethiopian Journal of Science and Technology. Journal Home · ABOUT · Advanced Search · Current Issue · Archives · Journal Home > Vol 6, No 2 (2013) >. Log in or Register to get access to full text downloads.

  14. AN Information Text Classification Algorithm Based on DBN

    Directory of Open Access Journals (Sweden)

    LU Shu-bao

    2017-04-01

    Full Text Available Aiming at the problem of low categorization accuracy and uneven distribution of the traditional text classification algorithms,a text classification algorithm based on deep learning has been put forward. Deep belief networks have very strong feature learning ability,which can be extracted from the high dimension of the original feature,so that the text classification can not only be considered,but also can be used to train classification model. The formula of TF-IDF is used to compute text eigenvalues,and the deep belief networks are used to construct the classifier. The experimental results show that compared with the commonly used classification algorithms such as support vector machine,neural network and extreme learning machine,the algorithm has higher accuracy and practicability,and it has opened up new ideas for the research of text classification.

  15. Medical Text Classification using Convolutional Neural Networks

    OpenAIRE

    Hughes, Mark; Li, Irene; Kotoulas, Spyros; Suzumura, Toyotaro

    2017-01-01

    We present an approach to automatically classify clinical text at a sentence level. We are using deep convolutional neural networks to represent complex features. We train the network on a dataset providing a broad categorization of health information. Through a detailed evaluation, we demonstrate that our method outperforms several approaches widely used in natural language processing tasks by about 15%.

  16. Medical Text Classification Using Convolutional Neural Networks.

    Science.gov (United States)

    Hughes, Mark; Li, Irene; Kotoulas, Spyros; Suzumura, Toyotaro

    2017-01-01

    We present an approach to automatically classify clinical text at a sentence level. We are using deep convolutional neural networks to represent complex features. We train the network on a dataset providing a broad categorization of health information. Through a detailed evaluation, we demonstrate that our method outperforms several approaches widely used in natural language processing tasks by about 15%.

  17. FROM STYLISTICS TO STYLOMETRY IN LITERATURE

    Directory of Open Access Journals (Sweden)

    Camillo Cavalcanti

    2015-11-01

    Full Text Available This article aims to contribute to the recent efforts to recover Stylistics from the downfall promoted by Literary Studies in Brazil by way of introducing an account of the history of Stylistics. The paper intends to show that Stylistics has used the stylometric method – which is a qualitative analysis of quantitative pieces – since its foundation, which means that this method represents its essential feature. We propose a new use of Stylometry, specifically to Literature, able to change the simple construction of mathematical values to a complex approach of hermeneutic contents.

  18. Text Classification by Combining Different Distance Functions with Weights

    Science.gov (United States)

    Yamada, Takahiro; Ishii, Naohiro; Nakashima, Toyoshiro

    The text classification is an important subject in the data mining. For the text classification, several methods have been developed up to now, as the nearest neighbor analysis, the latent semantic analysis, etc. The k-nearest neighbor (kNN) classification is a well-known simple and effective method for the classification of data in many domains. In the use of the kNN, the distance function is important to measure the distance and the similarity between data. To improve the performance of the classifier by the kNN, a new approach to combine multiple distance functions is proposed here. The weighting factors of elements in the distance function, are computed by GA for the effectiveness of the measurement. Further, an ensemble processing was developed for the improvement of the classification accuracy. Finally, it is shown by experiments that the methods, developed here, are effective in the text classification.

  19. Categorization and Pathology of Persian Stylistic Researches

    Directory of Open Access Journals (Sweden)

    Maryam Dorpar

    2014-07-01

    Full Text Available  Abstract In following article, surveys and researches about Persian style were categorized in two branches of historical and formalistic styles Mohammad Taghi Bahar founded stylistics as an autonomous knowledge by publishing his book, History of the evolution of Persian prose (1331, for teaching in University of Tehran. This book which has been influenced by verbal instructions of qajar dynasty’s scholars made the way generally has been followed by researchers in Persian stylistics up to now. However, researchers and critics have introduced various theories and approaches during last four decades.  Stagnation in Persian stylistic researches is the main problem which is considered in current article. The main questions are: What branch of stylistics should be the performed Persian stylistic researches? “what are the weak points of surveys” and “what should be done for getting rid of this stagnation?” The main objective of current article is taking steps for removing stagnation from Persian stylistics.  Malek osh-Shoara Bahar used periodization in studying prose styles and analyzed revolution of Persian prose in lexical aspect (obsolete words, Arabic words, synonyms, words repetition, morphological aspect (verbal prefixes, comparative adjective suffixes, syntactic aspect (precedence of verb over its belongings, omission of verbs and rhetorical aspect (simile and allegory, metonymy and metaphor, prolixity and periphrasis, riming prose and harmony. In fact he tried to show both health and strength and laxity and corruption period of prose. We call Bahar stylistics and all researches done in his way historical stylistics with traditional attitude. In this method, styles' consistency and evolution through history have been studied and preiodization of styles has been taken into account. Mentioned researches periodized styles, finding formal similarities and differences. Since, neglecting meaning and text

  20. Categorization and Pathology of Persian Stylistic Researches

    Directory of Open Access Journals (Sweden)

    Maryam Dorpar

    2014-08-01

    Full Text Available Abstract In following article, surveys and researches about Persian style were categorized in two branches of historical and formalistic styles Mohammad Taghi Bahar founded stylistics as an autonomous knowledge by publishing his book, History of the evolution of Persian prose (1331, for teaching in University of Tehran. This book which has been influenced by verbal instructions of qajar dynasty’s scholars made the way generally has been followed by researchers in Persian stylistics up to now. However, researchers and critics have introduced various theories and approaches during last four decades.  Stagnation in Persian stylistic researches is the main problem which is considered in current article. The main questions are: What branch of stylistics should be the performed Persian stylistic researches? “what are the weak points of surveys” and “what should be done for getting rid of this stagnation?” The main objective of current article is taking steps for removing stagnation from Persian stylistics.  Malek osh-Shoara Bahar used periodization in studying prose styles and analyzed revolution of Persian prose in lexical aspect (obsolete words, Arabic words, synonyms, words repetition, morphological aspect (verbal prefixes, comparative adjective suffixes, syntactic aspect (precedence of verb over its belongings, omission of verbs and rhetorical aspect (simile and allegory, metonymy and metaphor, prolixity and periphrasis, riming prose and harmony. In fact he tried to show both health and strength and laxity and corruption period of prose. We call Bahar stylistics and all researches done in his way historical stylistics with traditional attitude. In this method, styles' consistency and evolution through history have been studied and preiodization of styles has been taken into account. Mentioned researches periodized styles, finding formal similarities and differences. Since, neglecting meaning and text functionality they have only paid

  1. Classification process in a text document recommender system

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2005-12-01

    Full Text Available This paper presents the classification process in a recommender system used for textual documents taken especially from web. The system uses in the classification process a combination of content filters, event filters and collaborative filters and it uses implicit and explicit feedback for evaluating documents.

  2. Hot complaint intelligent classification based on text mining

    Directory of Open Access Journals (Sweden)

    XIA Haifeng

    2013-10-01

    Full Text Available The complaint recognizer system plays an important role in making sure the correct classification of the hot complaint,improving the service quantity of telecommunications industry.The customers’ complaint in telecommunications industry has its special particularity which should be done in limited time,which cause the error in classification of hot complaint.The paper presents a model of complaint hot intelligent classification based on text mining,which can classify the hot complaint in the correct level of the complaint navigation.The examples show that the model can be efficient to classify the text of the complaint.

  3. Semi-supervised Text Classification Using RBF Networks

    Science.gov (United States)

    Jiang, Eric P.

    Semi-supervised text classification has numerous applications and is particularly applicable to the problems where large quantities of unlabeled data are readily available while only a small number of labeled training samples are accessible. The paper proposes a semi-supervised classifier that integrates a clustering based Expectation Maximization (EM) algorithm into radial basis function (RBF) neural networks and can learn for classification from a very small number of labeled training samples and a large pool of unlabeled data effectively. A generalized centroid clustering algorithm is also investigated in this work to balance predictive values between labeled and unlabeled training data and to improve classification accuracy. Experimental results with three popular text classification corpora show that the proper use of additional unlabeled data in this semi-supervised approach can reduce classification errors by up to 26%.

  4. Deep Learning for text data mining: Solving spreadsheet data classification.

    OpenAIRE

    Kimashev, Aleksandr

    2017-01-01

    Master's thesis in Computer science This project developed for the Avito LOOPS company. Research goals was to investigate existing algorithms and implementations of Deep Learning, to understand their applicability to text mining, to design a solution that incorporates theoretical and practical aspects, to run classification experiments on different data sets so that the pros and cons of different techniques can be understood. Classification of the text was necessary for the spreadsheet co...

  5. Text Classification: Classifying Plain Source Files with Neural Network

    Directory of Open Access Journals (Sweden)

    Jaromir Veber

    2010-10-01

    Full Text Available The automated text file categorization has an important place in computer engineering, particularly in the process called data management automation. A lot has been written about text classification and the methods allowing classification of these files are well known. Unfortunately most studies are theoretical and for practical implementation more research is needed. I decided to contribute with a research focused on creating of a classifier for different kinds of programs (source files, scripts…. This paper will describe practical implementation of the classifier for text files depending on file content.

  6. Using LSI and its variants in Text Classification

    Science.gov (United States)

    Batra, Shalini; Bawa, Seema

    Latent Semantic Indexing (LSI), a well known technique in Information Retrieval has been partially successful in text retrieval and no major breakthrough has been achieved in text classification as yet. A significant step forward in this regard was made by Hofmann[3], who presented the probabilistic LSI (PLSI) model, as an alternative to LSI. If we wish to consider exchangeable representations for documents and words, PLSI is not successful which further led to the Latent Dirichlet Allocation (LDA) model [4]. A new local Latent Semantic Indexing method has been proposed by some authors called "Local Relevancy Ladder-Weighted LSI" (LRLW-LSI) to improve text classification [5]. In this paper we study LSI and its variants in detail , analyze the role played by them in text classification and conclude with future directions in this area.

  7. Improving classification in protein structure databases using text mining

    Directory of Open Access Journals (Sweden)

    Jones David T

    2009-05-01

    Full Text Available Abstract Background The classification of protein domains in the CATH resource is primarily based on structural comparisons, sequence similarity and manual analysis. One of the main bottlenecks in the processing of new entries is the evaluation of 'borderline' cases by human curators with reference to the literature, and better tools for helping both expert and non-expert users quickly identify relevant functional information from text are urgently needed. A text based method for protein classification is presented, which complements the existing sequence and structure-based approaches, especially in cases exhibiting low similarity to existing members and requiring manual intervention. The method is based on the assumption that textual similarity between sets of documents relating to proteins reflects biological function similarities and can be exploited to make classification decisions. Results An optimal strategy for the text comparisons was identified by using an established gold standard enzyme dataset. Filtering of the abstracts using a machine learning approach to discriminate sentences containing functional, structural and classification information that are relevant to the protein classification task improved performance. Testing this classification scheme on a dataset of 'borderline' protein domains that lack significant sequence or structure similarity to classified proteins showed that although, as expected, the structural similarity classifiers perform better on average, there is a significant benefit in incorporating text similarity in logistic regression models, indicating significant orthogonality in this additional information. Coverage was significantly increased especially at low error rates, which is important for routine classification tasks: 15.3% for the combined structure and text classifier compared to 10% for the structural classifier alone, at 10-3 error rate. Finally when only the highest scoring predictions were used

  8. A HMM text classification model with learning capacity

    Directory of Open Access Journals (Sweden)

    Eva L. IGLESIAS

    2015-05-01

    Full Text Available In this paper a method of classifying biomedical text documents based on Hidden Markov Model is proposed and evaluated. The method is integrated into a framework named BioClass. Bioclass is composed of intelligent text classification tools and facilitates the comparison between them because it has several views of the results. The main goal is to propose a more effective based-on content classifier than current methods in this environment To test the effectiveness of the classifier presented, a set of experiments performed on the OSHUMED corpus are preseted. Our model is tested adding it learning capacity and without it, and it is compared with other classification techniques. The results suggest that the adaptive HMM model is indeed more suitable for document classification.

  9. Rational kernels for Arabic Root Extraction and Text Classification

    Directory of Open Access Journals (Sweden)

    Attia Nehar

    2016-04-01

    Full Text Available In this paper, we address the problems of Arabic Text Classification and root extraction using transducers and rational kernels. We introduce a new root extraction approach on the basis of the use of Arabic patterns (Pattern Based Stemmer. Transducers are used to model these patterns and root extraction is done without relying on any dictionary. Using transducers for extracting roots, documents are transformed into finite state transducers. This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification. Root extraction experiments are conducted on three word collections and yield 75.6% of accuracy. Classification experiments are done on the Saudi Press Agency dataset and N-gram kernels are tested with different values of N. Accuracy and F1 report 90.79% and 62.93% respectively. These results show that our approach, when compared with other approaches, is promising specially in terms of accuracy and F1.

  10. A Chinese text classification system based on Naive Bayes algorithm

    Directory of Open Access Journals (Sweden)

    Cui Wei

    2016-01-01

    Full Text Available In this paper, aiming at the characteristics of Chinese text classification, using the ICTCLAS(Chinese lexical analysis system of Chinese academy of sciences for document segmentation, and for data cleaning and filtering the Stop words, using the information gain and document frequency feature selection algorithm to document feature selection. Based on this, based on the Naive Bayesian algorithm implemented text classifier , and use Chinese corpus of Fudan University has carried on the experiment and analysis on the system.

  11. Categorizing Children: Automated Text Classification of CHILDES files

    NARCIS (Netherlands)

    Opsomer, Rob; Knoth, Peter; Wiering, Marco; van Polen, Freek; Trapman, Jantine

    2008-01-01

    In this paper we present the application of machine learning text classification methods to two tasks: categorization of children’s speech in the CHILDES Database according to gender and age. Both tasks are binary. For age, we distinguish two age groups between the age of 1.9 and 3.0 years old. The

  12. The ‘indisciplinarity’ of stylistics

    Directory of Open Access Journals (Sweden)

    Sorlin Sandrine

    2014-12-01

    Full Text Available This paper aims at showing why the stylistician can be construed as a prolific “impostor” in a most positive sense: pledged to no specific linguistic prophet, she can opt for different theoretical linguistic tools (in the sphere of pragmatics, critical discourse analysis, cognitive grammar, etc. depending on her object of study and what her research question is. The liberty claimed by the stylistician explains why stylistics is the “undisciplined” child of linguistics, shirking any clear definition of its boundaries. It will be argued that stylistics can only exist as a cross-disciplinary field given its conception of language as fundamentally contextualized. If it was a discipline determined by clear-cut pre-established boundaries, stylistics would be far more “disciplined” but would run the risk of serving only itself. The broad goal of this paper is thus to evince that the “indisciplinarity” of stylistics constitutes its very defining essence. With this aim in mind, it will demonstrate what stylistics owes to other disciplines, what it shares with similar language-based disciplines and what it can offer to other fields or practices of knowledge.

  13. PDF text classification to leverage information extraction from publication reports.

    Science.gov (United States)

    Bui, Duy Duc An; Del Fiol, Guilherme; Jonnalagadda, Siddhartha

    2016-06-01

    Data extraction from original study reports is a time-consuming, error-prone process in systematic review development. Information extraction (IE) systems have the potential to assist humans in the extraction task, however majority of IE systems were not designed to work on Portable Document Format (PDF) document, an important and common extraction source for systematic review. In a PDF document, narrative content is often mixed with publication metadata or semi-structured text, which add challenges to the underlining natural language processing algorithm. Our goal is to categorize PDF texts for strategic use by IE systems. We used an open-source tool to extract raw texts from a PDF document and developed a text classification algorithm that follows a multi-pass sieve framework to automatically classify PDF text snippets (for brevity, texts) into TITLE, ABSTRACT, BODYTEXT, SEMISTRUCTURE, and METADATA categories. To validate the algorithm, we developed a gold standard of PDF reports that were included in the development of previous systematic reviews by the Cochrane Collaboration. In a two-step procedure, we evaluated (1) classification performance, and compared it with machine learning classifier, and (2) the effects of the algorithm on an IE system that extracts clinical outcome mentions. The multi-pass sieve algorithm achieved an accuracy of 92.6%, which was 9.7% (pPDF documents. Text classification is an important prerequisite step to leverage information extraction from PDF documents. Copyright © 2016 Elsevier Inc. All rights reserved.

  14. Comparative Analysis of Document level Text Classification Algorithms using R

    Science.gov (United States)

    Syamala, Maganti; Nalini, N. J., Dr; Maguluri, Lakshamanaphaneendra; Ragupathy, R., Dr.

    2017-08-01

    From the past few decades there has been tremendous volumes of data available in Internet either in structured or unstructured form. Also, there is an exponential growth of information on Internet, so there is an emergent need of text classifiers. Text mining is an interdisciplinary field which draws attention on information retrieval, data mining, machine learning, statistics and computational linguistics. And to handle this situation, a wide range of supervised learning algorithms has been introduced. Among all these K-Nearest Neighbor(KNN) is efficient and simplest classifier in text classification family. But KNN suffers from imbalanced class distribution and noisy term features. So, to cope up with this challenge we use document based centroid dimensionality reduction(CentroidDR) using R Programming. By combining these two text classification techniques, KNN and Centroid classifiers, we propose a scalable and effective flat classifier, called MCenKNN which works well substantially better than CenKNN.

  15. Cancer Hallmark Text Classification Using Convolutional Neural Networks

    OpenAIRE

    Baker, Simon; Korhonen, Anna-Leena; Pyysalo, S

    2017-01-01

    Methods based on deep learning approaches have recently achieved state-of-the-art performance in a range of machine learning tasks and are increasingly applied to natural language processing (NLP). Despite strong results in various established NLP tasks involving general domain texts, here is only limited work applying these models to biomedical NLP. In this paper, we consider a Convolutional Neural Network (CNN) approach to biomedical text classification. Evaluation using a recently intr...

  16. Enhanced root extraction and document classification algorithm for Arabic text

    OpenAIRE

    Alsaad, Amal

    2016-01-01

    This thesis was submitted for the award of Doctor of Philosophy and was awarded by Brunel University London Many text extraction and classification systems have been developed for English and other international languages; most of the languages are based on Roman letters. However, Arabic language is one of the difficult languages which have special rules and morphology. Not many systems have been developed for Arabic text categorization. Arabic language is one of the Semitic languages with...

  17. An Enhanced Probabilistic Neural Network Approach Applied to Text Classification

    Science.gov (United States)

    Marques Ciarelli, Patrick; Oliveira, Elias

    Text classification is still a quite difficult problem to be dealt with both by the academia and by the industrial areas. On the top of that, the importance of aggregating a set of related amount of text documents is steadily growing in importance these days. The presence of multi-labeled texts and great quantity of classes turn this problem even more challenging. In this article we present an enhanced version of Probabilistic Neural Network using centroids to tackle the multi-label classification problem. We carried out some experiments comparing our proposed classifier against the other well known classifiers in the literature which were specially designed to treat this type of problem. By the achieved results, we observed that our novel approach were superior to the other classifiers and faster than the Probabilistic Neural Network without the use of centroids.

  18. An Introduction to Literary Quaranic Stylistics

    Science.gov (United States)

    Almenoar, Lubna

    2010-01-01

    A stylistic analysis is one approach of analyzing a literary text using literary descriptions. The use of literary texts in the literature classroom has been limited to mostly Western sources. This paper is an attempt to create an awareness of the linguistic features present in the English language translations of the meaning of the Quran. The…

  19. Social Media Text Classification by Enhancing Well-Formed Text Trained Model

    Directory of Open Access Journals (Sweden)

    Phat Jotikabukkana

    2016-09-01

    Full Text Available Social media are a powerful communication tool in our era of digital information. The large amount of user-generated data is a useful novel source of data, even though it is not easy to extract the treasures from this vast and noisy trove. Since classification is an important part of text mining, many techniques have been proposed to classify this kind of information. We developed an effective technique of social media text classification by semi-supervised learning utilizing an online news source consisting of well-formed text. The computer first automatically extracts news categories, well-categorized by publishers, as classes for topic classification. A bag of words taken from news articles provides the initial keywords related to their category in the form of word vectors. The principal task is to retrieve a set of new productive keywords. Term Frequency-Inverse Document Frequency weighting (TF-IDF and Word Article Matrix (WAM are used as main methods. A modification of WAM is recomputed until it becomes the most effective model for social media text classification. The key success factor was enhancing our model with effective keywords from social media. A promising result of 99.50% accuracy was achieved, with more than 98.5% of Precision, Recall, and F-measure after updating the model three times.

  20. Overfitting Reduction of Text Classification Based on AdaBELM

    Directory of Open Access Journals (Sweden)

    Xiaoyue Feng

    2017-07-01

    Full Text Available Overfitting is an important problem in machine learning. Several algorithms, such as the extreme learning machine (ELM, suffer from this issue when facing high-dimensional sparse data, e.g., in text classification. One common issue is that the extent of overfitting is not well quantified. In this paper, we propose a quantitative measure of overfitting referred to as the rate of overfitting (RO and a novel model, named AdaBELM, to reduce the overfitting. With RO, the overfitting problem can be quantitatively measured and identified. The newly proposed model can achieve high performance on multi-class text classification. To evaluate the generalizability of the new model, we designed experiments based on three datasets, i.e., the 20 Newsgroups, Reuters-21578, and BioMed corpora, which represent balanced, unbalanced, and real application data, respectively. Experiment results demonstrate that AdaBELM can reduce overfitting and outperform classical ELM, decision tree, random forests, and AdaBoost on all three text-classification datasets; for example, it can achieve 62.2% higher accuracy than ELM. Therefore, the proposed model has a good generalizability.

  1. Computational text analysis and reading comprehension exam complexity towards automatic text classification

    CERN Document Server

    Liontou, Trisevgeni

    2014-01-01

    This book delineates a range of linguistic features that characterise the reading texts used at the B2 (Independent User) and C1 (Proficient User) levels of the Greek State Certificate of English Language Proficiency exams in order to help define text difficulty per level of competence. In addition, it examines whether specific reader variables influence test takers' perceptions of reading comprehension difficulty. The end product is a Text Classification Profile per level of competence and a formula for automatically estimating text difficulty and assigning levels to texts consistently and re

  2. Overfitting Reduction of Text Classification Based on AdaBELM

    OpenAIRE

    Xiaoyue Feng; Yanchun Liang; Xiaohu Shi; Dong Xu; Xu Wang; Renchu Guan

    2017-01-01

    Overfitting is an important problem in machine learning. Several algorithms, such as the extreme learning machine (ELM), suffer from this issue when facing high-dimensional sparse data, e.g., in text classification. One common issue is that the extent of overfitting is not well quantified. In this paper, we propose a quantitative measure of overfitting referred to as the rate of overfitting (RO) and a novel model, named AdaBELM, to reduce the overfitting. With RO, the overfitting problem can ...

  3. On Extending Neural Networks with Loss Ensembles for Text Classification

    OpenAIRE

    Hajiabadi, Hamideh; Molla-Aliod, Diego; Monsefi, Reza

    2017-01-01

    Ensemble techniques are powerful approaches that combine several weak learners to build a stronger one. As a meta learning framework, ensemble techniques can easily be applied to many machine learning techniques. In this paper we propose a neural network extended with an ensemble loss function for text classification. The weight of each weak loss function is tuned within the training phase through the gradient propagation optimization method of the neural network. The approach is evaluated on...

  4. Literature and development: perspectives from stylistics | Obobolo ...

    African Journals Online (AJOL)

    This study seeks solution to the problem of underdevelopment. It examines how indigenous narrative knowledge and literary information could bring development to post-colonial societies and the human society. It focuses on the role of mediatory/ternary force in stylistic analysis of texts, and it attempts an analysis of ...

  5. Rationale-Augmented Convolutional Neural Networks for Text Classification.

    Science.gov (United States)

    Zhang, Ye; Marshall, Iain; Wallace, Byron C

    2016-11-01

    We present a new Convolutional Neural Network (CNN) model for text classification that jointly exploits labels on documents and their constituent sentences. Specifically, we consider scenarios in which annotators explicitly mark sentences (or snippets) that support their overall document categorization, i.e., they provide rationales. Our model exploits such supervision via a hierarchical approach in which each document is represented by a linear combination of the vector representations of its component sentences. We propose a sentence-level convolutional model that estimates the probability that a given sentence is a rationale, and we then scale the contribution of each sentence to the aggregate document representation in proportion to these estimates. Experiments on five classification datasets that have document labels and associated rationales demonstrate that our approach consistently outperforms strong baselines. Moreover, our model naturally provides explanations for its predictions.

  6. MakeUP Stylist

    OpenAIRE

    SAHUQUILLO FALAGUERA, SILVIA; FORTUNY ELVIRA, OSCAR; SABIC, NEDIM

    2012-01-01

    Este proyecto, es una aplicación de escritorio que simula los efectos que se producen al aplicar un determinado maquillaje, o un complemento como podrían ser unas gafas, sobre un rostro. Por eso hemos llamado a este proyecto, MakeUP Stylist, algo así como estilista de maquillaje. Sobre las características técnicas , hemos de decir que está programado en C#, con un diseño de interfaces, basado en la tecnología WPF. Para detectar las caras a maquillar o sus elementos, ojos y boca, hemos emplead...

  7. DEEP LEARNING MODEL FOR BILINGUAL SENTIMENT CLASSIFICATION OF SHORT TEXTS

    Directory of Open Access Journals (Sweden)

    Y. B. Abdullin

    2017-01-01

    Full Text Available Sentiment analysis of short texts such as Twitter messages and comments in news portals is challenging due to the lack of contextual information. We propose a deep neural network model that uses bilingual word embeddings to effectively solve sentiment classification problem for a given pair of languages. We apply our approach to two corpora of two different language pairs: English-Russian and Russian-Kazakh. We show how to train a classifier in one language and predict in another. Our approach achieves 73% accuracy for English and 74% accuracy for Russian. For Kazakh sentiment analysis, we propose a baseline method, that achieves 60% accuracy; and a method to learn bilingual embeddings from a large unlabeled corpus using a bilingual word pairs.

  8. A journey through the stylistics of poetry

    DEFF Research Database (Denmark)

    Jensen, Kim Ebensgaard

    2015-01-01

    Review of Peter Verdonk, The Stylistics of Poetry: Context, Cognition, Discourse, History. (Series: Advances in Stylistics). London: Bloomsbury, 2013, xi + 198 pp., ISBN 978-1-4411-5878-9.......Review of Peter Verdonk, The Stylistics of Poetry: Context, Cognition, Discourse, History. (Series: Advances in Stylistics). London: Bloomsbury, 2013, xi + 198 pp., ISBN 978-1-4411-5878-9....

  9. Study the Stylistics of "Ashbah" sermon of "Nahjol Balaqeh"

    Directory of Open Access Journals (Sweden)

    Atefeh Rahmani

    2017-02-01

    Full Text Available Stylistics study the language features of literary text and try to use the tools of linguistics in order to reveal the effects of the beauty of a literary work and public property of it. Such studies are looking for common points in all literary works of scholar which can be fulfilled with studying a part of his works and sometimes explore the effects of stylistics on a certain theme by means of stylistics tools and because stylistics researches often have arbitrary and relative laws that study in this area depends on the nature of critic and taste. In this study it was tried to investigate the coordination of choice of words with sound and sentence structure, vocabulary eloquence and coordination of the structure of sentences with content, to indicate that in this sermon all three levels: phonology, syntax and lexicon have been used to represent the proportion of words and meaning.

  10. Construction accident narrative classification: An evaluation of text mining techniques.

    Science.gov (United States)

    Goh, Yang Miang; Ubeynarayana, C U

    2017-11-01

    Learning from past accidents is fundamental to accident prevention. Thus, accident and near miss reporting are encouraged by organizations and regulators. However, for organizations managing large safety databases, the time taken to accurately classify accident and near miss narratives will be very significant. This study aims to evaluate the utility of various text mining classification techniques in classifying 1000 publicly available construction accident narratives obtained from the US OSHA website. The study evaluated six machine learning algorithms, including support vector machine (SVM), linear regression (LR), random forest (RF), k-nearest neighbor (KNN), decision tree (DT) and Naive Bayes (NB), and found that SVM produced the best performance in classifying the test set of 251 cases. Further experimentation with tokenization of the processed text and non-linear SVM were also conducted. In addition, a grid search was conducted on the hyperparameters of the SVM models. It was found that the best performing classifiers were linear SVM with unigram tokenization and radial basis function (RBF) SVM with uni-gram tokenization. In view of its relative simplicity, the linear SVM is recommended. Across the 11 labels of accident causes or types, the precision of the linear SVM ranged from 0.5 to 1, recall ranged from 0.36 to 0.9 and F1 score was between 0.45 and 0.92. The reasons for misclassification were discussed and suggestions on ways to improve the performance were provided. Copyright © 2017 Elsevier Ltd. All rights reserved.

  11. Classification process in a text document recommender system

    OpenAIRE

    Dan MUNTEANU; Severin BUMBARU

    2005-01-01

    This paper presents the classification process in a recommender system used for textual documents taken especially from web. The system uses in the classification process a combination of content filters, event filters and collaborative filters and it uses implicit and explicit feedback for evaluating documents.

  12. Naive Bayes and Text Classification I - Introduction and Theory

    OpenAIRE

    Raschka, Sebastian

    2014-01-01

    Naive Bayes classifiers, a family of classifiers that are based on the popular Bayes' probability theorem, are known for creating simple yet well performing models, especially in the fields of document classification and disease prediction. In this article, we will look at the main concepts of naive Bayes classification in the context of document categorization.

  13. MODERN LINGUODIDACTIC ASPECTS OF COGNITIVE APPROACH REALIZATION IN TEACHING STYLISTICS OF THE UKRAINIAN LANGUAGE TO STUDENTS

    Directory of Open Access Journals (Sweden)

    Anzhelika Popovych

    2017-09-01

    Full Text Available An approach to teaching stylistics – is a fundamental methodological category that defines the system of studying discipline, the ways of organizing the teaching material and the peculiarities of the interaction of all components of the educational process: principles, methods, ways of teaching. The linguocognitive approach in the study of stylistics aims at identifying aspects of the speech world picture, interpreting texts from the standpoint of cognitive processes, forming the cognitive and linguistic culture of students and the corresponding way of linguistic expression. The following levels of linguocognitive approach to the study of stylistics in higher education are distinguished, such as knowledge, practical and educational levels. The knowledge level involves students studying the foundations of cognitive linguistics and cognitive stylistics, systematic consideration of cognitive structures and processes, understanding the meaning of «concept» and interpreting the language and aesthetic characters of national culture. The perception of the text, its decoding, as well as the production are realized on a practical level. The educational level is aimed at forming the national linguistic and speech consciousness; respect for Ukrainian language traditions; education of speech culture; the desire to follow the aesthetic and ethical norms of communication. According to the contemporary aspects of the development of linguistic and linguistic-stylistic science, not only the clarification of the linguistic structural-level stylistic features of texts, the presence of traces and stylistic figures, but the identification of aspects of the linguistic picture of the world, the linguistic and aesthetic signs of national culture are relevant. Therefore, the cognitive-stylistic analysis of the text will be appropriate for the lessons of stylistics. The linguocognitive approach to the study of the stylistics of the Ukrainian language is extremely

  14. 77 FR 60475 - Draft of SWGDOC Standard Classification of Typewritten Text

    Science.gov (United States)

    2012-10-03

    ... of Justice Programs Draft of SWGDOC Standard Classification of Typewritten Text AGENCY: National... general public a draft document entitled, ``SWGDOC Standard Classification of Typewritten Text''. The... agencies, organizations, and all other stakeholders and interested parties. Those individuals wishing to...

  15. On Dataless Hierarchical Text Classification (Author’s Manuscript)

    Science.gov (United States)

    2014-07-27

    top-down and bottom-up. Our bot - tom line results are summarised in Table 1, indicating that dataless classification is competitive with supervised...datasets we use to do the evalua- tion to compare our framework with existing work. 20Newsgroups Data (20NG) The 20 newsgroups da- ta (Lang 1995) is... microsoft win- dows windows dos microsoft ms driver drivers card printer rec.autos cars car ford auto toyota honda nissan bmw rec.motorcycles motorcycles

  16. Multimodal Stylistics: The Happy Marriage of Stylistics and Semiotics

    DEFF Research Database (Denmark)

    Nørgaard, Nina

    2010-01-01

    , doctor-patient discourse, academic writing, etc. While forceful in its rigour and systematism, the traditional stylistic approach (whether of a formalist, functionalist, cognitive or other orientation) has until recently largely failed to embrace meanings which are created by semiotic systems other than...

  17. Pseudo-Label Generation for Multi-Label Text Classification

    Data.gov (United States)

    National Aeronautics and Space Administration — With the advent and expansion of social networking, the amount of generated text data has seen a sharp increase. In order to handle such a huge volume of text data,...

  18. Words Matter: Scene Text for Image Classification and Retrieval

    NARCIS (Netherlands)

    Karaoglu, S.; Tao, R.; Gevers, T.; Smeulders, A.W.M.

    Text in natural images typically adds meaning to an object or scene. In particular, text specifies which business places serve drinks (e.g., cafe, teahouse) or food (e.g., restaurant, pizzeria), and what kind of service is provided (e.g., massage, repair). The mere presence of text, its words, and

  19. Improvement of Automatic Chinese Text Classification by Combining Character-based and Word-based Approaches

    OpenAIRE

    駱, 曦

    2015-01-01

    Automatic text classification (ATC) is the task of automatically assigning one or more appropriate categories for a document according to its content or topic. Traditionally, text classification is carried out by human experts as it requires a certain level of vocabulary recognition and knowledge processing. With the rapid explosion of texts in digital form and growth of online information, text classification has become an important research area owing to the need to automatically handle and...

  20. Two Different Faces of Cavafy in English: A Corpus-Assisted Approach to Translational Stylistics

    Science.gov (United States)

    Pantopoulos, Iraklis

    2012-01-01

    A translator is seen to leave a personal mark on the text through their stylistic choices and the patterns formed by these choices. This article comprises a case study that uses a specialized comparative corpus containing translations of C.P. Cavafy's canon in order to explore the distinctive stylistic features of Rae Dalven and of Edmund…

  1. Features selection for text classification based on constraints for term weights

    OpenAIRE

    Sergienko, R.; SHAN UR REHMAN M.; Khan, A.; Gasanova, T.; Minker, W.

    2015-01-01

    Text classification is an important data analysis problem which can be applied in different domains including airspace industry. In this paper different text classification problems such as opinion mining and topic categorization are considered. Different text preprocessing techniques (TF-IDF, ConfWeight, and the Novel TW) and machine learning algorithms for classification (Bayes classifier, k-NN, SVM, and artificial neural network) are applied. The main goal of the presented investigations i...

  2. Personality Types Classification for Indonesian Text in Partners Searching Website Using Naive Bayes Methods

    OpenAIRE

    Ni Made Ari Lestari; Dr. I Ketut Gede Darma Putra, S.Kom., Mt; Aa Ketut Agung Cahyawan, St., Mt

    2013-01-01

    The development of digital text information has been growing fast, but most of digital text is in unstructured form. Text mining analysis is needed in dealing with such unstructured text. One of the activities important in text mining is text classification or categorization. Text categorization itself currently has a variety of approaches such as probabilistic approaches, support vector machines, and artificial neural network or decision tree classification. Naive Bayes probabilistic method ...

  3. Some Stylistic Aspects of Social Advertising in Russia

    Directory of Open Access Journals (Sweden)

    Aigul F. Khanova

    2017-10-01

    Full Text Available The article considers some stylistic aspects of advertising database in Russia. It examines linguistic and stylistic properties and peculiarities of social advertising and the impact it has on public consciousness. It determines that social advertisements in Russia are characteristic of the vocabulary belonging to the low language norms which reflects cultural and ethical context. Figurative language and stylistic devices aim at appealing to emotions and make the advertisement more memorable. The authors deem it necessary to create a common database on social advertising in Russia in order to facilitate the analysis of economic impact and evaluate the capacity to exert effect on mainstream audience as well as determine strategies to build advertising campaigns.

  4. Translating children’s literature: some insights from corpus stylistics

    Directory of Open Access Journals (Sweden)

    Anna Čermáková

    2018-01-01

    Full Text Available In this paper I explore the potential of a corpus stylistic approach to the study of literary translation. The study focuses on translation of children’s literature with its specific constrains, and illustrates with two corpus linguistic techniques: keyword and cluster analysis — specific cases of repetition. So in a broader sense the paper discusses the phenomenon of repetition in different literary (stylistic traditions. These are illustrated by examples from two children’s classics aimed at two different age groups: the Harry Potter and the Winnie the Pooh books — and their translations into Czech. Various shifts in translation, especially in the translation of children’s literature, are often explained by the operation of so-called ‘translation universals’. Though ‘repetition’ as such does not belong to the commonly discussed set of translation universals, the stylistic norms opposing repetition seem to be a strong explanation for the translation shifts identified.

  5. TEXT CLASSIFICATION USING NAIVE BAYES UPDATEABLE ALGORITHM IN SBMPTN TEST QUESTIONS

    Directory of Open Access Journals (Sweden)

    Ristu Saptono

    2017-01-01

    Full Text Available Document classification is a growing interest in the research of text mining. Classification can be done based on the topics, languages, and so on. This study was conducted to determine how Naive Bayes Updateable performs in classifying the SBMPTN exam questions based on its theme. Increment model of one classification algorithm often used in text classification Naive Bayes classifier has the ability to learn from new data introduces with the system even after the classifier has been produced with the existing data. Naive Bayes Classifier classifies the exam questions based on the theme of the field of study by analyzing keywords that appear on the exam questions. One of feature selection method DF-Thresholding is implemented for improving the classification performance. Evaluation of the classification with Naive Bayes classifier algorithm produces 84,61% accuracy.

  6. METHODS OF TEXT INFORMATION CLASSIFICATION ON THE BASIS OF ARTIFICIAL NEURAL AND SEMANTIC NETWORKS

    OpenAIRE

    L. V. Serebryanaya; V. V. Potaraev

    2016-01-01

    The article covers the use of perseptron, Hopfild artificial neural network and semantic network for classification of text information. Network training algorithms are studied. An algorithm of inverse mistake spreading for perceptron network and convergence algorithm for Hopfild network are implemented. On the basis of the offered models and algorithms automatic text classification software is developed and its operation results are evaluated.

  7. AN IMPLEMENTATION OF EIS-SVM CLASSIFIER USING RESEARCH ARTICLES FOR TEXT CLASSIFICATION

    Directory of Open Access Journals (Sweden)

    B Ramesh

    2016-04-01

    Full Text Available Automatic text classification is a prominent research topic in text mining. The text pre-processing is a major role in text classifier. The efficiency of pre-processing techniques is increasing the performance of text classifier. In this paper, we are implementing ECAS stemmer, Efficient Instance Selection and Pre-computed Kernel Support Vector Machine for text classification using recent research articles. We are using better pre-processing techniques such as ECAS stemmer to find root word, Efficient Instance Selection for dimensionality reduction of text data and Pre-computed Kernel Support Vector Machine for classification of selected instances. In this experiments were performed on 750 research articles with three classes such as engineering article, medical articles and educational articles. The EIS-SVM classifier provides better performance in real-time research articles classification.

  8. Phonetic and stylistics database of OF Smerdyakov's discourse

    OpenAIRE

    Е Г Луговская

    2008-01-01

    Various peculiarities of phonetic analysis in communicative stylistics of text are disclosed in the present report. The subject of the report is the experience of the examination of the oral speech of Smerdyakov. The article contains the results of research of phonetic database of Smerdyakov along with the influence of his social status.

  9. Stylistic Study of Purple Hibiscus | Chukwukaelo | AFRREV IJAH: An ...

    African Journals Online (AJOL)

    This paper did a linguistic-stylistic study of the novel with emphasis on determining how the author manipulated language to create aesthetic patterns and values. Using the notion projection method in selecting data from the texts, a linguistic model like the Transformational Generative Grammar (TGG) was applied in the ...

  10. Early Medieval stylistic rhetoric

    Directory of Open Access Journals (Sweden)

    A. G.P. van der Walt

    1981-05-01

    Full Text Available According to the well-known expert on medieval rhetoric, James J. Murphy, the three typical medieval forms of rhetoric are the art of letter writing, the art of preaching and the art of poetry (Murphy, 1971, p. xv. In this paper we are concerned only with the second of these arts, namely, the rhetoric of preaching. Though the perceptive treatises on the rhetoric of preaching, the so-called artes praedicandi, did not originate before the thirteenth century, pulpit rhetoric was very much alive in the earlier part of the Middle Ages and fine examples of this kind of eloquence can be quoted.

  11. Language and Literature in Tertiary Education: The Case for Stylistics.

    Science.gov (United States)

    Buckledee, Steve

    2002-01-01

    Advocates the use of stylistics for teaching English-as-a-Foreign-Language at the tertiary level. Describes stylistics, discusses discourse conventions and grammatical structure, and examines stylistic analysis of a Shakespearian sonnet and a poem. (Author/VWL)

  12. METHODS OF TEXT INFORMATION CLASSIFICATION ON THE BASIS OF ARTIFICIAL NEURAL AND SEMANTIC NETWORKS

    Directory of Open Access Journals (Sweden)

    L. V. Serebryanaya

    2016-01-01

    Full Text Available The article covers the use of perseptron, Hopfild artificial neural network and semantic network for classification of text information. Network training algorithms are studied. An algorithm of inverse mistake spreading for perceptron network and convergence algorithm for Hopfild network are implemented. On the basis of the offered models and algorithms automatic text classification software is developed and its operation results are evaluated.

  13. The Problems of the Literary-historical Classification of Kafka's Texts

    Directory of Open Access Journals (Sweden)

    Mateja Clara Jelenčič

    2015-12-01

    Full Text Available Attempts at the literary classification of Kafka's texts often prove to be problematic. Most authors are at a loss, so they propose more than classification. The problem of the literary classification of Kafka's texts is treated only in a few German literary encyclopedias and in interpretations and analyses of his texts. In the interpretations and literary analyses Kafka is mostly classified as a Modernist, but some authors see him as a Postmodernist. Most German literary historians and encyclopedia writers are caught in a dilemma about this subject and so propose more than one literary classification: Kafka is assigned to Expressionism and also proclaimed as a representative of the Existentialism, Surrealism or Magic realism. However most German literary historians often assign Kafka to the Expressionism on historical grounds.

  14. Character-level Convolutional Network for Text Classification Applied to Chinese Corpus

    OpenAIRE

    Huang, Weijie; Wang, Jun

    2016-01-01

    This article provides an interesting exploration of character-level convolutional neural network solving Chinese corpus text classification problem. We constructed a large-scale Chinese language dataset, and the result shows that character-level convolutional neural network works better on Chinese corpus than its corresponding pinyin format dataset. This is the first time that character-level convolutional neural network applied to text classification problem.

  15. Large-scale Multi-label Text Classification - Revisiting Neural Networks

    OpenAIRE

    Nam, Jinseok; Kim, Jungi; Mencía, Eneldo Loza; Gurevych, Iryna; Fürnkranz, Johannes

    2013-01-01

    Neural networks have recently been proposed for multi-label classification because they are able to capture and model label dependencies in the output layer. In this work, we investigate limitations of BP-MLL, a neural network (NN) architecture that aims at minimizing pairwise ranking error. Instead, we propose to use a comparably simple NN approach with recently proposed learning techniques for large-scale multi-label text classification tasks. In particular, we show that BP-MLL's ranking lo...

  16. ARABIC TEXT CLASSIFICATION USING NEW STEMMER FOR FEATURE SELECTION AND DECISION TREES

    Directory of Open Access Journals (Sweden)

    SAID BAHASSINE

    2017-06-01

    Full Text Available Text classification is the process of assignment of unclassified text to appropriate classes based on their content. The most prevalent representation for text classification is the bag of words vector. In this representation, the words that appear in documents often have multiple morphological structures, grammatical forms. In most cases, this morphological variant of words belongs to the same category. In the first part of this paper, anew stemming algorithm was developed in which each term of a given document is represented by its root. In the second part, a comparative study is conducted of the impact of two stemming algorithms namely Khoja’s stemmer and our new stemmer (referred to hereafter by origin-stemmer on Arabic text classification. This investigation was carried out using chi-square as a feature of selection to reduce the dimensionality of the feature space and decision tree classifier. In order to evaluate the performance of the classifier, this study used a corpus that consists of 5070 documents independently classified into six categories: sport, entertainment, business, Middle East, switch and world on WEKA toolkit. The recall, f-measure and precision measures are used to compare the performance of the obtained models. The experimental results show that text classification using rout stemmer outperforms classification using Khoja’s stemmer. The f-measure was 92.9% in sport category and 89.1% in business category.

  17. The speech of Agrippa during the War against Jews in Flavius Josephus: notes for a stylistic analysis

    Directory of Open Access Journals (Sweden)

    Miguel Ángel Rodríguez Horrillo

    2016-08-01

    Full Text Available Agrippa’s speech of Josephus’ Jewish war is examined from a stylistic perspective. The speech is an example of literary μελετή in which the historian uses stylistic resources and reminiscences in order to create cultural connections between the Classical Greece and the present.

  18. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

    Directory of Open Access Journals (Sweden)

    Yin Wang

    2016-01-01

    Full Text Available Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  19. Automatic topic identification of health-related messages in online health community using text classification.

    Science.gov (United States)

    Lu, Yingjie

    2013-01-01

    To facilitate patient involvement in online health community and obtain informative support and emotional support they need, a topic identification approach was proposed in this paper for identifying automatically topics of the health-related messages in online health community, thus assisting patients in reaching the most relevant messages for their queries efficiently. Feature-based classification framework was presented for automatic topic identification in our study. We first collected the messages related to some predefined topics in a online health community. Then we combined three different types of features, n-gram-based features, domain-specific features and sentiment features to build four feature sets for health-related text representation. Finally, three different text classification techniques, C4.5, Naïve Bayes and SVM were adopted to evaluate our topic classification model. By comparing different feature sets and different classification techniques, we found that n-gram-based features, domain-specific features and sentiment features were all considered to be effective in distinguishing different types of health-related topics. In addition, feature reduction technique based on information gain was also effective to improve the topic classification performance. In terms of classification techniques, SVM outperformed C4.5 and Naïve Bayes significantly. The experimental results demonstrated that the proposed approach could identify the topics of online health-related messages efficiently.

  20. Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks

    OpenAIRE

    Lee, Ji Young; Dernoncourt, Franck

    2016-01-01

    Recent approaches based on artificial neural networks (ANNs) have shown promising results for short-text classification. However, many short texts occur in sequences (e.g., sentences in a document or utterances in a dialog), and most existing ANN-based systems do not leverage the preceding short texts when classifying a subsequent one. In this work, we present a model based on recurrent neural networks and convolutional neural networks that incorporates the preceding short texts. Our model ac...

  1. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

    Science.gov (United States)

    Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  2. On stylistic automatization of lexical units in various types of contexts

    Directory of Open Access Journals (Sweden)

    В В Зуева

    2009-12-01

    Full Text Available Stylistic automatization of lexical units in various types of contexts is investigated in this article. Following the works of Boguslav Havranek and other linguists of the Prague Linguistic School automatization is treated as a contextual narrowing of the meaning of a lexical unit to the level of its complete predictability in situational contexts and the lack of stylistic contradiction with other lexical units in speech.

  3. Enhancing navigation in biomedical databases by community voting and database-driven text classification

    Directory of Open Access Journals (Sweden)

    Guettler Daniel

    2009-10-01

    Full Text Available Abstract Background The breadth of biological databases and their information content continues to increase exponentially. Unfortunately, our ability to query such sources is still often suboptimal. Here, we introduce and apply community voting, database-driven text classification, and visual aids as a means to incorporate distributed expert knowledge, to automatically classify database entries and to efficiently retrieve them. Results Using a previously developed peptide database as an example, we compared several machine learning algorithms in their ability to classify abstracts of published literature results into categories relevant to peptide research, such as related or not related to cancer, angiogenesis, molecular imaging, etc. Ensembles of bagged decision trees met the requirements of our application best. No other algorithm consistently performed better in comparative testing. Moreover, we show that the algorithm produces meaningful class probability estimates, which can be used to visualize the confidence of automatic classification during the retrieval process. To allow viewing long lists of search results enriched by automatic classifications, we added a dynamic heat map to the web interface. We take advantage of community knowledge by enabling users to cast votes in Web 2.0 style in order to correct automated classification errors, which triggers reclassification of all entries. We used a novel framework in which the database "drives" the entire vote aggregation and reclassification process to increase speed while conserving computational resources and keeping the method scalable. In our experiments, we simulate community voting by adding various levels of noise to nearly perfectly labelled instances, and show that, under such conditions, classification can be improved significantly. Conclusion Using PepBank as a model database, we show how to build a classification-aided retrieval system that gathers training data from the

  4. Toward an enhanced Arabic text classification using cosine similarity and Latent Semantic

    Directory of Open Access Journals (Sweden)

    Fawaz S. Al-Anzi

    2017-04-01

    Full Text Available Cosine similarity is one of the most popular distance measures in text classification problems. In this paper, we used this important measure to investigate the performance of Arabic language text classification. For textual features, vector space model (VSM is generally used as a model to represent textual information as numerical vectors. However, Latent Semantic Indexing (LSI is a better textual representation technique as it maintains semantic information between the words. Hence, we used the singular value decomposition (SVD method to extract textual features based on LSI. In our experiments, we conducted comparison between some of the well-known classification methods such as Naïve Bayes, k-Nearest Neighbors, Neural Network, Random Forest, Support Vector Machine, and classification tree. We used a corpus that contains 4,000 documents of ten topics (400 document for each topic. The corpus contains 2,127,197 words with about 139,168 unique words. The testing set contains 400 documents, 40 documents for each topics. As a weighing scheme, we used Term Frequency.Inverse Document Frequency (TF.IDF. This study reveals that the classification methods that use LSI features significantly outperform the TF.IDF-based methods. It also reveals that k-Nearest Neighbors (based on cosine measure and support vector machine are the best performing classifiers.

  5. Categorization and Pathology of Persian Stylistic Researches

    OpenAIRE

    Maryam Dorpar

    2014-01-01

     Abstract In following article, surveys and researches about Persian style were categorized in two branches of historical and formalistic styles Mohammad Taghi Bahar founded stylistics as an autonomous knowledge by publishing his book, History of the evolution of Persian prose (1331), for teaching in University of Tehran. This book which has been influenced by verbal instructions of qajar dynastyâs scholars made the way generally has been followed by researchers in Persian stylistics up ...

  6. DEVELOPMENT OF FOREIGN LANGUAGE STYLISTIC COMPETENCE OF FUTURE PHILOLOGISTS: GRAMMATICAL ASPECT

    Directory of Open Access Journals (Sweden)

    Олена Вовк

    2015-05-01

    Full Text Available The article studies a grammatical aspect of developing stylistic competence of students of linguistic departments. Particularly, the stylistic competence which is defined as a capacity to create adequate utterances under natural conditions of communication according to a concrete situation is characterized. To highlight the importance of acquiring stylistic competence the levels of speech development of an individual are indentified and the stages of teaching grammar are differentiated. The approaches to teaching stylistic grammar are characterized within a communicative framework and relevant skills are elucidated. The role of functional styles in teaching a foreign language is clarified. The idea of teaching students to be able to make register shifts and mixture of speech registers in the process of foreign language competence acquiring are highlihgted. The theoretical principles are illustrated with the appropriate examples of exercises.

  7. Stylistic Analysis of the Short Story ‘The Last Word’ by Dr. A. R. Tabassum

    Directory of Open Access Journals (Sweden)

    Abdul Bari Khan

    2015-06-01

    Full Text Available In this article stylistic analysis of short story ‘The Last Word’ by Dr. A. R. Tabassum is performed.  The formative elements of the story, such as point of view, characters and allegorical element, are discussed in detail so as to give a better insight of the story. The story is analyzed stylistically in terms of figures of speech where grammatical, lexical and phonological schemes are considered, following the checklist of linguistic and stylistic categories proposed by Leech and Short. Features of repetition, parallelism, alliteration, consonance, assonance and rhyme are focused on. Finally, the findings and conclusion is given to sum up the discussion. Keywords: stylistics, analysis, short story, last word, allegory, Tabassum

  8. Enhancing navigation in biomedical databases by community voting and database-driven text classification.

    Science.gov (United States)

    Duchrow, Timo; Shtatland, Timur; Guettler, Daniel; Pivovarov, Misha; Kramer, Stefan; Weissleder, Ralph

    2009-10-03

    The breadth of biological databases and their information content continues to increase exponentially. Unfortunately, our ability to query such sources is still often suboptimal. Here, we introduce and apply community voting, database-driven text classification, and visual aids as a means to incorporate distributed expert knowledge, to automatically classify database entries and to efficiently retrieve them. Using a previously developed peptide database as an example, we compared several machine learning algorithms in their ability to classify abstracts of published literature results into categories relevant to peptide research, such as related or not related to cancer, angiogenesis, molecular imaging, etc. Ensembles of bagged decision trees met the requirements of our application best. No other algorithm consistently performed better in comparative testing. Moreover, we show that the algorithm produces meaningful class probability estimates, which can be used to visualize the confidence of automatic classification during the retrieval process. To allow viewing long lists of search results enriched by automatic classifications, we added a dynamic heat map to the web interface. We take advantage of community knowledge by enabling users to cast votes in Web 2.0 style in order to correct automated classification errors, which triggers reclassification of all entries. We used a novel framework in which the database "drives" the entire vote aggregation and reclassification process to increase speed while conserving computational resources and keeping the method scalable. In our experiments, we simulate community voting by adding various levels of noise to nearly perfectly labelled instances, and show that, under such conditions, classification can be improved significantly. Using PepBank as a model database, we show how to build a classification-aided retrieval system that gathers training data from the community, is completely controlled by the database, scales well

  9. Portable automatic text classification for adverse drug reaction detection via multi-corpus training.

    Science.gov (United States)

    Sarker, Abeed; Gonzalez, Graciela

    2015-02-01

    Automatic detection of adverse drug reaction (ADR) mentions from text has recently received significant interest in pharmacovigilance research. Current research focuses on various sources of text-based information, including social media-where enormous amounts of user posted data is available, which have the potential for use in pharmacovigilance if collected and filtered accurately. The aims of this study are: (i) to explore natural language processing (NLP) approaches for generating useful features from text, and utilizing them in optimized machine learning algorithms for automatic classification of ADR assertive text segments; (ii) to present two data sets that we prepared for the task of ADR detection from user posted internet data; and (iii) to investigate if combining training data from distinct corpora can improve automatic classification accuracies. One of our three data sets contains annotated sentences from clinical reports, and the two other data sets, built in-house, consist of annotated posts from social media. Our text classification approach relies on generating a large set of features, representing semantic properties (e.g., sentiment, polarity, and topic), from short text nuggets. Importantly, using our expanded feature sets, we combine training data from different corpora in attempts to boost classification accuracies. Our feature-rich classification approach performs significantly better than previously published approaches with ADR class F-scores of 0.812 (previously reported best: 0.770), 0.538 and 0.678 for the three data sets. Combining training data from multiple compatible corpora further improves the ADR F-scores for the in-house data sets to 0.597 (improvement of 5.9 units) and 0.704 (improvement of 2.6 units) respectively. Our research results indicate that using advanced NLP techniques for generating information rich features from text can significantly improve classification accuracies over existing benchmarks. Our experiments

  10. CLASSIFICATION OF TRAFFIC RELATED SHORT TEXTS TO ANALYSE ROAD PROBLEMS IN URBAN AREAS

    Directory of Open Access Journals (Sweden)

    A. M. M. Saldana-Perez

    2017-09-01

    Full Text Available The Volunteer Geographic Information (VGI can be used to understand the urban dynamics. In the classification of traffic related short texts to analyze road problems in urban areas, a VGI data analysis is done over a social media’s publications, in order to classify traffic events at big cities that modify the movement of vehicles and people through the roads, such as car accidents, traffic and closures. The classification of traffic events described in short texts is done by applying a supervised machine learning algorithm. In the approach users are considered as sensors which describe their surroundings and provide their geographic position at the social network. The posts are treated by a text mining process and classified into five groups. Finally, the classified events are grouped in a data corpus and geo-visualized in the study area, to detect the places with more vehicular problems.

  11. Natural Language Processing Based Instrument for Classification of Free Text Medical Records

    Directory of Open Access Journals (Sweden)

    Manana Khachidze

    2016-01-01

    Full Text Available According to the Ministry of Labor, Health and Social Affairs of Georgia a new health management system has to be introduced in the nearest future. In this context arises the problem of structuring and classifying documents containing all the history of medical services provided. The present work introduces the instrument for classification of medical records based on the Georgian language. It is the first attempt of such classification of the Georgian language based medical records. On the whole 24.855 examination records have been studied. The documents were classified into three main groups (ultrasonography, endoscopy, and X-ray and 13 subgroups using two well-known methods: Support Vector Machine (SVM and K-Nearest Neighbor (KNN. The results obtained demonstrated that both machine learning methods performed successfully, with a little supremacy of SVM. In the process of classification a “shrink” method, based on features selection, was introduced and applied. At the first stage of classification the results of the “shrink” case were better; however, on the second stage of classification into subclasses 23% of all documents could not be linked to only one definite individual subclass (liver or binary system due to common features characterizing these subclasses. The overall results of the study were successful.

  12. LEXICO-STYLISTIC CHOICES AND MEDIA IDEOLOGY IN NEWSPAPER REPORTS ON NIGER DELTA CONFLICTS

    Directory of Open Access Journals (Sweden)

    Chuka Fred Ononye

    2017-05-01

    Full Text Available Media reports on Niger Delta (Henceforth, ND conflicts have reflected a relationship between lexico-stylistic choices and media ideologies. The existing media studies on the discourse have predominantly utilised pragmatic, stylistic and discourse analytical tools in presenting and labelling discourse participants and/or their ideologies, but neglected how media ideologies can be revealed through lexico-stylistic choices made in the reports. This paper therefore examines the lexico-stylistic choices in the reports in order to establish their link to specific ideological goals of the newspapers in relaying the conflict news. Forty reports on ND conflicts, published between 2003 and 2007, sampled from two ND-based (The Tide and Pioneer and two national (The Punch and THISDAY, labelled newspapers, were subjected to stylistic and critical analyses, with insights from structural (relational semantics and aspects of stylistics discourse. Two broad lexical stylistic choices are identified, including paradigmatic (61.8%—indexed by synonymous, antonymous, hyponymous, colloquial, and register items, and coinages and syntagmatic (38.2%—marked by collocations, metaphors, pleonasms, and lexical fields features. The features are utilised for three ideological ends; namely, picking out and framing participants as perpetrators of the violence in the discourse, evaluating specific entities and their roles in the conflicts, and reducing the impact of the activities of the news actors. Although there are overlaps, the evaluative ideology is largely associated with the national newspaper, the impact reduction ideology with the ND-based newspapers, while the framist ideology is observed in the two sets of newspapers. With these findings the study has added the lexical stylistics angle to the existing scholarship on ND conflict news discourse. Thus, the newspaper reports on ND conflicts are motivated by their ideological goals to change the reader’s outlook on

  13. Classification of Traffic Related Short Texts to Analyse Road Problems in Urban Areas

    Science.gov (United States)

    Saldana-Perez, A. M. M.; Moreno-Ibarra, M.; Tores-Ruiz, M.

    2017-09-01

    The Volunteer Geographic Information (VGI) can be used to understand the urban dynamics. In the classification of traffic related short texts to analyze road problems in urban areas, a VGI data analysis is done over a social media's publications, in order to classify traffic events at big cities that modify the movement of vehicles and people through the roads, such as car accidents, traffic and closures. The classification of traffic events described in short texts is done by applying a supervised machine learning algorithm. In the approach users are considered as sensors which describe their surroundings and provide their geographic position at the social network. The posts are treated by a text mining process and classified into five groups. Finally, the classified events are grouped in a data corpus and geo-visualized in the study area, to detect the places with more vehicular problems.

  14. Comparisons and Selections of Features and Classifiers for Short Text Classification

    Science.gov (United States)

    Wang, Ye; Zhou, Zhi; Jin, Shan; Liu, Debin; Lu, Mi

    2017-10-01

    Short text is considerably different from traditional long text documents due to its shortness and conciseness, which somehow hinders the applications of conventional machine learning and data mining algorithms in short text classification. According to traditional artificial intelligence methods, we divide short text classification into three steps, namely preprocessing, feature selection and classifier comparison. In this paper, we have illustrated step-by-step how we approach our goals. Specifically, in feature selection, we compared the performance and robustness of the four methods of one-hot encoding, tf-idf weighting, word2vec and paragraph2vec, and in the classification part, we deliberately chose and compared Naive Bayes, Logistic Regression, Support Vector Machine, K-nearest Neighbor and Decision Tree as our classifiers. Then, we compared and analysed the classifiers horizontally with each other and vertically with feature selections. Regarding the datasets, we crawled more than 400,000 short text files from Shanghai and Shenzhen Stock Exchanges and manually labeled them into two classes, the big and the small. There are eight labels in the big class, and 59 labels in the small class.

  15. A CMAC-based scheme for determining membership with classification of text strings.

    Science.gov (United States)

    Ma, Heng; Tseng, Ying-Chih; Chen, Lu-I

    Membership determination of text strings has been an important procedure for analyzing textual data of a tremendous amount, especially when time is a crucial factor. Bloom filter has been a well-known approach for dealing with such a problem because of its succinct structure and simple determination procedure. As determination of membership with classification is becoming increasingly desirable, parallel Bloom filters are often implemented for facilitating the additional classification requirement. The parallel Bloom filters, however, tend to produce additional false-positive errors since membership determination must be performed on each of the parallel layers. We propose a scheme based on CMAC, a neural network mapping, which only requires a single-layer calculation to simultaneously obtain information of both the membership and classification. A hash function specifically designed for text strings is also proposed. The proposed scheme could effectively reduce false-positive errors by converging the range of membership acceptance to the minimum for each class during the neural network mapping. Simulation results show that the proposed scheme committed significantly less errors than the benchmark, parallel Bloom filters, with limited and identical memory usage at different classification levels.

  16. Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models

    Directory of Open Access Journals (Sweden)

    Jin Dai

    2014-01-01

    Full Text Available The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers.

  17. Texting

    Science.gov (United States)

    Tilley, Carol L.

    2009-01-01

    With the increasing ranks of cell phone ownership is an increase in text messaging, or texting. During 2008, more than 2.5 trillion text messages were sent worldwide--that's an average of more than 400 messages for every person on the planet. Although many of the messages teenagers text each day are perhaps nothing more than "how r u?" or "c u…

  18. Evaluation and Classification of Syntax Usage in Determining Short-Text Semantic Similarity

    Directory of Open Access Journals (Sweden)

    V. Batanović

    2014-06-01

    Full Text Available This paper outlines and categorizes ways of using syntactic information in a number of algorithms for determining the semantic similarity of short texts. We consider the use of word order information, part-of-speech tagging, parsing and semantic role labeling. We analyze and evaluate the effects of syntax usage on algorithm performance by utilizing the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. We also propose a new classification of algorithms based on their applicability to languages with scarce natural language processing tools.

  19. Lexico-stylistic peculiarities of the English blog discourse (on the material of the author’s blog

    Directory of Open Access Journals (Sweden)

    Morgunova Marina Nikolaevna

    2015-06-01

    Full Text Available The article studies the lexical-stylistic features of English-language blogging discourse. The analysis was conducted on the material of blogs copyright texts known to media people, journalists and bloggers on various issues of environmental, social, cultural, political and economic spheres of society, selected the most discussed topics of the last two years (20142015. A comparative analysis of the blog with other forms of Internet communication (email, chat, twitter is carried out. It is argued that blogs have a socio-transforming, democratizing potential, a special way of selfexpression, self-improvement and personal statement of uniqueness. Extralinguistic features of blog largely dictate its linguistic and stylistic characteristics. It is argued that the author’s blog - a kind of creative space for the personal development of the author’s ideas of creation, which determines the structure, design and content of the blog, as well as using linguistic means and stylistic devices that would most clearly reflect the purpose and theme of the blog. Selection of the lexical and stylistic means of the blog is determined by a combination of various determinants of the general and specific. It is revealed the use of Internet language, professionally-oriented vocabulary of various functional and stylistic status, stylistically marked vocabulary. Among the stylistic figures most common are a metaphor, simile, hyperbole epithet.

  20. Toward a multi-sensor-based approach to automatic text classification

    Energy Technology Data Exchange (ETDEWEB)

    Dasigi, V.R. [Sacred Heart Univ., Fairfield, CT (United States); Mann, R.C. [Oak Ridge National Lab., TN (United States)

    1995-10-01

    Many automatic text indexing and retrieval methods use a term-document matrix that is automatically derived from the text in question. Latent Semantic Indexing is a method, recently proposed in the Information Retrieval (IR) literature, for approximating a large and sparse term-document matrix with a relatively small number of factors, and is based on a solid mathematical foundation. LSI appears to be quite useful in the problem of text information retrieval, rather than text classification. In this report, we outline a method that attempts to combine the strength of the LSI method with that of neural networks, in addressing the problem of text classification. In doing so, we also indicate ways to improve performance by adding additional {open_quotes}logical sensors{close_quotes} to the neural network, something that is hard to do with the LSI method when employed by itself. The various programs that can be used in testing the system with TIPSTER data set are described. Preliminary results are summarized, but much work remains to be done.

  1. Stylistics and the Metaphysics of Poetry

    Science.gov (United States)

    Anderson, Neil

    2007-01-01

    In order to better understand the worth of aesthetic experience in encountering poetry, fresh perspectives are helpful. This paper introduces the reader to modern stylistics: that is linguistic examinations of "the speaker's meaning" in literature and notes such "scientific" approaches to poetry do find common metaphysical ground with leading…

  2. Scene text detection via extremal region based double threshold convolutional network classification.

    Directory of Open Access Journals (Sweden)

    Wei Zhu

    Full Text Available In this paper, we present a robust text detection approach in natural images which is based on region proposal mechanism. A powerful low-level detector named saliency enhanced-MSER extended from the widely-used MSER is proposed by incorporating saliency detection methods, which ensures a high recall rate. Given a natural image, character candidates are extracted from three channels in a perception-based illumination invariant color space by saliency-enhanced MSER algorithm. A discriminative convolutional neural network (CNN is jointly trained with multi-level information including pixel-level and character-level information as character candidate classifier. Each image patch is classified as strong text, weak text and non-text by double threshold filtering instead of conventional one-step classification, leveraging confident scores obtained via CNN. To further prune non-text regions, we develop a recursive neighborhood search algorithm to track credible texts from weak text set. Finally, characters are grouped into text lines using heuristic features such as spatial location, size, color, and stroke width. We compare our approach with several state-of-the-art methods, and experiments show that our method achieves competitive performance on public datasets ICDAR 2011 and ICDAR 2013.

  3. Scene text detection via extremal region based double threshold convolutional network classification.

    Science.gov (United States)

    Zhu, Wei; Lou, Jing; Chen, Longtao; Xia, Qingyuan; Ren, Mingwu

    2017-01-01

    In this paper, we present a robust text detection approach in natural images which is based on region proposal mechanism. A powerful low-level detector named saliency enhanced-MSER extended from the widely-used MSER is proposed by incorporating saliency detection methods, which ensures a high recall rate. Given a natural image, character candidates are extracted from three channels in a perception-based illumination invariant color space by saliency-enhanced MSER algorithm. A discriminative convolutional neural network (CNN) is jointly trained with multi-level information including pixel-level and character-level information as character candidate classifier. Each image patch is classified as strong text, weak text and non-text by double threshold filtering instead of conventional one-step classification, leveraging confident scores obtained via CNN. To further prune non-text regions, we develop a recursive neighborhood search algorithm to track credible texts from weak text set. Finally, characters are grouped into text lines using heuristic features such as spatial location, size, color, and stroke width. We compare our approach with several state-of-the-art methods, and experiments show that our method achieves competitive performance on public datasets ICDAR 2011 and ICDAR 2013.

  4. A Stylistic Analysis of Complexity in William Faulkner's "A Rose for Emily"

    Science.gov (United States)

    Abdurrahman, Israa' Burhanuddin

    2016-01-01

    Applying a stylistic analysis on certain texts refers to the identification of patterns of usage in writing. However, such an analysis is not restricted just to the description of the formal characteristics of texts, but it also tries to elucidate their functional importance for the interpretation of the text. This paper highlights complexity as a…

  5. Stylistic Analysis of Roald Dahl’s Cinderella

    Directory of Open Access Journals (Sweden)

    Henni Henni

    2010-01-01

    Full Text Available The paper presents a stylistic analysis of a rhyme, Cinderella, taken from Dahl’s rhyme collection, Revolting Rhymes. Roald Dahl is famous for his ability in creating extraordinary stories, in which linguistic elements, such as sounds and words, are manipulated to create an amusing story that has an unpredictable plot. The discussion covers an analysis of the narrative structure and the linguistic style applied in the rhyme, together with the discussion of the author’s purpose of applying such style. From the analysis it is found out that the style Dahl applies in the rhyme is especially useful for foregrounding.

  6. A linguistic stylistic study of Wole Soyinka's Night and Death in the ...

    African Journals Online (AJOL)

    Linguistic stylistics explores the linguistic features of a text; it is primarily concerned with the use of language and its effect in a text. This study is aimed at analyzing the language structure/system of Wole Soyinka's 'Night' and 'Death in the Dawn' to render a linguistic description, that is, identifying the linguistic deviant ...

  7. A Feature Selection Method Based on Fisher's Discriminant Ratio for Text Sentiment Classification

    Science.gov (United States)

    Wang, Suge; Li, Deyu; Wei, Yingjie; Li, Hongxia

    With the rapid growth of e-commerce, product reviews on the Web have become an important information source for customers' decision making when they intend to buy some product. As the reviews are often too many for customers to go through, how to automatically classify them into different sentiment orientation categories (i.e. positive/negative) has become a research problem. In this paper, based on Fisher's discriminant ratio, an effective feature selection method is proposed for product review text sentiment classification. In order to validate the validity of the proposed method, we compared it with other methods respectively based on information gain and mutual information while support vector machine is adopted as the classifier. In this paper, 6 subexperiments are conducted by combining different feature selection methods with 2 kinds of candidate feature sets. Under 1006 review documents of cars, the experimental results indicate that the Fisher's discriminant ratio based on word frequency estimation has the best performance with F value 83.3% while the candidate features are the words which appear in both positive and negative texts.

  8. Analysis of Influence of Different Relations Types on the Quality of Thesaurus Application to Text Classification Problems

    Directory of Open Access Journals (Sweden)

    Nadezhda S. Lagutina

    2017-01-01

    Full Text Available The main purpose of the article is to analyze how effectively different types of thesaurus relations can be used for solutions of text classification tasks. The basis of the study is an automatically generated thesaurus of a subject area, that contains three types of relations: synonymous, hierarchical and associative. To generate the thesaurus the authors use a hybrid method based on several linguistic and statistical algorithms for extraction of semantic relations. The method allows to create a thesaurus with a sufficiently large number of terms and relations among them. The authors consider two problems: topical text classification and sentiment classification of large newspaper articles. To solve them, the authors developed two approaches that complement standard algorithms with a procedure that take into account thesaurus relations to determine semantic features of texts. The approach to topical classification includes the standard unsupervised BM25 algorithm and the procedure, that take into account synonymous and hierarchical relations of the thesaurus of the subject area. The approach to sentiment classification consists of two steps. At the first step, a thesaurus is created, whose terms weight polarities are calculated depending on the term occurrences in the training set or on the weights of related thesaurus terms. At the second step, the thesaurus is used to compute the features of words from texts and to classify texts by the algorithm SVM or Naive Bayes. In experiments with text corpora BBCSport, Reuters, PubMed and the corpus of articles about American immigrants, the authors varied the types of thesaurus relations that are involved in the classification and the degree of their use. The results of the experiments make it possible to evaluate the efficiency of the application of thesaurus relations for classification of raw texts and to determine under what conditions certain relationships affect more or less. In particular, the

  9. The Stylistics of Selected American, Italian and Polish Challenge Vlogs

    Directory of Open Access Journals (Sweden)

    Ryszard Kurpiel

    2017-06-01

    Full Text Available The emergence of a new mode of communication called vlogging allows the spoken variety of language to manifest itself in new settings. Vloggers appear to draw both from traditional television aesthetics and from the videos of other vloggers. The content which circulates on the website globally is often recontextualised (cf. Rymes, 2012. The plethora of different types of vlogs and the sheer number of videos created daily call for an attempt to find a common stylistic denominator within specific types of vlogs. The chapter discusses characteristic stylistic features of a selected type of such videos, namely, challenge videos in American English, Italian and Polish. The chosen videos form the basis for an analysis in search of similarities and differences in style. The findings suggest that the analysed challenge videos have many common features, and if differences ensue, they are not generally found within specific national or linguistic groups but in mixed sets of videos in the three languages of interest. The author seeks to find explanations for the similarities.

  10. Mathematics Subject Classification (MSC 2010): guia ràpida [text]. Curs 2017-18

    OpenAIRE

    Universitat de Barcelona. CRAI

    2017-01-01

    La Mathematics Subject Classification 2010 (MSC 2010) és una revisió de la Mathematics Subject Classification 2000 (MSC 2000), resultat de la col·laboració dels editors de MathSciNet i Zentralblatt MATH per actualitzar la classificació que comparteixen.

  11. A Socio-Stylistic Analysis of Some Selected Bette Dirges | Ashipu ...

    African Journals Online (AJOL)

    A Socio-Stylistic Analysis of Some Selected Bette Dirges. ... Log in or Register to get access to full text downloads. ... investigation has shown that both the language and its literature are encapsulated in the socio-linguistic activities of the folk.

  12. Stylistics of the Russian language as a study discipline at a non-linguistic university

    Directory of Open Access Journals (Sweden)

    О Н Лапшина

    2008-03-01

    Full Text Available The article highlights basic aspects of Russian stylistics as an integral part of linguistic training of university students of non-linguistic specialties. It also underlines the significance of this academic discipline, its role and place in the system of communicative and professional demands of future specialists in the field of service.

  13. A Stylistic Analysis of Complexity in William Faulkner's "A Rose for Emily"

    Directory of Open Access Journals (Sweden)

    Israa' Burhanuddin Abdurrahman

    2016-08-01

    Full Text Available Applying a stylistic analysis on certain texts refers to the identification of patterns of usage in writing. However, such an analysis is not restricted just to the description of the formal characteristics of texts, but it also tries to elucidate their functional importance for the interpretation of the text. This paper highlights complexity as a hallmark of a stylistic analysis in A Rose for Emily, a short story by William Faulkner (1897-1962. The analysis is done by adopting Halliday's (1985 approach to analyzing complexity in sentence structure; and Lauer, et al's (2008 approach to analyzing narrative from a macro perspective in relation to the story acts. The analysis rests upon the assumption that since form conveys meaning, Faulkner's multilayer usage of complexity is extremely functional. This paper tries also to detect and prove that stylistic complexity is manipulated to convey the main themes, events, and successfully leads to identify the distinctive structure of this story.  Keywords: Style, Stylistic Complexity, Hypotactic, Paratactic, Functional

  14. Stylistics of “Tarikh i Balami”

    Directory of Open Access Journals (Sweden)

    Ali Mohammadi asiyabadi

    2016-05-01

    Full Text Available AbstractTarikh i Balami", written by abu Ali Mohammad Ibn Abdollah Balami, Minister of Samanis in the fourth century, is one of the oldest books in Islamic period that contains the first step of creation till Arab raid and the prophet's life and the kings. Amir Mansur ibn Noah Samani ordered Translation of the book “Tarikh al-Omam va al-Muluk” of Tabari to his minister , Balami. But on several occasions he summarized The original book in the translation and used the other sources and removed something from his translation.   Different versions of this book show that many differences between eleven versions used for comparison, none of them became similar to each other. Replacing the oldest words with new words and the Arabic word with Persian word in the same meaning is one of problems  that copyists have caused for stylistics of this book.   Writing "Tarikh i Balami" is related to the first period of Persian prose in Iran. During this time that took Samanis and Tahiris period, the authors have written in a simple style. This style of writing, also called Khurasani style, because the authors have lived in Khorasan. The style is also called Balami, the author of the book, because he was one of the most prominent writers of this genre.   Results of comparison between this book and the other books of this period, such as mughaddamih Shahnameh abu Mansuri, translation of Tafsir i Tabari, Hudud al-alam min al-Mashriq ela al-Maqrib, Tafsir i Pak and…. show that the most important stylistic features in its language level are significant in some areas. One of them is the author tried to use Persian words, but sometimes the shorter and more familiar Arabic words have been replaced. -        Some verbs used in specific ways, for example, the old prefixes “فرا”, “فرو”, “باز”,… are used with verbs. -  

  15. Detecting Family Resemblance: Automated Genre Classification

    Directory of Open Access Journals (Sweden)

    Yunhyong Kim

    2007-03-01

    Full Text Available This paper presents results in automated genre classification of digital documents in PDF format. It describes genre classification as an important ingredient in contextualising scientific data and in retrieving targetted material for improving research. The current paper compares the role of visual layout, stylistic features, and language model features in clustering documents and presents results in retrieving five selected genres (Scientific Article, Thesis, Periodicals, Business Report, and Form from a pool of materials populated with documents of the nineteen most popular genres found in our experimental data set.

  16. Anecdote as a Persuasive Stylistic Technique in Pusonnam Yiri's ...

    African Journals Online (AJOL)

    The paper explores anecdotes as a stylistic technique in the language of persuasion using samples from Yiri's Blindness of the Mind (2011). It attempts a textual analysis of some extracts from the novella using a Functional Stylistics Framework of M. A. K. Halliday. It analyses the writer's ability to manipulate situations with ...

  17. Stylistic Variation In Three English Translations Of The Dead Sea ...

    African Journals Online (AJOL)

    Since the discovery of the Dead Sea Scrolls in 1947 different English translations were published. In this article the stylistic variation of three of these translations are analysed. It is suggested that the issue of stylistic variation boils down to linguistically inscribed preference in the choice and construction of discourses in the ...

  18. Chiasmus as a Stylistic Device in Donne's and Vaughan's Poetry

    Science.gov (United States)

    I'jam, Dunya Muhammad Miqdad; Fadhil, Zahraa Adnan

    2016-01-01

    This study investigates chiasmus as a stylistic device in ten metaphysical poems (five for John Donne and five for Henry Vaughan). It aims at showing how both, Donne and Vaughan, utilize chiasmus at the different linguistic levels as a stylistic device in their poetry. Thus, to achieve this aim, it is hypothesized that chiasmus as used by Donne…

  19. Stylistic Patterns in Language Teaching Research Articles: A Multidimensional Analysis

    Science.gov (United States)

    Kitjaroenpaiboon, Woravit; Getkham, Kanyarat

    2016-01-01

    This paper presents the results of a multidimensional analysis to investigate stylistic patterns and their communicative functions in language teaching research articles. The findings were that language teaching research articles contained six stylistic patterns and communicative functions. Pattern I consisted of seven salient positive features…

  20. John McGahern's stylistic and narratological art

    DEFF Research Database (Denmark)

    Nørgaard, Nina; Busse, Beatrix; McIntyre, Dan

    2010-01-01

    The Irish writer John McGahern is acknowledged as a master stylist in both his novels and his short stories. In this article we discuss the stylistic and narratological factors that contribute to the artistry in McGahern’s short story writing. We analyse three stories: “A Slip-up”, “All Sorts...

  1. The Stylist: A Pascal Program for Analyzing Prose Style

    Science.gov (United States)

    1987-06-01

    Fiction was represented by long passages from two of my novels and 1000-word excerpts from the novels of Ernest Hemingway , Raymond Chandler and Kurt...Stylist, Thankfully, The Stylist was able to distinguish between the novels of Ernest Hemingway and the ramblings of computer science graduates, The

  2. Stylistic analysis of the poem "Ambulance" by Philip Larkin

    Directory of Open Access Journals (Sweden)

    Muhammad Yasir Khan

    2016-09-01

    Full Text Available Philip Larkin is known as a movement poet, because there are many poems which were published in the book of poetic movement, New Lines. This movement was started in 1950, against the unreasonable, inflated and extortionate romanticism of 1930's and 1940's. The common features of movement poetry are; the use of irony and wit, an element of satire, the loneliness of modern man, pessimistic outlook on life, realism, use of ordinary diction, use of symbols, and loss of religious faith. Robert Conquest is known as the representative poet of this movement. The poets of this movement believed that the language of poetry should be simple, lucid and intelligible. The poem Ambulance deals with the important features of movement poetry. Through the stylistics study of this poem the researchers will show the loneliness and concept of death in modern man.

  3. IMPACT OF STYLISTIC FEATURES, ARCHITECTURAL AND URBAN RULES OF THE ALGIERS ARCHITECTURAL HERITAGE DATING BETWEEN 1830 AND 1930 ON THE STRENGTH OF ITS BUILDINGS DURING THE EARTHQUAKE

    Directory of Open Access Journals (Sweden)

    M. A. Souami

    2013-07-01

    Full Text Available In a other work, we have highlighted a theoretical point of view that there is an relation between the earthquake-resistant architectural design codes and, the urban and stylistic characteristics of buildings and urban forms of the Algiers architectural heritage dating between 1830 and 1930. Following this, we hypothesized that its various stylistic and urban characteristics have a direct impact on the resilience of buildings to earthquakes. The purpose of this article is to try through the computer simulation examples of some stylistic and urban characteristics to prove the validity or not of our hypothesis.

  4. Corpus stylistics: speech, writing and thought presentation in a corpus of English writing.

    OpenAIRE

    Semino, Elena; Short, M. H.

    2004-01-01

    This book represents a new direction at the interface between the fields of stylistics and corpus linguistics, namely the use of a corpus methodology to investigate the ways in which people's words and thoughts are presented in written narratives. A 260,000 word electronic corpus of late 20th century written texts, including fiction, news reports and (auto)biographies is analysed by the authors, providing a detailed account of new theoretical insights, comparisons between different text types...

  5. O stylist como bricoleur no design de moda

    OpenAIRE

    Mostaro, Marcelo Lopardi

    2012-01-01

    O presente estudo aborda o ofício do stylist de moda e como este correlaciona o design e a moda. Para tanto, pinçamos algumas relações sobre o design e a moda e abordamos como ambas se relacionam com o stylist de moda. Para o noss recorte, objetivamos o profissional que executa o seu ofício de styling para os editorias de moda. A diversidade criativa proposta pelo stylist acarreta uma construção notória de looks, proporcionado leituras não verbais. Utilizamos conceito de Bricoleur, da ob...

  6. A Stylistic Analysis of D.H. Lawrence’s ‘Sons and Lovers’

    Directory of Open Access Journals (Sweden)

    Nozar Niazi

    2013-05-01

    Full Text Available This paper aims at analyzing D.H. Lawrence’s ‘Sons and Lovers’ using a stylistic approach. Stylistics is a study of the amalgamation of form with content. The stylistic analysis of a novel goes beyond the traditional, intuitive interpretation, because it combines intuition and detailed linguistic analysis of the text. The defining elements of modern language are within the text itself, not prescribed from outside. With modernist texts, usually understanding comes from close study of the language system defined within the text itself. Form, technique and style are considered not as a mere vehicle of the content of the story, but an integral part of the work’s meaning and value. In our analysis of ‘Sons and Lovers’ the resources of language: lexis, syntax, phonology, figurative language, cohesion and coherence, are discussed in relation to the style of discourse in order to explore hidden meanings in the text. The resources of language are shown to be an essential part of the meaning of the novel.

  7. Natural language text classification and filtering with trigrams and evolutionary nearest neighbour classifiers

    NARCIS (Netherlands)

    W.B. Langdon

    2000-01-01

    textabstractN~grams offer fast language independent multi-class text categorization. Text is reduced in a single pass to ngram vectors. These are assigned to one of several classes by a) nearest neighbour (KNN) and b) genetic algorithm operating on weights in a nearest neighbour classifier. 91

  8. A (FORENSIC STYLISTIC ANALYSIS OF ADVERBIALS OF ATTITUDE AND EMPHASIS IN SUPREME COURT DECISIONS IN PHILIPPINE ENGLISH

    Directory of Open Access Journals (Sweden)

    Hjalmar Punla Hernandez

    2017-09-01

    Full Text Available Contemporarily, stylistics today has developed into its multiplicity – one of which is forensic stylistics. Being a powerfully legal written discourse, Supreme Court decisions are a rich corpus in which linguistic vis-a-vis stylistic choices of Court justices could be examined. This study is a humble attempt at stylistically analyzing Supreme Court decisions in Philippine English (PhE drafted by two Filipino justices. Specifically, it sought to investigate on the classes, placements, and environments of adverbials of attitude and emphasis employed by the two justices, and drew their implications to teaching and learning English for Legal Purposes (ELP. Using McMenamin (2012, Quirk, Greenbaum, Leech, and Svartvik (1985, and Dita’s (2011 frameworks, 54 randomly selected Supreme Court decisions as primary sources of legal language were analyzed. Results are the following. Firstly, the classes of adverbials of attitude in Supreme Court decisions in PhE used by the two judges were the evaluation to the subject of the clause, judgment to the whole clause, and evaluation to an action performed by the subject of the clause, while those adverbials of emphasis were adverbials of conviction and doubt. Secondly, both adverbials they used have placements that were frequently medial and less initial in sentences where they belonged. Thirdly, the two justices put their adverbials within two principal environments, i.e. within functor, and before/after the verb among others. In these regards, legal and stylistic explanations with respect to these recurrent linguistic features in the two justices’ Court decisions were revealed. Implications of the study to ELP are explained. Lastly, trajectories for future (forensic stylistic analyses have been recommended.

  9. Recurrent Neural Network for Text Classification with Multi-Task Learning

    OpenAIRE

    Liu, Pengfei; Qiu, Xipeng; Huang, Xuanjing

    2016-01-01

    Neural network based methods have obtained great progress on a variety of natural language processing tasks. However, in most previous works, the models are learned based on single-task supervised objectives, which often suffer from insufficient training data. In this paper, we use the multi-task learning framework to jointly learn across multiple related tasks. Based on recurrent neural network, we propose three different mechanisms of sharing information to model text with task-specific and...

  10. Facility Detection and Popularity Assessment from Text Classification of Social Media and Crowdsourced Data

    Energy Technology Data Exchange (ETDEWEB)

    Sparks, Kevin A [ORNL; Li, Roger G [ORNL; Thakur, Gautam [ORNL; Stewart, Robert N [ORNL; Urban, Marie L [ORNL

    2016-01-01

    Advances in technology have continually progressed our understanding of where people are, how they use the environment around them, and why they are at their current location. Having a better knowledge of when various locations become popular through space and time could have large impacts on research fields like urban dynamics and energy consumption. In this paper, we discuss the ability to identify and locate various facility types (e.g. restaurant, airport, stadiums) using social media, and assess methods in determining when these facilities become popular over time. We use natural language processing tools and machine learning classifiers to interpret geotagged Twitter text and determine if a user is seemingly at a location of interest when the tweet was sent. On average our classifiers are approximately 85% accurate varying across multiple facility types, with a peak precision of 98%. By using these methods to classify unstructured text, geotagged social media data can be an extremely useful tool to better understanding the composition of places and how and when people use them.

  11. Beyond the Law of Transitivity:A Functional Stylistic Study of Maya Angelou's I Know Why the Caged Bird Sings

    Directory of Open Access Journals (Sweden)

    Muthanna Makki Muhammed

    2017-03-01

    Full Text Available The dominant critical focus on Maya Angelou’s writings has been on the thematic features of her texts. Linguistic and stylistic appraisals on her works are generally sparse. This paper is a stylistic study of Maya Angelou’s autobiographical novel I Know Why the Caged Bird Sings. It aims at examining the stylistic features of the text vis-à-vis the semantic Law of Transitivity so as to investigate the features that contribute in the discourse’s trespassing the sphere of informing to the sphere of interaction and influence. The paper starts with brief notes on stylistics in relation to semantics. This is followed by a discussion of the Law of Transitivity, frequent references are made to John R. Searle’s patterns of metaphor. The varied forms of the relations between the signified or the source (the vehicle and the signifier or the target (the tenor in relation to the sign (the common ground are discussed in the light of the figurative devices employed by the author and the functions achieved in revealing the ideological issues of race and gender in the book. The study attempts also at positioning the formal and psychological elements within a sociocultural context in order to promote the reader’s understanding of the purposes and functions to which certain linguistic choices are made.

  12. Security Classification Using Automated Learning (SCALE): Optimizing Statistical Natural Language Processing Techniques to Assign Security Labels to Unstructured Text

    Science.gov (United States)

    2010-12-01

    techniques de traitement du langage naturel statistique et d’apprentissage automa- tique pour attribuer automatiquement une classification de sécurité à...turel statistique (TLNS) et de l’apprentissage automatique au moment d’évaluer la sensibilité d’un texte non structuré. Nous avons effectué une...résultats démontrent que le traitement du langage naturel statistique combiné à l’apprentissage automatique est un moyen efficace qui permet

  13. Tula song folklore: genre-stylistic and dialectic peculiarities

    Directory of Open Access Journals (Sweden)

    Krasovskaya Nelli Alexandrovna

    2016-06-01

    Full Text Available The article analyzes the works of Tula folklore recorded in the western part of the Tula region, in terms of genre, stylistic and linguistic features. The relevance of the study is related to the fact that Tula folk songs has not been studied, linguistic features of the works are not subjected to serious analysis. The article describes the features of the genre of songs recorded in Belevsky district of Tula region, including the ancient fortunetelling chants, wedding ceremony songs, romantic ballads etc., it is cited numerous examples in the lyrics that reflect the dialectal features of the phonetic, grammatical, lexical levels. According to the authors, a modern folk song genre retains its diversity and is a kind of storeroom containing priceless linguistic wealth. The analysis allows to draw conclusions about the presence and well-preserved in the recorded music of South Russian dialect phonetic and grammatical features. So far, there is no established typology of Tula dialects, therefore, according to the authors, the fixation of folklore in the territories bordering on Tula dialects, is very important and interesting for further descriptive and comparative work on identifying the eastern and south-south-west differences in Tula dialects.

  14. Classification with Multiple Classes using Naïve Bayes and Text Generation with a Small Data Set using a Recurrent Neural Network

    OpenAIRE

    Reiten, Tore Elias Gjervik

    2017-01-01

    Master's thesis Information- and communication technology IKT590 - University of Agder 2017 In this thesis, text classification and text generation are explored using only a small data set and many classes. This thesis experiments with text classification, and show how it is able to find the most similar output compared to the input even with thousands of classes. Furthermore, text generation is explored on a small data set to create a unique output. By using Na¨ıve Bayes text ...

  15. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music.

    Directory of Open Access Journals (Sweden)

    Niels Chr Hansen

    Full Text Available Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled renditions of saxophone solos improvised by Charlie Parker in the bebop style. Ratings of explicit uncertainty and expectedness for different continuations of each melodic excerpt were collected. An information-theoretic model of expectation enabled selection of stimuli affording highly certain continuations in the bebop style, but highly uncertain continuations in the context of general tonal expectations, and vice versa. The results showed that expert musicians have acquired probabilistic characteristics of music influencing their experience of expectedness and predictive uncertainty. While classical musicians had internalised key aspects of the bebop style implicitly, only jazz musicians' explicit uncertainty ratings reflected the computational estimates, and jazz-specific expertise modulated the relationship between explicit and inferred uncertainty data. In spite of this, there was no evidence that non-musicians and classical musicians used a stylistically irrelevant cognitive model of general tonal music providing support for the theory of cognitive firewalls between stylistic models in predictive processing of music.

  16. Linguistic – Stylistic Technique and the Effective Teaching and ...

    African Journals Online (AJOL)

    Nekky Umera

    Abstract. This study investigated the effect of linguistic-stylistic technique on the effective teaching and learning of poetry in Nigerian senior secondary school. The sample of the study comprised of 310 senior secondary two (SS II) students from six schools randomly selected from Itu Local Government. Area of Akwa Ibom ...

  17. Linguistic – Stylistic Technique and the Effective Teaching and ...

    African Journals Online (AJOL)

    Nekky Umera

    between the achievement of male and female students in poetry taught with linguistic-stylistic technique. ..... Hence, it is intrinsically motivating and enhances retention. This is in line with the findings of Ebam and Ada (1998) that ... Carefully reading the poems that are to be taught in the classroom. •. Extracting the linguistic ...

  18. Stylistics in Teacher Training: Research Programs and Future Prospects

    Science.gov (United States)

    Ventura, Ana Clara

    2016-01-01

    The aim of this research is to analyse and systematize the conceptual and empirical bases of the available literature on research approaches, objects of study, and future prospects in the field of stylistics, in order to encourage best practice in teacher training. Three research approaches are presented: the empiricist-behaviorist approach, the…

  19. Stylistic Devices in Ben Okri's The Famished Road | Ikechi ...

    African Journals Online (AJOL)

    This paper discusses stylistic devices in Ben Okri's The Famished Road. In the presentation of his story, the novelist makes use of literary devices which enrich readers' understanding and enjoyment of his subject matter. Satire, register, cliché, pidgin and proverbs are some of the devices. Others include: figurative language ...

  20. Stylistic Analysis of Robert Browning's Poem "Patriot into Traitor

    Science.gov (United States)

    Ahmed, Mumtaz; Irshad, Ayesha

    2015-01-01

    The stylistic analysis of Robert Browning's poem "Patriot into Traitor" is done by using graphological, phonological, morphological and lexico-syntactic patterns. This analysis is helpful in decoding the underlying meanings of the poem. It clearly brings to surface what the poet really wants to impart.

  1. A Stylistic Analysis of Isidore Okpewho's The Last Duty | Nnaji ...

    African Journals Online (AJOL)

    This paper exposes the stylistic idiosyncrasies embraced by Okpewho in passing his message. The writer uses individualistic narrative style. Dash and ellipsis are punctuation marks also used in special ways. Figures of speech like anaphora, litotes, and rhetorical questions among others are used to beautify the work.

  2. Linguistic – Stylistic Technique and the Effective Teaching and ...

    African Journals Online (AJOL)

    This study investigated the effect of linguistic-stylistic technique on the effective teaching and learning of poetry in Nigerian senior secondary school. The sample of the study comprised of 310 senior secondary two (SS II) students from six schools randomly selected from Itu Local Government Area of Akwa Ibom State, ...

  3. Preparing Stylistically Challenging Contemporary Classical Repertoire for Performance: Interpreting "Kumari"

    Science.gov (United States)

    Viney, Liam; Blom, Diana

    2015-01-01

    Research involving the learning processes of musicians seldom examines specific pieces of music, and limited attention has been devoted to the earliest stages of learning a stylistically challenging or new piece of 20th-/21st-century art music. This article describes the processes by which two pianists (the authors) learned Ross Edwards's…

  4. Classification

    Science.gov (United States)

    Clary, Renee; Wandersee, James

    2013-01-01

    In this article, Renee Clary and James Wandersee describe the beginnings of "Classification," which lies at the very heart of science and depends upon pattern recognition. Clary and Wandersee approach patterns by first telling the story of the "Linnaean classification system," introduced by Carl Linnacus (1707-1778), who is…

  5. Classification

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2017-01-01

    This article presents and discusses definitions of the term “classification” and the related concepts “Concept/conceptualization,”“categorization,” “ordering,” “taxonomy” and “typology.” It further presents and discusses theories of classification including the influences of Aristotle...... and Wittgenstein. It presents different views on forming classes, including logical division, numerical taxonomy, historical classification, hermeneutical and pragmatic/critical views. Finally, issues related to artificial versus natural classification and taxonomic monism versus taxonomic pluralism are briefly...

  6. Comparison of Aminpour’s Qhazal and Qhazve‘s Qhazal Based on Structural Stylistics

    Directory of Open Access Journals (Sweden)

    Ahmad Rezae

    2014-12-01

    Full Text Available Abstract The importance of stylistics in examining the texts has resulted in the burgeoning of various stylistic schools, with their differing methodologies. Among them, structural stylistics – which is the more polished version of formal stylistics – looks over the structure of words, sounds and syntax in the text and then focuses on their relation to the content. In fact, structural stylistics, drawing upon the structuralists views, is formed on the basis of structural linguistics, and works to analyze a literary text with regard to its organic unity and the inter-relationship of the parts to the whole. In other words, the main purpose is to approach the content of a work through its form and structure. In this method, stylistic features of the work are recognized through understanding the structural proportions between sounds, words and syntax. Accordingly, the structuralists regard the style as the manner of deviation and extra-regularity and the frequency of its occurrence in a particular era. The present article, first, refers to the definitions of style and stylistic schools and enumerates the features of structural stylistics and then sets out to study and compare two ghazals on the subject of the Holy Defense by Gheisar Aminpour ("Taghvimha" or "Calendars" and Alireza Ghazve ("Ghesmat" or "Destiny", in the light of structural stylistics. Through analyzing the different parts of the texts, with regard to balances and deviation, we will deal with the relatedness and proportion of these parts to the content. "Calendars" is among the best-known ghazals of Aminpour. It contains the issues of feeling ashamed of martyrs, lamenting over our negligence, and feeling left away from the martyrs. A special sense of grief and sadness, hidden in the particular rhythm and cadence of the words and combinations, helps the poet to express his feelings and thoughts. The ghazal "Destiny" deals also with the distress and exhaustion the poet feels in this

  7. The primitives of Santa Clara of Ubeda: stylistic and iconographic study, critical appraisals and vicissitudes of a dispersed heritage

    Directory of Open Access Journals (Sweden)

    Clara Beltrán Catalán

    2016-12-01

    Full Text Available The authors study the stylistic and iconographic aspects of a collection of paintings on wood dating from the 15th and 16th centuries, originally in the Royal Monastery of Santa Clara at Úbeda. This collection was sold in the 1920s with the participation of the antique dealer Celestino Dupont. The research is complemented by an analysis of the critical appraisals given to these works and their history since their introduction into the art market.

  8. Stylistic features of narrative procedure in a psychological short story in the context of teaching interpretation

    Directory of Open Access Journals (Sweden)

    Stakić Mirjana M.

    2016-01-01

    Full Text Available The paper investigates the stylistic features of narrative procedure in a psychological short story in the context of its interpretation in the teaching of the Serbian language and literature. The narrative procedure in a psychological short story is characterized by the use of the first person in narrating, that is I form, an interior monologue and direct interior monologue, dreams, oversights and introspective. It is also characterized by a particular sentence structure, of often incomplete and elliptical form, used to express the conflicts going on in characters' inner sphere and verbal interaction between the characters. The narrative procedure applied in a psychological short story indicates that its plot is subdued to the internal psychological experiences. During the interpretation of a psychological short story students, through the interpretation of stylistic and narrative procedures, are directed and encouraged to discover complex and often hidden psychological mechanisms which spur the characters to act, influence their behavior, verbal expression and mutual relations. The interpretation of language signs which may have psychological and semantic potential leads to the revealing of unconscious internal psychological processes and mechanisms which take place within a literary character.

  9. Psychological landscape films: narrative and stylistic approaches

    Directory of Open Access Journals (Sweden)

    David Melbye

    2017-01-01

    Full Text Available This article serves to broadly address formalist approaches to rendering natural settings psychological in cinema, or, in other words, mobilizing landscapes to function beyond their usual function as narrative backdrop. By “formalist” it should be understood that such an approach to visual representation is both inherently aggressive and experimental, where one implies the other, especially when compared to more realist modes of expression. Instead of human characters portrayed within an authenticating wilderness, as in the typical classical Hollywood Western, the precise topography of the landscape becomes reflective of a particular human consciousness.

  10. Aesthetic Proximity: the Role of Stylistic Programme Elements in Format Localisation

    Directory of Open Access Journals (Sweden)

    Jolien van Keulen

    2016-08-01

    Full Text Available Implications of the transnationalisation of television are often studied by focusing on the localisation of the content of formatted programmes. Although television is essentially an audio-visual medium, little attention has been paid to the aesthetic aspects of television texts in relation to transnationalisation and formatting. Transnationalisation of production practices, such as through formatting, implies a transnational aesthetic. At the same time, aspects of style are specific to place, culture or audience. In this article, the localisation of stylistic programme elements is explored using a comparison of two reality format adaptations. It is argued that style plays an important role in the expression of the local in a transnational industry.

  11. Stylistics in the Southeast Asian ESL or EFL Classroom: A Collection of Potential Teaching Activities

    Science.gov (United States)

    Gonzales, Wilkinson Daniel Wong; Flores, Eden R.

    2016-01-01

    For the past few decades, stylistics has emerged as a discipline that encompasses both literary criticism and linguistics. The integration of both disciplines opened many opportunities for English literature and language teachers to get creative in their teaching--by introducing the stylistic approach in their classrooms. However, in a typical…

  12. Stylistic Analysis of the Short Story "The Last Word" by Dr. A. R. Tabassum

    Science.gov (United States)

    Bari Khan, Abdul; Ahmad, Madiha; Ahmad, Sofia; Ijaz, Nida

    2015-01-01

    In this article stylistic analysis of short story "The Last Word" by Dr. A. R. Tabassum is performed. The formative elements of the story, such as point of view, characters and allegorical element, are discussed in detail so as to give a better insight of the story. The story is analyzed stylistically in terms of figures of speech where…

  13. Stylistic Analysis of Ola Rotimi's Hopes of the Living Dead | Lere ...

    African Journals Online (AJOL)

    This paper explores the writer.s choice of language and linguistic techniques which are effective in passing across his message.Itdiscusses the stylistic rudiments Rotimi uses in realising the play.The stylistic features include:diction, sentence types,code-mixing, Nigerianisms, punctuation marks, figures of speech, use of ...

  14. Simile: the most salient stylistic feature in Kelile and Demne

    Directory of Open Access Journals (Sweden)

    Maryam Mahmoodi

    2014-12-01

    Full Text Available Abstract Kelile and Demne is one of the most salient samples of Persian technical prose rhetorical and semantic figures and figures of speech namely simile, metaphor, metonymy and irony are among the stylistic features of this book. Among these, simile, as the most influential imagination tool, play a dominant role in the illustrations of the book. In this article, simile has been analyzed and investigated in all its variations in Kelile and Demne. In this book, simile appears from its most laconic form (eloquent simile to its most extensive form. But the major feature of theirs is their outspokenness, explicitness and sometimes their novelty. Among the likening components, the range of image vocabulary is one of the likening features in this book. Also the point of similarity has been usually abstracted from man's states, shape, place, space, volume and generally affairs concerning visual and tactile senses. So, its perception is not too much difficult. The variety and extension of likening vehicles in this work are worth of contemplating. In analysis of simile on the credit of both parties, we can conclude that ratio-emotional similes are of the most frequent kinds of simile. And Nasrollah Monshi has extended the field of emotional similes and has manipulated the relations between objects in a novel way. Allegoric simile has been used abundantly in Kelile and Demne.  It justifies the didactic function of this text. Allegory approaches its main role in this book. i.e. arguing and convincing. The contents of allegories in this book are moral and political and in terms of from, they are anecdotes of animals and human beings. The types of similes on the credit of form - namely equalization similes implied comparative similes and subtrahend similes - have been also used. Among the salient features of this book, several images together or in interference with each other have been used in one word or sentence. Sometimes similes accompany other

  15. Simile: the most salient stylistic feature in Kelile and Demne

    Directory of Open Access Journals (Sweden)

    Maryam Mahmoodi

    2014-11-01

    Full Text Available Abstract Kelile and Demne is one of the most salient samples of Persian technical prose rhetorical and semantic figures and figures of speech namely simile, metaphor, metonymy and irony are among the stylistic features of this book. Among these, simile, as the most influential imagination tool, play a dominant role in the illustrations of the book. In this article, simile has been analyzed and investigated in all its variations in Kelile and Demne. In this book, simile appears from its most laconic form (eloquent simile to its most extensive form. But the major feature of theirs is their outspokenness, explicitness and sometimes their novelty. Among the likening components, the range of image vocabulary is one of the likening features in this book. Also the point of similarity has been usually abstracted from man's states, shape, place, space, volume and generally affairs concerning visual and tactile senses. So, its perception is not too much difficult. The variety and extension of likening vehicles in this work are worth of contemplating. In analysis of simile on the credit of both parties, we can conclude that ratio-emotional similes are of the most frequent kinds of simile. And Nasrollah Monshi has extended the field of emotional similes and has manipulated the relations between objects in a novel way. Allegoric simile has been used abundantly in Kelile and Demne.  It justifies the didactic function of this text. Allegory approaches its main role in this book. i.e. arguing and convincing. The contents of allegories in this book are moral and political and in terms of from, they are anecdotes of animals and human beings. The types of similes on the credit of form - namely equalization similes implied comparative similes and subtrahend similes - have been also used. Among the salient features of this book, several images together or in interference with each other have been used in one word or sentence. Sometimes similes accompany other

  16. Stylistics of Nafthat ol-Masdur by Zeidari Nasvi

    Directory of Open Access Journals (Sweden)

    Fereydon Tahmasbi

    2016-05-01

    disrupt the sequence of the events. The author, in accordance with the society and the available prose, adopted the mix of Arabic and Persian prose and this is one of the effects of socio-political structure on his prose. The presence of different social classes in his book reflects the interaction between literature and society.ReferencesAhmadi, Babak (2009. Creation and Beauty: Hermeneutics and Aesthetic Queries; 5th ed., Tehran: Markaz press.---------------------(2009. Structure and Interpretation of the text, 10th ed., Tehran: Markaz press.Alavi Moghadam, Mahyar (1998. Contemporary Literary Criticism Theories (Formalism and Structuralism. 1st ed., Tehran: The organization of the study and compiling humanities books for universities (SAMT.Boudaryar et al (1995. Perplexity of Signs (examples of postmodern criticism. Babak Ahmadi et al (trans., 1st ed., Tehran: Markaz press.Don Stewart (2004. Structuralism and Post-Structuralism; Abolfazl Sajedi (trans., Journal of Hoze and University, No 36.Eagleton, Terry (2007. A prelude to literary theory; Abbass Mokhber (trans., Tehran: Markaz Publishers.Ghiasi, Mohammad Taghi (1989. An Introduction to Structural Stylistics, Tehran: Shole Andishe.Hossein Panah, Farahnaz (2007. Aesthetics of Dolat Abadi Prose, Development and teaching Persian language and literature Magazine; vol 20, No 4.Kazazi, Mirjalal (2006. Aesthetics of Persian Speech; Expression, 7th ed., Tehran: Markaz Publishers.Khatibi, Hossein (2007. Prose Technique in Persian Literature, 3rd ed., Tehran: Zavvar.Khosravi, Abouzar (2008. Historian Politician, Ata’olmolk Joveini, Mah book of history and Geography, No 126.Khosrow beigi, Hooshang (2007. Memoir writing of Shahab od-Din Nasvi; Zamane, 6th year, No 64.-------------------------------- (2006. Shahab-al-din Nasvi and his Morality, Mah book of history and Geography.Meghdadi, Bahram (1999. Culture and Literary Terminology from Plato to Modern Time, 1st ed., Tehran: Fekre Ruz.Mirsadeghi, Jamal (2004. Story and

  17. A Stylistic Analysis of Linguistic Patterns in Chichamanda Ngozi Adichie’s Purple Hibiscus

    Directory of Open Access Journals (Sweden)

    Muchamad Sholakhuddin Al Fajri

    2017-06-01

    Full Text Available This study aims to carry out a detailed and systematic stylistic analysis of linguistic patterns in Purple Hibiscus Novel by Chichamanda Ngozi Adichie. It particularly analyses a specific extract of the novel in terms of narration and point of view, conversational analysis, speech and thought presentations and mind style, and how these linguistic devices and patterns are employed by the author to shape characters’ personalities and relationships between them in the reader’s mind. The result appears to suggest that the author successfully represents the protagonist, Kambili as an obedient and a salient daughter who respects deeply his father, while her father, Eugene, is constructed as a strict father and religious who imposes an absolute control on his daughter.

  18. Socio-stylistic aspects of linguistic variation: schooling and monitoring effects

    Directory of Open Access Journals (Sweden)

    Raquel Meister Ko. Freitag

    2015-07-01

    Full Text Available Alveolar stops following palatal glide in Brazilian Portuguese are investigated, with focus on constraints for the choice between occlusive or affricate variants. Data was collected in the community of practice of Praesidium Mãe da Divina Graça da Legião de Maria, from a sample belonging to the ‘Falares Sergipanos’ database, and submitted to GoldVarbX analysis. Seven hundred seventy-five variable contexts were identified, in which 550 occurrences corresponded to the occlusive variant. Results show that the phenomenon is constrained by linguistic, social and stylistic factors, and highlight the importance of a detailed control of the context in the analysis of variable phenomena at the phonetic-phonological level.

  19. On the Russian Expression SESTʹ V LUZHU: Semantic and Stylistic Peculiarities

    Directory of Open Access Journals (Sweden)

    Юлия Борисовна Камчатнова

    2012-12-01

    Full Text Available The Russian expression сесть в лужу (literally, to sit into a puddle stands apart from similar expressions with the same image in other languages, both semantically and stylistically. Whereas the expressions to find oneself in the mire, in der Patsche sitzen, etc. mean ‘to get into a difficult situation’ or ‘get tangled,’ the saying сесть в лужу means ‘to get into an awkward or funny situation, to fail because of being unable to live up to one’s expectations, to shame oneself.’ Perhaps, here the verb to sit, which in most cases implies a sitting position, changed the meaning of the expression, and it approached the offensive colloquial наложить в штаны ‘to crap one’s pants,’ which has the literal sense ‘to shit or wet oneself from fear,’ and the figural sense ‘to shame oneself through unredeemed promises.’ The expression сесть в лужу in current usage describes a person who is so scared that he has wet or soiled his pants. Judging by the fact that this expression cannot be found in Dahl’s dictionary and that it appeared in Russian literature at the beginning of the 20th century, it is obvious that before that time, it had been considered vulgar, substandard and unprintable. Nowadays, the expression is widespread, and though its origin has been forgotten, it has not become stylistically neutral: in dictionaries it is tagged as colloquial, informal, pejorative, ironic, and humorous.

  20. Latin Polyphony in the Early Spanish Baroque: Suggestions for Stylistic Criteria

    Directory of Open Access Journals (Sweden)

    Etzion, Judith

    2001-12-01

    Full Text Available This article focuses on the principal stylistic features in selected Latin works by Mateo Romero (Maestro Capitán, in order to suggest some basic guidelines for the generic and stylistic diversity of the Latin corpus in the early Spanish baroque. It essentially demonstrates how, on the one hand, Romero's contrapuntal works in the supposedly 'estilo antiguo' are substantially modified, and how, on the other, traditional features are retained within his works in the 'new style'. A special emphasis is given to Romero's polychoral works, which exemplify the most 'progressive' polychoral trend in the early Spanish baroque.

    El presente artículo se centra en los rasgos estilísticos principales de determinadas obras con texto latino de Mateo Romero (Maestro Capitán, en relación al estudio de algunas pautas básicas dentro de la diversidad genérica y estilística del corpus latino del barroco español temprano. El trabajo demuestra esencialmente cómo, por una parte, las obras contrapuntísticas de Romero, escritas en el supuestamente llamado "estilo antiguo" están substancialmente modificadas, y, por otra, cómo los rasgos tradicionales se hallan comprendidos en sus obras escritas en el "nuevo estilo". Se pone especial interés en las obras policorales de Romero para ejemplificar la tendencia más progresiva del estilo policoral en el barroco español temprano.

  1. Diagnostic investigations and historical-stylistic evaluation on the oil painting: "reading man by oil lamp light"

    Directory of Open Access Journals (Sweden)

    Salvatore Lorusso

    2006-02-01

    Full Text Available This investigation intends to verify the attribution of the oil painting (70x50,5 cm portraying a reading man by oil lamp light, to Gerrit van Hontorst. The note refers not only to a stylistic and historical-artistical evaluation but also to the knowledge, through diagnostic techniques, of the application to characterize components of matter, and of the manufacture execution technique and preservation conditions. This investigation denies the attribution to the painter Gerrit van Hontorst, but it does not exclude a dating within the XVII century.

  2. The Contribution of the Vaccine Adverse Event Text Mining System to the Classification of Possible Guillain-Barré Syndrome Reports

    Science.gov (United States)

    Botsis, T.; Woo, E. J.; Ball, R.

    2013-01-01

    Background We previously demonstrated that a general purpose text mining system, the Vaccine adverse event Text Mining (VaeTM) system, could be used to automatically classify reports of an-aphylaxis for post-marketing safety surveillance of vaccines. Objective To evaluate the ability of VaeTM to classify reports to the Vaccine Adverse Event Reporting System (VAERS) of possible Guillain-Barré Syndrome (GBS). Methods We used VaeTM to extract the key diagnostic features from the text of reports in VAERS. Then, we applied the Brighton Collaboration (BC) case definition for GBS, and an information retrieval strategy (i.e. the vector space model) to quantify the specific information that is included in the key features extracted by VaeTM and compared it with the encoded information that is already stored in VAERS as Medical Dictionary for Regulatory Activities (MedDRA) Preferred Terms (PTs). We also evaluated the contribution of the primary (diagnosis and cause of death) and secondary (second level diagnosis and symptoms) diagnostic VaeTM-based features to the total VaeTM-based information. Results MedDRA captured more information and better supported the classification of reports for GBS than VaeTM (AUC: 0.904 vs. 0.777); the lower performance of VaeTM is likely due to the lack of extraction by VaeTM of specific laboratory results that are included in the BC criteria for GBS. On the other hand, the VaeTM-based classification exhibited greater specificity than the MedDRA-based approach (94.96% vs. 87.65%). Most of the VaeTM-based information was contained in the secondary diagnostic features. Conclusion For GBS, clinical signs and symptoms alone are not sufficient to match MedDRA coding for purposes of case classification, but are preferred if specificity is the priority. PMID:23650490

  3. Stylistic analysis of headlines in science journalism: A case study of New Scientist.

    Science.gov (United States)

    Molek-Kozakowska, Katarzyna

    2017-11-01

    This article explores science journalism in the context of the media competition for readers' attention. It offers a qualitative stylistic perspective on how popular journalism colonizes science communication. It examines a sample of 400 headlines collected over the period of 15 months from the ranking of five 'most-read' articles on the website of the international magazine New Scientist. Dominant lexical properties of the sample are first identified through frequency and keyness survey and then analysed qualitatively from the perspective of the stylistic projection of newsworthiness. The analysis illustrates various degrees of stylistic 'hybridity' in online popularization of scientific research. Stylistic patterns that celebrate, domesticate or personalize science coverage (characteristic of popular journalism) are intertwined with devices that foreground tentativeness, precision and informativeness (characteristic of science communication). The article reflects on the implications of including various proportions of academic and popular styles in science journalism.

  4. Development of Feature Set, Classification Implementation and Applications for Vowel Migration/Modification in Sung Filipino (Tagalog Texts and Perceived Intelligibility

    Directory of Open Access Journals (Sweden)

    Virginia B. Bustos

    2009-12-01

    Full Text Available With the emergence of research on real-time visual feedback to supplement vocal pedagogy, the utilization of technology in the world of music is now seen to accelerate skills learning and enhance cognitive development. The researchers of this project aim to further analyze vowel intelligibility and develop software applications intended to be used not only by professional singers but also by individuals who wish to improve their singing capability. Data in the form of sung vowels and song pieces were obtained from 46 singers. A Listening Test was then conducted on these samples to obtain the ground truth for vowel classification based on human perception. Simulation of the human auditory perception of sung Filipino vowels was performed using formant frequencies and Mel-frequency cepstral coefficients as feature vector inputs to a two-stage Discriminant Analysis classifier. The setup resulted in an over-all Training Set accuracy of 89.4% and an over-all Test Set accuracy of 90.9%. The accuracy of the classifier, measured in terms of the correspondence of vowel classifications obtained from the classifier with the results of the Listening Test, reached 92.3%. Using information obtained from the classifier, offline and online/real-time software applications were developed. The main application features include the display of the spectral envelope and spectrogram, pitch and vibrato analysis and direct feedback on the classification of the sung vowel. These features were recommended by singers who were surveyed and were incorporated in the applications to aid singers to adjust formant locations, directly determine listener’s perception of sung vowels, perform modeling effectively and carry out vowel migration.

  5. Language Personality of the Publicist: Rhetorical and Stylistic Canon (Yu. Senkevich “To “Ra” Across the Atlantic”

    Directory of Open Access Journals (Sweden)

    Olga V. Shatalova

    2017-10-01

    Full Text Available In article the communicative and linguistic parameters of the speech of the publicist of the XX century Yu. Senkevich which are declared as a sample for formation of the language personality in the conditions of development of the modern information and communicative environment are designated. On the example of the analysis of the book «On “Ra” through Atlantic» locates the fact of that Yu.N. Senkevich’s creativity corresponds to the main criteria of popular scientific journalism: high degree of reliability, authoritativeness of the publicist; dynamism, dramatic nature, intelligence of a statement. The unostentatious enlightenment based on updating of scientific knowledge for the addressee and a dialogization of a publicistic discourse, the high level of psychological and philosophical generality, the declaration of humanistic values form specific rhetoric which is supported by the formal and grammatical organization of the speech of the publicist. Priority of syntactic designs of a certain type, stylistic ladders and figures – dynamism of a statement and scale of representation of material define. The humour and easy self-irony as significant characteristics of the language personality define a basis of the rhetorical and stylistic canon realized in publicistic works Yu.N. Senkevich – «the human view of people and society» that in the modern information and communicative environment becomes a necessary reference point.

  6. Clipped Wings and the Great Abyss: Cognitive Stylistics and Implicatures in Abiezer Coppe’s ‘Prophetic’ Recantation

    Directory of Open Access Journals (Sweden)

    Borgogni Daniele

    2017-03-01

    Full Text Available In this article, two major paradigms within cognitive stylistics, the Conceptual Metaphor Theory (CMT and the Conceptual Integration Theory (CIT, are applied as largely complementary approaches to discuss the scope and implicatures of the central metaphorical image of Copp’s Return to the wayes of Truth (1651, a text written by one of the most famous radical preachers of the Civil War period as a plea to be released from prison. The article will focus on how the linguistic and cultural contexts of Coppe’s prophetic writing, in their interaction with the dynamic conceptual relationships of a conceptual integration network, open up new possibilities of perspectivizing and insinuating radically different meanings and implicatures: the use of blends in Coppe’s text has a direct effect on the structure of the analogies that can be made between mental spaces, thereby triggering new meaning effects, supplementary symbolizing patterns, and unpredictable perlocutionary effects.

  7. A Stylistic and Proficiency-based Approach to EFL Learners’ Performance Inconsistency

    Directory of Open Access Journals (Sweden)

    Gholam-Reza Abbasian

    2013-11-01

    Full Text Available Performance deficiencies and inconsistencies among SLA or FL learners can be attributed to variety of sources including both systemic (i.e., language issues and individual variables.  Contrary to a rich background, the literature still suffers from a gap as far as delving into the issue from language proficiency and learning style is concerned. To fill the gap, this study addressed EFL learners’ interlanguage performance (i.e., error types in the light of their learning styles and language proficiency levels.  Participants were 73 Iranian graduate EFL learners, who received the Michigan proficiency tests along with the Perceptual Learning-Style Preference Questionnaire (of Reid in order to measure their language proficiency level and learning style types. For the purpose of the study they were divided into pre-intermediate, intermediate and upper-intermediate levels. To explore their performance inconsistencies, both oral data from an semi-structured (SST interview and written data based on an on-the-spot piece of writing were used and then classified based on Corder’s (1971 scheme into pre-systematic, systematic and post-systematic errors.   The results revealed significant relationships among the target variables, if not predictor-predicted relationships. The non-linear relationship among the variable underscores significance of an integrative approach to EFL learner’s performance inconsistencies and the importance of stylistic instruction in EFL contexts.

  8. MASCULINE LANGUAGE IN INDONESIAN NOVELS: A FEMINIST STYLISTIC APPROACH ON BELENGGU AND PENGAKUAN PARIYEM

    Directory of Open Access Journals (Sweden)

    Supriyadi .

    2014-06-01

    Full Text Available Belenggu is a novel written by Armijn Pane in 1938, whereas Pengakuan Pariyem is a lyrical novel written by Linus Suryadi AG that published in 1980. Both are interested to be analyzed from linguistic aspects, especially in relation to gender dan patriarchal issues. In this case, the proper approach is feminist stylistics by Sara Mills since it analyzed literary works from linguistic aspects and then is enlarged on the contexts of surroundings when it was published. The results are that Belenggu basically used masculine languages including word, phrases, clauses, sentences, and discourses when it is related to its contexts. Contextually, Belenggu represented author responses to conditions of his society in which women tried to insist their rights for equality (to men. It also represented author’s critic to women since it is better that women still work domestically and support her husband. Meanwhile, Pengakuan Pariyem is a lyrical novel that considers men and women have mutual relationships although women still work domestically and men work outside.

  9. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text

    Science.gov (United States)

    2011-01-01

    Background Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. Results A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were

  10. The Speak UP! Salon Project: Using Hair Stylists as Lay Health Educators About Unintended Pregnancy.

    Science.gov (United States)

    Roberts-Dobie, Susan; Rasmusson, Addie; Losch, Mary E

    2018-01-01

    The Speak UP! Salon Project trained 126 hair stylists from 60 salons as lay health educators to increase knowledge about contraceptives and inform clients about financial support for long-acting reversible contraceptives at local family planning clinics. Stylists' informal social support system, combined with the rapport built with clients over time, make them excellent candidates for lay health educator projects. This 3-year salon-based intervention was implemented in nine counties in a Midwestern state. Results from a subsample of participants who completed an online questionnaire (n = 177) indicate that hair stylists are a feasible method to link target populations to health information and to the health care system. The benefits and challenges of collecting data in a salon environment are also discussed.

  11. Magical cooking: Some stylistic characteristics of the novel Like water for chocolate

    Directory of Open Access Journals (Sweden)

    Uršula Kastelic Vukadinović

    2013-12-01

    Full Text Available In the paper we analyze some of the stylistic characteristics of the novel Like Water for Chocolate from the perspective of a non-Mexican reader. The narrator of the story interlaces the typical linguistic structures specific to cooking recipes and the story of a forbidden love between Tita and Pedro. As a good cook she gives advice, describes the procedures for the preparation of the food, all of which she intertwines with the story as once told to her, which is explicitly expressed by using reporting verbs. We found out that one of the characteristics of the novel is the visualization of the story. The author herself says that her literary creation is based on visual images and that afterwards she turns them into a story. In the novel this is reflected in Tita`s intense response to sensory stimuli. In the article, we highlight examples from the text, structured according to the predominant senses involved. Through the food that Tita is preparing Tita and Pedro establish an unusual, passionate and sensual relationship. Among the ingredients for the dishes that are connected to their love story, the reader encounters many unknown Mexican cultural expressions, which do not hinder the understanding of the story. The text shows that we are dealing with a dish or an ingredient that is mentioned at the beginning as a part of the recipe or presented in such a context. Therefore, the coherence of the text is maintained. The reader takes pleasure in the reading that transports him to the distant worlds of magical realism.

  12. Mapping the stylistic affiliations of Le Corbusier's work

    NARCIS (Netherlands)

    Panigyrakis, P.I.

    2015-01-01

    The paper deals with Le Corbusier's connection to the term "style". A classification of his work in specific architectural styles is discussed, followed by a description of the procedure through which the style of the man Le Corbusier was constructed; in an attempt to search meaning in his continual

  13. A Critical-Stylistic Study Of Mind Style And Characterisation In Buchi ...

    African Journals Online (AJOL)

    Although much research has gone into Buchi Emecheta's literary art, hardly has any statements being made about the situation of mind style in her works. This study, however, considers the critical –stylistic value of mind style in the characterisation of Nnu Ego in Emecheta's The Joys of Motherhood. It reveals the author's ...

  14. 112 A Linguistic Stylistic Study of Wole Soyinka's Night and Death in ...

    African Journals Online (AJOL)

    to the literary aspects, but with the evolution of time, style can be studied through a linguistic perspective and this is what this work tries to achieve. Linguistic Stylistics .... relationship between the moisture of the earth and that of a dog's nose. (4) is a lexical item present in the lexicon of the language, but it violates the class to ...

  15. A Pragma-Stylistic Analysis of President Goodluck Ebele Jonathan Inaugural Speech

    Science.gov (United States)

    Abuya, Eromosele John

    2012-01-01

    The study was an examination through the pragma-stylistic approach to meaning of the linguistic acts that manifest in the Inaugural Speech of Goodluck Ebele Jonathan as the democratically elected president in May 2011 General Elections in Nigeria. Hence, the study focused on speech acts type of locution, illocutionary and perlocutionary in the…

  16. Multidimensionality of Teachers' Graded Responses for Preschoolers' Stylistic Learning Behavior: The Learning-to-Learn Scales

    Science.gov (United States)

    McDermott, Paul A.; Fantuzzo, John W.; Warley, Heather P.; Waterman, Clare; Angelo, Lauren E.; Gadsden, Vivian L.; Sekino, Yumiko

    2011-01-01

    Assessment of preschool learning behavior has become very popular as a mechanism to inform cognitive development and promote successful interventions. The most widely used measures offer sound predictions but distinguish only a few types of stylistic learning and lack sensitive growth detection. The Learning-to-Learn Scales was designed to…

  17. Stylistic Variation in French: Bridging the Gap between Research and Textbooks

    Science.gov (United States)

    Etienne, Corinne; Sax, Kelly

    2009-01-01

    One fundamental aspect of all educated native speakers' language competence is stylistic variation, or the ability to adjust speech register to situational formality ( Labov, 1972). In this article, we investigate how current introductory and intermediate college French textbooks develop students' sociolinguistic competence and particularly the…

  18. A Thematic and Stylistic Analysis of Aidoo's Novels and Short Stories

    African Journals Online (AJOL)

    The study also reveals that Aidoo exploits stylistic reversals in her fiction where new shades of meanings are attached to old terms and the female character's change from the language of acquiescence to that of revolt and self-assertion, thus reflecting her growth from docility and conformity to liberation. The journey motif ...

  19. N-gram support vector machines for scalable procedure and diagnosis classification, with applications to clinical free text data from the intensive care unit.

    Science.gov (United States)

    Marafino, Ben J; Davies, Jason M; Bardach, Naomi S; Dean, Mitzi L; Dudley, R Adams

    2014-01-01

    Existing risk adjustment models for intensive care unit (ICU) outcomes rely on manual abstraction of patient-level predictors from medical charts. Developing an automated method for abstracting these data from free text might reduce cost and data collection times. To develop a support vector machine (SVM) classifier capable of identifying a range of procedures and diagnoses in ICU clinical notes for use in risk adjustment. We selected notes from 2001-2008 for 4191 neonatal ICU (NICU) and 2198 adult ICU patients from the MIMIC-II database from the Beth Israel Deaconess Medical Center. Using these notes, we developed an implementation of the SVM classifier to identify procedures (mechanical ventilation and phototherapy in NICU notes) and diagnoses (jaundice in NICU and intracranial hemorrhage (ICH) in adult ICU). On the jaundice classification task, we also compared classifier performance using n-gram features to unigrams with application of a negation algorithm (NegEx). Our classifier accurately identified mechanical ventilation (accuracy=0.982, F1=0.954) and phototherapy use (accuracy=0.940, F1=0.912), as well as jaundice (accuracy=0.898, F1=0.884) and ICH diagnoses (accuracy=0.938, F1=0.943). Including bigram features improved performance on the jaundice (accuracy=0.898 vs 0.865) and ICH (0.938 vs 0.927) tasks, and outperformed NegEx-derived unigram features (accuracy=0.898 vs 0.863) on the jaundice task. Overall, a classifier using n-gram support vectors displayed excellent performance characteristics. The classifier generalizes to diverse patient populations, diagnoses, and procedures. SVM-based classifiers can accurately identify procedure status and diagnoses among ICU patients, and including n-gram features improves performance, compared to existing methods. Published by the BMJ Publishing Group Limited. For permission to use (where not already granted under a licence) please go to http://group.bmj.com/group/rights-licensing/permissions.

  20. The Linguistic Technique of Parallelism in Al-Ahwas Al-Ansari’s Poetry: A Stylistic Study

    Directory of Open Access Journals (Sweden)

    Mah’d Ahmad Al-halhooli

    2016-11-01

    Full Text Available Parallelism is one of the most conspicuous stylistic techniques that are marked by the receiver's ear, and it also has an excellent musical rhythm. Al-Ahwas[1] thoroughly familiarizes readers with an entire host of parallelistic arrangements in his poetry, which makes the semantic units a rich material for the aesthetic study. The present study presents the most beautiful uses of all varieties of parallelism in a corpus of Al-Ahwas's poetry, as the reader of his poetry would never miss observing the many images of parallelism based on semantic concordance, as well as the parallelism that appears in one line, or in a stanza, based on structural concordance, in addition to the parallelism formed by the morphological rhythm which is based on the repetition of a morphological derivative formulae. Al-Ahwas did not suffice himself with the similar morphological formulae to achieve rhythm, but he also aimed at the parallelism achieved by repetition, which falls into several patterns. They can be included in one verse line, or within a number lines conforming to one idea, but with different contexts. The second pattern of repetition in terms of rhythm and structure is Epanaphora, or Epanalepsis: which is a repetitive structure based on inserting a word at the beginning of the speech and then repeating the same word at the end of the speech. Al-Ahwas achieved a high rhythmical harmony using parallelism besides a semantic goal by deliberately intensifying parallel words and functioning units in a network revolving around the dominant idea. This study hopes to pave the way for future avenues of studies in poetry under the category of stylistics.  [1]. The poet Al-Ahwas Al-Ansari. He is Abdullah bin Mohammed bin Abdullah bin Asim bin Thabit al-Ansari, one of the Umayyad poets, died in Damascus in 105 AH / 723 AD. He was called Al-Ahwas for a tightness in his eye. He is an Islamic satire poet.

  1. A STYLISTIC ANALYSIS OF “THE RIME OF THE ANCIENT MARINER”

    Directory of Open Access Journals (Sweden)

    Shaukat Khan

    2016-12-01

    Full Text Available If a specimen of literary art is seen as a fine tapestry of words made by the skilled seamstress—the poet, then the lexis and structure of a language are the raw materials—the fabric and the thread—by weaving which into specific patterns the finished product is achieved. The choice of materials and their arrangement into unique patterns always bear an image of their creator, or the artist; thus, a close view of them reveals the artist’s identity and brings out the meaningful message that underlies the ornate running threads. Mostly, the students of literary studies cannot appreciate the beauty of the literary classics on their own. Consequently, they simply mimic the ideas, and sometimes even the words, of famous professional critics when asked to give their own critical judgment on the aesthetic merit or the thematic quality of a literary work in the shape of a home assignment, classroom presentation or an annual assessment test. Now, the researcher has got the inspiration for carrying out this study from an idea expounded in Widdowson (1975 that this mimicry can be replaced by genuine individual opinion if the students, or even those people who have non-academic concerns with literature, are brought to a standpoint from where they can have a closer view of the raw materials, the language resources, which are involved in the making of a literary product. And, if the product in its finished form cannot elicit a desired response from them then making them sensitive to the process of its making can be quite effective in this regard. Through the present study, an attempt has been made to show an easy access to the outlandish world of verse by means of the linguistic route which is laid with the familiar flagstones of grammar and vocabulary. Meaning thereby that in this study the elusiveness of poetry will be dealt with the precision of a social scientist, the linguist. The approach which serves as the basis of this study is not an invention

  2. Comparison of Aminpour’s Qhazal and Qhazve‘s Qhazal Based on Structural Stylistics

    Directory of Open Access Journals (Sweden)

    Somayye Khorshidi

    2014-11-01

    Full Text Available Abstract The importance of stylistics in examining the texts has resulted in the burgeoning of various stylistic schools, with their differing methodologies. Among them, structural stylistics – which is the more polished version of formal stylistics – looks over the structure of words, sounds and syntax in the text and then focuses on their relation to the content. In fact, structural stylistics, drawing upon the structuralists views, is formed on the basis of structural linguistics, and works to analyze a literary text with regard to its organic unity and the inter-relationship of the parts to the whole. In other words, the main purpose is to approach the content of a work through its form and structure. In this method, stylistic features of the work are recognized through understanding the structural proportions between sounds, words and syntax. Accordingly, the structuralists regard the style as the manner of deviation and extra-regularity and the frequency of its occurrence in a particular era. The present article, first, refers to the definitions of style and stylistic schools and enumerates the features of structural stylistics and then sets out to study and compare two ghazals on the subject of the Holy Defense by Gheisar Aminpour ("Taghvimha" or "Calendars" and Alireza Ghazve ("Ghesmat" or "Destiny", in the light of structural stylistics. Through analyzing the different parts of the texts, with regard to balances and deviation, we will deal with the relatedness and proportion of these parts to the content. "Calendars" is among the best-known ghazals of Aminpour. It contains the issues of feeling ashamed of martyrs, lamenting over our negligence, and feeling left away from the martyrs. A special sense of grief and sadness, hidden in the particular rhythm and cadence of the words and combinations, helps the poet to express his feelings and thoughts. The ghazal "Destiny" deals also with the distress and exhaustion the poet feels

  3. Probing the topological properties of complex networks modeling short written texts.

    Directory of Open Access Journals (Sweden)

    Diego R Amancio

    Full Text Available In recent years, graph theory has been widely employed to probe several language properties. More specifically, the so-called word adjacency model has been proven useful for tackling several practical problems, especially those relying on textual stylistic analysis. The most common approach to treat texts as networks has simply considered either large pieces of texts or entire books. This approach has certainly worked well-many informative discoveries have been made this way-but it raises an uncomfortable question: could there be important topological patterns in small pieces of texts? To address this problem, the topological properties of subtexts sampled from entire books was probed. Statistical analyses performed on a dataset comprising 50 novels revealed that most of the traditional topological measurements are stable for short subtexts. When the performance of the authorship recognition task was analyzed, it was found that a proper sampling yields a discriminability similar to the one found with full texts. Surprisingly, the support vector machine classification based on the characterization of short texts outperformed the one performed with entire books. These findings suggest that a local topological analysis of large documents might improve its global characterization. Most importantly, it was verified, as a proof of principle, that short texts can be analyzed with the methods and concepts of complex networks. As a consequence, the techniques described here can be extended in a straightforward fashion to analyze texts as time-varying complex networks.

  4. A Stylistic Analysis of English Abstracts in Academic Papers from CNKI

    Institute of Scientific and Technical Information of China (English)

    Hou Lixiang; Zhang Liping; Gao Qiuju

    2016-01-01

    English abstracts, as an essential part of academic papers, play an important role in knowledge dissemination and academic exchange. A well-written abstract can not only attract the readers, but also increase the chances of the paper to be indexed and cited. Meanwhile, a concise abstract may save the readers’ time and help them withdraw the necessary information efficiently. So study on English abstracts is quite rewarding. Based on 30 English abstracts selected from CNKI at random, this paper attempts to make a stylistic analysis on the English abstracts of academic papers written by Chinese research article writer. These articles are analyzing the stylistic features of the English Abstract, it is hoped that this paper will enable Chinese research writers including undergraduates or postgraduates to eliminate the difficulties in understanding and writing the abstracts.

  5. Culture of teacher’s work with scientific communication text

    Directory of Open Access Journals (Sweden)

    Елена Николаевна Семеног

    2015-10-01

    Full Text Available The article is focused on the features of the research work culture of teacher with scientific communication text as an integral part of scientific and methodical work in a secondary school. There are characterized some aspects of formation text, genre, stylistic, terminological, lexicographical, bibliographical culture of teacher that contributes to the development of personal position and speaking individuality of the author of scientific product

  6. Probing the Topological Properties of Complex Networks Modeling Short Written Texts

    Science.gov (United States)

    Amancio, Diego R.

    2015-01-01

    In recent years, graph theory has been widely employed to probe several language properties. More specifically, the so-called word adjacency model has been proven useful for tackling several practical problems, especially those relying on textual stylistic analysis. The most common approach to treat texts as networks has simply considered either large pieces of texts or entire books. This approach has certainly worked well—many informative discoveries have been made this way—but it raises an uncomfortable question: could there be important topological patterns in small pieces of texts? To address this problem, the topological properties of subtexts sampled from entire books was probed. Statistical analyses performed on a dataset comprising 50 novels revealed that most of the traditional topological measurements are stable for short subtexts. When the performance of the authorship recognition task was analyzed, it was found that a proper sampling yields a discriminability similar to the one found with full texts. Surprisingly, the support vector machine classification based on the characterization of short texts outperformed the one performed with entire books. These findings suggest that a local topological analysis of large documents might improve its global characterization. Most importantly, it was verified, as a proof of principle, that short texts can be analyzed with the methods and concepts of complex networks. As a consequence, the techniques described here can be extended in a straightforward fashion to analyze texts as time-varying complex networks. PMID:25719799

  7. A CORPUS-BASED STYLISTIC ANALYSIS OF SELECTED RADIO ...

    African Journals Online (AJOL)

    NGOZI

    kidnapping, abduction, prostitution, and cultism… (Text B,. 402). In the sentence, the lexemes armed robbery, kidnapping, abduction, prostitution, and cultism are the hyponyms specifying the super- ... commentaries. Simile: Most family units have failed…as the children behave like sheep without a shepherd (Text A, 362).

  8. English for Science and Technology - Stylistics and Methods

    DEFF Research Database (Denmark)

    Mousten, Birthe

    The book covers basic methods for summarizing and editing of EST writing (English for Science and Technology). In addition, translation of basically technically oriented texts is covered with a view to an evaluation of formality, complexity and audience recognition in connection with different text...

  9. A Corpus-based Stylistic Analysis of Body-Soul and Heaviness-Lightness Metaphors in Kundera's Novel The Unbearable Lightness of Being

    Directory of Open Access Journals (Sweden)

    Khalid Shakir Hussein

    2015-10-01

    Full Text Available This paper represents an attempt to conduct a corpus-based stylistic analysis of  two conceptual metaphors in The Unbearable Lightness of Being, which is a novel written by Milan Kundera. Soul-body and lightness-heaviness metaphors are foregrounded as being central themes all through the novel. The way such metaphors are used in the novel indicates an insightful employment of metaphor as a cognitive tool which empowers language users with a capacity of conceptualizing different experiences. The researcher adopts conceptual metaphor theory to produce a sort of conceptual analysis incorporating Leech's semantic componential analysis within the overall analytic procedure. Different techniques are figured out in relation to the creative ways of manipulating the cognitive level of language, such as conceptual switching, conceptual extension, and conceptual fusion. These creative techniques are carefully used in the novel under investigation with different ranges of metaphorical creativity. Conceptual switching might be simple but very active in deviating from the conventional conceptual system. Conceptual extension marks certain minute elaborations conventional metaphors undergo extending the limits of cognitive conceptualization. As for conceptual fusion, it proves to be interestingly powerful in producing certain aggregations of metaphorical mappings. Keywords: Conceptual Metaphor Theory, Metaphorical Creativity, Metaphorical Mappings, Corpus Stylistics

  10. Review of Cervantes’ Poetry: Cervantes’ Metric and Stylistic Defects

    Directory of Open Access Journals (Sweden)

    Rodrigo Olay Valdés

    2013-12-01

    Full Text Available Cervantes has been considered as a technically clumsy poet. In three classic texts still very influential, Ricardo Rojas, Francisco Rodriguez Marin and Gerardo Diego referred to the poor versification of Cervantes and the metric limitations of his poetry: these accusations have become common places of criticism about Cervantes. Recent attempts to revalue the poetry of Cervantes have ignored those last series of statements. Today examined, many of the alleged technical defects that have traditionally been attributed to Cervantes did not deserve to be considered errors. It’s time to check if Cervantes was a little technically gifted poet.

  11. Stylistics of Nafthat ol-Masdur by Zeidari Nasvi

    OpenAIRE

    Fereydon Tahmasbi; Solmaz Mozaffari

    2016-01-01

    AbstractStructuralism theory is accepted as one of the most effective theories in the current century.   This theoryâs final goal is to discover the hidden secret and linguistic aspects of a literary work via structural elements of a work based on this theory. In structural analysis of a work, implications beyond the text are not important and just the text is considered, and also the meaning is wanted from the structure. According to this theory, the current essay tries to study the struct...

  12. Virginia Woolf's "Mrs. Dalloway": A Suggested Experiment in Classroom Stylistics.

    Science.gov (United States)

    Patil, P. B.

    One of the central problems in the study of fictional prose is that of distinguishing between mere linguistic regularity, which in itself is of no interest to literary studies, and regularity which is significant for the text in which it is found. General criteria for determining whether any particular instance of linguistic prominence is likely…

  13. Politeness Principle as a Pragma-Stylistic Device in Chimamanda ...

    African Journals Online (AJOL)

    This study investigates into how politeness principle is employed in the regulation of discourse in literary texts. It also evaluates the applicability of the politeness principle, which is a pragmatic principle designed to regulate spoken discourse, to literary discourse. Excerpts from Chimamanda Ngozi Adichie's Purple Hibiscus ...

  14. The stylistic ordering of English modifiers | Sam | International ...

    African Journals Online (AJOL)

    International Journal of Pedagogy, Policy and ICT in Education. Journal Home · ABOUT THIS JOURNAL · Advanced Search · Current Issue · Archives · Journal Home > Vol 4, No 1 (2014) >. Log in or Register to get access to full text downloads.

  15. LANGUAGE STYLE OF HABIBURRAHMAN EL-SHIRAZY IN THE DWILOGY OF AYAT-AYAT CINTA: A STYLISTIC STUDY

    Directory of Open Access Journals (Sweden)

    Aflahah Aflahah

    2017-05-01

    Full Text Available Novel is an artwork which closely relates to human life and is considered as the representation of human life journey. The language style in novel is the embodiment of language use by an author to express ideas, emotion, opinion, and to give a certain effect. The main problem will be discussed in this study is the language style found in the novels of AAC 1 and 2. This study is about individual language style, an author’s language style who have written best seller novels, namely Habiburrahman El-Shirazy (HES. The approach that is considered very appropriate in understanding the language use of HES in Dwilogy of Ayat-Ayat Cinta (DAAC is stylistic study. A stylistic study reveals how is the language style used by HES and what effects that are resulted. The approach of this research is a descriptive qualitative research. Descriptive method is used to describe the linguistic facts such as the language style based on lexical choice, sentence structure, and direct or indirect meaning. The use of language style based on lexical choice (diction in DAAC shows a typical language style of HES as well as shows his ability as a Da’i and man of letters. The analysis of language style based on sentence structure and direct/indirect meaning illustrates the descriptive style of HES. Whatever he describes, it must be very accurate. He gives very detail information of the object being spoken of either background of the story or characterization. Through language style, HES describes the emotions experienced by the characters properly so it makes the readers able to empathize. From the data had been obtained, it reveals that language style used by HES to describe characters or characterization (describe the physical condition, characters, and characteristics, describes its background, tells the plot, and convey the message. The results showed the lexical choice of scientific words and religious words are typical of HES’s language style in his DAAC, the

  16. Toni Morrison’s Beloved: A New Stylistic Narrative Form

    Directory of Open Access Journals (Sweden)

    Sayed Mohammad Anoosheh

    2016-12-01

    Full Text Available Toni Morrison faces a great challenge in representing the Atlantic slave trade. In contemporary narrative form, the novel, Beloved, portrays the devastating effects of forced transnational migration. She confronts conventional silences surrounding the aspect of slavery by presenting displaced Africans on their way to America. Her text defines both black literature of the late twentieth century and troubles the status quo as an experiment in aesthetic expression which demonstrates the legacy of trauma fabricated in American culture. She stylizes her narrative form of language for particular effect by using direct references and subtle allusions to the aspect of slavery. Drawing on the coded discourse of oral history and slave narrative as fashion of writerly texts, Morrison takes her readers as participants in the construction of cultural memory. The present article takes up the formalistic and cultural approach to critique the aesthetic means by which Morrison’s verbal style signifies the content of her story, Beloved, which results in a new genre as African-American literature that establishes itself in the late twentieth and the present century world literature.

  17. Slogans and meanings – a grammatical, semantic and stylistic analysis

    Directory of Open Access Journals (Sweden)

    Marinela Doina NISTEA

    2012-01-01

    Full Text Available According to DEX, explanatory dictionary of the Romanian language, published by the Institute of Linguistics of the Romanian Academy, a slogan is “an eminent, compelling phrasing expressing in a lapidary manner the political or economic aims of groups, organizations, etc., a topical issue” or “a battle cry of old Scottish clans”. Extrapolating, one may say that, nowadays, a product’s slogan is equivalent to the old battle cry: it is defined by concision and memorability, it has a precise purpose and emphasizes the offerer’s personality. Its efficiency for advertising is undeniable. The role of a slogan is to evoke the message in the most focused possible wording, meant to draw the addressee’s attention, even to surprise him/her, getting them to remember the idea suggested, regardless of whether it concerns a certain product offered for sale, or it regards a candidate, a political program or an ideology.

  18. Study children\\'s literature by comparative stylistics approach (Poetry book in Ahmed Shoghi,and Mohammed Alhravyand Abbas Yamini Sharif model

    Directory of Open Access Journals (Sweden)

    salah addin abdi

    2014-12-01

    Full Text Available Stylistic is disclosure laws creativity in literary discourse structure. And case under limitation in the idea of interdependence between the texts and to look at the texts in interdependence to only Angle comparison especially as were texts between different languages and what it was limited in literary texts turned out to be working round will be the technical side of any aesthetic. This literature poets any three Ahmed Shawki and Mohamed Hrawi Egyptians and Iranian poet Abbas yamini Sharif appear in children's literature be texture , beautiful word baptism imagination and purpose of the enjoyment of the small receiver and educated and refined . This literature poets look alike poets sing one topic in their hair for children is " Alktab " models of pedagogy in their poets be of standard noodles in a level voice , coordination and harmony between the internal and external music And role of repetition with different check rhythm music and show a sense of psychological and the emphasis on meaning. Be guaranteed their poets raising a child and learned, passion is pride and love of science and learning. The imagination in the poet be kind of yamini imagination Altaleva in while an innovative Shawki’s imagination and fantasy Hrawi graph. The study of each relying on poetic texts for all three of them and the energy of poetic language and its technical and creative aesthetic and stylistic comparison methodology which is lean on comparison mainly her and emerged from the comparison in two languages literary two would different Comparative Literature but of its focus on language and style .

  19. Prominent stylistic aspects in music of Nāser Khosrow's poetry

    Directory of Open Access Journals (Sweden)

    Morteza Mohseni

    2017-04-01

    Full Text Available It is axiomatic  for those who are stylistically a little familiar  about  the periods of  Persian  poetry  that  quasidas  of  Nāser-e Khosrow are totally  different  from  those  of  the  poets  in  fifth  lunar  century  both  in  terms  of  their  contents  and  technically.  This difference  is  seen  even  in  those  areas  of  his  poetry in which the poet is confided to make innovations. This paper investigates Nāser-e Khosrow's style in the field of the music of poetry. It also aims to identify his slylistic differences comparing with those of his most famous contemporaries (Onsori, Farrokhi and Manoochehri in external, lateral and internal areas.      As the first  step, all the Nāser Khosrow's poems -exept for additions section- was considered the Scope of this study, accompanied by main parts of other three cited  poets' divans (poetical works which consists almost 23600 distiches. For the second stage, each poet's divan were separatly scrutinized in three fields of the music of poetry. The frequency of each cases were recorded and after comparing statistics related to Nāser-e Khosrow's poems with other tree poets,  stylistic charachteristics of his poetry were explored. Statistical information related to the poets were generally recorded in a table and some parts of them were shown in a bar graph. It seems necessary to note that the researcher considered two items of  innovation and frequncy in all phases of the study.     Most of the studies done in the field of Nāser-e Khosrow's music of poetry, investigated the prosody and meter of his poetry. Most of these studies considered difficulty and relevancy as the important prosodic characteristics of  his poetry. Regarding the lateral and internal areas of Nāser-e Khosrow's poetry, there are not much argument proposed except for using difficult rhymes and nominal radifs ( for lateral music of poetry and attending to figures of

  20. Stylistics of Khaje Abd ol-Lah Ansari’s Epistles

    Directory of Open Access Journals (Sweden)

    Azadeh Poode

    2014-07-01

    Full Text Available  Abstract Stylistics is a knowledge that has been particularly considered by writers and speakers. Literary style of every writer shows the way of indicating speaker's thought and it is his opening key of speech's style in conveying meaning to the addressee's mind. Type of words, structures, sentences, and the way of interpreting meaning are factors in literary style elements. The effect of Khaje Abd ol-Lah Ansari's word on Sufi didactic literature is known among literary scholars. Deeply mystical concepts' integration with eloquent style in song has made the works of Khaje eternal so stylistic research about his works can show good points in literary aesthetics of his works. So besides Al-Sufi categories, his five epistles are the most preferred among his works that are described in this study.   Khaje Abd ol-Lah Ansari is among mysticism and Persian literature celebrities that besides having numerous writings, he is worthy to be analyzed in this subject in order to study his methods in writing his Sufism didactic works especially in five epistles of Kanz ol-Salekin, Vareda'at, Del va Jan, Haft Hesar, Ghalandar nama, and the pattern that is for providing next works.   At the lexical level , Khaje Abd ol-Lah's style does not have a manifest feature. His only lexical feature is repetition that is seen in three levels of letter, word, and sentence. Sometimes he repeats a word in several consecutive sentences and even in several pages. Number of old words and sounds are very few in epistles that this simplicity of language relates to the addresses that are common people and his didactic works.   According to linguistic and literary level, epistles are closer to the second period of Persian prose than the first period there is no sign of oldness in these works. Arabic words are used moderately and most of these words have been used in its modern sense. At syntax level, he has used prefix verbs specially "Dar" and

  1. Stylistics of Khaje Abd ol-Lah Ansari’s Epistles

    Directory of Open Access Journals (Sweden)

    Taghi Ejiye

    2014-08-01

    Full Text Available Abstract Stylistics is a knowledge that has been particularly considered by writers and speakers. Literary style of every writer shows the way of indicating speaker's thought and it is his opening key of speech's style in conveying meaning to the addressee's mind. Type of words, structures, sentences, and the way of interpreting meaning are factors in literary style elements. The effect of Khaje Abd ol-Lah Ansari's word on Sufi didactic literature is known among literary scholars. Deeply mystical concepts' integration with eloquent style in song has made the works of Khaje eternal so stylistic research about his works can show good points in literary aesthetics of his works. So besides Al-Sufi categories, his five epistles are the most preferred among his works that are described in this study.   Khaje Abd ol-Lah Ansari is among mysticism and Persian literature celebrities that besides having numerous writings, he is worthy to be analyzed in this subject in order to study his methods in writing his Sufism didactic works especially in five epistles of Kanz ol-Salekin, Vareda'at, Del va Jan, Haft Hesar, Ghalandar nama, and the pattern that is for providing next works.   At the lexical level , Khaje Abd ol-Lah's style does not have a manifest feature. His only lexical feature is repetition that is seen in three levels of letter, word, and sentence. Sometimes he repeats a word in several consecutive sentences and even in several pages. Number of old words and sounds are very few in epistles that this simplicity of language relates to the addresses that are common people and his didactic works.   According to linguistic and literary level, epistles are closer to the second period of Persian prose than the first period there is no sign of oldness in these works. Arabic words are used moderately and most of these words have been used in its modern sense. At syntax level, he has used prefix verbs specially "Dar" and sometimes "Bar", also he uses

  2. Magical cooking: Some stylistic characteristics of the novel Like water for chocolate

    OpenAIRE

    Uršula Kastelic Vukadinović

    2013-01-01

    In the paper we analyze some of the stylistic characteristics of the novel Like Water for Chocolate from the perspective of a non-Mexican reader. The narrator of the story interlaces the typical linguistic structures specific to cooking recipes and the story of a forbidden love between Tita and Pedro. As a good cook she gives advice, describes the procedures for the preparation of the food, all of which she intertwines with the story as once told to her, which is explicitly expressed by using...

  3. Aesthetic assessment of the music composition from different stylistic periods

    Directory of Open Access Journals (Sweden)

    Pejić Biljana

    2016-01-01

    Full Text Available This paper aims to investigate the aesthetic assessment of the music compositions from Renaissance, Baroque and Romanticism period, by aesthetic dimensions: beauty, harmony, decoration and semantic depth. The research was carried out on 15 students of Psychology. The stimuli were 54 music compositions from Renaissance, Baroque and Romanticism (18 for each period. The participants were asked to assess each stimulus on 13 scales (a 1-7 scale of Semantic Differential, using the dimensions: beautiful-ugly, harmony, decoration and semantic depth. The results indicate that there are differences in assessing the music composition from different art periods on given dimensions: beautiful-ugly, decoration and semantic depth. The most aesthetically evaluated are music compositions of Romanticism period and the least-music compositions of Renaissance. The music of Renaissance is the most evaluated in terms of harmony and semantic depth, but the least in terms of decoration. The Music of Baroque and Romanticism are not differentiated by these dimensions. The results indicate that with the development of music periods grow up Aesthetic evaluation of music works. Decoration appears to be more important characteristic in music aesthetic differentiation than harmony.

  4. Full text

    African Journals Online (AJOL)

    IndexCopernicus Portal System

    tyrosine kinase [1]. The literature about GISTs remains confusing because tumor classification and terminology are being continually refined. Furthermore, the exact definition of GISTs varies among authors. Some use the term to describe any GI submucosal mesenchymal tumor that is not myogenic. (e.g, leiomyosarcoma).

  5. Formal speech stylistics and type A behavior in 38 subjects during nonstress interviews.

    Science.gov (United States)

    Guggisberg, R; Laederach, K; Adler, R

    1981-01-01

    Open-ended interviews of 90 min length of 38 patients were analyzed with respect to speech stylistics, shown by Schucker and Jacobs to differentiate individuals with type A personality features from those with type B. In our patients, Type A/B had been assessed by the Bortner Personality Inventory. The stylistics studied were: repeated words swallowed words, interruptions, simultaneous speech, silence latency (between question and answer) (SL), speed of speech, uneven speed of speech (USS), explosive words (PW), uneven speech volume (USV), and speech volume. Correlations between both raters for all speech categories were high. Positive correlations between extent of type A and SL (r = 0.33; p = 0.022), USS (r = 0.51; p = 0.002), PW (r = 0.46; p = 0.003) and USV (r = 0.39; p = 0.012) were found. Our results indicate that the speech in nonstress open-ended interviews of type A individuals tends to show a higher emotional tension (positive correlations for USS PW and USV) and is more controlled in conversation (positive correlation for SL).

  6. Kırgız Şiirinin Stilistik Hususiyetleri Üzerine Stylistic Traits Of Kyrgyz Poetry

    Directory of Open Access Journals (Sweden)

    Osman ARICAN

    2013-07-01

    Full Text Available The words used by poet not only reflect poet’s view of a situation, but also they show a view of society in which the poet lives.Thus, we can establish versatile link between the poet and the society. This situation influences stylistic of the poems fundamentally and it also shapes vocabulary of the poems. This article consists of two main sections and a conclusion. We tried to determine features of the vocabularies and themes written in Kyrgyz poems. Poems of the leading poets have been observed and similarities and differences in the poems have been determined. The stylistic in the wider sense means an era, a country, a nation, opinion, feeling and in the narrow understanding it belongs to literature, language and linguistics. Important stylistic issues have been identified in our research. When talking about Kyrgyz literature and poems, particularly, their major elements should be taken into consideration and the result is to be based on it. This matter is discussed in seven chapters. In the second part some notable features of Kyrgyz songs have been determined and analyzed in five chapters. Samples taken from the works of Kyrgyz poets were given and particularized. Finally, it has become clear that life style of the society, its historical backgrounds and political periods should be known when researching a characteristic of a nation. It has been determined that stylistic research can be done if we evaluate them altogether. toplumun da bakış açısını yansıtmaktadır. Dolayısıyla şair ve toplum arasında birçok bakımdan yakın münasebet kurulabilmektedir. Söz konusu durum şiirlerin stilistik hususiyetlerini temelden etkilemekte ve kelime kadrolarını da yönlendirebilmektedir. Makale iki ana bölüm ve sonuçtan oluşmaktadır. Kırgız şiirinde işlenen temalar, kelime dünyası gibi özellikler tespit edilmeye çalışılmıştır. Kırgız şiirinin önde gelen temsilcilerinin şiirleri incelenmiş, şiirlerdeki ortakl

  7. Diachronic changes in subjectivity and stance: A corpus linguistic study of Dutch news texts

    NARCIS (Netherlands)

    Vis, K.; Sanders, J.M.; Spooren, W.P.M.S.

    2012-01-01

    In several studies of English data, researchers have observed a trend of 'informalization': a shift of stylistic preferences in public written discourse, such as journalistic texts, towards a more conversational, or oral, style. In this paper, we aim to contribute to this issue by empirically

  8. Linguistic and Cognitive Characteristics of the Composition of the Text of J. K. Rowling's English Tales

    Science.gov (United States)

    Solodova, Elena

    2015-01-01

    This article focuses on linguistic and cognitive characteristics inherent in the composition of the English postmodern tales written by J.K. Rowling. The composition of the text is viewed as linguistic and cognitive construal that integrates compositional plot structure, compositional meaning structure, linguistic and stylistic means of their…

  9. Cognitive Stylistics

    DEFF Research Database (Denmark)

    Hansen, Thomas Illum

    2012-01-01

    En introduktion til, hvordan man kan omsætte kognitiv lingvistik til en stilistisk sensitiv analyse af litterær sprogbrug med Virginia Woolf's roman Mrs. Dalloway som analysegenstand.......En introduktion til, hvordan man kan omsætte kognitiv lingvistik til en stilistisk sensitiv analyse af litterær sprogbrug med Virginia Woolf's roman Mrs. Dalloway som analysegenstand....

  10. Media stylistics

    OpenAIRE

    Lambrou, Marina; Durant, Alan

    2014-01-01

    In this chapter we review the concept of ‘media stylistics’. In particular, we disentangle the polysemy of these two terms which, when combined, describe but can also obscure work in this area; and we discuss key themes and concerns which emerge. Through analysis of two short extracts of media discourse in English, we elaborate a distinction between two alternative emphases: study of media language as concerned with the capabilities associated with changing technologies for conveying linguist...

  11. Classification of remotely sensed images

    CSIR Research Space (South Africa)

    Dudeni, N

    2008-10-01

    Full Text Available images N. Dudeni, P. Debba Introduction to Remote Sensing Introduction to Image Classification Objective of the study Classification algorithms by group Unsupervised algorithms Supervised classification algorithms Spatial... of remotely sensed images N. Dudeni, P. Debba Introduction to Remote Sensing Introduction to Image Classification Objective of the study Classification algorithms by group Unsupervised algorithms Supervised classification algorithms...

  12. Creative and Stylistic Devices Employed by Children during a Storybook Narrative Task: A Cross-Cultural Study

    Science.gov (United States)

    Gorman, Brenda K.; Fiestas, Christine E.; Pena, Elizabeth D.; Clark, Maya Reynolds

    2011-01-01

    Purpose: The purpose of this study was to analyze the effects of culture on the creative and stylistic features children employ when producing narratives based on wordless picture books. Method: Participants included 60 first- and second-grade African American, Latino American, and Caucasian children. A subset of narratives based on wordless…

  13. Responses to Bakhtins Dialogic Origins and Dialogic Pedagogy of Grammar: Stylistics as Part of Russian Language Instruction in Secondary Schools

    Science.gov (United States)

    Bazerman, Charles; Farmer, Frank; Halasek, Kay; Williams, Joseph

    2005-01-01

    The three authors writing on Bakhtins essay, "Dialogic Origin and Dialogic Pedagogy of Grammar" -- Farmer, Halasek, and Williams -- respond to one another, and Bazerman provides a summative comment in the paragraphs that follow. The responses explore further some of Bakhtins thoughts concerning rhetoric and its relation to stylistics and his use…

  14. Effects of Sociolinguistic Awareness on French L2 Learners' Planned and Unplanned Oral Production of Stylistic Variation

    Science.gov (United States)

    French, Leif M.; Beaulieu, Suzie

    2016-01-01

    The present study evaluated the effects of explicit instruction on advanced French second language (L2) learners' (n = 18) contextually appropriate use of two stylistic variables (/l/ deletion vs. retention and "ne" deletion vs. retention) using two different task types. A planned and unplanned oral task was used to assess students'…

  15. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music.

    Science.gov (United States)

    Hansen, Niels Chr; Vuust, Peter; Pearce, Marcus

    2016-01-01

    Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled renditions of saxophone solos improvised by Charlie Parker in the bebop style. Ratings of explicit uncertainty and expectedness for different continuations of each melodic excerpt were collected. An information-theoretic model of expectation enabled selection of stimuli affording highly certain continuations in the bebop style, but highly uncertain continuations in the context of general tonal expectations, and vice versa. The results showed that expert musicians have acquired probabilistic characteristics of music influencing their experience of expectedness and predictive uncertainty. While classical musicians had internalised key aspects of the bebop style implicitly, only jazz musicians' explicit uncertainty ratings reflected the computational estimates, and jazz-specific expertise modulated the relationship between explicit and inferred uncertainty data. In spite of this, there was no evidence that non-musicians and classical musicians used a stylistically irrelevant cognitive model of general tonal music providing support for the theory of cognitive firewalls between stylistic models in predictive processing of music.

  16. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music

    Science.gov (United States)

    Vuust, Peter; Pearce, Marcus

    2016-01-01

    Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled renditions of saxophone solos improvised by Charlie Parker in the bebop style. Ratings of explicit uncertainty and expectedness for different continuations of each melodic excerpt were collected. An information-theoretic model of expectation enabled selection of stimuli affording highly certain continuations in the bebop style, but highly uncertain continuations in the context of general tonal expectations, and vice versa. The results showed that expert musicians have acquired probabilistic characteristics of music influencing their experience of expectedness and predictive uncertainty. While classical musicians had internalised key aspects of the bebop style implicitly, only jazz musicians’ explicit uncertainty ratings reflected the computational estimates, and jazz-specific expertise modulated the relationship between explicit and inferred uncertainty data. In spite of this, there was no evidence that non-musicians and classical musicians used a stylistically irrelevant cognitive model of general tonal music providing support for the theory of cognitive firewalls between stylistic models in predictive processing of music. PMID:27732612

  17. Stylistic features of case reports as a genre of medical discourse.

    Science.gov (United States)

    Lysanets, Yuliia; Morokhovets, Halyna; Bieliaieva, Olena

    2017-03-13

    The present paper discusses the lexical and grammatical peculiarities of English language medical case reports, taking into account their communicative purposes and intentions. The objective of the research is to clarify the principal mechanisms of producing an effective English language medical case report and thus to provide recommendations and guidelines for medical professionals who will deal with this genre. The analysis of medical case reports will largely focus on the most significant linguistic peculiarities, such as the use of active and passive voice, the choice of particular verb tenses, and pronouns. The selected medical case reports will be considered using methods of lexico-grammatical analysis, quantitative examination, and contextual, structural, narrative, and stylistic analyses. The research revealed a range of important stylistic features of medical case reports which markedly distinguish them from other genres of medical scientific writing: educational and instructive intentions, conciseness and brevity, direct and personal tone, and material presented in a narrative style. The present research has shown that the communicative strategies of the analyzed discourse, mentioned immediately above, are effectively implemented by means of specific lexical units and grammatical structures: the dominance of active voice sentences, past simple tense, personal pronouns, and modal verbs. The research has also detected the occasional use of the present perfect, present simple, and future simple tenses and passive voice which also serve particular communicative purposes of medical case reports. Medical case reports possess a range of unique characteristics which differ from those of research articles and other scientific genres within the framework of written medical discourse. It is to be emphasized that it is highly important for medical professionals to master the major stylistic principles and communicative intentions of medical case report as a genre in

  18. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL...... that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.......Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...

  19. Monolingual accounting dictionaries for EFL text production

    DEFF Research Database (Denmark)

    Nielsen, Sandro

    2006-01-01

    Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types...... text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items...... that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading....

  20. Automatic Amharic text news classification: Aneural networks ...

    African Journals Online (AJOL)

    Tewodros Hailemeskel,. 2003; Ethiopia, 2002; Zelalem, 2001). For example, the word 'ሰምቶአል' ('he hears') can be written in. Amharic as ሰምቶአል, ሰምቷል, ሰምትዋል, etc ...

  1. Structure-semantics interplay in complex networks and its effects on the predictability of similarity in texts

    Science.gov (United States)

    Amancio, Diego R.; Oliveira, Osvaldo N., Jr.; Costa, Luciano da F.

    2012-09-01

    The classification of texts has become a major endeavor with so much electronic material available, for it is an essential task in several applications, including search engines and information retrieval. There are different ways to define similarity for grouping similar texts into clusters, as the concept of similarity may depend on the purpose of the task. For instance, in topic extraction similar texts mean those within the same semantic field, whereas in author recognition stylistic features should be considered. In this study, we introduce ways to classify texts employing concepts of complex networks, which may be able to capture syntactic, semantic and even pragmatic features. The interplay between various metrics of the complex networks is analyzed with three applications, namely identification of machine translation (MT) systems, evaluation of quality of machine translated texts and authorship recognition. We shall show that topological features of the networks representing texts can enhance the ability to identify MT systems in particular cases. For evaluating the quality of MT texts, on the other hand, high correlation was obtained with methods capable of capturing the semantics. This was expected because the golden standards used are themselves based on word co-occurrence. Notwithstanding, the Katz similarity, which involves semantic and structure in the comparison of texts, achieved the highest correlation with the NIST measurement, indicating that in some cases the combination of both approaches can improve the ability to quantify quality in MT. In authorship recognition, again the topological features were relevant in some contexts, though for the books and authors analyzed good results were obtained with semantic features as well. Because hybrid approaches encompassing semantic and topological features have not been extensively used, we believe that the methodology proposed here may be useful to enhance text classification considerably, as it

  2. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  3. title-node as a stylo-semantic device in the written text

    African Journals Online (AJOL)

    This paper presents a study of the stylistic and semantic relationships that develop from the title-nodes of written texts e.g. newspaper editorials and essays. Node is used in this paper to represent that major word or phrase in a text or an editorial title to which other words or phrases in the body of the text can be related in ...

  4. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music

    DEFF Research Database (Denmark)

    Hansen, Niels Christian; Vuust, Peter; Pearce, Marcus

    2016-01-01

    Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity...... and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled...... in the bebop style, but highly uncertain continuations in the context of general tonal expectations, and vice versa. The results showed that expert musicians have acquired probabilistic characteristics of music influencing their experience of expectedness and predictive uncertainty. While classical musicians...

  5. TRANSLATION STRATEGIES FOR TOURISM DISCOURSE TEXTS (THROUGH THE EXAMPLE OF A HOTEL WEBSITE

    Directory of Open Access Journals (Sweden)

    Anna Alexeevna Novozhilova

    2015-10-01

    Full Text Available The article deals with the study of linguistic and cultural aspects of the tourism discourse which has been very acute due to the development of tourist industry. International tourism has become wide-spread resulting in the necessity of related translations.Purpose. The study aims at defining discourse-conditioned translation strategies for tou-rism discourse texts.Materials. The author used texts in German and their translations from the website of one of the premium hotels in Germany as the materials of the study.Methods. Among the methods of analysis used are: stylistic analysis method, comparative method, descriptive method, semantic and structural analysis.Results. Translation mistakes and errors have been identified, translation strategies typical for tourism discourse texts have been offered, among them: recipient orientation, minimizing key information losses, choice of lexical and stylistic translation equivalents. The results of the study can be used in translation studies and translation practices.

  6. Analysis of Tense Interferential of Verbs in Old Narrative Texts

    OpenAIRE

    Amir Zeighami; Mahmood Barati khansari; Hassan Ziyari

    2014-01-01

    Abstract One of the admirable methods to compose stories in Persian verse and prose, is the present Tense verbs in the meaning of past tense. This grammatical point has been hidden in the grammarian and stylist's point of view although it has been repeatedly mentioned in the texts and this point has been not mentioned in the grammatical books but some of the investigators and literati have pointed out it in their correction works. We mention their sayings: firstly, Allame Qazvini, doub...

  7. Retrogressive harmonic motion as structural and stylistic characteristic of pop-rock music

    Science.gov (United States)

    Carter, Paul S.

    The central issue addressed in this dissertation is that of progressive and retrogressive harmonic motion as it is utilized in the repertoire of pop-rock music. I believe that analysis in these terms may prove to be a valuable tool for the understanding of the structure, style and perception of this music. Throughout my study of this music, various patterns of progressive and retrogressive harmonic motions within a piece reveal a kind of musical character about it, a character on which much of a work's style, organization and extramusical nature often depends. Several influential theorists, especially Jean-Phillipe Rameau, Hugo Riemann, and Arnold Schoenberg, have addressed the issues of functional harmony and the nature of the motion between chords of a tonal harmonic space. After assessing these views, I have found that it is possible to differentiate between two fundamental types of harmonic motions. This difference, one that I believe is instrumental in characterizing pop-rock music, is the basis for the analytical perspective I wish to embrace. After establishing a method of evaluating tonal harmonic root motions in these terms, I wish to examine a corpus of this music in order to discover what a characterization of its harmonic motion may reveal about each piece. Determining this harmonic character may help to establish structural and stylistic traits for that piece, its genre, composer, period, or even its sociological purpose. Conclusions may then be drawn regarding the role these patterns play in defining musical style traits of pop-rock. Partly as a tool for serving the study mentioned above I develop a graphical method of accounting for root motion I name the tonal "Space-Plot"; This apparatus allows the analyst to measure several facets about the harmonic motion of the music, and to see a wide scope of relations in and around a diatonic key.

  8. Word and text processing in acquired prosopagnosia.

    Science.gov (United States)

    Hills, Charlotte S; Pancaroglu, Raika; Duchaine, Brad; Barton, Jason J S

    2015-08-01

    A novel hypothesis of object recognition asserts that multiple regions are engaged in processing an object type, and that cerebral regions participate in processing multiple types of objects. In particular, for high-level expert processing, it proposes shared rather than dedicated resources for word and face perception, and predicts that prosopagnosic subjects would have minor deficits in visual word processing, and alexic subjects would have subtle impairments in face perception. In this study, we evaluated whether prosopagnosic subjects had deficits in processing either the word content or the style of visual text. Eleven prosopagnosic subjects, 6 with unilateral right lesions and 5 with bilateral lesions, participated. In the first study, we evaluated their word length effect in reading single words. In the second study, we assessed their time and accuracy for sorting text by word content independent of style, and for sorting text by handwriting or font style independent of word content. Only subjects with bilateral lesions showed mildly elevated word length effects. Subjects were not slowed in sorting text by word content, but were nearly uniformly impaired in accuracy for sorting text by style. Our results show that prosopagnosic subjects are impaired not only in face recognition but also in perceiving stylistic aspects of text. This supports a modified version of the many-to-many hypothesis that incorporates hemispheric specialization for processing different aspects of visual text. © 2015 American Neurological Association.

  9. Introducing Kholasat ol-Maqamat and its stylistic and lingual characteristics

    Directory of Open Access Journals (Sweden)

    Zohreh Niksiyar

    2014-12-01

    Moreover, he has benefited from Khaje Mohammad Heisam’s History, Jame’ ol-Osul, Some of Sihahe Sitte and Masanid Arba’e, apparently complete works of Several poets including Sheikh Attar Neishaburi and also some works of Sheikh Jam (Ons ot-Taebin, Seraj os-Sa’erin,  Rawzat ol-Moznebin,  Konuz ol-Hekme,  Miftah on-Nijat, Bihar ol-Haqiqe and a collection of  Sheikh Jam’s poems. Additionally, this book contains mystical, geographical, historical benefits and stylistic and lingual characteristics. This manuscript is distinguishable for some reasons such as availability of some other manuscripts titled maqamat of which the author made used, all have been written in jam's time by his cohorts. Each of them has a great value, since that time is probably the beginning of writing maqamat. Because on the basis of notes remained, Ala’ol-Molk has used four maqamats named Imam Razi od-Din Ilias Taibadi, Sheikh Ahmad Tarkhestani, Darvish Ali Buzjani that now there is no information about. This point increases the value of Kholasat ol-Maqamat.   He had also some of his ancestor's works that contain more accurate and detailed subjects as well as more complete information about family and descendents of Sheikh than Mohammad Ghaznavi's Maqamat. Three manuscripts of Kholasat ol-Maqamat are available with different mode of hand writing: Manuscript in Ganj Bakhsh library of Lahur, manuscript in theology faculty of University of Mashhad and digital library of parliament. Finally, necessary to say, this manuscript indicates Sheikh's spiritual stations happened to and written by his cohorts at that time. Since that papers except Mohammad Ghaznavi's Maqamat have been disappeared, Kholasat ol-Maqamat which is the extract of those maqamats could be known as the most important and exquisite manuscript about Sheikh's spiritual stations and Keramat. Actually, recognition, introduction and emendation of this manuscript will be an effective and fruitful step to know more about

  10. Introducing Kholasat ol-Maqamat and its stylistic and lingual characteristics

    Directory of Open Access Journals (Sweden)

    S.abbas Mohammad zade rezaee

    2014-11-01

    Moreover, he has benefited from Khaje Mohammad Heisam’s History, Jame’ ol-Osul, Some of Sihahe Sitte and Masanid Arba’e, apparently complete works of Several poets including Sheikh Attar Neishaburi and also some works of Sheikh Jam (Ons ot-Taebin, Seraj os-Sa’erin,  Rawzat ol-Moznebin,  Konuz ol-Hekme,  Miftah on-Nijat, Bihar ol-Haqiqe and a collection of  Sheikh Jam’s poems. Additionally, this book contains mystical, geographical, historical benefits and stylistic and lingual characteristics. This manuscript is distinguishable for some reasons such as availability of some other manuscripts titled maqamat of which the author made used, all have been written in jam's time by his cohorts. Each of them has a great value, since that time is probably the beginning of writing maqamat. Because on the basis of notes remained, Ala’ol-Molk has used four maqamats named Imam Razi od-Din Ilias Taibadi, Sheikh Ahmad Tarkhestani, Darvish Ali Buzjani that now there is no information about. This point increases the value of Kholasat ol-Maqamat.   He had also some of his ancestor's works that contain more accurate and detailed subjects as well as more complete information about family and descendents of Sheikh than Mohammad Ghaznavi's Maqamat. Three manuscripts of Kholasat ol-Maqamat are available with different mode of hand writing: Manuscript in Ganj Bakhsh library of Lahur, manuscript in theology faculty of University of Mashhad and digital library of parliament. Finally, necessary to say, this manuscript indicates Sheikh's spiritual stations happened to and written by his cohorts at that time. Since that papers except Mohammad Ghaznavi's Maqamat have been disappeared, Kholasat ol-Maqamat which is the extract of those maqamats could be known as the most important and exquisite manuscript about Sheikh's spiritual stations and Keramat. Actually, recognition, introduction and emendation of this manuscript will be an

  11. Agreement text contrastive analysis: English vs Russian

    OpenAIRE

    STEPANOVA V.V.; KISELEVA L.A.

    2014-01-01

    The article studies the incongruity of the structure and stylistic organization of an agreement as one of the types of legal discourse in English and Russian. The comparative analysis of certain stylistic differences connected with the structure and content of agreement and the tips for translation suggested in this concise work are aimed at better understanding and interpreting the legal document of the given format.

  12. Science and Technology Text Mining Basic Concepts

    National Research Council Canada - National Science Library

    Losiewicz, Paul

    2003-01-01

    ...). It then presents some of the most widely used data and text mining techniques, including clustering and classification methods, such as nearest neighbor, relational learning models, and genetic...

  13. Creative and stylistic devices employed by children during a storybook narrative task: a cross-cultural study.

    Science.gov (United States)

    Gorman, Brenda K; Fiestas, Christine E; Peña, Elizabeth D; Clark, Maya Reynolds

    2011-04-01

    The purpose of this study was to analyze the effects of culture on the creative and stylistic features children employ when producing narratives based on wordless picture books. Participants included 60 first- and second-grade African American, Latino American, and Caucasian children. A subset of narratives based on wordless picture books collected as part of a larger study was coded and analyzed for the following creative and stylistic conventions: organizational style (topic centered, linear, cyclical), dialogue (direct, indirect), reference to character relationships (nature, naming, conduct), embellishment (fantasy, suspense, conflict), and paralinguistic devices (expressive sounds, exclamatory utterances). Many similarities and differences between ethnic groups were found. No significant differences were found between ethnic groups in organizational style or use of paralinguistic devices. African American children included more fantasy in their stories, Latino children named their characters more often, and Caucasian children made more references to the nature of character relationships. Even within the context of a highly structured narrative task based on wordless picture books, culture influences children's production of narratives. Enhanced understanding of narrative structure, creativity, and style is necessary to provide ecologically valid narrative assessment and intervention for children from diverse cultural backgrounds.

  14. Towards secondary fingerprint classification

    CSIR Research Space (South Africa)

    Msiza, IS

    2011-07-01

    Full Text Available fingerprints that have a ?x that is greater than 30 pixels, and the same reasoning can be attributed to the mis-classification of some of the RL class fingerprints. Possible 2011 International Conference on Computer Engineering and Applications (ICCEA 2011...]. Even though the concept of sample classification applies to systems that use almost any biometric modality, this manuscript focuses on fingerprint classification, with immediate application to an automated fingerprint recog- nition system...

  15. A Corpus-Based Stylistic Analysis of "Body-Soul" and "Heaviness-Lightness" Metaphors in Kundera's Novel "The Unbearable Lightness of Being"

    Science.gov (United States)

    Hussein, Khalid Shakir; Ameer, May Ali Abdul

    2015-01-01

    This paper represents an attempt to conduct a corpus-based stylistic analysis of two conceptual metaphors in "The Unbearable Lightness of Being," which is a novel written by Milan Kundera. "Soul-body" and "lightness-heaviness" metaphors are foregrounded as being central themes all through the novel. The way such…

  16. Managing interactions between technological and stylistic innovation in the media industries, insights from the introduction of ebook technology in the publishing industry

    NARCIS (Netherlands)

    T.S. Schweizer (Sophie)

    2002-01-01

    textabstractThe mainstream of innovation research pays a lot of attention to technological innovation, but has neglected its interaction with another type of innovation, which is particularly important in sectors like the furniture, fashion and the media content industries: stylistic innovation.

  17. AHP 45: REVIEW: TIBETAN LITERARY GENRES, TEXTS, AND TEXT TYPES

    Directory of Open Access Journals (Sweden)

    Tricia Kehoe

    2017-03-01

    Full Text Available Intended as a follow-up to Cabezón and Jackson's groundbreaking Tibetan Literature: Studies in Genre (1996, Tibetan Literary Genres, Texts, and Text Types: From Genre Classification to Transformation aims to deepen our understandings of Tibetan literature by approaching Tibetan text types from systematic and historical perspectives. Growing out of a conference panel at the twelfth Tibetan Studies seminar, the book explores both pre-modern and contemporary genres, as well as issues of classification and methodologies. In doing so, this collection of essays edited by Jim Rheingans covers a great deal of new ground in terms of discussions of terminology, definitions, and the theoretical landscape pertaining to literature, genre, text boundaries, and typologies in the field of Tibetan literature. ...

  18. Classification of Flotation Frothers

    Directory of Open Access Journals (Sweden)

    Jan Drzymala

    2018-02-01

    Full Text Available In this paper, a scheme of flotation frothers classification is presented. The scheme first indicates the physical system in which a frother is present and four of them i.e., pure state, aqueous solution, aqueous solution/gas system and aqueous solution/gas/solid system are distinguished. As a result, there are numerous classifications of flotation frothers. The classifications can be organized into a scheme described in detail in this paper. The frother can be present in one of four physical systems, that is pure state, aqueous solution, aqueous solution/gas and aqueous solution/gas/solid system. It results from the paper that a meaningful classification of frothers relies on choosing the physical system and next feature, trend, parameter or parameters according to which the classification is performed. The proposed classification can play a useful role in characterizing and evaluation of flotation frothers.

  19. Text location in color documents

    Science.gov (United States)

    Jain, Anil K.; Namboodiri, Anoop M.; Jung, Keechul

    2003-01-01

    Many document images contain both text and non-text (images, line drawings, etc.) regions. An automatic segmentation of such an image into text and non-text regions is extremely useful in a variety of applications. Identification of text regions helps in text recognition applications, while the classification of an image into text and non-text regions helps in processing the individual regions differently in applications like page reproduction and printing. One of the main approaches to text detection is based on modeling the text as a texture. We present a method based on a combination of neural networks (texture-based) and connected component analysis to detect text in color documents with busy foreground and background. The proposed method achieves an accuracy of 96% (by area) on a test set of 40 documents.

  20. Information gathering for CLP classification

    Directory of Open Access Journals (Sweden)

    Ida Marcello

    2011-01-01

    Full Text Available Regulation 1272/2008 includes provisions for two types of classification: harmonised classification and self-classification. The harmonised classification of substances is decided at Community level and a list of harmonised classifications is included in the Annex VI of the classification, labelling and packaging Regulation (CLP. If a chemical substance is not included in the harmonised classification list it must be self-classified, based on available information, according to the requirements of Annex I of the CLP Regulation. CLP appoints that the harmonised classification will be performed for carcinogenic, mutagenic or toxic to reproduction substances (CMR substances and for respiratory sensitisers category 1 and for other hazard classes on a case-by-case basis. The first step of classification is the gathering of available and relevant information. This paper presents the procedure for gathering information and to obtain data. The data quality is also discussed.

  1. Türkçe Ders Kitaplarında Yer Alan Metinlerin Edimsel Sınıflandırılması Pragmatic Classification Of The Texts In Turkish Course Books

    Directory of Open Access Journals (Sweden)

    İ. Seçkin AYDIN

    2013-03-01

    Full Text Available Globally, one of the most discussed problems in education is theissue of teaching in mother tongue and in a foreign language.Accordingly, various methods and techniques have been tried. In anycase, using a text is indispensable. Especially listening, reading,speaking, writing and grammar drills are conducted in the classroomenvironment. Therefore, text preference is an important element inlanguage teaching. These texts are determined according to certaincriteria. The texts determined upon criteria are used to improve thelanguage skills of students. Certain linguistic criteria have to bedetermined for text choice in language teaching. In this context,functional display in the texts chosen is crucial for students to graspthe potential of language. The aim of this research is to assess the textsin 8th Grade Turkish Course Books in a pragmatic perspective. Surveymethod was used in the research. The statements in the texts wereclassified according to “speech acts theory” of Searle. In the research,the texts in the common theme, “Social Life”, were analyzed. The textsassessed in three course books were limited with a theme and prosetypes. The criteria were grouped under the following headings:commissives, declarations, directives, expressives and assertives It wasdetermined that most of the expressions in all the course books weredescriptive and expressive while the others were insufficient. Usingvarious expressions in texts contributes to perceptive and expressiveskills of students. It was suggested in the research that expressiontypes diverse in pragmatic aspect should be used in the texts forstudents.

  2. DESCRIPTION OF CLASSIFICATION SYSTEMS OF LIBRARY CATALOGUES

    Directory of Open Access Journals (Sweden)

    Zoya V. Savchenko

    2010-08-01

    Full Text Available The description of the most widespread universal library classifications (Decimal Classification of D'yui (DKD, Universal Decimal Classification (UDK, Classification of Library of Congress (KBK, Library-Bibliographic Classification (BBK are analysed in the article. Histories of these systems development and features of their use in modern informative services as well as the example of construction of electronic library classifiers on the base of the library classifications of DKD and KBK are given.

  3. Text World Theory and real world readers: From literature to life in a Belfast prison

    OpenAIRE

    Canning, Patricia

    2017-01-01

    Cognitive stylistics offers a range of frameworks for understanding (amongst other things) what producers of literary texts ‘do’ with language and how they ‘do’ it. Less prevalent, however, is an understanding of the ways in which these same frameworks offer insights into what readers ‘do’ (and how they ‘do’ it). Text World Theory (Werth, 1999; Gavins, 2007; Whiteley, 2011) has proved useful for understanding how and why readers construct mental representations engendered by the act of readin...

  4. Making School Development Credible. Text, Context, Irony

    Directory of Open Access Journals (Sweden)

    Mats Börjesson

    2012-01-01

    Full Text Available

    The article argues for the importance of an open, reflexive-methodological approach when switching between studying text, context and researcher activity. Close linguistic analysis can benefit from being linked with the researcher’s contextualisation of his empirical material as well as with more distanced readings. The more specific starting point for this article is that school development, like other similar terms such as school improvement and the like, makes use of linguistic building blocks with which whole narratives about today’s and tomorrow’s schools can be constructed. The subject of the study is a short text issued by the Swedish Schools Inspectorate (Skolinspektionen. Government language changes according to the authorities’ role in society and their own definitions of their functions, and an important aspect here is the legitimacy of the authorities’ texts. By means of various kinds of close linguistic analysis, the above-mentioned text is studied with regard to choice of categories, hierarchies of modalisation and the rhetorical effects of different types of formulations in a broader political-social landscape. The article concludes with a reflective discussion on the relationship between government language and irony as a stylistic device – a device that is based on the results of the close empirical analysis.[i]



    [i] The article is part of the project ”School  Development as Narrative”, funded by the Swedish Research Council. The author would like to thank the two reviewers for very valuable comments.

  5. Text Mining the Biomedical Literature

    Science.gov (United States)

    2007-11-05

    LECTURE NOTES IN COMPUTER SCIENCE Gelbukh, A; Sidorov, G; Guzman -Arenas, A. 1999. Use of a weighted topic hierarchy for document classification...matrix decomposition. ACM TRANSACTIONS ON MATHEMATICAL SOFTWARE 26 (3): 415-435. Kongovi, M; Guzman , JC; Dasigi, V. 2002. Text categorization: An...RECOGNITION, SPEECH AND IMAGE ANALYSIS 2905: 596-603. LECTURE NOTES IN COMPUTER SCIENCE Porter, AL; Kongthon, A; Lui , JC. 2002. Research profiling

  6. Active Discriminative Text Representation Learning

    OpenAIRE

    Zhang, Ye; Lease, Matthew; Wallace, Byron C.

    2016-01-01

    We propose a new active learning (AL) method for text classification with convolutional neural networks (CNNs). In AL, one selects the instances to be manually labeled with the aim of maximizing model performance with minimal effort. Neural models capitalize on word embeddings as representations (features), tuning these to the task at hand. We argue that AL strategies for multi-layered neural models should focus on selecting instances that most affect the embedding space (i.e., induce discrim...

  7. Expected Classification Accuracy

    Directory of Open Access Journals (Sweden)

    Lawrence M. Rudner

    2005-08-01

    Full Text Available Every time we make a classification based on a test score, we should expect some number..of misclassifications. Some examinees whose true ability is within a score range will have..observed scores outside of that range. A procedure for providing a classification table of..true and expected scores is developed for polytomously scored items under item response..theory and applied to state assessment data. A simplified procedure for estimating the..table entries is also presented.

  8. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  9. Text-Attentional Convolutional Neural Network for Scene Text Detection.

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-06-01

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature globally computed from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this paper, we present a new system for scene text detection by proposing a novel text-attentional convolutional neural network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/non-text information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates the main task of text/non-text classification. In addition, a powerful low-level detector called contrast-enhancement maximally stable extremal regions (MSERs) is developed, which extends the widely used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 data set, with an F-measure of 0.82, substantially improving the state-of-the-art results.

  10. The stylistic significance of focus constructions in ofò* | Ajayi ...

    African Journals Online (AJOL)

    This study discusses focus constructions in Yoruba ofò. Across ten categories of ofò the frequency of these constructions is very high, and it can be argued that they are structurally important to each text. We suggest that this pattern helps to reduce ambiguity and also to create parallelism between verse lines, thereby ...

  11. A Stylistic Study on the Linguistic Deviations in E. E. Cummings' Poetry

    Science.gov (United States)

    Li, Xin; Shi, Mengchen

    2015-01-01

    Regarded as the pioneer of experimental poetry, E. E. Cummings' unconventional treatment of poetic language has reached an unprecedented acme, which has intrigued and baffled numerous scholars, researchers and readers alike. Nevertheless, the very existence of poetry, like other types of literary texts, demonstrates the significance and value of…

  12. ISSVA classification.

    Science.gov (United States)

    Dasgupta, Roshni; Fishman, Steven J

    2014-08-01

    Mulliken and Glowacki, in 1982 created a classification system of vascular anomalies which divided vascular anomalies into tumors and malformations which provided the framework for great advances in the management of these patients. This classification system was recently expanded at the 2014 ISSVA workshop in Melbourne. This revision again provides much greater detail including newly named anomalies and identified genes to account for recent advances in knowledge and clinical associations. Copyright © 2014. Published by Elsevier Inc.

  13. Text-Attentional Convolutional Neural Networks for Scene Text Detection.

    Science.gov (United States)

    He, Tong; Huang, Weilin; Qiao, Yu; Yao, Jian

    2016-03-28

    Recent deep learning models have demonstrated strong capabilities for classifying text and non-text components in natural images. They extract a high-level feature computed globally from a whole image component (patch), where the cluttered background information may dominate true text features in the deep representation. This leads to less discriminative power and poorer robustness. In this work, we present a new system for scene text detection by proposing a novel Text-Attentional Convolutional Neural Network (Text-CNN) that particularly focuses on extracting text-related regions and features from the image components. We develop a new learning mechanism to train the Text-CNN with multi-level and rich supervised information, including text region mask, character label, and binary text/nontext information. The rich supervision information enables the Text-CNN with a strong capability for discriminating ambiguous texts, and also increases its robustness against complicated background components. The training process is formulated as a multi-task learning problem, where low-level supervised information greatly facilitates main task of text/non-text classification. In addition, a powerful low-level detector called Contrast- Enhancement Maximally Stable Extremal Regions (CE-MSERs) is developed, which extends the widely-used MSERs by enhancing intensity contrast between text patterns and background. This allows it to detect highly challenging text patterns, resulting in a higher recall. Our approach achieved promising results on the ICDAR 2013 dataset, with a F-measure of 0.82, improving the state-of-the-art results substantially.

  14. Federico Federici. Translation as Stylistic Evolution: Italo Calvino Creative Translator of Raymond Queneau

    Directory of Open Access Journals (Sweden)

    Tânia Mara Moysés

    2012-07-01

    Full Text Available http://dx.doi.org/10.5007/2175-7968.2012v1n29p183   A resenha trata da análise de Federico Federici sobre as estratégicas tradutórias de Italo Calvino na tradução, intitulada I fiori blu, do romance Les Fleurs bleues de Raymond Queneau,

  15. The stylistic significance of focus constructions in ofò* | Ajayi ...

    African Journals Online (AJOL)

    ... fréquences sont très importantes à chaque texte. Dans cette perspective, nous proposons que ce modèle aide à minimiser l'ambiguïté et à créer le parallélisme entre les strophes. Se faisant, la compréhension et l'efficacité de ces genres de paroles magiques s'améliorent. Institute of African Studies: Research Review No.

  16. A Discourse-Based View in Interdisciplinary Approaches to Fictional Text Analysis

    Directory of Open Access Journals (Sweden)

    Альсина Соуса

    2017-12-01

    Full Text Available As patterns of communication change in a globalized society, literacy in foreign languages, especially English, becomes an issue of ever growing relevance to all those involved in the educational system, not to mention those who are to learn all their life long. As such, the goal of this article is to discuss how EFLit (English as a Foreign Literature students can gain in both linguistic competence and critical awareness thereof, should their teachers/lecturers abide to a discourse-based view on (literary language and approach the selected texts by following a pedagogical stylistics orientation also drawing eclectically on pragmatics and other areas of knowledge within the broader domain of applied linguistics. Here under focus will be a discussion of the topics on which literary and linguistic studies show greatest potential for (theoretical convergence and, above all, combined applications in lecture setting. Crucially, it will be argued that a pedagogical stylistics approach to EFLit teaching/learning both develops students’ linguistic competence and raises their awareness as to the meaning making potential of language in use in the texts at hand as well as in their larger historical and sociocultural settings. This will be illustrated by highlighting some textual features within a short extract of Fred D’Aguiar’s The Longest Memory (1995 and the linguistic competence that its comprehension would demand from students.

  17. Adaptation Technique in the Translation of Literary Texts (as Exemplified in the Translation of French and English Literary Texts

    Directory of Open Access Journals (Sweden)

    Полина Сергеевна Тасенко

    2016-12-01

    Full Text Available This article describes some particularities of translation of literary texts from French and English into Russian as exemplified in the novel “The Lady of the Camellias” by A. Dumas and the novel “Wuthering Heights” by E. Brontë. The article examines in particular the use of the adaptation technique in order to confirm the hypothesis of the connection of this translation transformation with conveyance of the original's stylistic features. This transformation is applied in the translation of some phenomena and phrases in the original text that can barely be translated because of their national singularity, a significant time gap, or peculiarity for other culture bearers. The theoretical foundation for the article are literary style researches [Barkhudarov, 1975; Komissarov, 1990; Chaikovskij, 1997; Chukovskij, 2014; Baker, 2011], as well as researches in the field of the use of adaptation in the translation of literary texts [Alekseeva, 2004; Vlakhov, Florin, 1980; Shchetinkin, 1987; Clifford, Landers, 2001; Newmark, 1988]. The research methods used in the study are the following: relevant scientific literature analysis, statistical approach, comparative study of literary texts and their translations. A thorough analysis of the nature of adaptation enabled us to specify a range of translation transformations which allow adaptation to be applied in the reanslation of literary texts. According to the results of the research, the informative as well as the aesthetic function plays a very important role in the translation of literary texts.

  18. The Form is the Substance: Classification of Genres in Text

    Science.gov (United States)

    2001-01-01

    Mean value with one standard deviation error margin.) 4.2 Discussion Use of the presentation feature set yields a significant advatage over use of word...number of genres and increased corpus size. 6 References Biber, D. (1995) "Variation Across Speech and Writing ", Cambridge University Press, New York...Support Vector Machines", Cambridge University Press, . Flesch, R. (1974) "The Art of Readable Writing ", Harper and Row, New York. Illouz, G., B.Habert

  19. Context-Sensitive Sentiment Classification of Short Colloquial Text

    NARCIS (Netherlands)

    Blenn, N.; Charalampidou, K.; Doerr, C.

    The wide-spread popularity of online social networks and the resulting availability of data to researchers has enabled the investigation of new research questions, such as the analysis of information diffusion and how individuals are influencing opinion formation in groups. Many of these new

  20. Machine learning versus knowledge based classification of legal texts

    NARCIS (Netherlands)

    de Maat, E.; Krabben, K.; Winkels, R.

    2010-01-01

    This paper presents results of an experiment in which we used machine learning (ML) techniques to classify sentences in Dutch legislation. These results are compared to the results of a pattern-based classifier. Overall, the ML classifier performs as accurate (>90%) as the pattern based one, but

  1. Investigation into Text Classification With Kernel Based Schemes

    Science.gov (United States)

    2010-03-01

    began with scientific publications and library records but quickly spread to other forms of content, particularly those of information professionals...the user has no a-priori knowledge about the association between specific data and desired classes. In such problems, the learner only has access to...40 nec \\ 52 tfx \\ 5 bfx \\ 17 lfx \\ 29 afx \\ 41 nfx \\ 53 tfc \\ 6 bfc \\ 18 lfc \\ 30 afc \\ 42 nfc \\ 54 tgx \\ 7 bgx \\ 19 lgx \\ 31 agx \\ 43 ngx \\ 55 tgc

  2. Isolation and Communication A Stylistic Analysis of Thought Presentation in Mrs. Dalloway

    Directory of Open Access Journals (Sweden)

    Hua Guo

    2017-02-01

    Full Text Available Mrs. Dalloway by Virginia Woolf is well-acclaimed for its almost non-intrusive portrayal of characters’ state of mind. Many studies approach it from biographical, socio-historical, philosophical, and other non-linguistic perspectives, and most linguistic investigations deal with illustrative examples of a single linguistic device in this novel. Few are concerned with the presence of particular linguistic patterns that explain how the intricate flow of thought is successfully depicted. This paper offers a detailed elaboration on the criteria for categorizing thought presentation in Leech& Short’s model and distinguishes cases of ambiguity. A case study of Mrs. Dalloway’s flower purchase scene illustrates how different types of thought presentation along with different reporting clauses are used to convey the variation in the character’s mental state and the negotiation between her inner voice and the outside world.

  3. Machine learning approach for text and document mining

    OpenAIRE

    Bijalwan, Vishwanath; Kumari, Pinki; Pascual, Jordan; Semwal, Vijay Bhaskar

    2014-01-01

    Text Categorization (TC), also known as Text Classification, is the task of automatically classifying a set of text documents into different categories from a predefined set. If a document belongs to exactly one of the categories, it is a single-label classification task; otherwise, it is a multi-label classification task. TC uses several tools from Information Retrieval (IR) and Machine Learning (ML) and has received much attention in the last years from both researchers in the academia and ...

  4. The linguistic and stylistic content in teaching Spanish in Gabon: attitudes and teaching practices

    Directory of Open Access Journals (Sweden)

    Jeanne Berthe MAKAYA MAVOUNGOU

    2016-07-01

    Full Text Available Always it has made us perplex, the position thought by the majority of the professorship gabonés as for if we have to or not to be interested to the partial or complete study of the linguistic and semantic aspects during the explanation and comment of a document. The education of the language and of the Spanish literature needs of the teacher the capture in account and the knowledge of the synergy that exists between the different genres, the aesthetic expression, the semantic content and the pragmatic use. It is the reason by which there is expected from the teacher, the capture in account of the contributions of other such connected sciences as the linguistics, the aesthetics, the sociology, the psychology, etc. That do not stop suggesting to the pedagogy and to the didactics other tracks of treatment of the suitable contents in class of Spanish language. The contribution of this investigation places, especially, to level of the analysis of the educational practices and the contributions of these by means of the semantic and linguistic paths.

  5. Tissue Classification

    DEFF Research Database (Denmark)

    Van Leemput, Koen; Puonti, Oula

    2015-01-01

    Computational methods for automatically segmenting magnetic resonance images of the brain have seen tremendous advances in recent years. So-called tissue classification techniques, aimed at extracting the three main brain tissue classes (white matter, gray matter, and cerebrospinal fluid), are now...... well established. In their simplest form, these methods classify voxels independently based on their intensity alone, although much more sophisticated models are typically used in practice. This article aims to give an overview of often-used computational techniques for brain tissue classification...

  6. Framework for Urdu News Headlines Classification

    Directory of Open Access Journals (Sweden)

    Kashif AHMED

    2016-04-01

    Full Text Available Automatic text classification has great significance in the field of text mining and plays a pivotal role in areas such as spam filtering, news classification, noise reduction etc. It is evident from the literature that there is ample of research conducted for classifying text documents e.g. English news classification, Persian text classification etc. but there is no copious amount of work related to short Urdu text or Urdu news headlines classification. Therefore, after examining various existing news classification methodologies we propose an SVM based framework in this paper for classification of Urdu news headlines. This approach classifies Urdu news based on headlines in their respective pre-defined categories by utilizing their feature vector’s maximum indexes. This proposed system is compared with existing state-of-the art techniques.

  7. Two Aspects of Language and a Poem: One Stylistics Analysis of "Água Forte"

    Directory of Open Access Journals (Sweden)

    Isabel Cristina Ferreira Teixeira

    2016-07-01

    Full Text Available Este estudo organiza-se em torno de uma experiência desenvolvida em sala de aula, mais especificamente em Estudos Linguísticos, disciplina cursada por estudantes dos primeiros semestres do curso de Licenciatura em Letras. Nela começamos a reflexão sobre a constituição disciplinar da linguística pelas ideias estruturalistas. Jakobson logo aparece com seus artigos fundadores sobre os eixos da organização da linguagem associados à afasia e sobre as funções da linguagem, caracterizando a função poética como aquela que “projeta o princípio da equivalência do eixo da seleção sobre eixo da combinação” (1985, p. 130. Mas como compreender esses conceitos que descrevem a expressividade dos elementos linguísticos? A ideia é aplicá-los à linguagem em seus diferentes usos. O poema é uma dessas possibilidades. Apresentamos então o estudo feito para os estudantes e com os estudantes que analisa o funcionamento das duas figuras polares de estilo – a metáfora e a metonímia – aplicadas ao poema “Água-forte” de Manuel Bandeira (1986. Baseados nos princípios estruturalistas de Saussure (1974 e Jakobson (1985, aproximamos os eixos de organização da linguagem a essas figuras. Essa análise favorece a descoberta do sentido, produz possibilidades de leitura e, principalmente, permite a observação de efeitos discursivos relacionados ao conhecimento da expressividade dos elementos linguísticos.

  8. An Intelligent System For Arabic Text Categorization

    NARCIS (Netherlands)

    Syiam, M.M.; Tolba, Mohamed F.; Fayed, Z.T.; Abdel-Wahab, Mohamed S.; Ghoniemy, Said A.; Habib, Mena Badieh

    Text Categorization (classification) is the process of classifying documents into a predefined set of categories based on their content. In this paper, an intelligent Arabic text categorization system is presented. Machine learning algorithms are used in this system. Many algorithms for stemming and

  9. A quick survey of text categorization algorithms

    OpenAIRE

    Dan MUNTEANU

    2007-01-01

    This paper contains an overview of basic formulations and approaches to text classification. This paper surveys the algorithms used in text categorization: handcrafted rules, decision trees, decision rules, on-line learning, linear classifier, Rocchio’s algorithm, k Nearest Neighbor (kNN), Support Vector Machines (SVM).

  10. Xenolog classification.

    Science.gov (United States)

    Darby, Charlotte A; Stolzer, Maureen; Ropp, Patrick J; Barker, Daniel; Durand, Dannie

    2017-03-01

    Orthology analysis is a fundamental tool in comparative genomics. Sophisticated methods have been developed to distinguish between orthologs and paralogs and to classify paralogs into subtypes depending on the duplication mechanism and timing, relative to speciation. However, no comparable framework exists for xenologs: gene pairs whose history, since their divergence, includes a horizontal transfer. Further, the diversity of gene pairs that meet this broad definition calls for classification of xenologs with similar properties into subtypes. We present a xenolog classification that uses phylogenetic reconciliation to assign each pair of genes to a class based on the event responsible for their divergence and the historical association between genes and species. Our classes distinguish between genes related through transfer alone and genes related through duplication and transfer. Further, they separate closely-related genes in distantly-related species from distantly-related genes in closely-related species. We present formal rules that assign gene pairs to specific xenolog classes, given a reconciled gene tree with an arbitrary number of duplications and transfers. These xenology classification rules have been implemented in software and tested on a collection of ∼13 000 prokaryotic gene families. In addition, we present a case study demonstrating the connection between xenolog classification and gene function prediction. The xenolog classification rules have been implemented in N otung 2.9, a freely available phylogenetic reconciliation software package. http://www.cs.cmu.edu/~durand/Notung . Gene trees are available at http://dx.doi.org/10.7488/ds/1503 . durand@cmu.edu. Supplementary data are available at Bioinformatics online.

  11. Transporter Classification Database (TCDB)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Transporter Classification Database details a comprehensive classification system for membrane transport proteins known as the Transporter Classification (TC)...

  12. Medical imbalanced data classification

    Directory of Open Access Journals (Sweden)

    Sara Belarouci

    2017-04-01

    Full Text Available In general, the imbalanced dataset is a problem often found in health applications. In medical data classification, we often face the imbalanced number of data samples where at least one of the classes constitutes only a very small minority of the data. In the same time, it represent a difficult problem in most of machine learning algorithms. There have been many works dealing with classification of imbalanced dataset. In this paper, we proposed a learning method based on a cost sensitive extension of Least Mean Square (LMS algorithm that penalizes errors of different samples with different weights and some rules of thumb to determine those weights. After the balancing phase, we apply the different techniques (Support Vector Machine [SVM], K- Nearest Neighbor [K-NN] and Multilayer perceptron [MLP] for the balanced datasets. We have also compared the obtained results before and after balancing method. We have obtained best results compared to literature with a classification accuracy of 100%.

  13. Stylistics and comics

    NARCIS (Netherlands)

    Forceville, C.; El Refaie, E.; Meesters, G.; Burke, M.

    2014-01-01

    Comics is rapidly developing into a scholarly discipline in its own right, with a growing output of books, journals, and conferences. Two practitioners have been crucial influences. Eisner (1985) and McCloud (1993, 2000, 2006) have done much to aid the understanding of how comics create meaning. But

  14. FACET CLASSIFICATIONS OF E-LEARNING TOOLS

    Directory of Open Access Journals (Sweden)

    Olena Yu. Balalaieva

    2013-12-01

    Full Text Available The article deals with the classification of e-learning tools based on the facet method, which suggests the separation of the parallel set of objects into independent classification groups; at the same time it is not assumed rigid classification structure and pre-built finite groups classification groups are formed by a combination of values taken from the relevant facets. An attempt to systematize the existing classification of e-learning tools from the standpoint of classification theory is made for the first time. Modern Ukrainian and foreign facet classifications of e-learning tools are described; their positive and negative features compared to classifications based on a hierarchical method are analyzed. The original author's facet classification of e-learning tools is proposed.

  15. Classification of Cortical Brain Malformations

    Directory of Open Access Journals (Sweden)

    J Gordon Millichap

    2008-03-01

    Full Text Available Clinical, radiological, and genetic classifications of 113 cases of malformations of cortical development (MCD were evaluated at the Erasmus Medical Center-Sophia Children's Hospital, Rotterdam, the Netherlands.

  16. APPROACHES TO VISUALIZATION OF LARGE VOLUMETRIC TEXT DOCUMENTS

    Directory of Open Access Journals (Sweden)

    О Ю Заславская

    2016-12-01

    Full Text Available The article discusses the relevance of data visualization, based on improving the efficiency of the perception of the material to make the right professional solutions. author of the study describes the steps for creating verbal and graphic materials multipage text documents on the example of the analysis and structuring of the information provided in the Development Strategy of the State Autonomous Educational Institution “Moscow City Pedagogical University” for the period until 2020, including:setting a goal, the definition of the main directions of the rendered objects , the study of verbal material, familiarity with previously established visualized examples, creating schematic drawings, search and selection of resources, creation of visualization, analysis work. It is also justified by the need for each stage, and provides recommendations for the implementation of visualized ideas. Particular attention is paid to the isolation and mapping of key sites such as the logo of the educational institution, full name, infrastructure, etc. The paper sets out the basic approach to the visualization of documents containing large amounts of text data, among which are highlighted: purposeful, analytical, creative, search, implementation-creative, reflective. We give the visualization of multi-page document University Development Strategy in the form of three logically and stylistically related illustrations with a description placed in these facilities.

  17. PRIMERA SISTEMATIZACIÓN DE LAS CARACTERÍSTICAS ESTILÍSTICAS DE LA ALFARERÍA FINA DEL SITIO SORIA 2 (VALLE DE YOCAVIL, NOROESTE ARGENTINO / First systematization of stylistic characters of fine pottery from Soria 2 site (Yocavil, Northwestern Argentina

    Directory of Open Access Journals (Sweden)

    Romina Clara Spano

    2011-12-01

    Full Text Available Se presenta una primera sistematización de las características de la alfarería temprana del sitio Soria 2 (valle de Yocavil, Noroeste Argentino, centrando el análisis en ejemplares pertenecientes al denominado conjunto fino. Se apunta a la caracterización de una muestra del abundante material cerámico hallado en un contexto primario, para el cual se cuenta con un fechado de inicios de la Era Cristiana. El material es clasificado recurriendo a la categoría estilo, entendiendo a la misma como la integración de aspectos morfológicos, tecnológicos y decorativos, que convergen en los “modos de hacer” vigentes durante la ocupación del sitio. Se detallan las variables analíticas puestas en juego: forma, técnica de manufactura, pasta, cocción, tratamiento de la superficie y decoración. La conjunción de dichas variables es la base para proponer modalidades estilísticas. Adicionalmente, se refiere brevemente a las prácticas en las cuales las vasijas estuvieron involucradas, tomando en cuenta los contextos de hallazgo (doméstico y funerario. El análisis sugiere que algunos ejemplares de la muestra estudiada exhiben afinidades con espacios circundantes.   Palabras clave: alfarería; modalidades estilísticas; contexto primario; Formativo; valle de Yocavil.   Abstract In this paper we present a first systematization of the features of early pottery found at the site Soria 2 (Yocavil Valley, Northwestern Argentina, focusing the analysis on the specimens belonging to the so-called fine pottery group. We aim at the characterization of a sample of the abundant ceramic material found in primary context for which there is a radiocarbon date from the beginning of the Christian era. The material is classified using the style category, considered here as the integration of morphological, technological and decorative aspects, which converge in the current “ways of doing” at those times of the site occupation. The analytical variables used

  18. XML and Free Text.

    Science.gov (United States)

    Riggs, Ken Roger

    2002-01-01

    Discusses problems with marking free text, text that is either natural language or semigrammatical but unstructured, that prevent well-formed XML from marking text for readily available meaning. Proposes a solution to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML. (Author/LRW)

  19. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  20. Simple Open Stance Classification for Rumour Analysis

    OpenAIRE

    Aker, Ahmet; Derczynski, Leon; Bontcheva, Kalina

    2017-01-01

    Stance classification determines the attitude, or stance, in a (typically short) text. The task has powerful applications, such as the detection of fake news or the automatic extraction of attitudes toward entities or events in the media. This paper describes a surprisingly simple and efficient classification approach to open stance classification in Twitter, for rumour and veracity classification. The approach profits from a novel set of automatically identifiable problem-specific features, ...

  1. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2018-01-01

    the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities...... (images, videos). Thus, a basic question is this: should electronic text be included in the expanded notion of text as a new digital sign modality added to the repertoire of modalities, or should it be included as a sign modality, which is both an independent modality and a container in which other...

  2. Pragmatic Aspects of Quotation in Spanish Media Texts

    Directory of Open Access Journals (Sweden)

    Екатерина Дмитриевна Терентьева

    2016-12-01

    Full Text Available This paper examines the pragmatic functions of citation specific to the Spanish media texts. The texts for analysis were chosen among the so-called national quality press, including El País, El Mundo, and ABC 2014-2016. The study combined the interpretive text analysis and elements of the comparative and contextual analyses. The role of mass media in molding public opinion, and as one of the most powerful instruments of social control cannot be overemphasized. In addition to awareness-raising and influence functions, citation is widely used in media text to bring more credibility and expressiveness to the transmitted information. Presumably, all texts cited in media can be divided into two categories. On the one hand, these are texts that are socially significant and relevant at a specific point in time, and on the other hand, these are so-called precedent texts repeatedly reproduced within a given community. Based on the analyzed material the following three key pragmatic functions can be distinguished: substitution, illustration and reasoning which allows to make both direct and indirect quotation, that differ graphically in terms of the presence or absence of quotation marks, the object of study. Characteristics specific to the implicit citation are treated separately where implicit citation covers the citation of precedent texts used primarily for language game, and thus performs stylistic and advertising functions. The accomplished analysis of the citation pragmatic aspects of Spanish press brings clarity to some of the mechanisms used to impact the reader and shape public opinion.

  3. Searching for text documents

    NARCIS (Netherlands)

    Hiemstra, Djoerd; Blanken, Henk; de Vries, A.P.; Blok, H.E.; Feng, L.

    2007-01-01

    Many documents contain, besides text, also images, tables, and so on. This chapter concentrates on the text part only. Traditionally, systems handling text documents are called information storage and retrieval systems. Before the World-Wide Web emerged, such systems were almost exclusively used by

  4. Classification and regression trees

    CERN Document Server

    Breiman, Leo; Olshen, Richard A; Stone, Charles J

    1984-01-01

    The methodology used to construct tree structured rules is the focus of this monograph. Unlike many other statistical procedures, which moved from pencil and paper to calculators, this text's use of trees was unthinkable before computers. Both the practical and theoretical sides have been developed in the authors' study of tree methods. Classification and Regression Trees reflects these two sides, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

  5. Vocabulary Constraint on Texts

    Directory of Open Access Journals (Sweden)

    C. Sutarsyah

    2008-01-01

    Full Text Available This case study was carried out in the English Education Department of State University of Malang. The aim of the study was to identify and describe the vocabulary in the reading text and to seek if the text is useful for reading skill development. A descriptive qualitative design was applied to obtain the data. For this purpose, some available computer programs were used to find the description of vocabulary in the texts. It was found that the 20 texts containing 7,945 words are dominated by low frequency words which account for 16.97% of the words in the texts. The high frequency words occurring in the texts were dominated by function words. In the case of word levels, it was found that the texts have very limited number of words from GSL (General Service List of English Words (West, 1953. The proportion of the first 1,000 words of GSL only accounts for 44.6%. The data also show that the texts contain too large proportion of words which are not in the three levels (the first 2,000 and UWL. These words account for 26.44% of the running words in the texts.  It is believed that the constraints are due to the selection of the texts which are made of a series of short-unrelated texts. This kind of text is subject to the accumulation of low frequency words especially those of content words and limited of words from GSL. It could also defeat the development of students' reading skills and vocabulary enrichment.

  6. EEG Signal Classification: Introduction to the Problem

    Directory of Open Access Journals (Sweden)

    A. Stancak

    2003-09-01

    Full Text Available The contribution describes the design, optimization and verificationof the off-line single-trial movement classification system. Four typesof movements are used for the classification: the right index fingerextension vs. flexion as well as the right shoulder (proximal vs.right index finger (distal movement. The classification systemutilizes hidden information stored in the characteristic shapes ofhuman brain activity (EEG signal. The great variability of EEGpotentials requires using of context information and hence theclassifier based on Hidden Markov Models (HMM. The suitableparameterization, model structure as well as training andclassification process are suggested on the base of spectral analysisresults and experience with the speech recognition. The training andthe classification are performed with the disjoint sets of EEGrealizations. Classification experiments are performed with 10 randomlychosen sets of EEG realizations. The final average score of thedistal/proximal movement classification is 80%; the standard deviationof classification results is 9%. The classification of the extension /flexion gives comparable results.

  7. The Vicissitudes of Text

    Directory of Open Access Journals (Sweden)

    Jonathan CULLER

    2003-06-01

    Full Text Available The concept of text, which has been central to literary studies, has undergone many mutations, as it has traveled from the work of classical philologists, for whom it was and is the object of a powerful disciplinary formation, to postmodern theorists of the text, for whom, the concept might be summed up by the title of a fine book by John Mowatt: Text: the Genealogy of an Antidisciplinary Object. Of course, the interesting thing about a travelling concept is not that it travels — travelers, t...

  8. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  9. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2018-01-01

    the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities....... This wider notion would include, for instance, all sorts of scanning results, whether of the outer cosmos or the inner geographies of our bodies, and of digital traces of other processes in between these (machine readings included). Since alphabets, like the genetic alphabet, and all sorts of images may...

  10. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  11. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  12. YORUBA, INTERMEDIATE TEXTS.

    Science.gov (United States)

    MCCLURE, H. DAVID; OYEWALE, JOHN O.

    THIS COURSE IS BASED ON A SERIES OF BRIEF MONOLOGUES RECORDED BY A WESTERN-EDUCATED NATIVE SPEAKER OF YORUBA FROM THE OYO AREA. THE TAPES CONSTITUTE THE CENTRAL PART OF THE COURSE, WITH THE TEXT INTENDED AS SUPPLEMENTARY AND AUXILIARY MATERIAL. THE TEXT TOPICS WERE CHOSEN FOR THEIR SPECIAL RELEVANCE TO PEACE CORPS VOLUNTEERS WHO EXPECT TO USE…

  13. Making Sense of Texts

    Science.gov (United States)

    Harper, Rebecca G.

    2014-01-01

    This article addresses the triadic nature regarding meaning construction of texts. Grounded in Rosenblatt's (1995; 1998; 2004) Transactional Theory, research conducted in an undergraduate Language Arts curriculum course revealed that when presented with unfamiliar texts, students used prior experiences, social interactions, and literary strategies…

  14. Differential Classification of Dementia

    Directory of Open Access Journals (Sweden)

    E. Mohr

    1995-01-01

    Full Text Available In the absence of biological markers, dementia classification remains complex both in terms of characterization as well as early detection of the presence or absence of dementing symptoms, particularly in diseases with possible secondary dementia. An empirical, statistical approach using neuropsychological measures was therefore developed to distinguish demented from non-demented patients and to identify differential patterns of cognitive dysfunction in neurodegenerative disease. Age-scaled neurobehavioral test results (Wechsler Adult Intelligence Scale—Revised and Wechsler Memory Scale from Alzheimer's (AD and Huntington's (HD patients, matched for intellectual disability, as well as normal controls were used to derive a classification formula. Stepwise discriminant analysis accurately (99% correct distinguished controls from demented patients, and separated the two patient groups (79% correct. Variables discriminating between HD and AD patient groups consisted of complex psychomotor tasks, visuospatial function, attention and memory. The reliability of the classification formula was demonstrated with a new, independent sample of AD and HD patients which yielded virtually identical results (classification accuracy for dementia: 96%; AD versus HD: 78%. To validate the formula, the discriminant function was applied to Parkinson's (PD patients, 38% of whom were classified as demented. The validity of the classification was demonstrated by significant PD subgroup differences on measures of dementia not included in the discriminant function. Moreover, a majority of demented PD patients (65% were classified as having an HD-like pattern of cognitive deficits, in line with previous reports of the subcortical nature of PD dementia. This approach may thus be useful in classifying presence or absence of dementia and in discriminating between dementia subtypes in cases of secondary or coincidental dementia.

  15. Classification in Australia.

    Science.gov (United States)

    McKinlay, John

    Despite some inroads by the Library of Congress Classification and short-lived experimentation with Universal Decimal Classification and Bliss Classification, Dewey Decimal Classification, with its ability in recent editions to be hospitable to local needs, remains the most widely used classification system in Australia. Although supplemented at…

  16. The Vicissitudes of Text

    OpenAIRE

    Culler, Jonathan

    2011-01-01

    The concept of text, which has been central to literary studies, has undergone many mutations, as it has traveled from the work of classical philologists, for whom it was and is the object of a powerful disciplinary formation, to postmodern theorists of the text, for whom, the concept might be summed up by the title of a fine book by John Mowatt: Text: the Genealogy of an Antidisciplinary Object. Of course, the interesting thing about a travelling concept is not that it travels — travelers, t...

  17. The Vicissitudes of Text

    OpenAIRE

    Culler, Jonathan

    2003-01-01

    The concept of text, which has been central to literary studies, has undergone many mutations, as it has traveled from the work of classical philologists, for whom it was and is the object of a powerful disciplinary formation, to postmodern theorists of the text, for whom, the concept might be summed up by the title of a fine book by John Mowatt: Text: the Genealogy of an Antidisciplinary Object. Of course, the interesting thing about a travelling concept is not that it travels — travelers, t...

  18. Text, Hypertext, and Hyperfiction

    Directory of Open Access Journals (Sweden)

    Ladan Modir

    2014-03-01

    Full Text Available This article briefly surveys the changing theoretical perspectives on text from structuralism to poststructuralism and how they are subsequently accounted for by hypertext theorists to comprehend the emerging genre called hypertext fiction. Some theoretical issues concerning the reading of this genre also will be discussed. The purpose of this study is to illustrate that the radical promises and challenges of digital novels to readers would prove reading and interpretation of conventional texts are far more participatory. This will be accomplished by tracing the evolution of poststructuralists’ concepts of intertextuality, multivocality, decentering, multilinearity, disorientation, and interactivity to find a way out of constant notions of conventional principles of reading.

  19. Texting on the Move

    Science.gov (United States)

    ... about when and where we text. What's the Big Deal? The problem is multitasking. No matter how ... person again. Reviewed by: Mary L. Gavin, MD Date reviewed: October 2013 More on this topic for: ...

  20. Machine Translation from Text

    Science.gov (United States)

    Habash, Nizar; Olive, Joseph; Christianson, Caitlin; McCary, John

    Machine translation (MT) from text, the topic of this chapter, is perhaps the heart of the GALE project. Beyond being a well defined application that stands on its own, MT from text is the link between the automatic speech recognition component and the distillation component. The focus of MT in GALE is on translating from Arabic or Chinese to English. The three languages represent a wide range of linguistic diversity and make the GALE MT task rather challenging and exciting.

  1. Plagiarism in Academic Texts

    Directory of Open Access Journals (Sweden)

    Marta Eugenia Rojas-Porras

    2012-08-01

    Full Text Available The ethical and social responsibility of citing the sources in a scientific or artistic work is undeniable. This paper explores, in a preliminary way, academic plagiarism in its various forms. It includes findings based on a forensic analysis. The purpose of this paper is to raise awareness on the importance of considering these details when writing and publishing a text. Hopefully, this analysis may put the issue under discussion.

  2. Psychologically Motivated Text Mining

    OpenAIRE

    Shutova, Ekaterina; Lichtenstein, Patricia

    2016-01-01

    Natural language processing techniques are increasingly applied to identify social trends and predict behavior based on large text collections. Existing methods typically rely on surface lexical and syntactic information. Yet, research in psychology shows that patterns of human conceptualisation, such as metaphorical framing, are reliable predictors of human expectations and decisions. In this paper, we present a method to learn patterns of metaphorical framing from large text collections, us...

  3. Translation of Quantum Texts

    OpenAIRE

    Espinoza, Randall; Imbo, Tom; Lopata, Paul

    2004-01-01

    In the companion to this paper, we described a generalization of the deterministic quantum cloning process, called enscription, which utilizes entanglement in order to achieve the "copying" of (certain) sets of distinct quantum states which are not orthogonal, called texts. Here we provide a further generalization, called translation, which allows us to completely determine all translatable texts, and which displays an intimate relationship to the mathematical theory of graphs.

  4. VERNACULAR VOCABULARY SUFFIXED WORD-FORMATION MODELS OF TEXTS OF THE TOBOLSK ABALAK MONASTERY AT THE SECOND HALF OF THE XVIII CENTURY

    Directory of Open Access Journals (Sweden)

    Margarita Stepanovna Vykhrystyuk

    2016-12-01

    Full Text Available The article is devoted to the analysis of suffixed word-formative models of vernacular shorthand business texts of the Tobolsk Abalak monastery of the second half of the XVIII century. Subject of research is word-formation models of shorthand business texts vernacular vocabulary of the Tobolsk Abalak monastery of the second half of the XVIII century. Objective: to carry out a primary stylistic analysis of popular-colloquial elements on the word-formation level. The novelty of this study is that the object of attention, being handwritten monuments stored in the local archive, that belong to the period of formation of the Russian national language, have never been studied before; the second focus of attention being word-formation oppositions of the vernacular type of language found in business writing of the ancient capital of Siberia. The methodological basis of the research was constituted by the principles of the scientific-research approach to the text, i.e. the systematicity and historicism. These principles determine the attitude to the text as a source of information of various levels in terms of the unity of content and form. Taking this into account, the following methods were used for the purposes of the study: purposive sampling, descriptive, comparative. As a result of the linguistic analysis a conclusion was made that genetical correlation of words with vernacular word-formation features, that was considered as a stylistic marker at the previous stages of the history of language, becomes neutralized in the monastic business writing of the XVIII century. Practical implications of the study results are determined by the ability to use them in further studying historical grammar and history of the records management in Russia, reading special courses on linguistic issues of local history, source studies, historical stylistics.

  5. Remote Sensing Information Classification

    Science.gov (United States)

    Rickman, Douglas L.

    2008-01-01

    This viewgraph presentation reviews the classification of Remote Sensing data in relation to epidemiology. Classification is a way to reduce the dimensionality and precision to something a human can understand. Classification changes SCALAR data into NOMINAL data.

  6. Classification in context

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2004-01-01

    This paper surveys classification research literature, discusses various classification theories, and shows that the focus has traditionally been on establishing a scientific foundation for classification research. This paper argues that a shift has taken place, and suggests that contemporary...

  7. The earliest medical texts.

    Science.gov (United States)

    Frey, E F

    The first civilization known to have had an extensive study of medicine and to leave written records of its practices and procedures was that of ancient Egypt. The oldest extant Egyptian medical texts are six papyri from the period between 2000 B.C. and 1500 B.C.: the Kahun Medical Papyrus, the Ramesseum IV and Ramesseum V Papyri, the Edwin Smith Surgical Papyrus, The Ebers Medical Papyrus and the Hearst Medical Papyrus. These texts, most of them based on older texts dating possibly from 3000 B.C., are comparatively free of the magician's approach to treating illness. Egyptian medicine influenced the medicine of neighboring cultures, including the culture of ancient Greece. From Greece, its influence spread onward, thereby affecting Western civilization significantly.

  8. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  9. Text analysis in R

    NARCIS (Netherlands)

    Welbers, K.; van Atteveldt, W.H.; Benoit, K.

    2017-01-01

    Computational text analysis has become an exciting research field with many applications in communication research. It can be a difficult method to apply, however, because it requires knowledge of various techniques, and the software required to perform most of these techniques is not readily

  10. Text Induced Spelling Correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from a very large corpus of raw text, without supervision, and contains word

  11. Texts On-Line.

    Science.gov (United States)

    Thomas, Jean-Jacques

    1993-01-01

    Maintains that the study of signs is divided between those scholars who use the Saussurian binary sign (semiology) and those who prefer the Peirce tripartite sign (semiotics). Concludes that neither the Saussurian nor Peircian analysis methods can produce a semiotic interpretation based on a hierarchy of the text's various components. (CFR)

  12. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    and free online dictionaries. The Diccionario español para la producción de textos is an example of a general text production dictionary that makes use of internet technologies, is based on a lexicographic theory, contains all the lexicographic data that users need in a production situation, and aims...

  13. Content Based Text Handling.

    Science.gov (United States)

    Schwarz, Christoph

    1990-01-01

    Gives an overview of various linguistic software tools in the field of intelligent text handling that are being developed in Germany utilizing artificial intelligence techniques in the field of natural language processing. Syntactical analysis of documents is described and application areas are discussed. (10 references) (LRW)

  14. Text, Hypertext, and Hyperfiction

    OpenAIRE

    Ladan Modir; Ling C Guan; Sohaimi Bin Abdul Aziz

    2014-01-01

    This article briefly surveys the changing theoretical perspectives on text from structuralism to poststructuralism and how they are subsequently accounted for by hypertext theorists to comprehend the emerging genre called hypertext fiction. Some theoretical issues concerning the reading of this genre also will be discussed. The purpose of this study is to illustrate that the radical promises and challenges of digital n...

  15. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2017-01-01

    of “text” or “printed text” as the point of departure. On the other hand, electronic text can be defined by taking as point of departure the digital format in which everything is represented in the binary alphabet. While the notion of text, in most cases, lends itself to be independent of medium......) processing rules as binary sequences manifested in the binary alphabet. This wider notion would include, for instance, all sorts of scanning results, whether of the outer cosmos or the inner geographies of our bodies, and of digital traces of other processes in between these (machine readings included......). Since alphabets, like the genetic alphabet, and all sorts of images may be represented in the binary alphabet, such materials will also belong to the textual universe within this definition. A more intriguing implication is that digital born materials may also include scripts and interactive features...

  16. Strategy as Texts

    DEFF Research Database (Denmark)

    Obed Madsen, Søren

    of the strategy into four categories. Second, the managers produce new texts based on the original strategy document by using four different ways of translation models. The study’s findings contribute to three areas. Firstly, it shows that translation is more than a sociological process. It is also......This article shows empirically how managers translate a strategy plan at an individual level. By analysing how managers in three organizations translate strategies, it identifies that the translation happens in two steps: First, the managers decipher the strategy by coding the different parts...... a craftsmanship that requires knowledge and skills, which unfortunately seems to be overlooked in both the literature and in practice. Secondly, it shows that even though a strategy text is in singular, the translation makes strategy plural. Thirdly, the article proposes a way to open up the black box of what...

  17. Wisdom Texts and Philosophy

    Directory of Open Access Journals (Sweden)

    Anthony Preus

    2013-11-01

    Full Text Available The last essay of this issue concerns to a more "technical" subject: in many ancient cultures, literary monuments are mainly "wisdom literature". In these early works. Philosophy and Literature are more closely related than in many contemporary approaches. The author here tries to sketch the relationships between the ancient wisdom literatures of Egipt, Greece and Israel, and to show how this literary genre precedes "philosophy".

  18. Knowledge Based Text Generation

    Science.gov (United States)

    1989-08-01

    from data bases, so Kukich [1984] developed a system, ANA , which generates stock reports from a knowledge base of daily trading on the Dow Jones stock...MACHIAVELLI (topic organization and phraseology), CICERO (realization), FREUD (monitoring the origins of rhetorical plans), and LEIBNITZ (a "concept...68 Bossie and Mani 8 Alla Fiera dell’est 37 brain 2 frame 29 Alshawi 49 Brown and Yule 51 amplification 38 Cambridge University 40 ANA 15 canned text 7

  19. Weaving with text

    DEFF Research Database (Denmark)

    Hagedorn-Rasmussen, Peter

    This paper explores how a school principal by means of practical authorship creates reservoirs of language that provide a possible context for collective sensemaking. The paper draws upon a field study in which a school principal, and his managerial team, was shadowed in a period of intensive cha...... changes. The paper explores how the manager weaves with text, extracted from stakeholders, administration, politicians, employees, public discourse etc., as a means of creating a new fabric, a texture, of diverse perspectives that aims for collective sensemaking....

  20. Ways students read texts

    Science.gov (United States)

    Wandersee, James H.

    College students responding to the Preferred Method of Study (PMOS) questionnaire explained how they approach reading a new textbook chapter for comprehension. Results indicated that a significant positive correlation exists between the number of passes a student makes at new textbook material and his/her college grade-point average. Women showed a significant preference for adopting a single method of study. Less than half of the students queried construct organizational tools such as outlines or diagrams as they study a textbook. Students said they would alter their textbook strategies in response to the type of test they expected significantly more often than they would for the type of subject matter being studied. Only 6% of the students said they make a conscious effort to link the new concepts in the text to prior knowledge. There was no discernable relationship between the study strategies undergraduate college students employ and their college grade level (freshman through senior).

  1. SPORT FOOD ADDITIVE CLASSIFICATION

    Directory of Open Access Journals (Sweden)

    I. P. Prokopenko

    2015-01-01

    Full Text Available Correctly organized nutritive and pharmacological support is an important component of an athlete's preparation for competitions, an optimal shape maintenance, fast recovery and rehabilitation after traumas and defatigation. Special products of enhanced biological value (BAS for athletes nutrition are used with this purpose. Easy-to-use energy sources are administered into athlete's organism, yielded materials and biologically active substances which regulate and activate exchange reactions which proceed with difficulties during certain physical trainings. The article presents sport supplements classification which can be used before warm-up and trainings, after trainings and in competitions breaks.

  2. Classification of the web

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2004-01-01

    This paper discusses the challenges faced by investigations into the classification of the Web and outlines inquiries that are needed to use principles for bibliographic classification to construct classifications of the Web. This paper suggests that the classification of the Web meets challenges...

  3. Technological and stylistic evaluation of the Early Bronze Age pottery at Tarsus-Gozlukule, Turkey: Pottery production and its interaction with economic, social, and cultural spheres

    Science.gov (United States)

    Unlu, Elif

    This dissertation presents a technological and stylistic assessment of Early Bronze Age pottery production at Tarsus-Gozlukule, a multi-period mound settlement located in the Cilician Plain in southern Turkey. Pottery production, like all other man-made objects, is firstly a technological act. This dissertation maintains that material style (involving formal, technical, and decorative choices expressed by the artisan) of an artifact should be investigated as a whole as such an integrative study would be the most adequate way of understanding economic circumstances, social representation, and cultural boundaries. To facilitate this integrative investigation, seventy-two samples of Early Bronze Age pottery excavated from Tarsus-Gozlukule in the 1930s and 1940s.were selected for mineralogical, morphological, and chemical analyses. Petrographic and powder X-Ray Diffraction analyses were performed to determine the mineralogical makeup, Environmental Scanning Electron Microscope imagery was used to determine the morphology of these samples, and semi-quantitave Energy Dispersive X-Ray Spectroscopy analysis was performed on some samples to determine chemical properties of the clays. As a result of these scientific analyses various fabric groups were established. Afterwards formal shape and stylistic analysis was performed where shapes and surface treatments of the samples were analyzed and compared to the known local and non-local examples. Such an integrative approach to pottery production facilitates a better definition of the local pottery production process and enables an assessment of the technological know-how of the local pottery producers, their labor organization and its role within the operating markets, their function within the sociopolitical structure, and how such issues relate to the cultural boundaries within the community. Defining the paradigm of the local pottery production process leads to a broader investigation of issues related to the technological

  4. Clustering and classification of email contents

    Directory of Open Access Journals (Sweden)

    Izzat Alsmadi

    2015-01-01

    Full Text Available Information users depend heavily on emails’ system as one of the major sources of communication. Its importance and usage are continuously growing despite the evolution of mobile applications, social networks, etc. Emails are used on both the personal and professional levels. They can be considered as official documents in communication among users. Emails’ data mining and analysis can be conducted for several purposes such as: Spam detection and classification, subject classification, etc. In this paper, a large set of personal emails is used for the purpose of folder and subject classifications. Algorithms are developed to perform clustering and classification for this large text collection. Classification based on NGram is shown to be the best for such large text collection especially as text is Bi-language (i.e. with English and Arabic content.

  5. This article presents a tool for visual discourse analysis that allows stylistic research indicators to be systematised. The tool, called Matrix for Documentary Analysis, is based on the theoretical aspects of visual semiotics. It can be used to describe

    OpenAIRE

    Bartomeu Magaña, Elena; , EINA

    2016-01-01

    This article presents a tool for visual discourse analysis that allows stylistic research indicators to be systematised. The tool, called Matrix for Documentary Analysis, is based on the theoretical aspects of visual semiotics. It can be used to describe or forecast the scope of trends in graphic design and visual communication. This article also presents a test carried out on a trend in logotype design in Spain during the first decade of the 21st century known as Pollination. The results con...

  6. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  7. The book classification of William Torrey Harris: influences of Bacon and Hegel in library classification

    Directory of Open Access Journals (Sweden)

    Rodrigo de Sales

    2017-09-01

    Full Text Available The studies of library classification generally interact with the historical contextualization approach and with the classification ideas typical of Philosophy. In the 19th century, the North-American philosopher and educator William Torrey Harris developed a book classification at the St. Louis Public School, based on Francis Bacon and Georg Wilhelm Friedrich Hegel. The objective of this essay is to analyze Harris’s classification, reflecting upon his theoretical and philosophical backgrounds. To achieve such objective, this essay adopts a critical-descriptive approach for analysis. Results show some influences of Bacon and Hegel in Harris’s classification.

  8. Radar transmitter classification using non-stationary signal classifier

    CSIR Research Space (South Africa)

    Du Plessis, MC

    2009-07-01

    Full Text Available support vector machine which is applied to the radar pulse's time-frequency representation. The time-frequency representation is refined using particle swarm optimization to increase the classification accuracy. The classification accuracy is tested...

  9. The role of subject classification in terminological studies

    Directory of Open Access Journals (Sweden)

    Ж Багана

    2010-09-01

    Full Text Available This article describes the role of subject classification in terminology; defines its specific peculiarities and justifies the choice of the given classification to analyze the specific terminology.

  10. The Role of Text Mining in Export Control

    Energy Technology Data Exchange (ETDEWEB)

    Tae, Jae-woong; Son, Choul-woong; Shin, Dong-hoon [Korea Institute of Nuclear Nonproliferation and Control, Daejeon (Korea, Republic of)

    2015-10-15

    Korean government provides classification services to exporters. It is simple to copy technology such as documents and drawings. Moreover, it is also easy that new technology derived from the existing technology. The diversity of technology makes classification difficult because the boundary between strategic and nonstrategic technology is unclear and ambiguous. Reviewers should consider previous classification cases enough. However, the increase of the classification cases prevent consistent classifications. This made another innovative and effective approaches necessary. IXCRS (Intelligent Export Control Review System) is proposed to coincide with demands. IXCRS consists of and expert system, a semantic searching system, a full text retrieval system, and image retrieval system and a document retrieval system. It is the aim of the present paper to observe the document retrieval system based on text mining and to discuss how to utilize the system. This study has demonstrated how text mining technique can be applied to export control. The document retrieval system supports reviewers to treat previous classification cases effectively. Especially, it is highly probable that similarity data will contribute to specify classification criterion. However, an analysis of the system showed a number of problems that remain to be explored such as a multilanguage problem and an inclusion relationship problem. Further research should be directed to solve problems and to apply more data mining techniques so that the system should be used as one of useful tools for export control.

  11. Information Gain Based Dimensionality Selection for Classifying Text Documents

    Energy Technology Data Exchange (ETDEWEB)

    Dumidu Wijayasekara; Milos Manic; Miles McQueen

    2013-06-01

    Selecting the optimal dimensions for various knowledge extraction applications is an essential component of data mining. Dimensionality selection techniques are utilized in classification applications to increase the classification accuracy and reduce the computational complexity. In text classification, where the dimensionality of the dataset is extremely high, dimensionality selection is even more important. This paper presents a novel, genetic algorithm based methodology, for dimensionality selection in text mining applications that utilizes information gain. The presented methodology uses information gain of each dimension to change the mutation probability of chromosomes dynamically. Since the information gain is calculated a priori, the computational complexity is not affected. The presented method was tested on a specific text classification problem and compared with conventional genetic algorithm based dimensionality selection. The results show an improvement of 3% in the true positives and 1.6% in the true negatives over conventional dimensionality selection methods.

  12. Classification system to describe workpieces definitions

    CERN Document Server

    Macconnell, W R

    2013-01-01

    A Classification System to Describe Workpieces provides information pertinent to the fundamental aspects and principles of coding. This book discusses the various applications of the classification system of coding.Organized into three chapters, this book begins with an overview of the requirements of a system of classification pertaining adequately and equally to design, production, and work planning. This text then examines the purpose of the classification system in production to determine the most suitable means of machining a component. Other chapters consider the optimal utilization of m

  13. Prevailing Lexical-stylistic Features in Emirati Language Learners’ Digital Discourse | Caractéristiques lexicales stylistiques dominantes dans le discours numérique des apprenants en langue émirienne

    Directory of Open Access Journals (Sweden)

    Tsoghik Grigoryan

    2015-05-01

    Full Text Available Predicting the future path of the digital classroom discourse is twofold. Today’s language classroom is undergoing an irreversible revolution and one of the most powerful drivers of this transformation is ICT. Digital classroom not only exposes the learners to grammatical language of linguistics, but rather the everyday life of the language in use (Thurlow and Mroczek, 2011. The aim of this study was to explore the nature of free digital discourse in a digital language classroom and capture lexical-stylistic features used in students’ online conversations through Blackboard-learn discussion board. To identify common or unique features of digital discourse in a paperless language classroom and to show how they affect students’ speech behaviors, mixed method case study was used. Aujourd’hui, l’enseignement des langues est entraîné dans une hyperbole irréversible, et les TIC sont l’un des moteurs les plus puissants de cette transformation. Les salles de classe numériques exposent les apprenants non seulement à la grammaire linguistique, mais aussi à la vie quotidienne de la langue en usage (Thurlow et Mroczek, 2011. Le but de cette étude était d’explorer la nature du discours numérique gratuit sur iPad dans une salle de classe numérique pour l’apprentissage linguistique et de capter les caractéristiques lexicales stylistiques utilisées dans les communications en ligne des apprenants adolescents en langue émirienne. Cette approche mixte par étude de cas a mis en œuvre un cadre théorique de détection des sentiments sur une plateforme d’apprentissage sur tableau noir pour cerner les caractéristiques communes ou uniques du discours numérique dans une salle de classe dématérialisée et démontrer comment elles affectent les comportements linguistiques des élèves de langue maternelle émirienne.

  14. Analysis of Tense Interferential of Verbs in Old Narrative Texts

    Directory of Open Access Journals (Sweden)

    Mahmood Barati khansari

    2014-08-01

    Full Text Available Abstract One of the admirable methods to compose stories in Persian verse and prose, is the present Tense verbs in the meaning of past tense. This grammatical point has been hidden in the grammarian and stylist's point of view although it has been repeatedly mentioned in the texts and this point has been not mentioned in the grammatical books but some of the investigators and literati have pointed out it in their correction works. We mention their sayings: firstly, Allame Qazvini, doubtfully, mentions the interferential times of the verbs and inconsistencies of the Tenses in the correction of texts of Jahangoshaye – Joveini Book. He writes in the second footnote 2-3, that the verb Mikonam( I do is in the form of present Tense but its meaning is in the simple past Tense. As it has been observed, in the most old books the form of the verb is in the present tense but its meaning is in simple Tense ( Joveini, 1367, p. 357. Later, Fruzanfar in the correction of grammatical notes of ouhadoddin Kermani's Manaqeb, points to this point and counted it of the Eltefat Literary art ( Fruzanfar, 1347. P. 61 Mohammad Roushan informed this grammatical rule and he writes in the introduction of his book: the application of this kind of verb that is not on the basis of the dependent and independent verbs (Khagushi, 1361, p. 24. Yusofi in his correction on Bidpay Stories points to this grammatical point that it has been hidden of correctors of the book. Ha says that this grammatical point is the prose characteristic of the book. He adds that the characteristic includes in the present stories (Yusofi, 1364, p. 36. Finally, Dr. shfi'ee in his valuable notes on the Mateqol altei their mentions that this style of telling stories – the verb in the present Tense- is less in verse but the verbs in the same meaning and forms were used in old Persian as in the present time but there were inconsistence in the time and the form of the verbs in the past and the grammarians

  15. Analysis of Tense Interferential of Verbs in Old Narrative Texts

    Directory of Open Access Journals (Sweden)

    Amir Zeighami

    2014-07-01

    Full Text Available Abstract One of the admirable methods to compose stories in Persian verse and prose, is the present Tense verbs in the meaning of past tense. This grammatical point has been hidden in the grammarian and stylist's point of view although it has been repeatedly mentioned in the texts and this point has been not mentioned in the grammatical books but some of the investigators and literati have pointed out it in their correction works. We mention their sayings: firstly, Allame Qazvini, doubtfully, mentions the interferential times of the verbs and inconsistencies of the Tenses in the correction of texts of Jahangoshaye – Joveini Book. He writes in the second footnote 2-3, that the verb Mikonam( I do is in the form of present Tense but its meaning is in the simple past Tense. As it has been observed, in the most old books the form of the verb is in the present tense but its meaning is in simple Tense ( Joveini, 1367, p. 357. Later, Fruzanfar in the correction of grammatical notes of ouhadoddin Kermani's Manaqeb, points to this point and counted it of the Eltefat Literary art ( Fruzanfar, 1347. P. 61 Mohammad Roushan informed this grammatical rule and he writes in the introduction of his book: the application of this kind of verb that is not on the basis of the dependent and independent verbs (Khagushi, 1361, p. 24. Yusofi in his correction on Bidpay Stories points to this grammatical point that it has been hidden of correctors of the book. Ha says that this grammatical point is the prose characteristic of the book. He adds that the characteristic includes in the present stories (Yusofi, 1364, p. 36. Finally, Dr. shfi'ee in his valuable notes on the Mateqol altei their mentions that this style of telling stories – the verb in the present Tense- is less in verse but the verbs in the same meaning and forms were used in old Persian as in the present time but there were inconsistence in the time and the form of the verbs in the past and

  16. Facial aging: A clinical classification

    Directory of Open Access Journals (Sweden)

    Shiffman Melvin

    2007-01-01

    Full Text Available The purpose of this classification of facial aging is to have a simple clinical method to determine the severity of the aging process in the face. This allows a quick estimate as to the types of procedures that the patient would need to have the best results. Procedures that are presently used for facial rejuvenation include laser, chemical peels, suture lifts, fillers, modified facelift and full facelift. The physician is already using his best judgment to determine which procedure would be best for any particular patient. This classification may help to refine these decisions.

  17. SHIP CLASSIFICATION FROM MULTISPECTRAL VIDEOS

    Directory of Open Access Journals (Sweden)

    Frederique Robert-Inacio

    2012-05-01

    Full Text Available Surveillance of a seaport can be achieved by different means: radar, sonar, cameras, radio communications and so on. Such a surveillance aims, on the one hand, to manage cargo and tanker traffic, and, on the other hand, to prevent terrorist attacks in sensitive areas. In this paper an application to video-surveillance of a seaport entrance is presented, and more particularly, the different steps enabling to classify mobile shapes. This classification is based on a parameter measuring the similarity degree between the shape under study and a set of reference shapes. The classification result describes the considered mobile in terms of shape and speed.

  18. The Verbal Periphrasis ir a + Infinitive in the Narrative Texts by Gabriel García Márquez

    Directory of Open Access Journals (Sweden)

    Jasmina Markič

    2015-12-01

    Full Text Available This article deals with ir a + infinitive, one of the most frequent verbal periphrases in Spanish with temporal, aspectual and modal values. It presents the analysis of narrative texts written by the Colombian writer Gabriel García Márquez and emphasizes the expression of aspectual values of imminence of the event and the expression of future with the auxiliary verb in present replacing the simple future tense in Colombian Spanish corroborating, thus, the fact that in contemporary American Spanish the expression of futurity is in the process of transfer from a synthetic to an analytic form. However, these aspectual and temporal values are not always alone, frequently they merge with other mainly modal values which confer to this periphrasis additional expressive and stylistic uses.

  19. Text Authorship Identified Using the Dynamics of Word Co-Occurrence Networks

    Science.gov (United States)

    Akimushkin, Camilo; Amancio, Diego Raphael; Oliveira, Osvaldo Novais

    2017-01-01

    Automatic identification of authorship in disputed documents has benefited from complex network theory as this approach does not require human expertise or detailed semantic knowledge. Networks modeling entire books can be used to discriminate texts from different sources and understand network growth mechanisms, but only a few studies have probed the suitability of networks in modeling small chunks of text to grasp stylistic features. In this study, we introduce a methodology based on the dynamics of word co-occurrence networks representing written texts to classify a corpus of 80 texts by 8 authors. The texts were divided into sections with equal number of linguistic tokens, from which time series were created for 12 topological metrics. Since 73% of all series were stationary (ARIMA(p, 0, q)) and the remaining were integrable of first order (ARIMA(p, 1, q)), probability distributions could be obtained for the global network metrics. The metrics exhibit bell-shaped non-Gaussian distributions, and therefore distribution moments were used as learning attributes. With an optimized supervised learning procedure based on a nonlinear transformation performed by Isomap, 71 out of 80 texts were correctly classified using the K-nearest neighbors algorithm, i.e. a remarkable 88.75% author matching success rate was achieved. Hence, purely dynamic fluctuations in network metrics can characterize authorship, thus paving the way for a robust description of large texts in terms of small evolving networks. PMID:28125703

  20. Classification of cultivated plants.

    NARCIS (Netherlands)

    Brandenburg, W.A.

    1986-01-01

    Agricultural practice demands principles for classification, starting from the basal entity in cultivated plants: the cultivar. In establishing biosystematic relationships between wild, weedy and cultivated plants, the species concept needs re-examination. Combining of botanic classification, based

  1. Automatic Hierarchical Color Image Classification

    Directory of Open Access Journals (Sweden)

    Jing Huang

    2003-02-01

    Full Text Available Organizing images into semantic categories can be extremely useful for content-based image retrieval and image annotation. Grouping images into semantic classes is a difficult problem, however. Image classification attempts to solve this hard problem by using low-level image features. In this paper, we propose a method for hierarchical classification of images via supervised learning. This scheme relies on using a good low-level feature and subsequently performing feature-space reconfiguration using singular value decomposition to reduce noise and dimensionality. We use the training data to obtain a hierarchical classification tree that can be used to categorize new images. Our experimental results suggest that this scheme not only performs better than standard nearest-neighbor techniques, but also has both storage and computational advantages.

  2. A Semisupervised Cascade Classification Algorithm

    Directory of Open Access Journals (Sweden)

    Stamatis Karlos

    2016-01-01

    Full Text Available Classification is one of the most important tasks of data mining techniques, which have been adopted by several modern applications. The shortage of enough labeled data in the majority of these applications has shifted the interest towards using semisupervised methods. Under such schemes, the use of collected unlabeled data combined with a clearly smaller set of labeled examples leads to similar or even better classification accuracy against supervised algorithms, which use labeled examples exclusively during the training phase. A novel approach for increasing semisupervised classification using Cascade Classifier technique is presented in this paper. The main characteristic of Cascade Classifier strategy is the use of a base classifier for increasing the feature space by adding either the predicted class or the probability class distribution of the initial data. The classifier of the second level is supplied with the new dataset and extracts the decision for each instance. In this work, a self-trained NB∇C4.5 classifier algorithm is presented, which combines the characteristics of Naive Bayes as a base classifier and the speed of C4.5 for final classification. We performed an in-depth comparison with other well-known semisupervised classification methods on standard benchmark datasets and we finally reached to the point that the presented technique has better accuracy in most cases.

  3. Enquete stylistique sur la traduction automatique de documents techniques. (A Stylistic Inquiry into the Automatic Translation of Technical Documents).

    Science.gov (United States)

    Loffler-Laurian, Anne-Marie

    1987-01-01

    Describes a study that attempts to systematize the criteria required for accurate translations of technical documents. The results of a Linguistic Appreciation Questionnaire-Test, administered to 19 professional translators, were used to categorize the most common translation variables: style, structure, rhythm, and meaning of text in the hope of…

  4. The Application of Classification Structures in Knowledge Organization and Representation

    Directory of Open Access Journals (Sweden)

    Tzu-heng Chiu

    2002-12-01

    Full Text Available Classification is a way of seeing the world. In a classification scheme, phenomena of interest are represented in a context of relationships that provide description, explanation, prediction, heuristics, and the generation of new knowledge. Knowing that information organization isn’t equal to knowledge organization, the author first defines the scope of classification and knowledge organization, and then describes the relationship between the classification and the representation and organization of knowledge. At the end, four kinds of classification structures are compared to show their abilities in representing knowledge. In order to utilize these classification structures, it is very important to understand their advantages and disadvantages.[Article content in Chinese

  5. [Another seizure classification--Semiological Seizure Classification].

    Science.gov (United States)

    Lin, Ji-Ho; Kwan, Shang-Yeong; Wu, Dean; Su, Min-Shin; Yiu, Chun-Hing

    2004-09-01

    The International League Against Epilepsy (ILAE) introduced in 1981 a seizure classification based on clinical semiology, interictal EEG findings, and ictal EEG patterns. Such classification depends heavily on detailed electroclinical correlation. After 20 years' progress in epileptology, many clinicians have found it difficult to make a "definite" seizure diagnosis clinically without a series of electrophysiological examinations, particularly in the infants, and further advancement in epileptology has findings have made the previous classification inefficient. Lüders and colleagues have proposed a classification, Semiological Seizure Classification (SSC), based exclusively on ictal semiology, which was published in the official journal of ILAE-EPILEPSIA in 1998. The EEG, neuroimaging and other laboratory results should be analyzed separately and then integrated to define the epileptic syndromes. The seizure diagnosis is thus made through a "what-you-see-is-what-you-get" way. It has also provoked an extensive discussion about the necessity of this new classification. In this review, we present the original guideline, which has been used at The Cleveland Clinic Foundation for years, to introduce another method of epileptic seizure classification.

  6. "El tratamiento estilístico de la tauromaquia en la narrativa breve y en los artículos periodísticos de Ernest Hemingway" [The stylistic treatment of bullfighting in the short stories and articles of Ernest Hemingway

    Directory of Open Access Journals (Sweden)

    Ricardo Marín Ruiz

    2011-01-01

    Full Text Available RESUMEN:Este artículo aborda el análisis del tratamiento estilístico de la tauromaquia en algunos de los relatos y artículos periodísticos de Ernest Hemingway. El punto de partida de este estudio es la asunción de que algunos rasgos estilísticos de la narrativa del escritor estadounidense guardan una estrecha relación con su visión de la tauromaquia, de este modo, la comprensión del patrón estilístico empleado por Hemingway al escribir sobre la tauromaquia pasa por la previa consideración de las ideas que el autor desarrolló sobre esta última, aspecto que se aborda en la primera parte del artículo. A continuación, se aborda el asunto central del estudio, como es, en primer lugar, el análisis de los principales rasgos estilísticos de los relatos “taurinos” “The Undefeated” y “The Capital of the World” y, en segundo, identificar las principales diferencias y similitudes estilísticas entre estas historias y las crónicas taurinas firmadas por Hemingway.PALABRAS CLAVE:  ABSTRACT:The central subject of this essay is the study of the stylistic treatment of bullfighting in Hemingway´s short stories and articles, taking as a starting point the assumption that some of the features of the narrative pattern of his fiction are connected to his vision of bullfighting. Therefore, the treatment of this topic would be incomplete, and almost incomprehensible, without regarding first the forging and evolution of his ideas concerning bullfighting; thus, after having shown in the first part of the essay an overview of how the American author understood bullfighting, the author goes on to focus on a stylistic analysis of the short stories concerning bullfighting “The Undefeated” and “The Capital of the World”, and the identification of the main similarities and differences they show in relation to some articles signed by Hemingway.

  7. Synoptic, redactional, stylistic and narratological observations on the retelling of Mark 7:30 in Matthew 15:28

    Directory of Open Access Journals (Sweden)

    Peter-Ben A. Smit

    2014-10-01

    Full Text Available The Matthean redaction of Mark 7:30 in Matthew 15:28 often receives scant attention in scholarship in terms of its narrative quality. At most, it is regarded as a truncation of the full Marcan version, while all attention is given to Matthew’s introduction of the notion of ‘faith’ in this verse. This article argues, by contrast, and on the basis of a synoptic comparison and narratological analysis of both texts, that more justice is done to both versions of the conclusion of this healing miracle when understanding them as achieving different narrative effects, with Matthew focusing on the immediacy of the healing, while Mark creates suspense, thus focusing on the veracity of Jesus’ statement that the girl in question is healed.

  8. Synoptic, redactional, stylistic and narratological observations on the retelling of Mark 7:30 in Matthew 15:28

    Directory of Open Access Journals (Sweden)

    Peter-Ben A. Smit

    2014-02-01

    Full Text Available The Matthean redaction of Mark 7:30 in Matthew 15:28 often receives scant attention in scholarship in terms of its narrative quality. At most, it is regarded as a truncation of the full Marcan version, while all attention is given to Matthew’s introduction of the notion of ‘faith’ in this verse. This article argues, by contrast, and on the basis of a synoptic comparison and narratological analysis of both texts, that more justice is done to both versions of the conclusion of this healing miracle when understanding them as achieving different narrative effects, with Matthew focusing on the immediacy of the healing, while Mark creates suspense, thus focusing on the veracity of Jesus’ statement that the girl in question is healed.

  9. Combining multiple classifiers for age classification

    CSIR Research Space (South Africa)

    Van Heerden, C

    2009-11-01

    Full Text Available The authors compare several different classifier combination methods on a single task, namely speaker age classification. This task is well suited to combination strategies, since significantly different feature classes are employed. Support vector...

  10. Improving settlement type classification of aerial images

    CSIR Research Space (South Africa)

    Mdakane, L

    2014-10-01

    Full Text Available (multitemporal imagery) tend to exhibit pronounced viewing- and illumination geometry effects, which result in a poor generalization performance in settlement type classification tasks. The study investigated the influence of contrast in settlement type...

  11. Improving a College Writing Instrustion: Developing L2 Learners' Awareness and Strategies to Attain Near-native Stylistic Skill

    Directory of Open Access Journals (Sweden)

    Siusana Kweldju

    2016-02-01

    Full Text Available This article is a report of a one-cycle classroom action research conducted to non-degree students at the three-year program of English for business at Universitas Negeri Malang. Initially they were reluctant students to improve their own writing skill. The purpose of this study was to optimize the Writing III Course and part of the effort to develop a possible model of lexically-based instruction for college writing. Students consciousness raising was continuously done before this model could really optimize their learning.

  12. Classification, disease, and diagnosis.

    Science.gov (United States)

    Jutel, Annemarie

    2011-01-01

    Classification shapes medicine and guides its practice. Understanding classification must be part of the quest to better understand the social context and implications of diagnosis. Classifications are part of the human work that provides a foundation for the recognition and study of illness: deciding how the vast expanse of nature can be partitioned into meaningful chunks, stabilizing and structuring what is otherwise disordered. This article explores the aims of classification, their embodiment in medical diagnosis, and the historical traditions of medical classification. It provides a brief overview of the aims and principles of classification and their relevance to contemporary medicine. It also demonstrates how classifications operate as social framing devices that enable and disable communication, assert and refute authority, and are important items for sociological study.

  13. Fuzzy One-Class Classification Model Using Contamination Neighborhoods

    Directory of Open Access Journals (Sweden)

    Lev V. Utkin

    2012-01-01

    Full Text Available A fuzzy classification model is studied in the paper. It is based on the contaminated (robust model which produces fuzzy expected risk measures characterizing classification errors. Optimal classification parameters of the models are derived by minimizing the fuzzy expected risk. It is shown that an algorithm for computing the classification parameters is reduced to a set of standard support vector machine tasks with weighted data points. Experimental results with synthetic data illustrate the proposed fuzzy model.

  14. The Importance of Classification to Business Model Research

    Directory of Open Access Journals (Sweden)

    Susan Lambert

    2015-01-01

    Full Text Available Purpose: To bring to the fore the scientific significance of classification and its role in business model theory building. To propose a method by which existing classifications of business models can be analyzed and new ones developed. Design/Methodology/Approach: A review of the scholarly literature relevant to classifications of business models is presented along with a brief overview of classification theory applicable to business model research. Existing business model classifications are evaluated in terms of their propensity to contribute to theory building and a method for designing classifications schemes is proposed. Findings: Little attention has been paid to the rationale underlying the design of business model classifications and often there is no explicit consideration of the suitability of the classification for its intended purpose. Each classification contributes to the understanding of business models in practice but there is a dearth of taxonomical research that can facilitate progression of business model research towards theorizing. Originality/Value: This paper addresses the research element of classification that is largely overlooked yet is crucial for business model theory building. The nature of business model classifications is examined in the light of classification philosophies and a structured method of classification design is proposed. A case is made for the development of a general classification of business models that can facilitate the progression of business model research towards theory building.

  15. Automatic Text Summarization for Indonesian Language Using TextTeaser

    Science.gov (United States)

    Gunawan, D.; Pasaribu, A.; Rahmat, R. F.; Budiarto, R.

    2017-04-01

    Text summarization is one of the solution for information overload. Reducing text without losing the meaning not only can save time to read, but also maintain the reader’s understanding. One of many algorithms to summarize text is TextTeaser. Originally, this algorithm is intended to be used for text in English. However, due to TextTeaser algorithm does not consider the meaning of the text, we implement this algorithm for text in Indonesian language. This algorithm calculates four elements, such as title feature, sentence length, sentence position and keyword frequency. We utilize TextRank, an unsupervised and language independent text summarization algorithm, to evaluate the summarized text yielded by TextTeaser. The result shows that the TextTeaser algorithm needs more improvement to obtain better accuracy.

  16. Recursive heuristic classification

    Science.gov (United States)

    Wilkins, David C.

    1994-01-01

    The author will describe a new problem-solving approach called recursive heuristic classification, whereby a subproblem of heuristic classification is itself formulated and solved by heuristic classification. This allows the construction of more knowledge-intensive classification programs in a way that yields a clean organization. Further, standard knowledge acquisition and learning techniques for heuristic classification can be used to create, refine, and maintain the knowledge base associated with the recursively called classification expert system. The method of recursive heuristic classification was used in the Minerva blackboard shell for heuristic classification. Minerva recursively calls itself every problem-solving cycle to solve the important blackboard scheduler task, which involves assigning a desirability rating to alternative problem-solving actions. Knowing these ratings is critical to the use of an expert system as a component of a critiquing or apprenticeship tutoring system. One innovation of this research is a method called dynamic heuristic classification, which allows selection among dynamically generated classification categories instead of requiring them to be prenumerated.

  17. Security classification of information

    Energy Technology Data Exchange (ETDEWEB)

    Quist, A.S.

    1993-04-01

    This document is the second of a planned four-volume work that comprehensively discusses the security classification of information. The main focus of Volume 2 is on the principles for classification of information. Included herein are descriptions of the two major types of information that governments classify for national security reasons (subjective and objective information), guidance to use when determining whether information under consideration for classification is controlled by the government (a necessary requirement for classification to be effective), information disclosure risks and benefits (the benefits and costs of classification), standards to use when balancing information disclosure risks and benefits, guidance for assigning classification levels (Top Secret, Secret, or Confidential) to classified information, guidance for determining how long information should be classified (classification duration), classification of associations of information, classification of compilations of information, and principles for declassifying and downgrading information. Rules or principles of certain areas of our legal system (e.g., trade secret law) are sometimes mentioned to .provide added support to some of those classification principles.

  18. Important Text Characteristics for Early-Grades Text Complexity

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2015-01-01

    The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…

  19. THE AESTHETICS OF HAEMOTAPHONOMY: A STUDY OF THE STYLISTIC PARALLELS BETWEEN A SCIENCE AND LITERATURE AND THE VISUAL ARTS

    Directory of Open Access Journals (Sweden)

    Policarp Hortolà

    2009-01-01

    Full Text Available Este estudio tiene la intención de profundizar en la estética de la ciencia de la Hemotafonomía (HTN, mediante la identificación de sus paralelos estilísticos con la Literatura y las Artes Visuales. El objeto de estudio de esta ciencia es la citomorfología de las células sanguíneas en manchas de sangre. Sus sujetos de estudio son especímenes manchados de sangre, mientras que su método de estudio es el análisis de imágenes en claroscuro obtenidas por Microscopía Electrónica de Barrido. Literariamente, la HTN es estilísticamente paralela a la estética del Tremendismo y el Decadentismo. Visualmente, la HTN es estilísticamente paralela a la estética de la decoración arquitectónica churrigueresca y al Tenebrismo pictórico. De acuerdo con los resultados de este estudio, la HTN puede también considerarse estilísticamente como un Realismo literario y un Naturalismo visual. Además, puesto que Tremendismo, Decadentismo, Churriguerismo y Tenebrismo convergen en la cultura del Barroco, debe concluirse que, estéticamente, la HTN es una ciencia barroquista.

  20. THE EMERGENCE AND EVOLUTION OF ARABESQUE AS A MULTICULTURAL STYLISTIC FUSION IN ISLAMIC ART: THE CASE OF TURKISH ARCHITECTURE

    Directory of Open Access Journals (Sweden)

    Murat Cetin, M. Arif Kamal

    2012-04-01

    Full Text Available This paper elucidates the emergence and evolution of arabesque with specific reference to  the case of arabesque in Turkish art and architecture. It is argued here that arabesque is a fusion of styles rather than a pure and homogenous style. Furthermore, the paper aims to show that although the arabesque style appears to be a fanciful and freely organized manner of artistic treatment it is based on a very complex mathematical logic which is expressed through abstractionism. In this context, the grammar of geometry is elaborately used in the  implementation  of abstraction. Here, general characteristics as well as different modes or  types  of arabesque are discussed. Starting with the etymological roots of the term, the  history of its use in the literature  is  explored  through  the  paper. After  the  inquiry  of  its  material  and  pragmatic  aspects,  the development of the arabesque style is evaluated with regard to its transformations that took place along with its injection to Anatolia and mixing with Turkish culture. Finally, the morphological character of this fusion is put forward.   Keyword: Morphological character, geometry, fusion of styles, arabesque

  1. Learning From Short Text Streams With Topic Drifts.

    Science.gov (United States)

    Li, Peipei; He, Lu; Wang, Haiyan; Hu, Xuegang; Zhang, Yuhong; Li, Lei; Wu, Xindong

    2017-09-18

    Short text streams such as search snippets and micro blogs have been popular on the Web with the emergence of social media. Unlike traditional normal text streams, these data present the characteristics of short length, weak signal, high volume, high velocity, topic drift, etc. Short text stream classification is hence a very challenging and significant task. However, this challenge has received little attention from the research community. Therefore, a new feature extension approach is proposed for short text stream classification with the help of a large-scale semantic network obtained from a Web corpus. It is built on an incremental ensemble classification model for efficiency. First, more semantic contexts based on the senses of terms in short texts are introduced to make up of the data sparsity using the open semantic network, in which all terms are disambiguated by their semantics to reduce the noise impact. Second, a concept cluster-based topic drifting detection method is proposed to effectively track hidden topic drifts. Finally, extensive studies demonstrate that as compared to several well-known concept drifting detection methods in data stream, our approach can detect topic drifts effectively, and it enables handling short text streams effectively while maintaining the efficiency as compared to several state-of-the-art short text classification approaches.

  2. The Study of Figurative Languages using Stylistics Theory in What My Mother Doesn’t Know by Sonya Sones

    Directory of Open Access Journals (Sweden)

    Winnie Winnie

    2008-11-01

    Full Text Available What My Mother Doesn’t Know is a novel-in-verse by Sonya Sones which tells about a teenage girl named Sophie who tries to find her Mr. Right and her bad relationship with her parents. In the end, Sophie finds her Mr. Right who is actually not her type of boy friend but he can make her feel happy and even though she hates her parents, she realizes that she loves them nevertheless. The purpose of this article is to prove that the author’s use of style can reveal the themes of novel-in-verse The analysis of the novel-in-verse focuses mainly on the themes and the figurative language of simile, metaphor, personification, paradox, and hyperbole. Research methods are conducted in qualitative method in the form of library research and statistics to calculate how many poems use the five figurative languages and are related to the themes. In the analysis, the author’s style is figurative language and there are three themes that can be taken from the novel-in-verse. The themes are overwhelmed feeling does not show the true feeling, the bond of family will always be there, no matter how hateful we are to them and love is not determined by someone’s physical appearance. The result shows that 91.36 percents of poems that use the five figurative languages are related to the themes. It can be concluded that the author’s style in writing the novel-in-verse can reveal the three themes.

  3. Improving text recognition by distinguishing scene and overlay text

    Science.gov (United States)

    Quehl, Bernhard; Yang, Haojin; Sack, Harald

    2015-02-01

    Video texts are closely related to the content of a video. They provide a valuable source for indexing and interpretation of video data. Text detection and recognition task in images or videos typically distinguished between overlay and scene text. Overlay text is artificially superimposed on the image at the time of editing and scene text is text captured by the recording system. Typically, OCR systems are specialized on one kind of text type. However, in video images both types of text can be found. In this paper, we propose a method to automatically distinguish between overlay and scene text to dynamically control and optimize post processing steps following text detection. Based on a feature combination a Support Vector Machine (SVM) is trained to classify scene and overlay text. We show how this distinction in overlay and scene text improves the word recognition rate. Accuracy of the proposed methods has been evaluated by using publicly available test data sets.

  4. Statistical classification of images

    OpenAIRE

    Giuliodori, María Andrea

    2011-01-01

    Image classification is a burgeoning field of study. Despite the advances achieved in this camp, there is no general agreement about what is the most effective methods for the classification of digital images. This dissertation contributes to this line of research by developing different statistical methods aim to classifying digital images. In Chapter 1 we introduce basic concepts of image classification and review some results and methodologies proposed previously in the literature. In Chap...

  5. Text genres and registers the computation of linguistic features

    CERN Document Server

    Fang, Chengyu Alex

    2015-01-01

    This book is a description of some of the most recent advances in text classification as part of a concerted effort to achieve computer understanding of human language. In particular, it addresses state-of-the-art developments in the computation of higher-level linguistic features, ranging from etymology to grammar and syntax for the practical task of text classification according to genres, registers and subject domains. Serving as a bridge between computational methods and sophisticated linguistic analysis, this book will be of particular interest to academics and students of computational linguistics as well as professionals in natural language engineering.

  6. Definition and classification of epilepsy. Classification of epileptic seizures 2016

    Directory of Open Access Journals (Sweden)

    K. Yu. Mukhin

    2017-01-01

    Full Text Available Epilepsy is one of the most common neurological diseases, especially in childhood and adolescence. The incidence varies from 15 to 113 cases per 100 000 population with the maximum among children under 1 year old. The prevalence of epilepsy is high, ranging from 5 to 8 cases (in some regions – 10 cases per 1000 children under 15 years old. Classification of the disease has great importance for diagnosis, treatment and prognosis. The article presents a novel strategy for classification of epileptic seizures, developed in 2016. It contains a number of brand new concepts, including a very important one, saying that some seizures, previously considered as generalized or focal only, can be, in fact, both focal and generalized. They include tonic, atonic, myoclonic seizures and epileptic spasms. The term “secondarily generalized seizure” is replace by the term “bilateral tonic-clonic seizure” (as soon as it is not a separate type of epileptic seizures, and the term reflects the spread of discharge from any area of cerebral cortex and evolution of any types of focal seizures. International League Against Epilepsy recommends to abandon the term “pseudo-epileptic seizures” and replace it by the term “psychogenic non-epileptic seizures”. If a doctor is not sure that seizures have epileptic nature, the term “paroxysmal event” should be used without specifying the disease. The conception of childhood epileptic encephalopathies, developed within this novel classification project, is one of the most significant achievements, since in this case not only the seizures, but even epileptiform activity can induce severe disorders of higher mental functions. In addition to detailed description of the new strategy for classification of epileptic seizures, the article contains a comprehensive review of the existing principles of epilepsy and epileptic seizures classification.

  7. Text analysis methods, text analysis apparatuses, and articles of manufacture

    Science.gov (United States)

    Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

    2014-10-28

    Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.

  8. Classroom Texting in College Students

    Science.gov (United States)

    Pettijohn, Terry F.; Frazier, Erik; Rieser, Elizabeth; Vaughn, Nicholas; Hupp-Wilds, Bobbi

    2015-01-01

    A 21-item survey on texting in the classroom was given to 235 college students. Overall, 99.6% of students owned a cellphone and 98% texted daily. Of the 138 students who texted in the classroom, most texted friends or significant others, and indicate the reason for classroom texting is boredom or work. Students who texted sent a mean of 12.21…

  9. The language of poetic texts in contemporary Tuvan pop songs

    Directory of Open Access Journals (Sweden)

    Oyumaa M. Saaya

    2017-06-01

    Full Text Available The article presents a linguistic analysis of lyrics of modern Tuvan pop songs. While studying them is important for understanding contemporary songwriting in Tuva, it is also necessary to discover what linguistic means, functional styles and vocabulary are used by modern authors of popular lyrics. The study can also help identify how contemporary global trends influence songwriting in means of linguistics. Three groups of songs can be defined in Tuvan pop music. The first of them comprises songs written by both professional poets and amateurs with good writing skills. Their texts have homogenous literary style and are intended for general audience (rather than specific groups of listeners. They do not feature any jargon or youth slang. The second group consists of “songs of the people” which are still popular and relevant, but not classified as folklore. This group also contains songs previously banned by censorship, and those written by ex-convicts. Their lyrics differ in style, and the vocabulary is also heterogenous: they can include slang and contain vernacular language. The third group includes songs following popular global and Russian trends, which  triggered rapid evolution in Tuvan songwriting. There is significant number of authors or even creative unions, who write both lyric and music. They are stylistically uneven, contain a lot of neologisms, borrowed vocabulary, slang and jargon words and sometimes even macaronic (mixed language. The author provides a more in-depth analysis of lyrics belonging to the third group of songs. They can be divided into 6 thematic subgroups which greatly vary in lexical content and the use of tropes. The lyrics of contemporary Tuvan songs are quite close to the everyday language young people use. Active employment of jargon in the language of young and middle-aged people, especially in lyrics of modern songs, steadily decreases the literary norms of Tuvan language. The author emphasizes that

  10. MT Post-editing: A Text Repair Experience for the Foreign Language Class.

    Directory of Open Access Journals (Sweden)

    Ana Niño

    2007-04-01

    Full Text Available Communication also means having to sort out the problems involved in learning a foreign language, especially with regards to production rather than reception. These learning strategies or skills can also be applied to translation teaching methodology, where students put in practice their risk taking, avoidance, reduction and/ or compensatory strategies in getting the message across. We acknowledge translation as a writing task constrained by the source text. In addition, the translation and the writing cycles have in common a generation stage and a revision stage where grammatical, lexical and stylistic correctness is assessed. Somewhere in the middle between translation and writing skills lies MT (Machine Translation post-editing that involves correcting the raw MT output with the aim of providing a quality text according to the intended purpose. Our research is intended to test the suitability of MT post-editing as an activity to promote error correction and, subsequently, to enhance written production in second and foreign language teaching.

  11. Transcending the Versification of Oraliture: Song-Text as Oral Performance among the Ilaje

    Directory of Open Access Journals (Sweden)

    N. Akingbe

    2013-12-01

    Full Text Available Oraliture is a terminology that is often employed in the description of the various genres of oral literature such as proverbs, legends, short stories, traditional songs and rhymes, song-poems, historical narratives traditional symbols, images, oral performance, myths and other traditional stylistic devices. All these devices constitute vibrant appurtenances of oral narrative performance in Africa. Oral narrative performance is invariably situated within the domain of social communication, which brings together the raconteur/performer and the audience towards the realisation of communal entertainment. While the narrator/performer, plays the leading role in an oral performance, the audience’s involvement and participation is realised through song, verbal/choral responses, gestures and, or instrumental/musical accompaniment. This oral practice usually take place at one time or the other in various African communities during the festival, ritual/religious procession which ranges from story- telling, recitation of poems, song text and dancing. This paper is essentially concerned with the illustration of the use of song- text, as oral performance among the Ilaje, a burgeoning coastal subethnic group, of the Yoruba race in the South Western Nigeria. The paper will further examine how patriotism, history, death and anti-social behaviours are evaluated through the use of songs among the Ilaje.

  12. Classification System of Pathological Voices Using Correntropy

    Directory of Open Access Journals (Sweden)

    Aluisio I. R. Fontes

    2014-01-01

    Full Text Available This paper proposes the use of a similarity measure based on information theory called correntropy for the automatic classification of pathological voices. By using correntropy, it is possible to obtain descriptors that aggregate distinct spectral characteristics for healthy and pathological voices. Experiments using computational simulation demonstrate that such descriptors are very efficient in the characterization of vocal dysfunctions, leading to a success rate of 97% in the classification. With this new architecture, the classification process of vocal pathologies becomes much more simple and efficient.

  13. Emotion models for textual emotion classification

    Science.gov (United States)

    Bruna, O.; Avetisyan, H.; Holub, J.

    2016-11-01

    This paper deals with textual emotion classification which gained attention in recent years. Emotion classification is used in user experience, product evaluation, national security, and tutoring applications. It attempts to detect the emotional content in the input text and based on different approaches establish what kind of emotional content is present, if any. Textual emotion classification is the most difficult to handle, since it relies mainly on linguistic resources and it introduces many challenges to assignment of text to emotion represented by a proper model. A crucial part of each emotion detector is emotion model. Focus of this paper is to introduce emotion models used for classification. Categorical and dimensional models of emotion are explained and some more advanced approaches are mentioned.

  14. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  15. On the introduction of secondary fingerprint classification

    CSIR Research Space (South Africa)

    Msiza, IS

    2011-07-01

    Full Text Available improvement of small-scale applications such as access control systems and attendance registers of small to medium-sized institutions. They, however, may not be sufficient in the performance improvement of large-scale applications such as national Automatic... Fingerprint Identification Systems (AFIS). In order to enforce visible performance improvement on such large-scale applications, this chapter introduces a two-stage classification system, by taking advantage of the extensibility of the classification rules...

  16. Text as Statistical Mechanics Object

    OpenAIRE

    Koroutchev, K.; Korutcheva, E.

    2008-01-01

    In this article we present a model of human written text based on statistical mechanics approach by deriving the potential energy for different parts of the text using large text corpus. We have checked the results numerically and found that the specific heat parameter effectively separates the closed class words from the specific terms used in the text.

  17. Text analysis devices, articles of manufacture, and text analysis methods

    Science.gov (United States)

    Turner, Alan E; Hetzler, Elizabeth G; Nakamura, Grant C

    2013-05-28

    Text analysis devices, articles of manufacture, and text analysis methods are described according to some aspects. In one aspect, a text analysis device includes processing circuitry configured to analyze initial text to generate a measurement basis usable in analysis of subsequent text, wherein the measurement basis comprises a plurality of measurement features from the initial text, a plurality of dimension anchors from the initial text and a plurality of associations of the measurement features with the dimension anchors, and wherein the processing circuitry is configured to access a viewpoint indicative of a perspective of interest of a user with respect to the analysis of the subsequent text, and wherein the processing circuitry is configured to use the viewpoint to generate the measurement basis.

  18. Library Classification 2020

    Science.gov (United States)

    Harris, Christopher

    2013-01-01

    In this article the author explores how a new library classification system might be designed using some aspects of the Dewey Decimal Classification (DDC) and ideas from other systems to create something that works for school libraries in the year 2020. By examining what works well with the Dewey Decimal System, what features should be carried…

  19. Linear Classification Functions.

    Science.gov (United States)

    Huberty, Carl J.; Smith, Jerry D.

    Linear classification functions (LCFs) arise in a predictive discriminant analysis for the purpose of classifying experimental units into criterion groups. The relative contribution of the response variables to classification accuracy may be based on LCF-variable correlations for each group. It is proved that, if the raw response measures are…

  20. Supported eText: Assistive Technology through Text Transformations

    Science.gov (United States)

    Anderson-Inman, Lynne; Horney, Mark A.

    2007-01-01

    To gain meaningful access to the curriculum, students with reading difficulties must overcome substantial barriers imposed by the printed materials they are asked to read. Technology can assist students to overcome these challenges by enabling a shift from printed text to electronic text. By electronic text it means textual material read using a…

  1. Stylistic Embedding in Yoruba Literature.

    Science.gov (United States)

    Olabode, Afolabi

    The process of embedding, a term used in generative grammar to refer to a construction in which a sentence is included within another sentence, is examined as it occurs in Yoruba literature. Examples are drawn from Yoruba praise poetry, in both written and oral form and within Yoruba novels. Forms of embedding identified include those to draw…

  2. A Road to Aesthetic Stylistics

    Science.gov (United States)

    Al-Sheikh, Samir

    2016-01-01

    Being a linguistic phenomenon, poetry is marked by the defamilarization of language in a poetic discourse there is an "aesthetic distortion" of the normal codes, in which the aesthetic value is the most prominent function of the poetic texture . This study is a new adventure in correlating linguistics to aesthetics by and through the…

  3. Saving our science from ourselves: the plight of biological classification

    Directory of Open Access Journals (Sweden)

    Malte C. Ebach

    2011-06-01

    Full Text Available Saving our science from ourselves: the plight of biological classification. Biological classification ( nomenclature, taxonomy, and systematics is being sold short. The desire for new technologies, faster and cheaper taxonomic descriptions, identifications, and revisions is symptomatic of a lack of appreciation and understanding of classification. The problem of gadget-driven science, a lack of best practice and the inability to accept classification as a descriptive and empirical science are discussed. The worst cases scenario is a future in which classifications are purely artificial and uninformative.

  4. Challenges in prosthesis classification.

    Science.gov (United States)

    Robertsson, Otto; Mendenhall, Stan; Paxton, Elizabeth W; Inacio, Maria C S; Graves, Stephen

    2011-12-21

    Accurate prosthesis classification is critical for total joint arthroplasty surveillance and assessment of comparative effectiveness. Historically, prosthesis classification was based solely on the names of the prosthesis manufacturers. As a result, prosthesis designs changed without corresponding name changes, and other prostheses' names changed over time without substantial design modifications. As the number of prostheses used in total joint arthroplasty on the market increased, catalog and lot numbers associated with prosthesis descriptions were introduced by manufacturers. Currently, these catalog and lot numbers are not standardized, and there is no consensus on categorization of these numbers into brands or subbrands. Classification of the attributes of a prosthesis also varies, limiting comparisons of prostheses across studies and reports. The development of a universal prosthesis classification system would standardize prosthesis classification and enhance total joint arthroplasty research collaboration worldwide. This is a current area of focus for the International Consortium of Orthopaedic Registries (ICOR).

  5. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  6. Classification of anemia for gastroenterologists.

    Science.gov (United States)

    Moreno Chulilla, Jose Antonio; Romero Colás, Maria Soledad; Gutiérrez Martín, Martín

    2009-10-07

    Most anemia is related to the digestive system by dietary deficiency, malabsorption, or chronic bleeding. We review the World Health Organization definition of anemia, its morphological classification (microcytic, macrocytic and normocytic) and pathogenic classification (regenerative and hypo regenerative), and integration of these classifications. Interpretation of laboratory tests is included, from the simplest (blood count, routine biochemistry) to the more specific (iron metabolism, vitamin B12, folic acid, reticulocytes, erythropoietin, bone marrow examination and Schilling test). In the text and various algorithms, we propose a hierarchical and logical way to reach a diagnosis as quickly as possible, by properly managing the medical interview, physical examination, appropriate laboratory tests, bone marrow examination, and other complementary tests. The prevalence is emphasized in all sections so that the gastroenterologist can direct the diagnosis to the most common diseases, although the tables also include rare diseases. Digestive diseases potentially causing anemia have been studied in preference, but other causes of anemia have been included in the text and tables. Primitive hematological diseases that cause anemia are only listed, but are not discussed in depth. The last section is dedicated to simplifying all items discussed above, using practical rules to guide diagnosis and medical care with the greatest economy of resources and time.

  7. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  8. Odor Classification using Agent Technology

    Directory of Open Access Journals (Sweden)

    Sigeru OMATU

    2014-03-01

    Full Text Available In order to measure and classify odors, Quartz Crystal Microbalance (QCM can be used. In the present study, seven QCM sensors and three different odors are used. The system has been developed as a virtual organization of agents using an agent platform called PANGEA (Platform for Automatic coNstruction of orGanizations of intElligent Agents. This is a platform for developing open multi-agent systems, specifically those including organizational aspects. The main reason for the use of agents is the scalability of the platform, i.e. the way in which it models the services. The system models functionalities as services inside the agents, or as Service Oriented Approach (SOA architecture compliant services using Web Services. This way the adaptation of the odor classification systems with new algorithms, tools and classification techniques is allowed.

  9. Kappa Coefficients for Circular Classifications

    NARCIS (Netherlands)

    Warrens, Matthijs J.; Pratiwi, Bunga C.

    2016-01-01

    Circular classifications are classification scales with categories that exhibit a certain periodicity. Since linear scales have endpoints, the standard weighted kappas used for linear scales are not appropriate for analyzing agreement between two circular classifications. A family of kappa

  10. Classifications of patterned hair loss: a review

    Directory of Open Access Journals (Sweden)

    Mrinal Gupta

    2016-01-01

    Full Text Available Patterned hair loss is the most common cause of hair loss seen in both the sexes after puberty. Numerous classification systems have been proposed by various researchers for grading purposes. These systems vary from the simpler systems based on recession of the hairline to the more advanced multifactorial systems based on the morphological and dynamic parameters that affect the scalp and the hair itself. Most of these preexisting systems have certain limitations. Currently, the Hamilton-Norwood classification system for males and the Ludwig system for females are most commonly used to describe patterns of hair loss. In this article, we review the various classification systems for patterned hair loss in both the sexes. Relevant articles were identified through searches of MEDLINE and EMBASE. Search terms included but were not limited to androgenic alopecia classification, patterned hair loss classification, male pattern baldness classification, and female pattern hair loss classification. Further publications were identified from the reference lists of the reviewed articles.

  11. La LC classification come linked data

    Directory of Open Access Journals (Sweden)

    Kevin Ford

    2013-01-01

    Full Text Available In 2009 and in 2011, the Library of Congress made two of its largest authority files – Subject Headings and Names – available as linked data via LC’s Linked Data Service, ID.LOC.GOV. Both are offered in MADS/RDF and SKOS. It is LC’s objective, in 2012, to publish another of its largest authority files as linked data: LC Classification. Whereas the source records for Subject Headings and Names are encoded in the MARC Authority format, from which there is a relatively straightforward mapping to MADS/RDF and SKOS, LC Classification records rely on the MARC Classification format. Mapping from LC Classification to MADS/RDF or SKOS has been a little more challenging. For example, records that represent classification ranges, which are not Concepts intended to be assigned, are not easily accommodated in SKOS. This presents additional problems when needing to accurately represent the relationships in RDF for LC Classification. With comparison to the publication of LCSH and Names at ID.LOC.GOV, this paper will examine issues encountered – and how those challenges were addressed – during the conversion of LC Classification to MADS/RDF and SKOS for release as linked data at ID.LOC.GOV.

  12. Understanding about the classification of pulp inflammation

    Directory of Open Access Journals (Sweden)

    Trijoedani Widodo

    2007-03-01

    Full Text Available Since most authors use the reversible pulpitis and irreversible pulpitis classification, however, many dentists still do not implement these new classifications. Research was made using a descriptive method by proposing questionnaire to dentists from various dental clinics. The numbers of the dentists participating in this research are 22 dentists. All respondents use the diagnosis sheet during their examinations on patients. Nonetheless, it can't be known what diagnosis card used and most of the dentists are still using the old classification. Concerning responses given towards the new classification: a the new classification had been heard, however, it was not clear (36.3%; b the new classification has never been heard at all (63.6%. Then, responses concerning whether a new development is important to be followed-up or not: a there are those who think that information concerning new development is very important (27.2%; b those who feel that it is important to have new information (68.3%; c those who think that new information is not important (8%. It concluded that information concerning the development of classification of pulp inflammation did not reach the dentists.

  13. Modified angle's classification for primary dentition

    Directory of Open Access Journals (Sweden)

    Kaushik Narendra Chandranee

    2017-01-01

    Full Text Available Aim: This study aims to propose a modification of Angle's classification for primary dentition and to assess its applicability in children from Central India, Nagpur. Methods: Modification in Angle's classification has been proposed for application in primary dentition. Small roman numbers i/ii/iii are used for primary dentition notation to represent Angle's Class I/II/III molar relationships as in permanent dentition, respectively. To assess applicability of modified Angle's classification a cross-sectional preschool 2000 children population from central India; 3–6 years of age residing in Nagpur metropolitan city of Maharashtra state were selected randomly as per the inclusion and exclusion criteria. Results: Majority 93.35% children were found to have bilateral Class i followed by 2.5% bilateral Class ii and 0.2% bilateral half cusp Class iii molar relationships as per the modified Angle's classification for primary dentition. About 3.75% children had various combinations of Class ii relationships and 0.2% children were having Class iii subdivision relationship. Conclusions: Modification of Angle's classification for application in primary dentition has been proposed. A cross-sectional investigation using new classification revealed various 6.25% Class ii and 0.4% Class iii molar relationships cases in preschool children population in a metropolitan city of Nagpur. Application of the modified Angle's classification to other population groups is warranted to validate its routine application in clinical pediatric dentistry.

  14. Text Association Analysis and Ambiguity in Text Mining

    Science.gov (United States)

    Bhonde, S. B.; Paikrao, R. L.; Rahane, K. U.

    2010-11-01

    Text Mining is the process of analyzing a semantically rich document or set of documents to understand the content and meaning of the information they contain. The research in Text Mining will enhance human's ability to process massive quantities of information, and it has high commercial values. Firstly, the paper discusses the introduction of TM its definition and then gives an overview of the process of text mining and the applications. Up to now, not much research in text mining especially in concept/entity extraction has focused on the ambiguity problem. This paper addresses ambiguity issues in natural language texts, and presents a new technique for resolving ambiguity problem in extracting concept/entity from texts. In the end, it shows the importance of TM in knowledge discovery and highlights the up-coming challenges of document mining and the opportunities it offers.

  15. The Only Safe SMS Texting Is No SMS Texting.

    Science.gov (United States)

    Toth, Cheryl; Sacopulos, Michael J

    2015-01-01

    Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.

  16. A 38 Million Words Dutch Text Corpus and its Users

    African Journals Online (AJOL)

    the field of lexicography but also in computational linguistics and language technology. Conse- quently ... present paper reports on the characteristics of this corpus (design, text classification, linguistic annotation) and on its use, ..... the basis of the title of the book or information in the front or back matter (cf. Dutilh and Kruyt ...

  17. A concept-based approach to text categorization

    NARCIS (Netherlands)

    Schijvenaars, B.J.A.; Schuemie, M.J.; Mulligen, E.M. van; Weeber, M.; Jelier, R.; Mons, B.; Kors, J.A.; Kraaij, W.

    2005-01-01

    The Biosemantics group (Erasmus University Medical Center, Rotterdam) participated in the text categorization task of the Genomics Track. We followed a thesaurus-based approach, using the Collexis indexing system, in combination with a simple classification algorithm to assign a document to one of

  18. Multilingual Text Analysis for Text-to-Speech Synthesis

    CERN Document Server

    Sproat, R

    1996-01-01

    We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, German, Russian, Mandarin and Japanese.

  19. Multilingual Text Analysis for Text-to-Speech Synthesis

    OpenAIRE

    Sproat, Richard

    1996-01-01

    We present a model of text analysis for text-to-speech (TTS) synthesis based on (weighted) finite-state transducers, which serves as the text-analysis module of the multilingual Bell Labs TTS system. The transducers are constructed using a lexical toolkit that allows declarative descriptions of lexicons, morphological rules, numeral-expansion rules, and phonological rules, inter alia. To date, the model has been applied to eight languages: Spanish, Italian, Romanian, French, German, Russian, ...

  20. Incremental semantics for propositional texts

    NARCIS (Netherlands)

    Vermeulen, C.F.M.

    In this paper we are concerned with the special requirements that a semantics of texts should meet. It is argued that a semantics of texts should be incremental and should satisfy the break in principle. We develop a semantics for propositional texts that satisfies these constraints. We will see

  1. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...

  2. Monitoring interaction and collective text production through text mining

    Directory of Open Access Journals (Sweden)

    Macedo, Alexandra Lorandi

    2014-04-01

    Full Text Available This article presents the Concepts Network tool, developed using text mining technology. The main objective of this tool is to extract and relate terms of greatest incidence from a text and exhibit the results in the form of a graph. The Network was implemented in the Collective Text Editor (CTE which is an online tool that allows the production of texts in synchronized or non-synchronized forms. This article describes the application of the Network both in texts produced collectively and texts produced in a forum. The purpose of the tool is to offer support to the teacher in managing the high volume of data generated in the process of interaction amongst students and in the construction of the text. Specifically, the aim is to facilitate the teacher’s job by allowing him/her to process data in a shorter time than is currently demanded. The results suggest that the Concepts Network can aid the teacher, as it provides indicators of the quality of the text produced. Moreover, messages posted in forums can be analyzed without their content necessarily having to be pre-read.

  3. Predicting Prosody from Text for Text-to-Speech Synthesis

    CERN Document Server

    Rao, K Sreenivasa

    2012-01-01

    Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.

  4. ParaText : scalable text modeling and analysis.

    Energy Technology Data Exchange (ETDEWEB)

    Dunlavy, Daniel M.; Stanton, Eric T.; Shead, Timothy M.

    2010-06-01

    Automated processing, modeling, and analysis of unstructured text (news documents, web content, journal articles, etc.) is a key task in many data analysis and decision making applications. As data sizes grow, scalability is essential for deep analysis. In many cases, documents are modeled as term or feature vectors and latent semantic analysis (LSA) is used to model latent, or hidden, relationships between documents and terms appearing in those documents. LSA supplies conceptual organization and analysis of document collections by modeling high-dimension feature vectors in many fewer dimensions. While past work on the scalability of LSA modeling has focused on the SVD, the goal of our work is to investigate the use of distributed memory architectures for the entire text analysis process, from data ingestion to semantic modeling and analysis. ParaText is a set of software components for distributed processing, modeling, and analysis of unstructured text. The ParaText source code is available under a BSD license, as an integral part of the Titan toolkit. ParaText components are chained-together into data-parallel pipelines that are replicated across processes on distributed-memory architectures. Individual components can be replaced or rewired to explore different computational strategies and implement new functionality. ParaText functionality can be embedded in applications on any platform using the native C++ API, Python, or Java. The ParaText MPI Process provides a 'generic' text analysis pipeline in a command-line executable that can be used for many serial and parallel analysis tasks. ParaText can also be deployed as a web service accessible via a RESTful (HTTP) API. In the web service configuration, any client can access the functionality provided by ParaText using commodity protocols ... from standard web browsers to custom clients written in any language.

  5. SparkText: Biomedical Text Mining on Big Data Framework.

    Directory of Open Access Journals (Sweden)

    Zhan Ye

    Full Text Available Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment.In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM, and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes.This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  6. Generating Interpretable Fuzzy Systems for Classification Problems

    Directory of Open Access Journals (Sweden)

    Juan A. Contreras-Montes

    2009-12-01

    Full Text Available This paper presents a new method to generate interpretable fuzzy systems from training data to deal with classification problems. The antecedent partition uses triangular sets with 0.5 interpolations avoiding the presence of complex overlapping that happens in another method. Singleton consequents are generated form the projection of the modal values of each triangular membership function into the output space. Least square method is used to adjust the consequents. The proposed method gets a higher average classification accuracy rate than the existing methods with a reduced number of rules andparameters and without sacrificing the fuzzy system interpretability. The proposed approach is applied to two classical classification problems: Iris data and the Wisconsin Breast Cancer classification problem.

  7. Multivariate Approaches to Classification in Extragalactic Astronomy

    Directory of Open Access Journals (Sweden)

    Didier eFraix-Burnet

    2015-08-01

    Full Text Available Clustering objects into synthetic groups is a natural activity of any science. Astrophysics is not an exception and is now facing a deluge of data. For galaxies, the one-century old Hubble classification and the Hubble tuning fork are still largely in use, together with numerous mono- or bivariate classifications most often made by eye. However, a classification must be driven by the data, and sophisticated multivariate statistical tools are used more and more often. In this paper we review these different approaches in order to situate them in the general context of unsupervised and supervised learning. We insist on the astrophysical outcomes of these studies to show that multivariate analyses provide an obvious path toward a renewal of our classification of galaxies and are invaluable tools to investigate the physics and evolution of galaxies.

  8. Enterprise Potential: Essence, Classification and Types

    Directory of Open Access Journals (Sweden)

    Turylo Anatolii M.

    2014-02-01

    Full Text Available The article considers existing approaches to classification of the enterprise potential as an economic notion. It offers own vision of classification of enterprise potential, which meets modern tendencies of enterprise development. Classification ensures a possibility of a wider description and assessment of enterprise potential and also allows identification of its most significant characteristics. Classification of the enterprise potential is developed by different criteria: by functions, by resource support, by ability to adapt, by the level of detection, by the spectrum of taking into account possibilities, by the period of coverage of possibilities and by the level of use. Analysis of components of the enterprise potential allows obtaining a complete and trustworthy assessment of the state of an enterprise. Adaptation potential of an enterprise is based on principles systemacy and dynamism, it characterises possibilities of adjustment of an enterprise to external and internal economic conditions.

  9. KNN BASED CLASSIFICATION OF DIGITAL MODULATED SIGNALS

    Directory of Open Access Journals (Sweden)

    Sajjad Ahmed Ghauri

    2016-11-01

    Full Text Available Demodulation process without the knowledge of modulation scheme requires Automatic Modulation Classification (AMC. When receiver has limited information about received signal then AMC become essential process. AMC finds important place in the field many civil and military fields such as modern electronic warfare, interfering source recognition, frequency management, link adaptation etc. In this paper we explore the use of K-nearest neighbor (KNN for modulation classification with different distance measurement methods. Five modulation schemes are used for classification purpose which is Binary Phase Shift Keying (BPSK, Quadrature Phase Shift Keying (QPSK, Quadrature Amplitude Modulation (QAM, 16-QAM and 64-QAM. Higher order cummulants (HOC are used as an input feature set to the classifier. Simulation results shows that proposed classification method provides better results for the considered modulation formats.

  10. SparkText: Biomedical Text Mining on Big Data Framework.

    Science.gov (United States)

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  11. SparkText: Biomedical Text Mining on Big Data Framework

    Science.gov (United States)

    He, Karen Y.; Wang, Kai

    2016-01-01

    Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652

  12. Variación cultural, técnicas y procedimientos estilísticos a propósito de las autotraducciones al castellano de Carme Riera / Cultural variation, techniques and stylistic procedures used by Carme Riera to Spanish self-translation

    Directory of Open Access Journals (Sweden)

    Luisa Cotoner Cerdó

    2011-09-01

    Full Text Available Resumen: El propósito de este artículo es hacer un repaso de las técnicas y procedimientos estilísticos que la escritora Carme Riera utiliza al traspasar sus propias obras desde el original catalán al castellano. La intención de causar un mismo efecto estético en un nuevo público lector en lengua castellana hace que la autora se sienta absolutamente libre para modificar, adaptar, modular, ampliar o suprimir el texto original. Riera considera sus traducciones solo como ejercicio de recreación. Desde esa perspectiva, sus recreaciones consiguen seducir también al público lector hispano, al tiempo que reflejan una de las características esenciales de su obra: la visión poliédrica de la realidad, ya que sus autotraducciones guardan una estrecha correspondencia con la pluralidad de cosmovisiones inherente a la diversidad de lenguas. Sus procedimientos pueden ser a veces discutibles, pero el resultado final desemboca en una nueva y enriquecedora mirada.Summary: The purpose of this article is to survey the techniques and stylistic procedures used by the writer Carme Riera to transfer her own work from the original Catalan to Spanish. The process of seeking to create the same aesthetic impact on a new Spanish readership has the effect of liberating the author, enabling her to modify, adapt, vary, enlarge or omit passages of the original text. Riera considers translation only as an exercise in recreation. As a product of this perspective, her recreations are not only successful in captivating a Spanish readership but also reflect one of the essential characteristics of her work: a multifaceted vision of reality, since her self-translations are a close parallel to the multiplicity of world views inherent in the diversity of languages. Her procedures may at times be questionable, but they lead to a new and enriching final standpoint.

  13. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  14. Hermeneutic reading of classic texts.

    Science.gov (United States)

    Koskinen, Camilla A-L; Lindström, Unni Å

    2013-09-01

    The purpose of this article is to broaden the understandinfg of the hermeneutic reading of classic texts. The aim is to show how the choice of a specific scientific tradition in conjunction with a methodological approach creates the foundation that clarifies the actual realization of the reading. This hermeneutic reading of classic texts is inspired by Gadamer's notion that it is the researcher's own research tradition and a clearly formulated theoretical fundamental order that shape the researcher's attitude towards texts and create the starting point that guides all reading, uncovering and interpretation. The researcher's ethical position originates in a will to openness towards what is different in the text and which constantly sets the researcher's preunderstanding and research tradition in movement. It is the researcher's attitude towards the text that allows the text to address, touch and arouse wonder. Through a flexible, lingering and repeated reading of classic texts, what is different emerges with a timeless value. The reading of classic texts is an act that may rediscover and create understanding for essential dimensions and of human beings' reality on a deeper level. The hermeneutic reading of classic texts thus brings to light constantly new possibilities of uncovering for a new envisioning and interpretation for a new understanding of the essential concepts and phenomena within caring science. © 2012 The Authors Scandinavian Journal of Caring Sciences © 2012 Nordic College of Caring Science.

  15. [Classification of cardiomyopathy].

    Science.gov (United States)

    Asakura, Masanori; Kitakaze, Masafumi

    2014-01-01

    Cardiomyopathy is a group of cardiovascular diseases with poor prognosis. Some patients with dilated cardiomyopathy need heart transplantations due to severe heart failure. Some patients with hypertrophic cardiomyopathy die unexpectedly due to malignant ventricular arrhythmias. Various phenotypes of cardiomyopathies are due to the heterogeneous group of diseases. The classification of cardiomyopathies is important and indispensable in the clinical situation. However, their classification has not been established, because the causes of cardiomyopathies have not been fully elucidated. We usually use definition and classification offered by WHO/ISFC task force in 1995. Recently, several new definitions and classifications of the cardiomyopathies have been published by American Heart Association, European Society of Cardiology and Japanese Circulation Society.

  16. Update on diabetes classification.

    Science.gov (United States)

    Thomas, Celeste C; Philipson, Louis H

    2015-01-01

    This article highlights the difficulties in creating a definitive classification of diabetes mellitus in the absence of a complete understanding of the pathogenesis of the major forms. This brief review shows the evolving nature of the classification of diabetes mellitus. No classification scheme is ideal, and all have some overlap and inconsistencies. The only diabetes in which it is possible to accurately diagnose by DNA sequencing, monogenic diabetes, remains undiagnosed in more than 90% of the individuals who have diabetes caused by one of the known gene mutations. The point of classification, or taxonomy, of disease, should be to give insight into both pathogenesis and treatment. It remains a source of frustration that all schemes of diabetes mellitus continue to fall short of this goal. Copyright © 2015 Elsevier Inc. All rights reserved.

  17. Classification in Medical Imaging

    DEFF Research Database (Denmark)

    Chen, Chen

    detection in a cardiovascular disease study. The third focus is to deepen the understanding of classification mechanism by visualizing the knowledge learned by a classifier. More specifically, to build the most typical patterns recognized by the Fisher's linear discriminant rule with applications......Classification is extensively used in the context of medical image analysis for the purpose of diagnosis or prognosis. In order to classify image content correctly, one needs to extract efficient features with discriminative properties and build classifiers based on these features. In addition......, a good metric is required to measure distance or similarity between feature points so that the classification becomes feasible. Furthermore, in order to build a successful classifier, one needs to deeply understand how classifiers work. This thesis focuses on these three aspects of classification...

  18. Pitch Based Sound Classification

    DEFF Research Database (Denmark)

    Nielsen, Andreas Brinch; Hansen, Lars Kai; Kjems, U

    2006-01-01

    -max output function. Both linear and quadratic inputs are used. The model is trained on 2 hours of sound and tested on publicly available data. A test classification error below 0.05 with 1 s classification windows is achieved. Further more it is shown that linear input performs as well as a quadratic......A sound classification model is presented that can classify signals into music, noise and speech. The model extracts the pitch of the signal using the harmonic product spectrum. Based on the pitch estimate and a pitch error measure, features are created and used in a probabilistic model with soft......, and that even though classification gets marginally better, not much is achieved by increasing the window size beyond 1 s....

  19. Learning Apache Mahout classification

    CERN Document Server

    Gupta, Ashish

    2015-01-01

    If you are a data scientist who has some experience with the Hadoop ecosystem and machine learning methods and want to try out classification on large datasets using Mahout, this book is ideal for you. Knowledge of Java is essential.

  20. Hand eczema classification

    DEFF Research Database (Denmark)

    Diepgen, T L; Andersen, Klaus Ejner; Brandao, F M

    2008-01-01

    Summary Background Hand eczema is a long-lasting disease with a high prevalence in the background population. The disease has severe, negative effects on quality of life and sometimes on social status. Epidemiological studies have identified risk factors for onset and prognosis, but treatment...... of the disease is rarely evidence based, and a classification system for different subdiagnoses of hand eczema is not agreed upon. Randomized controlled trials investigating the treatment of hand eczema are called for. For this, as well as for clinical purposes, a generally accepted classification system...... A classification system for hand eczema is proposed. Conclusions It is suggested that this classification be used in clinical work and in clinical trials....

  1. Texte et contre-texte en situation de diglossie

    OpenAIRE

    Carpanin Marimoutou, Jean-Claude

    2015-01-01

    Le texte en situation diglossique s'inscrit dans une relation dialogique conflictuelle indépassée qui produit le contre-texte et que le contre-texte reproduit en retour, déplaçant non pas le conflit, mais les pôles du conflit. Une vue d'ensemble de la littérature réunionnaise suffit à mettre en évidence ce jeu de miroir. Une étude des préfaces montre la conscience des producteurs de ce que le combat des textes cache d'enjeux et comment celui qui est posé comme Autre semble ne produire qu'une ...

  2. Zum Uebersetzen fachlicher Texte (On the Translation of Technical Texts)

    Science.gov (United States)

    Friederich, Wolf

    1975-01-01

    Reviews a 1974 East German publication on translation of scientific literature from Russian to German. Considers terminology, different standard levels of translation in East Germany, and other matters related to translation. (Text is in German.) (DH)

  3. Latent classification models

    DEFF Research Database (Denmark)

    Langseth, Helge; Nielsen, Thomas Dyhre

    2005-01-01

    parametric family ofdistributions.  In this paper we propose a new set of models forclassification in continuous domains, termed latent classificationmodels. The latent classification model can roughly be seen ascombining the \\NB model with a mixture of factor analyzers,thereby relaxing the assumptions...... classification model, and wedemonstrate empirically that the accuracy of the proposed model issignificantly higher than the accuracy of other probabilisticclassifiers....

  4. Snake classification from images

    OpenAIRE

    James, Alex.

    2017-01-01

    Incorrect snake identification from the observable visual traits is a major reason of death resulting from snake bites. So far no automatic classification method has been proposed to distinguish snakes by deciphering the taxonomy features of snake for the two major species of snakes i.e. Elapidae and Viperidae. We present a parallel processed inter-feature product similarity fusion based automatic classification of Spectacled Cobra, Russel's Viper, King Cobra, Common Krait, Saw Scaled Viper, ...

  5. English Metafunction Analysis in Chemistry Text: Characterization of Scientific Text

    Directory of Open Access Journals (Sweden)

    Ahmad Amin Dalimunte, M.Hum

    2013-09-01

    Full Text Available The objectives of this research are to identify what Metafunctions are applied in chemistry text and how they characterize a scientific text. It was conducted by applying content analysis. The data for this research was a twelve-paragraph chemistry text. The data were collected by applying a documentary technique. The document was read and analyzed to find out the Metafunction. The data were analyzed by some procedures: identifying the types of process, counting up the number of the processes, categorizing and counting up the cohesion devices, classifying the types of modulation and determining modality value, finally counting up the number of sentences and clauses, then scoring the grammatical intricacy index. The findings of the research show that Material process (71of 100 is mostly used, circumstance of spatial location (26 of 56 is more dominant than the others. Modality (5 is less used in order to avoid from subjectivity. Impersonality is implied through less use of reference either pronouns (7 or demonstrative (7, conjunctions (60 are applied to develop ideas, and the total number of the clauses are found much more dominant (109 than the total number of the sentences (40 which results high grammatical intricacy index. The Metafunction found indicate that the chemistry text has fulfilled the characteristics of scientific or academic text which truly reflects it as a natural science.

  6. Comparison of Classification Algorithms and Training Sample Sizes in Urban Land Classification with Landsat Thematic Mapper Imagery

    Directory of Open Access Journals (Sweden)

    Congcong Li

    2014-01-01

    Full Text Available Although a large number of new image classification algorithms have been developed, they are rarely tested with the same classification task. In this research, with the same Landsat Thematic Mapper (TM data set and the same classification scheme over Guangzhou City, China, we tested two unsupervised and 13 supervised classification algorithms, including a number of machine learning algorithms that became popular in remote sensing during the past 20 years. Our analysis focused primarily on the spectral information provided by the TM data. We assessed all algorithms in a per-pixel classification decision experiment and all supervised algorithms in a segment-based experiment. We found that when sufficiently representative training samples were used, most algorithms performed reasonably well. Lack of training samples led to greater classification accuracy discrepancies than classification algorithms themselves. Some algorithms were more tolerable to insufficient (less representative training samples than others. Many algorithms improved the overall accuracy marginally with per-segment decision making.

  7. The Mapping Analysis Between the Cooperative Patent Classification System (CPC and the International Patent Classification System (IPC

    Directory of Open Access Journals (Sweden)

    Bai Linlin

    2017-10-01

    Full Text Available [Purpose/significance] A detailed comparison of the mapping between the Cooperative Patent Classification System (CPC and the International Patent Classification System (IPC is intended to provide references for the use of CPC and IPC for patent classification.[Method/process]This paper made a detailed mapping number analysis between CPC and IPC in terms of the class, subclass, group and subgroup on their common A-H sections. [Result/conclusion] The classification principle between the Cooperative Patent Classification System (CPC and the International Patent Classification System (IPC is basically consistent and CPC is more subdivided than the IPC. The “2000 series (indexing” and the Y section of the new technology are added, showing CPC’s high extensibility. We believe that as a strong compatibility and high degree classification system, CPC will become an efficient tool to improve the efficiency of retrieval and optimize the review quality.

  8. Strategies for Translating Vocative Texts

    Directory of Open Access Journals (Sweden)

    Olga COJOCARU

    2014-12-01

    Full Text Available The paper deals with the linguistic and cultural elements of vocative texts and the techniques used in translating them by giving some examples of texts that are typically vocative (i.e. advertisements and instructions for use. Semantic and communicative strategies are popular in translation studies and each of them has its own advantages and disadvantages in translating vocative texts. The advantage of semantic translation is that it takes more account of the aesthetic value of the SL text, while communicative translation attempts to render the exact contextual meaning of the original text in such a way that both content and language are readily acceptable and comprehensible to the readership. Focus is laid on the strategies used in translating vocative texts, strategies that highlight and introduce a cultural context to the target audience, in order to achieve their overall purpose, that is to sell or persuade the reader to behave in a certain way. Thus, in order to do that, a number of advertisements from the field of cosmetics industry and electronic gadgets were selected for analysis. The aim is to gather insights into vocative text translation and to create new perspectives on this field of research, now considered a process of innovation and diversion, especially in areas as important as economy and marketing.

  9. Text Genres in Information Organization

    Science.gov (United States)

    Nahotko, Marek

    2016-01-01

    Introduction: Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method: The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information…

  10. Intercultural Rhetoric Research: Beyond Texts

    Science.gov (United States)

    Connor, Ulla

    2004-01-01

    This paper proposes a set of new methods for intercultural rhetoric research that is context-sensitive and, in many instances, goes beyond mere text analysis. It considers changes in the field as intercultural rhetoric has moved from the EAP study of student essays to the study of writing in many disciplines and genres. New developments in text,…

  11. The Case for Multiple Texts

    Science.gov (United States)

    Cummins, Sunday

    2017-01-01

    Reading just one text on any topic, Cummins argues, isn't enough if we expect students to learn at deep levels about the topic, synthesize various sources of information, and gain the knowledge they need to write and speak seriously about the topic. Reading a second or third text expands a reader's knowledge on any topic or story--and the why…

  12. Understanding and Teaching Complex Texts

    Science.gov (United States)

    Fisher, Douglas; Frey, Nancy

    2014-01-01

    Teachers in today's classrooms struggle every day to design instructional interventions that would build students' reading skills and strategies in order to ensure their comprehension of complex texts. Text complexity can be determined in both qualitative and quantitative ways. In this article, the authors describe various innovative…

  13. LOTUS: Linked open text unleashed

    NARCIS (Netherlands)

    Ilievski, F.; Beek, Wouter; Van Erp, Marieke; Rietveld, Laurens; Schlobach, Stefan

    2015-01-01

    It is dificult to find resources on the Semantic Web today, in particular if one wants to search for resources based on natural language keywords and across multiple datasets. In this paper, we present LOTUS: Linked Open Text UnleaShed, a full-text lookup index over a huge Linked Open Data

  14. A comparative performance evaluation of neural network based approach for sentiment classification of online reviews

    OpenAIRE

    Vinodhini, G.; Chandrasekaran, R.M.

    2016-01-01

    The aim of sentiment classification is to efficiently identify the emotions expressed in the form of text messages. Machine learning methods for sentiment classification have been extensively studied, due to their predominant classification performance. Recent studies suggest that ensemble based machine learning methods provide better performance in classification. Artificial neural networks (ANNs) are rarely being investigated in the literature of sentiment classification. This paper compare...

  15. Text mining for systems biology.

    Science.gov (United States)

    Fluck, Juliane; Hofmann-Apitius, Martin

    2014-02-01

    Scientific communication in biomedicine is, by and large, still text based. Text mining technologies for the automated extraction of useful biomedical information from unstructured text that can be directly used for systems biology modelling have been substantially improved over the past few years. In this review, we underline the importance of named entity recognition and relationship extraction as fundamental approaches that are relevant to systems biology. Furthermore, we emphasize the role of publicly organized scientific benchmarking challenges that reflect the current status of text-mining technology and are important in moving the entire field forward. Given further interdisciplinary development of systems biology-orientated ontologies and training corpora, we expect a steadily increasing impact of text-mining technology on systems biology in the future. Copyright © 2013 Elsevier Ltd. All rights reserved.

  16. Proverbs 30:18-19 in the Light of Ancient Mesopotamian Cuneiform Texts

    Directory of Open Access Journals (Sweden)

    Böck, Barbara

    2009-12-01

    Full Text Available The meaning of Proverbs 30:18-19 has long been disputed. Most scholars interpret the Biblical couplets textually on stylistic features only; an explanation of the contextual association between the four motifs mentioned (eagle, serpent, boat, man and woman has not yet been undertaken. The present paper aims at shedding light on the motivation for this association, taking into consideration ancient Near Eastern cuneiform compositions for the first time. It is further suggested that Proverbs 30:18-19 derived originally from a riddle that had its setting in a wedding ceremony.El significado de Proverbios 30:18-19 sigue desafiando la exégesis de los biblistas. La mayoría de los comentaristas interpretan los versos bíblicos textualmente, ciñéndose al análisis de las figuras de estilo. Sin embargo, todavía no se ha dado ninguna explicación a la asociación contextual entre los cuatro motivos del proverbio (águila, serpiente, barco, hombre y mujer. Por primera vez, este artículo estudia composiciones de la literatura cuneiforme que ofrecen un telón de fondo para interpretar el sentido de los distintos elementos y del conjunto del proverbio bíblico. Según esta nueva lectura, Proverbios 30:18-19 describiría una adivinanza propuesta durante una ceremonia matrimonial.

  17. Lecture-production ou lecture-consommation? (Le texte litteraire dans une classe de francais langue etrangere) (Productive Reading or Reading for Consumption? [The Literary Text in a French Foreign Language Class]).

    Science.gov (United States)

    Coppolani, Rene

    1980-01-01

    Using literature to teach French is an unpopular method but is seen here as a potentially valuable one. A method of "productive reading" integrates linguistic, symbolic, cultural, thematic, and stylistic elements in language learning. Exercises and examples are given. (MSE)

  18. A New Classification Approach Based on Multiple Classification Rules

    OpenAIRE

    Zhongmei Zhou

    2014-01-01

    A good classifier can correctly predict new data for which the class label is unknown, so it is important to construct a high accuracy classifier. Hence, classification techniques are much useful in ubiquitous computing. Associative classification achieves higher classification accuracy than some traditional rule-based classification approaches. However, the approach also has two major deficiencies. First, it generates a very large number of association classification rules, especially when t...

  19. Mapping of the Universe of Knowledge in Different Classification Schemes

    Directory of Open Access Journals (Sweden)

    M. P. Satija

    2017-06-01

    Full Text Available Given the variety of approaches to mapping the universe of knowledge that have been presented and discussed in the literature, the purpose of this paper is to systematize their main principles and their applications in the major general modern library classification schemes. We conducted an analysis of the literature on classification and the main classification systems, namely Dewey/Universal Decimal Classification, Cutter’s Expansive Classification, Subject Classification of J.D. Brown, Colon Classification, Library of Congress Classification, Bibliographic Classification, Rider’s International Classification, Bibliothecal Bibliographic Klassification (BBK, and Broad System of Ordering (BSO. We conclude that the arrangement of the main classes can be done following four principles that are not mutually exclusive: ideological principle, social purpose principle, scientific order, and division by discipline. The paper provides examples and analysis of each system. We also conclude that as knowledge is ever-changing, classifications also change and present a different structure of knowledge depending upon the society and time of their design.

  20. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed the chronol......For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  1. Text structures in medical text processing: empirical evidence and a text understanding prototype.

    Science.gov (United States)

    Hahn, U.; Romacker, M.

    1997-01-01

    We consider the role of textual structures in medical texts. In particular, we examine the impact the lacking recognition of text phenomena has on the validity of medical knowledge bases fed by a natural language understanding front-end. First, we review the results from an empirical study on a sample of medical texts considering, in various forms of local coherence phenomena (anaphora and textual ellipses). We then discuss the representation bias emerging in the text knowledge base that is likely to occur when these phenomena are not dealt with--mainly the emergence of referentially incoherent and invalid representations. We then turn to a medical text understanding system designed to account for local text coherence. PMID:9357739

  2. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  3. Stemming Malay Text and Its Application in Automatic Text Categorization

    Science.gov (United States)

    Yasukawa, Michiko; Lim, Hui Tian; Yokoo, Hidetoshi

    In Malay language, there are no conjugations and declensions and affixes have important grammatical functions. In Malay, the same word may function as a noun, an adjective, an adverb, or, a verb, depending on its position in the sentence. Although extensively simple root words are used in informal conversations, it is essential to use the precise words in formal speech or written texts. In Malay, to make sentences clear, derivative words are used. Derivation is achieved mainly by the use of affixes. There are approximately a hundred possible derivative forms of a root word in written language of the educated Malay. Therefore, the composition of Malay words may be complicated. Although there are several types of stemming algorithms available for text processing in English and some other languages, they cannot be used to overcome the difficulties in Malay word stemming. Stemming is the process of reducing various words to their root forms in order to improve the effectiveness of text processing in information systems. It is essential to avoid both over-stemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes. Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary. The use of set of rules is aimed at reducing the occurrence of under-stemming errors, while that of the dictionaries is believed to reduce the occurrence of over-stemming errors. We performed an experiment to evaluate the application of our stemmer in text mining software. For the experiment, text data used were actual web pages collected from the World Wide Web to demonstrate the effectiveness of our Malay stemming algorithm. The experimental results showed that our stemmer can effectively increase the precision of the extracted Boolean expressions for text categorization.

  4. Pattern classification through fuzzy likelihood

    Directory of Open Access Journals (Sweden)

    Rosa M. Pidatella

    2015-12-01

    Full Text Available This paper introduces a novel way to compute the membership function of a fuzzy set approximating the distribution of some observed data starting with their histogram. This membership function is in turn used to obtain a posteriori probability through a suitable version of the Bayesian formula. The ordering imposed by an  overtaking relation between fuzzy numbers translates immediately into a dominance of the a posteriori probability of a class over another for a given observed value. In this way a crisp classification is eventually obtained.

  5. Automated Classification of Power Signals

    Science.gov (United States)

    2008-06-01

    the classification code of the n th event. Boolean EVC [n] The ‘Event file created?’ Boolean is set to 1 if the event has had an event file created...indicate the type of event. int EVC [MAX_EVENTS]; // Boolean to indicate whether an event has had an .evt file created int local_det=0...i; // CLEAN THE EVENT TEXT DATA. for (i=0;i<MAX_EVENTS;i++) { Class[i]="Empty." Class_ID[i]=0; EVC [i]=FALSE; event_class_status[i

  6. Educational portals: Classification and components

    Directory of Open Access Journals (Sweden)

    Rosana López Carreño

    2008-02-01

    Full Text Available The progressive running of different educational portals needs one definition and classification for its identification, as well as, one enumeration of their usual elements to be able to set up one model of educational portal. This paper revises the main institutional of education portals of the Regional Governments to set up the level of development and to know their possible trends. Furthermore, it no-tices about the importance in the some added value services implementation for the support and the improvement of the educational tasks, as blogs, wikis and webquests

  7. An Experimental Text-Commentary

    Science.gov (United States)

    O'Brien, Joan

    1976-01-01

    An experimental text-commentary of selected passages from Sophocles'"Antigone" is described. The commentary is intended for students seeking more than a conventional translation who do not know enough Greek to use a standard commentary. (RM)

  8. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  9. The classification of easement

    Directory of Open Access Journals (Sweden)

    Popov Danica D.

    2015-01-01

    Full Text Available Easement means, a right enjoyed by the owner of land over the lands of another: such as rights of way, right of light, rights of support, rights to a flow of air or water etc. The dominant tenement is the land owned by the possessor of the easement, and the servient tenement is the land over which the right is enjoyed. An easement must exist for the accommodation and better enjoyment to which it is annexed, otherwise it may amount to mere licence. An easement benefits and binds the land itself and therefore countinious despite any change of ownership of either dominant or servient tenement, although it will be extinguished if the two tenemants come into common ownership. An easement can only be enjoyed in respect of land. This means two parcels of land. First there must be a 'dominant tenement' and a 'servient tenement'. Dominant tenement to which the benefit of the easement attaches, and another (servient tenement which bears the burden of the easement. A positive easement consist of a right to do something on the land of another; a negative easement restrict the use of owner of the serviant tenement may make of his land. An easement may be on land or on the house made on land. The next classification is on easement on the ground, and the other one under the ground. An easement shall be done in accordance with the principle of restrictions. This means that the less burden the servient tenement. When there is doubt about the extent of the actual easement shall take what easier the servient tenement. The new needs of the dominant estate does not result in the expansion of servitude. In the article is made comparison between The Draft Code of property and other real estate, and The Draft of Civil Code of Serbia.

  10. Classification and clinical assessment

    Directory of Open Access Journals (Sweden)

    F. Cantini

    2012-06-01

    Full Text Available There are at least nine classification criteria for psoriatic arthritis (PsA that have been proposed and used in clinical studies. With the exception of the ESSG and Bennett rules, all of the other criteria sets have a good performance in identifying PsA patients. As the CASPAR criteria are based on a robust study methodology, they are considered the current reference standard. However, if there seems to be no doubt that they are very good to classify PsA patients (very high specificity, they might be not sensitive enough to diagnose patients with unknown early PsA. The vast clinical heterogeneity of PsA makes its assessment very challenging. Peripheral joint involvement is measured by 78/76 joint counts, spine involvement by the instruments used for ankylosing spondylitis (AS, dactylitis by involved digit count or by the Leeds dactylitis index, enthesitis by the number of affected entheses (several indices available and psoriasis by the Psoriasis Area and Severity Index (PASI. Peripheral joint damage can be assessed by a modified van der Heijde-Sharp scoring system and axial damage by the methods used for AS or by the Psoriatic Arthritis Spondylitis Radiology Index (PASRI. As in other arthritides, global evaluation of disease activity and severity by patient and physician and assessment of disability and quality of life are widely used. Finally, composite indices that capture several clinical manifestations of PsA have been proposed and a new instrument, the Psoriatic ARthritis Disease Activity Score (PASDAS, is currently being developed.

  11. Individual Profiling Using Text Analysis

    Science.gov (United States)

    2016-04-15

    likelihood that it belongs to the input text , although early experiments showed that this added no benefit. Parts–of– speech In early experiments all...tweets were POS tagged as part of the pre– processing step using a Twitter specific part–of– speech tagger [8]. Various studies have identified POS tags as...AFRL-AFOSR-UK-TR-2016-0011 Individual Profiling using Text Analysis 140333 Mark Stevenson UNIVERSITY OF SHEFFIELD, DEPARTMENT OF PSYCHOLOGY Final

  12. Thinking About Religious Texts Anthropologically

    Directory of Open Access Journals (Sweden)

    Joel S. Kahn

    2016-01-01

    Full Text Available This paper addresses the conference themes by asking what contribution anthropology can make to the study of religious literature and heritage. In particular I will discuss ways in which anthropologists engage with religious texts. The paper begins with an assessment of what is probably the dominant approach to religious texts in mainstream anthropology and sociology, namely avoiding them and focussing instead on the religious ‘practices’ of ‘ordinary believers’. Arguing that this tendency to neglect the study of texts is ill-advised, the paper looks at the reasons why anthropologists need to engage with contemporary religious texts, particularly in their studies of/in the modern Muslim world. Drawing on the insights of anthropologist of religion Joel Robbins into what he called the “awkward relationship” between anthropology and theology, the paper proposes three possible ways in which anthropology might engage with religious literature. Based on a reading of three rather different modern texts on or about Islam, the strengths and weaknesses of each of the three modes of anthropological engagement is assessed and a case is made for Robbins’s third approach on the grounds that it offers a way out of the impasse in which mainstream anthropology of religion finds itself, caught as it is between the ‘emic’ and the ‘etic’, i.e. between ontologically different worlds.

  13. RESEARCH OF CLASSIFICATION FEATURES OF THE FINANCIAL CONTROL

    Directory of Open Access Journals (Sweden)

    Knarik K. Arabyan

    2013-01-01

    Full Text Available One of the major problems is an improvement of classification features in the financial control theory. There is not a consensus concerning the form classification and the methods of financial control. This factor hinders the development of methodology and investigation of other issues of the financial control theory. The author summarizes scientists’ approaches to studying the classification features of financial control in the article.

  14. Analysing ESP Texts, but How?

    Directory of Open Access Journals (Sweden)

    Borza Natalia

    2015-03-01

    Full Text Available English as a second language (ESL teachers instructing general English and English for specific purposes (ESP in bilingual secondary schools face various challenges when it comes to choosing the main linguistic foci of language preparatory courses enabling non-native students to study academic subjects in English. ESL teachers intending to analyse English language subject textbooks written for secondary school students with the aim of gaining information about what bilingual secondary school students need to know in terms of language to process academic textbooks cannot avoiding deal with a dilemma. It needs to be decided which way it is most appropriate to analyse the texts in question. Handbooks of English applied linguistics are not immensely helpful with regard to this problem as they tend not to give recommendation as to which major text analytical approaches are advisable to follow in a pre-college setting. The present theoretical research aims to address this lacuna. Respectively, the purpose of this pedagogically motivated theoretical paper is to investigate two major approaches of ESP text analysis, the register and the genre analysis, in order to find the more suitable one for exploring the language use of secondary school subject texts from the point of view of an English as a second language teacher. Comparing and contrasting the merits and limitations of the two contrastive approaches allows for a better understanding of the nature of the two different perspectives of text analysis. The study examines the goals, the scope of analysis, and the achievements of the register perspective and those of the genre approach alike. The paper also investigates and reviews in detail the starkly different methods of ESP text analysis applied by the two perspectives. Discovering text analysis from a theoretical and methodological angle supports a practical aspect of English teaching, namely making an informed choice when setting out to analyse

  15. Hydropedological insights when considering catchment classification

    Directory of Open Access Journals (Sweden)

    J. Bouma

    2011-06-01

    Full Text Available Soil classification systems are analysed to explore the potential of developing classification systems for catchments. Soil classifications are useful to create systematic order in the overwhelming quantity of different soils in the world and to extrapolate data available for a given soil type to soils elsewhere with identical classifications. This principle also applies to catchments. However, to be useful, soil classifications have to be based on permanent characteristics as formed by the soil forming factors over often very long periods of time. When defining permanent catchment characteristics, discharge data would therefore appear to be less suitable. But permanent soil characteristics do not necessarily match with characteristics and parameters needed for functional soil characterization focusing, for example, on catchment hydrology. Hydropedology has made contributions towards the required functional characterization of soils as is illustrated for three recent hydrological catchment studies. However, much still needs to be learned about the physical behaviour of anisotropic, heterogeneous soils with varying soil structures during the year and about spatial and temporal variability. The suggestion is made therefore to first focus on improving simulation of catchment hydrology, possibly incorporating hydropedological expertise, before embarking on a catchment classification effort which involves major input of time and involves the risk of distraction. In doing so, we suggest to also define other characteristics for catchment performance than the traditionally measured discharge rates. Such characteristics may well be derived from societal issues being studied, as is illustrated for the Green Water Credits program.

  16. CLASSIFICATION ACCURACY INCREASE USING MULTISENSOR DATA FUSION

    Directory of Open Access Journals (Sweden)

    A. Makarau

    2012-09-01

    Full Text Available The practical use of very high resolution visible and near-infrared (VNIR data is still growing (IKONOS, Quickbird, GeoEye-1, etc. but for classification purposes the number of bands is limited in comparison to full spectral imaging. These limitations may lead to the confusion of materials such as different roofs, pavements, roads, etc. and therefore may provide wrong interpretation and use of classification products. Employment of hyperspectral data is another solution, but their low spatial resolution (comparing to multispectral data restrict their usage for many applications. Another improvement can be achieved by fusion approaches of multisensory data since this may increase the quality of scene classification. Integration of Synthetic Aperture Radar (SAR and optical data is widely performed for automatic classification, interpretation, and change detection. In this paper we present an approach for very high resolution SAR and multispectral data fusion for automatic classification in urban areas. Single polarization TerraSAR-X (SpotLight mode and multispectral data are integrated using the INFOFUSE framework, consisting of feature extraction (information fission, unsupervised clustering (data representation on a finite domain and dimensionality reduction, and data aggregation (Bayesian or neural network. This framework allows a relevant way of multisource data combination following consensus theory. The classification is not influenced by the limitations of dimensionality, and the calculation complexity primarily depends on the step of dimensionality reduction. Fusion of single polarization TerraSAR-X, WorldView-2 (VNIR or full set, and Digital Surface Model (DSM data allow for different types of urban objects to be classified into predefined classes of interest with increased accuracy. The comparison to classification results of WorldView-2 multispectral data (8 spectral bands is provided and the numerical evaluation of the method in

  17. A statistical approach to root system classification.

    Directory of Open Access Journals (Sweden)

    Gernot eBodner

    2013-08-01

    Full Text Available Plant root systems have a key role in ecology and agronomy. In spite of fast increase in root studies, still there is no classification that allows distinguishing among distinctive characteristics within the diversity of rooting strategies. Our hypothesis is that a multivariate approach for plant functional type identification in ecology can be applied to the classification of root systems. We demonstrate that combining principal component and cluster analysis yields a meaningful classification of rooting types based on morphological traits. The classification method presented is based on a data-defined statistical procedure without a priori decision on the classifiers. Biplot inspection is used to determine key traits and to ensure stability in cluster based grouping. The classification method is exemplified with simulated root architectures and morphological field data. Simulated root architectures showed that morphological attributes with spatial distribution parameters capture most distinctive features within root system diversity. While developmental type (tap vs. shoot-borne systems is a strong, but coarse classifier, topological traits provide the most detailed differentiation among distinctive groups. Adequacy of commonly available morphologic traits for classification is supported by field data. Three rooting types emerged from measured data, distinguished by diameter/weight, density and spatial distribution respectively. Similarity of root systems within distinctive groups was the joint result of phylogenetic relation and environmental as well as human selection pressure. We concluded that the data-define classification is appropriate for integration of knowledge obtained with different root measurement methods and at various scales. Currently root morphology is the most promising basis for classification due to widely used common measurement protocols. To capture details of root diversity efforts in architectural measurement

  18. GPU-Accelerated Text Mining

    Energy Technology Data Exchange (ETDEWEB)

    Cui, Xiaohui [ORNL; Mueller, Frank [North Carolina State University; Zhang, Yongpeng [ORNL; Potok, Thomas E [ORNL

    2009-01-01

    Accelerating hardware devices represent a novel promise for improving the performance for many problem domains but it is not clear for which domains what accelerators are suitable. While there is no room in general-purpose processor design to significantly increase the processor frequency, developers are instead resorting to multi-core chips duplicating conventional computing capabilities on a single die. Yet, accelerators offer more radical designs with a much higher level of parallelism and novel programming environments. This present work assesses the viability of text mining on CUDA. Text mining is one of the key concepts that has become prominent as an effective means to index the Internet, but its applications range beyond this scope and extend to providing document similarity metrics, the subject of this work. We have developed and optimized text search algorithms for GPUs to exploit their potential for massive data processing. We discuss the algorithmic challenges of parallelization for text search problems on GPUs and demonstrate the potential of these devices in experiments by reporting significant speedups. Our study may be one of the first to assess more complex text search problems for suitability for GPU devices, and it may also be one of the first to exploit and report on atomic instruction usage that have recently become available in NVIDIA devices.

  19. Comprehending text in literature class

    Directory of Open Access Journals (Sweden)

    Purić Daliborka S.

    2016-01-01

    Full Text Available The paper discusses the problem of understanding a text and the contribution of methodological apparatus in the reader book to comprehension of a text being read in junior classes of elementary school. By using the technique of content analysis from methodological apparatuses in eight reader books for the fourth grade of elementary school, approved for usage in 2014/2015 academic year, and surveying 350 teachers in 33 elementary schools and 11 administrative districts in the Republic of Serbia we examined: (a to what extent the Serbian language text book contents enable junior students to understand a literary text; (b to what extent teachers accept the suggestions offered in the textbook for preparing literature teaching. The results show that a large number of suggestions relate to reading comprehension, but some of categories of understanding are unevenly distributed in the methodological apparatus. On the other hand, the majority of teachers use the methodological apparatus given in a textbook for preparing classes, not only the textbook he or she selected for teaching but also other textbooks for the same grade.

  20. Introduction to Subject Indexing; a Programmed Text. Volume Two: UDC and Chain Procedure in Subject Cataloguing.

    Science.gov (United States)

    Brown, A. G.; And Others

    This is the second of two volumes dealing with practical classification and subject indexing. The programed text considers use of the Universal Decimal Classification (UDC) and techniques of cross referencing based on UDC in the construction of classified and alphabetical subject catalogs. (Author/LS)

  1. Princess Brambilla - images/text

    Directory of Open Access Journals (Sweden)

    Maria Aparecida Barbosa

    2016-01-01

    Full Text Available Read the illustrated literary text is simultaneously think pictures and words. This articulation between the written text and pictures adds potential, expands and becomes complex. Coincides with nowadays discussions on Giorgio Agamben's "contemporary" that add to what adheres to respectively time the displacement and the distance needed to understand it, shakes linear notions of historical chronology. Somehow the coincidence is related to the current interest in the concept of "Nachleben" (survival, which assumes the images of the past ransom, postulated by the art historian Aby Warburg in a research on ancient art of motion characteristics in Renaissance pictures Botticelli's. For the translation of the Princesa Brambilla – um capriccio segundo Jakob Callot, de E. T. A. Hoffmann, com 8 gravuras cunhadas a partir de moldes originais de Callot (1820 to Portuguese such discussions were fundamental, as I try to present in this article.

  2. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    Inspecting the quality of printed texts has its own importance in many industrial applications. To do so, this paper proposes a grading system which evaluates the performance of the printing task using some quality measures for each character and symbols. The purpose of these grading system is two......-folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...

  3. Text Mining in Biomedical Domain with Emphasis on Document Clustering

    Science.gov (United States)

    2017-01-01

    Objectives With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. Methods This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. Results Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. Conclusions Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise. PMID:28875048

  4. Text Mining in Biomedical Domain with Emphasis on Document Clustering.

    Science.gov (United States)

    Renganathan, Vinaitheerthan

    2017-07-01

    With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise.

  5. Mean translation of GLCM texture features for across-date settlement type classification of quickbird images

    CSIR Research Space (South Africa)

    Luus, FPS

    2013-07-01

    Full Text Available the spatial and temporal feature variance present in a settlement classification problem using panchromatic across-area and across-date high resolutionQuickBird imagery. A numerical analysis indicates that a significant settlement classification accuracy...

  6. The effects of shadow removal on across-date settlement type classification of quickbird images

    CSIR Research Space (South Africa)

    Luus, FPS

    2012-07-01

    Full Text Available QuickBird imagery acquired on separate dates may have significant differences in viewing- and illumination geometries, which can negatively impact across-date settlement type classification accuracy. The effect of cast shadows on classification...

  7. Ecosystem services classification: A systems ecology perspective of the cascade framework

    CSIR Research Space (South Africa)

    La Notte, A

    2017-03-01

    Full Text Available Ecosystem services research faces several challenges stemming from the plurality of interpretations of classifications and terminologies. In this paper we identify two main challenges with current ecosystem services classification systems: i...

  8. REDD-İ MATLA IN DİVAN POETRY AS A STYLISTICS SYMPLOCE ELEMENT DEYİŞBİLİMSEL BİR PARALEL YİNELEME UNSURU OLARAK DİVAN ŞİİRİNDE REDD-İ MATLA

    Directory of Open Access Journals (Sweden)

    Özge ÖZTEKİN

    2010-10-01

    Full Text Available Possibility of remembering the poem depends not only on the effects of its form and content but also on repeating the poem. This study attempts to deal with Divan poetry and “style” that is a term of Stylistics on a common ground. Specifically, it deals with “repetitions” used in gazhels of Divan poetry, thus tries to analyse these gazhels using a linguistic structure. Linguistic elements in a poem such as sounds, words, phraes, sentences and also lines can be said to be repeated if they are systematically and repeatedly used. In other words, repetetions can be observed at each level of linguistic structure, namely phonological, morphological, syntactic and semantic. There is a conceptual similarity between the Stylistic term symploce and the Divan poetry term “redd-i matla”. In the study, the similarities between these terms and this similarity is given using examples from Divan poetry. Şiirin okuyucunun belleğinde kalma yetisi; şekil ve muhtevanın yarattığı etki kadar, yapılan tekrarlarla metnin unutulmamasına da bağlıdır. Divan şiiri ile Deyişbilimi “biçem/üslûp” ortak paydası üzerinde bir araya getiren bu çalışma, divanlardaki gazelleri dilbilimsel yöntemlerle okuma yolunda “yinelemeler” gibi sıkça rastlanan bir yapı ölçütüne dayanmaktadır. Şiirdeki dil öğelerinin seslerden sözcüklere, sözcük öbeklerine, tümcelere ve hatta dizelere gelinceye kadar tekrar edilerek kullanılması, üslûp/biçem açısından yinelemenin varlığını imlemektedir. Yinelemeler, dil kullanımının sesbilgisi, biçimbilgisi, sözdizimi, anlambilimi gibi her dalında görülebilmektedir. Deyişbilimdeki “biçimbirimsel yineleme” çeşitlerinden biri olan “paralel yineleme” ile Divan şiirindeki “redd-i matla” arasında kavramsal bir yakınlık söz konusudur. İşaret ettikleri kavram yönünden benzer özellikler taşıyan ancak kendi alanlarındaki isimleri farklı olan bu terimler olduk

  9. Seductive Texts with Serious Intentions.

    Science.gov (United States)

    Nielsen, Harriet Bjerrum

    1995-01-01

    Debates whether a text claiming to have scientific value is using seduction irresponsibly at the expense of the truth, and discusses who is the subject and who is the object of such seduction. It argues that, rather than being an assault against scientific ethics, seduction is a necessary premise for a sensible conversation to take place. (GR)

  10. Text linguistics: memory and representation

    Directory of Open Access Journals (Sweden)

    Leonor Lopes Fávero

    2012-12-01

    Full Text Available Text Linguistics originates in Brazil in the 80s of the twentieth century. The first work that we know of is from 1981, authored by Prof. Ignacio Antonio Neiss, entitled Por uma gramática textua, which was followed by two other in 1983: Linguística textual: o que é e como se faz, by Prof. Luiz Antônio Marcuschi and Linguística textual: introdução by Leonor Lopes Favero and Ingedore Villaça Koch. Professor Neiss shows how initial attempts to textual linguistics, were generally related to structural and generative grammars. The work of Prof. Marcuschi focuses on the analysis of some text definitions and on the study of theoretical aspects in relation to their applicability. Leonor Lopes Favero and Ingedore V. Koch aim to provide the Brazilian reader with an overview of text linguistics in Europe, a recent branch of language science then. This work is part of the History of Linguistic Ideas, part of the Cultural History, which seeks to identify how at different times , a social reality is constructed, designed, and enlightened (Chartier, 1990.

  11. Text Comprehension Processes in Bilinguals.

    Science.gov (United States)

    1985-08-01

    work unit 03.05 (utilization of bilingual Navy personnel). The objective of this work unit is to understand and improve the communicative competence of...project aimed at understanding and improving the communicative competence of bilingual personnel. Background Chang (1984) found that the text

  12. Hebrew Text Database ETCBC4

    NARCIS (Netherlands)

    Roorda, D.; Talstra, Eep; van Peursen, Wido Th.; Dyk, Janet; Sikkel, Constantijn; Glanz, Oliver; Oosting, Reinoud; Kalkman, Gino

    2014-01-01

    The ETCBC database of the Hebrew Bible (formerly known as WIVU database), contains the scholarly text of the Hebrew Bible with linguistic markup. A previous version can be found in EASY (see the link below). The present dataset is an improvement in many ways: (A) it contains a new version of the

  13. Classification of hand eczema

    DEFF Research Database (Denmark)

    Agner, T; Aalto-Korte, K; Andersen, K E

    2015-01-01

    BACKGROUND: Classification of hand eczema (HE) is mandatory in epidemiological and clinical studies, and also important in clinical work. OBJECTIVES: The aim was to test a recently proposed classification system of HE in clinical practice in a prospective multicentre study. METHODS: Patients were...... HE, protein contact dermatitis/contact urticaria, hyperkeratotic endogenous eczema and vesicular endogenous eczema, respectively. An additional diagnosis was given if symptoms indicated that factors additional to the main diagnosis were of importance for the disease. RESULTS: Four hundred and twenty......%) could not be classified. 38% had one additional diagnosis and 26% had two or more additional diagnoses. Eczema on feet was found in 30% of the patients, statistically significantly more frequently associated with hyperkeratotic and vesicular endogenous eczema. CONCLUSION: We find that the classification...

  14. Cellular image classification

    CERN Document Server

    Xu, Xiang; Lin, Feng

    2017-01-01

    This book introduces new techniques for cellular image feature extraction, pattern recognition and classification. The authors use the antinuclear antibodies (ANAs) in patient serum as the subjects and the Indirect Immunofluorescence (IIF) technique as the imaging protocol to illustrate the applications of the described methods. Throughout the book, the authors provide evaluations for the proposed methods on two publicly available human epithelial (HEp-2) cell datasets: ICPR2012 dataset from the ICPR'12 HEp-2 cell classification contest and ICIP2013 training dataset from the ICIP'13 Competition on cells classification by fluorescent image analysis. First, the reading of imaging results is significantly influenced by one’s qualification and reading systems, causing high intra- and inter-laboratory variance. The authors present a low-order LP21 fiber mode for optical single cell manipulation and imaging staining patterns of HEp-2 cells. A focused four-lobed mode distribution is stable and effective in optical...

  15. AHP 45: REVIEW: TIBETAN LITERARY GENRES, TEXTS, AND TEXT TYPES

    Directory of Open Access Journals (Sweden)

    Zoe Tribur

    2017-03-01

    Full Text Available Following the quantitative tradition of sociolinguistic research pioneered by such scholars as William Labov, Walt Wolfram, and Penelope Eckert, Reynolds presents a detailed, coherent analysis of the social parameters behind a specific on-going sound change, the merger of syllable final bilabial nasal (m with aveolar coronal nasal (n, in one small farming community in Qinghai Province. His is certainly not the first such study on Tibetan sound change. It is also not the first study to investigate the merger of (m into (n, which is a prominent feature of so-called "farmer" dialects of Amdo Tibetan (Hua 2005. ...

  16. Text Plagi, detecció de text no citat

    OpenAIRE

    Martínez Vilanova, Albert

    2013-01-01

    El treball presenta una aplicació web que permet la detecció de text plagiat en un arxiu prèviament seleccionat per l'usuari gràcies a l'API Summon Service a través d'un entorn web. La idea sorgeix del director de projecte Jordi Duran Cals quan em va proposar una col·laboració amb la Universitat Oberta de Catalunya per desenvolupar aquesta nova eina, que donaria solució a la necessitat dels professors de poder detectar possibles plagis. Amb la nostra aplicació hem complert l'objectiu principa...

  17. Acoustic classification of dwellings

    DEFF Research Database (Denmark)

    Berardi, Umberto; Rasmussen, Birgit

    2014-01-01

    insulation performance, national schemes for sound classification of dwellings have been developed in several European countries. These schemes define acoustic classes according to different levels of sound insulation. Due to the lack of coordination among countries, a significant diversity in terms...... of descriptors, number of classes, and class intervals occurred between national schemes. However, a proposal “acoustic classification scheme for dwellings” has been developed recently in the European COST Action TU0901 with 32 member countries. This proposal has been accepted as an ISO work item. This paper...

  18. The paradox of atheoretical classification

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2016-01-01

    A distinction can be made between “artificial classifications” and “natural classifications,” where artificial classifications may adequately serve some limited purposes, but natural classifications are overall most fruitful by allowing inference and thus many different purposes. There is strong...... support for the view that a natural classification should be based on a theory (and, of course, that the most fruitful theory provides the most fruitful classification). Nevertheless, atheoretical (or “descriptive”) classifications are often produced. Paradoxically, atheoretical classifications may...... be very successful. The best example of a successful “atheoretical” classification is probably the prestigious Diagnostic and Statistical Manual of Mental Disorders (DSM) since its third edition from 1980. Based on such successes one may ask: Should the claim that classifications ideally are natural...

  19. Text Segmentation Using Exponential Models

    CERN Document Server

    Beeferman, D; Lafferty, G D; Beeferman, Doug; Berger, Adam; Lafferty, John

    1997-01-01

    This paper introduces a new statistical approach to partitioning text automatically into coherent segments. Our approach enlists both short-range and long-range language models to help it sniff out likely sites of topic changes in text. To aid its search, the system consults a set of simple lexical hints it has learned to associate with the presence of boundaries through inspection of a large corpus of annotated data. We also propose a new probabilistically motivated error metric for use by the natural language processing and information retrieval communities, intended to supersede precision and recall for appraising segmentation algorithms. Qualitative assessment of our algorithm as well as evaluation using this new metric demonstrate the effectiveness of our approach in two very different domains, Wall Street Journal articles and the TDT Corpus, a collection of newswire articles and broadcast news transcripts.

  20. Locative inferences in medical texts.

    Science.gov (United States)

    Mayer, P S; Bailey, G H; Mayer, R J; Hillis, A; Dvoracek, J E

    1987-06-01

    Medical research relies on epidemiological studies conducted on a large set of clinical records that have been collected from physicians recording individual patient observations. These clinical records are recorded for the purpose of individual care of the patient with little consideration for their use by a biostatistician interested in studying a disease over a large population. Natural language processing of clinical records for epidemiological studies must deal with temporal, locative, and conceptual issues. This makes text understanding and data extraction of clinical records an excellent area for applied research. While much has been done in making temporal or conceptual inferences in medical texts, parallel work in locative inferences has not been done. This paper examines the locative inferences as well as the integration of temporal, locative, and conceptual issues in the clinical record understanding domain by presenting an application that utilizes two key concepts in its parsing strategy--a knowledge-based parsing strategy and a minimal lexicon.