WorldWideScience

Sample records for stylistic text classification

  1. A stylistic classification of Russian-language texts based on the random walk model

    Science.gov (United States)

    Kramarenko, A. A.; Nekrasov, K. A.; Filimonov, V. V.; Zhivoderov, A. A.; Amieva, A. A.

    2017-09-01

    A formal approach to text analysis is suggested that is based on the random walk model. The frequencies and reciprocal positions of the vowel letters are matched up by a process of quasi-particle migration. Statistically significant difference in the migration parameters for the texts of different functional styles is found. Thus, a possibility of classification of texts using the suggested method is demonstrated. Five groups of the texts are singled out that can be distinguished from one another by the parameters of the quasi-particle migration process.

  2. STYLISTIC FEATURES OF ADVERTISING TEXTS OF INFORMATIVE AND COMPARATIVE TYPES

    Directory of Open Access Journals (Sweden)

    Poddubskaya, O.N.

    2016-06-01

    Full Text Available The relevance of this article is related to the fact that nowadays advertising has a very strong impact both on the consumer market, political and cultural life of society, and on the language and its development as a system. Advertising has given rise to the development of a special set of stylistic features of a text, formed under the influence of reviving advertising traditions in the Russian language and under the active impact of energetic and pushy European advertising. The purpose of this study is to explore stylistic features of informative and comparative advertising texts. The object of research is Russian-language advertising in printed media and on television. In the end of the article we made conclusions about groups of language means used for different stylistic devices in informative and comparative advertising texts. Analysis of stylistic features of modern informative and comparative advertising texts can be of great interest to specialists in the field of theoretical studies of modern advertising.

  3. DISCOURSE STYLISTICS AS CONTEXTUALIZED STYLISTICS

    Directory of Open Access Journals (Sweden)

    Marina Katnić-Bakaršić

    2003-01-01

    Full Text Available The focus of the paper is on discourse stylistics, viewed as contextualized discipline. Context includes various factors (sociohistorical, cognitive, cultural and intertextual. The paper investigates the most important approaches to discourse stylistics: pragmatic stylistics, discourse and/ or conversational analysis, cognitive stylistics, critical stylistics, feminists stylistics. In discourse stylistics analysis is always combined with interpretation, and description is followed by explanation and critique.

  4. Automated Determination of the Type of Genre and Stylistic Coloring of Russian Texts

    Directory of Open Access Journals (Sweden)

    Barakhnin Vladimir

    2017-01-01

    Full Text Available In this paper we propose the algorithm of automated definition of the genre type and semantic characteristics of poetic texts in Russian. We formulated the approaches to the construction of a joint (“two-dimensional” classifier of genre types and stylistic colouring of poetic texts, based on the definition of interdependence of the type of genre and stylistic colouring of the text. On the basis of these approaches the principles of formation of the training samples for the algorithms for the definition of styles and genre types were analyzed. The computational experiments with a corpus of texts of the Lyceum lyrics of A.S.Pushkin were implemented, which showed good results in determining the stylistic colouring of poetic texts and sufficient results in determining the genres. The proposed algorithms can be used for automation of the complex analysis of Russian poetic texts, significantly facilitating the work of the expert in determining their styles and genres by providing appropriate recommendations.

  5. Speech Act Classification of German Advertising Texts

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2015-12-01

    Full Text Available This paper uses the theory of speech acts and the underlying concept of pragmalinguistics to determine the types of speech acts and their classification in the German advertising printed texts. We ascertain that the advertising of cars and accessories, household appliances and computer equipment, watches, fancy goods, food, pharmaceuticals, and financial, insurance, legal services and also airline advertising is dominated by a pragmatic principle, which is based on demonstrating information about the benefits of a product / service. This influences the frequent usage of certain speech acts. The dominant form of exposure is to inform the recipient-user about the characteristics of the advertised product. This information is fore-grounded by means of stylistic and syntactic constructions specific to the advertisement (participial constructions, appositional constructions which contribute to emphasize certain notional components within the framework of the advertising text. Stylistic and syntactic devices of reduction (parceling constructions convey the author's idea. Other means like repetitions, enumerations etc are used by the advertiser to strengthen his selling power. The advertiser focuses the attention of the consumer on the characteristics of the product seeking to convince him of the utility of the product and to influence his/ her buying behavior.

  6. DISCOURSE STYLISTICS AS CONTEXTUALIZED STYLISTICS

    OpenAIRE

    Marina Katnić-Bakaršić

    2003-01-01

    The focus of the paper is on discourse stylistics, viewed as contextualized discipline. Context includes various factors (sociohistorical, cognitive, cultural and intertextual). The paper investigates the most important approaches to discourse stylistics: pragmatic stylistics, discourse and/ or conversational analysis, cognitive stylistics, critical stylistics, feminists stylistics. In discourse stylistics analysis is always combined with interpretation, and description is followed by explana...

  7. SAW Classification Algorithm for Chinese Text Classification

    OpenAIRE

    Xiaoli Guo; Huiyu Sun; Tiehua Zhou; Ling Wang; Zhaoyang Qu; Jiannan Zang

    2015-01-01

    Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word) algorithm. The algorithm uses the special space effect of Chinese text where words...

  8. DYNAMIC FEATURE SELECTION FOR WEB USER IDENTIFICATION ON LINGUISTIC AND STYLISTIC FEATURES OF ONLINE TEXTS

    Directory of Open Access Journals (Sweden)

    A. A. Vorobeva

    2017-01-01

    Full Text Available The paper deals with identification and authentication of web users participating in the Internet information processes (based on features of online texts.In digital forensics web user identification based on various linguistic features can be used to discover identity of individuals, criminals or terrorists using the Internet to commit cybercrimes. Internet could be used as a tool in different types of cybercrimes (fraud and identity theft, harassment and anonymous threats, terrorist or extremist statements, distribution of illegal content and information warfare. Linguistic identification of web users is a kind of biometric identification, it can be used to narrow down the suspects, identify a criminal and prosecute him. Feature set includes various linguistic and stylistic features extracted from online texts. We propose dynamic feature selection for each web user identification task. Selection is based on calculating Manhattan distance to k-nearest neighbors (Relief-f algorithm. This approach improves the identification accuracy and minimizes the number of features. Experiments were carried out on several datasets with different level of class imbalance. Experiment results showed that features relevance varies in different set of web users (probable authors of some text; features selection for each set of web users improves identification accuracy by 4% at the average that is approximately 1% higher than with the use of static set of features. The proposed approach is most effective for a small number of training samples (messages per user.

  9. Text document classification

    Czech Academy of Sciences Publication Activity Database

    Novovičová, Jana

    č. 62 (2005), s. 53-54 ISSN 0926-4981 R&D Projects: GA AV ČR IAA2075302; GA AV ČR KSK1019101; GA MŠk 1M0572 Institutional research plan: CEZ:AV0Z10750506 Keywords : document representation * categorization * classification Subject RIV: BD - Theory of Information

  10. A contrastive-stylistic study into the tense distribution in English and Slovene fictional texts

    Directory of Open Access Journals (Sweden)

    Silvana Orel Kos

    2008-12-01

    Full Text Available The article addresses contrastive and narratological issues of the unity vs. diversity of temporal spheres in fictional texts. It focuses on the presentation of mimetic discourse within the past time-sphere narrative, trying to establish the narrative or stylistic functions of the present and past time-sphere verb actions with respect to the role of the narrator or that of the character. the diegetic and mimetic functions of verb actions in certain temporal spheres, ie. tense usage in (free indirect discourse (free direct discourse, will be contrastively studied in original fictional texts and their translations, in both directions between english and Slovene. the character’s mimetic discourse may be presented through different narrative forms, spanning the report-control cline from the forms “in total control” of the character, ie. free direct discourse, to that “apparently in total control” of the narrator, ie. speech act and thought act report (cf. Leech and Short 1981: 324. in addition to the character’s verbal and mental responses, the study includes mediated instances of the character’s sensory responses, the basic formula thus being: He said that/thought that/saw that. Our contrastive analysis considers only fictional texts whose diegesis is rendered   in the narrative past tenses, as the english language system observes the sequence of tenses, while the Slovene language does not. the diegesis of a fictional text may be completely located in the present time-sphere, yet such texts do not present any major issues in terms of contrastive relevance for the studied language pair.

  11. Active Learning for Text Classification

    OpenAIRE

    Hu, Rong

    2011-01-01

    Text classification approaches are used extensively to solve real-world challenges. The success or failure of text classification systems hangs on the datasets used to train them, without a good dataset it is impossible to build a quality system. This thesis examines the applicability of active learning in text classification for the rapid and economical creation of labelled training data. Four main contributions are made in this thesis. First, we present two novel selection strategies to cho...

  12. Cluster Based Text Classification Model

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    We propose a cluster based classification model for suspicious email detection and other text classification tasks. The text classification tasks comprise many training examples that require a complex classification model. Using clusters for classification makes the model simpler and increases...... the accuracy at the same time. The test example is classified using simpler and smaller model. The training examples in a particular cluster share the common vocabulary. At the time of clustering, we do not take into account the labels of the training examples. After the clusters have been created......, the classifier is trained on each cluster having reduced dimensionality and less number of examples. The experimental results show that the proposed model outperforms the existing classification models for the task of suspicious email detection and topic categorization on the Reuters-21578 and 20 Newsgroups...

  13. Transfer Learning beyond Text Classification

    Science.gov (United States)

    Yang, Qiang

    Transfer learning is a new machine learning and data mining framework that allows the training and test data to come from different distributions or feature spaces. We can find many novel applications of machine learning and data mining where transfer learning is necessary. While much has been done in transfer learning in text classification and reinforcement learning, there has been a lack of documented success stories of novel applications of transfer learning in other areas. In this invited article, I will argue that transfer learning is in fact quite ubiquitous in many real world applications. In this article, I will illustrate this point through an overview of a broad spectrum of applications of transfer learning that range from collaborative filtering to sensor based location estimation and logical action model learning for AI planning. I will also discuss some potential future directions of transfer learning.

  14. Towards an integrated corpus stylistics

    Directory of Open Access Journals (Sweden)

    McIntyre Dan

    2015-12-01

    Full Text Available Over recent years, the use of corpora in stylistic analysis has grown in popularity. However, questions still remain over the remit of corpus stylistics, its distinction from corpus linguistics generally and its capacity to explain complex stylistic effects. This article argues in favour of an integrated corpus stylistics; that is, an approach to corpus stylistics that integrates it with other stylistic methods and analytical frameworks. I suggest that this approach is needed for two main reasons: (i it is analytically necessary in order to fully explain stylistic effects in texts, and (ii integrating corpus methods with other stylistic tools is what will distinguish corpus stylistics from corpus linguistics. My argument is supported by reference to examples from Mark Haddon’s no vel The Curious Incident of the Dog in the Night-time and the HBO TV series Deadwood. Both these examples rely for their explanation on a combination of corpus stylistic analytical techniques and other stylistic methods of analysis.

  15. Multimodal Stylistics: The Happy Marriage of Stylistics and Semiotics

    DEFF Research Database (Denmark)

    Nørgaard, Nina

    2010-01-01

    put up for analysis. Since the first major flourishing of stylistics in the 1960s, different linguistic paradigms and other academic trends of the times have caused the field to branch off into a great variety of sub-fields such as formalist stylistics, functionalist stylistics, cognitive stylistics......, doctor-patient discourse, academic writing, etc. While forceful in its rigour and systematism, the traditional stylistic approach (whether of a formalist, functionalist, cognitive or other orientation) has until recently largely failed to embrace meanings which are created by semiotic systems other than......Stylistics is the systematic study of the ways in which meaning is created by linguistic means in literature and other types of text. It arose from a wish to make literary criticism more ―scientific by anchoring the analysis of literature more solidly in the actual grammar and lexis of the texts...

  16. Arabic text classification using Polynomial Networks

    Directory of Open Access Journals (Sweden)

    Mayy M. Al-Tahrawi

    2015-10-01

    Full Text Available In this paper, an Arabic statistical learning-based text classification system has been developed using Polynomial Neural Networks. Polynomial Networks have been recently applied to English text classification, but they were never used for Arabic text classification. In this research, we investigate the performance of Polynomial Networks in classifying Arabic texts. Experiments are conducted on a widely used Arabic dataset in text classification: Al-Jazeera News dataset. We chose this dataset to enable direct comparisons of the performance of Polynomial Networks classifier versus other well-known classifiers on this dataset in the literature of Arabic text classification. Results of experiments show that Polynomial Networks classifier is a competitive algorithm to the state-of-the-art ones in the field of Arabic text classification.

  17. About the role of stylistic and syntactic devices of expansion in the informational complex of dicteme of a German advertising text

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2012-12-01

    Full Text Available The article highlights stylistic and syntactic devices of expansion, which act as compositional means, vary normative syntactic structure of an advertising text, contribute to sense formation, creating conditions for the purpose of advertiser’s intent. By means of these language elements expressing invariant tactic sense the advertiser consciously expands and/or complicates the informative complex of dicteme, an acting text unit, transmitting superfluous impressive information together with factual one. Combination of factual and impressive items of information activates both rational and emotional perceptional channels of prospective consumer, intensifies the positioning process of an advertised article.

  18. CCM: A Text Classification Method by Clustering

    DEFF Research Database (Denmark)

    Nizamani, Sarwat; Memon, Nasrullah; Wiil, Uffe Kock

    2011-01-01

    In this paper, a new Cluster based Classification Model (CCM) for suspicious email detection and other text classification tasks, is presented. Comparative experiments of the proposed model against traditional classification models and the boosting algorithm are also discussed. Experimental results...... show that the CCM outperforms traditional classification models as well as the boosting algorithm for the task of suspicious email detection on terrorism domain email dataset and topic categorization on the Reuters-21578 and 20 Newsgroups datasets. The overall finding is that applying a cluster based...

  19. Strategies to Increase Accuracy in Text Classification

    NARCIS (Netherlands)

    D. Blommesteijn (Dennis)

    2014-01-01

    htmlabstractText classification via supervised learning involves various steps from processing raw data, features extraction to training and validating classifiers. Within these steps implementation decisions are critical to the resulting classifier accuracy. This paper contains a report of the

  20. Stylistic devices in comical proverbs

    Directory of Open Access Journals (Sweden)

    Burmistrova L. V.

    2017-04-01

    Full Text Available the article analyses stylistic devices in Russian and English comical proverbs. The author shows their influence on the content of comical proverbs and reveals a comic effect in them.

  1. Text document classification based on mixture models

    Czech Academy of Sciences Publication Activity Database

    Novovičová, Jana; Malík, Antonín

    2004-01-01

    Roč. 40, č. 3 (2004), s. 293-304 ISSN 0023-5954 R&D Projects: GA AV ČR IAA2075302; GA ČR GA102/03/0049; GA AV ČR KSK1019101 Institutional research plan: CEZ:AV0Z1075907 Keywords : text classification * text categorization * multinomial mixture model Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.224, year: 2004

  2. A Road to Aesthetic Stylistics

    Directory of Open Access Journals (Sweden)

    Samir Al-Sheikh

    2016-08-01

    Full Text Available Being a linguistic phenomenon, poetry is marked by the defamilarization of language in a poetic discourse there is an aesthetic distortion of  the normal codes, in which the aesthetic value is the most prominent function of the poetic texture . This study is a new  adventure in correlating linguistics to aesthetics by and through the so-called approach Aesthetic stylistics( As. Aesthetic stylistics is the application of the theory of beauty to the intentionally violated components in literary text. It proceeds with the hypothesis that John Keats's Ode on a Grecian Urn and Kabbani's Maritime Poem are disinterested poetic experiences which create ecstatic responses to the reader's awareness, therefore, the judgment of the reader's taste is aesthetic. The study aims at highlighting the stylistic-aesthetic factors which generate the judgment of taste. While drawing heavily on the aestheticism of the Prague Linguistic Circle and Halliday's Functional Linguistics (FL, or what has come to be called the Traditional European Functionalism, the study will analyze Keats' Ode and Kabbani's poem in terms of Kant's Kritik der Urteilstraft, KdU. The two circles of the linguistic description and aesthetic interpretation will be internally interlinked to create the coherence of the stylistic process. The study consists of an introduction, two parts, one in theory and the other in analysis; it is eventually rounded up with concluding remarks elicited from the semiotic quest.  Keywords: Stylistics, Functionalism, Aesthetics

  3. Automatic Amharic text news classification: Aneural networks ...

    African Journals Online (AJOL)

    School of Computing and Electrical Engineering, Institute of Technology, Bahir Dar University, Bahir Dar ... The study is on classification of Amharic news automatically using neural networks approach. Learning Vector ... INTRODUCTION.

  4. Research on Classification of Chinese Text Data Based on SVM

    Science.gov (United States)

    Lin, Yuan; Yu, Hongzhi; Wan, Fucheng; Xu, Tao

    2017-09-01

    Data Mining has important application value in today’s industry and academia. Text classification is a very important technology in data mining. At present, there are many mature algorithms for text classification. KNN, NB, AB, SVM, decision tree and other classification methods all show good classification performance. Support Vector Machine’ (SVM) classification method is a good classifier in machine learning research. This paper will study the classification effect based on the SVM method in the Chinese text data, and use the support vector machine method in the chinese text to achieve the classify chinese text, and to able to combination of academia and practical application.

  5. Functional Stylistics and Peripeteic Texts

    DEFF Research Database (Denmark)

    Borchmann, Simon

    2008-01-01

    Using a pragmatically based linguistic description apparatus on literary use of language is not unproblematic. Observations show that literary use of language violates the norms contained by this apparatus. With this paper I suggest how we can deal with this problem by setting up a frame for the ...

  6. Discourse Analysis in Stylistics and Literature Instruction.

    Science.gov (United States)

    Short, Mick

    1990-01-01

    A review of research regarding discourse analysis in stylistics and literature instruction covers studies of text, systematic analysis, meaning, style, literature pedagogy, and applied linguistics. A 10-citation annotated bibliography and a larger unannotated bibliography are included. (CB)

  7. Using Unlabeled Data to Improve Text Classification

    National Research Council Canada - National Science Library

    Nigam, Kamal P

    2001-01-01

    .... This dissertation demonstrates that supervised learning algorithms that use a small number of labeled examples and many inexpensive unlabeled examples can create high-accuracy text classifiers...

  8. Text Categorization Using Weight Adjusted k-Nearest Neighbor Classification

    National Research Council Canada - National Science Library

    Han, Euihong; Karypis, George; Kumar, Vipin

    1999-01-01

    .... The authors present a nearest neighbor classification scheme for text categorization in which the importance of discriminating words is learned using mutual information and weight adjustment techniques...

  9. Categorization and Pathology of Persian Stylistic Researches

    Directory of Open Access Journals (Sweden)

    Maryam Dorpar

    2014-08-01

    Full Text Available Abstract In following article, surveys and researches about Persian style were categorized in two branches of historical and formalistic styles Mohammad Taghi Bahar founded stylistics as an autonomous knowledge by publishing his book, History of the evolution of Persian prose (1331, for teaching in University of Tehran. This book which has been influenced by verbal instructions of qajar dynasty’s scholars made the way generally has been followed by researchers in Persian stylistics up to now. However, researchers and critics have introduced various theories and approaches during last four decades.  Stagnation in Persian stylistic researches is the main problem which is considered in current article. The main questions are: What branch of stylistics should be the performed Persian stylistic researches? “what are the weak points of surveys” and “what should be done for getting rid of this stagnation?” The main objective of current article is taking steps for removing stagnation from Persian stylistics.  Malek osh-Shoara Bahar used periodization in studying prose styles and analyzed revolution of Persian prose in lexical aspect (obsolete words, Arabic words, synonyms, words repetition, morphological aspect (verbal prefixes, comparative adjective suffixes, syntactic aspect (precedence of verb over its belongings, omission of verbs and rhetorical aspect (simile and allegory, metonymy and metaphor, prolixity and periphrasis, riming prose and harmony. In fact he tried to show both health and strength and laxity and corruption period of prose. We call Bahar stylistics and all researches done in his way historical stylistics with traditional attitude. In this method, styles' consistency and evolution through history have been studied and preiodization of styles has been taken into account. Mentioned researches periodized styles, finding formal similarities and differences. Since, neglecting meaning and text functionality they have only paid

  10. Categorization and Pathology of Persian Stylistic Researches

    Directory of Open Access Journals (Sweden)

    Maryam Dorpar

    2014-07-01

    Full Text Available  Abstract In following article, surveys and researches about Persian style were categorized in two branches of historical and formalistic styles Mohammad Taghi Bahar founded stylistics as an autonomous knowledge by publishing his book, History of the evolution of Persian prose (1331, for teaching in University of Tehran. This book which has been influenced by verbal instructions of qajar dynasty’s scholars made the way generally has been followed by researchers in Persian stylistics up to now. However, researchers and critics have introduced various theories and approaches during last four decades.  Stagnation in Persian stylistic researches is the main problem which is considered in current article. The main questions are: What branch of stylistics should be the performed Persian stylistic researches? “what are the weak points of surveys” and “what should be done for getting rid of this stagnation?” The main objective of current article is taking steps for removing stagnation from Persian stylistics.  Malek osh-Shoara Bahar used periodization in studying prose styles and analyzed revolution of Persian prose in lexical aspect (obsolete words, Arabic words, synonyms, words repetition, morphological aspect (verbal prefixes, comparative adjective suffixes, syntactic aspect (precedence of verb over its belongings, omission of verbs and rhetorical aspect (simile and allegory, metonymy and metaphor, prolixity and periphrasis, riming prose and harmony. In fact he tried to show both health and strength and laxity and corruption period of prose. We call Bahar stylistics and all researches done in his way historical stylistics with traditional attitude. In this method, styles' consistency and evolution through history have been studied and preiodization of styles has been taken into account. Mentioned researches periodized styles, finding formal similarities and differences. Since, neglecting meaning and text

  11. Empirical Studies On Machine Learning Based Text Classification Algorithms

    OpenAIRE

    Shweta C. Dharmadhikari; Maya Ingle; Parag Kulkarni

    2011-01-01

    Automatic classification of text documents has become an important research issue now days. Properclassification of text documents requires information retrieval, machine learning and Natural languageprocessing (NLP) techniques. Our aim is to focus on important approaches to automatic textclassification based on machine learning techniques viz. supervised, unsupervised and semi supervised.In this paper we present a review of various text classification approaches under machine learningparadig...

  12. Classification process in a text document recommender system

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2005-12-01

    Full Text Available This paper presents the classification process in a recommender system used for textual documents taken especially from web. The system uses in the classification process a combination of content filters, event filters and collaborative filters and it uses implicit and explicit feedback for evaluating documents.

  13. Hot complaint intelligent classification based on text mining

    Directory of Open Access Journals (Sweden)

    XIA Haifeng

    2013-10-01

    Full Text Available The complaint recognizer system plays an important role in making sure the correct classification of the hot complaint,improving the service quantity of telecommunications industry.The customers’ complaint in telecommunications industry has its special particularity which should be done in limited time,which cause the error in classification of hot complaint.The paper presents a model of complaint hot intelligent classification based on text mining,which can classify the hot complaint in the correct level of the complaint navigation.The examples show that the model can be efficient to classify the text of the complaint.

  14. Text mining in the classification of digital documents

    Directory of Open Access Journals (Sweden)

    Marcial Contreras Barrera

    2016-11-01

    Full Text Available Objective: Develop an automated classifier for the classification of bibliographic material by means of the text mining. Methodology: The text mining is used for the development of the classifier, based on a method of type supervised, conformed by two phases; learning and recognition, in the learning phase, the classifier learns patterns across the analysis of bibliographical records, of the classification Z, belonging to library science, information sciences and information resources, recovered from the database LIBRUNAM, in this phase is obtained the classifier capable of recognizing different subclasses (LC. In the recognition phase the classifier is validated and evaluates across classification tests, for this end bibliographical records of the classification Z are taken randomly, classified by a cataloguer and processed by the automated classifier, in order to obtain the precision of the automated classifier. Results: The application of the text mining achieved the development of the automated classifier, through the method classifying documents supervised type. The precision of the classifier was calculated doing the comparison among the assigned topics manually and automated obtaining 75.70% of precision. Conclusions: The application of text mining facilitated the creation of automated classifier, allowing to obtain useful technology for the classification of bibliographical material with the aim of improving and speed up the process of organizing digital documents.

  15. The ‘indisciplinarity’ of stylistics

    Directory of Open Access Journals (Sweden)

    Sorlin Sandrine

    2014-12-01

    Full Text Available This paper aims at showing why the stylistician can be construed as a prolific “impostor” in a most positive sense: pledged to no specific linguistic prophet, she can opt for different theoretical linguistic tools (in the sphere of pragmatics, critical discourse analysis, cognitive grammar, etc. depending on her object of study and what her research question is. The liberty claimed by the stylistician explains why stylistics is the “undisciplined” child of linguistics, shirking any clear definition of its boundaries. It will be argued that stylistics can only exist as a cross-disciplinary field given its conception of language as fundamentally contextualized. If it was a discipline determined by clear-cut pre-established boundaries, stylistics would be far more “disciplined” but would run the risk of serving only itself. The broad goal of this paper is thus to evince that the “indisciplinarity” of stylistics constitutes its very defining essence. With this aim in mind, it will demonstrate what stylistics owes to other disciplines, what it shares with similar language-based disciplines and what it can offer to other fields or practices of knowledge.

  16. Rational kernels for Arabic Root Extraction and Text Classification

    Directory of Open Access Journals (Sweden)

    Attia Nehar

    2016-04-01

    Full Text Available In this paper, we address the problems of Arabic Text Classification and root extraction using transducers and rational kernels. We introduce a new root extraction approach on the basis of the use of Arabic patterns (Pattern Based Stemmer. Transducers are used to model these patterns and root extraction is done without relying on any dictionary. Using transducers for extracting roots, documents are transformed into finite state transducers. This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification. Root extraction experiments are conducted on three word collections and yield 75.6% of accuracy. Classification experiments are done on the Saudi Press Agency dataset and N-gram kernels are tested with different values of N. Accuracy and F1 report 90.79% and 62.93% respectively. These results show that our approach, when compared with other approaches, is promising specially in terms of accuracy and F1.

  17. Semantic Document Image Classification Based on Valuable Text Pattern

    Directory of Open Access Journals (Sweden)

    Hossein Pourghassem

    2011-01-01

    Full Text Available Knowledge extraction from detected document image is a complex problem in the field of information technology. This problem becomes more intricate when we know, a negligible percentage of the detected document images are valuable. In this paper, a segmentation-based classification algorithm is used to analysis the document image. In this algorithm, using a two-stage segmentation approach, regions of the image are detected, and then classified to document and non-document (pure region regions in the hierarchical classification. In this paper, a novel valuable definition is proposed to classify document image in to valuable or invaluable categories. The proposed algorithm is evaluated on a database consisting of the document and non-document image that provide from Internet. Experimental results show the efficiency of the proposed algorithm in the semantic document image classification. The proposed algorithm provides accuracy rate of 98.8% for valuable and invaluable document image classification problem.

  18. An Introduction to Literary Quaranic Stylistics

    Science.gov (United States)

    Almenoar, Lubna

    2010-01-01

    A stylistic analysis is one approach of analyzing a literary text using literary descriptions. The use of literary texts in the literature classroom has been limited to mostly Western sources. This paper is an attempt to create an awareness of the linguistic features present in the English language translations of the meaning of the Quran. The…

  19. A Chinese text classification system based on Naive Bayes algorithm

    Directory of Open Access Journals (Sweden)

    Cui Wei

    2016-01-01

    Full Text Available In this paper, aiming at the characteristics of Chinese text classification, using the ICTCLAS(Chinese lexical analysis system of Chinese academy of sciences for document segmentation, and for data cleaning and filtering the Stop words, using the information gain and document frequency feature selection algorithm to document feature selection. Based on this, based on the Naive Bayesian algorithm implemented text classifier , and use Chinese corpus of Fudan University has carried on the experiment and analysis on the system.

  20. Categorizing Children: Automated Text Classification of CHILDES files

    NARCIS (Netherlands)

    Opsomer, Rob; Knoth, Peter; Wiering, Marco; van Polen, Freek; Trapman, Jantine

    2008-01-01

    In this paper we present the application of machine learning text classification methods to two tasks: categorization of children’s speech in the CHILDES Database according to gender and age. Both tasks are binary. For age, we distinguish two age groups between the age of 1.9 and 3.0 years old. The

  1. Improving imbalanced scientific text classification using sampling strategies and dictionaries

    Directory of Open Access Journals (Sweden)

    Borrajo L.

    2011-12-01

    Full Text Available Many real applications have the imbalanced class distribution problem, where one of the classes is represented by a very small number of cases compared to the other classes. One of the systems affected are those related to the recovery and classification of scientific documentation.

  2. Social Media Text Classification by Enhancing Well-Formed Text Trained Model

    Directory of Open Access Journals (Sweden)

    Phat Jotikabukkana

    2016-09-01

    Full Text Available Social media are a powerful communication tool in our era of digital information. The large amount of user-generated data is a useful novel source of data, even though it is not easy to extract the treasures from this vast and noisy trove. Since classification is an important part of text mining, many techniques have been proposed to classify this kind of information. We developed an effective technique of social media text classification by semi-supervised learning utilizing an online news source consisting of well-formed text. The computer first automatically extracts news categories, well-categorized by publishers, as classes for topic classification. A bag of words taken from news articles provides the initial keywords related to their category in the form of word vectors. The principal task is to retrieve a set of new productive keywords. Term Frequency-Inverse Document Frequency weighting (TF-IDF and Word Article Matrix (WAM are used as main methods. A modification of WAM is recomputed until it becomes the most effective model for social media text classification. The key success factor was enhancing our model with effective keywords from social media. A promising result of 99.50% accuracy was achieved, with more than 98.5% of Precision, Recall, and F-measure after updating the model three times.

  3. Overfitting Reduction of Text Classification Based on AdaBELM

    Directory of Open Access Journals (Sweden)

    Xiaoyue Feng

    2017-07-01

    Full Text Available Overfitting is an important problem in machine learning. Several algorithms, such as the extreme learning machine (ELM, suffer from this issue when facing high-dimensional sparse data, e.g., in text classification. One common issue is that the extent of overfitting is not well quantified. In this paper, we propose a quantitative measure of overfitting referred to as the rate of overfitting (RO and a novel model, named AdaBELM, to reduce the overfitting. With RO, the overfitting problem can be quantitatively measured and identified. The newly proposed model can achieve high performance on multi-class text classification. To evaluate the generalizability of the new model, we designed experiments based on three datasets, i.e., the 20 Newsgroups, Reuters-21578, and BioMed corpora, which represent balanced, unbalanced, and real application data, respectively. Experiment results demonstrate that AdaBELM can reduce overfitting and outperform classical ELM, decision tree, random forests, and AdaBoost on all three text-classification datasets; for example, it can achieve 62.2% higher accuracy than ELM. Therefore, the proposed model has a good generalizability.

  4. Computational text analysis and reading comprehension exam complexity towards automatic text classification

    CERN Document Server

    Liontou, Trisevgeni

    2014-01-01

    This book delineates a range of linguistic features that characterise the reading texts used at the B2 (Independent User) and C1 (Proficient User) levels of the Greek State Certificate of English Language Proficiency exams in order to help define text difficulty per level of competence. In addition, it examines whether specific reader variables influence test takers' perceptions of reading comprehension difficulty. The end product is a Text Classification Profile per level of competence and a formula for automatically estimating text difficulty and assigning levels to texts consistently and re

  5. Classification of protein-protein interaction full-text documents using text and citation network features.

    Science.gov (United States)

    Kolchinsky, Artemy; Abi-Haidar, Alaa; Kaur, Jasleen; Hamed, Ahmed Abdeen; Rocha, Luis M

    2010-01-01

    We participated (as Team 9) in the Article Classification Task of the Biocreative II.5 Challenge: binary classification of full-text documents relevant for protein-protein interaction. We used two distinct classifiers for the online and offline challenges: 1) the lightweight Variable Trigonometric Threshold (VTT) linear classifier we successfully introduced in BioCreative 2 for binary classification of abstracts and 2) a novel Naive Bayes classifier using features from the citation network of the relevant literature. We supplemented the supplied training data with full-text documents from the MIPS database. The lightweight VTT classifier was very competitive in this new full-text scenario: it was a top-performing submission in this task, taking into account the rank product of the Area Under the interpolated precision and recall Curve, Accuracy, Balanced F-Score, and Matthew's Correlation Coefficient performance measures. The novel citation network classifier for the biomedical text mining domain, while not a top performing classifier in the challenge, performed above the central tendency of all submissions, and therefore indicates a promising new avenue to investigate further in bibliome informatics.

  6. DEEP LEARNING MODEL FOR BILINGUAL SENTIMENT CLASSIFICATION OF SHORT TEXTS

    Directory of Open Access Journals (Sweden)

    Y. B. Abdullin

    2017-01-01

    Full Text Available Sentiment analysis of short texts such as Twitter messages and comments in news portals is challenging due to the lack of contextual information. We propose a deep neural network model that uses bilingual word embeddings to effectively solve sentiment classification problem for a given pair of languages. We apply our approach to two corpora of two different language pairs: English-Russian and Russian-Kazakh. We show how to train a classifier in one language and predict in another. Our approach achieves 73% accuracy for English and 74% accuracy for Russian. For Kazakh sentiment analysis, we propose a baseline method, that achieves 60% accuracy; and a method to learn bilingual embeddings from a large unlabeled corpus using a bilingual word pairs.

  7. A journey through the stylistics of poetry

    DEFF Research Database (Denmark)

    Jensen, Kim Ebensgaard

    2015-01-01

    Review of Peter Verdonk, The Stylistics of Poetry: Context, Cognition, Discourse, History. (Series: Advances in Stylistics). London: Bloomsbury, 2013, xi + 198 pp., ISBN 978-1-4411-5878-9.......Review of Peter Verdonk, The Stylistics of Poetry: Context, Cognition, Discourse, History. (Series: Advances in Stylistics). London: Bloomsbury, 2013, xi + 198 pp., ISBN 978-1-4411-5878-9....

  8. A Stylistic Research of Western Advertisements

    Institute of Scientific and Technical Information of China (English)

    翟蕾

    2014-01-01

    The research involves the following two parts: the first part is the analysis of the advertising language;the second part is to analyze the register, namely apply the advertising context to a wider social context to find a more effective communicative means. The stylistics enables one to make the discourse analysis of the advertising texts both from a microcosmic and a macroscopic perspective. The twofold demonstrative function points out a new theoretical way for advertising research.

  9. Construction accident narrative classification: An evaluation of text mining techniques.

    Science.gov (United States)

    Goh, Yang Miang; Ubeynarayana, C U

    2017-11-01

    Learning from past accidents is fundamental to accident prevention. Thus, accident and near miss reporting are encouraged by organizations and regulators. However, for organizations managing large safety databases, the time taken to accurately classify accident and near miss narratives will be very significant. This study aims to evaluate the utility of various text mining classification techniques in classifying 1000 publicly available construction accident narratives obtained from the US OSHA website. The study evaluated six machine learning algorithms, including support vector machine (SVM), linear regression (LR), random forest (RF), k-nearest neighbor (KNN), decision tree (DT) and Naive Bayes (NB), and found that SVM produced the best performance in classifying the test set of 251 cases. Further experimentation with tokenization of the processed text and non-linear SVM were also conducted. In addition, a grid search was conducted on the hyperparameters of the SVM models. It was found that the best performing classifiers were linear SVM with unigram tokenization and radial basis function (RBF) SVM with uni-gram tokenization. In view of its relative simplicity, the linear SVM is recommended. Across the 11 labels of accident causes or types, the precision of the linear SVM ranged from 0.5 to 1, recall ranged from 0.36 to 0.9 and F1 score was between 0.45 and 0.92. The reasons for misclassification were discussed and suggestions on ways to improve the performance were provided. Copyright © 2017 Elsevier Ltd. All rights reserved.

  10. MODERN LINGUODIDACTIC ASPECTS OF COGNITIVE APPROACH REALIZATION IN TEACHING STYLISTICS OF THE UKRAINIAN LANGUAGE TO STUDENTS

    Directory of Open Access Journals (Sweden)

    Anzhelika Popovych

    2017-09-01

    Full Text Available An approach to teaching stylistics – is a fundamental methodological category that defines the system of studying discipline, the ways of organizing the teaching material and the peculiarities of the interaction of all components of the educational process: principles, methods, ways of teaching. The linguocognitive approach in the study of stylistics aims at identifying aspects of the speech world picture, interpreting texts from the standpoint of cognitive processes, forming the cognitive and linguistic culture of students and the corresponding way of linguistic expression. The following levels of linguocognitive approach to the study of stylistics in higher education are distinguished, such as knowledge, practical and educational levels. The knowledge level involves students studying the foundations of cognitive linguistics and cognitive stylistics, systematic consideration of cognitive structures and processes, understanding the meaning of «concept» and interpreting the language and aesthetic characters of national culture. The perception of the text, its decoding, as well as the production are realized on a practical level. The educational level is aimed at forming the national linguistic and speech consciousness; respect for Ukrainian language traditions; education of speech culture; the desire to follow the aesthetic and ethical norms of communication. According to the contemporary aspects of the development of linguistic and linguistic-stylistic science, not only the clarification of the linguistic structural-level stylistic features of texts, the presence of traces and stylistic figures, but the identification of aspects of the linguistic picture of the world, the linguistic and aesthetic signs of national culture are relevant. Therefore, the cognitive-stylistic analysis of the text will be appropriate for the lessons of stylistics. The linguocognitive approach to the study of the stylistics of the Ukrainian language is extremely

  11. Stylistic Analysis of Maya Angelou’s Equality

    Directory of Open Access Journals (Sweden)

    arina isti'anah

    2017-11-01

    Full Text Available This research presented the stylistic analysis of a poem by Maya Angelou, Equality. The poem was chosen as it became Angelou’s one of well-known poems. The Stylistic analysis aimed at comprehending the meanings of either literary or non-literary text by means of observing the language device used in the texts. In this article, the stylistic analysis was conducted to analyze Maya Angelou’s Equality. To achieve the goal of stylistic analysis, there were some language levels to observe; they were phonological, graphological, grammatical, and semantic levels. In the phonological level, the repetition of rhyme in some stanzas, assonance, consonance, and alliteration were used to voice Angelou’s dream about freedom for black people. In the graphological level, the use of prominent punctuation in stanzas 3, 6, and 9 stressed equality as the requirement for the freedom she expected. In the grammatical level, Angelou used pronoun I and you as the dominant words in the poem, revealed different class the poet experienced in the country. The use of metaphors in the poem brought the same meaning as freedom, voice, effort, and racism that black people experienced in America. This research concludes that stylistics applies to analyze literary work so that thorough appreciation to it can be achieved.

  12. THE PRODUCT DESIGN PROCESS USING STYLISTIC SURFACES

    Directory of Open Access Journals (Sweden)

    Arkadiusz Gita

    2017-06-01

    Full Text Available The increasing consumer requirements for the way what everyday use products look like, forces manufacturers to put more emphasis on product design. Constructors, apart from the functional aspects of the parts created, are forced to pay attention to the aesthetic aspects. Software for designing A-class surfaces is very helpful in this case. Extensive quality analysis modules facilitate the work and allow getting models with specific visual features. The authors present a design process of the product using stylistic surfaces based on the front panel of the moped casing. In addition, methods of analysis of the design surface and product technology are presented.

  13. THE COMPOSER AND FOLKLORE PROBLEM: FACTORS OF STYLISTIC STRUCTURE

    Directory of Open Access Journals (Sweden)

    COCEAROVA GALINA

    2017-12-01

    Full Text Available This paper continues the author’s earlier study of the Composer and Folklore problem from the stylistic point of view. It is noted that in academic music, where the attention is focused not only on the speech or text characteristics, but primarily on the linguistic and stylistic material of folklore, the appeal to folk sources leads to the emergence of a number of stylistic factors, both, in the formation of the national style, and in the field of ethnic culture as a whole and integral stable system. The research points to the role of folklore as the genetic code of ethnic culture, as well as to other factors acting at on the level ,of musical discourse and musical language, contributing to the formation of „language flexibility” (A. Kolmogorov and, as a result, „flexibility of style”.

  14. Words Matter: Scene Text for Image Classification and Retrieval

    NARCIS (Netherlands)

    Karaoglu, S.; Tao, R.; Gevers, T.; Smeulders, A.W.M.

    Text in natural images typically adds meaning to an object or scene. In particular, text specifies which business places serve drinks (e.g., cafe, teahouse) or food (e.g., restaurant, pizzeria), and what kind of service is provided (e.g., massage, repair). The mere presence of text, its words, and

  15. Investigation into Text Classification With Kernel Based Schemes

    Science.gov (United States)

    2010-03-01

    Document Matrix TDMs Term-Document Matrices TMG Text to Matrix Generator TN True Negative TP True Positive VSM Vector Space Model xxii THIS PAGE...are represented as a term-document matrix, common evaluation metrics, and the software package Text to Matrix Generator ( TMG ). The classifier...AND METRICS This chapter introduces the indexing capabilities of the Text to Matrix Generator ( TMG ) Toolbox. Specific attention is placed on the

  16. Some Stylistic Aspects of Social Advertising in Russia

    Directory of Open Access Journals (Sweden)

    Aigul F. Khanova

    2017-10-01

    Full Text Available The article considers some stylistic aspects of advertising database in Russia. It examines linguistic and stylistic properties and peculiarities of social advertising and the impact it has on public consciousness. It determines that social advertisements in Russia are characteristic of the vocabulary belonging to the low language norms which reflects cultural and ethical context. Figurative language and stylistic devices aim at appealing to emotions and make the advertisement more memorable. The authors deem it necessary to create a common database on social advertising in Russia in order to facilitate the analysis of economic impact and evaluate the capacity to exert effect on mainstream audience as well as determine strategies to build advertising campaigns.

  17. Consideration on the history and the stylistic use of prefixes

    Directory of Open Access Journals (Sweden)

    Antonio Carlos Silva de Carvalho

    2016-07-01

    Full Text Available This paper aims to discuss prefixes from a historical perspective, as well as to observe nuances of stylistic value in them. The choice of subject was basically due to two reasons: (i the considerations set out by Martins (2003 on the low stylistic productivity caused by prefixal derivation – especially if compared to suffixal derivation; and (ii the considerations set out by Silva (2009 on the so-called de-language, of negativity, in Manoel de Barros. At first we worked on a brief historically and etymologically-oriented incursion on prefixes; and then, subjecting the reflections we gathered to a punctual corpus by the author, we highlighted examples in which features of the morpho-stylistic nature that contribute to the singularity of his work, also linked to the aesthetics of the fragmentary and to the smallest beings, can be explored.

  18. Translating children’s literature: some insights from corpus stylistics

    Directory of Open Access Journals (Sweden)

    Anna Čermáková

    2018-01-01

    Full Text Available In this paper I explore the potential of a corpus stylistic approach to the study of literary translation. The study focuses on translation of children’s literature with its specific constrains, and illustrates with two corpus linguistic techniques: keyword and cluster analysis — specific cases of repetition. So in a broader sense the paper discusses the phenomenon of repetition in different literary (stylistic traditions. These are illustrated by examples from two children’s classics aimed at two different age groups: the Harry Potter and the Winnie the Pooh books — and their translations into Czech. Various shifts in translation, especially in the translation of children’s literature, are often explained by the operation of so-called ‘translation universals’. Though ‘repetition’ as such does not belong to the commonly discussed set of translation universals, the stylistic norms opposing repetition seem to be a strong explanation for the translation shifts identified.

  19. Text Classification and Distributional features techniques in Datamining and Warehousing

    OpenAIRE

    Bethu, Srikanth; Babu, G Charless; Vinoda, J; Priyadarshini, E; rao, M Raghavendra

    2013-01-01

    Text Categorization is traditionally done by using the term frequency and inverse document frequency.This type of method is not very good because, some words which are not so important may appear in the document .The term frequency of unimportant words may increase and document may be classified in the wrong category.For reducing the error of classifying of documents in wrong category. The Distributional features are introduced. In the Distribuional Features, the Distribution of the words in ...

  20. TEXT CLASSIFICATION USING NAIVE BAYES UPDATEABLE ALGORITHM IN SBMPTN TEST QUESTIONS

    Directory of Open Access Journals (Sweden)

    Ristu Saptono

    2017-01-01

    Full Text Available Document classification is a growing interest in the research of text mining. Classification can be done based on the topics, languages, and so on. This study was conducted to determine how Naive Bayes Updateable performs in classifying the SBMPTN exam questions based on its theme. Increment model of one classification algorithm often used in text classification Naive Bayes classifier has the ability to learn from new data introduces with the system even after the classifier has been produced with the existing data. Naive Bayes Classifier classifies the exam questions based on the theme of the field of study by analyzing keywords that appear on the exam questions. One of feature selection method DF-Thresholding is implemented for improving the classification performance. Evaluation of the classification with Naive Bayes classifier algorithm produces 84,61% accuracy.

  1. AN IMPLEMENTATION OF EIS-SVM CLASSIFIER USING RESEARCH ARTICLES FOR TEXT CLASSIFICATION

    Directory of Open Access Journals (Sweden)

    B Ramesh

    2016-04-01

    Full Text Available Automatic text classification is a prominent research topic in text mining. The text pre-processing is a major role in text classifier. The efficiency of pre-processing techniques is increasing the performance of text classifier. In this paper, we are implementing ECAS stemmer, Efficient Instance Selection and Pre-computed Kernel Support Vector Machine for text classification using recent research articles. We are using better pre-processing techniques such as ECAS stemmer to find root word, Efficient Instance Selection for dimensionality reduction of text data and Pre-computed Kernel Support Vector Machine for classification of selected instances. In this experiments were performed on 750 research articles with three classes such as engineering article, medical articles and educational articles. The EIS-SVM classifier provides better performance in real-time research articles classification.

  2. METHODS OF TEXT INFORMATION CLASSIFICATION ON THE BASIS OF ARTIFICIAL NEURAL AND SEMANTIC NETWORKS

    Directory of Open Access Journals (Sweden)

    L. V. Serebryanaya

    2016-01-01

    Full Text Available The article covers the use of perseptron, Hopfild artificial neural network and semantic network for classification of text information. Network training algorithms are studied. An algorithm of inverse mistake spreading for perceptron network and convergence algorithm for Hopfild network are implemented. On the basis of the offered models and algorithms automatic text classification software is developed and its operation results are evaluated.

  3. From defamiliarization to foregrounding and defeated expectancy: Linguo-stylistic and cognitive sketch

    Directory of Open Access Journals (Sweden)

    Kupchyshyna Yuliya

    2017-12-01

    Full Text Available The article focuses on revealing the nature of defamiliarization, foregrounding, and defeated expectancy from a linguo-stylistic and cognitive perspective. It has been stated that defamiliarization, composed by different types of foregrounding and defeated expectancy as deviation, generated with a certain stylistic purpose are complex phenomena. The article highlights cognitive factors which ensure the creation of defamiliarization and defeated expectancy in the literary texts.

  4. Stylistic Features of Comment in Arabic Blogosphere

    Directory of Open Access Journals (Sweden)

    Gabdulzyamil G. Zaynullin

    2017-11-01

    Full Text Available One of the most important issues in the study of the functioning of the Internet language is the definition of the features of each Internet genre presented in online communication, taking into account the linguocultural features of the language in question. This paper studies the genre of the Internet comments of the Arabic-speaking blogosphere and reveals its stylistic features. The most common goal of the comment is gratitude, followed by praise. We created a corpus of comments from blogs of various subjects, and then conducted the tagging, having identified the group to which we attributed a comment, depending on the subject and the communicative goal. With the help of the Lexico 3 software, the most frequent lexical units were identified, the lexical features of the comments were described, the main one being the widespread use of religionyms, and the relationship between the blog subject and the stylistic characteristics of communication was revealed. The article traces the correlation between the literary and colloquial functional style in the comments, and also draws a conclusion that the comments are of a conversational, informal character. The main devices of expressiveness that are characteristic for both network and pre-network communication were revealed, and the tendency of the analysts to observe in the comments a stable three-part composition (greeting, message, final formula. The influence of traditional Arabic rhetoric, as well as the epistolary genre, was preserved. The results of the paper can be used when studying other genres of Internet communication in Arabic and in comparative studies to create the linguistic software.

  5. ARABIC TEXT CLASSIFICATION USING NEW STEMMER FOR FEATURE SELECTION AND DECISION TREES

    Directory of Open Access Journals (Sweden)

    SAID BAHASSINE

    2017-06-01

    Full Text Available Text classification is the process of assignment of unclassified text to appropriate classes based on their content. The most prevalent representation for text classification is the bag of words vector. In this representation, the words that appear in documents often have multiple morphological structures, grammatical forms. In most cases, this morphological variant of words belongs to the same category. In the first part of this paper, anew stemming algorithm was developed in which each term of a given document is represented by its root. In the second part, a comparative study is conducted of the impact of two stemming algorithms namely Khoja’s stemmer and our new stemmer (referred to hereafter by origin-stemmer on Arabic text classification. This investigation was carried out using chi-square as a feature of selection to reduce the dimensionality of the feature space and decision tree classifier. In order to evaluate the performance of the classifier, this study used a corpus that consists of 5070 documents independently classified into six categories: sport, entertainment, business, Middle East, switch and world on WEKA toolkit. The recall, f-measure and precision measures are used to compare the performance of the obtained models. The experimental results show that text classification using rout stemmer outperforms classification using Khoja’s stemmer. The f-measure was 92.9% in sport category and 89.1% in business category.

  6. On stylistic automatization of lexical units in various types of contexts

    Directory of Open Access Journals (Sweden)

    В В Зуева

    2009-12-01

    Full Text Available Stylistic automatization of lexical units in various types of contexts is investigated in this article. Following the works of Boguslav Havranek and other linguists of the Prague Linguistic School automatization is treated as a contextual narrowing of the meaning of a lexical unit to the level of its complete predictability in situational contexts and the lack of stylistic contradiction with other lexical units in speech.

  7. MODERN LINGUODIDACTIC ASPECTS OF COGNITIVE APPROACH REALIZATION IN TEACHING STYLISTICS OF THE UKRAINIAN LANGUAGE TO STUDENTS

    OpenAIRE

    Popovych, Anzhelika

    2017-01-01

    An approach to teaching stylistics – is a fundamental methodological category that defines the system of studying discipline, the ways of organizing the teaching material and the peculiarities of the interaction of all components of the educational process: principles, methods, ways of teaching. The linguocognitive approach in the study of stylistics aims at identifying aspects of the speech world picture, interpreting texts from the standpoint of cognitive processes, forming the cognitive an...

  8. Healthcare Text Classification System and its Performance Evaluation: A Source of Better Intelligence by Characterizing Healthcare Text.

    Science.gov (United States)

    Srivastava, Saurabh Kumar; Singh, Sandeep Kumar; Suri, Jasjit S

    2018-04-13

    A machine learning (ML)-based text classification system has several classifiers. The performance evaluation (PE) of the ML system is typically driven by the training data size and the partition protocols used. Such systems lead to low accuracy because the text classification systems lack the ability to model the input text data in terms of noise characteristics. This research study proposes a concept of misrepresentation ratio (MRR) on input healthcare text data and models the PE criteria for validating the hypothesis. Further, such a novel system provides a platform to amalgamate several attributes of the ML system such as: data size, classifier type, partitioning protocol and percentage MRR. Our comprehensive data analysis consisted of five types of text data sets (TwitterA, WebKB4, Disease, Reuters (R8), and SMS); five kinds of classifiers (support vector machine with linear kernel (SVM-L), MLP-based neural network, AdaBoost, stochastic gradient descent and decision tree); and five types of training protocols (K2, K4, K5, K10 and JK). Using the decreasing order of MRR, our ML system demonstrates the mean classification accuracies as: 70.13 ± 0.15%, 87.34 ± 0.06%, 93.73 ± 0.03%, 94.45 ± 0.03% and 97.83 ± 0.01%, respectively, using all the classifiers and protocols. The corresponding AUC is 0.98 for SMS data using Multi-Layer Perceptron (MLP) based neural network. All the classifiers, the best accuracy of 91.84 ± 0.04% is shown to be of MLP-based neural network and this is 6% better over previously published. Further we observed that as MRR decreases, the system robustness increases and validated by standard deviations. The overall text system accuracy using all data types, classifiers, protocols is 89%, thereby showing the entire ML system to be novel, robust and unique. The system is also tested for stability and reliability.

  9. Prediction of cause of death from forensic autopsy reports using text classification techniques: A comparative study.

    Science.gov (United States)

    Mujtaba, Ghulam; Shuib, Liyana; Raj, Ram Gopal; Rajandram, Retnagowri; Shaikh, Khairunisa

    2018-07-01

    Automatic text classification techniques are useful for classifying plaintext medical documents. This study aims to automatically predict the cause of death from free text forensic autopsy reports by comparing various schemes for feature extraction, term weighing or feature value representation, text classification, and feature reduction. For experiments, the autopsy reports belonging to eight different causes of death were collected, preprocessed and converted into 43 master feature vectors using various schemes for feature extraction, representation, and reduction. The six different text classification techniques were applied on these 43 master feature vectors to construct a classification model that can predict the cause of death. Finally, classification model performance was evaluated using four performance measures i.e. overall accuracy, macro precision, macro-F-measure, and macro recall. From experiments, it was found that that unigram features obtained the highest performance compared to bigram, trigram, and hybrid-gram features. Furthermore, in feature representation schemes, term frequency, and term frequency with inverse document frequency obtained similar and better results when compared with binary frequency, and normalized term frequency with inverse document frequency. Furthermore, the chi-square feature reduction approach outperformed Pearson correlation, and information gain approaches. Finally, in text classification algorithms, support vector machine classifier outperforms random forest, Naive Bayes, k-nearest neighbor, decision tree, and ensemble-voted classifier. Our results and comparisons hold practical importance and serve as references for future works. Moreover, the comparison outputs will act as state-of-art techniques to compare future proposals with existing automated text classification techniques. Copyright © 2017 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.

  10. Food and Beverage Stylist and Photography

    OpenAIRE

    BEKAR, Aydan; KARAKULAK, Çisem

    2016-01-01

    A food and beverage stylist makes food and beverage look appetizing by preaparing them properly in order to get customers’ attention. A food and beverage photographer gets the most impressive image by using different shooting techniques. Food and beverage stylists and phtographers prepare attractive and unusual menus ,brochures, banners and ads for food and beverage enterprises so that products can look better when customers see them. People see the works of food and beverage styling and phot...

  11. Categorization and Pathology of Persian Stylistic Researches

    OpenAIRE

    Maryam Dorpar

    2014-01-01

     Abstract In following article, surveys and researches about Persian style were categorized in two branches of historical and formalistic styles Mohammad Taghi Bahar founded stylistics as an autonomous knowledge by publishing his book, History of the evolution of Persian prose (1331), for teaching in University of Tehran. This book which has been influenced by verbal instructions of qajar dynastyâs scholars made the way generally has been followed by researchers in Persian stylistics up ...

  12. Stylistic analysis of songs in beverage advertisement

    Institute of Scientific and Technical Information of China (English)

    周双卉

    2012-01-01

    With the development of the advertisement,people tend to study the stylistic analysis of it.However,in this paper,the focus will be on the songs in beverage advertisement.The analysis will be focused on the features of the beverage advertisement songs and the stylistics of it.The aim of the paper is to improve the people and the scholars' understanding of the beverage advertisement songs.

  13. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification

    Directory of Open Access Journals (Sweden)

    Yin Wang

    2016-01-01

    Full Text Available Background. Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Results. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. Conclusions. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  14. Stylistic Analysis of the Short Story ‘The Last Word’ by Dr. A. R. Tabassum

    Directory of Open Access Journals (Sweden)

    Abdul Bari Khan

    2015-06-01

    Full Text Available In this article stylistic analysis of short story ‘The Last Word’ by Dr. A. R. Tabassum is performed.  The formative elements of the story, such as point of view, characters and allegorical element, are discussed in detail so as to give a better insight of the story. The story is analyzed stylistically in terms of figures of speech where grammatical, lexical and phonological schemes are considered, following the checklist of linguistic and stylistic categories proposed by Leech and Short. Features of repetition, parallelism, alliteration, consonance, assonance and rhyme are focused on. Finally, the findings and conclusion is given to sum up the discussion. Keywords: stylistics, analysis, short story, last word, allegory, Tabassum

  15. A Stylistic Analysis of Four Translations of J. D. Salinger's The Catcher in the Rye

    Directory of Open Access Journals (Sweden)

    Silva Bratož

    2004-12-01

    Full Text Available The paper looks at stylistic differences between four translations of J. D. Salinger’s Catcher in the Rye – two Slovene translations, a Serbo-Croatian, and an Italian translation. Firstly, stylistic components relevant to the novel in question are identified. In this respect, the translation of teenage speech and idiom appears to be not only the most conspicuous stylistic feature of the original but also the hardest to translate. Secondly, the ways in which the different translations have rendered certain formal and lexical features of style are compared by determining and describing their function. A large number of examples have been submitted to critical scrutiny, of which only a few representative ones are listed and explained in the paper. Finally, this paper points to some particular difficulties of the four translators in their attempts to reproduce the stylistic components of the original.

  16. DEVELOPMENT OF FOREIGN LANGUAGE STYLISTIC COMPETENCE OF FUTURE PHILOLOGISTS: GRAMMATICAL ASPECT

    Directory of Open Access Journals (Sweden)

    Олена Вовк

    2015-05-01

    Full Text Available The article studies a grammatical aspect of developing stylistic competence of students of linguistic departments. Particularly, the stylistic competence which is defined as a capacity to create adequate utterances under natural conditions of communication according to a concrete situation is characterized. To highlight the importance of acquiring stylistic competence the levels of speech development of an individual are indentified and the stages of teaching grammar are differentiated. The approaches to teaching stylistic grammar are characterized within a communicative framework and relevant skills are elucidated. The role of functional styles in teaching a foreign language is clarified. The idea of teaching students to be able to make register shifts and mixture of speech registers in the process of foreign language competence acquiring are highlihgted. The theoretical principles are illustrated with the appropriate examples of exercises.

  17. Motif-Based Text Mining of Microbial Metagenome Redundancy Profiling Data for Disease Classification.

    Science.gov (United States)

    Wang, Yin; Li, Rudong; Zhou, Yuhua; Ling, Zongxin; Guo, Xiaokui; Xie, Lu; Liu, Lei

    2016-01-01

    Text data of 16S rRNA are informative for classifications of microbiota-associated diseases. However, the raw text data need to be systematically processed so that features for classification can be defined/extracted; moreover, the high-dimension feature spaces generated by the text data also pose an additional difficulty. Here we present a Phylogenetic Tree-Based Motif Finding algorithm (PMF) to analyze 16S rRNA text data. By integrating phylogenetic rules and other statistical indexes for classification, we can effectively reduce the dimension of the large feature spaces generated by the text datasets. Using the retrieved motifs in combination with common classification methods, we can discriminate different samples of both pneumonia and dental caries better than other existing methods. We extend the phylogenetic approaches to perform supervised learning on microbiota text data to discriminate the pathological states for pneumonia and dental caries. The results have shown that PMF may enhance the efficiency and reliability in analyzing high-dimension text data.

  18. Automatic topic identification of health-related messages in online health community using text classification.

    Science.gov (United States)

    Lu, Yingjie

    2013-01-01

    To facilitate patient involvement in online health community and obtain informative support and emotional support they need, a topic identification approach was proposed in this paper for identifying automatically topics of the health-related messages in online health community, thus assisting patients in reaching the most relevant messages for their queries efficiently. Feature-based classification framework was presented for automatic topic identification in our study. We first collected the messages related to some predefined topics in a online health community. Then we combined three different types of features, n-gram-based features, domain-specific features and sentiment features to build four feature sets for health-related text representation. Finally, three different text classification techniques, C4.5, Naïve Bayes and SVM were adopted to evaluate our topic classification model. By comparing different feature sets and different classification techniques, we found that n-gram-based features, domain-specific features and sentiment features were all considered to be effective in distinguishing different types of health-related topics. In addition, feature reduction technique based on information gain was also effective to improve the topic classification performance. In terms of classification techniques, SVM outperformed C4.5 and Naïve Bayes significantly. The experimental results demonstrated that the proposed approach could identify the topics of online health-related messages efficiently.

  19. Academic origins and characteristics of the Chinese stylistic restoration

    Directory of Open Access Journals (Sweden)

    Xi Chen

    2016-09-01

    Full Text Available The conservation practice in China, termed “Chinese stylistic restoration” in this study, has been influenced by the traditional Chinese philosophy and construction principles, the modern Chinese conservation theory of Liang Sicheng and Liu Dunzhen, and Western and international theories and policies concerning conservation. This study uses three case studies, namely, Shanghai Zhenru Temple, Jianfu Palace Garden, and Angkor Wat Chau Say Tevoda Temple, to demonstrate the main characteristics of the Chinese stylistic restoration, including its emphasis on style over authenticity, pursuit of a gestalt form, and flexible attitude toward reconstruction. Accordingly, these practices have shaped the current Chinese conservation theory as reflected in the case studies reported in “Principles for the Conservation of Heritage Sites in China” and the Qufu Declaration.

  20. Toward an enhanced Arabic text classification using cosine similarity and Latent Semantic

    Directory of Open Access Journals (Sweden)

    Fawaz S. Al-Anzi

    2017-04-01

    Full Text Available Cosine similarity is one of the most popular distance measures in text classification problems. In this paper, we used this important measure to investigate the performance of Arabic language text classification. For textual features, vector space model (VSM is generally used as a model to represent textual information as numerical vectors. However, Latent Semantic Indexing (LSI is a better textual representation technique as it maintains semantic information between the words. Hence, we used the singular value decomposition (SVD method to extract textual features based on LSI. In our experiments, we conducted comparison between some of the well-known classification methods such as Naïve Bayes, k-Nearest Neighbors, Neural Network, Random Forest, Support Vector Machine, and classification tree. We used a corpus that contains 4,000 documents of ten topics (400 document for each topic. The corpus contains 2,127,197 words with about 139,168 unique words. The testing set contains 400 documents, 40 documents for each topics. As a weighing scheme, we used Term Frequency.Inverse Document Frequency (TF.IDF. This study reveals that the classification methods that use LSI features significantly outperform the TF.IDF-based methods. It also reveals that k-Nearest Neighbors (based on cosine measure and support vector machine are the best performing classifiers.

  1. LEXICO-STYLISTIC CHOICES AND MEDIA IDEOLOGY IN NEWSPAPER REPORTS ON NIGER DELTA CONFLICTS

    Directory of Open Access Journals (Sweden)

    Chuka Fred Ononye

    2017-05-01

    Full Text Available Media reports on Niger Delta (Henceforth, ND conflicts have reflected a relationship between lexico-stylistic choices and media ideologies. The existing media studies on the discourse have predominantly utilised pragmatic, stylistic and discourse analytical tools in presenting and labelling discourse participants and/or their ideologies, but neglected how media ideologies can be revealed through lexico-stylistic choices made in the reports. This paper therefore examines the lexico-stylistic choices in the reports in order to establish their link to specific ideological goals of the newspapers in relaying the conflict news. Forty reports on ND conflicts, published between 2003 and 2007, sampled from two ND-based (The Tide and Pioneer and two national (The Punch and THISDAY, labelled newspapers, were subjected to stylistic and critical analyses, with insights from structural (relational semantics and aspects of stylistics discourse. Two broad lexical stylistic choices are identified, including paradigmatic (61.8%—indexed by synonymous, antonymous, hyponymous, colloquial, and register items, and coinages and syntagmatic (38.2%—marked by collocations, metaphors, pleonasms, and lexical fields features. The features are utilised for three ideological ends; namely, picking out and framing participants as perpetrators of the violence in the discourse, evaluating specific entities and their roles in the conflicts, and reducing the impact of the activities of the news actors. Although there are overlaps, the evaluative ideology is largely associated with the national newspaper, the impact reduction ideology with the ND-based newspapers, while the framist ideology is observed in the two sets of newspapers. With these findings the study has added the lexical stylistics angle to the existing scholarship on ND conflict news discourse. Thus, the newspaper reports on ND conflicts are motivated by their ideological goals to change the reader’s outlook on

  2. CLASSIFICATION OF TRAFFIC RELATED SHORT TEXTS TO ANALYSE ROAD PROBLEMS IN URBAN AREAS

    Directory of Open Access Journals (Sweden)

    A. M. M. Saldana-Perez

    2017-09-01

    Full Text Available The Volunteer Geographic Information (VGI can be used to understand the urban dynamics. In the classification of traffic related short texts to analyze road problems in urban areas, a VGI data analysis is done over a social media’s publications, in order to classify traffic events at big cities that modify the movement of vehicles and people through the roads, such as car accidents, traffic and closures. The classification of traffic events described in short texts is done by applying a supervised machine learning algorithm. In the approach users are considered as sensors which describe their surroundings and provide their geographic position at the social network. The posts are treated by a text mining process and classified into five groups. Finally, the classified events are grouped in a data corpus and geo-visualized in the study area, to detect the places with more vehicular problems.

  3. Applying Active Learning to Assertion Classification of Concepts in Clinical Text

    Science.gov (United States)

    Chen, Yukun; Mani, Subramani; Xu, Hua

    2012-01-01

    Supervised machine learning methods for clinical natural language processing (NLP) research require a large number of annotated samples, which are very expensive to build because of the involvement of physicians. Active learning, an approach that actively samples from a large pool, provides an alternative solution. Its major goal in classification is to reduce the annotation effort while maintaining the quality of the predictive model. However, few studies have investigated its uses in clinical NLP. This paper reports an application of active learning to a clinical text classification task: to determine the assertion status of clinical concepts. The annotated corpus for the assertion classification task in the 2010 i2b2/VA Clinical NLP Challenge was used in this study. We implemented several existing and newly developed active learning algorithms and assessed their uses. The outcome is reported in the global ALC score, based on the Area under the average Learning Curve of the AUC (Area Under the Curve) score. Results showed that when the same number of annotated samples was used, active learning strategies could generate better classification models (best ALC – 0.7715) than the passive learning method (random sampling) (ALC – 0.7411). Moreover, to achieve the same classification performance, active learning strategies required fewer samples than the random sampling method. For example, to achieve an AUC of 0.79, the random sampling method used 32 samples, while our best active learning algorithm required only 12 samples, a reduction of 62.5% in manual annotation effort. PMID:22127105

  4. Natural Language Processing Based Instrument for Classification of Free Text Medical Records

    Directory of Open Access Journals (Sweden)

    Manana Khachidze

    2016-01-01

    Full Text Available According to the Ministry of Labor, Health and Social Affairs of Georgia a new health management system has to be introduced in the nearest future. In this context arises the problem of structuring and classifying documents containing all the history of medical services provided. The present work introduces the instrument for classification of medical records based on the Georgian language. It is the first attempt of such classification of the Georgian language based medical records. On the whole 24.855 examination records have been studied. The documents were classified into three main groups (ultrasonography, endoscopy, and X-ray and 13 subgroups using two well-known methods: Support Vector Machine (SVM and K-Nearest Neighbor (KNN. The results obtained demonstrated that both machine learning methods performed successfully, with a little supremacy of SVM. In the process of classification a “shrink” method, based on features selection, was introduced and applied. At the first stage of classification the results of the “shrink” case were better; however, on the second stage of classification into subclasses 23% of all documents could not be linked to only one definite individual subclass (liver or binary system due to common features characterizing these subclasses. The overall results of the study were successful.

  5. Approach for Text Classification Based on the Similarity Measurement between Normal Cloud Models

    Directory of Open Access Journals (Sweden)

    Jin Dai

    2014-01-01

    Full Text Available The similarity between objects is the core research area of data mining. In order to reduce the interference of the uncertainty of nature language, a similarity measurement between normal cloud models is adopted to text classification research. On this basis, a novel text classifier based on cloud concept jumping up (CCJU-TC is proposed. It can efficiently accomplish conversion between qualitative concept and quantitative data. Through the conversion from text set to text information table based on VSM model, the text qualitative concept, which is extraction from the same category, is jumping up as a whole category concept. According to the cloud similarity between the test text and each category concept, the test text is assigned to the most similar category. By the comparison among different text classifiers in different feature selection set, it fully proves that not only does CCJU-TC have a strong ability to adapt to the different text features, but also the classification performance is also better than the traditional classifiers.

  6. Classification of Traffic Related Short Texts to Analyse Road Problems in Urban Areas

    Science.gov (United States)

    Saldana-Perez, A. M. M.; Moreno-Ibarra, M.; Tores-Ruiz, M.

    2017-09-01

    The Volunteer Geographic Information (VGI) can be used to understand the urban dynamics. In the classification of traffic related short texts to analyze road problems in urban areas, a VGI data analysis is done over a social media's publications, in order to classify traffic events at big cities that modify the movement of vehicles and people through the roads, such as car accidents, traffic and closures. The classification of traffic events described in short texts is done by applying a supervised machine learning algorithm. In the approach users are considered as sensors which describe their surroundings and provide their geographic position at the social network. The posts are treated by a text mining process and classified into five groups. Finally, the classified events are grouped in a data corpus and geo-visualized in the study area, to detect the places with more vehicular problems.

  7. Comparisons and Selections of Features and Classifiers for Short Text Classification

    Science.gov (United States)

    Wang, Ye; Zhou, Zhi; Jin, Shan; Liu, Debin; Lu, Mi

    2017-10-01

    Short text is considerably different from traditional long text documents due to its shortness and conciseness, which somehow hinders the applications of conventional machine learning and data mining algorithms in short text classification. According to traditional artificial intelligence methods, we divide short text classification into three steps, namely preprocessing, feature selection and classifier comparison. In this paper, we have illustrated step-by-step how we approach our goals. Specifically, in feature selection, we compared the performance and robustness of the four methods of one-hot encoding, tf-idf weighting, word2vec and paragraph2vec, and in the classification part, we deliberately chose and compared Naive Bayes, Logistic Regression, Support Vector Machine, K-nearest Neighbor and Decision Tree as our classifiers. Then, we compared and analysed the classifiers horizontally with each other and vertically with feature selections. Regarding the datasets, we crawled more than 400,000 short text files from Shanghai and Shenzhen Stock Exchanges and manually labeled them into two classes, the big and the small. There are eight labels in the big class, and 59 labels in the small class.

  8. Short text sentiment classification based on feature extension and ensemble classifier

    Science.gov (United States)

    Liu, Yang; Zhu, Xie

    2018-05-01

    With the rapid development of Internet social media, excavating the emotional tendencies of the short text information from the Internet, the acquisition of useful information has attracted the attention of researchers. At present, the commonly used can be attributed to the rule-based classification and statistical machine learning classification methods. Although micro-blog sentiment analysis has made good progress, there still exist some shortcomings such as not highly accurate enough and strong dependence from sentiment classification effect. Aiming at the characteristics of Chinese short texts, such as less information, sparse features, and diverse expressions, this paper considers expanding the original text by mining related semantic information from the reviews, forwarding and other related information. First, this paper uses Word2vec to compute word similarity to extend the feature words. And then uses an ensemble classifier composed of SVM, KNN and HMM to analyze the emotion of the short text of micro-blog. The experimental results show that the proposed method can make good use of the comment forwarding information to extend the original features. Compared with the traditional method, the accuracy, recall and F1 value obtained by this method have been improved.

  9. Style and creativity: towards a theory of creative stylistics

    OpenAIRE

    Yoshifumi, Saitō

    1997-01-01

    The purpose of this thesis is to present a new theory of creative stylistics as an antithesis to traditional description-oriented stylistics. For this purpose it undertakes: (1) a selective historical survey of stylistics with special attention to its academic formation in the context of the theoretical dissociation between linguistics and literary criticism (Chapter 1), (2) a theoretical survey of stylistics with special attention to the way it has been defined and subcategorized (Cha...

  10. Evaluation and Classification of Syntax Usage in Determining Short-Text Semantic Similarity

    Directory of Open Access Journals (Sweden)

    V. Batanović

    2014-06-01

    Full Text Available This paper outlines and categorizes ways of using syntactic information in a number of algorithms for determining the semantic similarity of short texts. We consider the use of word order information, part-of-speech tagging, parsing and semantic role labeling. We analyze and evaluate the effects of syntax usage on algorithm performance by utilizing the results of a paraphrase detection test on the Microsoft Research Paraphrase Corpus. We also propose a new classification of algorithms based on their applicability to languages with scarce natural language processing tools.

  11. Stylistics and the Metaphysics of Poetry

    Science.gov (United States)

    Anderson, Neil

    2007-01-01

    In order to better understand the worth of aesthetic experience in encountering poetry, fresh perspectives are helpful. This paper introduces the reader to modern stylistics: that is linguistic examinations of "the speaker's meaning" in literature and notes such "scientific" approaches to poetry do find common metaphysical ground with leading…

  12. A Stylistic Analysis of Complexity in William Faulkner's "A Rose for Emily"

    Science.gov (United States)

    Abdurrahman, Israa' Burhanuddin

    2016-01-01

    Applying a stylistic analysis on certain texts refers to the identification of patterns of usage in writing. However, such an analysis is not restricted just to the description of the formal characteristics of texts, but it also tries to elucidate their functional importance for the interpretation of the text. This paper highlights complexity as a…

  13. Stylistic Analysis of Roald Dahl’s Cinderella

    Directory of Open Access Journals (Sweden)

    Henni Henni

    2010-01-01

    Full Text Available The paper presents a stylistic analysis of a rhyme, Cinderella, taken from Dahl’s rhyme collection, Revolting Rhymes. Roald Dahl is famous for his ability in creating extraordinary stories, in which linguistic elements, such as sounds and words, are manipulated to create an amusing story that has an unpredictable plot. The discussion covers an analysis of the narrative structure and the linguistic style applied in the rhyme, together with the discussion of the author’s purpose of applying such style. From the analysis it is found out that the style Dahl applies in the rhyme is especially useful for foregrounding.

  14. Active learning for clinical text classification: is it better than random sampling?

    Science.gov (United States)

    Figueroa, Rosa L; Ngo, Long H; Goryachev, Sergey; Wiechmann, Eduardo P

    2012-01-01

    Objective This study explores active learning algorithms as a way to reduce the requirements for large training sets in medical text classification tasks. Design Three existing active learning algorithms (distance-based (DIST), diversity-based (DIV), and a combination of both (CMB)) were used to classify text from five datasets. The performance of these algorithms was compared to that of passive learning on the five datasets. We then conducted a novel investigation of the interaction between dataset characteristics and the performance results. Measurements Classification accuracy and area under receiver operating characteristics (ROC) curves for each algorithm at different sample sizes were generated. The performance of active learning algorithms was compared with that of passive learning using a weighted mean of paired differences. To determine why the performance varies on different datasets, we measured the diversity and uncertainty of each dataset using relative entropy and correlated the results with the performance differences. Results The DIST and CMB algorithms performed better than passive learning. With a statistical significance level set at 0.05, DIST outperformed passive learning in all five datasets, while CMB was found to be better than passive learning in four datasets. We found strong correlations between the dataset diversity and the DIV performance, as well as the dataset uncertainty and the performance of the DIST algorithm. Conclusion For medical text classification, appropriate active learning algorithms can yield performance comparable to that of passive learning with considerably smaller training sets. In particular, our results suggest that DIV performs better on data with higher diversity and DIST on data with lower uncertainty. PMID:22707743

  15. Parcellation as a stylistic dominant characteristic in the novel Nigdina by Svetlana Velmar Janković

    Directory of Open Access Journals (Sweden)

    Mimović Milica P.

    2015-01-01

    Full Text Available The subject of this research is parcellation as a stylistic dominant characteristic in the novel 'Nigdina' by Svetlana Velmar Janković, and also as the means of expression which is superior in regard to the other linguistic procedures. Considering the aim of syntactic-stylistic analysis in this paper, the examples were divided according to their syntactic functions identifying and interpreting syntactic structures that change word order, and simultaneously point out stylistic marking of the parcellated structure. The most frequent structures are the ones with parcellated positions of adjuncts. Adverbs and constructions with prepositions and case are comparably found in these syntactic positions. Moreover, the most frequent are adjuncts for the parcellation of syntactic position, then intonational and positional emphasis of apposition, and finally the parcellation of elements such as object, subject and an attributive. In conclusion, this procedure of making syntactic units independent, their frequency and superiority to the other stylistic procedures, contribute to the style of this novel, and point out parcellation as a dominant stylistic characteristic of this novel.

  16. The ‘indisciplinarity’ of stylistics

    OpenAIRE

    Sorlin Sandrine

    2014-01-01

    This paper aims at showing why the stylistician can be construed as a prolific “impostor” in a most positive sense: pledged to no specific linguistic prophet, she can opt for different theoretical linguistic tools (in the sphere of pragmatics, critical discourse analysis, cognitive grammar, etc.) depending on her object of study and what her research question is. The liberty claimed by the stylistician explains why stylistics is the “undisciplined” child of linguistics, shirking any clear def...

  17. Stylistics of “Tarikh i Balami”

    Directory of Open Access Journals (Sweden)

    Ali Mohammadi asiyabadi

    2016-05-01

    Full Text Available AbstractTarikh i Balami", written by abu Ali Mohammad Ibn Abdollah Balami, Minister of Samanis in the fourth century, is one of the oldest books in Islamic period that contains the first step of creation till Arab raid and the prophet's life and the kings. Amir Mansur ibn Noah Samani ordered Translation of the book “Tarikh al-Omam va al-Muluk” of Tabari to his minister , Balami. But on several occasions he summarized The original book in the translation and used the other sources and removed something from his translation.   Different versions of this book show that many differences between eleven versions used for comparison, none of them became similar to each other. Replacing the oldest words with new words and the Arabic word with Persian word in the same meaning is one of problems  that copyists have caused for stylistics of this book.   Writing "Tarikh i Balami" is related to the first period of Persian prose in Iran. During this time that took Samanis and Tahiris period, the authors have written in a simple style. This style of writing, also called Khurasani style, because the authors have lived in Khorasan. The style is also called Balami, the author of the book, because he was one of the most prominent writers of this genre.   Results of comparison between this book and the other books of this period, such as mughaddamih Shahnameh abu Mansuri, translation of Tafsir i Tabari, Hudud al-alam min al-Mashriq ela al-Maqrib, Tafsir i Pak and…. show that the most important stylistic features in its language level are significant in some areas. One of them is the author tried to use Persian words, but sometimes the shorter and more familiar Arabic words have been replaced. -        Some verbs used in specific ways, for example, the old prefixes “فرا”, “فرو”, “باز”,… are used with verbs. -  

  18. Analysis of Influence of Different Relations Types on the Quality of Thesaurus Application to Text Classification Problems

    Directory of Open Access Journals (Sweden)

    Nadezhda S. Lagutina

    2017-01-01

    Full Text Available The main purpose of the article is to analyze how effectively different types of thesaurus relations can be used for solutions of text classification tasks. The basis of the study is an automatically generated thesaurus of a subject area, that contains three types of relations: synonymous, hierarchical and associative. To generate the thesaurus the authors use a hybrid method based on several linguistic and statistical algorithms for extraction of semantic relations. The method allows to create a thesaurus with a sufficiently large number of terms and relations among them. The authors consider two problems: topical text classification and sentiment classification of large newspaper articles. To solve them, the authors developed two approaches that complement standard algorithms with a procedure that take into account thesaurus relations to determine semantic features of texts. The approach to topical classification includes the standard unsupervised BM25 algorithm and the procedure, that take into account synonymous and hierarchical relations of the thesaurus of the subject area. The approach to sentiment classification consists of two steps. At the first step, a thesaurus is created, whose terms weight polarities are calculated depending on the term occurrences in the training set or on the weights of related thesaurus terms. At the second step, the thesaurus is used to compute the features of words from texts and to classify texts by the algorithm SVM or Naive Bayes. In experiments with text corpora BBCSport, Reuters, PubMed and the corpus of articles about American immigrants, the authors varied the types of thesaurus relations that are involved in the classification and the degree of their use. The results of the experiments make it possible to evaluate the efficiency of the application of thesaurus relations for classification of raw texts and to determine under what conditions certain relationships affect more or less. In particular, the

  19. A Feature Selection Method Based on Fisher's Discriminant Ratio for Text Sentiment Classification

    Science.gov (United States)

    Wang, Suge; Li, Deyu; Wei, Yingjie; Li, Hongxia

    With the rapid growth of e-commerce, product reviews on the Web have become an important information source for customers' decision making when they intend to buy some product. As the reviews are often too many for customers to go through, how to automatically classify them into different sentiment orientation categories (i.e. positive/negative) has become a research problem. In this paper, based on Fisher's discriminant ratio, an effective feature selection method is proposed for product review text sentiment classification. In order to validate the validity of the proposed method, we compared it with other methods respectively based on information gain and mutual information while support vector machine is adopted as the classifier. In this paper, 6 subexperiments are conducted by combining different feature selection methods with 2 kinds of candidate feature sets. Under 1006 review documents of cars, the experimental results indicate that the Fisher's discriminant ratio based on word frequency estimation has the best performance with F value 83.3% while the candidate features are the words which appear in both positive and negative texts.

  20. Stylistic Variation In Three English Translations Of The Dead Sea ...

    African Journals Online (AJOL)

    Since the discovery of the Dead Sea Scrolls in 1947 different English translations were published. In this article the stylistic variation of three of these translations are analysed. It is suggested that the issue of stylistic variation boils down to linguistically inscribed preference in the choice and construction of discourses in the ...

  1. Chiasmus as a Stylistic Device in Donne's and Vaughan's Poetry

    Science.gov (United States)

    I'jam, Dunya Muhammad Miqdad; Fadhil, Zahraa Adnan

    2016-01-01

    This study investigates chiasmus as a stylistic device in ten metaphysical poems (five for John Donne and five for Henry Vaughan). It aims at showing how both, Donne and Vaughan, utilize chiasmus at the different linguistic levels as a stylistic device in their poetry. Thus, to achieve this aim, it is hypothesized that chiasmus as used by Donne…

  2. METAPHOR AS A STYLISTIC DEVICE OF ISLAMIC TEACHING

    Directory of Open Access Journals (Sweden)

    Jumino Suhadi

    2011-06-01

    Full Text Available Metafor Sebagai Sarana Stilistika Pembelajaran Islam. Artikel ini membahas tentang berbagai tipe metafora yang terdapat dalam ayat-ayat suci al-Qur’an dan Hadis berdasarkan kerangka teori sastra dan linguistik modern. Sumber data utama dalam studi ini terdiri dari ayat-ayat suci al-Qur’an yang diterjemahkan dalam Bahasa Inggris oleh Abdullah Yusuf Ali dan beberapa matan Hadis dari buku kumpulan Hadis karangan Habib Muhammad al-Haddar. Tujuan analisis ini adalah untuk memaparkan bukti kuat bahwa metafora merupakan alat stilistika ‘stylistic device’ yang dipergunakan secara luas dalam al-Qur’an dan al-Hadis dalam menyampaikan ajaran Islam. Hasil dari studi ini menunjukkan bahwa semua tipe metafora dalam arti luas terdapat dalam berbagai ayat suci al-Qur’an dan al-Hadis. Argumen tersebut menurut penulis merupakan bukti yang sangat meyakinkan bahwa metafora merupakan salah satu alat stilistika dalam menyampaikan ajaran Islam.

  3. The Stylistics Analysis of Internet Language

    Institute of Scientific and Technical Information of China (English)

    ZHOU Huan-huan

    2015-01-01

    Internet language is the product of modern technology, especially for the advancement of Information Technology. It is a social and linguistic phenomenon which has its own stylistic and rhetoric patterns and styles compared with other languages. The reasons for the emergence of Internet language can be summarised into three kinds:firstly, the netizens need distinct languag⁃es to show their personalities and enhance the impacts of languages such as sadness, angriness and happiness; secondly, the key⁃board makes it hard to type the whole sentences when people online chatting or other activities;lastly, it is fast and convenient, especially when some online activities are time-consuming. Internet languages make the most use of the functions of linguistic deviation and satisfy the psychological and practical needs of netizens.

  4. A Stylistic Analysis of D.H. Lawrence’s ‘Sons and Lovers’

    Directory of Open Access Journals (Sweden)

    Nozar Niazi

    2013-05-01

    Full Text Available This paper aims at analyzing D.H. Lawrence’s ‘Sons and Lovers’ using a stylistic approach. Stylistics is a study of the amalgamation of form with content. The stylistic analysis of a novel goes beyond the traditional, intuitive interpretation, because it combines intuition and detailed linguistic analysis of the text. The defining elements of modern language are within the text itself, not prescribed from outside. With modernist texts, usually understanding comes from close study of the language system defined within the text itself. Form, technique and style are considered not as a mere vehicle of the content of the story, but an integral part of the work’s meaning and value. In our analysis of ‘Sons and Lovers’ the resources of language: lexis, syntax, phonology, figurative language, cohesion and coherence, are discussed in relation to the style of discourse in order to explore hidden meanings in the text. The resources of language are shown to be an essential part of the meaning of the novel.

  5. A (FORENSIC STYLISTIC ANALYSIS OF ADVERBIALS OF ATTITUDE AND EMPHASIS IN SUPREME COURT DECISIONS IN PHILIPPINE ENGLISH

    Directory of Open Access Journals (Sweden)

    Hjalmar Punla Hernandez

    2017-09-01

    Full Text Available Contemporarily, stylistics today has developed into its multiplicity – one of which is forensic stylistics. Being a powerfully legal written discourse, Supreme Court decisions are a rich corpus in which linguistic vis-a-vis stylistic choices of Court justices could be examined. This study is a humble attempt at stylistically analyzing Supreme Court decisions in Philippine English (PhE drafted by two Filipino justices. Specifically, it sought to investigate on the classes, placements, and environments of adverbials of attitude and emphasis employed by the two justices, and drew their implications to teaching and learning English for Legal Purposes (ELP. Using McMenamin (2012, Quirk, Greenbaum, Leech, and Svartvik (1985, and Dita’s (2011 frameworks, 54 randomly selected Supreme Court decisions as primary sources of legal language were analyzed. Results are the following. Firstly, the classes of adverbials of attitude in Supreme Court decisions in PhE used by the two judges were the evaluation to the subject of the clause, judgment to the whole clause, and evaluation to an action performed by the subject of the clause, while those adverbials of emphasis were adverbials of conviction and doubt. Secondly, both adverbials they used have placements that were frequently medial and less initial in sentences where they belonged. Thirdly, the two justices put their adverbials within two principal environments, i.e. within functor, and before/after the verb among others. In these regards, legal and stylistic explanations with respect to these recurrent linguistic features in the two justices’ Court decisions were revealed. Implications of the study to ELP are explained. Lastly, trajectories for future (forensic stylistic analyses have been recommended.

  6. A new approach to the classification of African oral texts | Kam ...

    African Journals Online (AJOL)

    Toutes ces raisons ont conduit à un nouvel examen des différents genres oraux dans le cadre africain et à proposer une division de ces textes en cinq grandes catégories. Mots clés: littérature orale, genres oraux, textes oraux, discours, énoncés, jeux de plaisanterie, chercheurs en littérature orale. Tydskrif vir Letterkunde ...

  7. Beyond the Law of Transitivity:A Functional Stylistic Study of Maya Angelou's I Know Why the Caged Bird Sings

    Directory of Open Access Journals (Sweden)

    Muthanna Makki Muhammed

    2017-03-01

    Full Text Available The dominant critical focus on Maya Angelou’s writings has been on the thematic features of her texts. Linguistic and stylistic appraisals on her works are generally sparse. This paper is a stylistic study of Maya Angelou’s autobiographical novel I Know Why the Caged Bird Sings. It aims at examining the stylistic features of the text vis-à-vis the semantic Law of Transitivity so as to investigate the features that contribute in the discourse’s trespassing the sphere of informing to the sphere of interaction and influence. The paper starts with brief notes on stylistics in relation to semantics. This is followed by a discussion of the Law of Transitivity, frequent references are made to John R. Searle’s patterns of metaphor. The varied forms of the relations between the signified or the source (the vehicle and the signifier or the target (the tenor in relation to the sign (the common ground are discussed in the light of the figurative devices employed by the author and the functions achieved in revealing the ideological issues of race and gender in the book. The study attempts also at positioning the formal and psychological elements within a sociocultural context in order to promote the reader’s understanding of the purposes and functions to which certain linguistic choices are made.

  8. Shared Features of L2 Writing: Intergroup Homogeneity and Text Classification

    Science.gov (United States)

    Crossley, Scott A.; McNamara, Danielle S.

    2011-01-01

    This study investigates intergroup homogeneity within high intermediate and advanced L2 writers of English from Czech, Finnish, German, and Spanish first language backgrounds. A variety of linguistic features related to lexical sophistication, syntactic complexity, and cohesion were used to compare texts written by L1 speakers of English to L2…

  9. Tula song folklore: genre-stylistic and dialectic peculiarities

    Directory of Open Access Journals (Sweden)

    Krasovskaya Nelli Alexandrovna

    2016-06-01

    Full Text Available The article analyzes the works of Tula folklore recorded in the western part of the Tula region, in terms of genre, stylistic and linguistic features. The relevance of the study is related to the fact that Tula folk songs has not been studied, linguistic features of the works are not subjected to serious analysis. The article describes the features of the genre of songs recorded in Belevsky district of Tula region, including the ancient fortunetelling chants, wedding ceremony songs, romantic ballads etc., it is cited numerous examples in the lyrics that reflect the dialectal features of the phonetic, grammatical, lexical levels. According to the authors, a modern folk song genre retains its diversity and is a kind of storeroom containing priceless linguistic wealth. The analysis allows to draw conclusions about the presence and well-preserved in the recorded music of South Russian dialect phonetic and grammatical features. So far, there is no established typology of Tula dialects, therefore, according to the authors, the fixation of folklore in the territories bordering on Tula dialects, is very important and interesting for further descriptive and comparative work on identifying the eastern and south-south-west differences in Tula dialects.

  10. Refutation of stylistic constructs in palaeolithic rock art

    International Nuclear Information System (INIS)

    Bednarik, R.G.

    1995-01-01

    This paper describes the first experiment of applying a series of dating methods at a single rock art site in a ''blind test''. The rock art in question, in northeastern Portugal, had been unanimously attributed to the Upper Palaeolithic by stylistic comparison. Four independent assessments have produced the identical result that the rock art is in fact of the second half of the Holocene, and mostly under 3,000 years old. This finding is compared with other recent dating results which together show that stylistic dating is not an admissible method of determining the age of Palaeolithic art. (author). 17 refs., 1 fig., 1 photo

  11. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music.

    Directory of Open Access Journals (Sweden)

    Niels Chr Hansen

    Full Text Available Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled renditions of saxophone solos improvised by Charlie Parker in the bebop style. Ratings of explicit uncertainty and expectedness for different continuations of each melodic excerpt were collected. An information-theoretic model of expectation enabled selection of stimuli affording highly certain continuations in the bebop style, but highly uncertain continuations in the context of general tonal expectations, and vice versa. The results showed that expert musicians have acquired probabilistic characteristics of music influencing their experience of expectedness and predictive uncertainty. While classical musicians had internalised key aspects of the bebop style implicitly, only jazz musicians' explicit uncertainty ratings reflected the computational estimates, and jazz-specific expertise modulated the relationship between explicit and inferred uncertainty data. In spite of this, there was no evidence that non-musicians and classical musicians used a stylistically irrelevant cognitive model of general tonal music providing support for the theory of cognitive firewalls between stylistic models in predictive processing of music.

  12. Stylistic Analysis of Robert Browning's Poem "Patriot into Traitor

    Science.gov (United States)

    Ahmed, Mumtaz; Irshad, Ayesha

    2015-01-01

    The stylistic analysis of Robert Browning's poem "Patriot into Traitor" is done by using graphological, phonological, morphological and lexico-syntactic patterns. This analysis is helpful in decoding the underlying meanings of the poem. It clearly brings to surface what the poet really wants to impart.

  13. Stylistic Devices in Ben Okri's The Famished Road | Ikechi ...

    African Journals Online (AJOL)

    This paper discusses stylistic devices in Ben Okri's The Famished Road. In the presentation of his story, the novelist makes use of literary devices which enrich readers' understanding and enjoyment of his subject matter. Satire, register, cliché, pidgin and proverbs are some of the devices. Others include: figurative language ...

  14. A STYLISTIC ANALYSIS OF THE LANGUAGE OF POLITICAL ...

    African Journals Online (AJOL)

    user

    job. This study is a synchronic stylistic analysis of the various political utterances used during the electioneering process in ... as “The variety of language according to use” in situations such as ..... “Bruharity” is adapted from the English word.

  15. Preparing Stylistically Challenging Contemporary Classical Repertoire for Performance: Interpreting "Kumari"

    Science.gov (United States)

    Viney, Liam; Blom, Diana

    2015-01-01

    Research involving the learning processes of musicians seldom examines specific pieces of music, and limited attention has been devoted to the earliest stages of learning a stylistically challenging or new piece of 20th-/21st-century art music. This article describes the processes by which two pianists (the authors) learned Ross Edwards's…

  16. Comparison of Aminpour’s Qhazal and Qhazve‘s Qhazal Based on Structural Stylistics

    Directory of Open Access Journals (Sweden)

    Ahmad Rezae

    2014-12-01

    Full Text Available Abstract The importance of stylistics in examining the texts has resulted in the burgeoning of various stylistic schools, with their differing methodologies. Among them, structural stylistics – which is the more polished version of formal stylistics – looks over the structure of words, sounds and syntax in the text and then focuses on their relation to the content. In fact, structural stylistics, drawing upon the structuralists views, is formed on the basis of structural linguistics, and works to analyze a literary text with regard to its organic unity and the inter-relationship of the parts to the whole. In other words, the main purpose is to approach the content of a work through its form and structure. In this method, stylistic features of the work are recognized through understanding the structural proportions between sounds, words and syntax. Accordingly, the structuralists regard the style as the manner of deviation and extra-regularity and the frequency of its occurrence in a particular era. The present article, first, refers to the definitions of style and stylistic schools and enumerates the features of structural stylistics and then sets out to study and compare two ghazals on the subject of the Holy Defense by Gheisar Aminpour ("Taghvimha" or "Calendars" and Alireza Ghazve ("Ghesmat" or "Destiny", in the light of structural stylistics. Through analyzing the different parts of the texts, with regard to balances and deviation, we will deal with the relatedness and proportion of these parts to the content. "Calendars" is among the best-known ghazals of Aminpour. It contains the issues of feeling ashamed of martyrs, lamenting over our negligence, and feeling left away from the martyrs. A special sense of grief and sadness, hidden in the particular rhythm and cadence of the words and combinations, helps the poet to express his feelings and thoughts. The ghazal "Destiny" deals also with the distress and exhaustion the poet feels in this

  17. Classification

    Science.gov (United States)

    Clary, Renee; Wandersee, James

    2013-01-01

    In this article, Renee Clary and James Wandersee describe the beginnings of "Classification," which lies at the very heart of science and depends upon pattern recognition. Clary and Wandersee approach patterns by first telling the story of the "Linnaean classification system," introduced by Carl Linnacus (1707-1778), who is…

  18. Clipped Wings and the Great Abyss: Cognitive Stylistics and Implicatures in Abiezer Coppe’s ‘Prophetic’ Recantation

    OpenAIRE

    Borgogni Daniele

    2017-01-01

    In this article, two major paradigms within cognitive stylistics, the Conceptual Metaphor Theory (CMT) and the Conceptual Integration Theory (CIT), are applied as largely complementary approaches to discuss the scope and implicatures of the central metaphorical image of Copp’s Return to the wayes of Truth (1651), a text written by one of the most famous radical preachers of the Civil War period as a plea to be released from prison. The article will focus on how the linguistic and ...

  19. The primitives of Santa Clara of Ubeda: stylistic and iconographic study, critical appraisals and vicissitudes of a dispersed heritage

    Directory of Open Access Journals (Sweden)

    Clara Beltrán Catalán

    2016-12-01

    Full Text Available The authors study the stylistic and iconographic aspects of a collection of paintings on wood dating from the 15th and 16th centuries, originally in the Royal Monastery of Santa Clara at Úbeda. This collection was sold in the 1920s with the participation of the antique dealer Celestino Dupont. The research is complemented by an analysis of the critical appraisals given to these works and their history since their introduction into the art market.

  20. Assessing Unmet Information Needs of Breast Cancer Survivors: Exploratory Study of Online Health Forums Using Text Classification and Retrieval.

    Science.gov (United States)

    McRoy, Susan; Rastegar-Mojarad, Majid; Wang, Yanshan; Ruddy, Kathryn J; Haddad, Tufia C; Liu, Hongfang

    2018-05-15

    Patient education materials given to breast cancer survivors may not be a good fit for their information needs. Needs may change over time, be forgotten, or be misreported, for a variety of reasons. An automated content analysis of survivors' postings to online health forums can identify expressed information needs over a span of time and be repeated regularly at low cost. Identifying these unmet needs can guide improvements to existing education materials and the creation of new resources. The primary goals of this project are to assess the unmet information needs of breast cancer survivors from their own perspectives and to identify gaps between information needs and current education materials. This approach employs computational methods for content modeling and supervised text classification to data from online health forums to identify explicit and implicit requests for health-related information. Potential gaps between needs and education materials are identified using techniques from information retrieval. We provide a new taxonomy for the classification of sentences in online health forum data. 260 postings from two online health forums were selected, yielding 4179 sentences for coding. After annotation of data and training alternative one-versus-others classifiers, a random forest-based approach achieved F1 scores from 66% (Other, dataset2) to 90% (Medical, dataset1) on the primary information types. 136 expressions of need were used to generate queries to indexed education materials. Upon examination of the best two pages retrieved for each query, 12% (17/136) of queries were found to have relevant content by all coders, and 33% (45/136) were judged to have relevant content by at least one. Text from online health forums can be analyzed effectively using automated methods. Our analysis confirms that breast cancer survivors have many information needs that are not covered by the written documents they typically receive, as our results suggest that at most

  1. Diagnostic investigation and historical-stylistic evaluation of oil painting on metal board. Example of “Christ Crucified with two mourning angels”

    Directory of Open Access Journals (Sweden)

    Salvatore Lorusso

    2007-07-01

    Full Text Available The oil painting on metal board (40 x 30 cm under study was bought from the antiquarian French market and bears a very common representation that derives from one of Michelangelo’s designs: “Cristo Crocifisso con due angeli dolenti”. The present paper not only refers to a stylistic and historical-artistic assessment, but also alludes to knowledge in a general sense through diagnostic technique and preservation conditions. The results of the diagnostic study, together with the stylistic analysis, have confirmed that the painting is an ancient one that dates back to the first decades of the XVIIth century.

  2. Classification

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2017-01-01

    This article presents and discusses definitions of the term “classification” and the related concepts “Concept/conceptualization,”“categorization,” “ordering,” “taxonomy” and “typology.” It further presents and discusses theories of classification including the influences of Aristotle...... and Wittgenstein. It presents different views on forming classes, including logical division, numerical taxonomy, historical classification, hermeneutical and pragmatic/critical views. Finally, issues related to artificial versus natural classification and taxonomic monism versus taxonomic pluralism are briefly...

  3. Stylistics Analysis in Advertising Discourse: A Case of the Dangote Cement Advertisement in Bamenda- Cameroon

    Directory of Open Access Journals (Sweden)

    Seino Evangeline Agwa Fomukong

    2016-12-01

    Full Text Available There are many purposes for using language which determine how the writer or speaker   chooses words, syntactic expressions and figurative language. This is as a result of the fact that language has a very powerful effect over people, their actions and thoughts. This is seen in the use of language in various discourse types which include advertisements. The powerful influence language has on people therefore makes encoders to be choosy in the use of language, especially in advertisement because they have to persuade the readers. Consequently they make the language of advertisements positive and emphasize on the superiority of their products. This study discusses the advertisement of Dangote Cement on billboards in Bamenda, North West Region, Cameroon, analysing what is communicated, how it is communicated and the interpretation.  The analysis used as tools the Textual Conceptual Functions as given by Jeffries (2016, uncovering ideologies and social meanings expressed in Dangote Cement advertisement using the following apparatus: prioritisation, implying and assumption, listing, naming and description. The study has emphasized the structural analysis and the role of context to reveal functions and underlying meanings of the text. It also concludes that the advertisers use different stylistic devices that carry positivity, and a common ground that makes the readers identify with the advertisements, urging them go for the Dangote Cement. Keywords: stylistics, language, context, advertisements, ideologies, Dangote

  4. TEXT CLASSIFICATION FOR AUTOMATIC DETECTION OF E-CIGARETTE USE AND USE FOR SMOKING CESSATION FROM TWITTER: A FEASIBILITY PILOT.

    Science.gov (United States)

    Aphinyanaphongs, Yin; Lulejian, Armine; Brown, Duncan Penfold; Bonneau, Richard; Krebs, Paul

    2016-01-01

    Rapid increases in e-cigarette use and potential exposure to harmful byproducts have shifted public health focus to e-cigarettes as a possible drug of abuse. Effective surveillance of use and prevalence would allow appropriate regulatory responses. An ideal surveillance system would collect usage data in real time, focus on populations of interest, include populations unable to take the survey, allow a breadth of questions to answer, and enable geo-location analysis. Social media streams may provide this ideal system. To realize this use case, a foundational question is whether we can detect e-cigarette use at all. This work reports two pilot tasks using text classification to identify automatically Tweets that indicate e-cigarette use and/or e-cigarette use for smoking cessation. We build and define both datasets and compare performance of 4 state of the art classifiers and a keyword search for each task. Our results demonstrate excellent classifier performance of up to 0.90 and 0.94 area under the curve in each category. These promising initial results form the foundation for further studies to realize the ideal surveillance solution.

  5. Negation handling in sentiment classification using rule-based adapted from Indonesian language syntactic for Indonesian text in Twitter

    Science.gov (United States)

    Amalia, Rizkiana; Arif Bijaksana, Moch; Darmantoro, Dhinta

    2018-03-01

    The presence of the word negation is able to change the polarity of the text if it is not handled properly it will affect the performance of the sentiment classification. Negation words in Indonesian are ‘tidak’, ‘bukan’, ‘belum’ and ‘jangan’. Also, there is a conjunction word that able to reverse the actual values, as the word ‘tetapi’, or ‘tapi’. Unigram has shortcomings in dealing with the existence of negation because it treats negation word and the negated words as separate words. A general approach for negation handling in English text gives the tag ‘NEG_’ for following words after negation until the first punctuation. But this may gives the tag to un-negated, and this approach does not handle negation and conjunction in one sentences. The rule-based method to determine what words negated by adapting the rules of Indonesian language syntactic of negation to determine the scope of negation was proposed in this study. With adapting syntactic rules and tagging “NEG_” using SVM classifier with RBF kernel has better performance results than the other experiments. Considering the average F1-score value, the performance of this proposed method can be improved against baseline equal to 1.79% (baseline without negation handling) and 5% (baseline with existing negation handling) for a dataset that all tweets contain negation words. And also for the second dataset that has the various number of negation words in document tweet. It can be improved against baseline at 2.69% (without negation handling) and 3.17% (with existing negation handling).

  6. Stylistic features of narrative procedure in a psychological short story in the context of teaching interpretation

    Directory of Open Access Journals (Sweden)

    Stakić Mirjana M.

    2016-01-01

    Full Text Available The paper investigates the stylistic features of narrative procedure in a psychological short story in the context of its interpretation in the teaching of the Serbian language and literature. The narrative procedure in a psychological short story is characterized by the use of the first person in narrating, that is I form, an interior monologue and direct interior monologue, dreams, oversights and introspective. It is also characterized by a particular sentence structure, of often incomplete and elliptical form, used to express the conflicts going on in characters' inner sphere and verbal interaction between the characters. The narrative procedure applied in a psychological short story indicates that its plot is subdued to the internal psychological experiences. During the interpretation of a psychological short story students, through the interpretation of stylistic and narrative procedures, are directed and encouraged to discover complex and often hidden psychological mechanisms which spur the characters to act, influence their behavior, verbal expression and mutual relations. The interpretation of language signs which may have psychological and semantic potential leads to the revealing of unconscious internal psychological processes and mechanisms which take place within a literary character.

  7. THE DYNAMICS OF STYLISTICALLY MARKED VERBAL LEXIS IN THE INFINITIVE FORM IN THE RUSSIAN LITERARY CRITICISM OF THE MIDDLE AND SECOND HALF OF THE 19th CENTURY

    Directory of Open Access Journals (Sweden)

    Yakovenko Larisa Aleksandrovna

    2014-06-01

    Full Text Available The article studies the functioning of stylistically marked verbal lexis in the infinitive form in literary critical articles of Russian publicists of the middle and second half of the 19th century. The critical texts of that period are characterized by the use of different functional, stylistic and expressive emotional coloring verbal lexemes. The author reveals the lexical content of infinitive forms, determines the markedness character (functional and stylistic, or expressive and emotional. The article presents the dynamics of using infinitive forms which shows that in the texts of 19th century they are used to express critics' attitude to fiction works, litetrary images, and this attitude is determined by publicists' ideas about the ways of reality depiction. It is revealed that in the second half of 19th century this form reflects the urge to evaluate the social maturity and fiction skills of a writer, and that serves to increasing number of stylistically marked lexemes in the texts of that period.

  8. Parcellation as a stylistic dominant characteristic in the novel Nigdina by Svetlana Velmar Janković

    OpenAIRE

    Mimović Milica P.

    2015-01-01

    The subject of this research is parcellation as a stylistic dominant characteristic in the novel 'Nigdina' by Svetlana Velmar Janković, and also as the means of expression which is superior in regard to the other linguistic procedures. Considering the aim of syntactic-stylistic analysis in this paper, the examples were divided according to their syntactic functions identifying and interpreting syntactic structures that change word order, and simultaneously point out stylistic marking of the p...

  9. Combining automatic table classification and relationship extraction in extracting anticancer drug-side effect pairs from full-text articles.

    Science.gov (United States)

    Xu, Rong; Wang, QuanQiu

    2015-02-01

    Anticancer drug-associated side effect knowledge often exists in multiple heterogeneous and complementary data sources. A comprehensive anticancer drug-side effect (drug-SE) relationship knowledge base is important for computation-based drug target discovery, drug toxicity predication and drug repositioning. In this study, we present a two-step approach by combining table classification and relationship extraction to extract drug-SE pairs from a large number of high-profile oncological full-text articles. The data consists of 31,255 tables downloaded from the Journal of Oncology (JCO). We first trained a statistical classifier to classify tables into SE-related and -unrelated categories. We then extracted drug-SE pairs from SE-related tables. We compared drug side effect knowledge extracted from JCO tables to that derived from FDA drug labels. Finally, we systematically analyzed relationships between anti-cancer drug-associated side effects and drug-associated gene targets, metabolism genes, and disease indications. The statistical table classifier is effective in classifying tables into SE-related and -unrelated (precision: 0.711; recall: 0.941; F1: 0.810). We extracted a total of 26,918 drug-SE pairs from SE-related tables with a precision of 0.605, a recall of 0.460, and a F1 of 0.520. Drug-SE pairs extracted from JCO tables is largely complementary to those derived from FDA drug labels; as many as 84.7% of the pairs extracted from JCO tables have not been included a side effect database constructed from FDA drug labels. Side effects associated with anticancer drugs positively correlate with drug target genes, drug metabolism genes, and disease indications. Copyright © 2014 Elsevier Inc. All rights reserved.

  10. THE INTENTIONAL ASPECT OF MODERN STYLISTICS

    Directory of Open Access Journals (Sweden)

    Natalia Klushina

    2013-10-01

    Full Text Available There are cognitive, communicative, and pragmatic scientific methods in the modern anthropocentric linguistic paradigm. We have created the intentional method as a new integrative linguistic method for studying modern Russian discourse. The main scientific categories of the intentional method include: intention, intentionality, intentional categories, author, and addressee. The intention of the addressee consists of cognitive, communicative, and pragmatic constituents. We can divide intention into cognitive intention, which helps to understand the world, and communicative intention, which organizes communication between addressee and addresser. The intentional method can help to search out creative and subjective factors of human communication. This method can help to understand the nonlinearity and creativity of the communicative processes. The different types of effects of modern communication are analysed in this article. The effect holds not only a perlocutive quality in integral communication; therefore we announce that there are both positive and negative intentional effects in modern communication. Communicative failures are situations when an addressee refuses to carry on a dialogue or experiences indifference to the interlocutor.

  11. Aesthetic Proximity: the Role of Stylistic Programme Elements in Format Localisation

    Directory of Open Access Journals (Sweden)

    Jolien van Keulen

    2016-08-01

    Full Text Available Implications of the transnationalisation of television are often studied by focusing on the localisation of the content of formatted programmes. Although television is essentially an audio-visual medium, little attention has been paid to the aesthetic aspects of television texts in relation to transnationalisation and formatting. Transnationalisation of production practices, such as through formatting, implies a transnational aesthetic. At the same time, aspects of style are specific to place, culture or audience. In this article, the localisation of stylistic programme elements is explored using a comparison of two reality format adaptations. It is argued that style plays an important role in the expression of the local in a transnational industry.

  12. Simile: the most salient stylistic feature in Kelile and Demne

    Directory of Open Access Journals (Sweden)

    Maryam Mahmoodi

    2014-12-01

    Full Text Available Abstract Kelile and Demne is one of the most salient samples of Persian technical prose rhetorical and semantic figures and figures of speech namely simile, metaphor, metonymy and irony are among the stylistic features of this book. Among these, simile, as the most influential imagination tool, play a dominant role in the illustrations of the book. In this article, simile has been analyzed and investigated in all its variations in Kelile and Demne. In this book, simile appears from its most laconic form (eloquent simile to its most extensive form. But the major feature of theirs is their outspokenness, explicitness and sometimes their novelty. Among the likening components, the range of image vocabulary is one of the likening features in this book. Also the point of similarity has been usually abstracted from man's states, shape, place, space, volume and generally affairs concerning visual and tactile senses. So, its perception is not too much difficult. The variety and extension of likening vehicles in this work are worth of contemplating. In analysis of simile on the credit of both parties, we can conclude that ratio-emotional similes are of the most frequent kinds of simile. And Nasrollah Monshi has extended the field of emotional similes and has manipulated the relations between objects in a novel way. Allegoric simile has been used abundantly in Kelile and Demne.  It justifies the didactic function of this text. Allegory approaches its main role in this book. i.e. arguing and convincing. The contents of allegories in this book are moral and political and in terms of from, they are anecdotes of animals and human beings. The types of similes on the credit of form - namely equalization similes implied comparative similes and subtrahend similes - have been also used. Among the salient features of this book, several images together or in interference with each other have been used in one word or sentence. Sometimes similes accompany other

  13. Simile: the most salient stylistic feature in Kelile and Demne

    Directory of Open Access Journals (Sweden)

    Maryam Mahmoodi

    2014-11-01

    Full Text Available Abstract Kelile and Demne is one of the most salient samples of Persian technical prose rhetorical and semantic figures and figures of speech namely simile, metaphor, metonymy and irony are among the stylistic features of this book. Among these, simile, as the most influential imagination tool, play a dominant role in the illustrations of the book. In this article, simile has been analyzed and investigated in all its variations in Kelile and Demne. In this book, simile appears from its most laconic form (eloquent simile to its most extensive form. But the major feature of theirs is their outspokenness, explicitness and sometimes their novelty. Among the likening components, the range of image vocabulary is one of the likening features in this book. Also the point of similarity has been usually abstracted from man's states, shape, place, space, volume and generally affairs concerning visual and tactile senses. So, its perception is not too much difficult. The variety and extension of likening vehicles in this work are worth of contemplating. In analysis of simile on the credit of both parties, we can conclude that ratio-emotional similes are of the most frequent kinds of simile. And Nasrollah Monshi has extended the field of emotional similes and has manipulated the relations between objects in a novel way. Allegoric simile has been used abundantly in Kelile and Demne.  It justifies the didactic function of this text. Allegory approaches its main role in this book. i.e. arguing and convincing. The contents of allegories in this book are moral and political and in terms of from, they are anecdotes of animals and human beings. The types of similes on the credit of form - namely equalization similes implied comparative similes and subtrahend similes - have been also used. Among the salient features of this book, several images together or in interference with each other have been used in one word or sentence. Sometimes similes accompany other

  14. Stylistics of Nafthat ol-Masdur by Zeidari Nasvi

    Directory of Open Access Journals (Sweden)

    Fereydon Tahmasbi

    2016-05-01

    disrupt the sequence of the events. The author, in accordance with the society and the available prose, adopted the mix of Arabic and Persian prose and this is one of the effects of socio-political structure on his prose. The presence of different social classes in his book reflects the interaction between literature and society.ReferencesAhmadi, Babak (2009. Creation and Beauty: Hermeneutics and Aesthetic Queries; 5th ed., Tehran: Markaz press.---------------------(2009. Structure and Interpretation of the text, 10th ed., Tehran: Markaz press.Alavi Moghadam, Mahyar (1998. Contemporary Literary Criticism Theories (Formalism and Structuralism. 1st ed., Tehran: The organization of the study and compiling humanities books for universities (SAMT.Boudaryar et al (1995. Perplexity of Signs (examples of postmodern criticism. Babak Ahmadi et al (trans., 1st ed., Tehran: Markaz press.Don Stewart (2004. Structuralism and Post-Structuralism; Abolfazl Sajedi (trans., Journal of Hoze and University, No 36.Eagleton, Terry (2007. A prelude to literary theory; Abbass Mokhber (trans., Tehran: Markaz Publishers.Ghiasi, Mohammad Taghi (1989. An Introduction to Structural Stylistics, Tehran: Shole Andishe.Hossein Panah, Farahnaz (2007. Aesthetics of Dolat Abadi Prose, Development and teaching Persian language and literature Magazine; vol 20, No 4.Kazazi, Mirjalal (2006. Aesthetics of Persian Speech; Expression, 7th ed., Tehran: Markaz Publishers.Khatibi, Hossein (2007. Prose Technique in Persian Literature, 3rd ed., Tehran: Zavvar.Khosravi, Abouzar (2008. Historian Politician, Ata’olmolk Joveini, Mah book of history and Geography, No 126.Khosrow beigi, Hooshang (2007. Memoir writing of Shahab od-Din Nasvi; Zamane, 6th year, No 64.-------------------------------- (2006. Shahab-al-din Nasvi and his Morality, Mah book of history and Geography.Meghdadi, Bahram (1999. Culture and Literary Terminology from Plato to Modern Time, 1st ed., Tehran: Fekre Ruz.Mirsadeghi, Jamal (2004. Story and

  15. Five-way Smoking Status Classification Using Text Hot-Spot Identification and Error-correcting Output Codes

    OpenAIRE

    Cohen, Aaron M.

    2008-01-01

    We participated in the i2b2 smoking status classification challenge task. The purpose of this task was to evaluate the ability of systems to automatically identify patient smoking status from discharge summaries. Our submission included several techniques that we compared and studied, including hot-spot identification, zero-vector filtering, inverse class frequency weighting, error-correcting output codes, and post-processing rules. We evaluated our approaches using the same methods as the i2...

  16. Stylistics in the Southeast Asian ESL or EFL Classroom: A Collection of Potential Teaching Activities

    Science.gov (United States)

    Gonzales, Wilkinson Daniel Wong; Flores, Eden R.

    2016-01-01

    For the past few decades, stylistics has emerged as a discipline that encompasses both literary criticism and linguistics. The integration of both disciplines opened many opportunities for English literature and language teachers to get creative in their teaching--by introducing the stylistic approach in their classrooms. However, in a typical…

  17. Stylistic Analysis of the Short Story "The Last Word" by Dr. A. R. Tabassum

    Science.gov (United States)

    Bari Khan, Abdul; Ahmad, Madiha; Ahmad, Sofia; Ijaz, Nida

    2015-01-01

    In this article stylistic analysis of short story "The Last Word" by Dr. A. R. Tabassum is performed. The formative elements of the story, such as point of view, characters and allegorical element, are discussed in detail so as to give a better insight of the story. The story is analyzed stylistically in terms of figures of speech where…

  18. A Stylistic Analysis of Linguistic Patterns in Chichamanda Ngozi Adichie’s Purple Hibiscus

    Directory of Open Access Journals (Sweden)

    Muchamad Sholakhuddin Al Fajri

    2017-06-01

    Full Text Available This study aims to carry out a detailed and systematic stylistic analysis of linguistic patterns in Purple Hibiscus Novel by Chichamanda Ngozi Adichie. It particularly analyses a specific extract of the novel in terms of narration and point of view, conversational analysis, speech and thought presentations and mind style, and how these linguistic devices and patterns are employed by the author to shape characters’ personalities and relationships between them in the reader’s mind. The result appears to suggest that the author successfully represents the protagonist, Kambili as an obedient and a salient daughter who respects deeply his father, while her father, Eugene, is constructed as a strict father and religious who imposes an absolute control on his daughter.

  19. Diagnostic investigations and historical-stylistic evaluation on the oil painting: "reading man by oil lamp light"

    Directory of Open Access Journals (Sweden)

    Salvatore Lorusso

    2006-02-01

    Full Text Available This investigation intends to verify the attribution of the oil painting (70x50,5 cm portraying a reading man by oil lamp light, to Gerrit van Hontorst. The note refers not only to a stylistic and historical-artistical evaluation but also to the knowledge, through diagnostic techniques, of the application to characterize components of matter, and of the manufacture execution technique and preservation conditions. This investigation denies the attribution to the painter Gerrit van Hontorst, but it does not exclude a dating within the XVII century.

  20. Five-way smoking status classification using text hot-spot identification and error-correcting output codes.

    Science.gov (United States)

    Cohen, Aaron M

    2008-01-01

    We participated in the i2b2 smoking status classification challenge task. The purpose of this task was to evaluate the ability of systems to automatically identify patient smoking status from discharge summaries. Our submission included several techniques that we compared and studied, including hot-spot identification, zero-vector filtering, inverse class frequency weighting, error-correcting output codes, and post-processing rules. We evaluated our approaches using the same methods as the i2b2 task organizers, using micro- and macro-averaged F1 as the primary performance metric. Our best performing system achieved a micro-F1 of 0.9000 on the test collection, equivalent to the best performing system submitted to the i2b2 challenge. Hot-spot identification, zero-vector filtering, classifier weighting, and error correcting output coding contributed additively to increased performance, with hot-spot identification having by far the largest positive effect. High performance on automatic identification of patient smoking status from discharge summaries is achievable with the efficient and straightforward machine learning techniques studied here.

  1. Criticism versus stylistics: an analysis of their areas of overlap and ...

    African Journals Online (AJOL)

    Criticism versus stylistics: an analysis of their areas of overlap and contrast. ... AFRREV LALIGENS: An International Journal of Language, Literature and Gender Studies ... in a number of areas, a great deal of discrepancy exists between them.

  2. Stylistic analysis of headlines in science journalism: A case study of New Scientist.

    Science.gov (United States)

    Molek-Kozakowska, Katarzyna

    2017-11-01

    This article explores science journalism in the context of the media competition for readers' attention. It offers a qualitative stylistic perspective on how popular journalism colonizes science communication. It examines a sample of 400 headlines collected over the period of 15 months from the ranking of five 'most-read' articles on the website of the international magazine New Scientist. Dominant lexical properties of the sample are first identified through frequency and keyness survey and then analysed qualitatively from the perspective of the stylistic projection of newsworthiness. The analysis illustrates various degrees of stylistic 'hybridity' in online popularization of scientific research. Stylistic patterns that celebrate, domesticate or personalize science coverage (characteristic of popular journalism) are intertwined with devices that foreground tentativeness, precision and informativeness (characteristic of science communication). The article reflects on the implications of including various proportions of academic and popular styles in science journalism.

  3. Nigerian Visual Arts (1970-2003) and the Impact of Some Stylistic ...

    African Journals Online (AJOL)

    African Research Review ... The art productions and techniques from the stylistic tendencies have created vista of ... they experiment with materials and techniques without losing touch with their African identity. ... AJOL African Journals Online.

  4. Computing symmetrical strength of N-grams: a two pass filtering approach in automatic classification of text documents.

    Science.gov (United States)

    Agnihotri, Deepak; Verma, Kesari; Tripathi, Priyanka

    2016-01-01

    The contiguous sequences of the terms (N-grams) in the documents are symmetrically distributed among different classes. The symmetrical distribution of the N-Grams raises uncertainty in the belongings of the N-Grams towards the class. In this paper, we focused on the selection of most discriminating N-Grams by reducing the effects of symmetrical distribution. In this context, a new text feature selection method named as the symmetrical strength of the N-Grams (SSNG) is proposed using a two pass filtering based feature selection (TPF) approach. Initially, in the first pass of the TPF, the SSNG method chooses various informative N-Grams from the entire extracted N-Grams of the corpus. Subsequently, in the second pass the well-known Chi Square (χ(2)) method is being used to select few most informative N-Grams. Further, to classify the documents the two standard classifiers Multinomial Naive Bayes and Linear Support Vector Machine have been applied on the ten standard text data sets. In most of the datasets, the experimental results state the performance and success rate of SSNG method using TPF approach is superior to the state-of-the-art methods viz. Mutual Information, Information Gain, Odds Ratio, Discriminating Feature Selection and χ(2).

  5. The contribution of the vaccine adverse event text mining system to the classification of possible Guillain-Barré syndrome reports.

    Science.gov (United States)

    Botsis, T; Woo, E J; Ball, R

    2013-01-01

    We previously demonstrated that a general purpose text mining system, the Vaccine adverse event Text Mining (VaeTM) system, could be used to automatically classify reports of an-aphylaxis for post-marketing safety surveillance of vaccines. To evaluate the ability of VaeTM to classify reports to the Vaccine Adverse Event Reporting System (VAERS) of possible Guillain-Barré Syndrome (GBS). We used VaeTM to extract the key diagnostic features from the text of reports in VAERS. Then, we applied the Brighton Collaboration (BC) case definition for GBS, and an information retrieval strategy (i.e. the vector space model) to quantify the specific information that is included in the key features extracted by VaeTM and compared it with the encoded information that is already stored in VAERS as Medical Dictionary for Regulatory Activities (MedDRA) Preferred Terms (PTs). We also evaluated the contribution of the primary (diagnosis and cause of death) and secondary (second level diagnosis and symptoms) diagnostic VaeTM-based features to the total VaeTM-based information. MedDRA captured more information and better supported the classification of reports for GBS than VaeTM (AUC: 0.904 vs. 0.777); the lower performance of VaeTM is likely due to the lack of extraction by VaeTM of specific laboratory results that are included in the BC criteria for GBS. On the other hand, the VaeTM-based classification exhibited greater specificity than the MedDRA-based approach (94.96% vs. 87.65%). Most of the VaeTM-based information was contained in the secondary diagnostic features. For GBS, clinical signs and symptoms alone are not sufficient to match MedDRA coding for purposes of case classification, but are preferred if specificity is the priority.

  6. The Contribution of the Vaccine Adverse Event Text Mining System to the Classification of Possible Guillain-Barré Syndrome Reports

    Science.gov (United States)

    Botsis, T.; Woo, E. J.; Ball, R.

    2013-01-01

    Background We previously demonstrated that a general purpose text mining system, the Vaccine adverse event Text Mining (VaeTM) system, could be used to automatically classify reports of an-aphylaxis for post-marketing safety surveillance of vaccines. Objective To evaluate the ability of VaeTM to classify reports to the Vaccine Adverse Event Reporting System (VAERS) of possible Guillain-Barré Syndrome (GBS). Methods We used VaeTM to extract the key diagnostic features from the text of reports in VAERS. Then, we applied the Brighton Collaboration (BC) case definition for GBS, and an information retrieval strategy (i.e. the vector space model) to quantify the specific information that is included in the key features extracted by VaeTM and compared it with the encoded information that is already stored in VAERS as Medical Dictionary for Regulatory Activities (MedDRA) Preferred Terms (PTs). We also evaluated the contribution of the primary (diagnosis and cause of death) and secondary (second level diagnosis and symptoms) diagnostic VaeTM-based features to the total VaeTM-based information. Results MedDRA captured more information and better supported the classification of reports for GBS than VaeTM (AUC: 0.904 vs. 0.777); the lower performance of VaeTM is likely due to the lack of extraction by VaeTM of specific laboratory results that are included in the BC criteria for GBS. On the other hand, the VaeTM-based classification exhibited greater specificity than the MedDRA-based approach (94.96% vs. 87.65%). Most of the VaeTM-based information was contained in the secondary diagnostic features. Conclusion For GBS, clinical signs and symptoms alone are not sufficient to match MedDRA coding for purposes of case classification, but are preferred if specificity is the priority. PMID:23650490

  7. Language Personality of the Publicist: Rhetorical and Stylistic Canon (Yu. Senkevich “To “Ra” Across the Atlantic”

    Directory of Open Access Journals (Sweden)

    Olga V. Shatalova

    2017-10-01

    Full Text Available In article the communicative and linguistic parameters of the speech of the publicist of the XX century Yu. Senkevich which are declared as a sample for formation of the language personality in the conditions of development of the modern information and communicative environment are designated. On the example of the analysis of the book «On “Ra” through Atlantic» locates the fact of that Yu.N. Senkevich’s creativity corresponds to the main criteria of popular scientific journalism: high degree of reliability, authoritativeness of the publicist; dynamism, dramatic nature, intelligence of a statement. The unostentatious enlightenment based on updating of scientific knowledge for the addressee and a dialogization of a publicistic discourse, the high level of psychological and philosophical generality, the declaration of humanistic values form specific rhetoric which is supported by the formal and grammatical organization of the speech of the publicist. Priority of syntactic designs of a certain type, stylistic ladders and figures – dynamism of a statement and scale of representation of material define. The humour and easy self-irony as significant characteristics of the language personality define a basis of the rhetorical and stylistic canon realized in publicistic works Yu.N. Senkevich – «the human view of people and society» that in the modern information and communicative environment becomes a necessary reference point.

  8. Development of Feature Set, Classification Implementation and Applications for Vowel Migration/Modification in Sung Filipino (Tagalog Texts and Perceived Intelligibility

    Directory of Open Access Journals (Sweden)

    Virginia B. Bustos

    2009-12-01

    Full Text Available With the emergence of research on real-time visual feedback to supplement vocal pedagogy, the utilization of technology in the world of music is now seen to accelerate skills learning and enhance cognitive development. The researchers of this project aim to further analyze vowel intelligibility and develop software applications intended to be used not only by professional singers but also by individuals who wish to improve their singing capability. Data in the form of sung vowels and song pieces were obtained from 46 singers. A Listening Test was then conducted on these samples to obtain the ground truth for vowel classification based on human perception. Simulation of the human auditory perception of sung Filipino vowels was performed using formant frequencies and Mel-frequency cepstral coefficients as feature vector inputs to a two-stage Discriminant Analysis classifier. The setup resulted in an over-all Training Set accuracy of 89.4% and an over-all Test Set accuracy of 90.9%. The accuracy of the classifier, measured in terms of the correspondence of vowel classifications obtained from the classifier with the results of the Listening Test, reached 92.3%. Using information obtained from the classifier, offline and online/real-time software applications were developed. The main application features include the display of the spectral envelope and spectrogram, pitch and vibrato analysis and direct feedback on the classification of the sung vowel. These features were recommended by singers who were surveyed and were incorporated in the applications to aid singers to adjust formant locations, directly determine listener’s perception of sung vowels, perform modeling effectively and carry out vowel migration.

  9. Clipped Wings and the Great Abyss: Cognitive Stylistics and Implicatures in Abiezer Coppe’s ‘Prophetic’ Recantation

    Directory of Open Access Journals (Sweden)

    Borgogni Daniele

    2017-03-01

    Full Text Available In this article, two major paradigms within cognitive stylistics, the Conceptual Metaphor Theory (CMT and the Conceptual Integration Theory (CIT, are applied as largely complementary approaches to discuss the scope and implicatures of the central metaphorical image of Copp’s Return to the wayes of Truth (1651, a text written by one of the most famous radical preachers of the Civil War period as a plea to be released from prison. The article will focus on how the linguistic and cultural contexts of Coppe’s prophetic writing, in their interaction with the dynamic conceptual relationships of a conceptual integration network, open up new possibilities of perspectivizing and insinuating radically different meanings and implicatures: the use of blends in Coppe’s text has a direct effect on the structure of the analogies that can be made between mental spaces, thereby triggering new meaning effects, supplementary symbolizing patterns, and unpredictable perlocutionary effects.

  10. Some Issues in the Automatic Classification of U.S. Patents Working Notes for the AAAI-98 Workshop on Learning for Text Categorization

    National Research Council Canada - National Science Library

    Larkey, Leah

    1998-01-01

    The classification of U.S. patents poses some special problems due to the enormous size of the corpus, the size and complex hierarchical structure of the classification system, and the size and structure of patent documents...

  11. POETICS OF TRANSCENDENCE: STYLISTIC REDUCTION AS A TOOL FOR REPRESENTATION OF SACRED MEANINGS

    Directory of Open Access Journals (Sweden)

    Elena Brazgovskaya

    2016-10-01

    Full Text Available The main direction of the work is connected to the representation of abstract (transcendent objects in music and literature. The article analyses "Cantus in Memoriam Benjamin Britten" by Arvo Pärt and some poems of Czesław Miłosz. The metaphysical dimension of reality involves forms and things, existing beyond the boundaries of empirical perception and, at first sight, beyond the descriptive practices. Abstract objects are available in intellectual experience, but culture must transform them into a symbolic form. As a rule, it is connected to the practice of art minimalism. The essence of minimalism is the reduction of number of stylistic tools and “purification” the perception from the visual / auditory images (not a mimetic use of language. For the representation of the sacred Pärt uses only mensural canon form, scale and chord. These “characters” are deprived of descriptive function, but have symbolic potential (canon as a sign of stopped time, the eternal return. The distinctive feature of the Miłoszʼs style is the pursuit to “clean” the signs (indexical and symbolic. There is the reverse side of language distillation: the rejection of the subjective position, emotional experience, the distance between the person and the object of representation.

  12. MASCULINE LANGUAGE IN INDONESIAN NOVELS: A FEMINIST STYLISTIC APPROACH ON BELENGGU AND PENGAKUAN PARIYEM

    Directory of Open Access Journals (Sweden)

    Supriyadi .

    2014-06-01

    Full Text Available Belenggu is a novel written by Armijn Pane in 1938, whereas Pengakuan Pariyem is a lyrical novel written by Linus Suryadi AG that published in 1980. Both are interested to be analyzed from linguistic aspects, especially in relation to gender dan patriarchal issues. In this case, the proper approach is feminist stylistics by Sara Mills since it analyzed literary works from linguistic aspects and then is enlarged on the contexts of surroundings when it was published. The results are that Belenggu basically used masculine languages including word, phrases, clauses, sentences, and discourses when it is related to its contexts. Contextually, Belenggu represented author responses to conditions of his society in which women tried to insist their rights for equality (to men. It also represented author’s critic to women since it is better that women still work domestically and support her husband. Meanwhile, Pengakuan Pariyem is a lyrical novel that considers men and women have mutual relationships although women still work domestically and men work outside.

  13. A Stylistic Analysis of the Dialogues in Pirates of the Caribbean: On Strange Tides%A Stylistic Analysis of the Dialogues in Pirates of the Caribbean:On Strange Tides

    Institute of Scientific and Technical Information of China (English)

    李冯茹

    2017-01-01

    Dialogues in classical films are always the concentrated scripts studied by scholars. This thesis performs a stylistic analysis of dialogues from Pirates of the Caribbean: On Strange Tides at the levels of phonology, lexicon, syntax, semantics and pragmatics to make a good attempt in the application of stylistic analysis.

  14. Magical cooking: Some stylistic characteristics of the novel Like water for chocolate

    Directory of Open Access Journals (Sweden)

    Uršula Kastelic Vukadinović

    2013-12-01

    Full Text Available In the paper we analyze some of the stylistic characteristics of the novel Like Water for Chocolate from the perspective of a non-Mexican reader. The narrator of the story interlaces the typical linguistic structures specific to cooking recipes and the story of a forbidden love between Tita and Pedro. As a good cook she gives advice, describes the procedures for the preparation of the food, all of which she intertwines with the story as once told to her, which is explicitly expressed by using reporting verbs. We found out that one of the characteristics of the novel is the visualization of the story. The author herself says that her literary creation is based on visual images and that afterwards she turns them into a story. In the novel this is reflected in Tita`s intense response to sensory stimuli. In the article, we highlight examples from the text, structured according to the predominant senses involved. Through the food that Tita is preparing Tita and Pedro establish an unusual, passionate and sensual relationship. Among the ingredients for the dishes that are connected to their love story, the reader encounters many unknown Mexican cultural expressions, which do not hinder the understanding of the story. The text shows that we are dealing with a dish or an ingredient that is mentioned at the beginning as a part of the recipe or presented in such a context. Therefore, the coherence of the text is maintained. The reader takes pleasure in the reading that transports him to the distant worlds of magical realism.

  15. Puerto Nuevo and the Origins of the Stylistic-Religious Paracas Tradition

    OpenAIRE

    García, Rubén

    2012-01-01

    Puerto Nuevo was previously defined by García y Pinilla as a phase/style where there is an amalgam of Chavín and Cupisnique cultural elements with those of the south coast at the end of the Initial Period and the beginning of the Early Horizon. This paper presents new evidences and comparative stylistic analyses of contemporary societies that propose that it was during Puerto Nuevo times that the Paracas stylistic and religious tradition initiated, and therefore place it chronologically at th...

  16. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text.

    Science.gov (United States)

    Krallinger, Martin; Vazquez, Miguel; Leitner, Florian; Salgado, David; Chatr-Aryamontri, Andrew; Winter, Andrew; Perfetto, Livia; Briganti, Leonardo; Licata, Luana; Iannuccelli, Marta; Castagnoli, Luisa; Cesareni, Gianni; Tyers, Mike; Schneider, Gerold; Rinaldi, Fabio; Leaman, Robert; Gonzalez, Graciela; Matos, Sergio; Kim, Sun; Wilbur, W John; Rocha, Luis; Shatkay, Hagit; Tendulkar, Ashish V; Agarwal, Shashank; Liu, Feifan; Wang, Xinglong; Rak, Rafal; Noto, Keith; Elkan, Charles; Lu, Zhiyong; Dogan, Rezarta Islamaj; Fontaine, Jean-Fred; Andrade-Navarro, Miguel A; Valencia, Alfonso

    2011-10-03

    Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were evaluated by comparing

  17. The Protein-Protein Interaction tasks of BioCreative III: classification/ranking of articles and linking bio-ontology concepts to full text

    Science.gov (United States)

    2011-01-01

    Background Determining usefulness of biomedical text mining systems requires realistic task definition and data selection criteria without artificial constraints, measuring performance aspects that go beyond traditional metrics. The BioCreative III Protein-Protein Interaction (PPI) tasks were motivated by such considerations, trying to address aspects including how the end user would oversee the generated output, for instance by providing ranked results, textual evidence for human interpretation or measuring time savings by using automated systems. Detecting articles describing complex biological events like PPIs was addressed in the Article Classification Task (ACT), where participants were asked to implement tools for detecting PPI-describing abstracts. Therefore the BCIII-ACT corpus was provided, which includes a training, development and test set of over 12,000 PPI relevant and non-relevant PubMed abstracts labeled manually by domain experts and recording also the human classification times. The Interaction Method Task (IMT) went beyond abstracts and required mining for associations between more than 3,500 full text articles and interaction detection method ontology concepts that had been applied to detect the PPIs reported in them. Results A total of 11 teams participated in at least one of the two PPI tasks (10 in ACT and 8 in the IMT) and a total of 62 persons were involved either as participants or in preparing data sets/evaluating these tasks. Per task, each team was allowed to submit five runs offline and another five online via the BioCreative Meta-Server. From the 52 runs submitted for the ACT, the highest Matthew's Correlation Coefficient (MCC) score measured was 0.55 at an accuracy of 89% and the best AUC iP/R was 68%. Most ACT teams explored machine learning methods, some of them also used lexical resources like MeSH terms, PSI-MI concepts or particular lists of verbs and nouns, some integrated NER approaches. For the IMT, a total of 42 runs were

  18. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music

    DEFF Research Database (Denmark)

    Hansen, Niels Christian; Vuust, Peter; Pearce, Marcus

    2016-01-01

    Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and s......-musicians and classical musicians used a stylistically irrelevant cognitive model of general tonal music providing support for the theory of cognitive firewalls between stylistic models in predictive processing of music.......Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity...... and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled...

  19. Mapping the stylistic affiliations of Le Corbusier's work

    NARCIS (Netherlands)

    Panigyrakis, P.I.

    2015-01-01

    The paper deals with Le Corbusier's connection to the term "style". A classification of his work in specific architectural styles is discussed, followed by a description of the procedure through which the style of the man Le Corbusier was constructed; in an attempt to search meaning in his continual

  20. A Pragma-Stylistic Analysis of President Goodluck Ebele Jonathan Inaugural Speech

    Science.gov (United States)

    Abuya, Eromosele John

    2012-01-01

    The study was an examination through the pragma-stylistic approach to meaning of the linguistic acts that manifest in the Inaugural Speech of Goodluck Ebele Jonathan as the democratically elected president in May 2011 General Elections in Nigeria. Hence, the study focused on speech acts type of locution, illocutionary and perlocutionary in the…

  1. Multidimensionality of Teachers' Graded Responses for Preschoolers' Stylistic Learning Behavior: The Learning-to-Learn Scales

    Science.gov (United States)

    McDermott, Paul A.; Fantuzzo, John W.; Warley, Heather P.; Waterman, Clare; Angelo, Lauren E.; Gadsden, Vivian L.; Sekino, Yumiko

    2011-01-01

    Assessment of preschool learning behavior has become very popular as a mechanism to inform cognitive development and promote successful interventions. The most widely used measures offer sound predictions but distinguish only a few types of stylistic learning and lack sensitive growth detection. The Learning-to-Learn Scales was designed to…

  2. The Territory of Language: Linguistics, Stylistics, and the Teaching of Composition.

    Science.gov (United States)

    McQuade, Donald A.

    Intended to chart the interconnections of linguistics, stylistics, and the teaching of composition, this book encourages a productive collective effort to cultivate linguistics among teachers of writing. Chapter titles and their authors are as follows: (1) "Grammar in American College Composition: An Historical Overview" (R. J. Connors);…

  3. A STYLISTIC ANALYSIS OF “THE RIME OF THE ANCIENT MARINER”

    Directory of Open Access Journals (Sweden)

    Shaukat Khan

    2016-12-01

    Full Text Available If a specimen of literary art is seen as a fine tapestry of words made by the skilled seamstress—the poet, then the lexis and structure of a language are the raw materials—the fabric and the thread—by weaving which into specific patterns the finished product is achieved. The choice of materials and their arrangement into unique patterns always bear an image of their creator, or the artist; thus, a close view of them reveals the artist’s identity and brings out the meaningful message that underlies the ornate running threads. Mostly, the students of literary studies cannot appreciate the beauty of the literary classics on their own. Consequently, they simply mimic the ideas, and sometimes even the words, of famous professional critics when asked to give their own critical judgment on the aesthetic merit or the thematic quality of a literary work in the shape of a home assignment, classroom presentation or an annual assessment test. Now, the researcher has got the inspiration for carrying out this study from an idea expounded in Widdowson (1975 that this mimicry can be replaced by genuine individual opinion if the students, or even those people who have non-academic concerns with literature, are brought to a standpoint from where they can have a closer view of the raw materials, the language resources, which are involved in the making of a literary product. And, if the product in its finished form cannot elicit a desired response from them then making them sensitive to the process of its making can be quite effective in this regard. Through the present study, an attempt has been made to show an easy access to the outlandish world of verse by means of the linguistic route which is laid with the familiar flagstones of grammar and vocabulary. Meaning thereby that in this study the elusiveness of poetry will be dealt with the precision of a social scientist, the linguist. The approach which serves as the basis of this study is not an invention

  4. Comparison of Aminpour’s Qhazal and Qhazve‘s Qhazal Based on Structural Stylistics

    Directory of Open Access Journals (Sweden)

    Somayye Khorshidi

    2014-11-01

    Full Text Available Abstract The importance of stylistics in examining the texts has resulted in the burgeoning of various stylistic schools, with their differing methodologies. Among them, structural stylistics – which is the more polished version of formal stylistics – looks over the structure of words, sounds and syntax in the text and then focuses on their relation to the content. In fact, structural stylistics, drawing upon the structuralists views, is formed on the basis of structural linguistics, and works to analyze a literary text with regard to its organic unity and the inter-relationship of the parts to the whole. In other words, the main purpose is to approach the content of a work through its form and structure. In this method, stylistic features of the work are recognized through understanding the structural proportions between sounds, words and syntax. Accordingly, the structuralists regard the style as the manner of deviation and extra-regularity and the frequency of its occurrence in a particular era. The present article, first, refers to the definitions of style and stylistic schools and enumerates the features of structural stylistics and then sets out to study and compare two ghazals on the subject of the Holy Defense by Gheisar Aminpour ("Taghvimha" or "Calendars" and Alireza Ghazve ("Ghesmat" or "Destiny", in the light of structural stylistics. Through analyzing the different parts of the texts, with regard to balances and deviation, we will deal with the relatedness and proportion of these parts to the content. "Calendars" is among the best-known ghazals of Aminpour. It contains the issues of feeling ashamed of martyrs, lamenting over our negligence, and feeling left away from the martyrs. A special sense of grief and sadness, hidden in the particular rhythm and cadence of the words and combinations, helps the poet to express his feelings and thoughts. The ghazal "Destiny" deals also with the distress and exhaustion the poet feels

  5. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music

    OpenAIRE

    Hansen, Niels Chr.; Vuust, Peter; Pearce, Marcus

    2016-01-01

    Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies...

  6. I will proclaim myself what I am : corpus stylistics and the language of Shakespeare’s soliloquies

    OpenAIRE

    Murphy, Sean Edward

    2015-01-01

    This article reports on a corpus stylistic study of the language of soliloquies in Shakespeare’s plays. Literary corpus stylistics can use corpus linguistic methods to test claims made by literary critics and identify hitherto unnoticed features. Existing literary studies of soliloquies tend to define and classify them, to trace the history of the form or to offer literary appreciation; yet they pay surprisingly little attention to the language which characterises soliloquies. By creating a s...

  7. Probing the topological properties of complex networks modeling short written texts.

    Directory of Open Access Journals (Sweden)

    Diego R Amancio

    Full Text Available In recent years, graph theory has been widely employed to probe several language properties. More specifically, the so-called word adjacency model has been proven useful for tackling several practical problems, especially those relying on textual stylistic analysis. The most common approach to treat texts as networks has simply considered either large pieces of texts or entire books. This approach has certainly worked well-many informative discoveries have been made this way-but it raises an uncomfortable question: could there be important topological patterns in small pieces of texts? To address this problem, the topological properties of subtexts sampled from entire books was probed. Statistical analyses performed on a dataset comprising 50 novels revealed that most of the traditional topological measurements are stable for short subtexts. When the performance of the authorship recognition task was analyzed, it was found that a proper sampling yields a discriminability similar to the one found with full texts. Surprisingly, the support vector machine classification based on the characterization of short texts outperformed the one performed with entire books. These findings suggest that a local topological analysis of large documents might improve its global characterization. Most importantly, it was verified, as a proof of principle, that short texts can be analyzed with the methods and concepts of complex networks. As a consequence, the techniques described here can be extended in a straightforward fashion to analyze texts as time-varying complex networks.

  8. English for Science and Technology - Stylistics and Methods

    DEFF Research Database (Denmark)

    Mousten, Birthe

    The book covers basic methods for summarizing and editing of EST writing (English for Science and Technology). In addition, translation of basically technically oriented texts is covered with a view to an evaluation of formality, complexity and audience recognition in connection with different text...

  9. The Language of Satire: An Exploration of Stylistic Devices in ...

    African Journals Online (AJOL)

    Lwati: A Journal of Contemporary Research. Journal Home · ABOUT THIS JOURNAL · Advanced Search · Current Issue · Archives · Journal Home > Vol 7, No 2 (2010) >. Log in or Register to get access to full text downloads.

  10. A Corpus-based Stylistic Analysis of Body-Soul and Heaviness-Lightness Metaphors in Kundera's Novel The Unbearable Lightness of Being

    Directory of Open Access Journals (Sweden)

    Khalid Shakir Hussein

    2015-10-01

    Full Text Available This paper represents an attempt to conduct a corpus-based stylistic analysis of  two conceptual metaphors in The Unbearable Lightness of Being, which is a novel written by Milan Kundera. Soul-body and lightness-heaviness metaphors are foregrounded as being central themes all through the novel. The way such metaphors are used in the novel indicates an insightful employment of metaphor as a cognitive tool which empowers language users with a capacity of conceptualizing different experiences. The researcher adopts conceptual metaphor theory to produce a sort of conceptual analysis incorporating Leech's semantic componential analysis within the overall analytic procedure. Different techniques are figured out in relation to the creative ways of manipulating the cognitive level of language, such as conceptual switching, conceptual extension, and conceptual fusion. These creative techniques are carefully used in the novel under investigation with different ranges of metaphorical creativity. Conceptual switching might be simple but very active in deviating from the conventional conceptual system. Conceptual extension marks certain minute elaborations conventional metaphors undergo extending the limits of cognitive conceptualization. As for conceptual fusion, it proves to be interestingly powerful in producing certain aggregations of metaphorical mappings. Keywords: Conceptual Metaphor Theory, Metaphorical Creativity, Metaphorical Mappings, Corpus Stylistics

  11. The stylistic ordering of English modifiers | Sam | International ...

    African Journals Online (AJOL)

    International Journal of Pedagogy, Policy and ICT in Education. Journal Home · ABOUT THIS JOURNAL · Advanced Search · Current Issue · Archives · Journal Home > Vol 4, No 1 (2014) >. Log in or Register to get access to full text downloads.

  12. Universally Designed Text on the Web: Towards Readability Criteria Based on Anti-Patterns.

    Science.gov (United States)

    Eika, Evelyn

    2016-01-01

    The readability of web texts affects accessibility. The Web Content Accessibility guidelines (WCAG) state that the recommended reading level should match that of someone who has completed basic schooling. However, WCAG does not give advice on what constitutes an appropriate reading level. Web authors need tools to help composing WCAG compliant texts, and specific criteria are needed. Classic readability metrics are generally based on lengths of words and sentences and have been criticized for being over-simplistic. Automatic measures and classifications of texts' reading levels employing more advanced constructs remain an unresolved problem. If such measures were feasible, what should these be? This work examines three language constructs not captured by current readability indices but believed to significantly affect actual readability, namely, relative clauses, garden path sentences, and left-branching structures. The goal is to see whether quantifications of these stylistic features reflect readability and how they correspond to common readability measures. Manual assessments of a set of authentic web texts for such uses were conducted. The results reveal that texts related to narratives such as children's stories, which are given the highest readability value, do not contain these constructs. The structures in question occur more frequently in expository texts that aim at educating or disseminating information such as strategy and journal articles. The results suggest that language anti-patterns hold potential for establishing a set of deeper readability criteria.

  13. LANGUAGE STYLE OF HABIBURRAHMAN EL-SHIRAZY IN THE DWILOGY OF AYAT-AYAT CINTA: A STYLISTIC STUDY

    Directory of Open Access Journals (Sweden)

    Aflahah Aflahah

    2017-05-01

    Full Text Available Novel is an artwork which closely relates to human life and is considered as the representation of human life journey. The language style in novel is the embodiment of language use by an author to express ideas, emotion, opinion, and to give a certain effect. The main problem will be discussed in this study is the language style found in the novels of AAC 1 and 2. This study is about individual language style, an author’s language style who have written best seller novels, namely Habiburrahman El-Shirazy (HES. The approach that is considered very appropriate in understanding the language use of HES in Dwilogy of Ayat-Ayat Cinta (DAAC is stylistic study. A stylistic study reveals how is the language style used by HES and what effects that are resulted. The approach of this research is a descriptive qualitative research. Descriptive method is used to describe the linguistic facts such as the language style based on lexical choice, sentence structure, and direct or indirect meaning. The use of language style based on lexical choice (diction in DAAC shows a typical language style of HES as well as shows his ability as a Da’i and man of letters. The analysis of language style based on sentence structure and direct/indirect meaning illustrates the descriptive style of HES. Whatever he describes, it must be very accurate. He gives very detail information of the object being spoken of either background of the story or characterization. Through language style, HES describes the emotions experienced by the characters properly so it makes the readers able to empathize. From the data had been obtained, it reveals that language style used by HES to describe characters or characterization (describe the physical condition, characters, and characteristics, describes its background, tells the plot, and convey the message. The results showed the lexical choice of scientific words and religious words are typical of HES’s language style in his DAAC, the

  14. Slogans and meanings – a grammatical, semantic and stylistic analysis

    Directory of Open Access Journals (Sweden)

    Marinela Doina NISTEA

    2012-01-01

    Full Text Available According to DEX, explanatory dictionary of the Romanian language, published by the Institute of Linguistics of the Romanian Academy, a slogan is “an eminent, compelling phrasing expressing in a lapidary manner the political or economic aims of groups, organizations, etc., a topical issue” or “a battle cry of old Scottish clans”. Extrapolating, one may say that, nowadays, a product’s slogan is equivalent to the old battle cry: it is defined by concision and memorability, it has a precise purpose and emphasizes the offerer’s personality. Its efficiency for advertising is undeniable. The role of a slogan is to evoke the message in the most focused possible wording, meant to draw the addressee’s attention, even to surprise him/her, getting them to remember the idea suggested, regardless of whether it concerns a certain product offered for sale, or it regards a candidate, a political program or an ideology.

  15. Study children\\'s literature by comparative stylistics approach (Poetry book in Ahmed Shoghi,and Mohammed Alhravyand Abbas Yamini Sharif model

    Directory of Open Access Journals (Sweden)

    salah addin abdi

    2014-12-01

    Full Text Available Stylistic is disclosure laws creativity in literary discourse structure. And case under limitation in the idea of interdependence between the texts and to look at the texts in interdependence to only Angle comparison especially as were texts between different languages and what it was limited in literary texts turned out to be working round will be the technical side of any aesthetic. This literature poets any three Ahmed Shawki and Mohamed Hrawi Egyptians and Iranian poet Abbas yamini Sharif appear in children's literature be texture , beautiful word baptism imagination and purpose of the enjoyment of the small receiver and educated and refined . This literature poets look alike poets sing one topic in their hair for children is " Alktab " models of pedagogy in their poets be of standard noodles in a level voice , coordination and harmony between the internal and external music And role of repetition with different check rhythm music and show a sense of psychological and the emphasis on meaning. Be guaranteed their poets raising a child and learned, passion is pride and love of science and learning. The imagination in the poet be kind of yamini imagination Altaleva in while an innovative Shawki’s imagination and fantasy Hrawi graph. The study of each relying on poetic texts for all three of them and the energy of poetic language and its technical and creative aesthetic and stylistic comparison methodology which is lean on comparison mainly her and emerged from the comparison in two languages literary two would different Comparative Literature but of its focus on language and style .

  16. Stylistics of Khaje Abd ol-Lah Ansari’s Epistles

    Directory of Open Access Journals (Sweden)

    Azadeh Poode

    2014-07-01

    Full Text Available  Abstract Stylistics is a knowledge that has been particularly considered by writers and speakers. Literary style of every writer shows the way of indicating speaker's thought and it is his opening key of speech's style in conveying meaning to the addressee's mind. Type of words, structures, sentences, and the way of interpreting meaning are factors in literary style elements. The effect of Khaje Abd ol-Lah Ansari's word on Sufi didactic literature is known among literary scholars. Deeply mystical concepts' integration with eloquent style in song has made the works of Khaje eternal so stylistic research about his works can show good points in literary aesthetics of his works. So besides Al-Sufi categories, his five epistles are the most preferred among his works that are described in this study.   Khaje Abd ol-Lah Ansari is among mysticism and Persian literature celebrities that besides having numerous writings, he is worthy to be analyzed in this subject in order to study his methods in writing his Sufism didactic works especially in five epistles of Kanz ol-Salekin, Vareda'at, Del va Jan, Haft Hesar, Ghalandar nama, and the pattern that is for providing next works.   At the lexical level , Khaje Abd ol-Lah's style does not have a manifest feature. His only lexical feature is repetition that is seen in three levels of letter, word, and sentence. Sometimes he repeats a word in several consecutive sentences and even in several pages. Number of old words and sounds are very few in epistles that this simplicity of language relates to the addresses that are common people and his didactic works.   According to linguistic and literary level, epistles are closer to the second period of Persian prose than the first period there is no sign of oldness in these works. Arabic words are used moderately and most of these words have been used in its modern sense. At syntax level, he has used prefix verbs specially "Dar" and

  17. Prominent stylistic aspects in music of Nāser Khosrow's poetry

    Directory of Open Access Journals (Sweden)

    Morteza Mohseni

    2017-04-01

    Full Text Available It is axiomatic  for those who are stylistically a little familiar  about  the periods of  Persian  poetry  that  quasidas  of  Nāser-e Khosrow are totally  different  from  those  of  the  poets  in  fifth  lunar  century  both  in  terms  of  their  contents  and  technically.  This difference  is  seen  even  in  those  areas  of  his  poetry in which the poet is confided to make innovations. This paper investigates Nāser-e Khosrow's style in the field of the music of poetry. It also aims to identify his slylistic differences comparing with those of his most famous contemporaries (Onsori, Farrokhi and Manoochehri in external, lateral and internal areas.      As the first  step, all the Nāser Khosrow's poems -exept for additions section- was considered the Scope of this study, accompanied by main parts of other three cited  poets' divans (poetical works which consists almost 23600 distiches. For the second stage, each poet's divan were separatly scrutinized in three fields of the music of poetry. The frequency of each cases were recorded and after comparing statistics related to Nāser-e Khosrow's poems with other tree poets,  stylistic charachteristics of his poetry were explored. Statistical information related to the poets were generally recorded in a table and some parts of them were shown in a bar graph. It seems necessary to note that the researcher considered two items of  innovation and frequncy in all phases of the study.     Most of the studies done in the field of Nāser-e Khosrow's music of poetry, investigated the prosody and meter of his poetry. Most of these studies considered difficulty and relevancy as the important prosodic characteristics of  his poetry. Regarding the lateral and internal areas of Nāser-e Khosrow's poetry, there are not much argument proposed except for using difficult rhymes and nominal radifs ( for lateral music of poetry and attending to figures of

  18. PSYCHOLOGICAL FACTORS OF STYLISTIC FEATURES MANIFESTATION IN PEDAGOGICAL COMMUNICATION

    Directory of Open Access Journals (Sweden)

    I V Arendachuk

    2016-12-01

    Full Text Available The article deals with the contemporary theoretical approaches to defining such concepts as “pedagogical communication” and “the style of pedagogical communication”, the substantial features of pedagogical communication in the system of the teachers’ attitude to their students, their professional work and themselves are revealed. The results of the empirical research conducted on a sample of Saratov secondary school teachers ( n = 67; average age of 42 years old; average length of service of 18 years, intended to identify the relationships between the style of pedagogical communication and its psychological factors (personal and practical are demonstrated in the article. We used the methods of determining the style of interpersonal interaction (S.V. Maksimov, Yu.A. Lobeiko, diagnostics of dominant emotional modality in teachers (L.A. Rabinovich in T.G. Syritso’s modification, satisfaction evaluation of the profession of a teacher and identification of crisis factors and factors of overcoming the professional crisis of teachers (O.M. Chorosova, R.E. Gerasimova; the method of “psychological portrait of a teacher” (Z.V. Rezapkina, G.V. Rezapkina. The article demonstrates that the more constructive the style of pedagogical communication is, the larger number of psychological factors it has, and the system of these factors is dominated by the factors that characterize the individual features of the teacher’s personality.

  19. Mind-modelling with corpus stylistics in David Copperfield.

    Science.gov (United States)

    Stockwell, Peter; Mahlberg, Michaela

    2015-05-01

    We suggest an innovative approach to literary discourse by using corpus linguistic methods to address research questions from cognitive poetics. In this article, we focus on the way that readers engage in mind-modelling in the process of characterisation. The article sets out our cognitive poetic model of characterisation that emphasises the continuity between literary characterisation and real-life human relationships. The model also aims to deal with the modelling of the author's mind in line with the modelling of the minds of fictional characters. Crucially, our approach to mind-modelling is text-driven. Therefore we are able to employ corpus linguistic techniques systematically to identify textual patterns that function as cues triggering character information. In this article, we explore our understanding of mind-modelling through the characterisation of Mr. Dick from David Copperfield by Charles Dickens. Using the CLiC tool (Corpus Linguistics in Cheshire) developed for the exploration of 19th-century fiction, we investigate the textual traces in non-quotations around this character, in order to draw out the techniques of characterisation other than speech presentation. We show that Mr. Dick is a thematically and authorially significant character in the novel, and we move towards a rigorous account of the reader's modelling of authorial intention.

  20. Floral foregrounding: A corpus-assisted, cognitive stylistic study of the foregrounding of flowers in Mrs Dalloway

    DEFF Research Database (Denmark)

    Jensen, Marie Møller; Lottrup, Katrine; Nordentoft, Signe

    2018-01-01

    The study reported here combines quantitative and qualitative methods from both cognitive stylistics and corpus stylistics to analyze the flower-motif in Virginia Woolf’s novel Mrs Dalloway. The quantitative analysis compared the frequency of flower lemmas in the novel to both a reference corpus...... consisting of Woolf’s other works as well as a general corpus (the BNC). The analysis found significant differences between the frequencies in the novel and both corpora. The qualitative analysis is based on in the statistically significant results and considers cognitive entrenchment and salience...... in relation to these. Furthermore, the analysis also links these two notions to different types of foregrounding as conceptualized in stylistics proper. Finally, aspects of repetition, parallelism and symbolism in relation to the flower-motif are considered. In conclusion, it is found that the flower...

  1. Real-time stylistic prediction for whole-body human motions.

    Science.gov (United States)

    Matsubara, Takamitsu; Hyon, Sang-Ho; Morimoto, Jun

    2012-01-01

    The ability to predict human motion is crucial in several contexts such as human tracking by computer vision and the synthesis of human-like computer graphics. Previous work has focused on off-line processes with well-segmented data; however, many applications such as robotics require real-time control with efficient computation. In this paper, we propose a novel approach called real-time stylistic prediction for whole-body human motions to satisfy these requirements. This approach uses a novel generative model to represent a whole-body human motion including rhythmic motion (e.g., walking) and discrete motion (e.g., jumping). The generative model is composed of a low-dimensional state (phase) dynamics and a two-factor observation model, allowing it to capture the diversity of motion styles in humans. A real-time adaptation algorithm was derived to estimate both state variables and style parameter of the model from non-stationary unlabeled sequential observations. Moreover, with a simple modification, the algorithm allows real-time adaptation even from incomplete (partial) observations. Based on the estimated state and style, a future motion sequence can be accurately predicted. In our implementation, it takes less than 15 ms for both adaptation and prediction at each observation. Our real-time stylistic prediction was evaluated for human walking, running, and jumping behaviors. Copyright © 2011 Elsevier Ltd. All rights reserved.

  2. Translation of Lexical Stylistic Devices from English to Chinese in Com-mercial Advertisements

    Institute of Scientific and Technical Information of China (English)

    林鑫

    2014-01-01

    With rapid development of China, a growing number of foreign products are entering the Chinese market. An excel-lent translation of a product’s advertisement from English to Chinese undoubtedly contributes to its successful promotion in the Chinese market. Although the translation practice contains multiple difficulties, the translation of lexical stylistic devices is a big challenge for translators. It is not simply because lexical stylistic devices are diverse and various in form, but also because most de-vices involve linguistic and cultural differences between English and Chinese. This thesis analyzed a number of current English to Chinese translations of the devices in commercial advertisements, which mainly come from two translation scholars ’works and official websites of world-known brands. By analyzing the selected data, seven translation strategies are found to be the major translation strategies in this respect, namely literal translation, free translation, flexible translation, extended translation, adaptation translation, compensation translation and amplification translation strategies. Moreover, a number of linguistic and cultural issues which need to be considered by translators are also illustrated here.

  3. Linguistic and Cognitive Characteristics of the Composition of the Text of J. K. Rowling's English Tales

    Science.gov (United States)

    Solodova, Elena

    2015-01-01

    This article focuses on linguistic and cognitive characteristics inherent in the composition of the English postmodern tales written by J.K. Rowling. The composition of the text is viewed as linguistic and cognitive construal that integrates compositional plot structure, compositional meaning structure, linguistic and stylistic means of their…

  4. Stylistic features of case reports as a genre of medical discourse.

    Science.gov (United States)

    Lysanets, Yuliia; Morokhovets, Halyna; Bieliaieva, Olena

    2017-03-13

    The present paper discusses the lexical and grammatical peculiarities of English language medical case reports, taking into account their communicative purposes and intentions. The objective of the research is to clarify the principal mechanisms of producing an effective English language medical case report and thus to provide recommendations and guidelines for medical professionals who will deal with this genre. The analysis of medical case reports will largely focus on the most significant linguistic peculiarities, such as the use of active and passive voice, the choice of particular verb tenses, and pronouns. The selected medical case reports will be considered using methods of lexico-grammatical analysis, quantitative examination, and contextual, structural, narrative, and stylistic analyses. The research revealed a range of important stylistic features of medical case reports which markedly distinguish them from other genres of medical scientific writing: educational and instructive intentions, conciseness and brevity, direct and personal tone, and material presented in a narrative style. The present research has shown that the communicative strategies of the analyzed discourse, mentioned immediately above, are effectively implemented by means of specific lexical units and grammatical structures: the dominance of active voice sentences, past simple tense, personal pronouns, and modal verbs. The research has also detected the occasional use of the present perfect, present simple, and future simple tenses and passive voice which also serve particular communicative purposes of medical case reports. Medical case reports possess a range of unique characteristics which differ from those of research articles and other scientific genres within the framework of written medical discourse. It is to be emphasized that it is highly important for medical professionals to master the major stylistic principles and communicative intentions of medical case report as a genre in

  5. "If You Have to Ask, You'll Never Know": Effects of Specialised Stylistic Expertise on Predictive Processing of Music.

    Science.gov (United States)

    Hansen, Niels Chr; Vuust, Peter; Pearce, Marcus

    2016-01-01

    Musical expertise entails meticulous stylistic specialisation and enculturation. Even so, research on musical training effects has focused on generalised comparisons between musicians and non-musicians, and cross-cultural work addressing specialised expertise has traded cultural specificity and sensitivity for other methodological limitations. This study aimed to experimentally dissociate the effects of specialised stylistic training and general musical expertise on the perception of melodies. Non-musicians and professional musicians specialising in classical music or jazz listened to sampled renditions of saxophone solos improvised by Charlie Parker in the bebop style. Ratings of explicit uncertainty and expectedness for different continuations of each melodic excerpt were collected. An information-theoretic model of expectation enabled selection of stimuli affording highly certain continuations in the bebop style, but highly uncertain continuations in the context of general tonal expectations, and vice versa. The results showed that expert musicians have acquired probabilistic characteristics of music influencing their experience of expectedness and predictive uncertainty. While classical musicians had internalised key aspects of the bebop style implicitly, only jazz musicians' explicit uncertainty ratings reflected the computational estimates, and jazz-specific expertise modulated the relationship between explicit and inferred uncertainty data. In spite of this, there was no evidence that non-musicians and classical musicians used a stylistically irrelevant cognitive model of general tonal music providing support for the theory of cognitive firewalls between stylistic models in predictive processing of music.

  6. Impact of Stylistic Features, Architectural and Urban Rules of the Algiers Architectural Heritage Dating Between 1830 and 1930 ON the Strength of its Buildings during the Earthquake

    Science.gov (United States)

    Souami, M. A.

    2013-07-01

    In a other work, we have highlighted a theoretical point of view that there is an relation between the earthquake-resistant architectural design codes and, the urban and stylistic characteristics of buildings and urban forms of the Algiers architectural heritage dating between 1830 and 1930. Following this, we hypothesized that its various stylistic and urban characteristics have a direct impact on the resilience of buildings to earthquakes. The purpose of this article is to try through the computer simulation examples of some stylistic and urban characteristics to prove the validity or not of our hypothesis.

  7. Inductive inference for large scale text classification

    OpenAIRE

    Silva, Catarina Helena Branco Simões da

    2009-01-01

    Tese de doutoramento em Engenharia Informática apresentada à Fac. de Ciências e Tecnologia da Univ. de Coimbra Nas últimas décadas a disponibilidade e importância dos textos em formato digital tem vindo a aumentar exponencialmente, encontrando-se neste momento presentes em quase todos os aspectos da vida moderna. A classificação de textos é deste modo uma área activa de investigação, justificada por muitas aplicações reais. Ainda assim, lidar com a sobrecarga de textos em formato digital e...

  8. Text Mining.

    Science.gov (United States)

    Trybula, Walter J.

    1999-01-01

    Reviews the state of research in text mining, focusing on newer developments. The intent is to describe the disparate investigations currently included under the term text mining and provide a cohesive structure for these efforts. A summary of research identifies key organizations responsible for pushing the development of text mining. A section…

  9. Sounding Sacred: The Adoption of Biblical Archaisms in the Book of Mormon and Other 19th Century Texts

    Science.gov (United States)

    Bowen, Gregory A.

    2016-01-01

    The Book of Mormon is a text published in 1830 and considered a sacred work of scripture by adherents of the Latter-day Saint movement. Although written 200 years later, it exhibits many linguistic features of the King James translation of the Bible. Such stylistic imitation has been little studied, though a notable exception is Sigelman &…

  10. Representativeness in corpora of literary texts: introducing the C18P project

    Directory of Open Access Journals (Sweden)

    Gemeinböck, Iris

    2016-07-01

    Full Text Available Currently there are very few specialised corpora of literary texts that are tailored to the needs of literary critics who are interested in corpus stylistic analyses of prose fiction. Many existing corpora including literary texts were compiled for linguistic research interests and are often unsuitable for corpus stylistic purposes. The paper addresses three of the main problems: the absence of labelling of the texts for literary genre, the use of extracts, and the prevalence of linguistic periodisation schemes. C18P is a corpus of prose fiction designed specifically to address these issues. It traces the early development of the novel from 1700 up until the Victorian era. It can, for instance, be used for an analysis of the characteristic linguistic features of individual literary genres and forms. The following paper introduces the design of the corpus as well as some of its potential uses.

  11. Retrogressive harmonic motion as structural and stylistic characteristic of pop-rock music

    Science.gov (United States)

    Carter, Paul S.

    The central issue addressed in this dissertation is that of progressive and retrogressive harmonic motion as it is utilized in the repertoire of pop-rock music. I believe that analysis in these terms may prove to be a valuable tool for the understanding of the structure, style and perception of this music. Throughout my study of this music, various patterns of progressive and retrogressive harmonic motions within a piece reveal a kind of musical character about it, a character on which much of a work's style, organization and extramusical nature often depends. Several influential theorists, especially Jean-Phillipe Rameau, Hugo Riemann, and Arnold Schoenberg, have addressed the issues of functional harmony and the nature of the motion between chords of a tonal harmonic space. After assessing these views, I have found that it is possible to differentiate between two fundamental types of harmonic motions. This difference, one that I believe is instrumental in characterizing pop-rock music, is the basis for the analytical perspective I wish to embrace. After establishing a method of evaluating tonal harmonic root motions in these terms, I wish to examine a corpus of this music in order to discover what a characterization of its harmonic motion may reveal about each piece. Determining this harmonic character may help to establish structural and stylistic traits for that piece, its genre, composer, period, or even its sociological purpose. Conclusions may then be drawn regarding the role these patterns play in defining musical style traits of pop-rock. Partly as a tool for serving the study mentioned above I develop a graphical method of accounting for root motion I name the tonal "Space-Plot"; This apparatus allows the analyst to measure several facets about the harmonic motion of the music, and to see a wide scope of relations in and around a diatonic key.

  12. Text World Theory and real world readers: From literature to life in a Belfast prison.

    Science.gov (United States)

    Canning, Patricia

    2017-05-01

    Cognitive stylistics offers a range of frameworks for understanding (amongst other things) what producers of literary texts 'do' with language and how they 'do' it. Less prevalent, however, is an understanding of the ways in which these same frameworks offer insights into what readers 'do' (and how they 'do' it). Text World Theory (Werth, 1999; Gavins, 2007; Whiteley, 2011) has proved useful for understanding how and why readers construct mental representations engendered by the act of reading. However, research on readers' responses to literature has largely focused on an 'idealised' reader or an 'experimental' subject-reader often derived from within the academy and conducted using contrived or amended literary fiction. Moreover, the format of traditional book groups (participants read texts privately and discuss them at a later date) as well as online community forums such as Goodreads, means that such studies derive data from post-hoc, rather than real-time textual encounters and discussions. The current study is the first of its kind in analysing real-time reading contexts with real readers during a researcher-led literary project ('read.live.learn') in Northern Ireland's only female prison. In doing so, the study is unique in addressing experimental and post hoc bias. Using Text World Theory, the paper considers the personal and social impact of reader engagement in the talk of the participants. As such, it has three interrelated aims: to argue for the social and personal benefits of reading stylistically rich literature in real-time reading groups; to demonstrate the efficacy of stylistics for understanding how those benefits come about, and to demonstrate the inter-disciplinary value of stylistics, particularly its potential for traversing traditional research parameters.

  13. The Use of Systemic-Functional Linguistics in Automated Text Mining

    Science.gov (United States)

    2009-03-01

    what degree two or more documents are similar in terms of their meaning. Simply put, such a cognitive model aims to link the physical manifestation...These features, both in terms of frequency and their chaining across a text, were taken as salient stylistic features that had a direct relationship to...because SFL attempts to model these cognitive processes, this has the potential to improve NLP tasks by making them more ’human-like’. Secondly

  14. Classifying Classifications

    DEFF Research Database (Denmark)

    Debus, Michael S.

    2017-01-01

    This paper critically analyzes seventeen game classifications. The classifications were chosen on the basis of diversity, ranging from pre-digital classification (e.g. Murray 1952), over game studies classifications (e.g. Elverdam & Aarseth 2007) to classifications of drinking games (e.g. LaBrie et...... al. 2013). The analysis aims at three goals: The classifications’ internal consistency, the abstraction of classification criteria and the identification of differences in classification across fields and/or time. Especially the abstraction of classification criteria can be used in future endeavors...... into the topic of game classifications....

  15. Creative and Stylistic Devices Employed by Children During a Storybook Narrative Task: A Cross-Cultural Study

    Science.gov (United States)

    Gorman, Brenda K.; Fiestas, Christine E.; Peña, Elizabeth D.; Clark, Maya Reynolds

    2018-01-01

    Purpose The purpose of this study was to analyze the effects of culture on the creative and stylistic features children employ when producing narratives based on wordless picture books. Method Participants included 60 first- and second-grade African American, Latino American, and Caucasian children. A subset of narratives based on wordless picture books collected as part of a larger study was coded and analyzed for the following creative and stylistic conventions: organizational style (topic centered, linear, cyclical), dialogue (direct, indirect), reference to character relationships (nature, naming, conduct), embellishment (fantasy, suspense, conflict), and paralinguistic devices (expressive sounds, exclamatory utterances). Results Many similarities and differences between ethnic groups were found. No significant differences were found between ethnic groups in organizational style or use of paralinguistic devices. African American children included more fantasy in their stories, Latino children named their characters more often, and Caucasian children made more references to the nature of character relationships. Conclusion Even within the context of a highly structured narrative task based on wordless picture books, culture influences children’s production of narratives. Enhanced understanding of narrative structure, creativity, and style is necessary to provide ecologically valid narrative assessment and intervention for children from diverse cultural backgrounds. PMID:21278258

  16. Monolingual accounting dictionaries for EFL text production

    Directory of Open Access Journals (Sweden)

    Sandro Nielsen

    2006-10-01

    Full Text Available Monolingual accounting dictionaries are important for producing financial reporting texts in English in an international setting, because of the lack of specialised bilingual dictionaries. As the intended user groups have different factual and linguistic competences, they require specific types of information. By identifying and analysing the users' factual and linguistic competences, user needs, use-situations and the stages involved in producing accounting texts in English as a foreign language, lexicographers will have a sound basis for designing the optimal English accounting dictionary for EFL text production. The monolingual accounting dictionary needs to include information about UK, US and international accounting terms, their grammatical properties, their potential for being combined with other words in collocations, phrases and sentences in order to meet user requirements. Data items that deal with these aspects are necessary for the international user group as they produce subject-field specific and register-specific texts in a foreign language, and the data items are relevant for the various stages in text production: draft writing, copyediting, stylistic editing and proofreading.

  17. Science and Technology Text Mining Basic Concepts

    National Research Council Canada - National Science Library

    Losiewicz, Paul

    2003-01-01

    ...). It then presents some of the most widely used data and text mining techniques, including clustering and classification methods, such as nearest neighbor, relational learning models, and genetic...

  18. Managing interactions between technological and stylistic innovation in the media industries, insights from the introduction of ebook technology in the publishing industry

    NARCIS (Netherlands)

    T.S. Schweizer (Sophie)

    2002-01-01

    textabstractThe mainstream of innovation research pays a lot of attention to technological innovation, but has neglected its interaction with another type of innovation, which is particularly important in sectors like the furniture, fashion and the media content industries: stylistic innovation.

  19. Classification of Flotation Frothers

    Directory of Open Access Journals (Sweden)

    Jan Drzymala

    2018-02-01

    Full Text Available In this paper, a scheme of flotation frothers classification is presented. The scheme first indicates the physical system in which a frother is present and four of them i.e., pure state, aqueous solution, aqueous solution/gas system and aqueous solution/gas/solid system are distinguished. As a result, there are numerous classifications of flotation frothers. The classifications can be organized into a scheme described in detail in this paper. The frother can be present in one of four physical systems, that is pure state, aqueous solution, aqueous solution/gas and aqueous solution/gas/solid system. It results from the paper that a meaningful classification of frothers relies on choosing the physical system and next feature, trend, parameter or parameters according to which the classification is performed. The proposed classification can play a useful role in characterizing and evaluation of flotation frothers.

  20. Phraseosemantic peculiarities of idioms with the word «silki» (snares (a case study of Russian classics and modern literature texts

    Directory of Open Access Journals (Sweden)

    Andrianova D.A.

    2017-03-01

    Full Text Available this article explores semantic and stylistic meaning changes of idioms with the word “silki” (snares during XVIII–XXI centuries on the basis of Russian classics and modern literature texts and publicistic writing. It is proved that the word “silki” (snares was used as a biblical expression in ecclesiastic and some fiction texts, this explanes its strong negative connotation, which is out of use in up-to-date contexts.

  1. Information gathering for CLP classification

    Directory of Open Access Journals (Sweden)

    Ida Marcello

    2011-01-01

    Full Text Available Regulation 1272/2008 includes provisions for two types of classification: harmonised classification and self-classification. The harmonised classification of substances is decided at Community level and a list of harmonised classifications is included in the Annex VI of the classification, labelling and packaging Regulation (CLP. If a chemical substance is not included in the harmonised classification list it must be self-classified, based on available information, according to the requirements of Annex I of the CLP Regulation. CLP appoints that the harmonised classification will be performed for carcinogenic, mutagenic or toxic to reproduction substances (CMR substances and for respiratory sensitisers category 1 and for other hazard classes on a case-by-case basis. The first step of classification is the gathering of available and relevant information. This paper presents the procedure for gathering information and to obtain data. The data quality is also discussed.

  2. Türkçe Ders Kitaplarında Yer Alan Metinlerin Edimsel Sınıflandırılması Pragmatic Classification Of The Texts In Turkish Course Books

    Directory of Open Access Journals (Sweden)

    İ. Seçkin AYDIN

    2013-03-01

    Full Text Available Globally, one of the most discussed problems in education is theissue of teaching in mother tongue and in a foreign language.Accordingly, various methods and techniques have been tried. In anycase, using a text is indispensable. Especially listening, reading,speaking, writing and grammar drills are conducted in the classroomenvironment. Therefore, text preference is an important element inlanguage teaching. These texts are determined according to certaincriteria. The texts determined upon criteria are used to improve thelanguage skills of students. Certain linguistic criteria have to bedetermined for text choice in language teaching. In this context,functional display in the texts chosen is crucial for students to graspthe potential of language. The aim of this research is to assess the textsin 8th Grade Turkish Course Books in a pragmatic perspective. Surveymethod was used in the research. The statements in the texts wereclassified according to “speech acts theory” of Searle. In the research,the texts in the common theme, “Social Life”, were analyzed. The textsassessed in three course books were limited with a theme and prosetypes. The criteria were grouped under the following headings:commissives, declarations, directives, expressives and assertives It wasdetermined that most of the expressions in all the course books weredescriptive and expressive while the others were insufficient. Usingvarious expressions in texts contributes to perceptive and expressiveskills of students. It was suggested in the research that expressiontypes diverse in pragmatic aspect should be used in the texts forstudents.

  3. Zero-Shot Style Transfer in Text Using Recurrent Neural Networks

    OpenAIRE

    Carlson, Keith; Riddell, Allen; Rockmore, Daniel

    2017-01-01

    Zero-shot translation is the task of translating between a language pair where no aligned data for the pair is provided during training. In this work we employ a model that creates paraphrases which are written in the style of another existing text. Since we provide the model with no paired examples from the source style to the target style during training, we call this task zero-shot style transfer. Herein, we identify a high-quality source of aligned, stylistically distinct text in Bible ve...

  4. Making School Development Credible. Text, Context, Irony

    Directory of Open Access Journals (Sweden)

    Mats Börjesson

    2012-01-01

    Full Text Available

    The article argues for the importance of an open, reflexive-methodological approach when switching between studying text, context and researcher activity. Close linguistic analysis can benefit from being linked with the researcher’s contextualisation of his empirical material as well as with more distanced readings. The more specific starting point for this article is that school development, like other similar terms such as school improvement and the like, makes use of linguistic building blocks with which whole narratives about today’s and tomorrow’s schools can be constructed. The subject of the study is a short text issued by the Swedish Schools Inspectorate (Skolinspektionen. Government language changes according to the authorities’ role in society and their own definitions of their functions, and an important aspect here is the legitimacy of the authorities’ texts. By means of various kinds of close linguistic analysis, the above-mentioned text is studied with regard to choice of categories, hierarchies of modalisation and the rhetorical effects of different types of formulations in a broader political-social landscape. The article concludes with a reflective discussion on the relationship between government language and irony as a stylistic device – a device that is based on the results of the close empirical analysis.[i]



    [i] The article is part of the project ”School  Development as Narrative”, funded by the Swedish Research Council. The author would like to thank the two reviewers for very valuable comments.

  5. Stylistic Performance through Affective Marking: A Case of Multilingual Literary Discourse

    Directory of Open Access Journals (Sweden)

    Urjani Chakravarty

    2016-12-01

    Full Text Available This paper provides an overall analysis of how multi-lingual writer like Amitav Ghosh write about emotion in his literary text, and emphasize on how multilingual authors display emotion/affect through use of literary multilingualism (affective markers combined with writer style. Through use of multiple strategies, they reduces the limitations of interpretation of their texts. Furthermore, this paper highlighted the centrally sociolinguistic and cognitive dimensions of the relationships between multilingualism and emotion and how this is influenced by assumptions of Relevance Theory i.e. optimal relevance in a literary text. One should expect to find relationships between sociolinguistic diversity and affective expression for most authors in locally specific ways, whether multilingual or not. Such scholarship can then illuminate how the authors by using literary multilingualism through writer style and affective markers can shape emotions across various contexts in a literary text. Future research into multilingualism and emotion should continue to distinguish between how multilingual authors use linguistic forms to show feeling, and how they express about feeling in their created texts. Keywords: Language, Culture, Literary Multilingualism, Style, Affect and Relevance Theory

  6. Towards secondary fingerprint classification

    CSIR Research Space (South Africa)

    Msiza, IS

    2011-07-01

    Full Text Available an accuracy figure of 76.8%. This small difference between the two figures is indicative of the validity of the proposed secondary classification module. Keywords?fingerprint core; fingerprint delta; primary classifi- cation; secondary classification I..., namely, the fingerprint core and the fingerprint delta. Forensically, a fingerprint core is defined as the innermost turning point where the fingerprint ridges form a loop, while the fingerprint delta is defined as the point where these ridges form a...

  7. Expected Classification Accuracy

    Directory of Open Access Journals (Sweden)

    Lawrence M. Rudner

    2005-08-01

    Full Text Available Every time we make a classification based on a test score, we should expect some number..of misclassifications. Some examinees whose true ability is within a score range will have..observed scores outside of that range. A procedure for providing a classification table of..true and expected scores is developed for polytomously scored items under item response..theory and applied to state assessment data. A simplified procedure for estimating the..table entries is also presented.

  8. Directed Activities Related to Text: Text Analysis and Text Reconstruction.

    Science.gov (United States)

    Davies, Florence; Greene, Terry

    This paper describes Directed Activities Related to Text (DART), procedures that were developed and are used in the Reading for Learning Project at the University of Nottingham (England) to enhance learning from texts and that fall into two broad categories: (1) text analysis procedures, which require students to engage in some form of analysis of…

  9. Semantic and stylistic pecularities of Slavicisms in language of modern newspapers

    Directory of Open Access Journals (Sweden)

    Жанар Кабдыляшымовна Киынова

    2012-12-01

    Full Text Available In article functioning of slavonicims in language of modern Kazakhstan and Russian newspapers is considered. On the basis of examples, ekstserpirovanny from modern newspapers, the informative picture about tendencies and regularities of modern word usage in mass media is given.

  10. Stylistic Performance through Affective Marking: A Case of Multilingual Literary Discourse

    Science.gov (United States)

    Chakravarty, Urjani

    2016-01-01

    This paper provides an overall analysis of how multi-lingual writer like Amitav Ghosh write about emotion in his literary text, and emphasize on how multilingual authors display emotion/affect through use of literary multilingualism (affective markers) combined with writer style. Through use of multiple strategies, they reduces the limitations of…

  11. download full text

    African Journals Online (AJOL)

    Adopting a surveillance system for antibacterial use has therefore become a more realistic ..... Financial support was obtained from the African Poverty Related Infection ... classification and Defined Daily Dose system methodology in Canada.

  12. A Customizable Text Classifier for Text Mining

    Directory of Open Access Journals (Sweden)

    Yun-liang Zhang

    2007-12-01

    Full Text Available Text mining deals with complex and unstructured texts. Usually a particular collection of texts that is specified to one or more domains is necessary. We have developed a customizable text classifier for users to mine the collection automatically. It derives from the sentence category of the HNC theory and corresponding techniques. It can start with a few texts, and it can adjust automatically or be adjusted by user. The user can also control the number of domains chosen and decide the standard with which to choose the texts based on demand and abundance of materials. The performance of the classifier varies with the user's choice.

  13. A quick survey of text categorization algorithms

    Directory of Open Access Journals (Sweden)

    Dan MUNTEANU

    2007-12-01

    Full Text Available This paper contains an overview of basic formulations and approaches to text classification. This paper surveys the algorithms used in text categorization: handcrafted rules, decision trees, decision rules, on-line learning, linear classifier, Rocchio’s algorithm, k Nearest Neighbor (kNN, Support Vector Machines (SVM.

  14. The linguistic and stylistic content in teaching Spanish in Gabon: attitudes and teaching practices

    Directory of Open Access Journals (Sweden)

    Jeanne Berthe MAKAYA MAVOUNGOU

    2016-07-01

    Full Text Available Always it has made us perplex, the position thought by the majority of the professorship gabonés as for if we have to or not to be interested to the partial or complete study of the linguistic and semantic aspects during the explanation and comment of a document. The education of the language and of the Spanish literature needs of the teacher the capture in account and the knowledge of the synergy that exists between the different genres, the aesthetic expression, the semantic content and the pragmatic use. It is the reason by which there is expected from the teacher, the capture in account of the contributions of other such connected sciences as the linguistics, the aesthetics, the sociology, the psychology, etc. That do not stop suggesting to the pedagogy and to the didactics other tracks of treatment of the suitable contents in class of Spanish language. The contribution of this investigation places, especially, to level of the analysis of the educational practices and the contributions of these by means of the semantic and linguistic paths.

  15. Stylistics of Abou-al-Qasem Payandeh´s Short Stories

    Directory of Open Access Journals (Sweden)

    Dr. M. R. Nasr Isfahani

    2011-01-01

    Full Text Available This Study is an introduction to Abou-al-Qasem Payandeh´s short stories style, who is left unuttered literary aspect among the other authors the of this school of writing story in Isfahan. The great fame of Abou-al-Qasem Payandeh owes his translation of Holy Qoran and his works are considered by the critics through his activities in the field of press and translation. Style is a special method that every writer or every poet applies in the literary works, like it or not, in order to express his aims. This theme is different for every author. The special style, that Payandeh uses in his short stories, is emphasized from lingual and literary point of view. The application of archaic language, arabic words, and the idioms, in the frame of simile, allusion and prolixity give a new structure to his stories. The main matter in his story language is the society and the pain of poor people. We examine the stylic representation Abou-al-Qasem Payandeh´s short stories base on Defae az molla Nasr-al-din, (Defending of Mollanas-al-din Morde keshane Joozan, (Killing the dead Joozan People Zolemate edalat. (The Darkness of Justice Then we present a brief analyse about three aspects: lingual, literary and thought.

  16. Stylistics of Abou-al-Qasem Payandeh´s Short Stories

    Directory of Open Access Journals (Sweden)

    Mohammadreza Nasr Isfahani

    2011-04-01

    Full Text Available Abstract  This Study is an introduction to Abou-al-Qasem Payandeh´s short stories style, who is left unuttered literary aspect among the other authors the of this school of writing story in Isfahan. The great fame of Abou-al-Qasem Payandeh owes his translation of Holy Qoran and his works are considered by the critics through his activities in the field of press and translation.  Style is a special method that every writer or every poet applies in the literary works, like it or not, in order to express his aims. This theme is different for every author.  The special style, that Payandeh uses in his short stories, is emphasized from lingual and literary point of view. The application of archaic language, arabic words, and the idioms, in the frame of simile, allusion and prolixity give a new structure to his stories. The main matter in his story language is the society and the pain of poor people.  We examine the stylic representation Abou-al-Qasem Payandeh´s short stories base on Defae az molla Nasr-al-din, (Defending of Mollanas-al-din Morde keshane Joozan, (Killing the dead Joozan People Zolemate edalat. (The Darkness of Justice Then we present a brief analyse about three aspects: lingual, literary and thought

  17. Two Aspects of Language and a Poem: One Stylistics Analysis of "Água Forte"

    Directory of Open Access Journals (Sweden)

    Isabel Cristina Ferreira Teixeira

    2016-07-01

    Full Text Available Este estudo organiza-se em torno de uma experiência desenvolvida em sala de aula, mais especificamente em Estudos Linguísticos, disciplina cursada por estudantes dos primeiros semestres do curso de Licenciatura em Letras. Nela começamos a reflexão sobre a constituição disciplinar da linguística pelas ideias estruturalistas. Jakobson logo aparece com seus artigos fundadores sobre os eixos da organização da linguagem associados à afasia e sobre as funções da linguagem, caracterizando a função poética como aquela que “projeta o princípio da equivalência do eixo da seleção sobre eixo da combinação” (1985, p. 130. Mas como compreender esses conceitos que descrevem a expressividade dos elementos linguísticos? A ideia é aplicá-los à linguagem em seus diferentes usos. O poema é uma dessas possibilidades. Apresentamos então o estudo feito para os estudantes e com os estudantes que analisa o funcionamento das duas figuras polares de estilo – a metáfora e a metonímia – aplicadas ao poema “Água-forte” de Manuel Bandeira (1986. Baseados nos princípios estruturalistas de Saussure (1974 e Jakobson (1985, aproximamos os eixos de organização da linguagem a essas figuras. Essa análise favorece a descoberta do sentido, produz possibilidades de leitura e, principalmente, permite a observação de efeitos discursivos relacionados ao conhecimento da expressividade dos elementos linguísticos.

  18. STYLISTIC ORIGINALITY OF SPENSER’S EPITHALAMION AND ITS REFLECTION IN RUSSIAN AND CHINEESE TRANSLATIONS

    Directory of Open Access Journals (Sweden)

    I.I. Burova

    2017-06-01

    Full Text Available Contemporary notion of style as a set of rules that allows the author choose and combine the elements of content and form when producing a literary work (V.V. Vinogradov, Contemporary notion of style as a set of rules that allows the author choose and combine the elements of content and form when producing a literary work (V.V. Vinogradov, A.N. Sokolov et al. was in many respects prefigured in “The Arte of English Poesie” (1589 attributed to G. Puttenham, and most of the principles stated in it were mirrored in Edmund’s Spenser’s Epithalamion (1595. The style of the poem is remarkable for its combination of heterogeneous elements borrowed from both the earlier epithalamic tradition and the toolbox of such arts as music, painting, and architecture. A number of papers published in recent decades have revealed the picturesqueness and musicality of the poem seeing Epithalamion as a work bearing typical characteristics of the poet’s idiostyle. This paper suggests that there is a possibility to single it out from the rest of Spenser’s work and read it as a poem which peculiar style that results from the interplay of the emergent grand styles of the time such as Mannerism and/or Baroque, on the one hand, and the elements anticipating Neo-Classicism, on the other. The elaborate Mannerist/ Baroque structure of Epithalamion is permeated with the number symbolism supporting the idea of the perfect harmony of the wedlock blessed by Holy Church in the bridal poem, while its Neo-Classical elements reveal Spenser as a successor of Sappho and Catullus. In the concluding part of the essay, we attempt to evaluate how these peculiarities of the Epithalamion style were rendered in Russian and Chinese translations of the poem.

  19. Text Maps: Helping Students Navigate Informational Texts.

    Science.gov (United States)

    Spencer, Brenda H.

    2003-01-01

    Notes that a text map is an instructional approach designed to help students gain fluency in reading content area materials. Discusses how the goal is to teach students about the important features of the material and how the maps can be used to build new understandings. Presents the procedures for preparing and using a text map. (SG)

  20. The Form is the Substance: Classification of Genres in Text

    Science.gov (United States)

    2001-01-01

    particularly good discriminator for a horoscope genre, the fact that it is an astrological sign and appears with other terms deemed astrological may well be...denominator prior to taking logs.) A misclassification matrix example is shown in Table 3. The true genre tag is indicated by row and the classifier’s...classifiers and is true even for the Naïve Bayes classifier which does poorly with presentation features alone. Applying a threshold to the classifier output

  1. Machine learning versus knowledge based classification of legal texts

    NARCIS (Netherlands)

    de Maat, E.; Krabben, K.; Winkels, R.; Winkels, R.G.F.

    2010-01-01

    This paper presents results of an experiment in which we used machine learning (ML) techniques to classify sentences in Dutch legislation. These results are compared to the results of a pattern-based classifier. Overall, the ML classifier performs as accurate (>90%) as the pattern based one, but

  2. On Dataless Hierarchical Text Classification (Author’s Manuscript)

    Science.gov (United States)

    2014-07-27

    compound talk.politics.mideast politics mideast israel arab jews jewish muslim talk.politics.misc politics gay homosexual sexual alt.atheism atheism...tion in NLP tasks; it was further used in several NLP works, such as by Liang (2005), to measure words’ distributional similarity. This method...embedding trained by neural networks has been used widely in the NLP community and has become a hot trend recently. In this pa- per, we test the suitability

  3. Event Classification using Concepts

    NARCIS (Netherlands)

    Boer, M.H.T. de; Schutte, K.; Kraaij, W.

    2013-01-01

    The semantic gap is one of the challenges in the GOOSE project. In this paper a Semantic Event Classification (SEC) system is proposed as an initial step in tackling the semantic gap challenge in the GOOSE project. This system uses semantic text analysis, multiple feature detectors using the BoW

  4. An Intelligent System For Arabic Text Categorization

    NARCIS (Netherlands)

    Syiam, M.M.; Tolba, Mohamed F.; Fayed, Z.T.; Abdel-Wahab, Mohamed S.; Ghoniemy, Said A.; Habib, Mena Badieh

    Text Categorization (classification) is the process of classifying documents into a predefined set of categories based on their content. In this paper, an intelligent Arabic text categorization system is presented. Machine learning algorithms are used in this system. Many algorithms for stemming and

  5. Tissue Classification

    DEFF Research Database (Denmark)

    Van Leemput, Koen; Puonti, Oula

    2015-01-01

    Computational methods for automatically segmenting magnetic resonance images of the brain have seen tremendous advances in recent years. So-called tissue classification techniques, aimed at extracting the three main brain tissue classes (white matter, gray matter, and cerebrospinal fluid), are now...... well established. In their simplest form, these methods classify voxels independently based on their intensity alone, although much more sophisticated models are typically used in practice. This article aims to give an overview of often-used computational techniques for brain tissue classification...

  6. Stylistics and comics

    NARCIS (Netherlands)

    Forceville, C.; El Refaie, E.; Meesters, G.; Burke, M.

    2014-01-01

    Comics is rapidly developing into a scholarly discipline in its own right, with a growing output of books, journals, and conferences. Two practitioners have been crucial influences. Eisner (1985) and McCloud (1993, 2000, 2006) have done much to aid the understanding of how comics create meaning. But

  7. Text Mining in Organizational Research.

    Science.gov (United States)

    Kobayashi, Vladimer B; Mol, Stefan T; Berkers, Hannah A; Kismihók, Gábor; Den Hartog, Deanne N

    2018-07-01

    Despite the ubiquity of textual data, so far few researchers have applied text mining to answer organizational research questions. Text mining, which essentially entails a quantitative approach to the analysis of (usually) voluminous textual data, helps accelerate knowledge discovery by radically increasing the amount data that can be analyzed. This article aims to acquaint organizational researchers with the fundamental logic underpinning text mining, the analytical stages involved, and contemporary techniques that may be used to achieve different types of objectives. The specific analytical techniques reviewed are (a) dimensionality reduction, (b) distance and similarity computing, (c) clustering, (d) topic modeling, and (e) classification. We describe how text mining may extend contemporary organizational research by allowing the testing of existing or new research questions with data that are likely to be rich, contextualized, and ecologically valid. After an exploration of how evidence for the validity of text mining output may be generated, we conclude the article by illustrating the text mining process in a job analysis setting using a dataset composed of job vacancies.

  8. Classification of remotely sensed images

    CSIR Research Space (South Africa)

    Dudeni, N

    2008-10-01

    Full Text Available For this research, the researchers examine various existing image classification algorithms with the aim of demonstrating how these algorithms can be applied to remote sensing images. These algorithms are broadly divided into supervised...

  9. Classification of Cortical Brain Malformations

    Directory of Open Access Journals (Sweden)

    J Gordon Millichap

    2008-03-01

    Full Text Available Clinical, radiological, and genetic classifications of 113 cases of malformations of cortical development (MCD were evaluated at the Erasmus Medical Center-Sophia Children's Hospital, Rotterdam, the Netherlands.

  10. PRIMERA SISTEMATIZACIÓN DE LAS CARACTERÍSTICAS ESTILÍSTICAS DE LA ALFARERÍA FINA DEL SITIO SORIA 2 (VALLE DE YOCAVIL, NOROESTE ARGENTINO / First systematization of stylistic characters of fine pottery from Soria 2 site (Yocavil, Northwestern Argentina

    Directory of Open Access Journals (Sweden)

    Romina Clara Spano

    2011-12-01

    Full Text Available Se presenta una primera sistematización de las características de la alfarería temprana del sitio Soria 2 (valle de Yocavil, Noroeste Argentino, centrando el análisis en ejemplares pertenecientes al denominado conjunto fino. Se apunta a la caracterización de una muestra del abundante material cerámico hallado en un contexto primario, para el cual se cuenta con un fechado de inicios de la Era Cristiana. El material es clasificado recurriendo a la categoría estilo, entendiendo a la misma como la integración de aspectos morfológicos, tecnológicos y decorativos, que convergen en los “modos de hacer” vigentes durante la ocupación del sitio. Se detallan las variables analíticas puestas en juego: forma, técnica de manufactura, pasta, cocción, tratamiento de la superficie y decoración. La conjunción de dichas variables es la base para proponer modalidades estilísticas. Adicionalmente, se refiere brevemente a las prácticas en las cuales las vasijas estuvieron involucradas, tomando en cuenta los contextos de hallazgo (doméstico y funerario. El análisis sugiere que algunos ejemplares de la muestra estudiada exhiben afinidades con espacios circundantes.   Palabras clave: alfarería; modalidades estilísticas; contexto primario; Formativo; valle de Yocavil.   Abstract In this paper we present a first systematization of the features of early pottery found at the site Soria 2 (Yocavil Valley, Northwestern Argentina, focusing the analysis on the specimens belonging to the so-called fine pottery group. We aim at the characterization of a sample of the abundant ceramic material found in primary context for which there is a radiocarbon date from the beginning of the Christian era. The material is classified using the style category, considered here as the integration of morphological, technological and decorative aspects, which converge in the current “ways of doing” at those times of the site occupation. The analytical variables used

  11. Transporter Classification Database (TCDB)

    Data.gov (United States)

    U.S. Department of Health & Human Services — The Transporter Classification Database details a comprehensive classification system for membrane transport proteins known as the Transporter Classification (TC)...

  12. FACET CLASSIFICATIONS OF E-LEARNING TOOLS

    Directory of Open Access Journals (Sweden)

    Olena Yu. Balalaieva

    2013-12-01

    Full Text Available The article deals with the classification of e-learning tools based on the facet method, which suggests the separation of the parallel set of objects into independent classification groups; at the same time it is not assumed rigid classification structure and pre-built finite groups classification groups are formed by a combination of values taken from the relevant facets. An attempt to systematize the existing classification of e-learning tools from the standpoint of classification theory is made for the first time. Modern Ukrainian and foreign facet classifications of e-learning tools are described; their positive and negative features compared to classifications based on a hierarchical method are analyzed. The original author's facet classification of e-learning tools is proposed.

  13. Orienting task effects on text recall in adulthood.

    Science.gov (United States)

    Simon, E W; Dixon, R A; Nowak, C A; Hultsch, D F

    1982-09-01

    This investigation examined the effects of orienting task-controlled processing on the text recall of younger (18 to 32 years), middle-aged (39 to 51 years), and older (59 to 76 years) adults. The participants were presented with a 500-word narrative text. Three groups performed orienting tasks (syntactic, stylistic, advice) within an incidental memory paradigm. A fourth group was asked for intentional recall. Analysis indicated a significant age by orienting task interaction. Younger adults recalled more propositions when recall was intentional or when it was preceded by a deep-orienting task than when it was preceded by a shallow-orienting task. Middle-aged and older adults recalled more propositions when recall was intentional than when it was incidental, regardless of the depth of the orienting task. There were no significant differences in intentional recall. In addition, a significant age x orienting task x propositional level interaction indicated that younger adults recalled more of the main ideas of the text following deep processing, whereas the middle-aged and older adults recalled more of these ideas following intentional processing.

  14. Text, Style, and Author in Hamlet Q1

    Directory of Open Access Journals (Sweden)

    Christy Desmet

    2016-03-01

    Full Text Available The first quarto of Hamlet has traditionally been an embarrassment to attribution studies. Textual and bibliographical studies from the 1980s and beyond have permitted suspect texts to be recovered and performed, but critical appreciation tends to focus on such matters as characterization and performance possibilities rather than the text’s rhetorical integrity and aesthetic qualities. More recently, we have seen greater critical attention to Shakespeare’s suspect texts, which has increased our appreciation for and expanded our notion of Q1 Hamlet as a ‘text’. Opinion remains divided, however, on the question of who ‘wrote’ this play. This essay addresses the authorship debate somewhat indirectly by providing a different view of Hamlet Q1 based on a stylistic analysis that is grounded in Renaissance rhetoric. It characterizes the play’s style as the rhetoric of speed, with brachylogia as its representative rhetorical figure. Through review of theories about the composition of Hamlet Q1 and a rhetorical analysis of its style, the essay seeks to examine how Hamlet’s first quarto might have a recognizable style and how that style might be related to current concepts of authorship.

  15. Text-Fabric

    NARCIS (Netherlands)

    Roorda, Dirk

    2016-01-01

    Text-Fabric is a Python3 package for Text plus Annotations. It provides a data model, a text file format, and a binary format for (ancient) text plus (linguistic) annotations. The emphasis of this all is on: data processing; sharing data; and contributing modules. A defining characteristic is that

  16. Contextual Text Mining

    Science.gov (United States)

    Mei, Qiaozhu

    2009-01-01

    With the dramatic growth of text information, there is an increasing need for powerful text mining systems that can automatically discover useful knowledge from text. Text is generally associated with all kinds of contextual information. Those contexts can be explicit, such as the time and the location where a blog article is written, and the…

  17. XML and Free Text.

    Science.gov (United States)

    Riggs, Ken Roger

    2002-01-01

    Discusses problems with marking free text, text that is either natural language or semigrammatical but unstructured, that prevent well-formed XML from marking text for readily available meaning. Proposes a solution to mark meaning in free text that is consistent with the intended simplicity of XML versus SGML. (Author/LRW)

  18. Vietnamese Document Representation and Classification

    Science.gov (United States)

    Nguyen, Giang-Son; Gao, Xiaoying; Andreae, Peter

    Vietnamese is very different from English and little research has been done on Vietnamese document classification, or indeed, on any kind of Vietnamese language processing, and only a few small corpora are available for research. We created a large Vietnamese text corpus with about 18000 documents, and manually classified them based on different criteria such as topics and styles, giving several classification tasks of different difficulty levels. This paper introduces a new syllable-based document representation at the morphological level of the language for efficient classification. We tested the representation on our corpus with different classification tasks using six classification algorithms and two feature selection techniques. Our experiments show that the new representation is effective for Vietnamese categorization, and suggest that best performance can be achieved using syllable-pair document representation, an SVM with a polynomial kernel as the learning algorithm, and using Information gain and an external dictionary for feature selection.

  19. E-text

    DEFF Research Database (Denmark)

    Finnemann, Niels Ole

    2018-01-01

    text can be defined by taking as point of departure the digital format in which everything is represented in the binary alphabet. While the notion of text, in most cases, lends itself to be independent of medium and embodiment, it is also often tacitly assumed that it is, in fact, modeled around...... the print medium, rather than written text or speech. In late 20th century, the notion of text was subject to increasing criticism as in the question raised within literary text theory: is there a text in this class? At the same time, the notion was expanded by including extra linguistic sign modalities...

  20. Texting on the Move

    Science.gov (United States)

    ... text. What's the Big Deal? The problem is multitasking. No matter how young and agile we are, ... on something other than the road. In fact, driving while texting (DWT) can be more dangerous than ...

  1. Classification and regression trees

    CERN Document Server

    Breiman, Leo; Olshen, Richard A; Stone, Charles J

    1984-01-01

    The methodology used to construct tree structured rules is the focus of this monograph. Unlike many other statistical procedures, which moved from pencil and paper to calculators, this text's use of trees was unthinkable before computers. Both the practical and theoretical sides have been developed in the authors' study of tree methods. Classification and Regression Trees reflects these two sides, covering the use of trees as a data analysis method, and in a more mathematical framework, proving some of their fundamental properties.

  2. Decimal Classification Editions

    Directory of Open Access Journals (Sweden)

    Zenovia Niculescu

    2009-01-01

    Full Text Available The study approaches the evolution of Dewey Decimal Classification editions from the perspective of updating the terminology, reallocating and expanding the main and auxilary structure of Dewey indexing language. The comparative analysis of DDC editions emphasizes the efficiency of Dewey scheme from the point of view of improving the informational offer, through basic index terms, revised and developed, as well as valuing the auxilary notations.

  3. Text Coherence in Translation

    Science.gov (United States)

    Zheng, Yanping

    2009-01-01

    In the thesis a coherent text is defined as a continuity of senses of the outcome of combining concepts and relations into a network composed of knowledge space centered around main topics. And the author maintains that in order to obtain the coherence of a target language text from a source text during the process of translation, a translator can…

  4. Vocabulary Constraint on Texts

    Directory of Open Access Journals (Sweden)

    C. Sutarsyah

    2008-01-01

    Full Text Available This case study was carried out in the English Education Department of State University of Malang. The aim of the study was to identify and describe the vocabulary in the reading text and to seek if the text is useful for reading skill development. A descriptive qualitative design was applied to obtain the data. For this purpose, some available computer programs were used to find the description of vocabulary in the texts. It was found that the 20 texts containing 7,945 words are dominated by low frequency words which account for 16.97% of the words in the texts. The high frequency words occurring in the texts were dominated by function words. In the case of word levels, it was found that the texts have very limited number of words from GSL (General Service List of English Words (West, 1953. The proportion of the first 1,000 words of GSL only accounts for 44.6%. The data also show that the texts contain too large proportion of words which are not in the three levels (the first 2,000 and UWL. These words account for 26.44% of the running words in the texts.  It is believed that the constraints are due to the selection of the texts which are made of a series of short-unrelated texts. This kind of text is subject to the accumulation of low frequency words especially those of content words and limited of words from GSL. It could also defeat the development of students' reading skills and vocabulary enrichment.

  5. Dictionaries for text production

    DEFF Research Database (Denmark)

    Fuertes-Olivera, Pedro; Bergenholtz, Henning

    2018-01-01

    Dictionaries for Text Production are information tools that are designed and constructed for helping users to produce (i.e. encode) texts, both oral and written texts. These can be broadly divided into two groups: (a) specialized text production dictionaries, i.e., dictionaries that only offer...... a small amount of lexicographic data, most or all of which are typically used in a production situation, e.g. synonym dictionaries, grammar and spelling dictionaries, collocation dictionaries, concept dictionaries such as the Longman Language Activator, which is advertised as the World’s First Production...... Dictionary; (b) general text production dictionaries, i.e., dictionaries that offer all or most of the lexicographic data that are typically used in a production situation. A review of existing production dictionaries reveals that there are many specialized text production dictionaries but only a few general...

  6. Instant Sublime Text starter

    CERN Document Server

    Haughee, Eric

    2013-01-01

    A starter which teaches the basic tasks to be performed with Sublime Text with the necessary practical examples and screenshots. This book requires only basic knowledge of the Internet and basic familiarity with any one of the three major operating systems, Windows, Linux, or Mac OS X. However, as Sublime Text 2 is primarily a text editor for writing software, many of the topics discussed will be specifically relevant to software development. That being said, the Sublime Text 2 Starter is also suitable for someone without a programming background who may be looking to learn one of the tools of

  7. NEW CLASSIFICATION OF ECOPOLICES

    Directory of Open Access Journals (Sweden)

    VOROBYOV V. V.

    2016-09-01

    Full Text Available Problem statement. Ecopolices are the newest stage of the urban planning. They have to be consideredsuchas material and energy informational structures, included to the dynamic-evolutionary matrix netsofex change processes in the ecosystems. However, there are not made the ecopolice classifications, developing on suchapproaches basis. And this determined the topicality of the article. Analysis of publications on theoretical and applied aspects of the ecopolices formation showed, that the work on them is managed mainly in the context of the latest scientific and technological achievements in the various knowledge fields. These settlements are technocratic. They are connected with the morphology of space, network structures of regional and local natural ecosystems, without independent stability, can not exist without continuous man support. Another words, they do not work in with an ecopolices idea. It is come to a head for objective, symbiotic searching of ecopolices concept with the development of their classifications. Purpose statement is to develop the objective evidence for ecopolices and to propose their new classification. Conclusion. On the base of the ecopolices classification have to lie an elements correlation idea of their general plans and men activity type according with natural mechanism of accepting, reworking and transmission of material, energy and information between geo-ecosystems, planet, man, ecopolices material part and Cosmos. New ecopolices classification should be based on the principles of multi-dimensional, time-spaced symbiotic clarity with exchange ecosystem networks. The ecopolice function with this approach comes not from the subjective anthropocentric economy but from the holistic objective of Genesis paradigm. Or, otherwise - not from the Consequence, but from the Cause.

  8. Differential Classification of Dementia

    Directory of Open Access Journals (Sweden)

    E. Mohr

    1995-01-01

    Full Text Available In the absence of biological markers, dementia classification remains complex both in terms of characterization as well as early detection of the presence or absence of dementing symptoms, particularly in diseases with possible secondary dementia. An empirical, statistical approach using neuropsychological measures was therefore developed to distinguish demented from non-demented patients and to identify differential patterns of cognitive dysfunction in neurodegenerative disease. Age-scaled neurobehavioral test results (Wechsler Adult Intelligence Scale—Revised and Wechsler Memory Scale from Alzheimer's (AD and Huntington's (HD patients, matched for intellectual disability, as well as normal controls were used to derive a classification formula. Stepwise discriminant analysis accurately (99% correct distinguished controls from demented patients, and separated the two patient groups (79% correct. Variables discriminating between HD and AD patient groups consisted of complex psychomotor tasks, visuospatial function, attention and memory. The reliability of the classification formula was demonstrated with a new, independent sample of AD and HD patients which yielded virtually identical results (classification accuracy for dementia: 96%; AD versus HD: 78%. To validate the formula, the discriminant function was applied to Parkinson's (PD patients, 38% of whom were classified as demented. The validity of the classification was demonstrated by significant PD subgroup differences on measures of dementia not included in the discriminant function. Moreover, a majority of demented PD patients (65% were classified as having an HD-like pattern of cognitive deficits, in line with previous reports of the subcortical nature of PD dementia. This approach may thus be useful in classifying presence or absence of dementia and in discriminating between dementia subtypes in cases of secondary or coincidental dementia.

  9. Linguistics in Text Interpretation

    DEFF Research Database (Denmark)

    Togeby, Ole

    2011-01-01

    A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'.......A model for how text interpretation proceeds from what is pronounced, through what is said to what is comunicated, and definition of the concepts 'presupposition' and 'implicature'....

  10. LocText

    DEFF Research Database (Denmark)

    Cejuela, Juan Miguel; Vinchurkar, Shrikant; Goldberg, Tatyana

    2018-01-01

    trees and was trained and evaluated on a newly improved LocTextCorpus. Combined with an automatic named-entity recognizer, LocText achieved high precision (P = 86%±4). After completing development, we mined the latest research publications for three organisms: human (Homo sapiens), budding yeast...

  11. Systematic text condensation

    DEFF Research Database (Denmark)

    Malterud, Kirsti

    2012-01-01

    To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies.......To present background, principles, and procedures for a strategy for qualitative analysis called systematic text condensation and discuss this approach compared with related strategies....

  12. The Perfect Text.

    Science.gov (United States)

    Russo, Ruth

    1998-01-01

    A chemistry teacher describes the elements of the ideal chemistry textbook. The perfect text is focused and helps students draw a coherent whole out of the myriad fragments of information and interpretation. The text would show chemistry as the central science necessary for understanding other sciences and would also root chemistry firmly in the…

  13. Text 2 Mind Map

    OpenAIRE

    Iona, John

    2017-01-01

    This is a review of the web resource 'Text 2 Mind Map' www.Text2MindMap.com. It covers what the resource is, and how it might be used in Library and education context, in particular for School Librarians.

  14. Text File Comparator

    Science.gov (United States)

    Kotler, R. S.

    1983-01-01

    File Comparator program IFCOMP, is text file comparator for IBM OS/VScompatable systems. IFCOMP accepts as input two text files and produces listing of differences in pseudo-update form. IFCOMP is very useful in monitoring changes made to software at the source code level.

  15. Classification in context

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2004-01-01

    This paper surveys classification research literature, discusses various classification theories, and shows that the focus has traditionally been on establishing a scientific foundation for classification research. This paper argues that a shift has taken place, and suggests that contemporary...... classification research focus on contextual information as the guide for the design and construction of classification schemes....

  16. Classification of the web

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2004-01-01

    This paper discusses the challenges faced by investigations into the classification of the Web and outlines inquiries that are needed to use principles for bibliographic classification to construct classifications of the Web. This paper suggests that the classification of the Web meets challenges...... that call for inquiries into the theoretical foundation of bibliographic classification theory....

  17. Zum Bildungspotenzial biblischer Texte

    Directory of Open Access Journals (Sweden)

    Theis, Joachim

    2017-11-01

    Full Text Available Biblical education as a holistic process goes far beyond biblical learning. It must be understood as a lifelong process, in which both biblical texts and their understanders operate appropriating their counterpart in a dialogical way. – Neither does the recipient’s horizon of understanding appear as an empty room, which had to be filled with the text only, nor is the latter a dead material one could only examine cognitively. The recipient discovers the meaning of the biblical text recomposing it by existential appropriation. So the text is brought to live in each individual reality. Both scientific insights and subjective structures as well as the understanders’ community must be included to avoid potential one-sidednesses. Unfortunately, a special negative association obscures the approach of the bible very often: Still biblical work as part of religious education appears in a cognitively oriented habit, which is neither regarding the vitality and sovereignty of the biblical texts nor the students’ desire for meaning. Moreover, the bible is getting misused for teaching moral terms or pontifications. Such downfalls can be disrupted by biblical didactics which are empowerment didactics. Regarding the sovereignty of biblical texts, these didactics assist the understander with his/her individuation by opening the texts with focus on the understander’s otherness. Thus each the text and the recipient become subjects in a dialogue. The approach of the Biblical-Enabling-Didactics leads the Bible to become always new a book of life. Understanding them from within their hermeneutics, empowerment didactics could be raised to the principle of biblical didactics in general and grow into an essential element of holistic education.

  18. EST: Evading Scientific Text.

    Science.gov (United States)

    Ward, Jeremy

    2001-01-01

    Examines chemical engineering students' attitudes to text and other parts of English language textbooks. A questionnaire was administered to a group of undergraduates. Results reveal one way students get around the problem of textbook reading. (Author/VWL)

  19. nal Sesotho texts

    African Journals Online (AJOL)

    with literary texts written in indigenous South African languages. The project ... Homi Bhabha uses the words of Salman Rushdie to underline the fact that new .... I could not conceptualise an African-language-to-African-language dictionary. An.

  20. Plagiarism in Academic Texts

    Directory of Open Access Journals (Sweden)

    Marta Eugenia Rojas-Porras

    2012-08-01

    Full Text Available The ethical and social responsibility of citing the sources in a scientific or artistic work is undeniable. This paper explores, in a preliminary way, academic plagiarism in its various forms. It includes findings based on a forensic analysis. The purpose of this paper is to raise awareness on the importance of considering these details when writing and publishing a text. Hopefully, this analysis may put the issue under discussion.

  1. Machine Translation from Text

    Science.gov (United States)

    Habash, Nizar; Olive, Joseph; Christianson, Caitlin; McCary, John

    Machine translation (MT) from text, the topic of this chapter, is perhaps the heart of the GALE project. Beyond being a well defined application that stands on its own, MT from text is the link between the automatic speech recognition component and the distillation component. The focus of MT in GALE is on translating from Arabic or Chinese to English. The three languages represent a wide range of linguistic diversity and make the GALE MT task rather challenging and exciting.

  2. Hazard classification methodology

    International Nuclear Information System (INIS)

    Brereton, S.J.

    1996-01-01

    This document outlines the hazard classification methodology used to determine the hazard classification of the NIF LTAB, OAB, and the support facilities on the basis of radionuclides and chemicals. The hazard classification determines the safety analysis requirements for a facility

  3. TEXT Energy Storage System

    International Nuclear Information System (INIS)

    Weldon, W.F.; Rylander, H.G.; Woodson, H.H.

    1977-01-01

    The Texas Experimental Tokamak (TEXT) Enery Storage System, designed by the Center for Electromechanics (CEM), consists of four 50 MJ, 125 V homopolar generators and their auxiliaries and is designed to power the toroidal and poloidal field coils of TEXT on a two-minute duty cycle. The four 50 MJ generators connected in series were chosen because they represent the minimum cost configuration and also represent a minimal scale up from the successful 5.0 MJ homopolar generator designed, built, and operated by the CEM

  4. New mathematical cuneiform texts

    CERN Document Server

    Friberg, Jöran

    2016-01-01

    This monograph presents in great detail a large number of both unpublished and previously published Babylonian mathematical texts in the cuneiform script. It is a continuation of the work A Remarkable Collection of Babylonian Mathematical Texts (Springer 2007) written by Jöran Friberg, the leading expert on Babylonian mathematics. Focussing on the big picture, Friberg explores in this book several Late Babylonian arithmetical and metro-mathematical table texts from the sites of Babylon, Uruk and Sippar, collections of mathematical exercises from four Old Babylonian sites, as well as a new text from Early Dynastic/Early Sargonic Umma, which is the oldest known collection of mathematical exercises. A table of reciprocals from the end of the third millennium BC, differing radically from well-documented but younger tables of reciprocals from the Neo-Sumerian and Old-Babylonian periods, as well as a fragment of a Neo-Sumerian clay tablet showing a new type of a labyrinth are also discussed. The material is presen...

  5. The Emar Lexical Texts

    NARCIS (Netherlands)

    Gantzert, Merijn

    2011-01-01

    This four-part work provides a philological analysis and a theoretical interpretation of the cuneiform lexical texts found in the Late Bronze Age city of Emar, in present-day Syria. These word and sign lists, commonly dated to around 1100 BC, were almost all found in the archive of a single school.

  6. Text Induced Spelling Correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from a very large corpus of raw text, without supervision, and contains word

  7. Texts and Readers.

    Science.gov (United States)

    Iser, Wolfgang

    1980-01-01

    Notes that, since fictional discourse need not reflect prevailing systems of meaning and norms or values, readers gain detachment from their own presuppositions; by constituting and formulating text-sense, readers are constituting and formulating their own cognition and becoming aware of the operations for doing so. (FL)

  8. Documents and legal texts

    International Nuclear Information System (INIS)

    2017-01-01

    This section treats of the following documents and legal texts: 1 - Belgium 29 June 2014 - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy; 2 - Belgium, 7 December 2016. - Act amending the Act of 22 July 1985 on Third-Party Liability in the Field of Nuclear Energy

  9. Technological and stylistic evaluation of the Early Bronze Age pottery at Tarsus-Gozlukule, Turkey: Pottery production and its interaction with economic, social, and cultural spheres

    Science.gov (United States)

    Unlu, Elif

    This dissertation presents a technological and stylistic assessment of Early Bronze Age pottery production at Tarsus-Gozlukule, a multi-period mound settlement located in the Cilician Plain in southern Turkey. Pottery production, like all other man-made objects, is firstly a technological act. This dissertation maintains that material style (involving formal, technical, and decorative choices expressed by the artisan) of an artifact should be investigated as a whole as such an integrative study would be the most adequate way of understanding economic circumstances, social representation, and cultural boundaries. To facilitate this integrative investigation, seventy-two samples of Early Bronze Age pottery excavated from Tarsus-Gozlukule in the 1930s and 1940s.were selected for mineralogical, morphological, and chemical analyses. Petrographic and powder X-Ray Diffraction analyses were performed to determine the mineralogical makeup, Environmental Scanning Electron Microscope imagery was used to determine the morphology of these samples, and semi-quantitave Energy Dispersive X-Ray Spectroscopy analysis was performed on some samples to determine chemical properties of the clays. As a result of these scientific analyses various fabric groups were established. Afterwards formal shape and stylistic analysis was performed where shapes and surface treatments of the samples were analyzed and compared to the known local and non-local examples. Such an integrative approach to pottery production facilitates a better definition of the local pottery production process and enables an assessment of the technological know-how of the local pottery producers, their labor organization and its role within the operating markets, their function within the sociopolitical structure, and how such issues relate to the cultural boundaries within the community. Defining the paradigm of the local pottery production process leads to a broader investigation of issues related to the technological

  10. Strategy as Texts

    DEFF Research Database (Denmark)

    Obed Madsen, Søren

    of the strategy into four categories. Second, the managers produce new texts based on the original strategy document by using four different ways of translation models. The study’s findings contribute to three areas. Firstly, it shows that translation is more than a sociological process. It is also...... a craftsmanship that requires knowledge and skills, which unfortunately seems to be overlooked in both the literature and in practice. Secondly, it shows that even though a strategy text is in singular, the translation makes strategy plural. Thirdly, the article proposes a way to open up the black box of what......This article shows empirically how managers translate a strategy plan at an individual level. By analysing how managers in three organizations translate strategies, it identifies that the translation happens in two steps: First, the managers decipher the strategy by coding the different parts...

  11. Clustering and classification of email contents

    Directory of Open Access Journals (Sweden)

    Izzat Alsmadi

    2015-01-01

    Full Text Available Information users depend heavily on emails’ system as one of the major sources of communication. Its importance and usage are continuously growing despite the evolution of mobile applications, social networks, etc. Emails are used on both the personal and professional levels. They can be considered as official documents in communication among users. Emails’ data mining and analysis can be conducted for several purposes such as: Spam detection and classification, subject classification, etc. In this paper, a large set of personal emails is used for the purpose of folder and subject classifications. Algorithms are developed to perform clustering and classification for this large text collection. Classification based on NGram is shown to be the best for such large text collection especially as text is Bi-language (i.e. with English and Arabic content.

  12. SPORT FOOD ADDITIVE CLASSIFICATION

    Directory of Open Access Journals (Sweden)

    I. P. Prokopenko

    2015-01-01

    Full Text Available Correctly organized nutritive and pharmacological support is an important component of an athlete's preparation for competitions, an optimal shape maintenance, fast recovery and rehabilitation after traumas and defatigation. Special products of enhanced biological value (BAS for athletes nutrition are used with this purpose. Easy-to-use energy sources are administered into athlete's organism, yielded materials and biologically active substances which regulate and activate exchange reactions which proceed with difficulties during certain physical trainings. The article presents sport supplements classification which can be used before warm-up and trainings, after trainings and in competitions breaks.

  13. Reading Authentic Texts

    DEFF Research Database (Denmark)

    Balling, Laura Winther

    2013-01-01

    Most research on cognates has focused on words presented in isolation that are easily defined as cognate between L1 and L2. In contrast, this study investigates what counts as cognate in authentic texts and how such cognates are read. Participants with L1 Danish read news articles in their highly...... proficient L2, English, while their eye-movements were monitored. The experiment shows a cognate advantage for morphologically simple words, but only when cognateness is defined relative to translation equivalents that are appropriate in the context. For morphologically complex words, a cognate disadvantage...... word predictability indexed by the conditional probability of each word....

  14. Documents and legal texts

    International Nuclear Information System (INIS)

    2016-01-01

    This section treats of the following documents and legal texts: 1 - Brazil: Law No. 13,260 of 16 March 2016 (To regulate the provisions of item XLIII of Article 5 of the Federal Constitution on terrorism, dealing with investigative and procedural provisions and redefining the concept of a terrorist organisation; and amends Laws No. 7,960 of 21 December 1989 and No. 12,850 of 2 August 2013); 2 - India: The Atomic Energy (Amendment) Act, 2015; Department Of Atomic Energy Notification (Civil Liability for Nuclear Damage); 3 - Japan: Act on Subsidisation, etc. for Nuclear Damage Compensation Funds following the implementation of the Convention on Supplementary Compensation for Nuclear Damage

  15. Journalistic Text Production

    DEFF Research Database (Denmark)

    Haugaard, Rikke Hartmann

    , a multiple case study investigated three professional text producers’ practices as they unfolded in their natural setting at the Spanish newspaper, El Mundo. • Results indicate that journalists’ revisions are related to form markedly more often than to content. • Results suggest two writing phases serving...... at the Spanish newspaper, El Mundo, in Madrid. The study applied a combination of quantitative and qualitative methods, i.e. keystroke logging, participant observation and retrospective interview. Results indicate that journalists’ revisions are related to form markedly more often than to content (approx. three...

  16. Weitere Texte physiognomischen Inhalts

    Directory of Open Access Journals (Sweden)

    Böck, Barbara

    2004-12-01

    Full Text Available The present article offers the edition of three cuneiform texts belonging to the Akkadian handbook of omens drawn from the physical appearance as well as the morals and behaviour of man. The book comprising up to 27 chapters with more than 100 omens each was entitled in antiquity Alamdimmû. The edition of the three cuneiform tablets completes, thus, the author's monographic study on the ancient Mesopotamian divinatory discipline of physiognomy (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

    En este artículo se presenta la editio princeps de tres textos cuneiformes conservados en el British Museum (Londres y el Vorderasiatisches Museum (Berlín, que pertenecen al libro asirio-babilonio de presagios fisiognómicos. Este libro, titulado originalmente Alamdimmû ('forma, figura', consta de 27 capítulos, cada uno con más de cien presagios escritos en lengua acadia. Los tres textos completan así el estudio monográfico de la autora sobre la disciplina adivinatoria de la fisiognomía en el antiguo Oriente (Die babylonisch-assyrische Morphoskopie (Wien 2000 [=AfO Beih. 27].

  17. Utah Text Retrieval Project

    Energy Technology Data Exchange (ETDEWEB)

    Hollaar, L A

    1983-10-01

    The Utah Text Retrieval project seeks well-engineered solutions to the implementation of large, inexpensive, rapid text information retrieval systems. The project has three major components. Perhaps the best known is the work on the specialized processors, particularly search engines, necessary to achieve the desired performance and cost. The other two concern the user interface to the system and the system's internal structure. The work on user interface development is not only concentrating on the syntax and semantics of the query language, but also on the overall environment the system presents to the user. Environmental enhancements include convenient ways to browse through retrieved documents, access to other information retrieval systems through gateways supporting a common command interface, and interfaces to word processing systems. The system's internal structure is based on a high-level data communications protocol linking the user interface, index processor, search processor, and other system modules. This allows them to be easily distributed in a multi- or specialized-processor configuration. It also allows new modules, such as a knowledge-based query reformulator, to be added. 15 references.

  18. Raw materials resources classification and characterisation for ...

    African Journals Online (AJOL)

    Raw materials resources classification and characterisation for ceramic tableware production in Nigeria. PSA Irabor. Abstract. No Abstract. Journal of Applied Science, Engineering and Technology Vol. 2(1) 2002: 48-52. Full Text: EMAIL FULL TEXT EMAIL FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT.

  19. Some stylistic and syntactic devices of expansion and complication of a German advertising sentence in translation into Russian

    Directory of Open Access Journals (Sweden)

    Артур Нарманович Мамедов

    2011-06-01

    Full Text Available The translate of an advertising text of source language doesn't fully correspond the criteria of communicative equivalence without an adequate transfer of the invariant functional dominance, the construction, which expands or complicates the syntactic structure of an advertising sentence. Alternative correspondences of the target language, which fully transfer the meaning of such construction in certain cases of its usage, are often being found in macrocontext in the process of translation of such constructions.

  20. Radar transmitter classification using non-stationary signal classifier

    CSIR Research Space (South Africa)

    Du Plessis, MC

    2009-07-01

    Full Text Available support vector machine which is applied to the radar pulse's time-frequency representation. The time-frequency representation is refined using particle swarm optimization to increase the classification accuracy. The classification accuracy is tested...

  1. Documents and legal texts

    International Nuclear Information System (INIS)

    2013-01-01

    This section reprints a selection of recently published legislative texts and documents: - Russian Federation: Federal Law No.170 of 21 November 1995 on the use of atomic energy, Adopted by the State Duma on 20 October 1995; - Uruguay: Law No.19.056 On the Radiological Protection and Safety of Persons, Property and the Environment (4 January 2013); - Japan: Third Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (concerning Damages related to Rumour-Related Damage in the Agriculture, Forestry, Fishery and Food Industries), 30 January 2013; - France and the United States: Joint Statement on Liability for Nuclear Damage (Aug 2013); - Franco-Russian Nuclear Power Declaration (1 November 2013)

  2. The book classification of William Torrey Harris: influences of Bacon and Hegel in library classification

    Directory of Open Access Journals (Sweden)

    Rodrigo de Sales

    2017-09-01

    Full Text Available The studies of library classification generally interact with the historical contextualization approach and with the classification ideas typical of Philosophy. In the 19th century, the North-American philosopher and educator William Torrey Harris developed a book classification at the St. Louis Public School, based on Francis Bacon and Georg Wilhelm Friedrich Hegel. The objective of this essay is to analyze Harris’s classification, reflecting upon his theoretical and philosophical backgrounds. To achieve such objective, this essay adopts a critical-descriptive approach for analysis. Results show some influences of Bacon and Hegel in Harris’s classification.

  3. Sentiment classification technology based on Markov logic networks

    Science.gov (United States)

    He, Hui; Li, Zhigang; Yao, Chongchong; Zhang, Weizhe

    2016-07-01

    With diverse online media emerging, there is a growing concern of sentiment classification problem. At present, text sentiment classification mainly utilizes supervised machine learning methods, which feature certain domain dependency. On the basis of Markov logic networks (MLNs), this study proposed a cross-domain multi-task text sentiment classification method rooted in transfer learning. Through many-to-one knowledge transfer, labeled text sentiment classification, knowledge was successfully transferred into other domains, and the precision of the sentiment classification analysis in the text tendency domain was improved. The experimental results revealed the following: (1) the model based on a MLN demonstrated higher precision than the single individual learning plan model. (2) Multi-task transfer learning based on Markov logical networks could acquire more knowledge than self-domain learning. The cross-domain text sentiment classification model could significantly improve the precision and efficiency of text sentiment classification.

  4. Interconnectedness und digitale Texte

    Directory of Open Access Journals (Sweden)

    Detlev Doherr

    2013-04-01

    Full Text Available Zusammenfassung Die multimedialen Informationsdienste im Internet werden immer umfangreicher und umfassender, wobei auch die nur in gedruckter Form vorliegenden Dokumente von den Bibliotheken digitalisiert und ins Netz gestellt werden. Über Online-Dokumentenverwaltungen oder Suchmaschinen können diese Dokumente gefunden und dann in gängigen Formaten wie z.B. PDF bereitgestellt werden. Dieser Artikel beleuchtet die Funktionsweise der Humboldt Digital Library, die seit mehr als zehn Jahren Dokumente von Alexander von Humboldt in englischer Übersetzung im Web als HDL (Humboldt Digital Library kostenfrei zur Verfügung stellt. Anders als eine digitale Bibliothek werden dabei allerdings nicht nur digitalisierte Dokumente als Scan oder PDF bereitgestellt, sondern der Text als solcher und in vernetzter Form verfügbar gemacht. Das System gleicht damit eher einem Informationssystem als einer digitalen Bibliothek, was sich auch in den verfügbaren Funktionen zur Auffindung von Texten in unterschiedlichen Versionen und Übersetzungen, Vergleichen von Absätzen verschiedener Dokumente oder der Darstellung von Bilden in ihrem Kontext widerspiegelt. Die Entwicklung von dynamischen Hyperlinks auf der Basis der einzelnen Textabsätze der Humboldt‘schen Werke in Form von Media Assets ermöglicht eine Nutzung der Programmierschnittstelle von Google Maps zur geographischen wie auch textinhaltlichen Navigation. Über den Service einer digitalen Bibliothek hinausgehend, bietet die HDL den Prototypen eines mehrdimensionalen Informationssystems, das mit dynamischen Strukturen arbeitet und umfangreiche thematische Auswertungen und Vergleiche ermöglicht. Summary The multimedia information services on Internet are becoming more and more comprehensive, even the printed documents are digitized and republished as digital Web documents by the libraries. Those digital files can be found by search engines or management tools and provided as files in usual formats as

  5. Using Shakespeare's Sotto Voce to Determine True Identity From Text

    Directory of Open Access Journals (Sweden)

    David Kernot

    2018-03-01

    Full Text Available Little is known of the private life of William Shakespeare, but he is famous for his collection of plays and poems, even though many of the works attributed to him were published anonymously. Determining the identity of Shakespeare has fascinated scholars for 400 years, and four significant figures in English literary history have been suggested as likely alternatives to Shakespeare for some disputed works: Bacon, de Vere, Stanley, and Marlowe. A myriad of computational and statistical tools and techniques have been used to determine the true authorship of his works. Many of these techniques rely on basic statistical correlations, word counts, collocated word groups, or keyword density, but no one method has been decided on. We suggest that an alternative technique that uses word semantics to draw on personality can provide an accurate profile of a person. To test this claim, we analyse the works of Shakespeare, Christopher Marlowe, and Elizabeth Cary. We use Word Accumulation Curves, Hierarchical Clustering overlays, Principal Component Analysis, and Linear Discriminant Analysis techniques in combination with RPAS, a multi-faceted text analysis approach that draws on a writer's personality, or self to identify subtle characteristics within a person's writing style. Here we find that RPAS can separate the known authored works of Shakespeare from Marlowe and Cary. Further, it separates their contested works, works suspected of being written by others. While few authorship identification techniques identify self from the way a person writes, we demonstrate that these stylistic characteristics are as applicable 400 years ago as they are today and have the potential to be used within cyberspace for law enforcement purposes.

  6. Prevailing Lexical-stylistic Features in Emirati Language Learners’ Digital Discourse | Caractéristiques lexicales stylistiques dominantes dans le discours numérique des apprenants en langue émirienne

    Directory of Open Access Journals (Sweden)

    Tsoghik Grigoryan

    2015-05-01

    Full Text Available Predicting the future path of the digital classroom discourse is twofold. Today’s language classroom is undergoing an irreversible revolution and one of the most powerful drivers of this transformation is ICT. Digital classroom not only exposes the learners to grammatical language of linguistics, but rather the everyday life of the language in use (Thurlow and Mroczek, 2011. The aim of this study was to explore the nature of free digital discourse in a digital language classroom and capture lexical-stylistic features used in students’ online conversations through Blackboard-learn discussion board. To identify common or unique features of digital discourse in a paperless language classroom and to show how they affect students’ speech behaviors, mixed method case study was used. Aujourd’hui, l’enseignement des langues est entraîné dans une hyperbole irréversible, et les TIC sont l’un des moteurs les plus puissants de cette transformation. Les salles de classe numériques exposent les apprenants non seulement à la grammaire linguistique, mais aussi à la vie quotidienne de la langue en usage (Thurlow et Mroczek, 2011. Le but de cette étude était d’explorer la nature du discours numérique gratuit sur iPad dans une salle de classe numérique pour l’apprentissage linguistique et de capter les caractéristiques lexicales stylistiques utilisées dans les communications en ligne des apprenants adolescents en langue émirienne. Cette approche mixte par étude de cas a mis en œuvre un cadre théorique de détection des sentiments sur une plateforme d’apprentissage sur tableau noir pour cerner les caractéristiques communes ou uniques du discours numérique dans une salle de classe dématérialisée et démontrer comment elles affectent les comportements linguistiques des élèves de langue maternelle émirienne.

  7. Documents and legal texts

    International Nuclear Information System (INIS)

    2015-01-01

    This section treats of the following Documents and legal texts: 1 - Canada: Nuclear Liability and Compensation Act (An Act respecting civil liability and compensation for damage in case of a nuclear incident, repealing the Nuclear Liability Act and making consequential amendments to other acts); 2 - Japan: Act on Compensation for Nuclear Damage (The purpose of this act is to protect persons suffering from nuclear damage and to contribute to the sound development of the nuclear industry by establishing a basic system regarding compensation in case of nuclear damage caused by reactor operation etc.); Act on Indemnity Agreements for Compensation of Nuclear Damage; 3 - Slovak Republic: Act on Civil Liability for Nuclear Damage and on its Financial Coverage and on Changes and Amendments to Certain Laws (This Act regulates: a) The civil liability for nuclear damage incurred in the causation of a nuclear incident, b) The scope of powers of the Nuclear Regulatory Authority (hereinafter only as the 'Authority') in relation to the application of this Act, c) The competence of the National Bank of Slovakia in relation to the supervised financial market entities in the financial coverage of liability for nuclear damage; and d) The penalties for violation of this Act)

  8. Documents and legal texts

    International Nuclear Information System (INIS)

    2014-01-01

    This section of the Bulletin presents the recently published documents and legal texts sorted by country: - Brazil: Resolution No. 169 of 30 April 2014. - Japan: Act Concerning Exceptions to Interruption of Prescription Pertaining to Use of Settlement Mediation Procedures by the Dispute Reconciliation Committee for Nuclear Damage Compensation in relation to Nuclear Damage Compensation Disputes Pertaining to the Great East Japan Earthquake (Act No. 32 of 5 June 2013); Act Concerning Measures to Achieve Prompt and Assured Compensation for Nuclear Damage Arising from the Nuclear Plant Accident following the Great East Japan Earthquake and Exceptions to the Extinctive Prescription, etc. of the Right to Claim Compensation for Nuclear Damage (Act No. 97 of 11 December 2013); Fourth Supplement to Interim Guidelines on Determination of the Scope of Nuclear Damage Resulting from the Accident at the Tokyo Electric Power Company Fukushima Daiichi and Daini Nuclear Power Plants (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.); Outline of 'Fourth Supplement to Interim Guidelines (Concerning Damages Associated with the Prolongation of Evacuation Orders, etc.)'. - OECD Nuclear Energy Agency: Decision and Recommendation of the Steering Committee Concerning the Application of the Paris Convention to Nuclear Installations in the Process of Being Decommissioned; Joint Declaration on the Security of Supply of Medical Radioisotopes. - United Arab Emirates: Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage; Ratification of the Federal Supreme Council of Federal Decree No. (51) of 2014 Ratifying the Convention on Supplementary Compensation for Nuclear Damage

  9. Information Gain Based Dimensionality Selection for Classifying Text Documents

    Energy Technology Data Exchange (ETDEWEB)

    Dumidu Wijayasekara; Milos Manic; Miles McQueen

    2013-06-01

    Selecting the optimal dimensions for various knowledge extraction applications is an essential component of data mining. Dimensionality selection techniques are utilized in classification applications to increase the classification accuracy and reduce the computational complexity. In text classification, where the dimensionality of the dataset is extremely high, dimensionality selection is even more important. This paper presents a novel, genetic algorithm based methodology, for dimensionality selection in text mining applications that utilizes information gain. The presented methodology uses information gain of each dimension to change the mutation probability of chromosomes dynamically. Since the information gain is calculated a priori, the computational complexity is not affected. The presented method was tested on a specific text classification problem and compared with conventional genetic algorithm based dimensionality selection. The results show an improvement of 3% in the true positives and 1.6% in the true negatives over conventional dimensionality selection methods.

  10. The Role of Text Mining in Export Control

    Energy Technology Data Exchange (ETDEWEB)

    Tae, Jae-woong; Son, Choul-woong; Shin, Dong-hoon [Korea Institute of Nuclear Nonproliferation and Control, Daejeon (Korea, Republic of)

    2015-10-15

    Korean government provides classification services to exporters. It is simple to copy technology such as documents and drawings. Moreover, it is also easy that new technology derived from the existing technology. The diversity of technology makes classification difficult because the boundary between strategic and nonstrategic technology is unclear and ambiguous. Reviewers should consider previous classification cases enough. However, the increase of the classification cases prevent consistent classifications. This made another innovative and effective approaches necessary. IXCRS (Intelligent Export Control Review System) is proposed to coincide with demands. IXCRS consists of and expert system, a semantic searching system, a full text retrieval system, and image retrieval system and a document retrieval system. It is the aim of the present paper to observe the document retrieval system based on text mining and to discuss how to utilize the system. This study has demonstrated how text mining technique can be applied to export control. The document retrieval system supports reviewers to treat previous classification cases effectively. Especially, it is highly probable that similarity data will contribute to specify classification criterion. However, an analysis of the system showed a number of problems that remain to be explored such as a multilanguage problem and an inclusion relationship problem. Further research should be directed to solve problems and to apply more data mining techniques so that the system should be used as one of useful tools for export control.

  11. The Role of Text Mining in Export Control

    International Nuclear Information System (INIS)

    Tae, Jae-woong; Son, Choul-woong; Shin, Dong-hoon

    2015-01-01

    Korean government provides classification services to exporters. It is simple to copy technology such as documents and drawings. Moreover, it is also easy that new technology derived from the existing technology. The diversity of technology makes classification difficult because the boundary between strategic and nonstrategic technology is unclear and ambiguous. Reviewers should consider previous classification cases enough. However, the increase of the classification cases prevent consistent classifications. This made another innovative and effective approaches necessary. IXCRS (Intelligent Export Control Review System) is proposed to coincide with demands. IXCRS consists of and expert system, a semantic searching system, a full text retrieval system, and image retrieval system and a document retrieval system. It is the aim of the present paper to observe the document retrieval system based on text mining and to discuss how to utilize the system. This study has demonstrated how text mining technique can be applied to export control. The document retrieval system supports reviewers to treat previous classification cases effectively. Especially, it is highly probable that similarity data will contribute to specify classification criterion. However, an analysis of the system showed a number of problems that remain to be explored such as a multilanguage problem and an inclusion relationship problem. Further research should be directed to solve problems and to apply more data mining techniques so that the system should be used as one of useful tools for export control

  12. Synoptic, redactional, stylistic and narratological observations on the retelling of Mark 7:30 in Matthew 15:28

    Directory of Open Access Journals (Sweden)

    Peter-Ben A. Smit

    2014-10-01

    Full Text Available The Matthean redaction of Mark 7:30 in Matthew 15:28 often receives scant attention in scholarship in terms of its narrative quality. At most, it is regarded as a truncation of the full Marcan version, while all attention is given to Matthew’s introduction of the notion of ‘faith’ in this verse. This article argues, by contrast, and on the basis of a synoptic comparison and narratological analysis of both texts, that more justice is done to both versions of the conclusion of this healing miracle when understanding them as achieving different narrative effects, with Matthew focusing on the immediacy of the healing, while Mark creates suspense, thus focusing on the veracity of Jesus’ statement that the girl in question is healed.

  13. SHIP CLASSIFICATION FROM MULTISPECTRAL VIDEOS

    Directory of Open Access Journals (Sweden)

    Frederique Robert-Inacio

    2012-05-01

    Full Text Available Surveillance of a seaport can be achieved by different means: radar, sonar, cameras, radio communications and so on. Such a surveillance aims, on the one hand, to manage cargo and tanker traffic, and, on the other hand, to prevent terrorist attacks in sensitive areas. In this paper an application to video-surveillance of a seaport entrance is presented, and more particularly, the different steps enabling to classify mobile shapes. This classification is based on a parameter measuring the similarity degree between the shape under study and a set of reference shapes. The classification result describes the considered mobile in terms of shape and speed.

  14. Facial aging: A clinical classification

    Directory of Open Access Journals (Sweden)

    Shiffman Melvin

    2007-01-01

    Full Text Available The purpose of this classification of facial aging is to have a simple clinical method to determine the severity of the aging process in the face. This allows a quick estimate as to the types of procedures that the patient would need to have the best results. Procedures that are presently used for facial rejuvenation include laser, chemical peels, suture lifts, fillers, modified facelift and full facelift. The physician is already using his best judgment to determine which procedure would be best for any particular patient. This classification may help to refine these decisions.

  15. Analysis of Tense Interferential of Verbs in Old Narrative Texts

    Directory of Open Access Journals (Sweden)

    Mahmood Barati khansari

    2014-08-01

    Full Text Available Abstract One of the admirable methods to compose stories in Persian verse and prose, is the present Tense verbs in the meaning of past tense. This grammatical point has been hidden in the grammarian and stylist's point of view although it has been repeatedly mentioned in the texts and this point has been not mentioned in the grammatical books but some of the investigators and literati have pointed out it in their correction works. We mention their sayings: firstly, Allame Qazvini, doubtfully, mentions the interferential times of the verbs and inconsistencies of the Tenses in the correction of texts of Jahangoshaye – Joveini Book. He writes in the second footnote 2-3, that the verb Mikonam( I do is in the form of present Tense but its meaning is in the simple past Tense. As it has been observed, in the most old books the form of the verb is in the present tense but its meaning is in simple Tense ( Joveini, 1367, p. 357. Later, Fruzanfar in the correction of grammatical notes of ouhadoddin Kermani's Manaqeb, points to this point and counted it of the Eltefat Literary art ( Fruzanfar, 1347. P. 61 Mohammad Roushan informed this grammatical rule and he writes in the introduction of his book: the application of this kind of verb that is not on the basis of the dependent and independent verbs (Khagushi, 1361, p. 24. Yusofi in his correction on Bidpay Stories points to this grammatical point that it has been hidden of correctors of the book. Ha says that this grammatical point is the prose characteristic of the book. He adds that the characteristic includes in the present stories (Yusofi, 1364, p. 36. Finally, Dr. shfi'ee in his valuable notes on the Mateqol altei their mentions that this style of telling stories – the verb in the present Tense- is less in verse but the verbs in the same meaning and forms were used in old Persian as in the present time but there were inconsistence in the time and the form of the verbs in the past and the grammarians

  16. Analysis of Tense Interferential of Verbs in Old Narrative Texts

    Directory of Open Access Journals (Sweden)

    Amir Zeighami

    2014-07-01

    Full Text Available Abstract One of the admirable methods to compose stories in Persian verse and prose, is the present Tense verbs in the meaning of past tense. This grammatical point has been hidden in the grammarian and stylist's point of view although it has been repeatedly mentioned in the texts and this point has been not mentioned in the grammatical books but some of the investigators and literati have pointed out it in their correction works. We mention their sayings: firstly, Allame Qazvini, doubtfully, mentions the interferential times of the verbs and inconsistencies of the Tenses in the correction of texts of Jahangoshaye – Joveini Book. He writes in the second footnote 2-3, that the verb Mikonam( I do is in the form of present Tense but its meaning is in the simple past Tense. As it has been observed, in the most old books the form of the verb is in the present tense but its meaning is in simple Tense ( Joveini, 1367, p. 357. Later, Fruzanfar in the correction of grammatical notes of ouhadoddin Kermani's Manaqeb, points to this point and counted it of the Eltefat Literary art ( Fruzanfar, 1347. P. 61 Mohammad Roushan informed this grammatical rule and he writes in the introduction of his book: the application of this kind of verb that is not on the basis of the dependent and independent verbs (Khagushi, 1361, p. 24. Yusofi in his correction on Bidpay Stories points to this grammatical point that it has been hidden of correctors of the book. Ha says that this grammatical point is the prose characteristic of the book. He adds that the characteristic includes in the present stories (Yusofi, 1364, p. 36. Finally, Dr. shfi'ee in his valuable notes on the Mateqol altei their mentions that this style of telling stories – the verb in the present Tense- is less in verse but the verbs in the same meaning and forms were used in old Persian as in the present time but there were inconsistence in the time and the form of the verbs in the past and

  17. Using Shakespeare's Sotto Voce to Determine True Identity From Text

    Science.gov (United States)

    Kernot, David; Bossomaier, Terry; Bradbury, Roger

    2018-01-01

    Little is known of the private life of William Shakespeare, but he is famous for his collection of plays and poems, even though many of the works attributed to him were published anonymously. Determining the identity of Shakespeare has fascinated scholars for 400 years, and four significant figures in English literary history have been suggested as likely alternatives to Shakespeare for some disputed works: Bacon, de Vere, Stanley, and Marlowe. A myriad of computational and statistical tools and techniques have been used to determine the true authorship of his works. Many of these techniques rely on basic statistical correlations, word counts, collocated word groups, or keyword density, but no one method has been decided on. We suggest that an alternative technique that uses word semantics to draw on personality can provide an accurate profile of a person. To test this claim, we analyse the works of Shakespeare, Christopher Marlowe, and Elizabeth Cary. We use Word Accumulation Curves, Hierarchical Clustering overlays, Principal Component Analysis, and Linear Discriminant Analysis techniques in combination with RPAS, a multi-faceted text analysis approach that draws on a writer's personality, or self to identify subtle characteristics within a person's writing style. Here we find that RPAS can separate the known authored works of Shakespeare from Marlowe and Cary. Further, it separates their contested works, works suspected of being written by others. While few authorship identification techniques identify self from the way a person writes, we demonstrate that these stylistic characteristics are as applicable 400 years ago as they are today and have the potential to be used within cyberspace for law enforcement purposes. PMID:29599734

  18. Statistical text classifier to detect specific type of medical incidents.

    Science.gov (United States)

    Wong, Zoie Shui-Yee; Akiyama, Masanori

    2013-01-01

    WHO Patient Safety has put focus to increase the coherence and expressiveness of patient safety classification with the foundation of International Classification for Patient Safety (ICPS). Text classification and statistical approaches has showed to be successful to identifysafety problems in the Aviation industryusing incident text information. It has been challenging to comprehend the taxonomy of medical incidents in a structured manner. Independent reporting mechanisms for patient safety incidents have been established in the UK, Canada, Australia, Japan, Hong Kong etc. This research demonstrates the potential to construct statistical text classifiers to detect specific type of medical incidents using incident text data. An illustrative example for classifying look-alike sound-alike (LASA) medication incidents using structured text from 227 advisories related to medication errors from Global Patient Safety Alerts (GPSA) is shown in this poster presentation. The classifier was built using logistic regression model. ROC curve and the AUC value indicated that this is a satisfactory good model.

  19. THE PIANO COLLECTION SPRING MOOD BY O.P.NEGRUTSI: STYLISTIC AND GENRE FEATURES OF THE WORKS

    Directory of Open Access Journals (Sweden)

    GUPALOVA ELENA

    2015-12-01

    Full Text Available In the focus of this article is the music collection by O. Negrutsi „Spring Mood”, published in 2009 and edited by I .Stolyar. The piano Miniatures, that are different in variety of artistic ideas and content, are included in this anthology. All the works included in this collection have a number of teaching and performing features, a typical figurative and emotional system, evi­dent technical issues based on national folklore. All this makes them indispensable for the pedagogical and concert repertoire of the Special Piano course in the secondary and higher educational institutions of the Republic of Moldova.

  20. Automatic Hierarchical Color Image Classification

    Directory of Open Access Journals (Sweden)

    Jing Huang

    2003-02-01

    Full Text Available Organizing images into semantic categories can be extremely useful for content-based image retrieval and image annotation. Grouping images into semantic classes is a difficult problem, however. Image classification attempts to solve this hard problem by using low-level image features. In this paper, we propose a method for hierarchical classification of images via supervised learning. This scheme relies on using a good low-level feature and subsequently performing feature-space reconfiguration using singular value decomposition to reduce noise and dimensionality. We use the training data to obtain a hierarchical classification tree that can be used to categorize new images. Our experimental results suggest that this scheme not only performs better than standard nearest-neighbor techniques, but also has both storage and computational advantages.

  1. A Semisupervised Cascade Classification Algorithm

    Directory of Open Access Journals (Sweden)

    Stamatis Karlos

    2016-01-01

    Full Text Available Classification is one of the most important tasks of data mining techniques, which have been adopted by several modern applications. The shortage of enough labeled data in the majority of these applications has shifted the interest towards using semisupervised methods. Under such schemes, the use of collected unlabeled data combined with a clearly smaller set of labeled examples leads to similar or even better classification accuracy against supervised algorithms, which use labeled examples exclusively during the training phase. A novel approach for increasing semisupervised classification using Cascade Classifier technique is presented in this paper. The main characteristic of Cascade Classifier strategy is the use of a base classifier for increasing the feature space by adding either the predicted class or the probability class distribution of the initial data. The classifier of the second level is supplied with the new dataset and extracts the decision for each instance. In this work, a self-trained NB∇C4.5 classifier algorithm is presented, which combines the characteristics of Naive Bayes as a base classifier and the speed of C4.5 for final classification. We performed an in-depth comparison with other well-known semisupervised classification methods on standard benchmark datasets and we finally reached to the point that the presented technique has better accuracy in most cases.

  2. On the introduction of secondary fingerprint classification

    CSIR Research Space (South Africa)

    Msiza, IS

    2011-07-01

    Full Text Available The concept of fingerprint classification is an important one because of the need to, before executing a database search procedure, virtually break the fingerprint template database into smaller, manageable partitions. This is done in order to avoid...

  3. Combining multiple classifiers for age classification

    CSIR Research Space (South Africa)

    Van Heerden, C

    2009-11-01

    Full Text Available The authors compare several different classifier combination methods on a single task, namely speaker age classification. This task is well suited to combination strategies, since significantly different feature classes are employed. Support vector...

  4. Asteroid taxonomic classifications

    International Nuclear Information System (INIS)

    Tholen, D.J.

    1989-01-01

    This paper reports on three taxonomic classification schemes developed and applied to the body of available color and albedo data. Asteroid taxonomic classifications according to two of these schemes are reproduced

  5. Hand eczema classification

    DEFF Research Database (Denmark)

    Diepgen, T L; Andersen, Klaus Ejner; Brandao, F M

    2008-01-01

    of the disease is rarely evidence based, and a classification system for different subdiagnoses of hand eczema is not agreed upon. Randomized controlled trials investigating the treatment of hand eczema are called for. For this, as well as for clinical purposes, a generally accepted classification system...... A classification system for hand eczema is proposed. Conclusions It is suggested that this classification be used in clinical work and in clinical trials....

  6. Text

    International Nuclear Information System (INIS)

    Anon.

    2009-01-01

    The purpose of this act is to safeguard against the dangers and harmful effects of radioactive waste and to contribute to public safety and environmental protection by laying down requirements for the safe and efficient management of radioactive waste. We will find definitions, interrelation with other legislation, responsibilities of the state and local governments, responsibilities of radioactive waste management companies and generators, formulation of the basic plan for the control of radioactive waste, radioactive waste management ( with public information, financing and part of spent fuel management), Korea radioactive waste management corporation ( business activities, budget), establishment of a radioactive waste fund in order to secure the financial resources required for radioactive waste management, and penalties in case of improper operation of radioactive waste management. (N.C.)

  7. Classification with support hyperplanes

    NARCIS (Netherlands)

    G.I. Nalbantov (Georgi); J.C. Bioch (Cor); P.J.F. Groenen (Patrick)

    2006-01-01

    textabstractA new classification method is proposed, called Support Hy- perplanes (SHs). To solve the binary classification task, SHs consider the set of all hyperplanes that do not make classification mistakes, referred to as semi-consistent hyperplanes. A test object is classified using

  8. Standard classification: Physics

    International Nuclear Information System (INIS)

    1977-01-01

    This is a draft standard classification of physics. The conception is based on the physics part of the systematic catalogue of the Bayerische Staatsbibliothek and on the classification given in standard textbooks. The ICSU-AB classification now used worldwide by physics information services was not taken into account. (BJ) [de

  9. Stable classification of the energy-momentum tensor. Summary

    International Nuclear Information System (INIS)

    Guzman-Sanchez, A.R.; Przanowski, M.; Plevansky, J.

    1990-01-01

    Starting with the algebraic classification of the energy-momentum tensor given by Plebansky, it is established that this classification is unstable under versal deformations and a new (stable) classification is given. In order to keep the text to reasonable length, we just write the basic ideas and some results. (Author) (Author)

  10. THE AESTHETICS OF HAEMOTAPHONOMY: A STUDY OF THE STYLISTIC PARALLELS BETWEEN A SCIENCE AND LITERATURE AND THE VISUAL ARTS

    Directory of Open Access Journals (Sweden)

    Policarp Hortolà

    2009-01-01

    Full Text Available Este estudio tiene la intención de profundizar en la estética de la ciencia de la Hemotafonomía (HTN, mediante la identificación de sus paralelos estilísticos con la Literatura y las Artes Visuales. El objeto de estudio de esta ciencia es la citomorfología de las células sanguíneas en manchas de sangre. Sus sujetos de estudio son especímenes manchados de sangre, mientras que su método de estudio es el análisis de imágenes en claroscuro obtenidas por Microscopía Electrónica de Barrido. Literariamente, la HTN es estilísticamente paralela a la estética del Tremendismo y el Decadentismo. Visualmente, la HTN es estilísticamente paralela a la estética de la decoración arquitectónica churrigueresca y al Tenebrismo pictórico. De acuerdo con los resultados de este estudio, la HTN puede también considerarse estilísticamente como un Realismo literario y un Naturalismo visual. Además, puesto que Tremendismo, Decadentismo, Churriguerismo y Tenebrismo convergen en la cultura del Barroco, debe concluirse que, estéticamente, la HTN es una ciencia barroquista.

  11. THE EMERGENCE AND EVOLUTION OF ARABESQUE AS A MULTICULTURAL STYLISTIC FUSION IN ISLAMIC ART: THE CASE OF TURKISH ARCHITECTURE

    Directory of Open Access Journals (Sweden)

    Murat Cetin, M. Arif Kamal

    2012-04-01

    Full Text Available This paper elucidates the emergence and evolution of arabesque with specific reference to  the case of arabesque in Turkish art and architecture. It is argued here that arabesque is a fusion of styles rather than a pure and homogenous style. Furthermore, the paper aims to show that although the arabesque style appears to be a fanciful and freely organized manner of artistic treatment it is based on a very complex mathematical logic which is expressed through abstractionism. In this context, the grammar of geometry is elaborately used in the  implementation  of abstraction. Here, general characteristics as well as different modes or  types  of arabesque are discussed. Starting with the etymological roots of the term, the  history of its use in the literature  is  explored  through  the  paper. After  the  inquiry  of  its  material  and  pragmatic  aspects,  the development of the arabesque style is evaluated with regard to its transformations that took place along with its injection to Anatolia and mixing with Turkish culture. Finally, the morphological character of this fusion is put forward.   Keyword: Morphological character, geometry, fusion of styles, arabesque

  12. Classification of refrigerants; Classification des fluides frigorigenes

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    2001-07-01

    This document was made from the US standard ANSI/ASHRAE 34 published in 2001 and entitled 'designation and safety classification of refrigerants'. This classification allows to clearly organize in an international way the overall refrigerants used in the world thanks to a codification of the refrigerants in correspondence with their chemical composition. This note explains this codification: prefix, suffixes (hydrocarbons and derived fluids, azeotropic and non-azeotropic mixtures, various organic compounds, non-organic compounds), safety classification (toxicity, flammability, case of mixtures). (J.S.)

  13. The Study of Figurative Languages using Stylistics Theory in What My Mother Doesn’t Know by Sonya Sones

    Directory of Open Access Journals (Sweden)

    Winnie Winnie

    2008-11-01

    Full Text Available What My Mother Doesn’t Know is a novel-in-verse by Sonya Sones which tells about a teenage girl named Sophie who tries to find her Mr. Right and her bad relationship with her parents. In the end, Sophie finds her Mr. Right who is actually not her type of boy friend but he can make her feel happy and even though she hates her parents, she realizes that she loves them nevertheless. The purpose of this article is to prove that the author’s use of style can reveal the themes of novel-in-verse The analysis of the novel-in-verse focuses mainly on the themes and the figurative language of simile, metaphor, personification, paradox, and hyperbole. Research methods are conducted in qualitative method in the form of library research and statistics to calculate how many poems use the five figurative languages and are related to the themes. In the analysis, the author’s style is figurative language and there are three themes that can be taken from the novel-in-verse. The themes are overwhelmed feeling does not show the true feeling, the bond of family will always be there, no matter how hateful we are to them and love is not determined by someone’s physical appearance. The result shows that 91.36 percents of poems that use the five figurative languages are related to the themes. It can be concluded that the author’s style in writing the novel-in-verse can reveal the three themes.

  14. Teaching Text Structure: Examining the Affordances of Children's Informational Texts

    Science.gov (United States)

    Jones, Cindy D.; Clark, Sarah K.; Reutzel, D. Ray

    2016-01-01

    This study investigated the affordances of informational texts to serve as model texts for teaching text structure to elementary school children. Content analysis of a random sampling of children's informational texts from top publishers was conducted on text structure organization and on the inclusion of text features as signals of text…

  15. Fuzzy One-Class Classification Model Using Contamination Neighborhoods

    Directory of Open Access Journals (Sweden)

    Lev V. Utkin

    2012-01-01

    Full Text Available A fuzzy classification model is studied in the paper. It is based on the contaminated (robust model which produces fuzzy expected risk measures characterizing classification errors. Optimal classification parameters of the models are derived by minimizing the fuzzy expected risk. It is shown that an algorithm for computing the classification parameters is reduced to a set of standard support vector machine tasks with weighted data points. Experimental results with synthetic data illustrate the proposed fuzzy model.

  16. Classification, disease, and diagnosis.

    Science.gov (United States)

    Jutel, Annemarie

    2011-01-01

    Classification shapes medicine and guides its practice. Understanding classification must be part of the quest to better understand the social context and implications of diagnosis. Classifications are part of the human work that provides a foundation for the recognition and study of illness: deciding how the vast expanse of nature can be partitioned into meaningful chunks, stabilizing and structuring what is otherwise disordered. This article explores the aims of classification, their embodiment in medical diagnosis, and the historical traditions of medical classification. It provides a brief overview of the aims and principles of classification and their relevance to contemporary medicine. It also demonstrates how classifications operate as social framing devices that enable and disable communication, assert and refute authority, and are important items for sociological study.

  17. Issues surrounding the classification of accounting information

    Directory of Open Access Journals (Sweden)

    Huibrecht Van der Poll

    2011-06-01

    Full Text Available The act of classifying information created by accounting practices is ubiquitous in the accounting process; from recording to reporting, it has almost become second nature. The classification has to correspond to the requirements and demands of the changing environment in which it is practised. Evidence suggests that the current classification of items in financial statements is not keeping pace with the needs of users and the new financial constructs generated by the industry. This study addresses the issue of classification in two ways: by means of a critical analysis of classification theory and practices and by means of a questionnaire that was developed and sent to compilers and users of financial statements. A new classification framework for accounting information in the balance sheet and income statement is proposed.

  18. Important Text Characteristics for Early-Grades Text Complexity

    Science.gov (United States)

    Fitzgerald, Jill; Elmore, Jeff; Koons, Heather; Hiebert, Elfrieda H.; Bowen, Kimberly; Sanford-Moore, Eleanor E.; Stenner, A. Jackson

    2015-01-01

    The Common Core set a standard for all children to read increasingly complex texts throughout schooling. The purpose of the present study was to explore text characteristics specifically in relation to early-grades text complexity. Three hundred fifty primary-grades texts were selected and digitized. Twenty-two text characteristics were identified…

  19. Learning From Short Text Streams With Topic Drifts.

    Science.gov (United States)

    Li, Peipei; He, Lu; Wang, Haiyan; Hu, Xuegang; Zhang, Yuhong; Li, Lei; Wu, Xindong

    2017-09-18

    Short text streams such as search snippets and micro blogs have been popular on the Web with the emergence of social media. Unlike traditional normal text streams, these data present the characteristics of short length, weak signal, high volume, high velocity, topic drift, etc. Short text stream classification is hence a very challenging and significant task. However, this challenge has received little attention from the research community. Therefore, a new feature extension approach is proposed for short text stream classification with the help of a large-scale semantic network obtained from a Web corpus. It is built on an incremental ensemble classification model for efficiency. First, more semantic contexts based on the senses of terms in short texts are introduced to make up of the data sparsity using the open semantic network, in which all terms are disambiguated by their semantics to reduce the noise impact. Second, a concept cluster-based topic drifting detection method is proposed to effectively track hidden topic drifts. Finally, extensive studies demonstrate that as compared to several well-known concept drifting detection methods in data stream, our approach can detect topic drifts effectively, and it enables handling short text streams effectively while maintaining the efficiency as compared to several state-of-the-art short text classification approaches.

  20. Security classification of information

    Energy Technology Data Exchange (ETDEWEB)

    Quist, A.S.

    1993-04-01

    This document is the second of a planned four-volume work that comprehensively discusses the security classification of information. The main focus of Volume 2 is on the principles for classification of information. Included herein are descriptions of the two major types of information that governments classify for national security reasons (subjective and objective information), guidance to use when determining whether information under consideration for classification is controlled by the government (a necessary requirement for classification to be effective), information disclosure risks and benefits (the benefits and costs of classification), standards to use when balancing information disclosure risks and benefits, guidance for assigning classification levels (Top Secret, Secret, or Confidential) to classified information, guidance for determining how long information should be classified (classification duration), classification of associations of information, classification of compilations of information, and principles for declassifying and downgrading information. Rules or principles of certain areas of our legal system (e.g., trade secret law) are sometimes mentioned to .provide added support to some of those classification principles.

  1. Using ontology network structure in text mining.

    Science.gov (United States)

    Berndt, Donald J; McCart, James A; Luther, Stephen L

    2010-11-13

    Statistical text mining treats documents as bags of words, with a focus on term frequencies within documents and across document collections. Unlike natural language processing (NLP) techniques that rely on an engineered vocabulary or a full-featured ontology, statistical approaches do not make use of domain-specific knowledge. The freedom from biases can be an advantage, but at the cost of ignoring potentially valuable knowledge. The approach proposed here investigates a hybrid strategy based on computing graph measures of term importance over an entire ontology and injecting the measures into the statistical text mining process. As a starting point, we adapt existing search engine algorithms such as PageRank and HITS to determine term importance within an ontology graph. The graph-theoretic approach is evaluated using a smoking data set from the i2b2 National Center for Biomedical Computing, cast as a simple binary classification task for categorizing smoking-related documents, demonstrating consistent improvements in accuracy.

  2. CLASSIFICATION OF LEARNING MANAGEMENT SYSTEMS

    Directory of Open Access Journals (Sweden)

    Yu. B. Popova

    2016-01-01

    Full Text Available Using of information technologies and, in particular, learning management systems, increases opportunities of teachers and students in reaching their goals in education. Such systems provide learning content, help organize and monitor training, collect progress statistics and take into account the individual characteristics of each user. Currently, there is a huge inventory of both paid and free systems are physically located both on college servers and in the cloud, offering different features sets of different licensing scheme and the cost. This creates the problem of choosing the best system. This problem is partly due to the lack of comprehensive classification of such systems. Analysis of more than 30 of the most common now automated learning management systems has shown that a classification of such systems should be carried out according to certain criteria, under which the same type of system can be considered. As classification features offered by the author are: cost, functionality, modularity, keeping the customer’s requirements, the integration of content, the physical location of a system, adaptability training. Considering the learning management system within these classifications and taking into account the current trends of their development, it is possible to identify the main requirements to them: functionality, reliability, ease of use, low cost, support for SCORM standard or Tin Can API, modularity and adaptability. According to the requirements at the Software Department of FITR BNTU under the guidance of the author since 2009 take place the development, the use and continuous improvement of their own learning management system.

  3. Text genres and registers the computation of linguistic features

    CERN Document Server

    Fang, Chengyu Alex

    2015-01-01

    This book is a description of some of the most recent advances in text classification as part of a concerted effort to achieve computer understanding of human language. In particular, it addresses state-of-the-art developments in the computation of higher-level linguistic features, ranging from etymology to grammar and syntax for the practical task of text classification according to genres, registers and subject domains. Serving as a bridge between computational methods and sophisticated linguistic analysis, this book will be of particular interest to academics and students of computational linguistics as well as professionals in natural language engineering.

  4. GLOBAL LAND COVER CLASSIFICATION USING MODIS SURFACE REFLECTANCE PROSUCTS

    Directory of Open Access Journals (Sweden)

    K. Fukue

    2016-06-01

    Full Text Available The objective of this study is to develop high accuracy land cover classification algorithm for Global scale by using multi-temporal MODIS land reflectance products. In this study, time-domain co-occurrence matrix was introduced as a classification feature which provides time-series signature of land covers. Further, the non-parametric minimum distance classifier was introduced for timedomain co-occurrence matrix, which performs multi-dimensional pattern matching for time-domain co-occurrence matrices of a classification target pixel and each classification classes. The global land cover classification experiments have been conducted by applying the proposed classification method using 46 multi-temporal(in one year SR(Surface Reflectance and NBAR(Nadir BRDF-Adjusted Reflectance products, respectively. IGBP 17 land cover categories were used in our classification experiments. As the results, SR and NBAR products showed similar classification accuracy of 99%.

  5. Definition and classification of epilepsy. Classification of epileptic seizures 2016

    Directory of Open Access Journals (Sweden)

    K. Yu. Mukhin

    2017-01-01

    Full Text Available Epilepsy is one of the most common neurological diseases, especially in childhood and adolescence. The incidence varies from 15 to 113 cases per 100 000 population with the maximum among children under 1 year old. The prevalence of epilepsy is high, ranging from 5 to 8 cases (in some regions – 10 cases per 1000 children under 15 years old. Classification of the disease has great importance for diagnosis, treatment and prognosis. The article presents a novel strategy for classification of epileptic seizures, developed in 2016. It contains a number of brand new concepts, including a very important one, saying that some seizures, previously considered as generalized or focal only, can be, in fact, both focal and generalized. They include tonic, atonic, myoclonic seizures and epileptic spasms. The term “secondarily generalized seizure” is replace by the term “bilateral tonic-clonic seizure” (as soon as it is not a separate type of epileptic seizures, and the term reflects the spread of discharge from any area of cerebral cortex and evolution of any types of focal seizures. International League Against Epilepsy recommends to abandon the term “pseudo-epileptic seizures” and replace it by the term “psychogenic non-epileptic seizures”. If a doctor is not sure that seizures have epileptic nature, the term “paroxysmal event” should be used without specifying the disease. The conception of childhood epileptic encephalopathies, developed within this novel classification project, is one of the most significant achievements, since in this case not only the seizures, but even epileptiform activity can induce severe disorders of higher mental functions. In addition to detailed description of the new strategy for classification of epileptic seizures, the article contains a comprehensive review of the existing principles of epilepsy and epileptic seizures classification.

  6. Emotion models for textual emotion classification

    Science.gov (United States)

    Bruna, O.; Avetisyan, H.; Holub, J.

    2016-11-01

    This paper deals with textual emotion classification which gained attention in recent years. Emotion classification is used in user experience, product evaluation, national security, and tutoring applications. It attempts to detect the emotional content in the input text and based on different approaches establish what kind of emotional content is present, if any. Textual emotion classification is the most difficult to handle, since it relies mainly on linguistic resources and it introduces many challenges to assignment of text to emotion represented by a proper model. A crucial part of each emotion detector is emotion model. Focus of this paper is to introduce emotion models used for classification. Categorical and dimensional models of emotion are explained and some more advanced approaches are mentioned.

  7. Transcending the Versification of Oraliture: Song-Text as Oral Performance among the Ilaje

    Directory of Open Access Journals (Sweden)

    N. Akingbe

    2013-12-01

    Full Text Available Oraliture is a terminology that is often employed in the description of the various genres of oral literature such as proverbs, legends, short stories, traditional songs and rhymes, song-poems, historical narratives traditional symbols, images, oral performance, myths and other traditional stylistic devices. All these devices constitute vibrant appurtenances of oral narrative performance in Africa. Oral narrative performance is invariably situated within the domain of social communication, which brings together the raconteur/performer and the audience towards the realisation of communal entertainment. While the narrator/performer, plays the leading role in an oral performance, the audience’s involvement and participation is realised through song, verbal/choral responses, gestures and, or instrumental/musical accompaniment. This oral practice usually take place at one time or the other in various African communities during the festival, ritual/religious procession which ranges from story- telling, recitation of poems, song text and dancing. This paper is essentially concerned with the illustration of the use of song- text, as oral performance among the Ilaje, a burgeoning coastal subethnic group, of the Yoruba race in the South Western Nigeria. The paper will further examine how patriotism, history, death and anti-social behaviours are evaluated through the use of songs among the Ilaje.

  8. MT Post-editing: A Text Repair Experience for the Foreign Language Class.

    Directory of Open Access Journals (Sweden)

    Ana Niño

    2007-04-01

    Full Text Available Communication also means having to sort out the problems involved in learning a foreign language, especially with regards to production rather than reception. These learning strategies or skills can also be applied to translation teaching methodology, where students put in practice their risk taking, avoidance, reduction and/ or compensatory strategies in getting the message across. We acknowledge translation as a writing task constrained by the source text. In addition, the translation and the writing cycles have in common a generation stage and a revision stage where grammatical, lexical and stylistic correctness is assessed. Somewhere in the middle between translation and writing skills lies MT (Machine Translation post-editing that involves correcting the raw MT output with the aim of providing a quality text according to the intended purpose. Our research is intended to test the suitability of MT post-editing as an activity to promote error correction and, subsequently, to enhance written production in second and foreign language teaching.

  9. Text analysis methods, text analysis apparatuses, and articles of manufacture

    Science.gov (United States)

    Whitney, Paul D; Willse, Alan R; Lopresti, Charles A; White, Amanda M

    2014-10-28

    Text analysis methods, text analysis apparatuses, and articles of manufacture are described according to some aspects. In one aspect, a text analysis method includes accessing information indicative of data content of a collection of text comprising a plurality of different topics, using a computing device, analyzing the information indicative of the data content, and using results of the analysis, identifying a presence of a new topic in the collection of text.

  10. Ontologies vs. Classification Systems

    DEFF Research Database (Denmark)

    Madsen, Bodil Nistrup; Erdman Thomsen, Hanne

    2009-01-01

    What is an ontology compared to a classification system? Is a taxonomy a kind of classification system or a kind of ontology? These are questions that we meet when working with people from industry and public authorities, who need methods and tools for concept clarification, for developing meta...... data sets or for obtaining advanced search facilities. In this paper we will present an attempt at answering these questions. We will give a presentation of various types of ontologies and briefly introduce terminological ontologies. Furthermore we will argue that classification systems, e.g. product...... classification systems and meta data taxonomies, should be based on ontologies....

  11. Classroom Texting in College Students

    Science.gov (United States)

    Pettijohn, Terry F.; Frazier, Erik; Rieser, Elizabeth; Vaughn, Nicholas; Hupp-Wilds, Bobbi

    2015-01-01

    A 21-item survey on texting in the classroom was given to 235 college students. Overall, 99.6% of students owned a cellphone and 98% texted daily. Of the 138 students who texted in the classroom, most texted friends or significant others, and indicate the reason for classroom texting is boredom or work. Students who texted sent a mean of 12.21…

  12. The language of poetic texts in contemporary Tuvan pop songs

    Directory of Open Access Journals (Sweden)

    Oyumaa M. Saaya

    2017-06-01

    Full Text Available The article presents a linguistic analysis of lyrics of modern Tuvan pop songs. While studying them is important for understanding contemporary songwriting in Tuva, it is also necessary to discover what linguistic means, functional styles and vocabulary are used by modern authors of popular lyrics. The study can also help identify how contemporary global trends influence songwriting in means of linguistics. Three groups of songs can be defined in Tuvan pop music. The first of them comprises songs written by both professional poets and amateurs with good writing skills. Their texts have homogenous literary style and are intended for general audience (rather than specific groups of listeners. They do not feature any jargon or youth slang. The second group consists of “songs of the people” which are still popular and relevant, but not classified as folklore. This group also contains songs previously banned by censorship, and those written by ex-convicts. Their lyrics differ in style, and the vocabulary is also heterogenous: they can include slang and contain vernacular language. The third group includes songs following popular global and Russian trends, which  triggered rapid evolution in Tuvan songwriting. There is significant number of authors or even creative unions, who write both lyric and music. They are stylistically uneven, contain a lot of neologisms, borrowed vocabulary, slang and jargon words and sometimes even macaronic (mixed language. The author provides a more in-depth analysis of lyrics belonging to the third group of songs. They can be divided into 6 thematic subgroups which greatly vary in lexical content and the use of tropes. The lyrics of contemporary Tuvan songs are quite close to the everyday language young people use. Active employment of jargon in the language of young and middle-aged people, especially in lyrics of modern songs, steadily decreases the literary norms of Tuvan language. The author emphasizes that

  13. Observation of [Formula: see text] and [Formula: see text] decays.

    Science.gov (United States)

    Aaij, R; Adeva, B; Adinolfi, M; Ajaltouni, Z; Akar, S; Albrecht, J; Alessio, F; Alexander, M; Ali, S; Alkhazov, G; Alvarez Cartelle, P; Alves, A A; Amato, S; Amerio, S; Amhis, Y; An, L; Anderlini, L; Andreassi, G; Andreotti, M; Andrews, J E; Appleby, R B; Archilli, F; d'Argent, P; Arnau Romeu, J; Artamonov, A; Artuso, M; Aslanides, E; Auriemma, G; Baalouch, M; Babuschkin, I; Bachmann, S; Back, J J; Badalov, A; Baesso, C; Baker, S; Baldini, W; Barlow, R J; Barschel, C; Barsuk, S; Barter, W; Baszczyk, M; Batozskaya, V; Batsukh, B; Battista, V; Bay, A; Beaucourt, L; Beddow, J; Bedeschi, F; Bediaga, I; Bel, L J; Bellee, V; Belloli, N; Belous, K; Belyaev, I; Ben-Haim, E; Bencivenni, G; Benson, S; Benton, J; Berezhnoy, A; Bernet, R; Bertolin, A; Betancourt, C; Betti, F; Bettler, M-O; van Beuzekom, M; Bezshyiko, Ia; Bifani, S; Billoir, P; Bird, T; Birnkraut, A; Bitadze, A; Bizzeti, A; Blake, T; Blanc, F; Blouw, J; Blusk, S; Bocci, V; Boettcher, T; Bondar, A; Bondar, N; Bonivento, W; Bordyuzhin, I; Borgheresi, A; Borghi, S; Borisyak, M; Borsato, M; Bossu, F; Boubdir, M; Bowcock, T J V; Bowen, E; Bozzi, C; Braun, S; Britsch, M; Britton, T; Brodzicka, J; Buchanan, E; Burr, C; Bursche, A; Buytaert, J; Cadeddu, S; Calabrese, R; Calvi, M; Calvo Gomez, M; Camboni, A; Campana, P; Campora Perez, D H; Capriotti, L; Carbone, A; Carboni, G; Cardinale, R; Cardini, A; Carniti, P; Carson, L; Carvalho Akiba, K; Casse, G; Cassina, L; Castillo Garcia, L; Cattaneo, M; Cauet, Ch; Cavallero, G; Cenci, R; Charles, M; Charpentier, Ph; Chatzikonstantinidis, G; Chefdeville, M; Chen, S; Cheung, S-F; Chobanova, V; Chrzaszcz, M; Cid Vidal, X; Ciezarek, G; Clarke, P E L; Clemencic, M; Cliff, H V; Closier, J; Coco, V; Cogan, J; Cogneras, E; Cogoni, V; Cojocariu, L; Collazuol, G; Collins, P; Comerma-Montells, A; Contu, A; Cook, A; Coombs, G; Coquereau, S; Corti, G; Corvo, M; Costa Sobral, C M; Couturier, B; Cowan, G A; Craik, D C; Crocombe, A; Cruz Torres, M; Cunliffe, S; Currie, R; D'Ambrosio, C; Da Cunha Marinho, F; Dall'Occo, E; Dalseno, J; David, P N Y; Davis, A; De Aguiar Francisco, O; De Bruyn, K; De Capua, S; De Cian, M; De Miranda, J M; De Paula, L; De Serio, M; De Simone, P; Dean, C-T; Decamp, D; Deckenhoff, M; Del Buono, L; Demmer, M; Dendek, A; Derkach, D; Deschamps, O; Dettori, F; Dey, B; Di Canto, A; Dijkstra, H; Dordei, F; Dorigo, M; Dosil Suárez, A; Dovbnya, A; Dreimanis, K; Dufour, L; Dujany, G; Dungs, K; Durante, P; Dzhelyadin, R; Dziurda, A; Dzyuba, A; Déléage, N; Easo, S; Ebert, M; Egede, U; Egorychev, V; Eidelman, S; Eisenhardt, S; Eitschberger, U; Ekelhof, R; Eklund, L; Ely, S; Esen, S; Evans, H M; Evans, T; Falabella, A; Farley, N; Farry, S; Fay, R; Fazzini, D; Ferguson, D; Fernandez Prieto, A; Ferrari, F; Ferreira Rodrigues, F; Ferro-Luzzi, M; Filippov, S; Fini, R A; Fiore, M; Fiorini, M; Firlej, M; Fitzpatrick, C; Fiutowski, T; Fleuret, F; Fohl, K; Fontana, M; Fontanelli, F; Forshaw, D C; Forty, R; Franco Lima, V; Frank, M; Frei, C; Fu, J; Furfaro, E; Färber, C; Gallas Torreira, A; Galli, D; Gallorini, S; Gambetta, S; Gandelman, M; Gandini, P; Gao, Y; Garcia Martin, L M; García Pardiñas, J; Garra Tico, J; Garrido, L; Garsed, P J; Gascon, D; Gaspar, C; Gavardi, L; Gazzoni, G; Gerick, D; Gersabeck, E; Gersabeck, M; Gershon, T; Ghez, Ph; Gianì, S; Gibson, V; Girard, O G; Giubega, L; Gizdov, K; Gligorov, V V; Golubkov, D; Golutvin, A; Gomes, A; Gorelov, I V; Gotti, C; Govorkova, E; Grabalosa Gándara, M; Graciani Diaz, R; Granado Cardoso, L A; Graugés, E; Graverini, E; Graziani, G; Grecu, A; Griffith, P; Grillo, L; Gruberg Cazon, B R; Grünberg, O; Gushchin, E; Guz, Yu; Gys, T; Göbel, C; Hadavizadeh, T; Hadjivasiliou, C; Haefeli, G; Haen, C; Haines, S C; Hall, S; Hamilton, B; Han, X; Hansmann-Menzemer, S; Harnew, N; Harnew, S T; Harrison, J; Hatch, M; He, J; Head, T; Heister, A; Hennessy, K; Henrard, P; Henry, L; Hernando Morata, J A; van Herwijnen, E; Heß, M; Hicheur, A; Hill, D; Hombach, C; Hopchev, H; Hulsbergen, W; Humair, T; Hushchyn, M; Hussain, N; Hutchcroft, D; Idzik, M; Ilten, P; Jacobsson, R; Jaeger, A; Jalocha, J; Jans, E; Jawahery, A; Jiang, F; John, M; Johnson, D; Jones, C R; Joram, C; Jost, B; Jurik, N; Kandybei, S; Kanso, W; Karacson, M; Kariuki, J M; Karodia, S; Kecke, M; Kelsey, M; Kenyon, I R; Kenzie, M; Ketel, T; Khairullin, E; Khanji, B; Khurewathanakul, C; Kirn, T; Klaver, S; Klimaszewski, K; Koliiev, S; Kolpin, M; Komarov, I; Koopman, R F; Koppenburg, P; Kosmyntseva, A; Kozachuk, A; Kozeiha, M; Kravchuk, L; Kreplin, K; Kreps, M; Krokovny, P; Kruse, F; Krzemien, W; Kucewicz, W; Kucharczyk, M; Kudryavtsev, V; Kuonen, A K; Kurek, K; Kvaratskheliya, T; Lacarrere, D; Lafferty, G; Lai, A; Lanfranchi, G; Langenbruch, C; Latham, T; Lazzeroni, C; Le Gac, R; van Leerdam, J; Lees, J-P; Leflat, A; Lefrançois, J; Lefèvre, R; Lemaitre, F; Lemos Cid, E; Leroy, O; Lesiak, T; Leverington, B; Li, Y; Likhomanenko, T; Lindner, R; Linn, C; Lionetto, F; Liu, B; Liu, X; Loh, D; Longstaff, I; Lopes, J H; Lucchesi, D; Lucio Martinez, M; Luo, H; Lupato, A; Luppi, E; Lupton, O; Lusiani, A; Lyu, X; Machefert, F; Maciuc, F; Maev, O; Maguire, K; Malde, S; Malinin, A; Maltsev, T; Manca, G; Mancinelli, G; Manning, P; Maratas, J; Marchand, J F; Marconi, U; Marin Benito, C; Marino, P; Marks, J; Martellotti, G; Martin, M; Martinelli, M; Martinez Santos, D; Martinez Vidal, F; Martins Tostes, D; Massacrier, L M; Massafferri, A; Matev, R; Mathad, A; Mathe, Z; Matteuzzi, C; Mauri, A; Maurin, B; Mazurov, A; McCann, M; McCarthy, J; McNab, A; McNulty, R; Meadows, B; Meier, F; Meissner, M; Melnychuk, D; Merk, M; Merli, A; Michielin, E; Milanes, D A; Minard, M-N; Mitzel, D S; Mogini, A; Molina Rodriguez, J; Monroy, I A; Monteil, S; Morandin, M; Morawski, P; Mordà, A; Morello, M J; Moron, J; Morris, A B; Mountain, R; Muheim, F; Mulder, M; Mussini, M; Müller, D; Müller, J; Müller, K; Müller, V; Naik, P; Nakada, T; Nandakumar, R; Nandi, A; Nasteva, I; Needham, M; Neri, N; Neubert, S; Neufeld, N; Neuner, M; Nguyen, A D; Nguyen, T D; Nguyen-Mau, C; Nieswand, S; Niet, R; Nikitin, N; Nikodem, T; Novoselov, A; O'Hanlon, D P; Oblakowska-Mucha, A; Obraztsov, V; Ogilvy, S; Oldeman, R; Onderwater, C J G; Otalora Goicochea, J M; Otto, A; Owen, P; Oyanguren, A; Pais, P R; Palano, A; Palombo, F; Palutan, M; Panman, J; Papanestis, A; Pappagallo, M; Pappalardo, L L; Parker, W; Parkes, C; Passaleva, G; Pastore, A; Patel, G D; Patel, M; Patrignani, C; Pearce, A; Pellegrino, A; Penso, G; Pepe Altarelli, M; Perazzini, S; Perret, P; Pescatore, L; Petridis, K; Petrolini, A; Petrov, A; Petruzzo, M; Picatoste Olloqui, E; Pietrzyk, B; Pikies, M; Pinci, D; Pistone, A; Piucci, A; Playfer, S; Plo Casasus, M; Poikela, T; Polci, F; Poluektov, A; Polyakov, I; Polycarpo, E; Pomery, G J; Popov, A; Popov, D; Popovici, B; Poslavskii, S; Potterat, C; Price, E; Price, J D; Prisciandaro, J; Pritchard, A; Prouve, C; Pugatch, V; Puig Navarro, A; Punzi, G; Qian, W; Quagliani, R; Rachwal, B; Rademacker, J H; Rama, M; Ramos Pernas, M; Rangel, M S; Raniuk, I; Ratnikov, F; Raven, G; Redi, F; Reichert, S; Dos Reis, A C; Remon Alepuz, C; Renaudin, V; Ricciardi, S; Richards, S; Rihl, M; Rinnert, K; Rives Molina, V; Robbe, P; Rodrigues, A B; Rodrigues, E; Rodriguez Lopez, J A; Rodriguez Perez, P; Rogozhnikov, A; Roiser, S; Rollings, A; Romanovskiy, V; Romero Vidal, A; Ronayne, J W; Rotondo, M; Rudolph, M S; Ruf, T; Ruiz Valls, P; Saborido Silva, J J; Sadykhov, E; Sagidova, N; Saitta, B; Salustino Guimaraes, V; Sanchez Mayordomo, C; Sanmartin Sedes, B; Santacesaria, R; Santamarina Rios, C; Santimaria, M; Santovetti, E; Sarti, A; Satriano, C; Satta, A; Saunders, D M; Savrina, D; Schael, S; Schellenberg, M; Schiller, M; Schindler, H; Schlupp, M; Schmelling, M; Schmelzer, T; Schmidt, B; Schneider, O; Schopper, A; Schubert, K; Schubiger, M; Schune, M-H; Schwemmer, R; Sciascia, B; Sciubba, A; Semennikov, A; Sergi, A; Serra, N; Serrano, J; Sestini, L; Seyfert, P; Shapkin, M; Shapoval, I; Shcheglov, Y; Shears, T; Shekhtman, L; Shevchenko, V; Siddi, B G; Silva Coutinho, R; Silva de Oliveira, L; Simi, G; Simone, S; Sirendi, M; Skidmore, N; Skwarnicki, T; Smith, E; Smith, I T; Smith, J; Smith, M; Snoek, H; Sokoloff, M D; Soler, F J P; Souza De Paula, B; Spaan, B; Spradlin, P; Sridharan, S; Stagni, F; Stahl, M; Stahl, S; Stefko, P; Stefkova, S; Steinkamp, O; Stemmle, S; Stenyakin, O; Stevenson, S; Stoica, S; Stone, S; Storaci, B; Stracka, S; Straticiuc, M; Straumann, U; Sun, L; Sutcliffe, W; Swientek, K; Syropoulos, V; Szczekowski, M; Szumlak, T; T'Jampens, S; Tayduganov, A; Tekampe, T; Tellarini, G; Teubert, F; Thomas, E; van Tilburg, J; Tilley, M J; Tisserand, V; Tobin, M; Tolk, S; Tomassetti, L; Tonelli, D; Topp-Joergensen, S; Toriello, F; Tournefier, E; Tourneur, S; Trabelsi, K; Traill, M; Tran, M T; Tresch, M; Trisovic, A; Tsaregorodtsev, A; Tsopelas, P; Tully, A; Tuning, N; Ukleja, A; Ustyuzhanin, A; Uwer, U; Vacca, C; Vagnoni, V; Valassi, A; Valat, S; Valenti, G; Vallier, A; Vazquez Gomez, R; Vazquez Regueiro, P; Vecchi, S; van Veghel, M; Velthuis, J J; Veltri, M; Veneziano, G; Venkateswaran, A; Vernet, M; Vesterinen, M; Viaud, B; Vieira, D; Vieites Diaz, M; Viemann, H; Vilasis-Cardona, X; Vitti, M; Volkov, V; Vollhardt, A; Voneki, B; Vorobyev, A; Vorobyev, V; Voß, C; de Vries, J A; Vázquez Sierra, C; Waldi, R; Wallace, C; Wallace, R; Walsh, J; Wang, J; Ward, D R; Wark, H M; Watson, N K; Websdale, D; Weiden, A; Whitehead, M; Wicht, J; Wilkinson, G; Wilkinson, M; Williams, M; Williams, M P; Williams, M; Williams, T; Wilson, F F; Wimberley, J; Wishahi, J; Wislicki, W; Witek, M; Wormser, G; Wotton, S A; Wraight, K; Wyllie, K; Xie, Y; Xing, Z; Xu, Z; Yang, Z; Yin, H; Yu, J; Yuan, X; Yushchenko, O; Zarebski, K A; Zavertyaev, M; Zhang, L; Zhang, Y; Zhang, Y; Zhelezov, A; Zheng, Y; Zhokhov, A; Zhu, X; Zhukov, V; Zucchelli, S

    2017-01-01

    The decays [Formula: see text] and [Formula: see text] are observed for the first time using a data sample corresponding to an integrated luminosity of 3.0 fb[Formula: see text], collected by the LHCb experiment in proton-proton collisions at the centre-of-mass energies of 7 and 8[Formula: see text]. The branching fractions relative to that of [Formula: see text] are measured to be [Formula: see text]where the first uncertainties are statistical and the second are systematic.

  14. Mining the Text: 34 Text Features that Can Ease or Obstruct Text Comprehension and Use

    Science.gov (United States)

    White, Sheida

    2012-01-01

    This article presents 34 characteristics of texts and tasks ("text features") that can make continuous (prose), noncontinuous (document), and quantitative texts easier or more difficult for adolescents and adults to comprehend and use. The text features were identified by examining the assessment tasks and associated texts in the national…

  15. Saving our science from ourselves: the plight of biological classification

    Directory of Open Access Journals (Sweden)

    Malte C. Ebach

    2011-06-01

    Full Text Available Saving our science from ourselves: the plight of biological classification. Biological classification ( nomenclature, taxonomy, and systematics is being sold short. The desire for new technologies, faster and cheaper taxonomic descriptions, identifications, and revisions is symptomatic of a lack of appreciation and understanding of classification. The problem of gadget-driven science, a lack of best practice and the inability to accept classification as a descriptive and empirical science are discussed. The worst cases scenario is a future in which classifications are purely artificial and uninformative.

  16. From Text to Political Positions: Text analysis across disciplines

    NARCIS (Netherlands)

    Kaal, A.R.; Maks, I.; van Elfrinkhof, A.M.E.

    2014-01-01

    ABSTRACT From Text to Political Positions addresses cross-disciplinary innovation in political text analysis for party positioning. Drawing on political science, computational methods and discourse analysis, it presents a diverse collection of analytical models including pure quantitative and

  17. Classification of radiological procedures

    International Nuclear Information System (INIS)

    1989-01-01

    A classification for departments in Danish hospitals which use radiological procedures. The classification codes consist of 4 digits, where the first 2 are the codes for the main groups. The first digit represents the procedure's topographical object and the second the techniques. The last 2 digits describe individual procedures. (CLS)

  18. Colombia: Territorial classification

    International Nuclear Information System (INIS)

    Mendoza Morales, Alberto

    1998-01-01

    The article is about the approaches of territorial classification, thematic axes, handling principles and territorial occupation, politician and administrative units and administration regions among other topics. Understanding as Territorial Classification the space distribution on the territory of the country, of the geographical configurations, the human communities, the political-administrative units and the uses of the soil, urban and rural, existent and proposed

  19. Munitions Classification Library

    Science.gov (United States)

    2016-04-04

    members of the community to make their own additions to any, or all, of the classification libraries . The next phase entailed data collection over less......Include area code) 04/04/2016 Final Report August 2014 - August 2015 MUNITIONS CLASSIFICATION LIBRARY Mr. Craig Murray, Parsons Dr. Thomas H. Bell, Leidos

  20. Recursive automatic classification algorithms

    Energy Technology Data Exchange (ETDEWEB)

    Bauman, E V; Dorofeyuk, A A

    1982-03-01

    A variational statement of the automatic classification problem is given. The dependence of the form of the optimal partition surface on the form of the classification objective functional is investigated. A recursive algorithm is proposed for maximising a functional of reasonably general form. The convergence problem is analysed in connection with the proposed algorithm. 8 references.

  1. Library Classification 2020

    Science.gov (United States)

    Harris, Christopher

    2013-01-01

    In this article the author explores how a new library classification system might be designed using some aspects of the Dewey Decimal Classification (DDC) and ideas from other systems to create something that works for school libraries in the year 2020. By examining what works well with the Dewey Decimal System, what features should be carried…

  2. Spectroscopic classification of transients

    DEFF Research Database (Denmark)

    Stritzinger, M. D.; Fraser, M.; Hummelmose, N. N.

    2017-01-01

    We report the spectroscopic classification of several transients based on observations taken with the Nordic Optical Telescope (NOT) equipped with ALFOSC, over the nights 23-25 August 2017.......We report the spectroscopic classification of several transients based on observations taken with the Nordic Optical Telescope (NOT) equipped with ALFOSC, over the nights 23-25 August 2017....

  3. Odor Classification using Agent Technology

    Directory of Open Access Journals (Sweden)

    Sigeru OMATU

    2014-03-01

    Full Text Available In order to measure and classify odors, Quartz Crystal Microbalance (QCM can be used. In the present study, seven QCM sensors and three different odors are used. The system has been developed as a virtual organization of agents using an agent platform called PANGEA (Platform for Automatic coNstruction of orGanizations of intElligent Agents. This is a platform for developing open multi-agent systems, specifically those including organizational aspects. The main reason for the use of agents is the scalability of the platform, i.e. the way in which it models the services. The system models functionalities as services inside the agents, or as Service Oriented Approach (SOA architecture compliant services using Web Services. This way the adaptation of the odor classification systems with new algorithms, tools and classification techniques is allowed.

  4. Text mining from ontology learning to automated text processing applications

    CERN Document Server

    Biemann, Chris

    2014-01-01

    This book comprises a set of articles that specify the methodology of text mining, describe the creation of lexical resources in the framework of text mining and use text mining for various tasks in natural language processing (NLP). The analysis of large amounts of textual data is a prerequisite to build lexical resources such as dictionaries and ontologies and also has direct applications in automated text processing in fields such as history, healthcare and mobile applications, just to name a few. This volume gives an update in terms of the recent gains in text mining methods and reflects

  5. DOE LLW classification rationale

    International Nuclear Information System (INIS)

    Flores, A.Y.

    1991-01-01

    This report was about the rationale which the US Department of Energy had with low-level radioactive waste (LLW) classification. It is based on the Nuclear Regulatory Commission's classification system. DOE site operators met to review the qualifications and characteristics of the classification systems. They evaluated performance objectives, developed waste classification tables, and compiled dose limits on the waste. A goal of the LLW classification system was to allow each disposal site the freedom to develop limits to radionuclide inventories and concentrations according to its own site-specific characteristics. This goal was achieved with the adoption of a performance objectives system based on a performance assessment, with site-specific environmental conditions and engineered disposal systems

  6. Constructing criticality by classification

    DEFF Research Database (Denmark)

    Machacek, Erika

    2017-01-01

    " in the bureaucratic practice of classification: Experts construct material criticality in assessments as they allot information on the materials to the parameters of the assessment framework. In so doing, they ascribe a new set of connotations to the materials, namely supply risk, and their importance to clean energy......, legitimizing a criticality discourse.Specifically, the paper introduces a typology delineating the inferences made by the experts from their produced recommendations in the classification of rare earth element criticality. The paper argues that the classification is a specific process of constructing risk....... It proposes that the expert bureaucratic practice of classification legitimizes (i) the valorisation that was made in the drafting of the assessment framework for the classification, and (ii) political operationalization when enacted that might have (non-)distributive implications for the allocation of public...

  7. Working with text tools, techniques and approaches for text mining

    CERN Document Server

    Tourte, Gregory J L

    2016-01-01

    Text mining tools and technologies have long been a part of the repository world, where they have been applied to a variety of purposes, from pragmatic aims to support tools. Research areas as diverse as biology, chemistry, sociology and criminology have seen effective use made of text mining technologies. Working With Text collects a subset of the best contributions from the 'Working with text: Tools, techniques and approaches for text mining' workshop, alongside contributions from experts in the area. Text mining tools and technologies in support of academic research include supporting research on the basis of a large body of documents, facilitating access to and reuse of extant work, and bridging between the formal academic world and areas such as traditional and social media. Jisc have funded a number of projects, including NaCTem (the National Centre for Text Mining) and the ResDis programme. Contents are developed from workshop submissions and invited contributions, including: Legal considerations in te...

  8. Variación cultural, técnicas y procedimientos estilísticos a propósito de las autotraducciones al castellano de Carme Riera / Cultural variation, techniques and stylistic procedures used by Carme Riera to Spanish self-translation

    Directory of Open Access Journals (Sweden)

    Luisa Cotoner Cerdó

    2011-09-01

    Full Text Available Resumen: El propósito de este artículo es hacer un repaso de las técnicas y procedimientos estilísticos que la escritora Carme Riera utiliza al traspasar sus propias obras desde el original catalán al castellano. La intención de causar un mismo efecto estético en un nuevo público lector en lengua castellana hace que la autora se sienta absolutamente libre para modificar, adaptar, modular, ampliar o suprimir el texto original. Riera considera sus traducciones solo como ejercicio de recreación. Desde esa perspectiva, sus recreaciones consiguen seducir también al público lector hispano, al tiempo que reflejan una de las características esenciales de su obra: la visión poliédrica de la realidad, ya que sus autotraducciones guardan una estrecha correspondencia con la pluralidad de cosmovisiones inherente a la diversidad de lenguas. Sus procedimientos pueden ser a veces discutibles, pero el resultado final desemboca en una nueva y enriquecedora mirada.Summary: The purpose of this article is to survey the techniques and stylistic procedures used by the writer Carme Riera to transfer her own work from the original Catalan to Spanish. The process of seeking to create the same aesthetic impact on a new Spanish readership has the effect of liberating the author, enabling her to modify, adapt, vary, enlarge or omit passages of the original text. Riera considers translation only as an exercise in recreation. As a product of this perspective, her recreations are not only successful in captivating a Spanish readership but also reflect one of the essential characteristics of her work: a multifaceted vision of reality, since her self-translations are a close parallel to the multiplicity of world views inherent in the diversity of languages. Her procedures may at times be questionable, but they lead to a new and enriching final standpoint.

  9. Classifications of patterned hair loss: a review

    Directory of Open Access Journals (Sweden)

    Mrinal Gupta

    2016-01-01

    Full Text Available Patterned hair loss is the most common cause of hair loss seen in both the sexes after puberty. Numerous classification systems have been proposed by various researchers for grading purposes. These systems vary from the simpler systems based on recession of the hairline to the more advanced multifactorial systems based on the morphological and dynamic parameters that affect the scalp and the hair itself. Most of these preexisting systems have certain limitations. Currently, the Hamilton-Norwood classification system for males and the Ludwig system for females are most commonly used to describe patterns of hair loss. In this article, we review the various classification systems for patterned hair loss in both the sexes. Relevant articles were identified through searches of MEDLINE and EMBASE. Search terms included but were not limited to androgenic alopecia classification, patterned hair loss classification, male pattern baldness classification, and female pattern hair loss classification. Further publications were identified from the reference lists of the reviewed articles.

  10. A simple phenotypic classification for celiac disease

    Directory of Open Access Journals (Sweden)

    Ajit Sood

    2018-04-01

    Full Text Available Background/Aims : Celiac disease is a global health problem. The presentation of celiac disease has unfolded over years and it is now known that it can manifest at different ages, has varied presentations, and is prone to develop complications, if not managed properly. Although the Oslo definitions provide consensus on the various terminologies used in literature, there is no phenotypic classification providing a composite diagnosis for the disease. Methods : Various variables identified for phenotypic classification included age at diagnosis, age at onset of symptoms, clinical presentation, family history and complications. These were applied to the existing registry of 1,664 patients at Dayanand Medical College and Hospital, Ludhiana, India. In addition, age was evaluated as below 15 and below 18 years. Cross tabulations were used for the verification of the classification using the existing data. Expert opinion was sought from both international and national experts of varying fields. Results : After empirical verification, age at diagnosis was considered appropriate in between A1 (<18 and A2 (≧18. The disease presentation has been classified into 3 types–P1 (classical, P2 (non-classical and P3 (asymptomatic. Complications were considered as absent (C0 or present (C1. A single phenotypic classification based on these 3 characteristics, namely age at the diagnosis, clinical presentation, and intestinal complications (APC classification was derived. Conclusions : APC classification (age at diagnosis, presentation, complications is a simple disease explanatory classification for patients with celiac disease aimed at providing a composite diagnosis.

  11. An edit script for taxonomic classifications

    Directory of Open Access Journals (Sweden)

    Valiente Gabriel

    2005-08-01

    Full Text Available Abstract Background The NCBI taxonomy provides one of the most powerful ways to navigate sequence data bases but currently users are forced to formulate queries according to a single taxonomic classification. Given that there is not universal agreement on the classification of organisms, providing a single classification places constraints on the questions biologists can ask. However, maintaining multiple classifications is burdensome in the face of a constantly growing NCBI classification. Results In this paper, we present a solution to the problem of generating modifications of the NCBI taxonomy, based on the computation of an edit script that summarises the differences between two classification trees. Our algorithms find the shortest possible edit script based on the identification of all shared subtrees, and only take time quasi linear in the size of the trees because classification trees have unique node labels. Conclusion These algorithms have been recently implemented, and the software is freely available for download from http://darwin.zoology.gla.ac.uk/~rpage/forest/.

  12. Understanding about the classification of pulp inflammation

    Directory of Open Access Journals (Sweden)

    Trijoedani Widodo

    2007-03-01

    Full Text Available Since most authors use the reversible pulpitis and irreversible pulpitis classification, however, many dentists still do not implement these new classifications. Research was made using a descriptive method by proposing questionnaire to dentists from various dental clinics. The numbers of the dentists participating in this research are 22 dentists. All respondents use the diagnosis sheet during their examinations on patients. Nonetheless, it can't be known what diagnosis card used and most of the dentists are still using the old classification. Concerning responses given towards the new classification: a the new classification had been heard, however, it was not clear (36.3%; b the new classification has never been heard at all (63.6%. Then, responses concerning whether a new development is important to be followed-up or not: a there are those who think that information concerning new development is very important (27.2%; b those who feel that it is important to have new information (68.3%; c those who think that new information is not important (8%. It concluded that information concerning the development of classification of pulp inflammation did not reach the dentists.

  13. Modified angle's classification for primary dentition

    Directory of Open Access Journals (Sweden)

    Kaushik Narendra Chandranee

    2017-01-01

    Full Text Available Aim: This study aims to propose a modification of Angle's classification for primary dentition and to assess its applicability in children from Central India, Nagpur. Methods: Modification in Angle's classification has been proposed for application in primary dentition. Small roman numbers i/ii/iii are used for primary dentition notation to represent Angle's Class I/II/III molar relationships as in permanent dentition, respectively. To assess applicability of modified Angle's classification a cross-sectional preschool 2000 children population from central India; 3–6 years of age residing in Nagpur metropolitan city of Maharashtra state were selected randomly as per the inclusion and exclusion criteria. Results: Majority 93.35% children were found to have bilateral Class i followed by 2.5% bilateral Class ii and 0.2% bilateral half cusp Class iii molar relationships as per the modified Angle's classification for primary dentition. About 3.75% children had various combinations of Class ii relationships and 0.2% children were having Class iii subdivision relationship. Conclusions: Modification of Angle's classification for application in primary dentition has been proposed. A cross-sectional investigation using new classification revealed various 6.25% Class ii and 0.4% Class iii molar relationships cases in preschool children population in a metropolitan city of Nagpur. Application of the modified Angle's classification to other population groups is warranted to validate its routine application in clinical pediatric dentistry.

  14. Informational Text and the CCSS

    Science.gov (United States)

    Aspen Institute, 2012

    2012-01-01

    What constitutes an informational text covers a broad swath of different types of texts. Biographies & memoirs, speeches, opinion pieces & argumentative essays, and historical, scientific or technical accounts of a non-narrative nature are all included in what the Common Core State Standards (CCSS) envisions as informational text. Also included…

  15. The Only Safe SMS Texting Is No SMS Texting.

    Science.gov (United States)

    Toth, Cheryl; Sacopulos, Michael J

    2015-01-01

    Many physicians and practice staff use short messaging service (SMS) text messaging to communicate with patients. But SMS text messaging is unencrypted, insecure, and does not meet HIPAA requirements. In addition, the short and abbreviated nature of text messages creates opportunities for misinterpretation, and can negatively impact patient safety and care. Until recently, asking patients to sign a statement that they understand and accept these risks--as well as having policies, device encryption, and cyber insurance in place--would have been enough to mitigate the risk of using SMS text in a medical practice. But new trends and policies have made SMS text messaging unsafe under any circumstance. This article explains these trends and policies, as well as why only secure texting or secure messaging should be used for physician-patient communication.

  16. Internet and library classification as determinants of students ...

    African Journals Online (AJOL)

    Internet and library classification as determinants of students utilisation of information resources in University of Calabar Library. ... DOWNLOAD FULL TEXT Open Access DOWNLOAD FULL TEXT Subscription or Fee Access ...

  17. Predicting Prosody from Text for Text-to-Speech Synthesis

    CERN Document Server

    Rao, K Sreenivasa

    2012-01-01

    Predicting Prosody from Text for Text-to-Speech Synthesis covers the specific aspects of prosody, mainly focusing on how to predict the prosodic information from linguistic text, and then how to exploit the predicted prosodic knowledge for various speech applications. Author K. Sreenivasa Rao discusses proposed methods along with state-of-the-art techniques for the acquisition and incorporation of prosodic knowledge for developing speech systems. Positional, contextual and phonological features are proposed for representing the linguistic and production constraints of the sound units present in the text. This book is intended for graduate students and researchers working in the area of speech processing.

  18. Monitoring interaction and collective text production through text mining

    Directory of Open Access Journals (Sweden)

    Macedo, Alexandra Lorandi

    2014-04-01

    Full Text Available This article presents the Concepts Network tool, developed using text mining technology. The main objective of this tool is to extract and relate terms of greatest incidence from a text and exhibit the results in the form of a graph. The Network was implemented in the Collective Text Editor (CTE which is an online tool that allows the production of texts in synchronized or non-synchronized forms. This article describes the application of the Network both in texts produced collectively and texts produced in a forum. The purpose of the tool is to offer support to the teacher in managing the high volume of data generated in the process of interaction amongst students and in the construction of the text. Specifically, the aim is to facilitate the teacher’s job by allowing him/her to process data in a shorter time than is currently demanded. The results suggest that the Concepts Network can aid the teacher, as it provides indicators of the quality of the text produced. Moreover, messages posted in forums can be analyzed without their content necessarily having to be pre-read.

  19. Text recycling: acceptable or misconduct?

    Science.gov (United States)

    Harriman, Stephanie; Patel, Jigisha

    2014-08-16

    Text recycling, also referred to as self-plagiarism, is the reproduction of an author's own text from a previous publication in a new publication. Opinions on the acceptability of this practice vary, with some viewing it as acceptable and efficient, and others as misleading and unacceptable. In light of the lack of consensus, journal editors often have difficulty deciding how to act upon the discovery of text recycling. In response to these difficulties, we have created a set of guidelines for journal editors on how to deal with text recycling. In this editorial, we discuss some of the challenges of developing these guidelines, and how authors can avoid undisclosed text recycling.

  20. TEXT DEIXIS IN NARRATIVE SEQUENCES

    Directory of Open Access Journals (Sweden)

    Josep Rivera

    2007-06-01

    Full Text Available This study looks at demonstrative descriptions, regarding them as text-deictic procedures which contribute to weave discourse reference. Text deixis is thought of as a metaphorical referential device which maps the ground of utterance onto the text itself. Demonstrative expressions with textual antecedent-triggers, considered as the most important text-deictic units, are identified in a narrative corpus consisting of J. M. Barrie’s Peter Pan and its translation into Catalan. Some linguistic and discourse variables related to DemNPs are analysed to characterise adequately text deixis. It is shown that this referential device is usually combined with abstract nouns, thus categorising and encapsulating (non-nominal complex discourse entities as nouns, while performing a referential cohesive function by means of the text deixis + general noun type of lexical cohesion.

  1. KNN BASED CLASSIFICATION OF DIGITAL MODULATED SIGNALS

    Directory of Open Access Journals (Sweden)

    Sajjad Ahmed Ghauri

    2016-11-01

    Full Text Available Demodulation process without the knowledge of modulation scheme requires Automatic Modulation Classification (AMC. When receiver has limited information about received signal then AMC become essential process. AMC finds important place in the field many civil and military fields such as modern electronic warfare, interfering source recognition, frequency management, link adaptation etc. In this paper we explore the use of K-nearest neighbor (KNN for modulation classification with different distance measurement methods. Five modulation schemes are used for classification purpose which is Binary Phase Shift Keying (BPSK, Quadrature Phase Shift Keying (QPSK, Quadrature Amplitude Modulation (QAM, 16-QAM and 64-QAM. Higher order cummulants (HOC are used as an input feature set to the classifier. Simulation results shows that proposed classification method provides better results for the considered modulation formats.

  2. Enterprise Potential: Essence, Classification and Types

    Directory of Open Access Journals (Sweden)

    Turylo Anatolii M.

    2014-02-01

    Full Text Available The article considers existing approaches to classification of the enterprise potential as an economic notion. It offers own vision of classification of enterprise potential, which meets modern tendencies of enterprise development. Classification ensures a possibility of a wider description and assessment of enterprise potential and also allows identification of its most significant characteristics. Classification of the enterprise potential is developed by different criteria: by functions, by resource support, by ability to adapt, by the level of detection, by the spectrum of taking into account possibilities, by the period of coverage of possibilities and by the level of use. Analysis of components of the enterprise potential allows obtaining a complete and trustworthy assessment of the state of an enterprise. Adaptation potential of an enterprise is based on principles systemacy and dynamism, it characterises possibilities of adjustment of an enterprise to external and internal economic conditions.

  3. Multivariate Approaches to Classification in Extragalactic Astronomy

    Directory of Open Access Journals (Sweden)

    Didier eFraix-Burnet

    2015-08-01

    Full Text Available Clustering objects into synthetic groups is a natural activity of any science. Astrophysics is not an exception and is now facing a deluge of data. For galaxies, the one-century old Hubble classification and the Hubble tuning fork are still largely in use, together with numerous mono- or bivariate classifications most often made by eye. However, a classification must be driven by the data, and sophisticated multivariate statistical tools are used more and more often. In this paper we review these different approaches in order to situate them in the general context of unsupervised and supervised learning. We insist on the astrophysical outcomes of these studies to show that multivariate analyses provide an obvious path toward a renewal of our classification of galaxies and are invaluable tools to investigate the physics and evolution of galaxies.

  4. Pedoinformatics Approach to Soil Text Analytics

    Science.gov (United States)

    Furey, J.; Seiter, J.; Davis, A.

    2017-12-01

    The several extant schema for the classification of soils rely on differing criteria, but the major soil science taxonomies, including the United States Department of Agriculture (USDA) and the international harmonized World Reference Base for Soil Resources systems, are based principally on inferred pedogenic properties. These taxonomies largely result from compiled individual observations of soil morphologies within soil profiles, and the vast majority of this pedologic information is contained in qualitative text descriptions. We present text mining analyses of hundreds of gigabytes of parsed text and other data in the digitally available USDA soil taxonomy documentation, the Soil Survey Geographic (SSURGO) database, and the National Cooperative Soil Survey (NCSS) soil characterization database. These analyses implemented iPython calls to Gensim modules for topic modelling, with latent semantic indexing completed down to the lowest taxon level (soil series) paragraphs. Via a custom extension of the Natural Language Toolkit (NLTK), approximately one percent of the USDA soil series descriptions were used to train a classifier for the remainder of the documents, essentially by treating soil science words as comprising a novel language. While location-specific descriptors at the soil series level are amenable to geomatics methods, unsupervised clustering of the occurrence of other soil science words did not closely follow the usual hierarchy of soil taxa. We present preliminary phrasal analyses that may account for some of these effects.

  5. Facilitating text reading in posterior cortical atrophy

    Science.gov (United States)

    Rajdev, Kishan; Shakespeare, Timothy J.; Leff, Alexander P.; Crutch, Sebastian J.

    2015-01-01

    Objective: We report (1) the quantitative investigation of text reading in posterior cortical atrophy (PCA), and (2) the effects of 2 novel software-based reading aids that result in dramatic improvements in the reading ability of patients with PCA. Methods: Reading performance, eye movements, and fixations were assessed in patients with PCA and typical Alzheimer disease and in healthy controls (experiment 1). Two reading aids (single- and double-word) were evaluated based on the notion that reducing the spatial and oculomotor demands of text reading might support reading in PCA (experiment 2). Results: Mean reading accuracy in patients with PCA was significantly worse (57%) compared with both patients with typical Alzheimer disease (98%) and healthy controls (99%); spatial aspects of passages were the primary determinants of text reading ability in PCA. Both aids led to considerable gains in reading accuracy (PCA mean reading accuracy: single-word reading aid = 96%; individual patient improvement range: 6%–270%) and self-rated measures of reading. Data suggest a greater efficiency of fixations and eye movements under the single-word reading aid in patients with PCA. Conclusions: These findings demonstrate how neurologic characterization of a neurodegenerative syndrome (PCA) and detailed cognitive analysis of an important everyday skill (reading) can combine to yield aids capable of supporting important everyday functional abilities. Classification of evidence: This study provides Class III evidence that for patients with PCA, 2 software-based reading aids (single-word and double-word) improve reading accuracy. PMID:26138948

  6. Text against Text: Counterbalancing the Hegemony of Assessment.

    Science.gov (United States)

    Cosgrove, Cornelius

    A study examined whether composition specialists can counterbalance the potential privileging of the assessment perspective, or of self-appointed interpreters of that perspective, through the study of assessment discourse as text. Fourteen assessment texts were examined, most of them journal articles and most of them featuring the common…

  7. SparkText: Biomedical Text Mining on Big Data Framework.

    Directory of Open Access Journals (Sweden)

    Zhan Ye

    Full Text Available Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment.In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM, and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes.This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  8. Classification of movement disorders.

    Science.gov (United States)

    Fahn, Stanley

    2011-05-01

    The classification of movement disorders has evolved. Even the terminology has shifted, from an anatomical one of extrapyramidal disorders to a phenomenological one of movement disorders. The history of how this shift came about is described. The history of both the definitions and the classifications of the various neurologic conditions is then reviewed. First is a review of movement disorders as a group; then, the evolving classifications for 3 of them--parkinsonism, dystonia, and tremor--are covered in detail. Copyright © 2011 Movement Disorder Society.

  9. Learning Convolutional Text Representations for Visual Question Answering

    OpenAIRE

    Wang, Zhengyang; Ji, Shuiwang

    2017-01-01

    Visual question answering is a recently proposed artificial intelligence task that requires a deep understanding of both images and texts. In deep learning, images are typically modeled through convolutional neural networks, and texts are typically modeled through recurrent neural networks. While the requirement for modeling images is similar to traditional computer vision tasks, such as object recognition and image classification, visual question answering raises a different need for textual...

  10. Knowledge Representation in Travelling Texts

    DEFF Research Database (Denmark)

    Mousten, Birthe; Locmele, Gunta

    2014-01-01

    Today, information travels fast. Texts travel, too. In a corporate context, the question is how to manage which knowledge elements should travel to a new language area or market and in which form? The decision to let knowledge elements travel or not travel highly depends on the limitation...... and the purpose of the text in a new context as well as on predefined parameters for text travel. For texts used in marketing and in technology, the question is whether culture-bound knowledge representation should be domesticated or kept as foreign elements, or should be mirrored or moulded—or should not travel...... at all! When should semantic and pragmatic elements in a text be replaced and by which other elements? The empirical basis of our work is marketing and technical texts in English, which travel into the Latvian and Danish markets, respectively....

  11. Texting while driving: is speech-based text entry less risky than handheld text entry?

    Science.gov (United States)

    He, J; Chaparro, A; Nguyen, B; Burge, R J; Crandall, J; Chaparro, B; Ni, R; Cao, S

    2014-11-01

    Research indicates that using a cell phone to talk or text while maneuvering a vehicle impairs driving performance. However, few published studies directly compare the distracting effects of texting using a hands-free (i.e., speech-based interface) versus handheld cell phone, which is an important issue for legislation, automotive interface design and driving safety training. This study compared the effect of speech-based versus handheld text entries on simulated driving performance by asking participants to perform a car following task while controlling the duration of a secondary text-entry task. Results showed that both speech-based and handheld text entries impaired driving performance relative to the drive-only condition by causing more variation in speed and lane position. Handheld text entry also increased the brake response time and increased variation in headway distance. Text entry using a speech-based cell phone was less detrimental to driving performance than handheld text entry. Nevertheless, the speech-based text entry task still significantly impaired driving compared to the drive-only condition. These results suggest that speech-based text entry disrupts driving, but reduces the level of performance interference compared to text entry with a handheld device. In addition, the difference in the distraction effect caused by speech-based and handheld text entry is not simply due to the difference in task duration. Copyright © 2014 Elsevier Ltd. All rights reserved.

  12. SparkText: Biomedical Text Mining on Big Data Framework

    Science.gov (United States)

    He, Karen Y.; Wang, Kai

    2016-01-01

    Background Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. Results In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. Conclusions This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research. PMID:27685652

  13. SparkText: Biomedical Text Mining on Big Data Framework.

    Science.gov (United States)

    Ye, Zhan; Tafti, Ahmad P; He, Karen Y; Wang, Kai; He, Max M

    Many new biomedical research articles are published every day, accumulating rich information, such as genetic variants, genes, diseases, and treatments. Rapid yet accurate text mining on large-scale scientific literature can discover novel knowledge to better understand human diseases and to improve the quality of disease diagnosis, prevention, and treatment. In this study, we designed and developed an efficient text mining framework called SparkText on a Big Data infrastructure, which is composed of Apache Spark data streaming and machine learning methods, combined with a Cassandra NoSQL database. To demonstrate its performance for classifying cancer types, we extracted information (e.g., breast, prostate, and lung cancers) from tens of thousands of articles downloaded from PubMed, and then employed Naïve Bayes, Support Vector Machine (SVM), and Logistic Regression to build prediction models to mine the articles. The accuracy of predicting a cancer type by SVM using the 29,437 full-text articles was 93.81%. While competing text-mining tools took more than 11 hours, SparkText mined the dataset in approximately 6 minutes. This study demonstrates the potential for mining large-scale scientific articles on a Big Data infrastructure, with real-time update from new articles published daily. SparkText can be extended to other areas of biomedical research.

  14. Text Mining Applications and Theory

    CERN Document Server

    Berry, Michael W

    2010-01-01

    Text Mining: Applications and Theory presents the state-of-the-art algorithms for text mining from both the academic and industrial perspectives.  The contributors span several countries and scientific domains: universities, industrial corporations, and government laboratories, and demonstrate the use of techniques from machine learning, knowledge discovery, natural language processing and information retrieval to design computational models for automated text analysis and mining. This volume demonstrates how advancements in the fields of applied mathematics, computer science, machine learning

  15. Update on diabetes classification.

    Science.gov (United States)

    Thomas, Celeste C; Philipson, Louis H

    2015-01-01

    This article highlights the difficulties in creating a definitive classification of diabetes mellitus in the absence of a complete understanding of the pathogenesis of the major forms. This brief review shows the evolving nature of the classification of diabetes mellitus. No classification scheme is ideal, and all have some overlap and inconsistencies. The only diabetes in which it is possible to accurately diagnose by DNA sequencing, monogenic diabetes, remains undiagnosed in more than 90% of the individuals who have diabetes caused by one of the known gene mutations. The point of classification, or taxonomy, of disease, should be to give insight into both pathogenesis and treatment. It remains a source of frustration that all schemes of diabetes mellitus continue to fall short of this goal. Copyright © 2015 Elsevier Inc. All rights reserved.

  16. Learning Apache Mahout classification

    CERN Document Server

    Gupta, Ashish

    2015-01-01

    If you are a data scientist who has some experience with the Hadoop ecosystem and machine learning methods and want to try out classification on large datasets using Mahout, this book is ideal for you. Knowledge of Java is essential.

  17. CLASSIFICATION OF VIRUSES

    Indian Academy of Sciences (India)

    First page Back Continue Last page Overview Graphics. CLASSIFICATION OF VIRUSES. On basis of morphology. On basis of chemical composition. On basis of structure of genome. On basis of mode of replication. Notes:

  18. Pitch Based Sound Classification

    DEFF Research Database (Denmark)

    Nielsen, Andreas Brinch; Hansen, Lars Kai; Kjems, U

    2006-01-01

    A sound classification model is presented that can classify signals into music, noise and speech. The model extracts the pitch of the signal using the harmonic product spectrum. Based on the pitch estimate and a pitch error measure, features are created and used in a probabilistic model with soft......-max output function. Both linear and quadratic inputs are used. The model is trained on 2 hours of sound and tested on publicly available data. A test classification error below 0.05 with 1 s classification windows is achieved. Further more it is shown that linear input performs as well as a quadratic......, and that even though classification gets marginally better, not much is achieved by increasing the window size beyond 1 s....

  19. Text and ideology: text-oriented discourse analysis

    Directory of Open Access Journals (Sweden)

    Maria Eduarda Gonçalves Peixoto

    2018-04-01

    Full Text Available The article aims to contribute to the understanding of the connection between text and ideology articulated by the text-oriented analysis of discourse (ADTO. Based on the reflections of Fairclough (1989, 2001, 2003 and Fairclough and Chouliaraki (1999, the debate presents the social ontology that ADTO uses to base its conception of social life as an open system and textually mediated; the article then explains the chronological-narrative development of the main critical theories of ideology, by virtue of which ADTO organizes the assumptions that underpin the particular use it makes of the term. Finally, the discussion presents the main aspects of the connection between text and ideology, offering a conceptual framework that can contribute to the domain of the theme according to a critical discourse analysis approach.

  20. Latent classification models

    DEFF Research Database (Denmark)

    Langseth, Helge; Nielsen, Thomas Dyhre

    2005-01-01

    parametric family ofdistributions.  In this paper we propose a new set of models forclassification in continuous domains, termed latent classificationmodels. The latent classification model can roughly be seen ascombining the \\NB model with a mixture of factor analyzers,thereby relaxing the assumptions...... classification model, and wedemonstrate empirically that the accuracy of the proposed model issignificantly higher than the accuracy of other probabilisticclassifiers....

  1. Proverbs 30:18-19 in the Light of Ancient Mesopotamian Cuneiform Texts

    Directory of Open Access Journals (Sweden)

    Böck, Barbara

    2009-12-01

    Full Text Available The meaning of Proverbs 30:18-19 has long been disputed. Most scholars interpret the Biblical couplets textually on stylistic features only; an explanation of the contextual association between the four motifs mentioned (eagle, serpent, boat, man and woman has not yet been undertaken. The present paper aims at shedding light on the motivation for this association, taking into consideration ancient Near Eastern cuneiform compositions for the first time. It is further suggested that Proverbs 30:18-19 derived originally from a riddle that had its setting in a wedding ceremony.El significado de Proverbios 30:18-19 sigue desafiando la exégesis de los biblistas. La mayoría de los comentaristas interpretan los versos bíblicos textualmente, ciñéndose al análisis de las figuras de estilo. Sin embargo, todavía no se ha dado ninguna explicación a la asociación contextual entre los cuatro motivos del proverbio (águila, serpiente, barco, hombre y mujer. Por primera vez, este artículo estudia composiciones de la literatura cuneiforme que ofrecen un telón de fondo para interpretar el sentido de los distintos elementos y del conjunto del proverbio bíblico. Según esta nueva lectura, Proverbios 30:18-19 describiría una adivinanza propuesta durante una ceremonia matrimonial.

  2. 78 FR 68983 - Cotton Futures Classification: Optional Classification Procedure

    Science.gov (United States)

    2013-11-18

    ...-AD33 Cotton Futures Classification: Optional Classification Procedure AGENCY: Agricultural Marketing... regulations to allow for the addition of an optional cotton futures classification procedure--identified and... response to requests from the U.S. cotton industry and ICE, AMS will offer a futures classification option...

  3. English Metafunction Analysis in Chemistry Text: Characterization of Scientific Text

    Directory of Open Access Journals (Sweden)

    Ahmad Amin Dalimunte, M.Hum

    2013-09-01

    Full Text Available The objectives of this research are to identify what Metafunctions are applied in chemistry text and how they characterize a scientific text. It was conducted by applying content analysis. The data for this research was a twelve-paragraph chemistry text. The data were collected by applying a documentary technique. The document was read and analyzed to find out the Metafunction. The data were analyzed by some procedures: identifying the types of process, counting up the number of the processes, categorizing and counting up the cohesion devices, classifying the types of modulation and determining modality value, finally counting up the number of sentences and clauses, then scoring the grammatical intricacy index. The findings of the research show that Material process (71of 100 is mostly used, circumstance of spatial location (26 of 56 is more dominant than the others. Modality (5 is less used in order to avoid from subjectivity. Impersonality is implied through less use of reference either pronouns (7 or demonstrative (7, conjunctions (60 are applied to develop ideas, and the total number of the clauses are found much more dominant (109 than the total number of the sentences (40 which results high grammatical intricacy index. The Metafunction found indicate that the chemistry text has fulfilled the characteristics of scientific or academic text which truly reflects it as a natural science.

  4. A Classification of BPEL Extensions

    Directory of Open Access Journals (Sweden)

    Oliver Kopp

    2011-10-01

    Full Text Available The Business Process Execution Language (BPEL has emerged as de-facto standard for business processes implementation. This language is designed to be extensible for including additional valuable features in a standardized manner. There are a number of BPEL extensions available. They are, however, neither classified nor evaluated with respect to their compliance to the BPEL standard. This article fills this gap by providing a framework for classifying BPEL extensions, a classification of existing extensions, and a guideline for designing BPEL extensions.

  5. Text Genres in Information Organization

    Science.gov (United States)

    Nahotko, Marek

    2016-01-01

    Introduction: Text genres used by so-called information organizers in the processes of information organization in information systems were explored in this research. Method: The research employed text genre socio-functional analysis. Five genre groups in information organization were distinguished. Every genre group used in information…

  6. Strategies for Translating Vocative Texts

    Directory of Open Access Journals (Sweden)

    Olga COJOCARU

    2014-12-01

    Full Text Available The paper deals with the linguistic and cultural elements of vocative texts and the techniques used in translating them by giving some examples of texts that are typically vocative (i.e. advertisements and instructions for use. Semantic and communicative strategies are popular in translation studies and each of them has its own advantages and disadvantages in translating vocative texts. The advantage of semantic translation is that it takes more account of the aesthetic value of the SL text, while communicative translation attempts to render the exact contextual meaning of the original text in such a way that both content and language are readily acceptable and comprehensible to the readership. Focus is laid on the strategies used in translating vocative texts, strategies that highlight and introduce a cultural context to the target audience, in order to achieve their overall purpose, that is to sell or persuade the reader to behave in a certain way. Thus, in order to do that, a number of advertisements from the field of cosmetics industry and electronic gadgets were selected for analysis. The aim is to gather insights into vocative text translation and to create new perspectives on this field of research, now considered a process of innovation and diversion, especially in areas as important as economy and marketing.

  7. Supernova Photometric Lightcurve Classification

    Science.gov (United States)

    Zaidi, Tayeb; Narayan, Gautham

    2016-01-01

    This is a preliminary report on photometric supernova classification. We first explore the properties of supernova light curves, and attempt to restructure the unevenly sampled and sparse data from assorted datasets to allow for processing and classification. The data was primarily drawn from the Dark Energy Survey (DES) simulated data, created for the Supernova Photometric Classification Challenge. This poster shows a method for producing a non-parametric representation of the light curve data, and applying a Random Forest classifier algorithm to distinguish between supernovae types. We examine the impact of Principal Component Analysis to reduce the dimensionality of the dataset, for future classification work. The classification code will be used in a stage of the ANTARES pipeline, created for use on the Large Synoptic Survey Telescope alert data and other wide-field surveys. The final figure-of-merit for the DES data in the r band was 60% for binary classification (Type I vs II).Zaidi was supported by the NOAO/KPNO Research Experiences for Undergraduates (REU) Program which is funded by the National Science Foundation Research Experiences for Undergraduates Program (AST-1262829).

  8. Systematic characterizations of text similarity in full text biomedical publications.

    Science.gov (United States)

    Sun, Zhaohui; Errami, Mounir; Long, Tara; Renard, Chris; Choradia, Nishant; Garner, Harold

    2010-09-15

    Computational methods have been used to find duplicate biomedical publications in MEDLINE. Full text articles are becoming increasingly available, yet the similarities among them have not been systematically studied. Here, we quantitatively investigated the full text similarity of biomedical publications in PubMed Central. 72,011 full text articles from PubMed Central (PMC) were parsed to generate three different datasets: full texts, sections, and paragraphs. Text similarity comparisons were performed on these datasets using the text similarity algorithm eTBLAST. We measured the frequency of similar text pairs and compared it among different datasets. We found that high abstract similarity can be used to predict high full text similarity with a specificity of 20.1% (95% CI [17.3%, 23.1%]) and sensitivity of 99.999%. Abstract similarity and full text similarity have a moderate correlation (Pearson correlation coefficient: -0.423) when the similarity ratio is above 0.4. Among pairs of articles in PMC, method sections are found to be the most repetitive (frequency of similar pairs, methods: 0.029, introduction: 0.0076, results: 0.0043). In contrast, among a set of manually verified duplicate articles, results are the most repetitive sections (frequency of similar pairs, results: 0.94, methods: 0.89, introduction: 0.82). Repetition of introduction and methods sections is more likely to be committed by the same authors (odds of a highly similar pair having at least one shared author, introduction: 2.31, methods: 1.83, results: 1.03). There is also significantly more similarity in pairs of review articles than in pairs containing one review and one nonreview paper (frequency of similar pairs: 0.0167 and 0.0023, respectively). While quantifying abstract similarity is an effective approach for finding duplicate citations, a comprehensive full text analysis is necessary to uncover all potential duplicate citations in the scientific literature and is helpful when

  9. Mapping of the Universe of Knowledge in Different Classification Schemes

    Directory of Open Access Journals (Sweden)

    M. P. Satija

    2017-06-01

    Full Text Available Given the variety of approaches to mapping the universe of knowledge that have been presented and discussed in the literature, the purpose of this paper is to systematize their main principles and their applications in the major general modern library classification schemes. We conducted an analysis of the literature on classification and the main classification systems, namely Dewey/Universal Decimal Classification, Cutter’s Expansive Classification, Subject Classification of J.D. Brown, Colon Classification, Library of Congress Classification, Bibliographic Classification, Rider’s International Classification, Bibliothecal Bibliographic Klassification (BBK, and Broad System of Ordering (BSO. We conclude that the arrangement of the main classes can be done following four principles that are not mutually exclusive: ideological principle, social purpose principle, scientific order, and division by discipline. The paper provides examples and analysis of each system. We also conclude that as knowledge is ever-changing, classifications also change and present a different structure of knowledge depending upon the society and time of their design.

  10. RESEARCH OF CLASSIFICATION FEATURES OF THE FINANCIAL CONTROL

    Directory of Open Access Journals (Sweden)

    Knarik K. Arabyan

    2013-01-01

    Full Text Available One of the major problems is an improvement of classification features in the financial control theory. There is not a consensus concerning the form classification and the methods of financial control. This factor hinders the development of methodology and investigation of other issues of the financial control theory. The author summarizes scientists’ approaches to studying the classification features of financial control in the article.

  11. Classification and clinical assessment

    Directory of Open Access Journals (Sweden)

    F. Cantini

    2012-06-01

    Full Text Available There are at least nine classification criteria for psoriatic arthritis (PsA that have been proposed and used in clinical studies. With the exception of the ESSG and Bennett rules, all of the other criteria sets have a good performance in identifying PsA patients. As the CASPAR criteria are based on a robust study methodology, they are considered the current reference standard. However, if there seems to be no doubt that they are very good to classify PsA patients (very high specificity, they might be not sensitive enough to diagnose patients with unknown early PsA. The vast clinical heterogeneity of PsA makes its assessment very challenging. Peripheral joint involvement is measured by 78/76 joint counts, spine involvement by the instruments used for ankylosing spondylitis (AS, dactylitis by involved digit count or by the Leeds dactylitis index, enthesitis by the number of affected entheses (several indices available and psoriasis by the Psoriasis Area and Severity Index (PASI. Peripheral joint damage can be assessed by a modified van der Heijde-Sharp scoring system and axial damage by the methods used for AS or by the Psoriatic Arthritis Spondylitis Radiology Index (PASRI. As in other arthritides, global evaluation of disease activity and severity by patient and physician and assessment of disability and quality of life are widely used. Finally, composite indices that capture several clinical manifestations of PsA have been proposed and a new instrument, the Psoriatic ARthritis Disease Activity Score (PASDAS, is currently being developed.

  12. The classification of easement

    Directory of Open Access Journals (Sweden)

    Popov Danica D.

    2015-01-01

    Full Text Available Easement means, a right enjoyed by the owner of land over the lands of another: such as rights of way, right of light, rights of support, rights to a flow of air or water etc. The dominant tenement is the land owned by the possessor of the easement, and the servient tenement is the land over which the right is enjoyed. An easement must exist for the accommodation and better enjoyment to which it is annexed, otherwise it may amount to mere licence. An easement benefits and binds the land itself and therefore countinious despite any change of ownership of either dominant or servient tenement, although it will be extinguished if the two tenemants come into common ownership. An easement can only be enjoyed in respect of land. This means two parcels of land. First there must be a 'dominant tenement' and a 'servient tenement'. Dominant tenement to which the benefit of the easement attaches, and another (servient tenement which bears the burden of the easement. A positive easement consist of a right to do something on the land of another; a negative easement restrict the use of owner of the serviant tenement may make of his land. An easement may be on land or on the house made on land. The next classification is on easement on the ground, and the other one under the ground. An easement shall be done in accordance with the principle of restrictions. This means that the less burden the servient tenement. When there is doubt about the extent of the actual easement shall take what easier the servient tenement. The new needs of the dominant estate does not result in the expansion of servitude. In the article is made comparison between The Draft Code of property and other real estate, and The Draft of Civil Code of Serbia.

  13. Linguistic Dating of Biblical Texts

    DEFF Research Database (Denmark)

    Ehrensvärd, Martin Gustaf

    2003-01-01

    For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed the chronol......For two centuries, scholars have pointed to consistent differences in the Hebrew of certain biblical texts and interpreted these differences as reflecting the date of composition of the texts. Until the 1980s, this was quite uncontroversial as the linguistic findings largely confirmed...... the chronology of the texts established by other means: the Hebrew of Genesis-2 Kings was judged to be early and that of Esther, Daniel, Ezra, Nehemiah, and Chronicles to be late. In the current debate where revisionists have questioned the traditional dating, linguistic arguments in the dating of texts have...... come more into focus. The study critically examines some linguistic arguments adduced to support the traditional position, and reviewing the arguments it points to weaknesses in the linguistic dating of EBH texts to pre-exilic times. When viewing the linguistic evidence in isolation it will be clear...

  14. A New Classification Approach Based on Multiple Classification Rules

    OpenAIRE

    Zhongmei Zhou

    2014-01-01

    A good classifier can correctly predict new data for which the class label is unknown, so it is important to construct a high accuracy classifier. Hence, classification techniques are much useful in ubiquitous computing. Associative classification achieves higher classification accuracy than some traditional rule-based classification approaches. However, the approach also has two major deficiencies. First, it generates a very large number of association classification rules, especially when t...

  15. Biomarker Identification Using Text Mining

    Directory of Open Access Journals (Sweden)

    Hui Li

    2012-01-01

    Full Text Available Identifying molecular biomarkers has become one of the important tasks for scientists to assess the different phenotypic states of cells or organisms correlated to the genotypes of diseases from large-scale biological data. In this paper, we proposed a text-mining-based method to discover biomarkers from PubMed. First, we construct a database based on a dictionary, and then we used a finite state machine to identify the biomarkers. Our method of text mining provides a highly reliable approach to discover the biomarkers in the PubMed database.

  16. Text Mining in Biomedical Domain with Emphasis on Document Clustering.

    Science.gov (United States)

    Renganathan, Vinaitheerthan

    2017-07-01

    With the exponential increase in the number of articles published every year in the biomedical domain, there is a need to build automated systems to extract unknown information from the articles published. Text mining techniques enable the extraction of unknown knowledge from unstructured documents. This paper reviews text mining processes in detail and the software tools available to carry out text mining. It also reviews the roles and applications of text mining in the biomedical domain. Text mining processes, such as search and retrieval of documents, pre-processing of documents, natural language processing, methods for text clustering, and methods for text classification are described in detail. Text mining techniques can facilitate the mining of vast amounts of knowledge on a given topic from published biomedical research articles and draw meaningful conclusions that are not possible otherwise.

  17. A New English?Arabic Parallel Text Corpus for Lexicographic Applications

    Directory of Open Access Journals (Sweden)

    Hashan Al-Ajmi

    2011-10-01

    Full Text Available

    Abstract: Bilingual lexicographers, translation specialists and English teachers in the Arabworld do not have access to computerized corpora of parallel texts for the English–Arabic languagepair. This project has been carried out to meet this requirement by establishing the first generalparallel corpus of English texts and their Arabic translations. The first phase of the project involvedthe selection of general source texts having appropriate lexical and stylistic features. The chosensource texts deal with a variety of topics such as the environment, globalization, psychology, history,politics, drama, etc. Their Arabic translations were taken from The World of Knowledge seriespublished by the National Council for Culture, Arts and Letters (NCCAL in Kuwait.

    Keywords: PARALLEL CORPUS, LEXICOGRAPHY, TRANSLATION, BILINGUAL DICTIONARY,COLLOCATIONS, ALIGNMENT, SYNONYMS, DERIVATIVES, ANTONYMS, GLOSSARY,FREQUENCY

    Opsomming: 'n Nuwe Engels–Arabiese parallelletekskorpus vir leksikografiesetoepassings Tweetalige leksikograwe, vertaalkundiges en Engelsonderwysers in dieArabiese wêreld het nie toegang tot gerekenariseerde korpusse van parallelle tekste vir die Engels–Arabiese taalpaar nie. Hierdie projek is onderneem om in dié behoefte te voorsien deur die eerstealgemene parallelle korpus van Engelse tekste en hul Arabiese vertalings tot stand te bring. Dieeerste fase van die projek het die keuse van algemene brontekste behels wat geskikte leksikale enstilistiese eienskappe besit. Die gekose brontekste handel oor 'n verskeidenheid onderwerpe soosdie omgewing, globalisering, psigologie, geskiedenis, politiek, drama, ens. Hul Arabiese vertalingsis geneem uit The World of Knowledge-reeks gepubliseer deur die National Council for Culture, Artsand Letters (NCCAL in Koeweit.

    Sleutelwoorde: PARALLELLE KORPUS, LEKSIKOGRAFIE, VERTALING, TWEETALIGEWOORDEBOEK, KOLLOKASIES, OOREENSTEMMING, SINONIEME, AFLEIDINGS, ANTONIEME

  18. Stemming Malay Text and Its Application in Automatic Text Categorization

    Science.gov (United States)

    Yasukawa, Michiko; Lim, Hui Tian; Yokoo, Hidetoshi

    In Malay language, there are no conjugations and declensions and affixes have important grammatical functions. In Malay, the same word may function as a noun, an adjective, an adverb, or, a verb, depending on its position in the sentence. Although extensively simple root words are used in informal conversations, it is essential to use the precise words in formal speech or written texts. In Malay, to make sentences clear, derivative words are used. Derivation is achieved mainly by the use of affixes. There are approximately a hundred possible derivative forms of a root word in written language of the educated Malay. Therefore, the composition of Malay words may be complicated. Although there are several types of stemming algorithms available for text processing in English and some other languages, they cannot be used to overcome the difficulties in Malay word stemming. Stemming is the process of reducing various words to their root forms in order to improve the effectiveness of text processing in information systems. It is essential to avoid both over-stemming and under-stemming errors. We have developed a new Malay stemmer (stemming algorithm) for removing inflectional and derivational affixes. Our stemmer uses a set of affix rules and two types of dictionaries: a root-word dictionary and a derivative-word dictionary. The use of set of rules is aimed at reducing the occurrence of under-stemming errors, while that of the dictionaries is believed to reduce the occurrence of over-stemming errors. We performed an experiment to evaluate the application of our stemmer in text mining software. For the experiment, text data used were actual web pages collected from the World Wide Web to demonstrate the effectiveness of our Malay stemming algorithm. The experimental results showed that our stemmer can effectively increase the precision of the extracted Boolean expressions for text categorization.

  19. Improvement of Classification of Enterprise Circulating Funds

    Directory of Open Access Journals (Sweden)

    Rohanova Hanna O.

    2014-02-01

    Full Text Available The goal of the article lies in revelation of possibilities of increase of efficiency of managing enterprise circulating funds by means of improvement of their classification features. Having analysed approaches of many economists to classification of enterprise circulating funds, systemised and supplementing them, the article offers grouping classification features of enterprise circulating funds. In the result of the study the article offers an expanded classification of circulating funds, which clearly shows the role of circulating funds in managing enterprise finance and economy in general. The article supplements and groups classification features of enterprise circulating funds by: the organisation level, functioning character, sources of formation and their cost, and level of management efficiency. The article shows that the provided grouping of classification features of circulating funds allows exerting all-sided and purposeful influence upon indicators of efficiency of circulating funds functioning and facilitates their rational management in general. The prospect of further studies in this direction is identification of the level of attraction of loan resources by production enterprises for financing circulating funds.

  20. Anomaly Detection with Text Mining

    Data.gov (United States)

    National Aeronautics and Space Administration — Many existing complex space systems have a significant amount of historical maintenance and problem data bases that are stored in unstructured text forms. The...

  1. Social Studies: Texts and Supplements.

    Science.gov (United States)

    Curriculum Review, 1979

    1979-01-01

    This review of selected social studies texts, series, and supplements, mainly for the secondary level, includes a special section examining eight titles on warfare and terrorism for grades 4-12. (SJL)

  2. The effects of shadow removal on across-date settlement type classification of quickbird images

    CSIR Research Space (South Africa)

    Luus, FPS

    2012-07-01

    Full Text Available QuickBird imagery acquired on separate dates may have significant differences in viewing- and illumination geometries, which can negatively impact across-date settlement type classification accuracy. The effect of cast shadows on classification...

  3. A statistical approach to root system classification.

    Directory of Open Access Journals (Sweden)

    Gernot eBodner

    2013-08-01

    Full Text Available Plant root systems have a key role in ecology and agronomy. In spite of fast increase in root studies, still there is no classification that allows distinguishing among distinctive characteristics within the diversity of rooting strategies. Our hypothesis is that a multivariate approach for plant functional type identification in ecology can be applied to the classification of root systems. We demonstrate that combining principal component and cluster analysis yields a meaningful classification of rooting types based on morphological traits. The classification method presented is based on a data-defined statistical procedure without a priori decision on the classifiers. Biplot inspection is used to determine key traits and to ensure stability in cluster based grouping. The classification method is exemplified with simulated root architectures and morphological field data. Simulated root architectures showed that morphological attributes with spatial distribution parameters capture most distinctive features within root system diversity. While developmental type (tap vs. shoot-borne systems is a strong, but coarse classifier, topological traits provide the most detailed differentiation among distinctive groups. Adequacy of commonly available morphologic traits for classification is supported by field data. Three rooting types emerged from measured data, distinguished by diameter/weight, density and spatial distribution respectively. Similarity of root systems within distinctive groups was the joint result of phylogenetic relation and environmental as well as human selection pressure. We concluded that the data-define classification is appropriate for integration of knowledge obtained with different root measurement methods and at various scales. Currently root morphology is the most promising basis for classification due to widely used common measurement protocols. To capture details of root diversity efforts in architectural measurement

  4. GPU-Accelerated Text Mining

    International Nuclear Information System (INIS)

    Cui, X.; Mueller, F.; Zhang, Y.; Potok, Thomas E.

    2009-01-01

    Accelerating hardware devices represent a novel promise for improving the performance for many problem domains but it is not clear for which domains what accelerators are suitable. While there is no room in general-purpose processor design to significantly increase the processor frequency, developers are instead resorting to multi-core chips duplicating conventional computing capabilities on a single die. Yet, accelerators offer more radical designs with a much higher level of parallelism and novel programming environments. This present work assesses the viability of text mining on CUDA. Text mining is one of the key concepts that has become prominent as an effective means to index the Internet, but its applications range beyond this scope and extend to providing document similarity metrics, the subject of this work. We have developed and optimized text search algorithms for GPUs to exploit their potential for massive data processing. We discuss the algorithmic challenges of parallelization for text search problems on GPUs and demonstrate the potential of these devices in experiments by reporting significant speedups. Our study may be one of the first to assess more complex text search problems for suitability for GPU devices, and it may also be one of the first to exploit and report on atomic instruction usage that have recently become available in NVIDIA devices

  5. Comprehending text in literature class

    Directory of Open Access Journals (Sweden)

    Purić Daliborka S.

    2016-01-01

    Full Text Available The paper discusses the problem of understanding a text and the contribution of methodological apparatus in the reader book to comprehension of a text being read in junior classes of elementary school. By using the technique of content analysis from methodological apparatuses in eight reader books for the fourth grade of elementary school, approved for usage in 2014/2015 academic year, and surveying 350 teachers in 33 elementary schools and 11 administrative districts in the Republic of Serbia we examined: (a to what extent the Serbian language text book contents enable junior students to understand a literary text; (b to what extent teachers accept the suggestions offered in the textbook for preparing literature teaching. The results show that a large number of suggestions relate to reading comprehension, but some of categories of understanding are unevenly distributed in the methodological apparatus. On the other hand, the majority of teachers use the methodological apparatus given in a textbook for preparing classes, not only the textbook he or she selected for teaching but also other textbooks for the same grade.

  6. A Guide Text or Many Texts? "That is the Question”

    Directory of Open Access Journals (Sweden)

    Delgado de Valencia Sonia

    2001-08-01

    Full Text Available The use of supplementary materials in the classroom has always been an essential part of the teaching and learning process. To restrict our teaching to the scope of one single textbook means to stand behind the advances of knowledge, in any area and context. Young learners appreciate any new and varied support that expands their knowledge of the world: diaries, letters, panels, free texts, magazines, short stories, poems or literary excerpts, and articles taken from Internet are materials that will allow learnersto share more and work more collaboratively. In this article we are going to deal with some of these materials, with the criteria to select, adapt, and create them that may be of interest to the learner and that may promote reading and writing processes. Since no text can entirely satisfy the needs of students and teachers, the creativity of both parties will be necessary to improve the quality of teaching through the adequate use and adaptation of supplementary materials.

  7. Individual Profiling Using Text Analysis

    Science.gov (United States)

    2016-04-15

    AFRL-AFOSR-UK-TR-2016-0011 Individual Profiling using Text Analysis 140333 Mark Stevenson UNIVERSITY OF SHEFFIELD, DEPARTMENT OF PSYCHOLOGY Final...REPORT TYPE      Final 3.  DATES COVERED (From - To)      15 Sep 2014 to 14 Sep 2015 4.  TITLE AND SUBTITLE Individual Profiling using Text Analysis ...consisted of collections of tweets for a number of Twitter users whose gender, age and personality scores are known. The task was to construct some system

  8. Identifying issue frames in text.

    Directory of Open Access Journals (Sweden)

    Eyal Sagi

    Full Text Available Framing, the effect of context on cognitive processes, is a prominent topic of research in psychology and public opinion research. Research on framing has traditionally relied on controlled experiments and manually annotated document collections. In this paper we present a method that allows for quantifying the relative strengths of competing linguistic frames based on corpus analysis. This method requires little human intervention and can therefore be efficiently applied to large bodies of text. We demonstrate its effectiveness by tracking changes in the framing of terror over time and comparing the framing of abortion by Democrats and Republicans in the U.S.

  9. Finding text in color images

    Science.gov (United States)

    Zhou, Jiangying; Lopresti, Daniel P.; Tasdizen, Tolga

    1998-04-01

    In this paper, we consider the problem of locating and extracting text from WWW images. A previous algorithm based on color clustering and connected components analysis works well as long as the color of each character is relatively uniform and the typography is fairly simple. It breaks down quickly, however, when these assumptions are violated. In this paper, we describe more robust techniques for dealing with this challenging problem. We present an improved color clustering algorithm that measures similarity based on both RGB and spatial proximity. Layout analysis is also incorporated to handle more complex typography. THese changes significantly enhance the performance of our text detection procedure.

  10. Optimizing tree-species classification in hyperspectal images

    CSIR Research Space (South Africa)

    Barnard, E

    2010-11-01

    Full Text Available for classification. Scaling of these components so that all features have equal variance is found to be useful, and their best performance (88.9% accurate classification) is achieved with 15 scaled features and a support vector machine as classifier. A graphical...

  11. A proposed data base system for detection, classification and ...

    African Journals Online (AJOL)

    A proposed data base system for detection, classification and location of fault on electricity company of Ghana electrical distribution system. Isaac Owusu-Nyarko, Mensah-Ananoo Eugine. Abstract. No Abstract. Keywords: database, classification of fault, power, distribution system, SCADA, ECG. Full Text: EMAIL FULL TEXT ...

  12. Titulus Scuola: the new file classification schema for Italian schools

    Directory of Open Access Journals (Sweden)

    Gianni Penzo Doria

    2017-05-01

    Full Text Available This article presents the new file classification schema national for Italian schools, produced by the Italian Directorate General of Archives of the Ministry for Cultural Heritage, within the project Titulus Scuola. This classification schema represents the starting point for a standard documental system, aimed at the digital administration.

  13. Improving settlement type classification of aerial images

    CSIR Research Space (South Africa)

    Mdakane, L

    2014-10-01

    Full Text Available , an automated method can be used to help identify human settlements in a fixed, repeatable and timely manner. The main contribution of this work is to improve generalisation on settlement type classification of aerial imagery. Images acquired at different dates...

  14. Cellular image classification

    CERN Document Server

    Xu, Xiang; Lin, Feng

    2017-01-01

    This book introduces new techniques for cellular image feature extraction, pattern recognition and classification. The authors use the antinuclear antibodies (ANAs) in patient serum as the subjects and the Indirect Immunofluorescence (IIF) technique as the imaging protocol to illustrate the applications of the described methods. Throughout the book, the authors provide evaluations for the proposed methods on two publicly available human epithelial (HEp-2) cell datasets: ICPR2012 dataset from the ICPR'12 HEp-2 cell classification contest and ICIP2013 training dataset from the ICIP'13 Competition on cells classification by fluorescent image analysis. First, the reading of imaging results is significantly influenced by one’s qualification and reading systems, causing high intra- and inter-laboratory variance. The authors present a low-order LP21 fiber mode for optical single cell manipulation and imaging staining patterns of HEp-2 cells. A focused four-lobed mode distribution is stable and effective in optical...

  15. Bosniak classification system

    DEFF Research Database (Denmark)

    Graumann, Ole; Osther, Susanne Sloth; Karstoft, Jens

    2016-01-01

    BACKGROUND: The Bosniak classification was originally based on computed tomographic (CT) findings. Magnetic resonance (MR) and contrast-enhanced ultrasonography (CEUS) imaging may demonstrate findings that are not depicted at CT, and there may not always be a clear correlation between the findings...... at MR and CEUS imaging and those at CT. PURPOSE: To compare diagnostic accuracy of MR, CEUS, and CT when categorizing complex renal cystic masses according to the Bosniak classification. MATERIAL AND METHODS: From February 2011 to June 2012, 46 complex renal cysts were prospectively evaluated by three...... readers. Each mass was categorized according to the Bosniak classification and CT was chosen as gold standard. Kappa was calculated for diagnostic accuracy and data was compared with pathological results. RESULTS: CT images found 27 BII, six BIIF, seven BIII, and six BIV. Forty-three cysts could...

  16. Bosniak Classification system

    DEFF Research Database (Denmark)

    Graumann, Ole; Osther, Susanne Sloth; Karstoft, Jens

    2014-01-01

    Background: The Bosniak classification is a diagnostic tool for the differentiation of cystic changes in the kidney. The process of categorizing renal cysts may be challenging, involving a series of decisions that may affect the final diagnosis and clinical outcome such as surgical management....... Purpose: To investigate the inter- and intra-observer agreement among experienced uroradiologists when categorizing complex renal cysts according to the Bosniak classification. Material and Methods: The original categories of 100 cystic renal masses were chosen as “Gold Standard” (GS), established...... to the calculated weighted κ all readers performed “very good” for both inter-observer and intra-observer variation. Most variation was seen in cysts catagorized as Bosniak II, IIF, and III. These results show that radiologists who evaluate complex renal cysts routinely may apply the Bosniak classification...

  17. Multilingual text induced spelling correction

    NARCIS (Netherlands)

    Reynaert, M.W.C.

    2004-01-01

    We present TISC, a multilingual, language-independent and context-sensitive spelling checking and correction system designed to facilitate the automatic removal of non-word spelling errors in large corpora. Its lexicon is derived from raw text corpora, without supervision, and contains word unigrams

  18. Automated analysis of instructional text

    Energy Technology Data Exchange (ETDEWEB)

    Norton, L.M.

    1983-05-01

    The development of a capability for automated processing of natural language text is a long-range goal of artificial intelligence. This paper discusses an investigation into the issues involved in the comprehension of descriptive, as opposed to illustrative, textual material. The comprehension process is viewed as the conversion of knowledge from one representation into another. The proposed target representation consists of statements of the prolog language, which can be interpreted both declaratively and procedurally, much like production rules. A computer program has been written to model in detail some ideas about this process. The program successfully analyzes several heavily edited paragraphs adapted from an elementary textbook on programming, automatically synthesizing as a result of the analysis a working Prolog program which, when executed, can parse and interpret let commands in the basic language. The paper discusses the motivations and philosophy of the project, the many kinds of prerequisite knowledge which are necessary, and the structure of the text analysis program. A sentence-by-sentence account of the analysis of the sample text is presented, describing the syntactic and semantic processing which is involved. The paper closes with a discussion of lessons learned from the project, possible alternative approaches, and possible extensions for future work. The entire project is presented as illustrative of the nature and complexity of the text analysis process, rather than as providing definitive or optimal solutions to any aspects of the task. 12 references.

  19. Solar Concepts: A Background Text.

    Science.gov (United States)

    Gorham, Jonathan W.

    This text is designed to provide teachers, students, and the general public with an overview of key solar energy concepts. Various energy terms are defined and explained. Basic thermodynamic laws are discussed. Alternative energy production is described in the context of the present energy situation. Described are the principal contemporary solar…

  20. FTP: Full-Text Publishing?

    Science.gov (United States)

    Jul, Erik

    1992-01-01

    Describes the use of file transfer protocol (FTP) on the INTERNET computer network and considers its use as an electronic publishing system. The differing electronic formats of text files are discussed; the preparation and access of documents are described; and problems are addressed, including a lack of consistency. (LRW)

  1. Quality Inspection of Printed Texts

    DEFF Research Database (Denmark)

    Pedersen, Jesper Ballisager; Nasrollahi, Kamal; Moeslund, Thomas B.

    2016-01-01

    -folded: for costumers of the printing and verification system, the overall grade used to verify if the text is of sufficient quality, while for printer's manufacturer, the detailed character/symbols grades and quality measurements are used for the improvement and optimization of the printing task. The proposed system...

  2. Acoustic classification of dwellings

    DEFF Research Database (Denmark)

    Berardi, Umberto; Rasmussen, Birgit

    2014-01-01

    insulation performance, national schemes for sound classification of dwellings have been developed in several European countries. These schemes define acoustic classes according to different levels of sound insulation. Due to the lack of coordination among countries, a significant diversity in terms...... exchanging experiences about constructions fulfilling different classes, reducing trade barriers, and finally increasing the sound insulation of dwellings.......Schemes for the classification of dwellings according to different building performances have been proposed in the last years worldwide. The general idea behind these schemes relates to the positive impact a higher label, and thus a better performance, should have. In particular, focusing on sound...

  3. Minimum Error Entropy Classification

    CERN Document Server

    Marques de Sá, Joaquim P; Santos, Jorge M F; Alexandre, Luís A

    2013-01-01

    This book explains the minimum error entropy (MEE) concept applied to data classification machines. Theoretical results on the inner workings of the MEE concept, in its application to solving a variety of classification problems, are presented in the wider realm of risk functionals. Researchers and practitioners also find in the book a detailed presentation of practical data classifiers using MEE. These include multi‐layer perceptrons, recurrent neural networks, complexvalued neural networks, modular neural networks, and decision trees. A clustering algorithm using a MEE‐like concept is also presented. Examples, tests, evaluation experiments and comparison with similar machines using classic approaches, complement the descriptions.

  4. Classification of iconic images

    OpenAIRE

    Zrianina, Mariia; Kopf, Stephan

    2016-01-01

    Iconic images represent an abstract topic and use a presentation that is intuitively understood within a certain cultural context. For example, the abstract topic “global warming” may be represented by a polar bear standing alone on an ice floe. Such images are widely used in media and their automatic classification can help to identify high-level semantic concepts. This paper presents a system for the classification of iconic images. It uses a variation of the Bag of Visual Words approach wi...

  5. Casemix classification systems.

    Science.gov (United States)

    Fetter, R B

    1999-01-01

    The idea of using casemix classification to manage hospital services is not new, but has been limited by available technology. It was not until after the introduction of Medicare in the United States in 1965 that serious attempts were made to measure hospital production in order to contain spiralling costs. This resulted in a system of casemix classification known as diagnosis related groups (DRGs). This paper traces the development of DRGs and their evolution from the initial version to the All Patient Refined DRGs developed in 1991.

  6. A Way Forward for Ship Classification and Technical Services

    Directory of Open Access Journals (Sweden)

    Lam-Bee Goh

    2014-04-01

    Full Text Available Classification societies are one of key organizations that promote the highest standards in ship safety and quality shipping. The paper reviews the ship classification industry and identifies what the classification societies can do to add value to the maritime industry more effectively. To meet this objective, an analysis of the five competitive forces is carried out, together with an opinion survey performed on some of the leading shipping companies, to assess and to establish some of the key factors which should be considered when formulating an overall business strategy for the growth of the classification services business. The findings from the study are discussed with the strategic options and choices. A classification services industrial value chain analysis together with ship management and operation is undertaken to explore the opportunities for classification societies. These findings also provide guidance to policy-makers who design and seek to implement more effective international shipping policies.

  7. CADASTRAL CLASSIFICATION OF THE LAND PLOTS IN UKRAINE

    Directory of Open Access Journals (Sweden)

    KIRICHEK Yu. O.

    2016-04-01

    Full Text Available Summary. Work concerns development of national system of classification of the land plots. The developed classification will allow to solve correctly a number of the corresponding cadastral, land management, estimated and other tasks. The analysis of classifications of lands, improvements and real estate in general is made. The created offers concerning creation of a new classification of the land plots in Ukraine. Today the Ukrainian real estate market has no single system that separates the system property groups, classes and types. This significantly complicates the work and can not fully be aware of the specific situation of real estate market. This task is designed to solve classification properties, it is used to transition from a diversity of individual properties to a limited number of classes of evaluation objects. The classification is different functional purpose (use facilities assessment, which determines the difference in value.

  8. Extension classification method for low-carbon product cases

    Directory of Open Access Journals (Sweden)

    Yanwei Zhao

    2016-05-01

    Full Text Available In product low-carbon design, intelligent decision systems integrated with certain classification algorithms recommend the existing design cases to designers. However, these systems mostly dependent on prior experience, and product designers not only expect to get a satisfactory case from an intelligent system but also hope to achieve assistance in modifying unsatisfactory cases. In this article, we proposed a new categorization method composed of static and dynamic classification based on extension theory. This classification method can be integrated into case-based reasoning system to get accurate classification results and to inform designers of detailed information about unsatisfactory cases. First, we establish the static classification model for cases by dependent function in a hierarchical structure. Then for dynamic classification, we make transformation for cases based on case model, attributes, attribute values, and dependent function, thus cases can take qualitative changes. Finally, the applicability of proposed method is demonstrated through a case study of screw air compressor cases.

  9. Quality-Oriented Classification of Aircraft Material Based on SVM

    Directory of Open Access Journals (Sweden)

    Hongxia Cai

    2014-01-01

    Full Text Available The existing material classification is proposed to improve the inventory management. However, different materials have the different quality-related attributes, especially in the aircraft industry. In order to reduce the cost without sacrificing the quality, we propose a quality-oriented material classification system considering the material quality character, Quality cost, and Quality influence. Analytic Hierarchy Process helps to make feature selection and classification decision. We use the improved Kraljic Portfolio Matrix to establish the three-dimensional classification model. The aircraft materials can be divided into eight types, including general type, key type, risk type, and leveraged type. Aiming to improve the classification accuracy of various materials, the algorithm of Support Vector Machine is introduced. Finally, we compare the SVM and BP neural network in the application. The results prove that the SVM algorithm is more efficient and accurate and the quality-oriented material classification is valuable.

  10. Integrating Globality and Locality for Robust Representation Based Classification

    Directory of Open Access Journals (Sweden)

    Zheng Zhang

    2014-01-01

    Full Text Available The representation based classification method (RBCM has shown huge potential for face recognition since it first emerged. Linear regression classification (LRC method and collaborative representation classification (CRC method are two well-known RBCMs. LRC and CRC exploit training samples of each class and all the training samples to represent the testing sample, respectively, and subsequently conduct classification on the basis of the representation residual. LRC method can be viewed as a “locality representation” method because it just uses the training samples of each class to represent the testing sample and it cannot embody the effectiveness of the “globality representation.” On the contrary, it seems that CRC method cannot own the benefit of locality of the general RBCM. Thus we propose to integrate CRC and LRC to perform more robust representation based classification. The experimental results on benchmark face databases substantially demonstrate that the proposed method achieves high classification accuracy.

  11. The paradox of atheoretical classification

    DEFF Research Database (Denmark)

    Hjørland, Birger

    2016-01-01

    A distinction can be made between “artificial classifications” and “natural classifications,” where artificial classifications may adequately serve some limited purposes, but natural classifications are overall most fruitful by allowing inference and thus many different purposes. There is strong...... support for the view that a natural classification should be based on a theory (and, of course, that the most fruitful theory provides the most fruitful classification). Nevertheless, atheoretical (or “descriptive”) classifications are often produced. Paradoxically, atheoretical classifications may...... be very successful. The best example of a successful “atheoretical” classification is probably the prestigious Diagnostic and Statistical Manual of Mental Disorders (DSM) since its third edition from 1980. Based on such successes one may ask: Should the claim that classifications ideally are natural...

  12. CNN for breaking text-based CAPTCHA with noise

    Science.gov (United States)

    Liu, Kaixuan; Zhang, Rong; Qing, Ke

    2017-07-01

    A CAPTCHA ("Completely Automated Public Turing test to tell Computers and Human Apart") system is a program that most humans can pass but current computer programs could hardly pass. As the most common type of CAPTCHAs , text-based CAPTCHA has been widely used in different websites to defense network bots. In order to breaking textbased CAPTCHA, in this paper, two trained CNN models are connected for the segmentation and classification of CAPTCHA images. Then base on these two models, we apply sliding window segmentation and voting classification methods realize an end-to-end CAPTCHA breaking system with high success rate. The experiment results show that our method is robust and effective in breaking text-based CAPTCHA with noise.

  13. Linguistic dating of biblical texts

    DEFF Research Database (Denmark)

    Young, Ian; Rezetko, Robert; Ehrensvärd, Martin Gustaf

    Since the beginning of critical scholarship biblical texts have been dated using linguistic evidence. In recent years this has become a controversial topic, especially with the publication of Ian Young (ed.), Biblical Hebrew: Studies in Chronology and Typology (2003). However, until now there has...... been no introduction and comprehensive study of the field. Volume 1 introduces the field of linguistic dating of biblical texts, particularly to intermediate and advanced students of biblical Hebrew who have a reasonable background in the language, having completed at least an introductory course...... in this volume are: What is it that makes Archaic Biblical Hebrew archaic , Early Biblical Hebrew early , and Late Biblical Hebrew late ? Does linguistic typology, i.e. different linguistic characteristics, convert easily and neatly into linguistic chronology, i.e. different historical origins? A large amount...

  14. Text as an Autopoietic System

    DEFF Research Database (Denmark)

    Nicolaisen, Maria Skou

    2016-01-01

    The aim of the present research article is to discuss the possibilities and limitations in addressing text as an autopoietic system. The theory of autopoiesis originated in the field of biology in order to explain the dynamic processes entailed in sustaining living organisms at cellular level. Th....... By comparing the biological with the textual account of autopoietic agency, the end conclusion is that a newly derived concept of sociopoiesis might be better suited for discussing the architecture of textual systems....

  15. The TEXT upgrade vertical interferometer

    International Nuclear Information System (INIS)

    Hallock, G.A.; Gartman, M.L.; Li, W.; Chiang, K.; Shin, S.; Castles, R.L.; Chatterjee, R.; Rahman, A.S.

    1992-01-01

    A far-infrared interferometer has been installed on TEXT upgrade to obtain electron density profiles. The primary system views the plasma vertically through a set of large (60-cm radialx7.62-cm toroidal) diagnostic ports. A 1-cm channel spacing (59 channels total) and fast electronic time response is used, to provide high resolution for radial profiles and perturbation experiments. Initial operation of the vertical system was obtained late in 1991, with six operating channels

  16. Reasoning with Annotations of Texts

    OpenAIRE

    Ma , Yue; Lévy , François; Ghimire , Sudeep

    2011-01-01

    International audience; Linguistic and semantic annotations are important features for text-based applications. However, achieving and maintaining a good quality of a set of annotations is known to be a complex task. Many ad hoc approaches have been developed to produce various types of annotations, while comparing those annotations to improve their quality is still rare. In this paper, we propose a framework in which both linguistic and domain information can cooperate to reason with annotat...

  17. Adaptive SVM for Data Stream Classification

    Directory of Open Access Journals (Sweden)

    Isah A. Lawal

    2017-07-01

    Full Text Available In this paper, we address the problem of learning an adaptive classifier for the classification of continuous streams of data. We present a solution based on incremental extensions of the Support Vector Machine (SVM learning paradigm that updates an existing SVM whenever new training data are acquired. To ensure that the SVM effectiveness is guaranteed while exploiting the newly gathered data, we introduce an on-line model selection approach in the incremental learning process. We evaluated the proposed method on real world applications including on-line spam email filtering and human action classification from videos. Experimental results show the effectiveness and the potential of the proposed approach.

  18. Memristive Perceptron for Combinational Logic Classification

    Directory of Open Access Journals (Sweden)

    Lidan Wang

    2013-01-01

    Full Text Available The resistance of the memristor depends upon the past history of the input current or voltage; so it can function as synapse in neural networks. In this paper, a novel perceptron combined with the memristor is proposed to implement the combinational logic classification. The relationship between the memristive conductance change and the synapse weight update is deduced, and the memristive perceptron model and its synaptic weight update rule are explored. The feasibility of the novel memristive perceptron for implementing the combinational logic classification (NAND, NOR, XOR, and NXOR is confirmed by MATLAB simulation.

  19. Oscillating feature subset search algorithm for text categorization

    Czech Academy of Sciences Publication Activity Database

    Novovičová, Jana; Somol, Petr; Pudil, Pavel

    2006-01-01

    Roč. 44, č. 4225 (2006), s. 578-587 ISSN 0302-9743 R&D Projects: GA AV ČR IAA2075302; GA MŠk 2C06019 EU Projects: European Commission(XE) 507752 - MUSCLE Institutional research plan: CEZ:AV0Z10750506 Keywords : text classification * feature selection * oscillating search algorithm * Bhattacharyya distance Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.402, year: 2005

  20. Classification across gene expression microarray studies

    Directory of Open Access Journals (Sweden)

    Kuner Ruprecht

    2009-12-01

    Full Text Available Abstract Background The increasing number of gene expression microarray studies represents an important resource in biomedical research. As a result, gene expression based diagnosis has entered clinical practice for patient stratification in breast cancer. However, the integration and combined analysis of microarray studies remains still a challenge. We assessed the potential benefit of data integration on the classification accuracy and systematically evaluated the generalization performance of selected methods on four breast cancer studies comprising almost 1000 independent samples. To this end, we introduced an evaluation framework which aims to establish good statistical practice and a graphical way to monitor differences. The classification goal was to correctly predict estrogen receptor status (negative/positive and histological grade (low/high of each tumor sample in an independent study which was not used for the training. For the classification we chose support vector machines (SVM, predictive analysis of microarrays (PAM, random forest (RF and k-top scoring pairs (kTSP. Guided by considerations relevant for classification across studies we developed a generalization of kTSP which we evaluated in addition. Our derived version (DV aims to improve the robustness of the intrinsic invariance of kTSP with respect to technologies and preprocessing. Results For each individual study the generalization error was benchmarked via complete cross-validation and was found to be similar for all classification methods. The misclassification rates were substantially higher in classification across studies, when each single study was used as an independent test set while all remaining studies were combined for the training of the classifier. However, with increasing number of independent microarray studies used in the training, the overall classification performance improved. DV performed better than the average and showed slightly less variance. In

  1. Ecosystem classification, Chapter 2

    Science.gov (United States)

    M.J. Robin-Abbott; L.H. Pardo

    2011-01-01

    The ecosystem classification in this report is based on the ecoregions developed through the Commission for Environmental Cooperation (CEC) for North America (CEC 1997). Only ecosystems that occur in the United States are included. CEC ecoregions are described, with slight modifications, below (CEC 1997) and shown in Figures 2.1 and 2.2. We chose this ecosystem...

  2. The classification of phocomelia.

    Science.gov (United States)

    Tytherleigh-Strong, G; Hooper, G

    2003-06-01

    We studied 24 patients with 44 phocomelic upper limbs. Only 11 limbs could be grouped in the classification system of Frantz and O' Rahilly. The non-classifiable limbs were further studied and their characteristics identified. It is confirmed that phocomelia is not an intercalary defect.

  3. Principles for ecological classification

    Science.gov (United States)

    Dennis H. Grossman; Patrick Bourgeron; Wolf-Dieter N. Busch; David T. Cleland; William Platts; G. Ray; C. Robins; Gary Roloff

    1999-01-01

    The principal purpose of any classification is to relate common properties among different entities to facilitate understanding of evolutionary and adaptive processes. In the context of this volume, it is to facilitate ecosystem stewardship, i.e., to help support ecosystem conservation and management objectives.

  4. Mimicking human texture classification

    NARCIS (Netherlands)

    Rogowitz, B.E.; van Rikxoort, Eva M.; van den Broek, Egon; Pappas, T.N.; Schouten, Theo E.; Daly, S.J.

    2005-01-01

    In an attempt to mimic human (colorful) texture classification by a clustering algorithm three lines of research have been encountered, in which as test set 180 texture images (both their color and gray-scale equivalent) were drawn from the OuTex and VisTex databases. First, a k-means algorithm was

  5. Classification, confusion and misclassification

    African Journals Online (AJOL)

    The classification of objects and phenomena in science and nature has fascinated academics since Carl Linnaeus, the Swedish botanist and zoologist, created his binomial description of living things in the 1700s and probably long before in accounts of others in textbooks long since gone. It must have concerned human ...

  6. Classifications in popular music

    NARCIS (Netherlands)

    van Venrooij, A.; Schmutz, V.; Wright, J.D.

    2015-01-01

    The categorical system of popular music, such as genre categories, is a highly differentiated and dynamic classification system. In this article we present work that studies different aspects of these categorical systems in popular music. Following the work of Paul DiMaggio, we focus on four

  7. Shark Teeth Classification

    Science.gov (United States)

    Brown, Tom; Creel, Sally; Lee, Velda

    2009-01-01

    On a recent autumn afternoon at Harmony Leland Elementary in Mableton, Georgia, students in a fifth-grade science class investigated the essential process of classification--the act of putting things into groups according to some common characteristics or attributes. While they may have honed these skills earlier in the week by grouping their own…

  8. Classification in Medical Imaging

    DEFF Research Database (Denmark)

    Chen, Chen

    Classification is extensively used in the context of medical image analysis for the purpose of diagnosis or prognosis. In order to classify image content correctly, one needs to extract efficient features with discriminative properties and build classifiers based on these features. In addition...... on characterizing human faces and emphysema disease in lung CT images....

  9. Improving Student Question Classification

    Science.gov (United States)

    Heiner, Cecily; Zachary, Joseph L.

    2009-01-01

    Students in introductory programming classes often articulate their questions and information needs incompletely. Consequently, the automatic classification of student questions to provide automated tutorial responses is a challenging problem. This paper analyzes 411 questions from an introductory Java programming course by reducing the natural…

  10. NOUN CLASSIFICATION IN ESAHIE

    African Journals Online (AJOL)

    The present work deals with noun classification in Esahie (Kwa, Niger ... phonological information influences the noun (form) class system of Esahie. ... between noun classes and (grammatical) Gender is interrogated (in the light of ..... the (A) argument6 precedes the verb and the (P) argument7 follows the verb in a simple.

  11. Dynamic Latent Classification Model

    DEFF Research Database (Denmark)

    Zhong, Shengtong; Martínez, Ana M.; Nielsen, Thomas Dyhre

    as possible. Motivated by this problem setting, we propose a generative model for dynamic classification in continuous domains. At each time point the model can be seen as combining a naive Bayes model with a mixture of factor analyzers (FA). The latent variables of the FA are used to capture the dynamics...

  12. Classification of myocardial infarction

    DEFF Research Database (Denmark)

    Saaby, Lotte; Poulsen, Tina Svenstrup; Hosbond, Susanne Elisabeth

    2013-01-01

    The classification of myocardial infarction into 5 types was introduced in 2007 as an important component of the universal definition. In contrast to the plaque rupture-related type 1 myocardial infarction, type 2 myocardial infarction is considered to be caused by an imbalance between demand...

  13. Text Mining for Protein Docking.

    Directory of Open Access Journals (Sweden)

    Varsha D Badal

    2015-12-01

    Full Text Available The rapidly growing amount of publicly available information from biomedical research is readily accessible on the Internet, providing a powerful resource for predictive biomolecular modeling. The accumulated data on experimentally determined structures transformed structure prediction of proteins and protein complexes. Instead of exploring the enormous search space, predictive tools can simply proceed to the solution based on similarity to the existing, previously determined structures. A similar major paradigm shift is emerging due to the rapidly expanding amount of information, other than experimentally determined structures, which still can be used as constraints in biomolecular structure prediction. Automated text mining has been widely used in recreating protein interaction networks, as well as in detecting small ligand binding sites on protein structures. Combining and expanding these two well-developed areas of research, we applied the text mining to structural modeling of protein-protein complexes (protein docking. Protein docking can be significantly improved when constraints on the docking mode are available. We developed a procedure that retrieves published abstracts on a specific protein-protein interaction and extracts information relevant to docking. The procedure was assessed on protein complexes from Dockground (http://dockground.compbio.ku.edu. The results show that correct information on binding residues can be extracted for about half of the complexes. The amount of irrelevant information was reduced by conceptual analysis of a subset of the retrieved abstracts, based on the bag-of-words (features approach. Support Vector Machine models were trained and validated on the subset. The remaining abstracts were filtered by the best-performing models, which decreased the irrelevant information for ~ 25% complexes in the dataset. The extracted constraints were incorporated in the docking protocol and tested on the Dockground unbound

  14. The Balinese Unicode Text Processing

    Directory of Open Access Journals (Sweden)

    Imam Habibi

    2009-06-01

    Full Text Available In principal, the computer only recognizes numbers as the representation of a character. Therefore, there are many encoding systems to allocate these numbers although not all characters are covered. In Europe, every single language even needs more than one encoding system. Hence, a new encoding system known as Unicode has been established to overcome this problem. Unicode provides unique id for each different characters which does not depend on platform, program, and language. Unicode standard has been applied in a number of industries, such as Apple, HP, IBM, JustSystem, Microsoft, Oracle, SAP, Sun, Sybase, and Unisys. In addition, language standards and modern information exchanges such as XML, Java, ECMA Script (JavaScript, LDAP, CORBA 3.0, and WML make use of Unicode as an official tool for implementing ISO/IEC 10646. There are four things to do according to Balinese script: the algorithm of transliteration, searching, sorting, and word boundary analysis (spell checking. To verify the truth of algorithm, some applications are made. These applications can run on Linux/Windows OS platform using J2SDK 1.5 and J2ME WTK2 library. The input and output of the algorithm/application are character sequence that is obtained from keyboard punch and external file. This research produces a module or a library which is able to process the Balinese text based on Unicode standard. The output of this research is the ability, skill, and mastering of 1. Unicode standard (21-bit as a substitution to ASCII (7-bit and ISO8859-1 (8-bit as the former default character set in many applications. 2. The Balinese Unicode text processing algorithm. 3. An experience of working with and learning from an international team that consists of the foremost experts in the area: Michael Everson (Ireland, Peter Constable (Microsoft US, I Made Suatjana, and Ida Bagus Adi Sudewa.

  15. Text mining by Tsallis entropy

    Science.gov (United States)

    Jamaati, Maryam; Mehri, Ali

    2018-01-01

    Long-range correlations between the elements of natural languages enable them to convey very complex information. Complex structure of human language, as a manifestation of natural languages, motivates us to apply nonextensive statistical mechanics in text mining. Tsallis entropy appropriately ranks the terms' relevance to document subject, taking advantage of their spatial correlation length. We apply this statistical concept as a new powerful word ranking metric in order to extract keywords of a single document. We carry out an experimental evaluation, which shows capability of the presented method in keyword extraction. We find that, Tsallis entropy has reliable word ranking performance, at the same level of the best previous ranking methods.

  16. Biased limiter experiments on text

    International Nuclear Information System (INIS)

    Phillips, P.E.; Wootton, A.J.; Rowan, W.L.; Ritz, C.P.; Rhodes, T.L.; Bengtson, R.D.; Hodge, W.L.; Durst, R.D.; McCool, S.C.; Richards, B.; Gentle, K.W.; Schoch, P.; Forster, J.C.; Hickok, R.L.; Evans, T.E.

    1987-01-01

    Experiments using an electrically biased limiter have been performed on the Texas Experimental Tokamak (TEXT). A small movable limiter is inserted past the main poloidal ring limiter (which is electrically connected to the vacuum vessel) and biased at V Lim with respect to it. The floating potential, plasma potential and shear layer position can be controlled. With vertical strokeV Lim vertical stroke ≥ 50 V the plasma density increases. For V Lim Lim > 0 the results obtained are inconclusive. Variation of V Lim changes the electrostatic turbulence which may explain the observed total flux changes. (orig.)

  17. New Historicism: Text and Context

    Directory of Open Access Journals (Sweden)

    Violeta M. Vesić

    2016-02-01

    Full Text Available During most of the twentieth century history was seen as a phenomenon outside of literature that guaranteed the veracity of literary interpretation. History was unique and it functioned as a basis for reading literary works. During the seventies of the twentieth century there occurred a change of attitude towards history in American literary theory, and there appeared a new theoretical approach which soon became known as New Historicism. Since its inception, New Historicism has been identified with the study of Renaissance and Romanticism, but nowadays it has been increasingly involved in other literary trends. Although there are great differences in the arguments and practices at various representatives of this school, New Historicism has clearly recognizable features and many new historicists will agree with the statement of Walter Cohen that New Historicism, when it appeared in the eighties, represented something quite new in reference to the studies of theory, criticism and history (Cohen 1987, 33. Theoretical connection with Bakhtin, Foucault and Marx is clear, as well as a kind of uneasy tie with deconstruction and the work of Paul de Man. At the center of this approach is a renewed interest in the study of literary works in the light of historical and political circumstances in which they were created. Foucault encouraged readers to begin to move literary texts and to link them with discourses and representations that are not literary, as well as to examine the sociological aspects of the texts in order to take part in the social struggles of today. The study of literary works using New Historicism is the study of politics, history, culture and circumstances in which these works were created. With regard to one of the main fact which is located in the center of the criticism, that history cannot be viewed objectively and that reality can only be understood through a cultural context that reveals the work, re-reading and interpretation of

  18. A Gameplay Definition through Videogame Classification

    Directory of Open Access Journals (Sweden)

    Damien Djaouti

    2008-01-01

    Full Text Available This paper is part of an experimental approach aimed to raise a videogames classification. Being inspired by the methodology that Propp used for the classification of Russian fairy tales, we have identified recurrent diagrams within rules of videogames, that we called “Gameplay Bricks”. The combinations of these different bricks should allow us to represent a classification of all videogames in accordance with their rules. In this article, we will study the nature of these bricks, especially the link they seem to have with two types of game rules: the rules that allow the player to “manipulate” the elements of the game, and the rules defining the “goal” of the game. This study will lead to an hypothesis about the nature of gameplay.

  19. Towards Automatic Classification of Wikipedia Content

    Science.gov (United States)

    Szymański, Julian

    Wikipedia - the Free Encyclopedia encounters the problem of proper classification of new articles everyday. The process of assignment of articles to categories is performed manually and it is a time consuming task. It requires knowledge about Wikipedia structure, which is beyond typical editor competence, which leads to human-caused mistakes - omitting or wrong assignments of articles to categories. The article presents application of SVM classifier for automatic classification of documents from The Free Encyclopedia. The classifier application has been tested while using two text representations: inter-documents connections (hyperlinks) and word content. The results of the performed experiments evaluated on hand crafted data show that the Wikipedia classification process can be partially automated. The proposed approach can be used for building a decision support system which suggests editors the best categories that fit new content entered to Wikipedia.

  20. Inventory classification based on decoupling points

    Directory of Open Access Journals (Sweden)

    Joakim Wikner

    2015-01-01

    Full Text Available The ideal state of continuous one-piece flow may never be achieved. Still the logistics manager can improve the flow by carefully positioning inventory to buffer against variations. Strategies such as lean, postponement, mass customization, and outsourcing all rely on strategic positioning of decoupling points to separate forecast-driven from customer-order-driven flows. Planning and scheduling of the flow are also based on classification of decoupling points as master scheduled or not. A comprehensive classification scheme for these types of decoupling points is introduced. The approach rests on identification of flows as being either demand based or supply based. The demand or supply is then combined with exogenous factors, classified as independent, or endogenous factors, classified as dependent. As a result, eight types of strategic as well as tactical decoupling points are identified resulting in a process-based framework for inventory classification that can be used for flow design.

  1. Exploring different approaches for music genre classification

    Directory of Open Access Journals (Sweden)

    Antonio Jose Homsi Goulart

    2012-07-01

    Full Text Available In this letter, we present different approaches for music genre classification. The proposed techniques, which are composed of a feature extraction stage followed by a classification procedure, explore both the variations of parameters used as input and the classifier architecture. Tests were carried out with three styles of music, namely blues, classical, and lounge, which are considered informally by some musicians as being “big dividers” among music genres, showing the efficacy of the proposed algorithms and establishing a relationship between the relevance of each set of parameters for each music style and each classifier. In contrast to other works, entropies and fractal dimensions are the features adopted for the classifications.

  2. Transportation Modes Classification Using Sensors on Smartphones

    Directory of Open Access Journals (Sweden)

    Shih-Hau Fang

    2016-08-01

    Full Text Available This paper investigates the transportation and vehicular modes classification by using big data from smartphone sensors. The three types of sensors used in this paper include the accelerometer, magnetometer, and gyroscope. This study proposes improved features and uses three machine learning algorithms including decision trees, K-nearest neighbor, and support vector machine to classify the user’s transportation and vehicular modes. In the experiments, we discussed and compared the performance from different perspectives including the accuracy for both modes, the executive time, and the model size. Results show that the proposed features enhance the accuracy, in which the support vector machine provides the best performance in classification accuracy whereas it consumes the largest prediction time. This paper also investigates the vehicle classification mode and compares the results with that of the transportation modes.

  3. Global Optimization Ensemble Model for Classification Methods

    Directory of Open Access Journals (Sweden)

    Hina Anwar

    2014-01-01

    Full Text Available Supervised learning is the process of data mining for deducing rules from training datasets. A broad array of supervised learning algorithms exists, every one of them with its own advantages and drawbacks. There are some basic issues that affect the accuracy of classifier while solving a supervised learning problem, like bias-variance tradeoff, dimensionality of input space, and noise in the input data space. All these problems affect the accuracy of classifier and are the reason that there is no global optimal method for classification. There is not any generalized improvement method that can increase the accuracy of any classifier while addressing all the problems stated above. This paper proposes a global optimization ensemble model for classification methods (GMC that can improve the overall accuracy for supervised learning problems. The experimental results on various public datasets showed that the proposed model improved the accuracy of the classification models from 1% to 30% depending upon the algorithm complexity.

  4. Automatic Classification of Attacks on IP Telephony

    Directory of Open Access Journals (Sweden)

    Jakub Safarik

    2013-01-01

    Full Text Available This article proposes an algorithm for automatic analysis of attack data in IP telephony network with a neural network. Data for the analysis is gathered from variable monitoring application running in the network. These monitoring systems are a typical part of nowadays network. Information from them is usually used after attack. It is possible to use an automatic classification of IP telephony attacks for nearly real-time classification and counter attack or mitigation of potential attacks. The classification use proposed neural network, and the article covers design of a neural network and its practical implementation. It contains also methods for neural network learning and data gathering functions from honeypot application.

  5. Efficient Fingercode Classification

    Science.gov (United States)

    Sun, Hong-Wei; Law, Kwok-Yan; Gollmann, Dieter; Chung, Siu-Leung; Li, Jian-Bin; Sun, Jia-Guang

    In this paper, we present an efficient fingerprint classification algorithm which is an essential component in many critical security application systems e. g. systems in the e-government and e-finance domains. Fingerprint identification is one of the most important security requirements in homeland security systems such as personnel screening and anti-money laundering. The problem of fingerprint identification involves searching (matching) the fingerprint of a person against each of the fingerprints of all registered persons. To enhance performance and reliability, a common approach is to reduce the search space by firstly classifying the fingerprints and then performing the search in the respective class. Jain et al. proposed a fingerprint classification algorithm based on a two-stage classifier, which uses a K-nearest neighbor classifier in its first stage. The fingerprint classification algorithm is based on the fingercode representation which is an encoding of fingerprints that has been demonstrated to be an effective fingerprint biometric scheme because of its ability to capture both local and global details in a fingerprint image. We enhance this approach by improving the efficiency of the K-nearest neighbor classifier for fingercode-based fingerprint classification. Our research firstly investigates the various fast search algorithms in vector quantization (VQ) and the potential application in fingerprint classification, and then proposes two efficient algorithms based on the pyramid-based search algorithms in VQ. Experimental results on DB1 of FVC 2004 demonstrate that our algorithms can outperform the full search algorithm and the original pyramid-based search algorithms in terms of computational efficiency without sacrificing accuracy.

  6. 78 FR 54970 - Cotton Futures Classification: Optional Classification Procedure

    Science.gov (United States)

    2013-09-09

    ... Service 7 CFR Part 27 [AMS-CN-13-0043] RIN 0581-AD33 Cotton Futures Classification: Optional Classification Procedure AGENCY: Agricultural Marketing Service, USDA. ACTION: Proposed rule. SUMMARY: The... optional cotton futures classification procedure--identified and known as ``registration'' by the U.S...

  7. PASTEC: an automatic transposable element classification tool.

    Directory of Open Access Journals (Sweden)

    Claire Hoede

    Full Text Available SUMMARY: The classification of transposable elements (TEs is key step towards deciphering their potential impact on the genome. However, this process is often based on manual sequence inspection by TE experts. With the wealth of genomic sequences now available, this task requires automation, making it accessible to most scientists. We propose a new tool, PASTEC, which classifies TEs by searching for structural features and similarities. This tool outperforms currently available software for TE classification. The main innovation of PASTEC is the search for HMM profiles, which is useful for inferring the classification of unknown TE on the basis of conserved functional domains of the proteins. In addition, PASTEC is the only tool providing an exhaustive spectrum of possible classifications to the order level of the Wicker hierarchical TE classification system. It can also automatically classify other repeated elements, such as SSR (Simple Sequence Repeats, rDNA or potential repeated host genes. Finally, the output of this new tool is designed to facilitate manual curation by providing to biologists with all the evidence accumulated for each TE consensus. AVAILABILITY: PASTEC is available as a REPET module or standalone software (http://urgi.versailles.inra.fr/download/repet/REPET_linux-x64-2.2.tar.gz. It requires a Unix-like system. There are two standalone versions: one of which is parallelized (requiring Sun grid Engine or Torque, and the other of which is not.

  8. A comparative evaluation of sequence classification programs

    Directory of Open Access Journals (Sweden)

    Bazinet Adam L

    2012-05-01

    Full Text Available Abstract Background A fundamental problem in modern genomics is to taxonomically or functionally classify DNA sequence fragments derived from environmental sampling (i.e., metagenomics. Several different methods have been proposed for doing this effectively and efficiently, and many have been implemented in software. In addition to varying their basic algorithmic approach to classification, some methods screen sequence reads for ’barcoding genes’ like 16S rRNA, or various types of protein-coding genes. Due to the sheer number and complexity of methods, it can be difficult for a researcher to choose one that is well-suited for a particular analysis. Results We divided the very large number of programs that have been released in recent years for solving the sequence classification problem into three main categories based on the general algorithm they use to compare a query sequence against a database of sequences. We also evaluated the performance of the leading programs in each category on data sets whose taxonomic and functional composition is known. Conclusions We found significant variability in classification accuracy, precision, and resource consumption of sequence classification programs when used to analyze various metagenomics data sets. However, we observe some general trends and patterns that will be useful to researchers who use sequence classification programs.

  9. LDA boost classification: boosting by topics

    Science.gov (United States)

    Lei, La; Qiao, Guo; Qimin, Cao; Qitao, Li

    2012-12-01

    AdaBoost is an efficacious classification algorithm especially in text categorization (TC) tasks. The methodology of setting up a classifier committee and voting on the documents for classification can achieve high categorization precision. However, traditional Vector Space Model can easily lead to the curse of dimensionality and feature sparsity problems; so it affects classification performance seriously. This article proposed a novel classification algorithm called LDABoost based on boosting ideology which uses Latent Dirichlet Allocation (LDA) to modeling the feature space. Instead of using words or phrase, LDABoost use latent topics as the features. In this way, the feature dimension is significantly reduced. Improved Naïve Bayes (NB) is designed as the weaker classifier which keeps the efficiency advantage of classic NB algorithm and has higher precision. Moreover, a two-stage iterative weighted method called Cute Integration in this article is proposed for improving the accuracy by integrating weak classifiers into strong classifier in a more rational way. Mutual Information is used as metrics of weights allocation. The voting information and the categorization decision made by basis classifiers are fully utilized for generating the strong classifier. Experimental results reveals LDABoost making categorization in a low-dimensional space, it has higher accuracy than traditional AdaBoost algorithms and many other classic classification algorithms. Moreover, its runtime consumption is lower than different versions of AdaBoost, TC algorithms based on support vector machine and Neural Networks.

  10. Application of Cocktail method in vegetation classification

    Directory of Open Access Journals (Sweden)

    Hamed Asadi

    2016-09-01

    Full Text Available This study intends to assess the application of Cocktail method in the classification of large vegetation databases. For this purpose, Buxus hyrcana dataset consisted of 442 relevés with 89 species were used and by the modified TWINSPAN. For running the Cocktail method, first primarily classification was done by modified TWINSPAN, and by performing phi analysis in the groups resulted five species were selected which had the highest fidelity value. Then sociological species groups were formed by examining co-occurrence of these 5 species with other species in the database. 21 plant communities belongs to 6 variant, 17 sub associations, 11 associations, 4 alliance, 1 order and 1 class were recognized by assigning 379 releves to the sociological species groups by using logical formulas. Also, 63 releves by the logical formula were not assigned to any sociological species groups, by FPFI index were assigned to the sociological species groups which had the most index value. According to 91% classification agreement with Brown-Blanquet classification and Cocktail classification, we suggest Cocktail method to vegetation scientists as an efficient alternative of Braun-Blanquet method to classify large vegetation databases.

  11. CLASSIFICATION BY USING MULTISPECTRAL POINT CLOUD DATA

    Directory of Open Access Journals (Sweden)

    C. T. Liao

    2012-07-01

    Full Text Available Remote sensing images are generally recorded in two-dimensional format containing multispectral information. Also, the semantic information is clearly visualized, which ground features can be better recognized and classified via supervised or unsupervised classification methods easily. Nevertheless, the shortcomings of multispectral images are highly depending on light conditions, and classification results lack of three-dimensional semantic information. On the other hand, LiDAR has become a main technology for acquiring high accuracy point cloud data. The advantages of LiDAR are high data acquisition rate, independent of light conditions and can directly produce three-dimensional coordinates. However, comparing with multispectral images, the disadvantage is multispectral information shortage, which remains a challenge in ground feature classification through massive point cloud data. Consequently, by combining the advantages of both LiDAR and multispectral images, point cloud data with three-dimensional coordinates and multispectral information can produce a integrate solution for point cloud classification. Therefore, this research acquires visible light and near infrared images, via close range photogrammetry, by matching images automatically through free online service for multispectral point cloud generation. Then, one can use three-dimensional affine coordinate transformation to compare the data increment. At last, the given threshold of height and color information is set as threshold in classification.

  12. Physio-climatic classification of South Africa's woodland biome

    CSIR Research Space (South Africa)

    Fairbanks, DHK

    2000-07-01

    Full Text Available monthly temperature, total plant-available water balance of soil, elevation, landscape topographic position, and landscape soil fertility were used as input classification variables. The map data were submitted to a factor analysis and varimax axis...

  13. 32 CFR 2700.22 - Classification guides.

    Science.gov (United States)

    2010-07-01

    ... SECURITY INFORMATION REGULATIONS Derivative Classification § 2700.22 Classification guides. OMSN shall... direct derivative classification, shall identify the information to be protected in specific and uniform...

  14. Experiments on Supervised Learning Algorithms for Text Categorization

    Science.gov (United States)

    Namburu, Setu Madhavi; Tu, Haiying; Luo, Jianhui; Pattipati, Krishna R.

    2005-01-01

    Modern information society is facing the challenge of handling massive volume of online documents, news, intelligence reports, and so on. How to use the information accurately and in a timely manner becomes a major concern in many areas. While the general information may also include images and voice, we focus on the categorization of text data in this paper. We provide a brief overview of the information processing flow for text categorization, and discuss two supervised learning algorithms, viz., support vector machines (SVM) and partial least squares (PLS), which have been successfully applied in other domains, e.g., fault diagnosis [9]. While SVM has been well explored for binary classification and was reported as an efficient algorithm for text categorization, PLS has not yet been applied to text categorization. Our experiments are conducted on three data sets: Reuter's- 21578 dataset about corporate mergers and data acquisitions (ACQ), WebKB and the 20-Newsgroups. Results show that the performance of PLS is comparable to SVM in text categorization. A major drawback of SVM for multi-class categorization is that it requires a voting scheme based on the results of pair-wise classification. PLS does not have this drawback and could be a better candidate for multi-class text categorization.

  15. On the Feature Selection and Classification Based on Information Gain for Document Sentiment Analysis

    Directory of Open Access Journals (Sweden)

    Asriyanti Indah Pratiwi

    2018-01-01

    Full Text Available Sentiment analysis in a movie review is the needs of today lifestyle. Unfortunately, enormous features make the sentiment of analysis slow and less sensitive. Finding the optimum feature selection and classification is still a challenge. In order to handle an enormous number of features and provide better sentiment classification, an information-based feature selection and classification are proposed. The proposed method reduces more than 90% unnecessary features while the proposed classification scheme achieves 96% accuracy of sentiment classification. From the experimental results, it can be concluded that the combination of proposed feature selection and classification achieves the best performance so far.

  16. IAEA Classification of Uranium Deposits

    International Nuclear Information System (INIS)

    Bruneton, Patrice

    2014-01-01

    Classifications of uranium deposits follow two general approaches, focusing on: • descriptive features such as the geotectonic position, the host rock type, the orebody morphology, …… : « geologic classification »; • or on genetic aspects: « genetic classification »

  17. Classification of Osteogenesis Imperfecta revisited

    NARCIS (Netherlands)

    van Dijk, F. S.; Pals, G.; van Rijn, R. R.; Nikkels, P. G. J.; Cobben, J. M.

    2010-01-01

    In 1979 Sillence proposed a classification of Osteogenesis Imperfecta (OI) in OI types I, II, III and IV. In 2004 and 2007 this classification was expanded with OI types V-VIII because of distinct clinical features and/or different causative gene mutations. We propose a revised classification of OI

  18. The future of general classification

    DEFF Research Database (Denmark)

    Mai, Jens Erik

    2013-01-01

    Discusses problems related to accessing multiple collections using a single retrieval language. Surveys the concepts of interoperability and switching language. Finds that mapping between more indexing languages always will be an approximation. Surveys the issues related to general classification...... and contrasts that to special classifications. Argues for the use of general classifications to provide access to collections nationally and internationally....

  19. Revue bibliographique: les méthodes chimiques d'identification et de classification des champignons

    Directory of Open Access Journals (Sweden)

    Verscheure M.

    2002-01-01

    Full Text Available Chemotaxonomy of fungi : a review. For few years, advancements of molecular methods and analytical techniques enabled scientists to realise a classification of microorganisms based on biochemical characteristics. This classification, called chemotaxonomy, includes molecular methods and chemical methods which provide additional data and lead to a better identification and/or classification.

  20. Machine printed text and handwriting identification in noisy document images.

    Science.gov (United States)

    Zheng, Yefeng; Li, Huiping; Doermann, David

    2004-03-01

    In this paper, we address the problem of the identification of text in noisy document images. We are especially focused on segmenting and identifying between handwriting and machine printed text because: 1) Handwriting in a document often indicates corrections, additions, or other supplemental information that should be treated differently from the main content and 2) the segmentation and recognition techniques requested for machine printed and handwritten text are significantly different. A novel aspect of our approach is that we treat noise as a separate class and model noise based on selected features. Trained Fisher classifiers are used to identify machine printed text and handwriting from noise and we further exploit context to refine the classification. A Markov Random Field-based (MRF) approach is used to model the geometrical structure of the printed text, handwriting, and noise to rectify misclassifications. Experimental results show that our approach is robust and can significantly improve page segmentation in noisy document collections.

  1. MULTI-TEMPORAL CLASSIFICATION AND CHANGE DETECTION USING UAV IMAGES

    Directory of Open Access Journals (Sweden)

    S. Makuti

    2018-05-01

    Full Text Available In this paper different methodologies for the classification and change detection of UAV image blocks are explored. UAV is not only the cheapest platform for image acquisition but it is also the easiest platform to operate in repeated data collections over a changing area like a building construction site. Two change detection techniques have been evaluated in this study: the pre-classification and the post-classification algorithms. These methods are based on three main steps: feature extraction, classification and change detection. A set of state of the art features have been used in the tests: colour features (HSV, textural features (GLCM and 3D geometric features. For classification purposes Conditional Random Field (CRF has been used: the unary potential was determined using the Random Forest algorithm while the pairwise potential was defined by the fully connected CRF. In the performed tests, different feature configurations and settings have been considered to assess the performance of these methods in such challenging task. Experimental results showed that the post-classification approach outperforms the pre-classification change detection method. This was analysed using the overall accuracy, where by post classification have an accuracy of up to 62.6 % and the pre classification change detection have an accuracy of 46.5 %. These results represent a first useful indication for future works and developments.

  2. Defuzzification Strategies for Fuzzy Classifications of Remote Sensing Data

    Directory of Open Access Journals (Sweden)

    Peter Hofmann

    2016-06-01

    Full Text Available The classes in fuzzy classification schemes are defined as fuzzy sets, partitioning the feature space through fuzzy rules, defined by fuzzy membership functions. Applying fuzzy classification schemes in remote sensing allows each pixel or segment to be an incomplete member of more than one class simultaneously, i.e., one that does not fully meet all of the classification criteria for any one of the classes and is member of more than one class simultaneously. This can lead to fuzzy, ambiguous and uncertain class assignation, which is unacceptable for many applications, indicating the need for a reliable defuzzification method. Defuzzification in remote sensing has to date, been performed by “crisp-assigning” each fuzzy-classified pixel or segment to the class for which it best fulfills the fuzzy classification rules, regardless of its classification fuzziness, uncertainty or ambiguity (maximum method. The defuzzification of an uncertain or ambiguous fuzzy classification leads to a more or less reliable crisp classification. In this paper the most common parameters for expressing classification uncertainty, fuzziness and ambiguity are analysed and discussed in terms of their ability to express the reliability of a crisp classification. This is done by means of a typical practical example from Object Based Image Analysis (OBIA.

  3. Wagner classification and culture analysis of diabetic foot infection

    Directory of Open Access Journals (Sweden)

    Fatma Bozkurt

    2011-03-01

    Full Text Available The aim of this study was to determine the concordance ratio between microorganisms isolated from deep tissue culture and those from superficial culture in patients with diabetic foot according to Wagner’s wound classification method.Materials and methods: A total of 63 patients with Diabetic foot infection, who were admitted to Dicle University Hospital between October 2006 and November 2007, were included into the study. Wagner’s classification method was used for wound classification. For microbiologic studies superficial and deep tissue specimens were obtained from each patient, and were rapidly sent to laboratory for aerob and anaerob cultures. Microbiologic data were analyzed and interpreted in line with sensitivity and specifity formula.Results: Thirty-eight (60% of the patients were in Wagner’s classification ≤2, while 25 (40% patients were Wagner’s classification ≥3. According to our culture results, 66 (69% Gr (+ and 30 (31% Gr (- microorganisms grew in Wagner classification ≤2 patients. While in Wagner classification ≥3; 25 (35% Gr (+ and 46 (65% Gr (- microorganisms grew. Microorganisms grew in 89% of superficial cultures and 64% of the deep tissue cultures in patients with Wagner classification ≤2, while microorganism grew in 64% of Wagner classification ≥3.Conclusion: In ulcers of diabetic food infections, initial treatment should be started according to result of sterile superficial culture, but deep tissue culture should be taken, if unresponsive to initial treatment.

  4. Specific classification of financial analysis of enterprise activity

    Directory of Open Access Journals (Sweden)

    Synkevych Nadiia I.

    2014-01-01

    Full Text Available Despite the fact that one can find a big variety of classifications of types of financial analysis of enterprise activity, which differ with their approach to classification and a number of classification features and their content, in modern scientific literature, their complex comparison and analysis of existing classification have not been done. This explains urgency of this study. The article studies classification of types of financial analysis of scientists and presents own approach to this problem. By the results of analysis the article improves and builds up a specific classification of financial analysis of enterprise activity and offers classification by the following features: objects, subjects, goals of study, automation level, time period of the analytical base, scope of study, organisation system, classification features of the subject, spatial belonging, sufficiency, information sources, periodicity, criterial base, method of data selection for analysis and time direction. All types of financial analysis significantly differ with their inherent properties and parameters depending on the goals of financial analysis. The developed specific classification provides subjects of financial analysis of enterprise activity with a possibility to identify a specific type of financial analysis, which would correctly meet the set goals.

  5. A programmed text in statistics

    CERN Document Server

    Hine, J

    1975-01-01

    Exercises for Section 2 42 Physical sciences and engineering 42 43 Biological sciences 45 Social sciences Solutions to Exercises, Section 1 47 Physical sciences and engineering 47 49 Biological sciences 49 Social sciences Solutions to Exercises, Section 2 51 51 PhYSical sciences and engineering 55 Biological sciences 58 Social sciences 62 Tables 2 62 x - tests involving variances 2 63,64 x - one tailed tests 2 65 x - two tailed tests F-distribution 66-69 Preface This project started some years ago when the Nuffield Foundation kindly gave a grant for writing a pro­ grammed text to use with service courses in statistics. The work carried out by Mrs. Joan Hine and Professor G. B. Wetherill at Bath University, together with some other help from time to time by colleagues at Bath University and elsewhere. Testing was done at various colleges and universities, and some helpful comments were received, but we particularly mention King Edwards School, Bath, who provided some sixth formers as 'guinea pigs' for the fir...

  6. [Headache: classification and diagnosis].

    Science.gov (United States)

    Carbaat, P A T; Couturier, E G M

    2016-11-01

    There are many types of headache and, moreover, many people have different types of headache at the same time. Adequate treatment is possible only on the basis of the correct diagnosis. Technically and in terms of content the current diagnostics process for headache is based on the 'International Classification of Headache Disorders' (ICHD-3-beta) that was produced under the auspices of the International Headache Society. This classification is based on a distinction between primary and secondary headaches. The most common primary headache types are the tension type headache, migraine and the cluster headache. Application of uniform diagnostic concepts is essential to come to the most appropriate treatment of the various types of headache.

  7. Classification of hand eczema

    DEFF Research Database (Denmark)

    Agner, T; Aalto-Korte, K; Andersen, K E

    2015-01-01

    BACKGROUND: Classification of hand eczema (HE) is mandatory in epidemiological and clinical studies, and also important in clinical work. OBJECTIVES: The aim was to test a recently proposed classification system of HE in clinical practice in a prospective multicentre study. METHODS: Patients were...... recruited from nine different tertiary referral centres. All patients underwent examination by specialists in dermatology and were checked using relevant allergy testing. Patients were classified into one of the six diagnostic subgroups of HE: allergic contact dermatitis, irritant contact dermatitis, atopic...... system investigated in the present study was useful, being able to give an appropriate main diagnosis for 89% of HE patients, and for another 7% when using two main diagnoses. The fact that more than half of the patients had one or more additional diagnoses illustrates that HE is a multifactorial disease....

  8. Sound classification of dwellings

    DEFF Research Database (Denmark)

    Rasmussen, Birgit

    2012-01-01

    National schemes for sound classification of dwellings exist in more than ten countries in Europe, typically published as national standards. The schemes define quality classes reflecting different levels of acoustical comfort. Main criteria concern airborne and impact sound insulation between...... dwellings, facade sound insulation and installation noise. The schemes have been developed, implemented and revised gradually since the early 1990s. However, due to lack of coordination between countries, there are significant discrepancies, and new standards and revisions continue to increase the diversity...... is needed, and a European COST Action TU0901 "Integrating and Harmonizing Sound Insulation Aspects in Sustainable Urban Housing Constructions", has been established and runs 2009-2013, one of the main objectives being to prepare a proposal for a European sound classification scheme with a number of quality...

  9. Knowledge Dictionary for Information Extraction on the Arabic Text Data

    Directory of Open Access Journals (Sweden)

    Wahyu Jauharis Saputra

    2013-04-01

    Full Text Available Information extraction is an early stage of a process of textual data analysis. Information extraction is required to get information from textual data that can be used for process analysis, such as classification and categorization. A textual data is strongly influenced by the language. Arabic is gaining a significant attention in many studies because Arabic language is very different from others, and in contrast to other languages, tools and research on the Arabic language is still lacking. The information extracted using the knowledge dictionary is a concept of expression. A knowledge dictionary is usually constructed manually by an expert and this would take a long time and is specific to a problem only. This paper proposed a method for automatically building a knowledge dictionary. Dictionary knowledge is formed by classifying sentences having the same concept, assuming that they will have a high similarity value. The concept that has been extracted can be used as features for subsequent computational process such as classification or categorization. Dataset used in this paper was the Arabic text dataset. Extraction result was tested by using a decision tree classification engine and the highest precision value obtained was 71.0% while the highest recall value was 75.0%. 

  10. Granular loess classification based

    International Nuclear Information System (INIS)

    Browzin, B.S.

    1985-01-01

    This paper discusses how loess might be identified by two index properties: the granulometric composition and the dry unit weight. These two indices are necessary but not always sufficient for identification of loess. On the basis of analyses of samples from three continents, it was concluded that the 0.01-0.5-mm fraction deserves the name loessial fraction. Based on the loessial fraction concept, a granulometric classification of loess is proposed. A triangular chart is used to classify loess

  11. CLASSIFICATION OF CRIMINAL GROUPS

    OpenAIRE

    Natalia Romanova

    2013-01-01

    New types of criminal groups are emerging in modern society.  These types have their special criminal subculture. The research objective is to develop new parameters of classification of modern criminal groups, create a new typology of criminal groups and identify some features of their subculture. Research methodology is based on the system approach that includes using the method of analysis of documentary sources (materials of a criminal case), method of conversations with themembers of the...

  12. Decimal Classification Editions

    OpenAIRE

    Zenovia Niculescu

    2009-01-01

    The study approaches the evolution of Dewey Decimal Classification editions from the perspective of updating the terminology, reallocating and expanding the main and auxilary structure of Dewey indexing language. The comparative analysis of DDC editions emphasizes the efficiency of Dewey scheme from the point of view of improving the informational offer, through basic index terms, revised and developed, as well as valuing the auxilary notations.

  13. Explicit Foreground and Background Modeling in The Classification of Text Blocks in Scene Images

    NARCIS (Netherlands)

    Sriman, Bowornrat; Schomaker, Lambertus

    2015-01-01

    Achieving high accuracy for classifying foreground and background is an interesting challenge in the field of scene image analysis because of the wide range of illumination, complex background, and scale changes. Classifying fore- ground and background using bag-of-feature model gives a good result.

  14. Making diffusion work for you: Classification sans text, finding culprits and filling missing values

    OpenAIRE

    Sundareisan, Shashidhar

    2014-01-01

    Can we find people infected with the flu virus even though they did not visit a doctor? Can the temporal features of a trending hashtag or a keyword indicate which topic it belongs to without any textual information? Given a history of interactions between blogs and news websites, can we predict blogs posts/news websites that are not in the sample but talk about the ``the state of the economy'' in 2008? These questions have two things in common: a network (social networks or human co...

  15. Classifications of track structures

    International Nuclear Information System (INIS)

    Paretzke, H.G.

    1984-01-01

    When ionizing particles interact with matter they produce random topological structures of primary activations which represent the initial boundary conditions for all subsequent physical, chemical and/or biological reactions. There are two important aspects of research on such track structures, namely their experimental or theoretical determination on one hand and the quantitative classification of these complex structures which is a basic pre-requisite for the understanding of mechanisms of radiation actions. This paper deals only with the latter topic, i.e. the problems encountered in and possible approaches to quantitative ordering and grouping of these multidimensional objects by their degrees of similarity with respect to their efficiency in producing certain final radiation effects, i.e. to their ''radiation quality.'' Various attempts of taxonometric classification with respect to radiation efficiency have been made in basic and applied radiation research including macro- and microdosimetric concepts as well as track entities and stopping power based theories. In this paper no review of those well-known approaches is given but rather an outline and discussion of alternative methods new to this field of radiation research which have some very promising features and which could possibly solve at least some major classification problems

  16. Neuromuscular disease classification system

    Science.gov (United States)

    Sáez, Aurora; Acha, Begoña; Montero-Sánchez, Adoración; Rivas, Eloy; Escudero, Luis M.; Serrano, Carmen

    2013-06-01

    Diagnosis of neuromuscular diseases is based on subjective visual assessment of biopsies from patients by the pathologist specialist. A system for objective analysis and classification of muscular dystrophies and neurogenic atrophies through muscle biopsy images of fluorescence microscopy is presented. The procedure starts with an accurate segmentation of the muscle fibers using mathematical morphology and a watershed transform. A feature extraction step is carried out in two parts: 24 features that pathologists take into account to diagnose the diseases and 58 structural features that the human eye cannot see, based on the assumption that the biopsy is considered as a graph, where the nodes are represented by each fiber, and two nodes are connected if two fibers are adjacent. A feature selection using sequential forward selection and sequential backward selection methods, a classification using a Fuzzy ARTMAP neural network, and a study of grading the severity are performed on these two sets of features. A database consisting of 91 images was used: 71 images for the training step and 20 as the test. A classification error of 0% was obtained. It is concluded that the addition of features undetectable by the human visual inspection improves the categorization of atrophic patterns.

  17. An automated cirrus classification

    Science.gov (United States)

    Gryspeerdt, Edward; Quaas, Johannes; Goren, Tom; Klocke, Daniel; Brueck, Matthias

    2018-05-01

    Cirrus clouds play an important role in determining the radiation budget of the earth, but many of their properties remain uncertain, particularly their response to aerosol variations and to warming. Part of the reason for this uncertainty is the dependence of cirrus cloud properties on the cloud formation mechanism, which itself is strongly dependent on the local meteorological conditions. In this work, a classification system (Identification and Classification of Cirrus or IC-CIR) is introduced to identify cirrus clouds by the cloud formation mechanism. Using reanalysis and satellite data, cirrus clouds are separated into four main types: orographic, frontal, convective and synoptic. Through a comparison to convection-permitting model simulations and back-trajectory-based analysis, it is shown that these observation-based regimes can provide extra information on the cloud-scale updraughts and the frequency of occurrence of liquid-origin ice, with the convective regime having higher updraughts and a greater occurrence of liquid-origin ice compared to the synoptic regimes. Despite having different cloud formation mechanisms, the radiative properties of the regimes are not distinct, indicating that retrieved cloud properties alone are insufficient to completely describe them. This classification is designed to be easily implemented in GCMs, helping improve future model-observation comparisons and leading to improved parametrisations of cirrus cloud processes.

  18. Boosting bonsai trees for handwritten/printed text discrimination

    Science.gov (United States)

    Ricquebourg, Yann; Raymond, Christian; Poirriez, Baptiste; Lemaitre, Aurélie; Coüasnon, Bertrand

    2013-12-01

    Boosting over decision-stumps proved its efficiency in Natural Language Processing essentially with symbolic features, and its good properties (fast, few and not critical parameters, not sensitive to over-fitting) could be of great interest in the numeric world of pixel images. In this article we investigated the use of boosting over small decision trees, in image classification processing, for the discrimination of handwritten/printed text. Then, we conducted experiments to compare it to usual SVM-based classification revealing convincing results with very close performance, but with faster predictions and behaving far less as a black-box. Those promising results tend to make use of this classifier in more complex recognition tasks like multiclass problems.

  19. Maximum mutual information regularized classification

    KAUST Repository

    Wang, Jim Jing-Yan

    2014-09-07

    In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.

  20. Maximum mutual information regularized classification

    KAUST Repository

    Wang, Jim Jing-Yan; Wang, Yi; Zhao, Shiguang; Gao, Xin

    2014-01-01

    In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.