cell classification problem: Topics by WorldWideScience.org

Sample records for cell classification problem

Classification of cancerous cells based on the one-class problem approach

Science.gov (United States)

Murshed, Nabeel A.; Bortolozzi, Flavio; Sabourin, Robert

1996-03-01

One of the most important factors in reducing the effect of cancerous diseases is the early diagnosis, which requires a good and a robust method. With the advancement of computer technologies and digital image processing, the development of a computer-based system has become feasible. In this paper, we introduce a new approach for the detection of cancerous cells. This approach is based on the one-class problem approach, through which the classification system need only be trained with patterns of cancerous cells. This reduces the burden of the training task by about 50%. Based on this approach, a computer-based classification system is developed, based on the Fuzzy ARTMAP neural networks. Experimental results were performed using a set of 542 patterns taken from a sample of breast cancer. Results of the experiment show 98% correct identification of cancerous cells and 95% correct identification of non-cancerous cells.
Cell dynamic morphology classification using deep convolutional neural networks.

Science.gov (United States)

Li, Heng; Pang, Fengqian; Shi, Yonggang; Liu, Zhiwen

2018-05-15

Cell morphology is often used as a proxy measurement of cell status to understand cell physiology. Hence, interpretation of cell dynamic morphology is a meaningful task in biomedical research. Inspired by the recent success of deep learning, we here explore the application of convolutional neural networks (CNNs) to cell dynamic morphology classification. An innovative strategy for the implementation of CNNs is introduced in this study. Mouse lymphocytes were collected to observe the dynamic morphology, and two datasets were thus set up to investigate the performances of CNNs. Considering the installation of deep learning, the classification problem was simplified from video data to image data, and was then solved by CNNs in a self-taught manner with the generated image data. CNNs were separately performed in three installation scenarios and compared with existing methods. Experimental results demonstrated the potential of CNNs in cell dynamic morphology classification, and validated the effectiveness of the proposed strategy. CNNs were successfully applied to the classification problem, and outperformed the existing methods in the classification accuracy. For the installation of CNNs, transfer learning was proved to be a promising scheme. © 2018 International Society for Advancement of Cytometry. © 2018 International Society for Advancement of Cytometry.
Cancer classification in the genomic era: five contemporary problems.

Science.gov (United States)

Song, Qingxuan; Merajver, Sofia D; Li, Jun Z

2015-10-19

Classification is an everyday instinct as well as a full-fledged scientific discipline. Throughout the history of medicine, disease classification is central to how we develop knowledge, make diagnosis, and assign treatment. Here, we discuss the classification of cancer and the process of categorizing cancer subtypes based on their observed clinical and biological features. Traditionally, cancer nomenclature is primarily based on organ location, e.g., "lung cancer" designates a tumor originating in lung structures. Within each organ-specific major type, finer subgroups can be defined based on patient age, cell type, histological grades, and sometimes molecular markers, e.g., hormonal receptor status in breast cancer or microsatellite instability in colorectal cancer. In the past 15+ years, high-throughput technologies have generated rich new data regarding somatic variations in DNA, RNA, protein, or epigenomic features for many cancers. These data, collected for increasingly large tumor cohorts, have provided not only new insights into the biological diversity of human cancers but also exciting opportunities to discover previously unrecognized cancer subtypes. Meanwhile, the unprecedented volume and complexity of these data pose significant challenges for biostatisticians, cancer biologists, and clinicians alike. Here, we review five related issues that represent contemporary problems in cancer taxonomy and interpretation. (1) How many cancer subtypes are there? (2) How can we evaluate the robustness of a new classification system? (3) How are classification systems affected by intratumor heterogeneity and tumor evolution? (4) How should we interpret cancer subtypes? (5) Can multiple classification systems co-exist? While related issues have existed for a long time, we will focus on those aspects that have been magnified by the recent influx of complex multi-omics data. Exploration of these problems is essential for data-driven refinement of cancer classification
A New Method for Solving Supervised Data Classification Problems

Directory of Open Access Journals (Sweden)

Parvaneh Shabanzadeh

2014-01-01

Full Text Available Supervised data classification is one of the techniques used to extract nontrivial information from data. Classification is a widely used technique in various fields, including data mining, industry, medicine, science, and law. This paper considers a new algorithm for supervised data classification problems associated with the cluster analysis. The mathematical formulations for this algorithm are based on nonsmooth, nonconvex optimization. A new algorithm for solving this optimization problem is utilized. The new algorithm uses a derivative-free technique, with robustness and efficiency. To improve classification performance and efficiency in generating classification model, a new feature selection algorithm based on techniques of convex programming is suggested. Proposed methods are tested on real-world datasets. Results of numerical experiments have been presented which demonstrate the effectiveness of the proposed algorithms.
Cost-sensitive classification problem (Poster)

NARCIS (Netherlands)

Calders, T.G.K.; Pechenizkiy, M.

2012-01-01

In practical situations almost all classification problems are cost-sensitive or utility based one way or another. This exercise mimics a real situation in which students first have to translate a description into a datamining workflow, learn a prediction model, apply it to new data, and set up a
Simulation Modeling by Classification of Problems: A Case of Cellular Manufacturing

International Nuclear Information System (INIS)

Afiqah, K N; Mahayuddin, Z R

2016-01-01

Cellular manufacturing provides good solution approach to manufacturing area by applying Group Technology concept. The evolution of cellular manufacturing can enhance performance of the cell and to increase the quality of the product manufactured but it triggers other problem. Generally, this paper highlights factors and problems which emerge commonly in cellular manufacturing. The aim of the research is to develop a thorough understanding of common problems in cellular manufacturing. A part from that, in order to find a solution to the problems exist using simulation technique, this classification framework is very useful to be adapted during model building. Biology evolution tool was used in the research in order to classify the problems emerge. The result reveals 22 problems and 25 factors using cladistic technique. In this research, the expected result is the cladogram established based on the problems in cellular manufacturing gathered. (paper)
Morphological classification of plant cell deaths

DEFF Research Database (Denmark)

van Doorn, W.G.; Beers, E.P.; Dangl, J.L.

2011-01-01

, which can express features of both necrosis and vacuolar cell death, PCD in starchy cereal endosperm and during self-incompatibility. The present classification is not static, but will be subject to further revision, especially when specific biochemical pathways are better defined....... the classification of PCD in plants. Here we suggest a classification based on morphological criteria. According to this classification, the use of the term 'apoptosis' is not justified in plants, but at least two classes of PCD can be distinguished: vacuolar cell death and necrosis. During vacuolar cell death...
Various forms of indexing HDMR for modelling multivariate classification problems

Energy Technology Data Exchange (ETDEWEB)

Aksu, Çağrı [Bahçeşehir University, Information Technologies Master Program, Beşiktaş, 34349 İstanbul (Turkey); Tunga, M. Alper [Bahçeşehir University, Software Engineering Department, Beşiktaş, 34349 İstanbul (Turkey)

2014-12-10

The Indexing HDMR method was recently developed for modelling multivariate interpolation problems. The method uses the Plain HDMR philosophy in partitioning the given multivariate data set into less variate data sets and then constructing an analytical structure through these partitioned data sets to represent the given multidimensional problem. Indexing HDMR makes HDMR be applicable to classification problems having real world data. Mostly, we do not know all possible class values in the domain of the given problem, that is, we have a non-orthogonal data structure. However, Plain HDMR needs an orthogonal data structure in the given problem to be modelled. In this sense, the main idea of this work is to offer various forms of Indexing HDMR to successfully model these real life classification problems. To test these different forms, several well-known multivariate classification problems given in UCI Machine Learning Repository were used and it was observed that the accuracy results lie between 80% and 95% which are very satisfactory.
Predicting Assignment Submissions in a Multiclass Classification Problem

Directory of Open Access Journals (Sweden)

Bogdan Drăgulescu

2015-08-01

Full Text Available Predicting student failure is an important task that can empower educators to counteract the factors that affect student performance. In this paper, a part of the bigger problem of predicting student failure is addressed: predicting the students that do not complete their assignment tasks. For solving this problem, real data collected by our university’s educational platform was used. Because the problem consisted of predicting one of three possible classes (multi-class classification, the appropriate algorithms and methods were selected. Several experiments were carried out to find the best approach for this prediction problem and the used data set. An approach of time segmentation is proposed in order to facilitate the prediction from early on. Methods that address the problems of high dimensionality and imbalanced data were also evaluated. The outcome of each approach is shown and compared in order to select the best performing classification algorithm for the problem at hand.
Building and Solving Odd-One-Out Classification Problems: A Systematic Approach

Science.gov (United States)

Ruiz, Philippe E.

2011-01-01

Classification problems ("find the odd-one-out") are frequently used as tests of inductive reasoning to evaluate human or animal intelligence. This paper introduces a systematic method for building the set of all possible classification problems, followed by a simple algorithm for solving the problems of the R-ASCM, a psychometric test derived…
Psychological and social problems in primary care patients - general practitioners' assessment and classification.

Science.gov (United States)

Rosendal, Marianne; Vedsted, Peter; Christensen, Kaj Sparle; Moth, Grete

2013-03-01

To estimate the frequency of psychological and social classification codes employed by general practitioners (GPs) and to explore the extent to which GPs ascribed health problems to biomedical, psychological, or social factors. A cross-sectional survey based on questionnaire data from GPs. Setting. Danish primary care. 387 GPs and their face-to-face contacts with 5543 patients. GPs registered consecutive patients on registration forms including reason for encounter, diagnostic classification of main problem, and a GP assessment of biomedical, psychological, and social factors' influence on the contact. The GP-stated reasons for encounter largely overlapped with their classification of the managed problem. Using the International Classification of Primary Care (ICPC-2-R), GPs classified 600 (11%) patients with psychological problems and 30 (0.5%) with social problems. Both codes for problems/complaints and specific disorders were used as the GP's diagnostic classification of the main problem. Two problems (depression and acute stress reaction/adjustment disorder) accounted for 51% of all psychological classifications made. GPs generally emphasized biomedical aspects of the contacts. Psychological aspects were given greater importance in follow-up consultations than in first-episode consultations, whereas social factors were rarely seen as essential to the consultation. Psychological problems are frequently seen and managed in primary care and most are classified within a few diagnostic categories. Social matters are rarely considered or classified.
THE PROBLEMS OF FIXED ASSETS CLASSIFICATION FOR ACCOUNTING

Directory of Open Access Journals (Sweden)

Sophiia Kafka

2016-06-01

Full Text Available This article provides a critical analysis of research in accounting of fixed assets; the basic issues of fixed assets accounting that have been developed by the Ukrainian scientists during 1999-2016 have been determined. It is established that the problems of non-current assets taxation and their classification are the most noteworthy. In the dissertations the issues of fixed assets classification are of exclusively particular branch nature, so its improvement is important. The purpose of the article is developing science-based classification of fixed assets for accounting purposes since their composition is quite diverse. The classification of fixed assets for accounting purposes have been summarized and developed in Figure 1 according to the results of the research. The accomplished analysis of existing approaches to classification of fixed assets has made it possible to specify its basic types and justify the classification criteria of fixed assets for the main objects of fixed assets. Key words: non-current assets, fixed assets, accounting, valuation, classification of the fixed assets. JEL:G M41
Solving and interpreting binary classification problems in marketing with SVMs

NARCIS (Netherlands)

J.C. Bioch (Cor); P.J.F. Groenen (Patrick); G.I. Nalbantov (Georgi)

2005-01-01

textabstractMarketing problems often involve inary classification of customers into ``buyers'' versus ``non-buyers'' or ``prefers brand A'' versus ``prefers brand B''. These cases require binary classification models such as logistic regression, linear, and quadratic discriminant analysis. A
Morphological classification of plant cell deaths.

Science.gov (United States)

van Doorn, W G; Beers, E P; Dangl, J L; Franklin-Tong, V E; Gallois, P; Hara-Nishimura, I; Jones, A M; Kawai-Yamada, M; Lam, E; Mundy, J; Mur, L A J; Petersen, M; Smertenko, A; Taliansky, M; Van Breusegem, F; Wolpert, T; Woltering, E; Zhivotovsky, B; Bozhkov, P V

2011-08-01

Programmed cell death (PCD) is an integral part of plant development and of responses to abiotic stress or pathogens. Although the morphology of plant PCD is, in some cases, well characterised and molecular mechanisms controlling plant PCD are beginning to emerge, there is still confusion about the classification of PCD in plants. Here we suggest a classification based on morphological criteria. According to this classification, the use of the term 'apoptosis' is not justified in plants, but at least two classes of PCD can be distinguished: vacuolar cell death and necrosis. During vacuolar cell death, the cell contents are removed by a combination of autophagy-like process and release of hydrolases from collapsed lytic vacuoles. Necrosis is characterised by early rupture of the plasma membrane, shrinkage of the protoplast and absence of vacuolar cell death features. Vacuolar cell death is common during tissue and organ formation and elimination, whereas necrosis is typically found under abiotic stress. Some examples of plant PCD cannot be ascribed to either major class and are therefore classified as separate modalities. These are PCD associated with the hypersensitive response to biotrophic pathogens, which can express features of both necrosis and vacuolar cell death, PCD in starchy cereal endosperm and during self-incompatibility. The present classification is not static, but will be subject to further revision, especially when specific biochemical pathways are better defined.
Classification of Ship Routing and Scheduling Problems in Liner Shipping

DEFF Research Database (Denmark)

Kjeldsen, Karina Hjortshøj

2011-01-01

This article provides a classification scheme for ship routing and scheduling problems in liner shipping in line with the current and future operational conditions of the liner shipping industry. Based on the classification, the literature is divided into groups whose main characteristics...
Nonlinear programming for classification problems in machine learning

Science.gov (United States)

Astorino, Annabella; Fuduli, Antonio; Gaudioso, Manlio

2016-10-01

We survey some nonlinear models for classification problems arising in machine learning. In the last years this field has become more and more relevant due to a lot of practical applications, such as text and web classification, object recognition in machine vision, gene expression profile analysis, DNA and protein analysis, medical diagnosis, customer profiling etc. Classification deals with separation of sets by means of appropriate separation surfaces, which is generally obtained by solving a numerical optimization model. While linear separability is the basis of the most popular approach to classification, the Support Vector Machine (SVM), in the recent years using nonlinear separating surfaces has received some attention. The objective of this work is to recall some of such proposals, mainly in terms of the numerical optimization models. In particular we tackle the polyhedral, ellipsoidal, spherical and conical separation approaches and, for some of them, we also consider the semisupervised versions.
Problems of classification in the family Paramyxoviridae.

Science.gov (United States)

Rima, Bert; Collins, Peter; Easton, Andrew; Fouchier, Ron; Kurath, Gael; Lamb, Robert A; Lee, Benhur; Maisner, Andrea; Rota, Paul; Wang, Lin-Fa

2018-05-01

A number of unassigned viruses in the family Paramyxoviridae need to be classified either as a new genus or placed into one of the seven genera currently recognized in this family. Furthermore, numerous new paramyxoviruses continue to be discovered. However, attempts at classification have highlighted the difficulties that arise by applying historic criteria or criteria based on sequence alone to the classification of the viruses in this family. While the recent taxonomic change that elevated the previous subfamily Pneumovirinae into a separate family Pneumoviridae is readily justified on the basis of RNA dependent -RNA polymerase (RdRp or L protein) sequence motifs, using RdRp sequence comparisons for assignment to lower level taxa raises problems that would require an overhaul of the current criteria for assignment into genera in the family Paramyxoviridae. Arbitrary cut off points to delineate genera and species would have to be set if classification was based on the amino acid sequence of the RdRp alone or on pairwise analysis of sequence complementarity (PASC) of all open reading frames (ORFs). While these cut-offs cannot be made consistent with the current classification in this family, resorting to genus-level demarcation criteria with additional input from the biological context may afford a way forward. Such criteria would reflect the increasingly dynamic nature of virus taxonomy even if it would require a complete revision of the current classification.
Lymphoma classification update: B-cell non-Hodgkin lymphomas.

Science.gov (United States)

Jiang, Manli; Bennani, N Nora; Feldman, Andrew L

2017-05-01

Lymphomas are classified based on the normal counterpart, or cell of origin, from which they arise. Because lymphocytes have physiologic immune functions that vary both by lineage and by stage of differentiation, the classification of lymphomas arising from these normal lymphoid populations is complex. Recent genomic data have contributed additional complexity. Areas covered: Lymphoma classification follows the World Health Organization (WHO) system, which reflects international consensus and is based on pathological, genetic, and clinical factors. A 2016 revision to the WHO classification of lymphoid neoplasms recently was reported. The present review focuses on B-cell non-Hodgkin lymphomas, the most common group of lymphomas, and summarizes recent changes most relevant to hematologists and other clinicians who care for lymphoma patients. Expert commentary: Lymphoma classification is a continually evolving field that needs to be responsive to new clinical, pathological, and molecular understanding of lymphoid neoplasia. Among the entities covered in this review, the 2016 revision of the WHO classification particularly impact the subclassification and genetic stratification of diffuse large B-cell lymphoma and high-grade B-cell lymphomas, and reflect evolving criteria and nomenclature for indolent B-cell lymphomas and lymphoproliferative disorders.
The cell method for electrical engineering and multiphysics problems an introduction

CERN Document Server

Alotto, Piergiorgio; Repetto, Maurizio; Rosso, Carlo

2013-01-01

This book presents a numerical scheme for the solution of field problems governed by partial differential equations: the cell method. The technique lends itself naturally to the solution of multiphysics problems with several interacting phenomena. The Cell Method, based on a space-time tessellation, is intimately related to the work of Tonti and to his ideas of classification diagrams or, as they are nowadays called, Tonti diagrams: a graphical representation of the problem's equations made possible by a suitable selection of a space-time framework relating physical variables to each other. The main features of the cell method are presented and links with many other discrete numerical methods (finite integration techniques, finite difference time domain, finite volumes, mimetic finite differences, etc.) are discussed. After outlining the theoretical basis of the method, a set of physical problems which have been solved with the cell method is described. These single and multiphysics problems stem from the aut...
An Efficient Optimization Method for Solving Unsupervised Data Classification Problems

Directory of Open Access Journals (Sweden)

Parvaneh Shabanzadeh

2015-01-01

Full Text Available Unsupervised data classification (or clustering analysis is one of the most useful tools and a descriptive task in data mining that seeks to classify homogeneous groups of objects based on similarity and is used in many medical disciplines and various applications. In general, there is no single algorithm that is suitable for all types of data, conditions, and applications. Each algorithm has its own advantages, limitations, and deficiencies. Hence, research for novel and effective approaches for unsupervised data classification is still active. In this paper a heuristic algorithm, Biogeography-Based Optimization (BBO algorithm, was adapted for data clustering problems by modifying the main operators of BBO algorithm, which is inspired from the natural biogeography distribution of different species. Similar to other population-based algorithms, BBO algorithm starts with an initial population of candidate solutions to an optimization problem and an objective function that is calculated for them. To evaluate the performance of the proposed algorithm assessment was carried on six medical and real life datasets and was compared with eight well known and recent unsupervised data classification algorithms. Numerical results demonstrate that the proposed evolutionary optimization algorithm is efficient for unsupervised data classification.

On the problem of classification of radioactive waste.; K voprosu o klassifikatsii radioaktivnykh otkhodov.

Energy Technology Data Exchange (ETDEWEB)

Bogachev, O M; Ermolin, G A [Naukovo-Tekhnyichnij Tsentr z dezaktivatsyiyi ta kompleksnogo povodzhennya z radyioaktivnimi vyidkhodami, Zhovtyi Vodi (Ukraine)

1994-12-31

The available classification of radioactive waste, classification problems on processing, storage and burial technology have been considered. Complex classification of radioactive waste with regard for the state of aggregation, activity, radiation kind, half-life period, processing technology, storage terms and storehouse types has been suggested.
Automated classification of cell morphology by coherence-controlled holographic microscopy

Science.gov (United States)

Strbkova, Lenka; Zicha, Daniel; Vesely, Pavel; Chmelik, Radim

2017-08-01

In the last few years, classification of cells by machine learning has become frequently used in biology. However, most of the approaches are based on morphometric (MO) features, which are not quantitative in terms of cell mass. This may result in poor classification accuracy. Here, we study the potential contribution of coherence-controlled holographic microscopy enabling quantitative phase imaging for the classification of cell morphologies. We compare our approach with the commonly used method based on MO features. We tested both classification approaches in an experiment with nutritionally deprived cancer tissue cells, while employing several supervised machine learning algorithms. Most of the classifiers provided higher performance when quantitative phase features were employed. Based on the results, it can be concluded that the quantitative phase features played an important role in improving the performance of the classification. The methodology could be valuable help in refining the monitoring of live cells in an automated fashion. We believe that coherence-controlled holographic microscopy, as a tool for quantitative phase imaging, offers all preconditions for the accurate automated analysis of live cell behavior while enabling noninvasive label-free imaging with sufficient contrast and high-spatiotemporal phase sensitivity.
Automated classification of cell morphology by coherence-controlled holographic microscopy.

Science.gov (United States)

Strbkova, Lenka; Zicha, Daniel; Vesely, Pavel; Chmelik, Radim

2017-08-01

In the last few years, classification of cells by machine learning has become frequently used in biology. However, most of the approaches are based on morphometric (MO) features, which are not quantitative in terms of cell mass. This may result in poor classification accuracy. Here, we study the potential contribution of coherence-controlled holographic microscopy enabling quantitative phase imaging for the classification of cell morphologies. We compare our approach with the commonly used method based on MO features. We tested both classification approaches in an experiment with nutritionally deprived cancer tissue cells, while employing several supervised machine learning algorithms. Most of the classifiers provided higher performance when quantitative phase features were employed. Based on the results, it can be concluded that the quantitative phase features played an important role in improving the performance of the classification. The methodology could be valuable help in refining the monitoring of live cells in an automated fashion. We believe that coherence-controlled holographic microscopy, as a tool for quantitative phase imaging, offers all preconditions for the accurate automated analysis of live cell behavior while enabling noninvasive label-free imaging with sufficient contrast and high-spatiotemporal phase sensitivity. (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE).
Topological Classification of Morse Functions and Generalisations of Hilbert's 16-th Problem

International Nuclear Information System (INIS)

Arnold, Vladimir I.

2007-01-01

The topological structures of the generic smooth functions on a smooth manifold belong to the small quantity of the most fundamental objects of study both in pure and applied mathematics. The problem of their study has been formulated by A. Cayley in 1868, who required the classification of the possible configurations of the horizontal lines on the topographical maps of mountain regions, and created the first elements of what is called today 'Morse Theory' and 'Catastrophes Theory'. In the paper we describe this problem, and in particular describe the classification of Morse functions on the 2 sphere and on the torus
The Texas low-level waste compact: Classification and semantic problems

International Nuclear Information System (INIS)

LeMone, D.V.

1995-01-01

The disposal of low-level radioactive wastes for the State of Texas, as well as the participating compact states of Maine and Vermont, will require a stable classification scheme and a mutually acceptable series of definitions for the orderly planning, development, emplacement, and closure of the proposed Texas low-level site. Under the currently utilized system of classification, low-level radioactive wastes are usually segregated under six basic classes. These classes are: Class A, Class B, Class C, NARM, NORM, and Mixed Low-Level Waste. These wastes originate from two primary sources: utility generators and non-utility generators (medical/industrial/university). The Texas Low-Level Radioactive Waste Disposal Site currently will not accept either Greater Than Class C (GTCC) waste or Transuranic (TRU) waste (exceeding 370 Bq/g (10 nCi/g)), thereby establishing the upper limits for disposal. One basic problem for all low-level entities is the national classification scheme. There is no currently defined lower limit for radioactive wastes. This standard is essential and must be addressed in order to effectively project future waste streams. Semantic problems include the rendering of precise definitions for such common words as processing, recycling, generation, etc.; they are not necessarily defined or used in the same sense between generators or states. Consistency in terminology is an absolute essential for adequate nuclear waste management. Other problems that must be addressed include such areas as: types of beneficiation of waste (supercompaction and incineration versus untreated waste), validation of point of origin, consistent and easily recognizable labeling that includes an inventory, transport tracking, and package standards
Comparing Linear Discriminant Function with Logistic Regression for the Two-Group Classification Problem.

Science.gov (United States)

Fan, Xitao; Wang, Lin

The Monte Carlo study compared the performance of predictive discriminant analysis (PDA) and that of logistic regression (LR) for the two-group classification problem. Prior probabilities were used for classification, but the cost of misclassification was assumed to be equal. The study used a fully crossed three-factor experimental design (with…
Training echo state networks for rotation-invariant bone marrow cell classification.

Science.gov (United States)

Kainz, Philipp; Burgsteiner, Harald; Asslaber, Martin; Ahammer, Helmut

2017-01-01

The main principle of diagnostic pathology is the reliable interpretation of individual cells in context of the tissue architecture. Especially a confident examination of bone marrow specimen is dependent on a valid classification of myeloid cells. In this work, we propose a novel rotation-invariant learning scheme for multi-class echo state networks (ESNs), which achieves very high performance in automated bone marrow cell classification. Based on representing static images as temporal sequence of rotations, we show how ESNs robustly recognize cells of arbitrary rotations by taking advantage of their short-term memory capacity. The performance of our approach is compared to a classification random forest that learns rotation-invariance in a conventional way by exhaustively training on multiple rotations of individual samples. The methods were evaluated on a human bone marrow image database consisting of granulopoietic and erythropoietic cells in different maturation stages. Our ESN approach to cell classification does not rely on segmentation of cells or manual feature extraction and can therefore directly be applied to image data.
A Neural-Network-Based Approach to White Blood Cell Classification

Directory of Open Access Journals (Sweden)

Mu-Chun Su

2014-01-01

Full Text Available This paper presents a new white blood cell classification system for the recognition of five types of white blood cells. We propose a new segmentation algorithm for the segmentation of white blood cells from smear images. The core idea of the proposed segmentation algorithm is to find a discriminating region of white blood cells on the HSI color space. Pixels with color lying in the discriminating region described by an ellipsoidal region will be regarded as the nucleus and granule of cytoplasm of a white blood cell. Then, through a further morphological process, we can segment a white blood cell from a smear image. Three kinds of features (i.e., geometrical features, color features, and LDP-based texture features are extracted from the segmented cell. These features are fed into three different kinds of neural networks to recognize the types of the white blood cells. To test the effectiveness of the proposed white blood cell classification system, a total of 450 white blood cells images were used. The highest overall correct recognition rate could reach 99.11% correct. Simulation results showed that the proposed white blood cell classification system was very competitive to some existing systems.
Evaluation of nuclear power plant operating procedures classifications and interfaces: Problems and techniques for improvement

International Nuclear Information System (INIS)

Barnes, V.E.; Radford, L.R.

1987-02-01

This report presents activities and findings of a project designed to evaluate current practices and problems related to procedure classification schemes and procedure interfaces in commercial nuclear power plants. The phrase ''procedure classification scheme'' refers to how plant operating procedures are categorized and indexed (e.g., normal, abnormal, emergency operating procedures). The term ''procedure interface'' refers to how reactor operators are instructed to transition within and between procedures. The project consisted of four key tasks, including (1) a survey of literature regarding problems associated with procedure classifications and interfaces, as well as techniques for overcoming them; (2) interviews with experts in the nuclear industry to discuss the appropriate scope of different classes of operating procedures and techniques for managing interfaces between them; (3) a reanalysis of data gathered about nuclear power plant normal operating and off-normal operating procedures in a related project, ''Program Plan for Assessing and Upgrading Operating Procedures for Nuclear Power Plants''; and (4) solicitation of the comments and expert opinions of a peer review group on the draft project report and on proposed techniques for resolving classification and interface issues. In addition to describing these activities and their results, recommendations for NRC and utility actions to address procedure classification and interface problems are offered
Cell-based therapy technology classifications and translational challenges

Science.gov (United States)

Mount, Natalie M.; Ward, Stephen J.; Kefalas, Panos; Hyllner, Johan

2015-01-01

Cell therapies offer the promise of treating and altering the course of diseases which cannot be addressed adequately by existing pharmaceuticals. Cell therapies are a diverse group across cell types and therapeutic indications and have been an active area of research for many years but are now strongly emerging through translation and towards successful commercial development and patient access. In this article, we present a description of a classification of cell therapies on the basis of their underlying technologies rather than the more commonly used classification by cell type because the regulatory path and manufacturing solutions are often similar within a technology area due to the nature of the methods used. We analyse the progress of new cell therapies towards clinical translation, examine how they are addressing the clinical, regulatory, manufacturing and reimbursement requirements, describe some of the remaining challenges and provide perspectives on how the field may progress for the future. PMID:26416686
Cell of origin associated classification of B-cell malignancies by gene signatures of the normal B-cell hierarchy.

Science.gov (United States)

Johnsen, Hans Erik; Bergkvist, Kim Steve; Schmitz, Alexander; Kjeldsen, Malene Krag; Hansen, Steen Møller; Gaihede, Michael; Nørgaard, Martin Agge; Bæch, John; Grønholdt, Marie-Louise; Jensen, Frank Svendsen; Johansen, Preben; Bødker, Julie Støve; Bøgsted, Martin; Dybkær, Karen

2014-06-01

Recent findings have suggested biological classification of B-cell malignancies as exemplified by the "activated B-cell-like" (ABC), the "germinal-center B-cell-like" (GCB) and primary mediastinal B-cell lymphoma (PMBL) subtypes of diffuse large B-cell lymphoma and "recurrent translocation and cyclin D" (TC) classification of multiple myeloma. Biological classification of B-cell derived cancers may be refined by a direct and systematic strategy where identification and characterization of normal B-cell differentiation subsets are used to define the cancer cell of origin phenotype. Here we propose a strategy combining multiparametric flow cytometry, global gene expression profiling and biostatistical modeling to generate B-cell subset specific gene signatures from sorted normal human immature, naive, germinal centrocytes and centroblasts, post-germinal memory B-cells, plasmablasts and plasma cells from available lymphoid tissues including lymph nodes, tonsils, thymus, peripheral blood and bone marrow. This strategy will provide an accurate image of the stage of differentiation, which prospectively can be used to classify any B-cell malignancy and eventually purify tumor cells. This report briefly describes the current models of the normal B-cell subset differentiation in multiple tissues and the pathogenesis of malignancies originating from the normal germinal B-cell hierarchy.
Density-conserving affine continuous cellular automata solving the relaxed density classification problem

International Nuclear Information System (INIS)

Wolnik, Barbara; Dembowski, Marcin; Bołt, Witold; Baetens, Jan M; De Baets, Bernard

2017-01-01

The focus of this paper is on the density classification problem in the context of affine continuous cellular automata. Although such cellular automata cannot solve this problem in the classical sense, most density-conserving affine continuous cellular automata with a unit neighborhood radius are valid solutions of a slightly relaxed version of this problem. This result follows from a detailed study of the dynamics of the density-conserving affine continuous cellular automata that we introduce. (paper)
Stock Market Index Data and indicators for Day Trading as a Binary Classification problem.

Science.gov (United States)

Bruni, Renato

2017-02-01

Classification is the attribution of labels to records according to a criterion automatically learned from a training set of labeled records. This task is needed in a huge number of practical applications, and consequently it has been studied intensively and several classification algorithms are available today. In finance, a stock market index is a measurement of value of a section of the stock market. It is often used to describe the aggregate trend of a market. One basic financial issue would be forecasting this trend. Clearly, such a stochastic value is very difficult to predict. However, technical analysis is a security analysis methodology developed to forecast the direction of prices through the study of past market data. Day trading consists in buying and selling financial instruments within the same trading day. In this case, one interesting problem is the automatic individuation of favorable days for trading. We model this problem as a binary classification problem, and we provide datasets containing daily index values, the corresponding values of a selection of technical indicators, and the class label, which is 1 if the subsequent time period is favorable for day trading and 0 otherwise. These datasets can be used to test the behavior of different approaches in solving the day trading problem.
Stock Market Index Data and indicators for Day Trading as a Binary Classification problem

Directory of Open Access Journals (Sweden)

Renato Bruni

2017-02-01

Full Text Available Classification is the attribution of labels to records according to a criterion automatically learned from a training set of labeled records. This task is needed in a huge number of practical applications, and consequently it has been studied intensively and several classification algorithms are available today. In finance, a stock market index is a measurement of value of a section of the stock market. It is often used to describe the aggregate trend of a market. One basic financial issue would be forecasting this trend. Clearly, such a stochastic value is very difficult to predict. However, technical analysis is a security analysis methodology developed to forecast the direction of prices through the study of past market data. Day trading consists in buying and selling financial instruments within the same trading day. In this case, one interesting problem is the automatic individuation of favorable days for trading. We model this problem as a binary classification problem, and we provide datasets containing daily index values, the corresponding values of a selection of technical indicators, and the class label, which is 1 if the subsequent time period is favorable for day trading and 0 otherwise. These datasets can be used to test the behavior of different approaches in solving the day trading problem.
Segmentation and classification of cell cycle phases in fluorescence imaging.

Science.gov (United States)

Ersoy, Ilker; Bunyak, Filiz; Chagin, Vadim; Cardoso, M Christina; Palaniappan, Kannappan

2009-01-01

Current chemical biology methods for studying spatiotemporal correlation between biochemical networks and cell cycle phase progression in live-cells typically use fluorescence-based imaging of fusion proteins. Stable cell lines expressing fluorescently tagged protein GFP-PCNA produce rich, dynamically varying sub-cellular foci patterns characterizing the cell cycle phases, including the progress during the S-phase. Variable fluorescence patterns, drastic changes in SNR, shape and position changes and abundance of touching cells require sophisticated algorithms for reliable automatic segmentation and cell cycle classification. We extend the recently proposed graph partitioning active contours (GPAC) for fluorescence-based nucleus segmentation using regional density functions and dramatically improve its efficiency, making it scalable for high content microscopy imaging. We utilize surface shape properties of GFP-PCNA intensity field to obtain descriptors of foci patterns and perform automated cell cycle phase classification, and give quantitative performance by comparing our results to manually labeled data.
HEp-2 Cell Classification Using Shape Index Histograms With Donut-Shaped Spatial Pooling

DEFF Research Database (Denmark)

Larsen, Anders Boesen Lindbo; Vestergaard, Jacob Schack; Larsen, Rasmus

2014-01-01

We present a new method for automatic classification of indirect immunoflourescence images of HEp-2 cells into different staining pattern classes. Our method is based on a new texture measure called shape index histograms that captures second-order image structure at multiple scales. Moreover, we...... datasets. Our results show that shape index histograms are superior to other popular texture descriptors for HEp-2 cell classification. Moreover, when comparing to other automated systems for HEp-2 cell classification we show that shape index histograms are very competitive; especially considering...
Comparison of four approaches to a rock facies classification problem

Science.gov (United States)

Dubois, M.K.; Bohling, Geoffrey C.; Chakrabarti, S.

2007-01-01

In this study, seven classifiers based on four different approaches were tested in a rock facies classification problem: classical parametric methods using Bayes' rule, and non-parametric methods using fuzzy logic, k-nearest neighbor, and feed forward-back propagating artificial neural network. Determining the most effective classifier for geologic facies prediction in wells without cores in the Panoma gas field, in Southwest Kansas, was the objective. Study data include 3600 samples with known rock facies class (from core) with each sample having either four or five measured properties (wire-line log curves), and two derived geologic properties (geologic constraining variables). The sample set was divided into two subsets, one for training and one for testing the ability of the trained classifier to correctly assign classes. Artificial neural networks clearly outperformed all other classifiers and are effective tools for this particular classification problem. Classical parametric models were inadequate due to the nature of the predictor variables (high dimensional and not linearly correlated), and feature space of the classes (overlapping). The other non-parametric methods tested, k-nearest neighbor and fuzzy logic, would need considerable improvement to match the neural network effectiveness, but further work, possibly combining certain aspects of the three non-parametric methods, may be justified. ?? 2006 Elsevier Ltd. All rights reserved.
A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

Directory of Open Access Journals (Sweden)

Zekić-Sušac Marijana

2014-09-01

Full Text Available Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART classification trees, support vector machines, and k-nearest neighbour on the same dataset in order to compare their efficiency in the sense of classification accuracy. The performance of each method was compared on ten subsamples in a 10-fold cross-validation procedure in order to assess computing sensitivity and specificity of each model. Results: The artificial neural network model based on multilayer perceptron yielded a higher classification rate than the models produced by other methods. The pairwise t-test showed a statistical significance between the artificial neural network and the k-nearest neighbour model, while the difference among other methods was not statistically significant. Conclusions: Tested machine learning methods are able to learn fast and achieve high classification accuracy. However, further advancement can be assured by testing a few additional methodological refinements in machine learning methods.
Parallel computation for blood cell classification in medical hyperspectral imagery

International Nuclear Information System (INIS)

Li, Wei; Wu, Lucheng; Qiu, Xianbo; Ran, Qiong; Xie, Xiaoming

2016-01-01

With the advantage of fine spectral resolution, hyperspectral imagery provides great potential for cell classification. This paper provides a promising classification system including the following three stages: (1) band selection for a subset of spectral bands with distinctive and informative features, (2) spectral-spatial feature extraction, such as local binary patterns (LBP), and (3) followed by an effective classifier. Moreover, these three steps are further implemented on graphics processing units (GPU) respectively, which makes the system real-time and more practical. The GPU parallel implementation is compared with the serial implementation on central processing units (CPU). Experimental results based on real medical hyperspectral data demonstrate that the proposed system is able to offer high accuracy and fast speed, which are appealing for cell classification in medical hyperspectral imagery. (paper)
Identification and Classification of Diseases: Fundamental Problems in Medical Ontology and Epistemology

Directory of Open Access Journals (Sweden)

Lennart Nordenfelt

2013-05-01

Full Text Available During the last three centuries there has been remarkable development in the area of the identification and classification of diseases. The taxonomic systems adopted in the 18th century by, for instance, Sauvages and Linnaeus bare no resemblance to the modern nomenclatures for pathological phenomena. The aim of this paper is to give a brief historical presentation, but also a critical analysis, of a number of crucial ideas and theories behind the construction of certain major disease classifications. My focus in the second half of the paper is on the most influential modern systems of classification, the International Statistical Classification of Diseases and Related Health Problems (ICD and the International Systematized Nomenclature of Human and Veterinary Medicine (SNOMED. The former is the official classification adopted by the World Health Organization and is used mainly for clinical and administrative purposes. The latter is a highly complex system of classification which has recently been developed for a variety of purposes (including medical research and is meant to be read and handled by computers. ICD, although widely used all over the world, has salient and well-known logical deficiencies. SNOMED has been introduced partly to remedy these deficiencies. I conclude, however, that SNOMED, in spite of its sophisticated resources, cannot completely replace ICD. For many clinical and administrative purposes there is need of a relatively simple system that can be handled by the ordinary doctor and the ordinary health-care administrator.

CLASSIFICATION OF TRAFFIC RELATED SHORT TEXTS TO ANALYSE ROAD PROBLEMS IN URBAN AREAS

Directory of Open Access Journals (Sweden)

A. M. M. Saldana-Perez

2017-09-01

Full Text Available The Volunteer Geographic Information (VGI can be used to understand the urban dynamics. In the classification of traffic related short texts to analyze road problems in urban areas, a VGI data analysis is done over a social media’s publications, in order to classify traffic events at big cities that modify the movement of vehicles and people through the roads, such as car accidents, traffic and closures. The classification of traffic events described in short texts is done by applying a supervised machine learning algorithm. In the approach users are considered as sensors which describe their surroundings and provide their geographic position at the social network. The posts are treated by a text mining process and classified into five groups. Finally, the classified events are grouped in a data corpus and geo-visualized in the study area, to detect the places with more vehicular problems.
Classification of Traffic Related Short Texts to Analyse Road Problems in Urban Areas

Science.gov (United States)

Saldana-Perez, A. M. M.; Moreno-Ibarra, M.; Tores-Ruiz, M.

2017-09-01

The Volunteer Geographic Information (VGI) can be used to understand the urban dynamics. In the classification of traffic related short texts to analyze road problems in urban areas, a VGI data analysis is done over a social media's publications, in order to classify traffic events at big cities that modify the movement of vehicles and people through the roads, such as car accidents, traffic and closures. The classification of traffic events described in short texts is done by applying a supervised machine learning algorithm. In the approach users are considered as sensors which describe their surroundings and provide their geographic position at the social network. The posts are treated by a text mining process and classified into five groups. Finally, the classified events are grouped in a data corpus and geo-visualized in the study area, to detect the places with more vehicular problems.
Automated cell type discovery and classification through knowledge transfer

Science.gov (United States)

Lee, Hao-Chih; Kosoy, Roman; Becker, Christine E.

2017-01-01

Abstract Motivation: Recent advances in mass cytometry allow simultaneous measurements of up to 50 markers at single-cell resolution. However, the high dimensionality of mass cytometry data introduces computational challenges for automated data analysis and hinders translation of new biological understanding into clinical applications. Previous studies have applied machine learning to facilitate processing of mass cytometry data. However, manual inspection is still inevitable and becoming the barrier to reliable large-scale analysis. Results: We present a new algorithm called Automated Cell-type Discovery and Classification (ACDC) that fully automates the classification of canonical cell populations and highlights novel cell types in mass cytometry data. Evaluations on real-world data show ACDC provides accurate and reliable estimations compared to manual gating results. Additionally, ACDC automatically classifies previously ambiguous cell types to facilitate discovery. Our findings suggest that ACDC substantially improves both reliability and interpretability of results obtained from high-dimensional mass cytometry profiling data. Availability and Implementation: A Python package (Python 3) and analysis scripts for reproducing the results are availability on https://bitbucket.org/dudleylab/acdc. Contact: brian.kidd@mssm.edu or joel.dudley@mssm.edu Supplementary information: Supplementary data are available at Bioinformatics online. PMID:28158442
Links between early baseline cortisol, attachment classification, and problem behaviors: A test of differential susceptibility versus diathesis-stress.

Science.gov (United States)

Fong, Michelle C; Measelle, Jeffrey; Conradt, Elisabeth; Ablow, Jennifer C

2017-02-01

The purpose of the current study was to predict concurrent levels of problem behaviors from young children's baseline cortisol and attachment classification, a proxy for the quality of caregiving experienced. In a sample of 58 children living at or below the federal poverty threshold, children's baseline cortisol levels, attachment classification, and problem behaviors were assessed at 17 months of age. We hypothesized that an interaction between baseline cortisol and attachment classification would predict problem behaviors above and beyond any main effects of baseline cortisol and attachment. However, based on limited prior research, we did not predict whether or not this interaction would be more consistent with diathesis-stress or differential susceptibility models. Consistent with diathesis-stress theory, the results indicated no significant differences in problem behavior levels among children with high baseline cortisol. In contrast, children with low baseline cortisol had the highest level of problem behaviors in the context of a disorganized attachment relationship. However, in the context of a secure attachment relationship, children with low baseline cortisol looked no different, with respect to problem behavior levels, then children with high cortisol levels. These findings have substantive implications for the socioemotional development of children reared in poverty. Copyright © 2017 Elsevier Inc. All rights reserved.
Numeric pathologic lymph node classification shows prognostic superiority to topographic pN classification in esophageal squamous cell carcinoma.

Science.gov (United States)

Sugawara, Kotaro; Yamashita, Hiroharu; Uemura, Yukari; Mitsui, Takashi; Yagi, Koichi; Nishida, Masato; Aikou, Susumu; Mori, Kazuhiko; Nomura, Sachiyo; Seto, Yasuyuki

2017-10-01

The current eighth tumor node metastasis lymph node category pathologic lymph node staging system for esophageal squamous cell carcinoma is based solely on the number of metastatic nodes and does not consider anatomic distribution. We aimed to assess the prognostic capability of the eighth tumor node metastasis pathologic lymph node staging system (numeric-based) compared with the 11th Japan Esophageal Society (topography-based) pathologic lymph node staging system in patients with esophageal squamous cell carcinoma. We retrospectively reviewed the clinical records of 289 patients with esophageal squamous cell carcinoma who underwent esophagectomy with extended lymph node dissection during the period from January 2006 through June 2016. We compared discrimination abilities for overall survival, recurrence-free survival, and cancer-specific survival between these 2 staging systems using C-statistics. The median number of dissected and metastatic nodes was 61 (25% to 75% quartile range, 45 to 79) and 1 (25% to 75% quartile range, 0 to 3), respectively. The eighth tumor node metastasis pathologic lymph node staging system had a greater ability to accurately determine overall survival (C-statistics: tumor node metastasis classification, 0.69, 95% confidence interval, 0.62-0.76; Japan Esophageal Society classification; 0.65, 95% confidence interval, 0.58-0.71; P = .014) and cancer-specific survival (C-statistics: tumor node metastasis classification, 0.78, 95% confidence interval, 0.70-0.87; Japan Esophageal Society classification; 0.72, 95% confidence interval, 0.64-0.80; P = .018). Rates of total recurrence rose as the eighth tumor node metastasis pathologic lymph node stage increased, while stratification of patients according to the topography-based node classification system was not feasible. Numeric nodal staging is an essential tool for stratifying the oncologic outcomes of patients with esophageal squamous cell carcinoma even in the cohort in which adequate
Intraoperative neuropathology of glioma recurrence: cell detection and classification

Science.gov (United States)

Abas, Fazly S.; Gokozan, Hamza N.; Goksel, Behiye; Otero, Jose J.; Gurcan, Metin N.

2016-03-01

Intraoperative neuropathology of glioma recurrence represents significant visual challenges to pathologists as they carry significant clinical implications. For example, rendering a diagnosis of recurrent glioma can help the surgeon decide to perform more aggressive resection if surgically appropriate. In addition, the success of recent clinical trials for intraoperative administration of therapies, such as inoculation with oncolytic viruses, may suggest that refinement of the intraoperative diagnosis during neurosurgery is an emerging need for pathologists. Typically, these diagnoses require rapid/STAT processing lasting only 20-30 minutes after receipt from neurosurgery. In this relatively short time frame, only dyes, such as hematoxylin and eosin (H and E), can be implemented. The visual challenge lies in the fact that these patients have undergone chemotherapy and radiation, both of which induce cytological atypia in astrocytes, and pathologists are unable to implement helpful biomarkers in their diagnoses. Therefore, there is a need to help pathologists differentiate between astrocytes that are cytologically atypical due to treatment versus infiltrating, recurrent, neoplastic astrocytes. This study focuses on classification of neoplastic versus non-neoplastic astrocytes with the long term goal of providing a better neuropathological computer-aided consultation via classification of cells into reactive gliosis versus recurrent glioma. We present a method to detect cells in H and E stained digitized slides of intraoperative cytologic preparations. The method uses a combination of the `value' component of the HSV color space and `b*' component of the CIE L*a*b* color space to create an enhanced image that suppresses the background while revealing cells on an image. A composite image is formed based on the morphological closing of the hue-luminance combined image. Geometrical and textural features extracted from Discrete Wavelet Frames and combined to classify
Cutaneous B-cell lymphoma : classification, prognostic factors and management recommendations

NARCIS (Netherlands)

Senff, Nancy Johanna

2009-01-01

The term primary cutaneous B-cell lymphomas refers to a heterogeneous group of B-cell non-Hodgkin lymphomas, that present in the skin without evidence of extracutaneous disease at the time of diagnosis. In recent years, there has been considerable debate regarding the classification and terminology
HEp-2 cell image classification method based on very deep convolutional networks with small datasets

Science.gov (United States)

Lu, Mengchi; Gao, Long; Guo, Xifeng; Liu, Qiang; Yin, Jianping

2017-07-01

Human Epithelial-2 (HEp-2) cell images staining patterns classification have been widely used to identify autoimmune diseases by the anti-Nuclear antibodies (ANA) test in the Indirect Immunofluorescence (IIF) protocol. Because manual test is time consuming, subjective and labor intensive, image-based Computer Aided Diagnosis (CAD) systems for HEp-2 cell classification are developing. However, methods proposed recently are mostly manual features extraction with low accuracy. Besides, the scale of available benchmark datasets is small, which does not exactly suitable for using deep learning methods. This issue will influence the accuracy of cell classification directly even after data augmentation. To address these issues, this paper presents a high accuracy automatic HEp-2 cell classification method with small datasets, by utilizing very deep convolutional networks (VGGNet). Specifically, the proposed method consists of three main phases, namely image preprocessing, feature extraction and classification. Moreover, an improved VGGNet is presented to address the challenges of small-scale datasets. Experimental results over two benchmark datasets demonstrate that the proposed method achieves superior performance in terms of accuracy compared with existing methods.
Multiple kernel learning using single stage function approximation for binary classification problems

Science.gov (United States)

Shiju, S.; Sumitra, S.

2017-12-01

In this paper, the multiple kernel learning (MKL) is formulated as a supervised classification problem. We dealt with binary classification data and hence the data modelling problem involves the computation of two decision boundaries of which one related with that of kernel learning and the other with that of input data. In our approach, they are found with the aid of a single cost function by constructing a global reproducing kernel Hilbert space (RKHS) as the direct sum of the RKHSs corresponding to the decision boundaries of kernel learning and input data and searching that function from the global RKHS, which can be represented as the direct sum of the decision boundaries under consideration. In our experimental analysis, the proposed model had shown superior performance in comparison with that of existing two stage function approximation formulation of MKL, where the decision functions of kernel learning and input data are found separately using two different cost functions. This is due to the fact that single stage representation helps the knowledge transfer between the computation procedures for finding the decision boundaries of kernel learning and input data, which inturn boosts the generalisation capacity of the model.
Improvement of Bioactive Compound Classification through Integration of Orthogonal Cell-Based Biosensing Methods

Directory of Open Access Journals (Sweden)

Goran N. Jovanovic

2007-01-01

Full Text Available Lack of specificity for different classes of chemical and biological agents, and false positives and negatives, can limit the range of applications for cell-based biosensors. This study suggests that the integration of results from algal cells (Mesotaenium caldariorum and fish chromatophores (Betta splendens improves classification efficiency and detection reliability. Cells were challenged with paraquat, mercuric chloride, sodium arsenite and clonidine. The two detection systems were independently investigated for classification of the toxin set by performing discriminant analysis. The algal system correctly classified 72% of the bioactive compounds, whereas the fish chromatophore system correctly classified 68%. The combined classification efficiency was 95%. The algal sensor readout is based on fluorescence measurements of changes in the energy producing pathways of photosynthetic cells, whereas the response from fish chromatophores was quantified using optical density. Change in optical density reflects interference with the functioning of cellular signal transduction networks. Thus, algal cells and fish chromatophores respond to the challenge agents through sufficiently different mechanisms of action to be considered orthogonal.
CLASSIFICATION OF RESTRAINTS IN THE OPTIMIZATION PROBLEM OF A COLD-FORMED PROFILE

Directory of Open Access Journals (Sweden)

Agnieszka Łukowicz

2015-11-01

Full Text Available This work describes the restraints in the optimization problem. This is an important and complicated issue because it requires taking into account a vast range of information related to the design and production. In order to describe the relations of a specific optimization problem, it is essential to adopt appropriate criteria and to collect information on all kinds of restraints, i.e. boundary conditions. The following paper verifies the various restraints and defines three subsets: design assumptions, technological limitations and standard conditions. The provided classification was made with reference to the analysis of the construction applicability of the newly patented cold-formed profile.
A Comparison of Machine Learning Methods in a High-Dimensional Classification Problem

OpenAIRE

Zekić-Sušac, Marijana; Pfeifer, Sanja; Šarlija, Nataša

2014-01-01

Background: Large-dimensional data modelling often relies on variable reduction methods in the pre-processing and in the post-processing stage. However, such a reduction usually provides less information and yields a lower accuracy of the model. Objectives: The aim of this paper is to assess the high-dimensional classification problem of recognizing entrepreneurial intentions of students by machine learning methods. Methods/Approach: Four methods were tested: artificial neural networks, CART ...
Knowledge acquisition from natural language for expert systems based on classification problem-solving methods

Science.gov (United States)

Gomez, Fernando

1989-01-01

It is shown how certain kinds of domain independent expert systems based on classification problem-solving methods can be constructed directly from natural language descriptions by a human expert. The expert knowledge is not translated into production rules. Rather, it is mapped into conceptual structures which are integrated into long-term memory (LTM). The resulting system is one in which problem-solving, retrieval and memory organization are integrated processes. In other words, the same algorithm and knowledge representation structures are shared by these processes. As a result of this, the system can answer questions, solve problems or reorganize LTM.
The classification problem in machine learning: an overview with study cases in emotion recognition and music-speech differentiation

OpenAIRE

Rodríguez Cadavid, Santiago

2015-01-01

This work addresses the well-known classification problem in machine learning -- The goal of this study is to approach the reader to the methodological aspects of the feature extraction, feature selection and classifier performance through simple and understandable theoretical aspects and two study cases -- Finally, a very good classification performance was obtained for the emotion recognition from speech
Some debatable problems of stratigraphic classification

Science.gov (United States)

Gladenkov, Yury

2014-05-01

Russian geologists perform large-scale geological mapping in Russia and abroad. Therefore we urge unification of legends of geological maps compiled in different countries. It seems important to continuously organize discussions on problems of stratigraphic classification. 1. The stratigraphic schools (conventionally called "European" and "American") define "stratigraphy" in different ways. The former prefers "single" stratigraphy that uses data proved by many methods. The latter divides stratigraphy into several independent stratigraphers (litho-, bio-, magneto- and others). Russian geologists classify stratigraphic units into general (chronostratigraphic) and special (in accordance with a method applied). 2. There exist different interpretations of chronostratigraphy. Some stratigraphers suppose that a chronostratigraphic unit corresponds to rock strata formed during a certain time interval (it is somewhat formalistic because a length of interval is frequently unspecified). Russian specialists emphasize the historical-geological background of chronostratigraphic units. Every stratigraphic unit (global and regional) reflects a stage of geological evolution of biosphere and stratisphere. 3. In the view of Russian stratigraphers, the main stratigraphic units may have different extent: a) global (stage), b) regional (regional stage,local zone), and c) local (suite). There is no such hierarchy in the ISG. 4. Russian specialists think that local "lithostratigraphic" units (formations) which may have diachronous boundaries are not chronostratigraphic ones in strict sense (actually they are lithological bodies). In this case "lithostratigraphy" can be considered as "prostratigraphy" and employed in initial studies of sequences. Therefore, a suite is a main local unit of the Russian Code and differs from a formation, although it is somewhat similar. It does not mean that lithostratigraphy is unnecessary. Usage of marker horizons, members and other bodies is of great help
Automatic white blood cell classification using pre-trained deep learning models: ResNet and Inception

Science.gov (United States)

Habibzadeh, Mehdi; Jannesari, Mahboobeh; Rezaei, Zahra; Baharvand, Hossein; Totonchi, Mehdi

2018-04-01

This works gives an account of evaluation of white blood cell differential counts via computer aided diagnosis (CAD) system and hematology rules. Leukocytes, also called white blood cells (WBCs) play main role of the immune system. Leukocyte is responsible for phagocytosis and immunity and therefore in defense against infection involving the fatal diseases incidence and mortality related issues. Admittedly, microscopic examination of blood samples is a time consuming, expensive and error-prone task. A manual diagnosis would search for specific Leukocytes and number abnormalities in the blood slides while complete blood count (CBC) examination is performed. Complications may arise from the large number of varying samples including different types of Leukocytes, related sub-types and concentration in blood, which makes the analysis prone to human error. This process can be automated by computerized techniques which are more reliable and economical. In essence, we seek to determine a fast, accurate mechanism for classification and gather information about distribution of white blood evidences which may help to diagnose the degree of any abnormalities during CBC test. In this work, we consider the problem of pre-processing and supervised classification of white blood cells into their four primary types including Neutrophils, Eosinophils, Lymphocytes, and Monocytes using a consecutive proposed deep learning framework. For first step, this research proposes three consecutive pre-processing calculations namely are color distortion; bounding box distortion (crop) and image flipping mirroring. In second phase, white blood cell recognition performed with hierarchy topological feature extraction using Inception and ResNet architectures. Finally, the results obtained from the preliminary analysis of cell classification with (11200) training samples and 1244 white blood cells evaluation data set are presented in confusion matrices and interpreted using accuracy rate, and false
Automated classification of bone marrow cells in microscopic images for diagnosis of leukemia: a comparison of two classification schemes with respect to the segmentation quality

Science.gov (United States)

Krappe, Sebastian; Benz, Michaela; Wittenberg, Thomas; Haferlach, Torsten; Münzenmayer, Christian

2015-03-01

The morphological analysis of bone marrow smears is fundamental for the diagnosis of leukemia. Currently, the counting and classification of the different types of bone marrow cells is done manually with the use of bright field microscope. This is a time consuming, partly subjective and tedious process. Furthermore, repeated examinations of a slide yield intra- and inter-observer variances. For this reason an automation of morphological bone marrow analysis is pursued. This analysis comprises several steps: image acquisition and smear detection, cell localization and segmentation, feature extraction and cell classification. The automated classification of bone marrow cells is depending on the automated cell segmentation and the choice of adequate features extracted from different parts of the cell. In this work we focus on the evaluation of support vector machines (SVMs) and random forests (RFs) for the differentiation of bone marrow cells in 16 different classes, including immature and abnormal cell classes. Data sets of different segmentation quality are used to test the two approaches. Automated solutions for the morphological analysis for bone marrow smears could use such a classifier to pre-classify bone marrow cells and thereby shortening the examination duration.
Wavelet-SVM classification and automatic recognition of unstained viable cells in phase-contrast microscopy

International Nuclear Information System (INIS)

Skoczylas, M.; Rakowski, W.; Cherubini, R.; Gerardi, S.

2011-01-01

Irradiation of individual cultured mammalian cells with a pre-selected number of ions down to one ion per single cell is a useful experimental approach to investigating the low-dose ionising radiation exposure effects and thus contributing to a more realistic human cancer risk assessment. One of the crucial tasks of all the microbeam apparatuses is the visualisation, recognition and positioning of every individual cell of the cell culture to be irradiated. Before irradiations, mammalian cells (specifically, Chinese hamster V79 cells) are seeded and grown as a monolayer on a mylar surface used as the bottom of a specially designed holder. Manual recognition of unstained cells in a bright-field microscope is a time-consuming procedure; therefore, a parallel algorithm has been conceived and developed in order to speed up this irradiation protocol step. Many technical problems have been faced to overcome the complexity of the images to be analysed: cell discrimination in an inhomogeneous background, among many disturbing bodies mainly due to the mylar surface roughness and culture medium bodies; cell shapes, depending on how they attach to the surface, which phase of the cell cycle they are in and on cell density. Preliminary results of the recognition and classification based on a method of wavelet kernels for the support vector machine classifier will be presented. (authors)
Statistical-mechanics analysis of Gaussian labeled-unlabeled classification problems

International Nuclear Information System (INIS)

Tanaka, Toshiyuki

2013-01-01

The labeled-unlabeled classification problem in semi-supervised learning is studied via statistical-mechanics approach. We analytically investigate performance of a learner with an equal-weight mixture of two symmetrically-located Gaussians, performing posterior mean estimation of the parameter vector on the basis of a dataset consisting of labeled and unlabeled data generated from the same probability model as that assumed by the learner. Under the assumption of replica symmetry, we have analytically obtained a set of saddle-point equations, which allows us to numerically evaluate performance of the learner. On the basis of the analytical result we have observed interesting phenomena, in particular the coexistence of good and bad solutions, which may happen when the number of unlabeled data is relatively large compared with that of labeled data
Study of Image Analysis Algorithms for Segmentation, Feature Extraction and Classification of Cells

Directory of Open Access Journals (Sweden)

Margarita Gamarra

2017-08-01

Full Text Available Recent advances in microcopy and improvements in image processing algorithms have allowed the development of computer-assisted analytical approaches in cell identification. Several applications could be mentioned in this field: Cellular phenotype identification, disease detection and treatment, identifying virus entry in cells and virus classification; these applications could help to complement the opinion of medical experts. Although many surveys have been presented in medical image analysis, they focus mainly in tissues and organs and none of the surveys about image cells consider an analysis following the stages in the typical image processing: Segmentation, feature extraction and classification. The goal of this study is to provide comprehensive and critical analyses about the trends in each stage of cell image processing. In this paper, we present a literature survey about cell identification using different image processing techniques.

Modern classification of neoplasms: reconciling differences between morphologic and molecular approaches

International Nuclear Information System (INIS)

Berman, Jules

2005-01-01

For over 150 years, pathologists have relied on histomorphology to classify and diagnose neoplasms. Their success has been stunning, permitting the accurate diagnosis of thousands of different types of neoplasms using only a microscope and a trained eye. In the past two decades, cancer genomics has challenged the supremacy of histomorphology by identifying genetic alterations shared by morphologically diverse tumors and by finding genetic features that distinguish subgroups of morphologically homogeneous tumors. The Developmental Lineage Classification and Taxonomy of Neoplasms groups neoplasms by their embryologic origin. The putative value of this classification is based on the expectation that tumors of a common developmental lineage will share common metabolic pathways and common responses to drugs that target these pathways. The purpose of this manuscript is to show that grouping tumors according to their developmental lineage can reconcile certain fundamental discrepancies resulting from morphologic and molecular approaches to neoplasm classification. In this study, six issues in tumor classification are described that exemplify the growing rift between morphologic and molecular approaches to tumor classification: 1) the morphologic separation between epithelial and non-epithelial tumors; 2) the grouping of tumors based on shared cellular functions; 3) the distinction between germ cell tumors and pluripotent tumors of non-germ cell origin; 4) the distinction between tumors that have lost their differentiation and tumors that arise from uncommitted stem cells; 5) the molecular properties shared by morphologically disparate tumors that have a common developmental lineage, and 6) the problem of re-classifying morphologically identical but clinically distinct subsets of tumors. The discussion of these issues in the context of describing different methods of tumor classification is intended to underscore the clinical value of a robust tumor classification. A
Mapping online transportation service quality and multiclass classification problem solving priorities

Science.gov (United States)

Alamsyah, Andry; Rachmadiansyah, Imam

2018-03-01

Online transportation service is known for its accessibility, transparency, and tariff affordability. These points make online transportation have advantages over the existing conventional transportation service. Online transportation service is an example of disruptive technology that change the relationship between customers and companies. In Indonesia, there are high competition among online transportation provider, hence the companies must maintain and monitor their service level. To understand their position, we apply both sentiment analysis and multiclass classification to understand customer opinions. From negative sentiments, we can identify problems and establish problem-solving priorities. As a case study, we use the most popular online transportation provider in Indonesia: Gojek and Grab. Since many customers are actively give compliment and complain about company’s service level on Twitter, therefore we collect 61,721 tweets in Bahasa during one month observations. We apply Naive Bayes and Support Vector Machine methods to see which model perform best for our data. The result reveal Gojek has better service quality with 19.76% positive and 80.23% negative sentiments than Grab with 9.2% positive and 90.8% negative. The Gojek highest problem-solving priority is regarding application problems, while Grab is about unusable promos. The overall result shows general problems of both case study are related to accessibility dimension which indicate lack of capability to provide good digital access to the end users.
Tight bounds on the size of neural networks for classification problems

Energy Technology Data Exchange (ETDEWEB)

Beiu, V. [Los Alamos National Lab., NM (United States); Pauw, T. de [Universite Catholique de Louvain, Louvain-la-Neuve (Belgium). Dept. de Mathematique

1997-06-01

This paper relies on the entropy of a data-set (i.e., number-of-bits) to prove tight bounds on the size of neural networks solving a classification problem. First, based on a sequence of geometrical steps, the authors constructively compute an upper bound of O(mn) on the number-of-bits for a given data-set - here m is the number of examples and n is the number of dimensions (i.e., R{sup n}). This result is used further in a nonconstructive way to bound the size of neural networks which correctly classify that data-set.
Charting the landscape of priority problems in psychiatry, part 1: classification and diagnosis.

Science.gov (United States)

Stephan, Klaas E; Bach, Dominik R; Fletcher, Paul C; Flint, Jonathan; Frank, Michael J; Friston, Karl J; Heinz, Andreas; Huys, Quentin J M; Owen, Michael J; Binder, Elisabeth B; Dayan, Peter; Johnstone, Eve C; Meyer-Lindenberg, Andreas; Montague, P Read; Schnyder, Ulrich; Wang, Xiao-Jing; Breakspear, Michael

2016-01-01

Contemporary psychiatry faces major challenges. Its syndrome-based disease classification is not based on mechanisms and does not guide treatment, which largely depends on trial and error. The development of therapies is hindered by ignorance of potential beneficiary patient subgroups. Neuroscientific and genetics research have yet to affect disease definitions or contribute to clinical decision making. In this challenging setting, what should psychiatric research focus on? In two companion papers, we present a list of problems nominated by clinicians and researchers from different disciplines as candidates for future scientific investigation of mental disorders. These problems are loosely grouped into challenges concerning nosology and diagnosis (this Personal View) and problems related to pathogenesis and aetiology (in the companion Personal View). Motivated by successful examples in other disciplines, particularly the list of Hilbert's problems in mathematics, this subjective and eclectic list of priority problems is intended for psychiatric researchers, helping to re-focus existing research and providing perspectives for future psychiatric science. Copyright © 2016 Elsevier Ltd. All rights reserved.
Oriented Shape Index Histograms for Cell Classification

DEFF Research Database (Denmark)

Larsen, Anders Boesen Lindbo; Dahl, Anders Bjorholm; Larsen, Rasmus

2015-01-01

We propose a novel extension to the shape index histogram feature descriptor where the orientation of the second-order curvature is included in the histograms. The orientation of the shape index is reminiscent but not equal to gradient orientation which is widely used for feature description. We...... evaluate our new feature descriptor using a public dataset consisting of HEp-2 cell images from indirect immunoflourescence lighting. Our results show that we can improve classification performance significantly when including the shape index orientation. Notably, we show that shape index orientation...
Performance of the majority voting rule in solving the density classification problem in high dimensions

Energy Technology Data Exchange (ETDEWEB)

Gomez Soto, Jose Manuel [Unidad Academica de Matematicas, Universidad Autonoma de Zacatecas, Calzada Solidaridad entronque Paseo a la Bufa, Zacatecas, Zac. (Mexico); Fuks, Henryk, E-mail: jmgomezgoo@gmail.com, E-mail: hfuks@brocku.ca [Department of Mathematics, Brock University, St. Catharines, ON (Canada)

2011-11-04

The density classification problem (DCP) is one of the most widely studied problems in the theory of cellular automata. After it was shown that the DCP cannot be solved perfectly, the research in this area has been focused on finding better rules that could solve the DCP approximately. In this paper, we argue that the majority voting rule in high dimensions can achieve high performance in solving the DCP, and that its performance increases with dimension. We support this conjecture with arguments based on the mean-field approximation and direct computer simulations. (paper)
Performance of the majority voting rule in solving the density classification problem in high dimensions

International Nuclear Information System (INIS)

Gomez Soto, Jose Manuel; Fuks, Henryk

2011-01-01

The density classification problem (DCP) is one of the most widely studied problems in the theory of cellular automata. After it was shown that the DCP cannot be solved perfectly, the research in this area has been focused on finding better rules that could solve the DCP approximately. In this paper, we argue that the majority voting rule in high dimensions can achieve high performance in solving the DCP, and that its performance increases with dimension. We support this conjecture with arguments based on the mean-field approximation and direct computer simulations. (paper)
Proposals for Paraphilic Disorders in the International Classification of Diseases and Related Health Problems, Eleventh Revision (ICD-11)

OpenAIRE

Krueger, Richard B.; Reed, Geoffrey M.; First, Michael B.; Marais, Adele; Kismodi, Eszter; Briken, Peer

2017-01-01

The World Health Organization is currently developing the 11th revision of the International Classifications of Diseases and Related Health Problems (ICD-11), with approval of the ICD-11 by the World Health Assembly anticipated in 2018. The Working Group on the Classification of Sexual Disorders and Sexual Health (WGSDSH) was created and charged with reviewing and making recommendations for categories related to sexuality that are contained in the chapter of Mental and Behavioural Disorders i...
Hierarchical multi-scale classification of nearshore aquatic habitats of the Great Lakes: Western Lake Erie

Science.gov (United States)

McKenna, J.E.; Castiglione, C.

2010-01-01

Classification is a valuable conservation tool for examining natural resource status and problems and is being developed for coastal aquatic habitats. We present an objective, multi-scale hydrospatial framework for nearshore areas of the Great Lakes. The hydrospatial framework consists of spatial units at eight hierarchical scales from the North American Continent to the individual 270-m spatial cell. Characterization of spatial units based on fish abundance and diversity provides a fish-guided classification of aquatic areas at each spatial scale and demonstrates how classifications may be generated from that framework. Those classification units then provide information about habitat, as well as biotic conditions, which can be compared, contrasted, and hierarchically related spatially. Examples within several representative coastal or open water zones of the Western Lake Erie pilot area highlight potential application of this classification system to management problems. This classification system can assist natural resource managers with planning and establishing priorities for aquatic habitat protection, developing rehabilitation strategies, or identifying special management actions.
Selective ablation of Copper-Indium-Diselenide solar cells monitored by laser-induced breakdown spectroscopy and classification methods

Energy Technology Data Exchange (ETDEWEB)

Diego-Vallejo, David [Technische Universität Berlin, Institute of Optics and Atomic Physics, Straße des 17, Juni 135, 10623 Berlin (Germany); Laser- und Medizin- Technologie Berlin GmbH (LMTB), Applied Laser Technology, Fabeckstr. 60-62, 14195 Berlin (Germany); Ashkenasi, David, E-mail: d.ashkenasi@lmtb.de [Laser- und Medizin- Technologie Berlin GmbH (LMTB), Applied Laser Technology, Fabeckstr. 60-62, 14195 Berlin (Germany); Lemke, Andreas [Laser- und Medizin- Technologie Berlin GmbH (LMTB), Applied Laser Technology, Fabeckstr. 60-62, 14195 Berlin (Germany); Eichler, Hans Joachim [Technische Universität Berlin, Institute of Optics and Atomic Physics, Straße des 17, Juni 135, 10623 Berlin (Germany); Laser- und Medizin- Technologie Berlin GmbH (LMTB), Applied Laser Technology, Fabeckstr. 60-62, 14195 Berlin (Germany)

2013-09-01

Laser-induced breakdown spectroscopy (LIBS) and two classification methods, i.e. linear correlation and artificial neural networks (ANN), are used to monitor P1, P2 and P3 scribing steps of Copper-Indium-Diselenide (CIS) solar cells. Narrow channels featuring complete removal of desired layers with minimum damage on the underlying film are expected to enhance efficiency of solar cells. The monitoring technique is intended to determine that enough material has been removed to reach the desired layer based on the analysis of plasma emission acquired during multiple pass laser scribing. When successful selective scribing is achieved, a high degree of similarity between test and reference spectra has to be identified by classification methods in order to stop the scribing procedure and avoid damaging the bottom layer. Performance of linear correlation and artificial neural networks is compared and evaluated for two spectral bandwidths. By using experimentally determined combinations of classifier and analyzed spectral band for each step, classification performance achieves errors of 7, 1 and 4% for steps P1, P2 and P3, respectively. The feasibility of using plasma emission for the supervision of processing steps of solar cell manufacturing is demonstrated. This method has the potential to be implemented as an online monitoring procedure assisting the production of solar cells. - Highlights: • LIBS and two classification methods were used to monitor CIS solar cells processing. • Selective ablation of thin-film solar cells was improved with inspection system. • Customized classification method and analyzed spectral band enhanced performance.
Recursive automatic classification algorithms

Energy Technology Data Exchange (ETDEWEB)

Bauman, E V; Dorofeyuk, A A

1982-03-01

A variational statement of the automatic classification problem is given. The dependence of the form of the optimal partition surface on the form of the classification objective functional is investigated. A recursive algorithm is proposed for maximising a functional of reasonably general form. The convergence problem is analysed in connection with the proposed algorithm. 8 references.
Feature Importance for Human Epithelial (HEp-2 Cell Image Classification

Directory of Open Access Journals (Sweden)

Vibha Gupta

2018-02-01

Full Text Available Indirect Immuno-Fluorescence (IIF microscopy imaging of human epithelial (HEp-2 cells is a popular method for diagnosing autoimmune diseases. Considering large data volumes, computer-aided diagnosis (CAD systems, based on image-based classification, can help in terms of time, effort, and reliability of diagnosis. Such approaches are based on extracting some representative features from the images. This work explores the selection of the most distinctive features for HEp-2 cell images using various feature selection (FS methods. Considering that there is no single universally optimal feature selection technique, we also propose hybridization of one class of FS methods (filter methods. Furthermore, the notion of variable importance for ranking features, provided by another type of approaches (embedded methods such as Random forest, Random uniform forest is exploited to select a good subset of features from a large set, such that addition of new features does not increase classification accuracy. In this work, we have also, with great consideration, designed class-specific features to capture morphological visual traits of the cell patterns. We perform various experiments and discussions to demonstrate the effectiveness of FS methods along with proposed and a standard feature set. We achieve state-of-the-art performance even with small number of features, obtained after the feature selection.
Comparative Study of Classification Techniques on Breast Cancer FNA Biopsy Data

Directory of Open Access Journals (Sweden)

George Rumbe

2010-12-01

Full Text Available Accurate diagnostic detection of the cancerous cells in a patient is critical and may alter the subsequent treatment and increase the chances of survival rate. Machine learning techniques have been instrumental in disease detection and are currently being used in various classification problems due to their accurate prediction performance. Various techniques may provide different desired accuracies and it is therefore imperative to use the most suitable method which provides the best desired results. This research seeks to provide comparative analysis of Support Vector Machine, Bayesian classifier and other Artificial neural network classifiers (Backpropagation, linear programming, Learning vector quantization, and K nearest neighborhood on the Wisconsin breast cancer classification problem.
[Classification of cell-based medicinal products and legal implications: An overview and an update].

Science.gov (United States)

Scherer, Jürgen; Flory, Egbert

2015-11-01

In general, cell-based medicinal products do not represent a uniform class of medicinal products, but instead comprise medicinal products with diverse regulatory classification as advanced-therapy medicinal products (ATMP), medicinal products (MP), tissue preparations, or blood products. Due to the legal and scientific consequences of the development and approval of MPs, classification should be clarified as early as possible. This paper describes the legal situation in Germany and highlights specific criteria and concepts for classification, with a focus on, but not limited to, ATMPs and non-ATMPs. Depending on the stage of product development and the specific application submitted to a competent authority, legally binding classification is done by the German Länder Authorities, Paul-Ehrlich-Institut, or European Medicines Agency. On request by the applicants, the Committee for Advanced Therapies may issue scientific recommendations for classification.
Support Vector Machine and Parametric Wavelet-Based Texture Classification of Stem Cell Images

National Research Council Canada - National Science Library

Jeffreys, Christopher

2004-01-01

.... Since colony texture is a major discriminating feature in determining quality, we introduce a non-invasive, semi-automated texture-based stem cell colony classification methodology to aid researchers...
Multi-element neutron activation analysis and solution of classification problems using multidimensional statistics

International Nuclear Information System (INIS)

Vaganov, P.A.; Kol'tsov, A.A.; Kulikov, V.D.; Mejer, V.A.

1983-01-01

The multi-element instrumental neutron activation analysis of samples of mountain rocks (sandstones, aleurolites and shales of one of gold deposits) is performed. The spectra of irradiated samples are measured by Ge(Li) detector of the volume of 35 mm 3 . The content of 22 chemical elements is determined in each sample. The results of analysis serve as reliable basis for multi-dimensional statistic information processing, they constitute the basis for the generalized characteristics of rocks which brings about the solution of classification problem for rocks of different deposits
Revisiting Classification of Eating Disorders-toward Diagnostic and Statistical Manual of Mental Disorders-5 and International Statistical Classification of Diseases and Related Health Problems-11.

Science.gov (United States)

Goyal, Shrigopal; Balhara, Yatan Pal Singh; Khandelwal, S K

2012-07-01

Two of the most commonly used nosological systems- International Statistical Classification of Diseases and Related Health Problems (ICD)-10 and Diagnostic and Statistical Manual of Mental Disorders (DSM)-IV are under revision. This process has generated a lot of interesting debates with regards to future of the current diagnostic categories. In fact, the status of categorical approach in the upcoming versions of ICD and DSM is also being debated. The current article focuses on the debate with regards to the eating disorders. The existing classification of eating disorders has been criticized for its limitations. A host of new diagnostic categories have been recommended for inclusion in the upcoming revisions. Also the structure of the existing categories has also been put under scrutiny.
Health problems among detainees in Switzerland: a study using the ICPC-2 classification

Directory of Open Access Journals (Sweden)

Bertrand Dominique

2011-04-01

Full Text Available Abstract Background Little is known about the health status of prisoners in Switzerland. The aim of this study was to provide a detailed description of the health problems presented by detainees in Switzerland's largest remand prison. Methods In this retrospective cross-sectional study we reviewed the health records of all detainees leaving Switzerland's largest remand prison in 2007. The health problems were coded using the International Classification for Primary Care (ICPC-2. Analyses were descriptive, stratified by gender. Results A total of 2195 health records were reviewed. Mean age was 29.5 years (SD 9.5; 95% were male; 87.8% were migrants. Mean length of stay was 80 days (SD 160. Illicit drug use (40.2% and mental health problems (32.6% were frequent, but most of these detainees (57.6% had more generic primary care problems, such as skin (27.0%, infectious diseases (23.5%, musculoskeletal (19.2%, injury related (18.3%, digestive (15.0% or respiratory problems (14.0%. Furthermore, 7.9% reported exposure to violence during arrest by the police. Conclusion Morbidity is high in this young, predominantly male population of detainees, in particular in relation to substance abuse. Other health problems more commonly seen in general practice are also frequent. These findings support the further development of coordinated primary care and mental health services within detention centers.
Segmentation and Classification of Bone Marrow Cells Images Using Contextual Information for Medical Diagnosis of Acute Leukemias.

Directory of Open Access Journals (Sweden)

Carolina Reta

Full Text Available Morphological identification of acute leukemia is a powerful tool used by hematologists to determine the family of such a disease. In some cases, experienced physicians are even able to determine the leukemia subtype of the sample. However, the identification process may have error rates up to 40% (when classifying acute leukemia subtypes depending on the physician's experience and the sample quality. This problem raises the need to create automatic tools that provide hematologists with a second opinion during the classification process. Our research presents a contextual analysis methodology for the detection of acute leukemia subtypes from bone marrow cells images. We propose a cells separation algorithm to break up overlapped regions. In this phase, we achieved an average accuracy of 95% in the evaluation of the segmentation process. In a second phase, we extract descriptive features to the nucleus and cytoplasm obtained in the segmentation phase in order to classify leukemia families and subtypes. We finally created a decision algorithm that provides an automatic diagnosis for a patient. In our experiments, we achieved an overall accuracy of 92% in the supervised classification of acute leukemia families, 84% for the lymphoblastic subtypes, and 92% for the myeloblastic subtypes. Finally, we achieved accuracies of 95% in the diagnosis of leukemia families and 90% in the diagnosis of leukemia subtypes.
Cellular image classification

CERN Document Server

Xu, Xiang; Lin, Feng

2017-01-01

This book introduces new techniques for cellular image feature extraction, pattern recognition and classification. The authors use the antinuclear antibodies (ANAs) in patient serum as the subjects and the Indirect Immunofluorescence (IIF) technique as the imaging protocol to illustrate the applications of the described methods. Throughout the book, the authors provide evaluations for the proposed methods on two publicly available human epithelial (HEp-2) cell datasets: ICPR2012 dataset from the ICPR'12 HEp-2 cell classification contest and ICIP2013 training dataset from the ICIP'13 Competition on cells classification by fluorescent image analysis. First, the reading of imaging results is significantly influenced by one’s qualification and reading systems, causing high intra- and inter-laboratory variance. The authors present a low-order LP21 fiber mode for optical single cell manipulation and imaging staining patterns of HEp-2 cells. A focused four-lobed mode distribution is stable and effective in optical...

Automated morphological analysis of bone marrow cells in microscopic images for diagnosis of leukemia: nucleus-plasma separation and cell classification using a hierarchical tree model of hematopoesis

Science.gov (United States)

Krappe, Sebastian; Wittenberg, Thomas; Haferlach, Torsten; Münzenmayer, Christian

2016-03-01

The morphological differentiation of bone marrow is fundamental for the diagnosis of leukemia. Currently, the counting and classification of the different types of bone marrow cells is done manually under the use of bright field microscopy. This is a time-consuming, subjective, tedious and error-prone process. Furthermore, repeated examinations of a slide may yield intra- and inter-observer variances. For that reason a computer assisted diagnosis system for bone marrow differentiation is pursued. In this work we focus (a) on a new method for the separation of nucleus and plasma parts and (b) on a knowledge-based hierarchical tree classifier for the differentiation of bone marrow cells in 16 different classes. Classification trees are easily interpretable and understandable and provide a classification together with an explanation. Using classification trees, expert knowledge (i.e. knowledge about similar classes and cell lines in the tree model of hematopoiesis) is integrated in the structure of the tree. The proposed segmentation method is evaluated with more than 10,000 manually segmented cells. For the evaluation of the proposed hierarchical classifier more than 140,000 automatically segmented bone marrow cells are used. Future automated solutions for the morphological analysis of bone marrow smears could potentially apply such an approach for the pre-classification of bone marrow cells and thereby shortening the examination time.
New fuzzy support vector machine for the class imbalance problem in medical datasets classification.

Science.gov (United States)

Gu, Xiaoqing; Ni, Tongguang; Wang, Hongyuan

2014-01-01

In medical datasets classification, support vector machine (SVM) is considered to be one of the most successful methods. However, most of the real-world medical datasets usually contain some outliers/noise and data often have class imbalance problems. In this paper, a fuzzy support machine (FSVM) for the class imbalance problem (called FSVM-CIP) is presented, which can be seen as a modified class of FSVM by extending manifold regularization and assigning two misclassification costs for two classes. The proposed FSVM-CIP can be used to handle the class imbalance problem in the presence of outliers/noise, and enhance the locality maximum margin. Five real-world medical datasets, breast, heart, hepatitis, BUPA liver, and pima diabetes, from the UCI medical database are employed to illustrate the method presented in this paper. Experimental results on these datasets show the outperformed or comparable effectiveness of FSVM-CIP.
New Fuzzy Support Vector Machine for the Class Imbalance Problem in Medical Datasets Classification

Directory of Open Access Journals (Sweden)

Xiaoqing Gu

2014-01-01

Full Text Available In medical datasets classification, support vector machine (SVM is considered to be one of the most successful methods. However, most of the real-world medical datasets usually contain some outliers/noise and data often have class imbalance problems. In this paper, a fuzzy support machine (FSVM for the class imbalance problem (called FSVM-CIP is presented, which can be seen as a modified class of FSVM by extending manifold regularization and assigning two misclassification costs for two classes. The proposed FSVM-CIP can be used to handle the class imbalance problem in the presence of outliers/noise, and enhance the locality maximum margin. Five real-world medical datasets, breast, heart, hepatitis, BUPA liver, and pima diabetes, from the UCI medical database are employed to illustrate the method presented in this paper. Experimental results on these datasets show the outperformed or comparable effectiveness of FSVM-CIP.
Cell-based product classification procedure: What can be done differently to improve decisions on borderline products?

Science.gov (United States)

Izeta, Ander; Herrera, Concha; Mata, Rosario; Astori, Giuseppe; Giordano, Rosaria; Hernández, Carmen; Leyva, Laura; Arias, Salvador; Oyonarte, Salvador; Carmona, Gloria; Cuende, Natividad

2016-07-01

In June 2015, European Medicines Agency/Committee for Advanced Therapies (CAT) released the new version of the reflection paper on classification of advanced therapy medicinal products (ATMPs) established to address questions of borderline cases in which classification of a product based on genes, cells or tissues is unclear. The paper shows CAT's understanding of substantial manipulation and essential function(s) criteria that define the legal scope of cell-based medicinal products. This article aims to define the authors' viewpoint on the reflection paper. ATMP classification has intrinsic weaknesses derived from the lack of clarity of the evolving concepts of substantial manipulation and essential function(s) as stated in the EU Regulation, leading to the risk of differing interpretations and misclassification. This might result in the broadening of ATMP scope at the expense of other products such as cell/tissue transplants and blood products, or even putting some present and future clinical practice at risk of being classified as ATMP. Because of the major organizational, economic and regulatory implications of product classification, we advocate for increased interaction between CAT and competent authorities (CAs) for medicines, blood and blood components and tissues and cells or for the creation of working groups including representatives of all parties as recently suggested by several CAs. Copyright © 2016 International Society for Cellular Therapy. Published by Elsevier Inc. All rights reserved.
Integrated pillar scatterers for speeding up classification of cell holograms.

Science.gov (United States)

Lugnan, Alessio; Dambre, Joni; Bienstman, Peter

2017-11-27

The computational power required to classify cell holograms is a major limit to the throughput of label-free cell sorting based on digital holographic microscopy. In this work, a simple integrated photonic stage comprising a collection of silica pillar scatterers is proposed as an effective nonlinear mixing interface between the light scattered by a cell and an image sensor. The light processing provided by the photonic stage allows for the use of a simple linear classifier implemented in the electric domain and applied on a limited number of pixels. A proof-of-concept of the presented machine learning technique, which is based on the extreme learning machine (ELM) paradigm, is provided by the classification results on samples generated by 2D FDTD simulations of cells in a microfluidic channel.
Artificial neural networks for classification in metabolomic studies of whole cells using 1H nuclear magnetic resonance.

LENUS (Irish Health Repository)

Brougham, D F

2011-01-01

We report the successful classification, by artificial neural networks (ANNs), of (1)H NMR spectroscopic data recorded on whole-cell culture samples of four different lung carcinoma cell lines, which display different drug resistance patterns. The robustness of the approach was demonstrated by its ability to classify the cell line correctly in 100% of cases, despite the demonstrated presence of operator-induced sources of variation, and irrespective of which spectra are used for training and for validation. The study demonstrates the potential of ANN for lung carcinoma classification in realistic situations.
Deep Learning for ECG Classification

Science.gov (United States)

Pyakillya, B.; Kazachenko, N.; Mikhailovsky, N.

2017-10-01

The importance of ECG classification is very high now due to many current medical applications where this problem can be stated. Currently, there are many machine learning (ML) solutions which can be used for analyzing and classifying ECG data. However, the main disadvantages of these ML results is use of heuristic hand-crafted or engineered features with shallow feature learning architectures. The problem relies in the possibility not to find most appropriate features which will give high classification accuracy in this ECG problem. One of the proposing solution is to use deep learning architectures where first layers of convolutional neurons behave as feature extractors and in the end some fully-connected (FCN) layers are used for making final decision about ECG classes. In this work the deep learning architecture with 1D convolutional layers and FCN layers for ECG classification is presented and some classification results are showed.
Correlation between patients' reasons for encounters/health problems and population density in Japan: a systematic review of observational studies coded by the International Classification of Health Problems in Primary Care (ICHPPC) and the International Classification of Primary care (ICPC).

Science.gov (United States)

Kaneko, Makoto; Ohta, Ryuichi; Nago, Naoki; Fukushi, Motoharu; Matsushima, Masato

2017-09-13

The Japanese health care system has yet to establish structured training for primary care physicians; therefore, physicians who received an internal medicine based training program continue to play a principal role in the primary care setting. To promote the development of a more efficient primary health care system, the assessment of its current status in regard to the spectrum of patients' reasons for encounters (RFEs) and health problems is an important step. Recognizing the proportions of patients' RFEs and health problems, which are not generally covered by an internist, can provide valuable information to promote the development of a primary care physician-centered system. We conducted a systematic review in which we searched six databases (PubMed, the Cochrane Library, Google Scholar, Ichushi-Web, JDreamIII and CiNii) for observational studies in Japan coded by International Classification of Health Problems in Primary Care (ICHPPC) and International Classification of Primary Care (ICPC) up to March 2015. We employed population density as index of accessibility. We calculated Spearman's rank correlation coefficient to examine the correlation between the proportion of "non-internal medicine-related" RFEs and health problems in each study area in consideration of the population density. We found 17 studies with diverse designs and settings. Among these studies, "non-internal medicine-related" RFEs, which was not thought to be covered by internists, ranged from about 4% to 40%. In addition, "non-internal medicine-related" health problems ranged from about 10% to 40%. However, no significant correlation was found between population density and the proportion of "non-internal medicine-related" RFEs and health problems. This is the first systematic review on RFEs and health problems coded by ICHPPC and ICPC undertaken to reveal the diversity of health problems in Japanese primary care. These results suggest that primary care physicians in some rural areas of Japan
BIOCAT: a pattern recognition platform for customizable biological image classification and annotation.

Science.gov (United States)

Zhou, Jie; Lamichhane, Santosh; Sterne, Gabriella; Ye, Bing; Peng, Hanchuan

2013-10-04

Pattern recognition algorithms are useful in bioimage informatics applications such as quantifying cellular and subcellular objects, annotating gene expressions, and classifying phenotypes. To provide effective and efficient image classification and annotation for the ever-increasing microscopic images, it is desirable to have tools that can combine and compare various algorithms, and build customizable solution for different biological problems. However, current tools often offer a limited solution in generating user-friendly and extensible tools for annotating higher dimensional images that correspond to multiple complicated categories. We develop the BIOimage Classification and Annotation Tool (BIOCAT). It is able to apply pattern recognition algorithms to two- and three-dimensional biological image sets as well as regions of interest (ROIs) in individual images for automatic classification and annotation. We also propose a 3D anisotropic wavelet feature extractor for extracting textural features from 3D images with xy-z resolution disparity. The extractor is one of the about 20 built-in algorithms of feature extractors, selectors and classifiers in BIOCAT. The algorithms are modularized so that they can be "chained" in a customizable way to form adaptive solution for various problems, and the plugin-based extensibility gives the tool an open architecture to incorporate future algorithms. We have applied BIOCAT to classification and annotation of images and ROIs of different properties with applications in cell biology and neuroscience. BIOCAT provides a user-friendly, portable platform for pattern recognition based biological image classification of two- and three- dimensional images and ROIs. We show, via diverse case studies, that different algorithms and their combinations have different suitability for various problems. The customizability of BIOCAT is thus expected to be useful for providing effective and efficient solutions for a variety of biological
Microarray-based classification of diffuse large B-cell lymphoma

DEFF Research Database (Denmark)

Poulsen, Christian Bjørn; Borup, Rehannah; Nielsen, Finn Cilius

2005-01-01

on the Affymetrix HG-U133A oligonucleotide arrays and improve the classification, we determined the expression profiles of pretreatment, diagnostic samples from 52 primary nodal DLBCL. METHODS AND RESULTS: First, three previously published gene lists were converted to the HG-U133A probe sets and used......OBJECTIVE: Hierarchical clusterings of diffuse large B-cell lymphoma (DLBCL) based on gene expression signatures have previously been used to classify DLBCL into Germinal Center B-cell (GCB) and Activated B-cell (ABC) types. To examine if it was feasible to perform a cross-platform validation...... for hierarchical clustering. In this way, three subtypes, including the GCB type (n = 20), the ABC type (n = 25) and an intermediate group, Type-3 (n = 5), were distinguished. The CD10 and Bcl-6 expression as well as t(14;18) translocation were prevalent, but not exclusive to the GCB type. By contrast, MUM1...
New Approach to Analyzing Physics Problems: A Taxonomy of Introductory Physics Problems

Science.gov (United States)

Teodorescu, Raluca E.; Bennhold, Cornelius; Feldman, Gerald; Medsker, Larry

2013-01-01

This paper describes research on a classification of physics problems in the context of introductory physics courses. This classification, called the Taxonomy of Introductory Physics Problems (TIPP), relates physics problems to the cognitive processes required to solve them. TIPP was created in order to design educational objectives, to develop…
Analysis of Influence of Different Relations Types on the Quality of Thesaurus Application to Text Classification Problems

Directory of Open Access Journals (Sweden)

Nadezhda S. Lagutina

2017-01-01

Full Text Available The main purpose of the article is to analyze how effectively different types of thesaurus relations can be used for solutions of text classification tasks. The basis of the study is an automatically generated thesaurus of a subject area, that contains three types of relations: synonymous, hierarchical and associative. To generate the thesaurus the authors use a hybrid method based on several linguistic and statistical algorithms for extraction of semantic relations. The method allows to create a thesaurus with a sufficiently large number of terms and relations among them. The authors consider two problems: topical text classification and sentiment classification of large newspaper articles. To solve them, the authors developed two approaches that complement standard algorithms with a procedure that take into account thesaurus relations to determine semantic features of texts. The approach to topical classification includes the standard unsupervised BM25 algorithm and the procedure, that take into account synonymous and hierarchical relations of the thesaurus of the subject area. The approach to sentiment classification consists of two steps. At the first step, a thesaurus is created, whose terms weight polarities are calculated depending on the term occurrences in the training set or on the weights of related thesaurus terms. At the second step, the thesaurus is used to compute the features of words from texts and to classify texts by the algorithm SVM or Naive Bayes. In experiments with text corpora BBCSport, Reuters, PubMed and the corpus of articles about American immigrants, the authors varied the types of thesaurus relations that are involved in the classification and the degree of their use. The results of the experiments make it possible to evaluate the efficiency of the application of thesaurus relations for classification of raw texts and to determine under what conditions certain relationships affect more or less. In particular, the
Binary classification posed as a quadratically constrained quadratic ...

Indian Academy of Sciences (India)

Binary classification is posed as a quadratically constrained quadratic problem and solved using the proposed method. Each class in the binary classification problem is modeled as a multidimensional ellipsoid to forma quadratic constraint in the problem. Particle swarms help in determining the optimal hyperplane or ...
Developing a case mix classification for child and adolescent mental health services: the influence of presenting problems, complexity factors and service providers on number of appointments.

Science.gov (United States)

Martin, Peter; Davies, Roger; Macdougall, Amy; Ritchie, Benjamin; Vostanis, Panos; Whale, Andy; Wolpert, Miranda

2017-09-01

Case-mix classification is a focus of international attention in considering how best to manage and fund services, by providing a basis for fairer comparison of resource utilization. Yet there is little evidence of the best ways to establish case mix for child and adolescent mental health services (CAMHS). To develop a case mix classification for CAMHS that is clinically meaningful and predictive of number of appointments attended and to investigate the influence of presenting problems, context and complexity factors and provider variation. We analysed 4573 completed episodes of outpatient care from 11 English CAMHS. Cluster analysis, regression trees and a conceptual classification based on clinical best practice guidelines were compared regarding their ability to predict number of appointments, using mixed effects negative binomial regression. The conceptual classification is clinically meaningful and did as well as data-driven classifications in accounting for number of appointments. There was little evidence for effects of complexity or context factors, with the possible exception of school attendance problems. Substantial variation in resource provision between providers was not explained well by case mix. The conceptually-derived classification merits further testing and development in the context of collaborative decision making.
GA Based Optimal Feature Extraction Method for Functional Data Classification

OpenAIRE

Jun Wan; Zehua Chen; Yingwu Chen; Zhidong Bai

2010-01-01

Classification is an interesting problem in functional data analysis (FDA), because many science and application problems end up with classification problems, such as recognition, prediction, control, decision making, management, etc. As the high dimension and high correlation in functional data (FD), it is a key problem to extract features from FD whereas keeping its global characters, which relates to the classification efficiency and precision to heavens. In this paper...
Data quality objectives for the B-Cell waste stream classification sampling

International Nuclear Information System (INIS)

Barnett, J.M.

1998-01-01

This document defines the data quality objectives, (DQOS) for sampling the B-Cell racks waste stream. The sampling effort is concentrated on determining a ratio of Cs-137 to Sr-90 and Cs-137 to transuranics (TRU). Figure 1.0 shows the logic path of sampling effort. The flow chart begins with sample and data acquisition and progresses toward (a) statistical confidence and waste classification boundaries, (b) management decisions based on the input parameters and technical methods available, and (c) grout container volume/weight limits and radiation limits. The end result will be accurately classifying the B-Cell rack waste stream
The future of general classification

DEFF Research Database (Denmark)

Mai, Jens Erik

2013-01-01

Discusses problems related to accessing multiple collections using a single retrieval language. Surveys the concepts of interoperability and switching language. Finds that mapping between more indexing languages always will be an approximation. Surveys the issues related to general classification...... and contrasts that to special classifications. Argues for the use of general classifications to provide access to collections nationally and internationally....
On the classification of the spectrally stable standing waves of the Hartree problem

Science.gov (United States)

Georgiev, Vladimir; Stefanov, Atanas

2018-05-01

We consider the fractional Hartree model, with general power non-linearity and arbitrary spatial dimension. We construct variationally the "normalized" solutions for the corresponding Choquard-Pekar model-in particular a number of key properties, like smoothness and bell-shapedness are established. As a consequence of the construction, we show that these solitons are spectrally stable as solutions to the time-dependent Hartree model. In addition, we analyze the spectral stability of the Moroz-Van Schaftingen solitons of the classical Hartree problem, in any dimensions and power non-linearity. A full classification is obtained, the main conclusion of which is that only and exactly the "normalized" solutions (which exist only in a portion of the range) are spectrally stable.
A reliable Raman-spectroscopy-based approach for diagnosis, classification and follow-up of B-cell acute lymphoblastic leukemia

Science.gov (United States)

Managò, Stefano; Valente, Carmen; Mirabelli, Peppino; Circolo, Diego; Basile, Filomena; Corda, Daniela; de Luca, Anna Chiara

2016-04-01

Acute lymphoblastic leukemia type B (B-ALL) is a neoplastic disorder that shows high mortality rates due to immature lymphocyte B-cell proliferation. B-ALL diagnosis requires identification and classification of the leukemia cells. Here, we demonstrate the use of Raman spectroscopy to discriminate normal lymphocytic B-cells from three different B-leukemia transformed cell lines (i.e., RS4;11, REH, MN60 cells) based on their biochemical features. In combination with immunofluorescence and Western blotting, we show that these Raman markers reflect the relative changes in the potential biological markers from cell surface antigens, cytoplasmic proteins, and DNA content and correlate with the lymphoblastic B-cell maturation/differentiation stages. Our study demonstrates the potential of this technique for classification of B-leukemia cells into the different differentiation/maturation stages, as well as for the identification of key biochemical changes under chemotherapeutic treatments. Finally, preliminary results from clinical samples indicate high consistency of, and potential applications for, this Raman spectroscopy approach.
Voice based gender classification using machine learning

Science.gov (United States)

Raahul, A.; Sapthagiri, R.; Pankaj, K.; Vijayarajan, V.

2017-11-01

Gender identification is one of the major problem speech analysis today. Tracing the gender from acoustic data i.e., pitch, median, frequency etc. Machine learning gives promising results for classification problem in all the research domains. There are several performance metrics to evaluate algorithms of an area. Our Comparative model algorithm for evaluating 5 different machine learning algorithms based on eight different metrics in gender classification from acoustic data. Agenda is to identify gender, with five different algorithms: Linear Discriminant Analysis (LDA), K-Nearest Neighbour (KNN), Classification and Regression Trees (CART), Random Forest (RF), and Support Vector Machine (SVM) on basis of eight different metrics. The main parameter in evaluating any algorithms is its performance. Misclassification rate must be less in classification problems, which says that the accuracy rate must be high. Location and gender of the person have become very crucial in economic markets in the form of AdSense. Here with this comparative model algorithm, we are trying to assess the different ML algorithms and find the best fit for gender classification of acoustic data.

A proposed United States resource classification system

International Nuclear Information System (INIS)

Masters, C.D.

1980-01-01

Energy is a world-wide problem calling for world-wide communication to resolve the many supply and distribution problems. Essential to a communication problem are a definition and comparability of elements being communicated. The US Geological Survey, with the co-operation of the US Bureau of Mines and the US Department of Energy, has devised a classification system for all mineral resources, the principles of which, it is felt, offer the possibility of world communication. At present several other systems, extant or under development (Potential Gas Committee of the USA, United Nations Resource Committee, and the American Society of Testing and Materials) are internally consistent and provide easy communication linkage. The system in use by the uranium community in the United States of America, however, ties resource quantities to forward-cost dollar values rendering them inconsistent with other classifications and therefore not comparable. This paper develops the rationale for the new USGS resource classification and notes its benefits relative to a forward-cost classification and its relationship specifically to other current classifications. (author)
New Dandelion Algorithm Optimizes Extreme Learning Machine for Biomedical Classification Problems

Directory of Open Access Journals (Sweden)

Xiguang Li

2017-01-01

Full Text Available Inspired by the behavior of dandelion sowing, a new novel swarm intelligence algorithm, namely, dandelion algorithm (DA, is proposed for global optimization of complex functions in this paper. In DA, the dandelion population will be divided into two subpopulations, and different subpopulations will undergo different sowing behaviors. Moreover, another sowing method is designed to jump out of local optimum. In order to demonstrate the validation of DA, we compare the proposed algorithm with other existing algorithms, including bat algorithm, particle swarm optimization, and enhanced fireworks algorithm. Simulations show that the proposed algorithm seems much superior to other algorithms. At the same time, the proposed algorithm can be applied to optimize extreme learning machine (ELM for biomedical classification problems, and the effect is considerable. At last, we use different fusion methods to form different fusion classifiers, and the fusion classifiers can achieve higher accuracy and better stability to some extent.
Learning to recognise : A study on one-class classification and active learning

NARCIS (Netherlands)

Juszczak, P.

2006-01-01

The thesis treats classification problems which are undersampled or where there exist an unbalance between classes in the sampling. The thesis is divided into three parts. The first two parts treat the problem of one-class classification. In the one-class classification problem, it is assumed that
Quantitative Cell Cycle Analysis Based on an Endogenous All-in-One Reporter for Cell Tracking and Classification

Directory of Open Access Journals (Sweden)

Thomas Zerjatke

2017-05-01

Full Text Available Cell cycle kinetics are crucial to cell fate decisions. Although live imaging has provided extensive insights into this relationship at the single-cell level, the limited number of fluorescent markers that can be used in a single experiment has hindered efforts to link the dynamics of individual proteins responsible for decision making directly to cell cycle progression. Here, we present fluorescently tagged endogenous proliferating cell nuclear antigen (PCNA as an all-in-one cell cycle reporter that allows simultaneous analysis of cell cycle progression, including the transition into quiescence, and the dynamics of individual fate determinants. We also provide an image analysis pipeline for automated segmentation, tracking, and classification of all cell cycle phases. Combining the all-in-one reporter with labeled endogenous cyclin D1 and p21 as prime examples of cell-cycle-regulated fate determinants, we show how cell cycle and quantitative protein dynamics can be simultaneously extracted to gain insights into G1 phase regulation and responses to perturbations.
Development of the Contiguous-cells Transportation Problem

Directory of Open Access Journals (Sweden)

O. E. Charles-Owaba

2015-08-01

Full Text Available The issue of scheduling a long string of multi-period activities which have to be completed without interruption has always been an industrial challenge. The existing production/maintenance scheduling algorithms can only handle situations where activities can be split into two or more sets of activities carried out in non-contiguous sets of work periods. This study proposes a contiguous-periods production/maintenance scheduling approach using the Transportation Model. Relevant variables and parameters of contiguous-cells scheduling problem were taken from the literature. A scheduling optimization problem was defined and solved using a contiguous-cells transportation algorithm (CCTA which was applied in order to determine the optimal maintenance schedule of a fleet of ships at a dockyard in South-Western Nigeria. Fifteen different problems were solved. It is concluded that the contiguous-cells transportation approach to production/ maintenance scheduling is feasible. The model will be a useful decision support tool for scheduling maintenance operations.
Automatic Hierarchical Color Image Classification

Directory of Open Access Journals (Sweden)

Jing Huang

2003-02-01

Full Text Available Organizing images into semantic categories can be extremely useful for content-based image retrieval and image annotation. Grouping images into semantic classes is a difficult problem, however. Image classification attempts to solve this hard problem by using low-level image features. In this paper, we propose a method for hierarchical classification of images via supervised learning. This scheme relies on using a good low-level feature and subsequently performing feature-space reconfiguration using singular value decomposition to reduce noise and dimensionality. We use the training data to obtain a hierarchical classification tree that can be used to categorize new images. Our experimental results suggest that this scheme not only performs better than standard nearest-neighbor techniques, but also has both storage and computational advantages.
ON DEPARTURES FROM INDEPENDENCE IN CROSS-CLASSIFICATIONS.

Science.gov (United States)

CASE, C. MARSTON

THIS NOTE IS CONCERNED WITH IDEAS AND PROBLEMS INVOLVED IN CROSS-CLASSIFICATION OF OBSERVATIONS ON A GIVEN POPULATION, ESPECIALLY TWO-DIMENSIONAL CROSS-CLASSIFICATIONS. MAIN OBJECTIVES OF THE NOTE INCLUDE--(1) ESTABLISHMENT OF A CONCEPTUAL FRAMEWORK FOR CHARACTERIZATION AND COMPARISON OF CROSS-CLASSIFICATIONS, (2) DISCUSSION OF EXISTING METHODS…
Using fuzzy logic for automatic control: Case study of a problem of cereals samples classification

Directory of Open Access Journals (Sweden)

Lakhoua Najeh Mohamed

2009-01-01

Full Text Available The aim of this paper is to present the use of fuzzy logic for automatic control of industrial systems particularly the way to approach a problem of classification. We present a case study of a grading system of cereals that allows us to determine the price of transactions of cereals in Tunisia. Our contribution in this work consists in proposing not only an application of the fuzzy logic on the grading system of cereals but also a methodology enabling the proposing of a new grading system based on the concept of 'Grade' while using the fuzzy logic techniques. .
Requirements Elicitation Problems: A Literature Analysis

Directory of Open Access Journals (Sweden)

Bill Davey

2015-06-01

Full Text Available Requirements elicitation is the process through which analysts determine the software requirements of stakeholders. Requirements elicitation is seldom well done, and an inaccurate or incomplete understanding of user requirements has led to the downfall of many software projects. This paper proposes a classification of problem types that occur in requirements elicitation. The classification has been derived from a literature analysis. Papers reporting on techniques for improving requirements elicitation practice were examined for the problem the technique was designed to address. In each classification the most recent or prominent techniques for ameliorating the problems are presented. The classification allows the requirements engineer to be sensitive to problems as they arise and the educator to structure delivery of requirements elicitation training.
Fuzzy Expert System based on a Novel Hybrid Stem Cell (HSC) Algorithm for Classification of Micro Array Data.

Science.gov (United States)

Vijay, S Arul Antran; GaneshKumar, P

2018-02-21

In the growing scenario, microarray data is extensively used since it provides a more comprehensive understanding of genetic variants among diseases. As the gene expression samples have high dimensionality it becomes tedious to analyze the samples manually. Hence an automated system is needed to analyze these samples. The fuzzy expert system offers a clear classification when compared to the machine learning and statistical methodologies. In fuzzy classification, knowledge acquisition would be a major concern. Despite several existing approaches for knowledge acquisition much effort is necessary to enhance the learning process. This paper proposes an innovative Hybrid Stem Cell (HSC) algorithm that utilizes Ant Colony optimization and Stem Cell algorithm for designing fuzzy classification system to extract the informative rules to form the membership functions from the microarray dataset. The HSC algorithm uses a novel Adaptive Stem Cell Optimization (ASCO) to improve the points of membership function and Ant Colony Optimization to produce the near optimum rule set. In order to extract the most informative genes from the large microarray dataset a method called Mutual Information is used. The performance results of the proposed technique evaluated using the five microarray datasets are simulated. These results prove that the proposed Hybrid Stem Cell (HSC) algorithm produces a precise fuzzy system than the existing methodologies.
Assessing the Psychosocial Problems In Parenting Sickle-Cell ...

African Journals Online (AJOL)

Aim: To assess the psycho social problems encountered in parenting sickle-cell children in Enugu. Method: The subjects include all parents, guardian, foster parents of sickle cell children who have the responsibility of caring for sickle-cell children and who have attended the sickle-cell clinic of the UNTH between June to ...
Artificial neural network classification using a minimal training set - Comparison to conventional supervised classification

Science.gov (United States)

Hepner, George F.; Logan, Thomas; Ritter, Niles; Bryant, Nevin

1990-01-01

Recent research has shown an artificial neural network (ANN) to be capable of pattern recognition and the classification of image data. This paper examines the potential for the application of neural network computing to satellite image processing. A second objective is to provide a preliminary comparison and ANN classification. An artificial neural network can be trained to do land-cover classification of satellite imagery using selected sites representative of each class in a manner similar to conventional supervised classification. One of the major problems associated with recognition and classifications of pattern from remotely sensed data is the time and cost of developing a set of training sites. This reseach compares the use of an ANN back propagation classification procedure with a conventional supervised maximum likelihood classification procedure using a minimal training set. When using a minimal training set, the neural network is able to provide a land-cover classification superior to the classification derived from the conventional classification procedure. This research is the foundation for developing application parameters for further prototyping of software and hardware implementations for artificial neural networks in satellite image and geographic information processing.
A Multi-layer Hybrid Framework for Dimensional Emotion Classification

NARCIS (Netherlands)

Nicolaou, Mihalis A.; Gunes, Hatice; Pantic, Maja

2011-01-01

This paper investigates dimensional emotion prediction and classification from naturalistic facial expressions. Similarly to many pattern recognition problems, dimensional emotion classification requires generating multi-dimensional outputs. To date, classification for valence and arousal dimensions
EEG Eye State Identification Using Incremental Attribute Learning with Time-Series Classification

Directory of Open Access Journals (Sweden)

Ting Wang

2014-01-01

Full Text Available Eye state identification is a kind of common time-series classification problem which is also a hot spot in recent research. Electroencephalography (EEG is widely used in eye state classification to detect human's cognition state. Previous research has validated the feasibility of machine learning and statistical approaches for EEG eye state classification. This paper aims to propose a novel approach for EEG eye state identification using incremental attribute learning (IAL based on neural networks. IAL is a novel machine learning strategy which gradually imports and trains features one by one. Previous studies have verified that such an approach is applicable for solving a number of pattern recognition problems. However, in these previous works, little research on IAL focused on its application to time-series problems. Therefore, it is still unknown whether IAL can be employed to cope with time-series problems like EEG eye state classification. Experimental results in this study demonstrates that, with proper feature extraction and feature ordering, IAL can not only efficiently cope with time-series classification problems, but also exhibit better classification performance in terms of classification error rates in comparison with conventional and some other approaches.
Global Optimization Ensemble Model for Classification Methods

Science.gov (United States)

Anwar, Hina; Qamar, Usman; Muzaffar Qureshi, Abdul Wahab

2014-01-01

Supervised learning is the process of data mining for deducing rules from training datasets. A broad array of supervised learning algorithms exists, every one of them with its own advantages and drawbacks. There are some basic issues that affect the accuracy of classifier while solving a supervised learning problem, like bias-variance tradeoff, dimensionality of input space, and noise in the input data space. All these problems affect the accuracy of classifier and are the reason that there is no global optimal method for classification. There is not any generalized improvement method that can increase the accuracy of any classifier while addressing all the problems stated above. This paper proposes a global optimization ensemble model for classification methods (GMC) that can improve the overall accuracy for supervised learning problems. The experimental results on various public datasets showed that the proposed model improved the accuracy of the classification models from 1% to 30% depending upon the algorithm complexity. PMID:24883382
Global Optimization Ensemble Model for Classification Methods

Directory of Open Access Journals (Sweden)

Hina Anwar

2014-01-01

Full Text Available Supervised learning is the process of data mining for deducing rules from training datasets. A broad array of supervised learning algorithms exists, every one of them with its own advantages and drawbacks. There are some basic issues that affect the accuracy of classifier while solving a supervised learning problem, like bias-variance tradeoff, dimensionality of input space, and noise in the input data space. All these problems affect the accuracy of classifier and are the reason that there is no global optimal method for classification. There is not any generalized improvement method that can increase the accuracy of any classifier while addressing all the problems stated above. This paper proposes a global optimization ensemble model for classification methods (GMC that can improve the overall accuracy for supervised learning problems. The experimental results on various public datasets showed that the proposed model improved the accuracy of the classification models from 1% to 30% depending upon the algorithm complexity.
Solar Cell Production in Nigeria: Prospects, Options and Problems

International Nuclear Information System (INIS)

Fasasi, A. Y.; Siyanbola, W.O.; Ibitoye, F. I.; Pelemo, D. A.

2002-01-01

The prospects and problems facing solar cell production in Nigeria are discussed. The paper reviews many proven solar cell materials in terms of their current efficiencies and production costs. Silicon solar cell production appears to be the best technology option for Nigeria because of the abundant quartz sand and waste products from our phosphate fertiliser company that can be employed as starting materials to produce solar grade silicon. Factors affecting solar cell efficiency, choice of solar cell as well as financial and material problems limiting the progress on silicon solar cell production are also discussed. Finally, the paper recommends the simultaneous production of solar grade silicon and coordinated development of the balance of system components as first steps towards actualizing this objective
Maximum mutual information regularized classification

KAUST Repository

Wang, Jim Jing-Yan

2014-09-07

In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.
Maximum mutual information regularized classification

KAUST Repository

Wang, Jim Jing-Yan; Wang, Yi; Zhao, Shiguang; Gao, Xin

2014-01-01

In this paper, a novel pattern classification approach is proposed by regularizing the classifier learning to maximize mutual information between the classification response and the true class label. We argue that, with the learned classifier, the uncertainty of the true class label of a data sample should be reduced by knowing its classification response as much as possible. The reduced uncertainty is measured by the mutual information between the classification response and the true class label. To this end, when learning a linear classifier, we propose to maximize the mutual information between classification responses and true class labels of training samples, besides minimizing the classification error and reducing the classifier complexity. An objective function is constructed by modeling mutual information with entropy estimation, and it is optimized by a gradient descend method in an iterative algorithm. Experiments on two real world pattern classification problems show the significant improvements achieved by maximum mutual information regularization.
Data Clustering and Evolving Fuzzy Decision Tree for Data Base Classification Problems

Science.gov (United States)

Chang, Pei-Chann; Fan, Chin-Yuan; Wang, Yen-Wen

Data base classification suffers from two well known difficulties, i.e., the high dimensionality and non-stationary variations within the large historic data. This paper presents a hybrid classification model by integrating a case based reasoning technique, a Fuzzy Decision Tree (FDT), and Genetic Algorithms (GA) to construct a decision-making system for data classification in various data base applications. The model is major based on the idea that the historic data base can be transformed into a smaller case-base together with a group of fuzzy decision rules. As a result, the model can be more accurately respond to the current data under classifying from the inductions by these smaller cases based fuzzy decision trees. Hit rate is applied as a performance measure and the effectiveness of our proposed model is demonstrated by experimentally compared with other approaches on different data base classification applications. The average hit rate of our proposed model is the highest among others.

Effective Exchange Rate Classifications and Growth

OpenAIRE

Justin M. Dubas; Byung-Joo Lee; Nelson C. Mark

2005-01-01

We propose an econometric procedure for obtaining de facto exchange rate regime classifications which we apply to study the relationship between exchange rate regimes and economic growth. Our classification method models the de jure regimes as outcomes of a multinomial logit choice problem conditional on the volatility of a country's effective exchange rate, a bilateral exchange rate and international reserves. An `effective' de facto exchange rate regime classification is then obtained by as...
Cell nuclei attributed relational graphs for efficient representation and classification of gastric cancer in digital histopathology

Science.gov (United States)

Sharma, Harshita; Zerbe, Norman; Heim, Daniel; Wienert, Stephan; Lohmann, Sebastian; Hellwich, Olaf; Hufnagl, Peter

2016-03-01

This paper describes a novel graph-based method for efficient representation and subsequent classification in histological whole slide images of gastric cancer. Her2/neu immunohistochemically stained and haematoxylin and eosin stained histological sections of gastric carcinoma are digitized. Immunohistochemical staining is used in practice by pathologists to determine extent of malignancy, however, it is laborious to visually discriminate the corresponding malignancy levels in the more commonly used haematoxylin and eosin stain, and this study attempts to solve this problem using a computer-based method. Cell nuclei are first isolated at high magnification using an automatic cell nuclei segmentation strategy, followed by construction of cell nuclei attributed relational graphs of the tissue regions. These graphs represent tissue architecture comprehensively, as they contain information about cell nuclei morphology as vertex attributes, along with knowledge of neighborhood in the form of edge linking and edge attributes. Global graph characteristics are derived and ensemble learning is used to discriminate between three types of malignancy levels, namely, non-tumor, Her2/neu positive tumor and Her2/neu negative tumor. Performance is compared with state of the art methods including four texture feature groups (Haralick, Gabor, Local Binary Patterns and Varma Zisserman features), color and intensity features, and Voronoi diagram and Delaunay triangulation. Texture, color and intensity information is also combined with graph-based knowledge, followed by correlation analysis. Quantitative assessment is performed using two cross validation strategies. On investigating the experimental results, it can be concluded that the proposed method provides a promising way for computer-based analysis of histopathological images of gastric cancer.
ON THE WAYS OF AUTOMATED PROCESSING OF SPATIAL GEOMETRY OF THE SYSTEM “GATE-CASTING” FOR SOLVING OF THE CLASSIFICATION PROBLEMS

Directory of Open Access Journals (Sweden)

A. N. Chichko

2007-01-01

Full Text Available The system parameterization of castings, allowing to formalize spatial geometry of casting, is offered. The algorithm of taxonomy, which can be used for solving of problems of castings classification in the systems of computeraided design of foundry technologies, is described. The method is approved on castings of type ''cover”.
A Quantum Hybrid PSO Combined with Fuzzy k-NN Approach to Feature Selection and Cell Classification in Cervical Cancer Detection

Directory of Open Access Journals (Sweden)

Abdullah M. Iliyasu

2017-12-01

Full Text Available A quantum hybrid (QH intelligent approach that blends the adaptive search capability of the quantum-behaved particle swarm optimisation (QPSO method with the intuitionistic rationality of traditional fuzzy k-nearest neighbours (Fuzzy k-NN algorithm (known simply as the Q-Fuzzy approach is proposed for efficient feature selection and classification of cells in cervical smeared (CS images. From an initial multitude of 17 features describing the geometry, colour, and texture of the CS images, the QPSO stage of our proposed technique is used to select the best subset features (i.e., global best particles that represent a pruned down collection of seven features. Using a dataset of almost 1000 images, performance evaluation of our proposed Q-Fuzzy approach assesses the impact of our feature selection on classification accuracy by way of three experimental scenarios that are compared alongside two other approaches: the All-features (i.e., classification without prior feature selection and another hybrid technique combining the standard PSO algorithm with the Fuzzy k-NN technique (P-Fuzzy approach. In the first and second scenarios, we further divided the assessment criteria in terms of classification accuracy based on the choice of best features and those in terms of the different categories of the cervical cells. In the third scenario, we introduced new QH hybrid techniques, i.e., QPSO combined with other supervised learning methods, and compared the classification accuracy alongside our proposed Q-Fuzzy approach. Furthermore, we employed statistical approaches to establish qualitative agreement with regards to the feature selection in the experimental scenarios 1 and 3. The synergy between the QPSO and Fuzzy k-NN in the proposed Q-Fuzzy approach improves classification accuracy as manifest in the reduction in number cell features, which is crucial for effective cervical cancer detection and diagnosis.
Maxillectomy defects: a suggested classification scheme.

Science.gov (United States)

Akinmoladun, V I; Dosumu, O O; Olusanya, A A; Ikusika, O F

2013-06-01

The term "maxillectomy" has been used to describe a variety of surgical procedures for a spectrum of diseases involving a diverse anatomical site. Hence, classifications of maxillectomy defects have often made communication difficult. This article highlights this problem, emphasises the need for a uniform system of classification and suggests a classification system which is simple and comprehensive. Articles related to this subject, especially those with specified classifications of maxillary surgical defects were sourced from the internet through Google, Scopus and PubMed using the search terms maxillectomy defects classification. A manual search through available literature was also done. The review of the materials revealed many classifications and modifications of classifications from the descriptive, reconstructive and prosthodontic perspectives. No globally acceptable classification exists among practitioners involved in the management of diseases in the mid-facial region. There were over 14 classifications of maxillary defects found in the English literature. Attempts made to address the inadequacies of previous classifications have tended to result in cumbersome and relatively complex classifications. A single classification that is based on both surgical and prosthetic considerations is most desirable and is hereby proposed.
Contributions for classification of platelet rich plasma - proposal of a new classification: MARSPILL.

Science.gov (United States)

Lana, Jose Fabio Santos Duarte; Purita, Joseph; Paulus, Christian; Huber, Stephany Cares; Rodrigues, Bruno Lima; Rodrigues, Ana Amélia; Santana, Maria Helena; Madureira, João Lopo; Malheiros Luzo, Ângela Cristina; Belangero, William Dias; Annichino-Bizzacchi, Joyce Maria

2017-07-01

Platelet-rich plasma (PRP) has emerged as a significant therapy used in medical conditions with heterogeneous results. There are some important classifications to try to standardize the PRP procedure. The aim of this report is to describe PRP contents studying celular and molecular components, and also propose a new classification for PRP. The main focus is on mononuclear cells, which comprise progenitor cells and monocytes. In addition, there are important variables related to PRP application incorporated in this study, which are the harvest method, activation, red blood cells, number of spins, image guidance, leukocytes number and light activation. The other focus is the discussion about progenitor cells presence on peripherial blood which are interesting due to neovasculogenesis and proliferation. The function of monocytes (in tissue-macrophages) are discussed here and also its plasticity, a potential property for regenerative medicine treatments.
Urogenital tuberculosis: definition and classification.

Science.gov (United States)

Kulchavenya, Ekaterina

2014-10-01

To improve the approach to the diagnosis and management of urogenital tuberculosis (UGTB), we need clear and unique classification. UGTB remains an important problem, especially in developing countries, but it is often an overlooked disease. As with any other infection, UGTB should be cured by antibacterial therapy, but because of late diagnosis it may often require surgery. Scientific literature dedicated to this problem was critically analyzed and juxtaposed with the author's own more than 30 years' experience in tuberculosis urology. The conception, terms and definition were consolidated into one system; classification stage by stage as well as complications are presented. Classification of any disease includes dispersion on forms and stages and exact definitions for each stage. Clinical features and symptoms significantly vary between different forms and stages of UGTB. A simple diagnostic algorithm was constructed. UGTB is multivariant disease and a standard unified approach to it is impossible. Clear definition as well as unique classification are necessary for real estimation of epidemiology and the optimization of therapy. The term 'UGTB' has insufficient information in order to estimate therapy, surgery and prognosis, or to evaluate the epidemiology.
Searching bioremediation patents through Cooperative Patent Classification (CPC).

Science.gov (United States)

Prasad, Rajendra

2016-03-01

Patent classification systems have traditionally evolved independently at each patent jurisdiction to classify patents handled by their examiners to be able to search previous patents while dealing with new patent applications. As patent databases maintained by them went online for free access to public as also for global search of prior art by examiners, the need arose for a common platform and uniform structure of patent databases. The diversity of different classification, however, posed problems of integrating and searching relevant patents across patent jurisdictions. To address this problem of comparability of data from different sources and searching patents, WIPO in the recent past developed what is known as International Patent Classification (IPC) system which most countries readily adopted to code their patents with IPC codes along with their own codes. The Cooperative Patent Classification (CPC) is the latest patent classification system based on IPC/European Classification (ECLA) system, developed by the European Patent Office (EPO) and the United States Patent and Trademark Office (USPTO) which is likely to become a global standard. This paper discusses this new classification system with reference to patents on bioremediation.
Classification of pyodestructive pulmonary diseases

International Nuclear Information System (INIS)

Muromskij, Yu.A.; Semivolkov, V.I.; Shlenova, L.A.

1993-01-01

Classification of pyodestructive lungs diseases, thier complications and outcomes is proposed which makes it possible for physioians engaged in studying respiratory organs pathology to orient themselves in problems of diagnosis and treatment tactics. The above classification is developed on the basis of studying the disease anamnesis and its clinical process, as well as on the basis of roentgenological and morphological study results by more than 10000 patients
Analysis of mesenchymal stem cell differentiation in vitro using classification association rule mining.

Science.gov (United States)

Wang, Weiqi; Wang, Yanbo Justin; Bañares-Alcántara, René; Coenen, Frans; Cui, Zhanfeng

2009-12-01

In this paper, data mining is used to analyze the data on the differentiation of mammalian Mesenchymal Stem Cells (MSCs), aiming at discovering known and hidden rules governing MSC differentiation, following the establishment of a web-based public database containing experimental data on the MSC proliferation and differentiation. To this effect, a web-based public interactive database comprising the key parameters which influence the fate and destiny of mammalian MSCs has been constructed and analyzed using Classification Association Rule Mining (CARM) as a data-mining technique. The results show that the proposed approach is technically feasible and performs well with respect to the accuracy of (classification) prediction. Key rules mined from the constructed MSC database are consistent with experimental observations, indicating the validity of the method developed and the first step in the application of data mining to the study of MSCs.
Racial classification in the evolutionary sciences: a comparative analysis.

Science.gov (United States)

Billinger, Michael S

2007-01-01

Human racial classification has long been a problem for the discipline of anthropology, but much of the criticism of the race concept has focused on its social and political connotations. The central argument of this paper is that race is not a specifically human problem, but one that exists in evolutionary thought in general. This paper looks at various disciplinary approaches to racial or subspecies classification, extending its focus beyond the anthropological race concept by providing a comparative analysis of the use of racial classification in evolutionary biology, genetics, and anthropology.
Giant cell arteritis. Part I. Terminology, classification, clinical manifestations, diagnosis

Directory of Open Access Journals (Sweden)

Azamat Makhmudovich Satybaldyev

2012-01-01

Full Text Available Giant cell arteritis (GCA is a vasculitis affecting mainly large and medium-sized arteries, which the classification of systemic vasculitides refers to as those mainly involving the large vessels. GCA is typified by the involvement of extracranial aortic branches and intracranial vessels, the aorta and its large vessels are being affected most frequently. The paper considers the terminology, classification, prevalence, major pathogenic mechanisms, and morphology of GCA. A broad spectrum of its clinical subtypes is due to target vessel stenosis caused by intimal hyperplasia. In 40% of cases, GCA is shown to be accompanied by polymyalgia rheumatica that may either precede or manifest simultaneously with GCA, or follow this disease. The menacing complications of GCA may be visual loss or ischemic strokes at various sites depending on the location of the occluded vessel. Along with the gold standard verification of the diagnosis of GCA, namely temporal artery biopsy, the author indicates other (noninvasive methods for detection of vascular lesions: color Doppler ultrasonography of the temporal arteries, fluorescein angiography of the retina, mag-netic resonance angiography, magnetic resonance imaging, and computed tomography to rule out aortic aneurysm. Dynamic 18F positron emission tomography is demonstrated to play a role in the evaluation of therapeutic effectiveness.
Packet Classification by Multilevel Cutting of the Classification Space: An Algorithmic-Architectural Solution for IP Packet Classification in Next Generation Networks

Directory of Open Access Journals (Sweden)

Motasem Aldiab

2008-01-01

Full Text Available Traditionally, the Internet provides only a “best-effort” service, treating all packets going to the same destination equally. However, providing differentiated services for different users based on their quality requirements is increasingly becoming a demanding issue. For this, routers need to have the capability to distinguish and isolate traffic belonging to different flows. This ability to determine the flow each packet belongs to is called packet classification. Technology vendors are reluctant to support algorithmic solutions for classification due to their nondeterministic performance. Although content addressable memories (CAMs are favoured by technology vendors due to their deterministic high-lookup rates, they suffer from the problems of high-power consumption and high-silicon cost. This paper provides a new algorithmic-architectural solution for packet classification that mixes CAMs with algorithms based on multilevel cutting of the classification space into smaller spaces. The provided solution utilizes the geometrical distribution of rules in the classification space. It provides the deterministic performance of CAMs, support for dynamic updates, and added flexibility for system designers.
Semantic Document Image Classification Based on Valuable Text Pattern

Directory of Open Access Journals (Sweden)

Hossein Pourghassem

2011-01-01

Full Text Available Knowledge extraction from detected document image is a complex problem in the field of information technology. This problem becomes more intricate when we know, a negligible percentage of the detected document images are valuable. In this paper, a segmentation-based classification algorithm is used to analysis the document image. In this algorithm, using a two-stage segmentation approach, regions of the image are detected, and then classified to document and non-document (pure region regions in the hierarchical classification. In this paper, a novel valuable definition is proposed to classify document image in to valuable or invaluable categories. The proposed algorithm is evaluated on a database consisting of the document and non-document image that provide from Internet. Experimental results show the efficiency of the proposed algorithm in the semantic document image classification. The proposed algorithm provides accuracy rate of 98.8% for valuable and invaluable document image classification problem.
Support Vector Machines for Hyperspectral Remote Sensing Classification

Science.gov (United States)

Gualtieri, J. Anthony; Cromp, R. F.

1998-01-01

The Support Vector Machine provides a new way to design classification algorithms which learn from examples (supervised learning) and generalize when applied to new data. We demonstrate its success on a difficult classification problem from hyperspectral remote sensing, where we obtain performances of 96%, and 87% correct for a 4 class problem, and a 16 class problem respectively. These results are somewhat better than other recent results on the same data. A key feature of this classifier is its ability to use high-dimensional data without the usual recourse to a feature selection step to reduce the dimensionality of the data. For this application, this is important, as hyperspectral data consists of several hundred contiguous spectral channels for each exemplar. We provide an introduction to this new approach, and demonstrate its application to classification of an agriculture scene.
An edit script for taxonomic classifications

Directory of Open Access Journals (Sweden)

Valiente Gabriel

2005-08-01

Full Text Available Abstract Background The NCBI taxonomy provides one of the most powerful ways to navigate sequence data bases but currently users are forced to formulate queries according to a single taxonomic classification. Given that there is not universal agreement on the classification of organisms, providing a single classification places constraints on the questions biologists can ask. However, maintaining multiple classifications is burdensome in the face of a constantly growing NCBI classification. Results In this paper, we present a solution to the problem of generating modifications of the NCBI taxonomy, based on the computation of an edit script that summarises the differences between two classification trees. Our algorithms find the shortest possible edit script based on the identification of all shared subtrees, and only take time quasi linear in the size of the trees because classification trees have unique node labels. Conclusion These algorithms have been recently implemented, and the software is freely available for download from http://darwin.zoology.gla.ac.uk/~rpage/forest/.
An Incremental Classification Algorithm for Mining Data with Feature Space Heterogeneity

Directory of Open Access Journals (Sweden)

Yu Wang

2014-01-01

Full Text Available Feature space heterogeneity often exists in many real world data sets so that some features are of different importance for classification over different subsets. Moreover, the pattern of feature space heterogeneity might dynamically change over time as more and more data are accumulated. In this paper, we develop an incremental classification algorithm, Supervised Clustering for Classification with Feature Space Heterogeneity (SCCFSH, to address this problem. In our approach, supervised clustering is implemented to obtain a number of clusters such that samples in each cluster are from the same class. After the removal of outliers, relevance of features in each cluster is calculated based on their variations in this cluster. The feature relevance is incorporated into distance calculation for classification. The main advantage of SCCFSH lies in the fact that it is capable of solving a classification problem with feature space heterogeneity in an incremental way, which is favorable for online classification tasks with continuously changing data. Experimental results on a series of data sets and application to a database marketing problem show the efficiency and effectiveness of the proposed approach.
Density Based Support Vector Machines for Classification

OpenAIRE

Zahra Nazari; Dongshik Kang

2015-01-01

Support Vector Machines (SVM) is the most successful algorithm for classification problems. SVM learns the decision boundary from two classes (for Binary Classification) of training points. However, sometimes there are some less meaningful samples amongst training points, which are corrupted by noises or misplaced in wrong side, called outliers. These outliers are affecting on margin and classification performance, and machine should better to discard them. SVM as a popular and widely used cl...
The Problem of Classification of Rumours: Peculliarities of Cultural Rumours

Directory of Open Access Journals (Sweden)

Valdas Pruskus

2011-04-01

Full Text Available The paper analyses classification of rumours. There were many attempts to classify rumours using different criteria. Some authors (A. Dmitrijev 1995 classify them in accordance with three main spheres of social life where they function: political, economic and ideological rumours. However, such a classification is rather conditional, thus it will always seem to be roughcast.Other authors (P. Sorokin 1991 classify rumours in accordance to social elements of interaction systems: the quality and quantity of communicating (interacting individuals, type of interaction and character of information conveyors. This classification seems to be more sociological because it enables to identify which groups spread rumours and how they do it. However, this classification does not mention other important things: the content of a rumour, its relation to reality and so on. The American sociologists W. A. Peterson and N. P. Gist (1951 classify rumours into types according to their content (political, economic, etc., time orientation (explaining past, predictive or foretelling, origin (spontaneous, purposive or relation with reality (rational, fantastic. So there is not one classification of rumours. Partially it is conditioned by a multiple nature of rumours. On the other hand, it is important not only to classify rumours but also to have some mechanism which is able to reveal functioning peculiarities of a particular rumour.The author of this study supposes that every rumour despite its topics has certain features which can be set using a particular system of criteria. There are ten criteria which can describe a rumour and name the peculiarities of its functioning and spread.They help to define any rumour and include the social actualness of a rumour (actual and unreal, the purpose of a rumour (popular, unpopular, the nature of a rumour (malicious or entertaining (jokes, the depth of a rumour (superficial or deep, the supplier (author of a rumour (known
Handling Imbalanced Data Sets in Multistage Classification

Science.gov (United States)

López, M.

Multistage classification is a logical approach, based on a divide-and-conquer solution, for dealing with problems with a high number of classes. The classification problem is divided into several sequential steps, each one associated to a single classifier that works with subgroups of the original classes. In each level, the current set of classes is split into smaller subgroups of classes until they (the subgroups) are composed of only one class. The resulting chain of classifiers can be represented as a tree, which (1) simplifies the classification process by using fewer categories in each classifier and (2) makes it possible to combine several algorithms or use different attributes in each stage. Most of the classification algorithms can be biased in the sense of selecting the most populated class in overlapping areas of the input space. This can degrade a multistage classifier performance if the training set sample frequencies do not reflect the real prevalence in the population. Several techniques such as applying prior probabilities, assigning weights to the classes, or replicating instances have been developed to overcome this handicap. Most of them are designed for two-class (accept-reject) problems. In this article, we evaluate several of these techniques as applied to multistage classification and analyze how they can be useful for astronomy. We compare the results obtained by classifying a data set based on Hipparcos with and without these methods.

Parallel geometric classification of stem cells by their three-dimensional morphology

International Nuclear Information System (INIS)

Juba, Derek; Cardone, Antonio; Yiu Ip, Cheuk; Varshney, Amitabh; Simon Jr, Carl G; K Tison, Christopher; Kumar, Girish; Brady, Mary

2013-01-01

There is a need for tools to classify cells based on their three-dimensional (3D) shape. Cells exist in vivo in 3D, cells are frequently cultured within 3D scaffolds in vitro and 3D scaffolds are used for cell delivery in tissue engineering therapies. Recent work indicates that the physical structure of a tissue engineering scaffold can direct stem cell function by driving stem cells into morphologies that induce their differentiation. Thus, we have developed a rapid method for classifying cells based on their 3D shape. First, random lines are intersected with 3D Z-stacks of confocal images of stem cells. The intersection lengths are stored in histograms, which are then used to train a support vector machine (SVM) learning algorithm to distinguish between stem cells cultured on differentiation-inducing 3D scaffolds and those cultured on non-differentiating flat substrates. The trained SVM is able to properly classify the ‘new’ query cells over 80% of the time. The algorithm is easily parallelizable and we demonstrate its implementation on a commodity graphics processing unit (GPU). Use of a GPU to run the algorithm increases throughput by over 100-fold as compared to use of a CPU. The algorithm is also progressive, providing an approximate answer quickly and refining the answer over time. This allows further increase in the throughput of the algorithm by allowing the SVM classification scheme to terminate early if it becomes confident enough of the class of the cell being analyzed. These results demonstrate a rapid method for classifying stem cells based on their 3D shape that can be used by tissue engineers for identifying 3D tissue scaffold structures that drive stem cells into shapes that correlate with differentiation. (paper)
Classifications of track structures

International Nuclear Information System (INIS)

Paretzke, H.G.

1984-01-01

When ionizing particles interact with matter they produce random topological structures of primary activations which represent the initial boundary conditions for all subsequent physical, chemical and/or biological reactions. There are two important aspects of research on such track structures, namely their experimental or theoretical determination on one hand and the quantitative classification of these complex structures which is a basic pre-requisite for the understanding of mechanisms of radiation actions. This paper deals only with the latter topic, i.e. the problems encountered in and possible approaches to quantitative ordering and grouping of these multidimensional objects by their degrees of similarity with respect to their efficiency in producing certain final radiation effects, i.e. to their ''radiation quality.'' Various attempts of taxonometric classification with respect to radiation efficiency have been made in basic and applied radiation research including macro- and microdosimetric concepts as well as track entities and stopping power based theories. In this paper no review of those well-known approaches is given but rather an outline and discussion of alternative methods new to this field of radiation research which have some very promising features and which could possibly solve at least some major classification problems
Feasibility study of stain-free classification of cell apoptosis based on diffraction imaging flow cytometry and supervised machine learning techniques.

Science.gov (United States)

Feng, Jingwen; Feng, Tong; Yang, Chengwen; Wang, Wei; Sa, Yu; Feng, Yuanming

2018-06-01

This study was to explore the feasibility of prediction and classification of cells in different stages of apoptosis with a stain-free method based on diffraction images and supervised machine learning. Apoptosis was induced in human chronic myelogenous leukemia K562 cells by cis-platinum (DDP). A newly developed technique of polarization diffraction imaging flow cytometry (p-DIFC) was performed to acquire diffraction images of the cells in three different statuses (viable, early apoptotic and late apoptotic/necrotic) after cell separation through fluorescence activated cell sorting with Annexin V-PE and SYTOX® Green double staining. The texture features of the diffraction images were extracted with in-house software based on the Gray-level co-occurrence matrix algorithm to generate datasets for cell classification with supervised machine learning method. Therefore, this new method has been verified in hydrogen peroxide induced apoptosis model of HL-60. Results show that accuracy of higher than 90% was achieved respectively in independent test datasets from each cell type based on logistic regression with ridge estimators, which indicated that p-DIFC system has a great potential in predicting and classifying cells in different stages of apoptosis.
Sparse Representation Based Multi-Instance Learning for Breast Ultrasound Image Classification

Directory of Open Access Journals (Sweden)

Lu Bing

2017-01-01

Full Text Available We propose a novel method based on sparse representation for breast ultrasound image classification under the framework of multi-instance learning (MIL. After image enhancement and segmentation, concentric circle is used to extract the global and local features for improving the accuracy in diagnosis and prediction. The classification problem of ultrasound image is converted to sparse representation based MIL problem. Each instance of a bag is represented as a sparse linear combination of all basis vectors in the dictionary, and then the bag is represented by one feature vector which is obtained via sparse representations of all instances within the bag. The sparse and MIL problem is further converted to a conventional learning problem that is solved by relevance vector machine (RVM. Results of single classifiers are combined to be used for classification. Experimental results on the breast cancer datasets demonstrate the superiority of the proposed method in terms of classification accuracy as compared with state-of-the-art MIL methods.
Sparse Representation Based Multi-Instance Learning for Breast Ultrasound Image Classification.

Science.gov (United States)

Bing, Lu; Wang, Wei

2017-01-01

We propose a novel method based on sparse representation for breast ultrasound image classification under the framework of multi-instance learning (MIL). After image enhancement and segmentation, concentric circle is used to extract the global and local features for improving the accuracy in diagnosis and prediction. The classification problem of ultrasound image is converted to sparse representation based MIL problem. Each instance of a bag is represented as a sparse linear combination of all basis vectors in the dictionary, and then the bag is represented by one feature vector which is obtained via sparse representations of all instances within the bag. The sparse and MIL problem is further converted to a conventional learning problem that is solved by relevance vector machine (RVM). Results of single classifiers are combined to be used for classification. Experimental results on the breast cancer datasets demonstrate the superiority of the proposed method in terms of classification accuracy as compared with state-of-the-art MIL methods.
Cancer cell detection and classification using transformation invariant template learning methods

International Nuclear Information System (INIS)

Talware, Rajendra; Abhyankar, Aditya

2011-01-01

In traditional cancer cell detection, pathologists examine biopsies to make diagnostic assessments, largely based on cell morphology and tissue distribution. The process of image acquisition is very much subjective and the pattern undergoes unknown or random transformations during data acquisition (e.g. variation in illumination, orientation, translation and perspective) results in high degree of variability. Transformed Component Analysis (TCA) incorporates a discrete, hidden variable that accounts for transformations and uses the Expectation Maximization (EM) algorithm to jointly extract components and normalize for transformations. Further the TEMPLAR framework developed takes advantage of hierarchical pattern models and adds probabilistic modeling for local transformations. Pattern classification is based on Expectation Maximization algorithm and General Likelihood Ratio Tests (GLRT). Performance of TEMPLAR is certainly improved by defining area of interest on slide a priori. Performance can be further enhanced by making the kernel function adaptive during learning. (author)
Classification of Hydrogels Based on Their Source: A Review and Application in Stem Cell Regulation

Science.gov (United States)

Khansari, Maziyar M.; Sorokina, Lioudmila V.; Mukherjee, Prithviraj; Mukhtar, Farrukh; Shirdar, Mostafa Rezazadeh; Shahidi, Mahnaz; Shokuhfar, Tolou

2017-08-01

Stem cells are recognized by their self-renewal ability and can give rise to specialized progeny. Hydrogels are an established class of biomaterials with the ability to control stem cell fate via mechanotransduction. They can mimic various physiological conditions to influence the fate of stem cells and are an ideal platform to support stem cell regulation. This review article provides a summary of recent advances in the application of different classes of hydrogels based on their source (e.g., natural, synthetic, or hybrid). This classification is important because the chemistry of substrate affects stem cell differentiation and proliferation. Natural and synthetic hydrogels have been widely used in stem cell regulation. Nevertheless, they have limitations that necessitate a new class of material. Hybrid hydrogels obtained by manipulation of the natural and synthetic ones can potentially overcome these limitations and shape the future of research in application of hydrogels in stem cell regulation.
RESEARCH OF CLASSIFICATION FEATURES OF THE FINANCIAL CONTROL

Directory of Open Access Journals (Sweden)

Knarik K. Arabyan

2013-01-01

Full Text Available One of the major problems is an improvement of classification features in the financial control theory. There is not a consensus concerning the form classification and the methods of financial control. This factor hinders the development of methodology and investigation of other issues of the financial control theory. The author summarizes scientists’ approaches to studying the classification features of financial control in the article.
Adjacent-cell Preconditioners for solving optically thick neutron transport problems

International Nuclear Information System (INIS)

Azmy, Y.Y.

1994-01-01

We develop, analyze, and test a new acceleration scheme for neutron transport methods, the Adjacent-cell Preconditioner (AP) that is particularly suited for solving optically thick problems. Our method goes beyond Diffusion Synthetic Acceleration (DSA) methods in that it's spectral radius vanishes with increasing cell thickness. In particular, for the ID case the AP method converges immediately, i.e. in one iteration, to 10 -4 pointwise relative criterion in problems with dominant cell size of 10 mfp or thicker. Also the AP has a simple formalism and is cell-centered hence, multidimensional and high order extensions are easier to develop, and more efficient to implement
Polarimetric SAR image classification based on discriminative dictionary learning model

Science.gov (United States)

Sang, Cheng Wei; Sun, Hong

2018-03-01

Polarimetric SAR (PolSAR) image classification is one of the important applications of PolSAR remote sensing. It is a difficult high-dimension nonlinear mapping problem, the sparse representations based on learning overcomplete dictionary have shown great potential to solve such problem. The overcomplete dictionary plays an important role in PolSAR image classification, however for PolSAR image complex scenes, features shared by different classes will weaken the discrimination of learned dictionary, so as to degrade classification performance. In this paper, we propose a novel overcomplete dictionary learning model to enhance the discrimination of dictionary. The learned overcomplete dictionary by the proposed model is more discriminative and very suitable for PolSAR classification.
Efficient Generation and Selection of Combined Features for Improved Classification

KAUST Repository

Shono, Ahmad N.

2014-05-01

This study contributes a methodology and associated toolkit developed to allow users to experiment with the use of combined features in classification problems. Methods are provided for efficiently generating combined features from an original feature set, for efficiently selecting the most discriminating of these generated combined features, and for efficiently performing a preliminary comparison of the classification results when using the original features exclusively against the results when using the selected combined features. The potential benefit of considering combined features in classification problems is demonstrated by applying the developed methodology and toolkit to three sample data sets where the discovery of combined features containing new discriminating information led to improved classification results.
Deep Multi-Task Learning for Tree Genera Classification

Science.gov (United States)

Ko, C.; Kang, J.; Sohn, G.

2018-05-01

The goal for our paper is to classify tree genera using airborne Light Detection and Ranging (LiDAR) data with Convolution Neural Network (CNN) - Multi-task Network (MTN) implementation. Unlike Single-task Network (STN) where only one task is assigned to the learning outcome, MTN is a deep learning architect for learning a main task (classification of tree genera) with other tasks (in our study, classification of coniferous and deciduous) simultaneously, with shared classification features. The main contribution of this paper is to improve classification accuracy from CNN-STN to CNN-MTN. This is achieved by introducing a concurrence loss (Lcd) to the designed MTN. This term regulates the overall network performance by minimizing the inconsistencies between the two tasks. Results show that we can increase the classification accuracy from 88.7 % to 91.0 % (from STN to MTN). The second goal of this paper is to solve the problem of small training sample size by multiple-view data generation. The motivation of this goal is to address one of the most common problems in implementing deep learning architecture, the insufficient number of training data. We address this problem by simulating training dataset with multiple-view approach. The promising results from this paper are providing a basis for classifying a larger number of dataset and number of classes in the future.
How to solve mathematical problems

CERN Document Server

Wickelgren, Wayne A

1995-01-01

Seven problem-solving techniques include inference, classification of action sequences, subgoals, contradiction, working backward, relations between problems, and mathematical representation. Also, problems from mathematics, science, and engineering with complete solutions.
The 2015 World Health Organization Classification of Lung Tumors: Impact of Genetic, Clinical and Radiologic Advances Since the 2004 Classification.

Science.gov (United States)

Travis, William D; Brambilla, Elisabeth; Nicholson, Andrew G; Yatabe, Yasushi; Austin, John H M; Beasley, Mary Beth; Chirieac, Lucian R; Dacic, Sanja; Duhig, Edwina; Flieder, Douglas B; Geisinger, Kim; Hirsch, Fred R; Ishikawa, Yuichi; Kerr, Keith M; Noguchi, Masayuki; Pelosi, Giuseppe; Powell, Charles A; Tsao, Ming Sound; Wistuba, Ignacio

2015-09-01

The 2015 World Health Organization (WHO) Classification of Tumors of the Lung, Pleura, Thymus and Heart has just been published with numerous important changes from the 2004 WHO classification. The most significant changes in this edition involve (1) use of immunohistochemistry throughout the classification, (2) a new emphasis on genetic studies, in particular, integration of molecular testing to help personalize treatment strategies for advanced lung cancer patients, (3) a new classification for small biopsies and cytology similar to that proposed in the 2011 Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society classification, (4) a completely different approach to lung adenocarcinoma as proposed by the 2011 Association for the Study of Lung Cancer/American Thoracic Society/European Respiratory Society classification, (5) restricting the diagnosis of large cell carcinoma only to resected tumors that lack any clear morphologic or immunohistochemical differentiation with reclassification of the remaining former large cell carcinoma subtypes into different categories, (6) reclassifying squamous cell carcinomas into keratinizing, nonkeratinizing, and basaloid subtypes with the nonkeratinizing tumors requiring immunohistochemistry proof of squamous differentiation, (7) grouping of neuroendocrine tumors together in one category, (8) adding NUT carcinoma, (9) changing the term sclerosing hemangioma to sclerosing pneumocytoma, (10) changing the name hamartoma to "pulmonary hamartoma," (11) creating a group of PEComatous tumors that include (a) lymphangioleiomyomatosis, (b) PEComa, benign (with clear cell tumor as a variant) and (c) PEComa, malignant, (12) introducing the entity pulmonary myxoid sarcoma with an EWSR1-CREB1 translocation, (13) adding the entities myoepithelioma and myoepithelial carcinomas, which can show EWSR1 gene rearrangements, (14) recognition of usefulness of WWTR1-CAMTA1 fusions in diagnosis of epithelioid
Renal cell carcinoma: histological classification and correlation with imaging findings

Energy Technology Data Exchange (ETDEWEB)

Muglia, Valdair F., E-mail: fmuglia@fmrp.usp.br [Universidade de Sao Paulo (CCIFM/FMRP/USP), Ribeirao Preto, SP (Brazil). Centro de Ciencias das Imagens e Fisica Medica. Faculdade de Medicina; Prando, Adilson [Universidade Estadual de Campinas (UNICAMP), SP (Brazil); Hospital Vera Cruz, Campinas, SP (Brazil). Dept. de Imaginologia

2015-05-15

Renal cell carcinoma (RCC) is the seventh most common histological type of cancer in the Western world and has shown a sustained increase in its prevalence. The histological classification of RCCs is of utmost importance, considering the significant prognostic and therapeutic implications of its histological subtypes. Imaging methods play an outstanding role in the diagnosis, staging and follow-up of RCC. Clear cell, papillary and chromophobe are the most common histological subtypes of RCC, and their preoperative radiological characterization, either followed or not by confirmatory percutaneous biopsy, may be particularly useful in cases of poor surgical condition, metastatic disease, central mass in a solitary kidney, and in patients eligible for molecular targeted therapy. New strategies recently developed for treating renal cancer, such as cryo and radiofrequency ablation, molecularly targeted therapy and active surveillance also require appropriate preoperative characterization of renal masses. Less common histological types, although sharing nonspecific imaging features, may be suspected on the basis of clinical and epidemiological data. The present study is aimed at reviewing the main clinical and imaging findings of histological RCC subtypes. (author)
GMDH-Based Semi-Supervised Feature Selection for Electricity Load Classification Forecasting

Directory of Open Access Journals (Sweden)

Lintao Yang

2018-01-01

Full Text Available With the development of smart power grids, communication network technology and sensor technology, there has been an exponential growth in complex electricity load data. Irregular electricity load fluctuations caused by the weather and holiday factors disrupt the daily operation of the power companies. To deal with these challenges, this paper investigates a day-ahead electricity peak load interval forecasting problem. It transforms the conventional continuous forecasting problem into a novel interval forecasting problem, and then further converts the interval forecasting problem into the classification forecasting problem. In addition, an indicator system influencing the electricity load is established from three dimensions, namely the load series, calendar data, and weather data. A semi-supervised feature selection algorithm is proposed to address an electricity load classification forecasting issue based on the group method of data handling (GMDH technology. The proposed algorithm consists of three main stages: (1 training the basic classifier; (2 selectively marking the most suitable samples from the unclassified label data, and adding them to an initial training set; and (3 training the classification models on the final training set and classifying the test samples. An empirical analysis of electricity load dataset from four Chinese cities is conducted. Results show that the proposed model can address the electricity load classification forecasting problem more efficiently and effectively than the FW-Semi FS (forward semi-supervised feature selection and GMDH-U (GMDH-based semi-supervised feature selection for customer classification models.
The brain MRI classification problem from wavelets perspective

Science.gov (United States)

Bendib, Mohamed M.; Merouani, Hayet F.; Diaba, Fatma

2015-02-01

Haar and Daubechies 4 (DB4) are the most used wavelets for brain MRI (Magnetic Resonance Imaging) classification. The former is simple and fast to compute while the latter is more complex and offers a better resolution. This paper explores the potential of both of them in performing Normal versus Pathological discrimination on the one hand, and Multiclassification on the other hand. The Whole Brain Atlas is used as a validation database, and the Random Forest (RF) algorithm is employed as a learning approach. The achieved results are discussed and statistically compared.
CLASS-PAIR-GUIDED MULTIPLE KERNEL LEARNING OF INTEGRATING HETEROGENEOUS FEATURES FOR CLASSIFICATION

Directory of Open Access Journals (Sweden)

Q. Wang

2017-10-01

Full Text Available In recent years, many studies on remote sensing image classification have shown that using multiple features from different data sources can effectively improve the classification accuracy. As a very powerful means of learning, multiple kernel learning (MKL can conveniently be embedded in a variety of characteristics. The conventional combined kernel learned by MKL can be regarded as the compromise of all basic kernels for all classes in classification. It is the best of the whole, but not optimal for each specific class. For this problem, this paper proposes a class-pair-guided MKL method to integrate the heterogeneous features (HFs from multispectral image (MSI and light detection and ranging (LiDAR data. In particular, the one-against-one strategy is adopted, which converts multiclass classification problem to a plurality of two-class classification problem. Then, we select the best kernel from pre-constructed basic kernels set for each class-pair by kernel alignment (KA in the process of classification. The advantage of the proposed method is that only the best kernel for the classification of any two classes can be retained, which leads to greatly enhanced discriminability. Experiments are conducted on two real data sets, and the experimental results show that the proposed method achieves the best performance in terms of classification accuracies in integrating the HFs for classification when compared with several state-of-the-art algorithms.
HIV classification using coalescent theory

Energy Technology Data Exchange (ETDEWEB)

Zhang, Ming [Los Alamos National Laboratory; Letiner, Thomas K [Los Alamos National Laboratory; Korber, Bette T [Los Alamos National Laboratory

2008-01-01

Algorithms for subtype classification and breakpoint detection of HIV-I sequences are based on a classification system of HIV-l. Hence, their quality highly depend on this system. Due to the history of creation of the current HIV-I nomenclature, the current one contains inconsistencies like: The phylogenetic distance between the subtype B and D is remarkably small compared with other pairs of subtypes. In fact, it is more like the distance of a pair of subsubtypes Robertson et al. (2000); Subtypes E and I do not exist any more since they were discovered to be composed of recombinants Robertson et al. (2000); It is currently discussed whether -- instead of CRF02 being a recombinant of subtype A and G -- subtype G should be designated as a circulating recombination form (CRF) nd CRF02 as a subtype Abecasis et al. (2007); There are 8 complete and over 400 partial HIV genomes in the LANL-database which belong neither to a subtype nor to a CRF (denoted by U). Moreover, the current classification system is somehow arbitrary like all complex classification systems that were created manually. To this end, it is desirable to deduce the classification system of HIV systematically by an algorithm. Of course, this problem is not restricted to HIV, but applies to all fast mutating and recombining viruses. Our work addresses the simpler subproblem to score classifications of given input sequences of some virus species (classification denotes a partition of the input sequences in several subtypes and CRFs). To this end, we reconstruct ancestral recombination graphs (ARG) of the input sequences under restrictions determined by the given classification. These restritions are imposed in order to ensure that the reconstructed ARGs do not contradict the classification under consideration. Then, we find the ARG with maximal probability by means of Markov Chain Monte Carlo methods. The probability of the most probable ARG is interpreted as a score for the classification. To our
The Periodic Table and the Philosophy of Classification

DEFF Research Database (Denmark)

Hjørland, Birger

2011-01-01

This paper discusses some problems in the philosophy of classification based on a discussion of the periodic system of chemistry and physics. The emerging interdisciplinary field ‘philosophy of classification’ is briefly introduced and related to the field of knowledge organization (KO) within...... Library and Information Science (LIS). It is argued that KO needs to be better integrated with the broader field of classification theory and research. The paper considers some core issues such as whether classifications are pragmatic human tools or neutral reflections of nature, how classifications...

Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning

Directory of Open Access Journals (Sweden)

Tanel Pärnamaa

2017-05-01

Full Text Available High-throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently-tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held-out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high-throughput microscopy.
Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning.

Science.gov (United States)

Pärnamaa, Tanel; Parts, Leopold

2017-05-05

High-throughput microscopy of many single cells generates high-dimensional data that are far from straightforward to analyze. One important problem is automatically detecting the cellular compartment where a fluorescently-tagged protein resides, a task relatively simple for an experienced human, but difficult to automate on a computer. Here, we train an 11-layer neural network on data from mapping thousands of yeast proteins, achieving per cell localization classification accuracy of 91%, and per protein accuracy of 99% on held-out images. We confirm that low-level network features correspond to basic image characteristics, while deeper layers separate localization classes. Using this network as a feature calculator, we train standard classifiers that assign proteins to previously unseen compartments after observing only a small number of training examples. Our results are the most accurate subcellular localization classifications to date, and demonstrate the usefulness of deep learning for high-throughput microscopy. Copyright © 2017 Parnamaa and Parts.
Taxonomies of Educational Objectives and Theories of Classification.

Science.gov (United States)

Travers, Robert M. W.

1980-01-01

Classification is the taxonomic science in which a system of categories is established and in which the categories have some logical structure. Scientific classifications have included those by Aristotle, Linnaeus, and Lavoisier. Educational taxonomies include those developed by Bloom, Herbart, Dewey, and Piaget. The problems of taxonomy…
A comparative evaluation of sequence classification programs

Directory of Open Access Journals (Sweden)

Bazinet Adam L

2012-05-01

Full Text Available Abstract Background A fundamental problem in modern genomics is to taxonomically or functionally classify DNA sequence fragments derived from environmental sampling (i.e., metagenomics. Several different methods have been proposed for doing this effectively and efficiently, and many have been implemented in software. In addition to varying their basic algorithmic approach to classification, some methods screen sequence reads for ’barcoding genes’ like 16S rRNA, or various types of protein-coding genes. Due to the sheer number and complexity of methods, it can be difficult for a researcher to choose one that is well-suited for a particular analysis. Results We divided the very large number of programs that have been released in recent years for solving the sequence classification problem into three main categories based on the general algorithm they use to compare a query sequence against a database of sequences. We also evaluated the performance of the leading programs in each category on data sets whose taxonomic and functional composition is known. Conclusions We found significant variability in classification accuracy, precision, and resource consumption of sequence classification programs when used to analyze various metagenomics data sets. However, we observe some general trends and patterns that will be useful to researchers who use sequence classification programs.
[Landscape classification: research progress and development trend].

Science.gov (United States)

Liang, Fa-Chao; Liu, Li-Ming

2011-06-01

Landscape classification is the basis of the researches on landscape structure, process, and function, and also, the prerequisite for landscape evaluation, planning, protection, and management, directly affecting the precision and practicability of landscape research. This paper reviewed the research progress on the landscape classification system, theory, and methodology, and summarized the key problems and deficiencies of current researches. Some major landscape classification systems, e. g. , LANMAP and MUFIC, were introduced and discussed. It was suggested that a qualitative and quantitative comprehensive classification based on the ideology of functional structure shape and on the integral consideration of landscape classification utility, landscape function, landscape structure, physiogeographical factors, and human disturbance intensity should be the major research directions in the future. The integration of mapping, 3S technology, quantitative mathematics modeling, computer artificial intelligence, and professional knowledge to enhance the precision of landscape classification would be the key issues and the development trend in the researches of landscape classification.
Improving Student Question Classification

Science.gov (United States)

Heiner, Cecily; Zachary, Joseph L.

2009-01-01

Students in introductory programming classes often articulate their questions and information needs incompletely. Consequently, the automatic classification of student questions to provide automated tutorial responses is a challenging problem. This paper analyzes 411 questions from an introductory Java programming course by reducing the natural…
Joint Feature Selection and Classification for Multilabel Learning.

Science.gov (United States)

Huang, Jun; Li, Guorong; Huang, Qingming; Wu, Xindong

2018-03-01

Multilabel learning deals with examples having multiple class labels simultaneously. It has been applied to a variety of applications, such as text categorization and image annotation. A large number of algorithms have been proposed for multilabel learning, most of which concentrate on multilabel classification problems and only a few of them are feature selection algorithms. Current multilabel classification models are mainly built on a single data representation composed of all the features which are shared by all the class labels. Since each class label might be decided by some specific features of its own, and the problems of classification and feature selection are often addressed independently, in this paper, we propose a novel method which can perform joint feature selection and classification for multilabel learning, named JFSC. Different from many existing methods, JFSC learns both shared features and label-specific features by considering pairwise label correlations, and builds the multilabel classifier on the learned low-dimensional data representations simultaneously. A comparative study with state-of-the-art approaches manifests a competitive performance of our proposed method both in classification and feature selection for multilabel learning.
Minimum Error Entropy Classification

CERN Document Server

Marques de Sá, Joaquim P; Santos, Jorge M F; Alexandre, Luís A

2013-01-01

This book explains the minimum error entropy (MEE) concept applied to data classification machines. Theoretical results on the inner workings of the MEE concept, in its application to solving a variety of classification problems, are presented in the wider realm of risk functionals. Researchers and practitioners also find in the book a detailed presentation of practical data classifiers using MEE. These include multi‐layer perceptrons, recurrent neural networks, complexvalued neural networks, modular neural networks, and decision trees. A clustering algorithm using a MEE‐like concept is also presented. Examples, tests, evaluation experiments and comparison with similar machines using classic approaches, complement the descriptions.
Scientific and General Subject Classifications in the Digital World

CERN Document Server

De Robbio, Antonella; Marini, A

2001-01-01

In the present work we discuss opportunities, problems, tools and techniques encountered when interconnecting discipline-specific subject classifications, primarily organized as search devices in bibliographic databases, with general classifications originally devised for book shelving in public libraries. We first state the fundamental distinction between topical (or subject) classifications and object classifications. Then we trace the structural limitations that have constrained subject classifications since their library origins, and the devices that were used to overcome the gap with genuine knowledge representation. After recalling some general notions on structure, dynamics and interferences of subject classifications and of the objects they refer to, we sketch a synthetic overview on discipline-specific classifications in Mathematics, Computing and Physics, on one hand, and on general classifications on the other. In this setting we present The Scientific Classifications Page, which collects groups of...
Software support for irregular and loosely synchronous problems

Science.gov (United States)

Choudhary, A.; Fox, G.; Hiranandani, S.; Kennedy, K.; Koelbel, C.; Ranka, S.; Saltz, J.

1992-01-01

A large class of scientific and engineering applications may be classified as irregular and loosely synchronous from the perspective of parallel processing. We present a partial classification of such problems. This classification has motivated us to enhance FORTRAN D to provide language support for irregular, loosely synchronous problems. We present techniques for parallelization of such problems in the context of FORTRAN D.
Activation analysis. A basis for chemical similarity and classification

Energy Technology Data Exchange (ETDEWEB)

Beeck, J OP de [Ghent Rijksuniversiteit (Belgium). Instituut voor Kernwetenschappen

1977-01-01

It is shown that activation analysis is especially suited to serve as a basis for determining the chemical similarity between samples defined by their trace-element concentration patterns. The general problem of classification and identification is discussed. The nature of possible classification structures and their appropriate clustering strategies is considered. A practical computer method is suggested and its application as well as the graphical representation of classification results are given. The possibility for classification using information theory is mentioned. Classification of chemical elements is discussed and practically realized after Hadamard transformation of the concentration variation patterns in a series of samples.
Inter Genre Similarity Modelling For Automatic Music Genre Classification

OpenAIRE

Bagci, Ulas; Erzin, Engin

2009-01-01

Music genre classification is an essential tool for music information retrieval systems and it has been finding critical applications in various media platforms. Two important problems of the automatic music genre classification are feature extraction and classifier design. This paper investigates inter-genre similarity modelling (IGS) to improve the performance of automatic music genre classification. Inter-genre similarity information is extracted over the mis-classified feature population....
Research on the re-establishment of the classification criteria of strategic items

Energy Technology Data Exchange (ETDEWEB)

Han, Seong Mi; Yang, Seunghyo; Shin, Dong Hoon [Korea Institute of Nuclear Nonproliferation and Control, Daejeon (Korea, Republic of)

2014-05-15

According to these export control laws and regulations, the exporters have to apply the review for classification and export licensing to their own government. In this process, a technical review institute such as Korea Institute of Nuclear Nonproliferation and Control (institute under the NSSC) are referring to Minister's Regulation for the Export and Import of Strategic Goods. In this regulation, there are many criteria to classify the strategic items to be exported. But there are some problems in these criteria. At Typical problem is that classification criteria of Trigger List Items generally is very qualitative and very obscure in contrast with Dual Use Items. So, in most cases, this characteristics of classification criteria of trigger list items have caused much trouble for stakeholders such as government and nuclear related companies. So, there were needs that the classification criteria had to be more correct, obvious and objective. To solve these problems, the past classification cases for technology were re-analyzed and the general criteria were deducted in this study. Previously mentioned, the classification process and criteria were very qualitative and very obscure for the Trigger List Items. So, the re-establishment of the classification criteria was done to solve these problems in this study. Each extracted results were shown in Tables I and II. This re-established criteria are expected to contribute to quantification, disambiguation and objectification of the classification review process. As the future works, we will establish the probability or numerical factor for the extracted criteria through statistical surveys, to make better use of these criteria. And we will push ahead with the NSSC approval to use as the classification guidelines of the trigger list items in review processes.
Saving our science from ourselves: the plight of biological classification

Directory of Open Access Journals (Sweden)

Malte C. Ebach

2011-06-01

Full Text Available Saving our science from ourselves: the plight of biological classification. Biological classification ( nomenclature, taxonomy, and systematics is being sold short. The desire for new technologies, faster and cheaper taxonomic descriptions, identifications, and revisions is symptomatic of a lack of appreciation and understanding of classification. The problem of gadget-driven science, a lack of best practice and the inability to accept classification as a descriptive and empirical science are discussed. The worst cases scenario is a future in which classifications are purely artificial and uninformative.
Effectiveness of Multivariate Time Series Classification Using Shapelets

Directory of Open Access Journals (Sweden)

A. P. Karpenko

2015-01-01

Full Text Available Typically, time series classifiers require signal pre-processing (filtering signals from noise and artifact removal, etc., enhancement of signal features (amplitude, frequency, spectrum, etc., classification of signal features in space using the classical techniques and classification algorithms of multivariate data. We consider a method of classifying time series, which does not require enhancement of the signal features. The method uses the shapelets of time series (time series shapelets i.e. small fragments of this series, which reflect properties of one of its classes most of all.Despite the significant number of publications on the theory and shapelet applications for classification of time series, the task to evaluate the effectiveness of this technique remains relevant. An objective of this publication is to study the effectiveness of a number of modifications of the original shapelet method as applied to the multivariate series classification that is a littlestudied problem. The paper presents the problem statement of multivariate time series classification using the shapelets and describes the shapelet–based basic method of binary classification, as well as various generalizations and proposed modification of the method. It also offers the software that implements a modified method and results of computational experiments confirming the effectiveness of the algorithmic and software solutions.The paper shows that the modified method and the software to use it allow us to reach the classification accuracy of about 85%, at best. The shapelet search time increases in proportion to input data dimension.
Reducing the Complexity of Genetic Fuzzy Classifiers in Highly-Dimensional Classification Problems

Directory of Open Access Journals (Sweden)

DimitrisG. Stavrakoudis

2012-04-01

Full Text Available This paper introduces the Fast Iterative Rule-based Linguistic Classifier (FaIRLiC, a Genetic Fuzzy Rule-Based Classification System (GFRBCS which targets at reducing the structural complexity of the resulting rule base, as well as its learning algorithm's computational requirements, especially when dealing with high-dimensional feature spaces. The proposed methodology follows the principles of the iterative rule learning (IRL approach, whereby a rule extraction algorithm (REA is invoked in an iterative fashion, producing one fuzzy rule at a time. The REA is performed in two successive steps: the first one selects the relevant features of the currently extracted rule, whereas the second one decides the antecedent part of the fuzzy rule, using the previously selected subset of features. The performance of the classifier is finally optimized through a genetic tuning post-processing stage. Comparative results in a hyperspectral remote sensing classification as well as in 12 real-world classification datasets indicate the effectiveness of the proposed methodology in generating high-performing and compact fuzzy rule-based classifiers, even for very high-dimensional feature spaces.
Proposals for Paraphilic Disorders in the International Classification of Diseases and Related Health Problems, Eleventh Revision (ICD-11).

Science.gov (United States)

Krueger, Richard B; Reed, Geoffrey M; First, Michael B; Marais, Adele; Kismodi, Eszter; Briken, Peer

2017-07-01

The World Health Organization is currently developing the 11th revision of the International Classifications of Diseases and Related Health Problems (ICD-11), with approval of the ICD-11 by the World Health Assembly anticipated in 2018. The Working Group on the Classification of Sexual Disorders and Sexual Health (WGSDSH) was created and charged with reviewing and making recommendations for categories related to sexuality that are contained in the chapter of Mental and Behavioural Disorders in ICD-10 (World Health Organization 1992a). Among these categories was the ICD-10 grouping F65, Disorders of sexual preference, which describes conditions now widely referred to as Paraphilic Disorders. This article reviews the evidence base, rationale, and recommendations for the proposed revisions in this area for ICD-11 and compares them with DSM-5. The WGSDSH recommended that the grouping, Disorders of sexual preference, be renamed to Paraphilic Disorders and be limited to disorders that involve sexual arousal patterns that focus on non-consenting others or are associated with substantial distress or direct risk of injury or death. Consistent with this framework, the WGSDSH also recommended that the ICD-10 categories of Fetishism, Fetishistic Transvestism, and Sadomasochism be removed from the classification and new categories of Coercive Sexual Sadism Disorder, Frotteuristic Disorder, Other Paraphilic Disorder Involving Non-Consenting Individuals, and Other Paraphilic Disorder Involving Solitary Behaviour or Consenting Individuals be added. The WGSDSH's proposals for Paraphilic Disorders in ICD-11 are based on the WHO's role as a global public health agency and the ICD's function as a public health reporting tool.
solving the cell formation problem in group technology

Directory of Open Access Journals (Sweden)

Prafulla Joglekar

2001-01-01

Full Text Available Over the last three decades, numerous algorithms have been proposed to solve the work-cell formation problem. For practicing manufacturing managers it would be nice to know as to which algorithm would be most effective and efficient for their specific situation. While several studies have attempted to fulfill this need, most have not resulted in any definitive recommendations and a better methodology of evaluation of cell formation algorithms is urgently needed. Prima facie, the methodology underlying Miltenburg and Zhang's (M&Z (1991 evaluation of nine well-known cell formation algorithms seems very promising. The primary performance measure proposed by M&Z effectively captures the objectives of a good solution to a cell formation problem and is worthy of use in future studies. Unfortunately, a critical review of M&Z's methodology also reveals certain important flaws in M&Z's methodology. For example, M&Z may not have duplicated each algorithm precisely as the developer(s of that algorithm intended. Second, M&Z's misrepresent Chandrasekharan and Rajagopalan's [C&R's] (1986 grouping efficiency measure. Third, M&Z's secondary performance measures lead them to unnecessarily ambivalent results. Fourth, several of M&Z's empirical conclusions can be theoretically deduced. It is hoped that future evaluations of cell formation algorithms will benefit from both the strengths and weaknesses of M&Z's work.
Sentiment classification technology based on Markov logic networks

Science.gov (United States)

He, Hui; Li, Zhigang; Yao, Chongchong; Zhang, Weizhe

2016-07-01

With diverse online media emerging, there is a growing concern of sentiment classification problem. At present, text sentiment classification mainly utilizes supervised machine learning methods, which feature certain domain dependency. On the basis of Markov logic networks (MLNs), this study proposed a cross-domain multi-task text sentiment classification method rooted in transfer learning. Through many-to-one knowledge transfer, labeled text sentiment classification, knowledge was successfully transferred into other domains, and the precision of the sentiment classification analysis in the text tendency domain was improved. The experimental results revealed the following: (1) the model based on a MLN demonstrated higher precision than the single individual learning plan model. (2) Multi-task transfer learning based on Markov logical networks could acquire more knowledge than self-domain learning. The cross-domain text sentiment classification model could significantly improve the precision and efficiency of text sentiment classification.
Maternal cell phone use during pregnancy and child behavioral problems in five birth cohorts.

Science.gov (United States)

Birks, Laura; Guxens, Mònica; Papadopoulou, Eleni; Alexander, Jan; Ballester, Ferran; Estarlich, Marisa; Gallastegi, Mara; Ha, Mina; Haugen, Margaretha; Huss, Anke; Kheifets, Leeka; Lim, Hyungryul; Olsen, Jørn; Santa-Marina, Loreto; Sudan, Madhuri; Vermeulen, Roel; Vrijkotte, Tanja; Cardis, Elisabeth; Vrijheid, Martine

2017-07-01

Previous studies have reported associations between prenatal cell phone use and child behavioral problems, but findings have been inconsistent and based on retrospective assessment of cell phone use. This study aimed to assess this association in a multi-national analysis, using data from three cohorts with prospective data on prenatal cell phone use, together with previously published data from two cohorts with retrospectively collected cell phone use data. We used individual participant data from 83,884 mother-child pairs in the five cohorts from Denmark (1996-2002), Korea (2006-2011), the Netherlands (2003-2004), Norway (2004-2008), and Spain (2003-2008). We categorized cell phone use into none, low, medium, and high, based on frequency of calls during pregnancy reported by the mothers. Child behavioral problems (reported by mothers using the Strengths and Difficulties Questionnaire or Child Behavior Checklist) were classified in the borderline/clinical and clinical ranges using validated cut-offs in children aged 5-7years. Cohort specific risk estimates were meta-analyzed. Overall, 38.8% of mothers, mostly from the Danish cohort, reported no cell phone use during pregnancy and these mothers were less likely to have a child with overall behavioral, hyperactivity/inattention or emotional problems. Evidence for a trend of increasing risk of child behavioral problems through the maternal cell phone use categories was observed for hyperactivity/inattention problems (OR for problems in the clinical range: 1.11, 95%CI 1.01, 1.22; 1.28, 95%CI 1.12, 1.48, among children of medium and high users, respectively). This association was fairly consistent across cohorts and between cohorts with retrospectively and prospectively collected cell phone use data. Maternal cell phone use during pregnancy may be associated with an increased risk for behavioral problems, particularly hyperactivity/inattention problems, in the offspring. The interpretation of these results is unclear

Time-reversal of electromagnetic scattering for small scatterer classification

International Nuclear Information System (INIS)

Smith, J Torquil; Berryman, James G

2012-01-01

Time-reversal operators, or the alternatively labelled, but equivalent, multistatic response matrix methods, are used to show how to determine the number of scatterers present in an electromagnetic scattering scenario that might be typical of UneXploded Ordinance (UXO) detection, classification and removal applications. Because the nature of the target UXO application differs from that of many other common inversion problems, emphasis is placed here on classification and enumeration rather than on detailed imaging. The main technical issues necessarily revolve around showing that it is possible to find a sufficient number of constraints via multiple measurements (i.e. using several distinct views at the target site) to solve the enumeration problem. The main results show that five measurements with antenna pairs are generally adequate to solve the classification and enumeration problems. However, these results also demonstrate a need for decreasing noise levels in the multistatic matrix as the number n of scatterers increases for the intended practical applications of the method. (paper)
An Ultrasonic Pattern Recognition Approach to Welding Defect Classification

International Nuclear Information System (INIS)

Song, Sung Jin

1995-01-01

Classification of flaws in weldments from their ultrasonic scattering signals is very important in quantitative nondestructive evaluation. This problem is ideally suited to a modern ultrasonic pattern recognition technique. Here brief discussion on systematic approach to this methodology is presented including ultrasonic feature extraction, feature selection and classification. A stronger emphasis is placed on probabilistic neural networks as efficient classifiers for many practical classification problems. In an example probabilistic neural networks are applied to classify flaws in weldments into 3 classes such as cracks, porosity and slag inclusions. Probabilistic nets are shown to be able to exhibit high performance of other classifiers without any training time overhead. In addition, forward selection scheme for sensitive features is addressed to enhance network performance
Video based object representation and classification using multiple covariance matrices.

Science.gov (United States)

Zhang, Yurong; Liu, Quan

2017-01-01

Video based object recognition and classification has been widely studied in computer vision and image processing area. One main issue of this task is to develop an effective representation for video. This problem can generally be formulated as image set representation. In this paper, we present a new method called Multiple Covariance Discriminative Learning (MCDL) for image set representation and classification problem. The core idea of MCDL is to represent an image set using multiple covariance matrices with each covariance matrix representing one cluster of images. Firstly, we use the Nonnegative Matrix Factorization (NMF) method to do image clustering within each image set, and then adopt Covariance Discriminative Learning on each cluster (subset) of images. At last, we adopt KLDA and nearest neighborhood classification method for image set classification. Promising experimental results on several datasets show the effectiveness of our MCDL method.
Support Vector Machines for Pattern Classification

CERN Document Server

Abe, Shigeo

2010-01-01

A guide on the use of SVMs in pattern classification, including a rigorous performance comparison of classifiers and regressors. The book presents architectures for multiclass classification and function approximation problems, as well as evaluation criteria for classifiers and regressors. Features: Clarifies the characteristics of two-class SVMs; Discusses kernel methods for improving the generalization ability of neural networks and fuzzy systems; Contains ample illustrations and examples; Includes performance evaluation using publicly available data sets; Examines Mahalanobis kernels, empir
Collateral in Loan Classification and Provisioning

OpenAIRE

In W Song

2002-01-01

Adequate loan classification practices are an essential part of a sound and effective credit risk-management process in a bank. Failure to identify deterioration in credit quality in a timely manner can aggravate and prolong the problem. Two key issues arise with regard to the use of collateral in the context of loan classification and provisioning. In particular, the questions arise whether collateral should be taken into account in classifying a collateralized loan, and whether it should be...
A Survey on Sentiment Classification in Face Recognition

Science.gov (United States)

Qian, Jingyu

2018-01-01

Face recognition has been an important topic for both industry and academia for a long time. K-means clustering, autoencoder, and convolutional neural network, each representing a design idea for face recognition method, are three popular algorithms to deal with face recognition problems. It is worthwhile to summarize and compare these three different algorithms. This paper will focus on one specific face recognition problem-sentiment classification from images. Three different algorithms for sentiment classification problems will be summarized, including k-means clustering, autoencoder, and convolutional neural network. An experiment with the application of these algorithms on a specific dataset of human faces will be conducted to illustrate how these algorithms are applied and their accuracy. Finally, the three algorithms are compared based on the accuracy result.
Large Uptake of Titania and Iron Oxide Nanoparticles in the Nucleus of Lung Epithelial Cells as Measured by Raman Imaging and Multivariate Classification

Science.gov (United States)

Ahlinder, Linnea; Ekstrand-Hammarström, Barbro; Geladi, Paul; Österlund, Lars

2013-01-01

It is a challenging task to characterize the biodistribution of nanoparticles in cells and tissue on a subcellular level. Conventional methods to study the interaction of nanoparticles with living cells rely on labeling techniques that either selectively stain the particles or selectively tag them with tracer molecules. In this work, Raman imaging, a label-free technique that requires no extensive sample preparation, was combined with multivariate classification to quantify the spatial distribution of oxide nanoparticles inside living lung epithelial cells (A549). Cells were exposed to TiO2 (titania) and/or α-FeO(OH) (goethite) nanoparticles at various incubation times (4 or 48 h). Using multivariate classification of hyperspectral Raman data with partial least-squares discriminant analysis, we show that a surprisingly large fraction of spectra, classified as belonging to the cell nucleus, show Raman bands associated with nanoparticles. Up to 40% of spectra from the cell nucleus show Raman bands associated with nanoparticles. Complementary transmission electron microscopy data for thin cell sections qualitatively support the conclusions. PMID:23870252
Classification and its applications for drug-target interaction identification

OpenAIRE

Mei, Jian-Ping; Kwoh, Chee-Keong; Yang, Peng; Li, Xiao-Li

2015-01-01

Classification is one of the most popular and widely used supervised learning tasks, which categorizes objects into predefined classes based on known knowledge. Classification has been an important research topic in machine learning and data mining. Different classification methods have been proposed and applied to deal with various real-world problems. Unlike unsupervised learning such as clustering, a classifier is typically trained with labeled data before being used to make prediction, an...
[Outstanding problems of normal and pathological morphology of the diffuse endocrine system].

Science.gov (United States)

Iaglov, V V; Iaglova, N V

2011-01-01

The diffuse endocrine system (DES)--a mosaic-cellular endoepithelial gland--is the biggest part of the human endocrine system. Scientists used to consider cells of DES as neuroectodermal. According to modem data cells of DES are different cytogenetic types because they develop from the different embryonic blastophyllum. So that any hormone-active tumors originated from DES of the digestive, respiratory and urogenital system shouldn't be considered as neuroendocrinal tumors. The basic problems of DES morphology and pathology are the creation of scientifically substantiated histogenetic classification of DES tumors.
Polarimetry based partial least square classification of ex vivo healthy and basal cell carcinoma human skin tissues.

Science.gov (United States)

Ahmad, Iftikhar; Ahmad, Manzoor; Khan, Karim; Ikram, Masroor

2016-06-01

Optical polarimetry was employed for assessment of ex vivo healthy and basal cell carcinoma (BCC) tissue samples from human skin. Polarimetric analyses revealed that depolarization and retardance for healthy tissue group were significantly higher (ppolarimetry together with PLS statistics hold promise for automated pathology classification. Copyright © 2016 Elsevier B.V. All rights reserved.
Observation and inverse problems in coupled cell networks

International Nuclear Information System (INIS)

Joly, Romain

2012-01-01

A coupled cell network is a model for many situations such as food webs in ecosystems, cellular metabolism and economic networks. It consists in a directed graph G, each node (or cell) representing an agent of the network and each directed arrow representing which agent acts on which. It yields a system of differential equations .x(t)=f(x(t)), where the component i of f depends only on the cells x j (t) for which the arrow j → i exists in G. In this paper, we investigate the observation problems in coupled cell networks: can one deduce the behaviour of the whole network (oscillations, stabilization, etc) by observing only one of the cells? We show that the natural observation properties hold for almost all the interactions f
Review of international solutions to NEACRP benchmark BWR lattice cell problems

International Nuclear Information System (INIS)

Halsall, M.J.

1977-12-01

This paper summarises international solutions to a set of BWR benchmark problems. The problems, posed as an activity sponsored by the Nuclear Energy Agency Committee on Reactor Physics, were as follows: 9-pin supercell with central burnable poison pin, mini-BWR with 4 pin-cells and water gaps and control rod cruciform, full 7 x 7 pin BWR lattice cell with differential U 235 enrichment, and full 8 x 8 pin BWR lattice cell with water-hole, Pu-loading, burnable poison, and homogenised cruciform control rod. Solutions have been contributed by Denmark, Japan, Sweden, Switzerland and the UK. (author)
Data Augmentation for Plant Classification

NARCIS (Netherlands)

Pawara, Pornntiwa; Okafor, Emmanuel; Schomaker, Lambertus; Wiering, Marco

2017-01-01

Data augmentation plays a crucial role in increasing the number of training images, which often aids to improve classification performances of deep learning techniques for computer vision problems. In this paper, we employ the deep learning framework and determine the effects of several
The classification of osteonecrosis in patients with cancer: validation of a new radiological classification system

International Nuclear Information System (INIS)

Niinimäki, T.; Niinimäki, J.; Halonen, J.; Hänninen, P.; Harila-Saari, A.; Niinimäki, R.

2015-01-01

Aim: To validate a new, non-joint-specific radiological classification system that is suitable regardless of the site of the osteonecrosis (ON) in patients with cancer. Material and methods: Critical deficiencies in the existing ON classification systems were identified and a new, non-joint-specific radiological classification system was developed. Seventy-two magnetic resonance imaging (MRI) images of patients with cancer and ON lesions were graded, and the validation of the new system was performed by assessing inter- and intra-observer reliability. Results: Intra-observer reliability of ON grading was good or very good, with kappa values of 0.79–0.86. Interobserver agreement was lower but still good, with kappa values of 0.62–0.77. Ninety-eight percent of all intra- or interobserver differences were within one grade. Interobserver reliability of assessing the location of ON was very good, with kappa values of 0.93–0.98. Conclusion: All the available radiological ON classification systems are joint specific. This limitation has spurred the development of multiple systems, which has led to the insufficient use of classifications in ON studies among patients with cancer. The introduced radiological classification system overcomes the problem of joint-specificity, was found to be reliable, and can be used to classify all ON lesions regardless of the affected site. - Highlights: • Patients with cancer may have osteonecrosis lesions at multiple sites. • There is no non-joint-specific osteonecrosis classification available. • We introduced a new non-joint-specific osteonecrosis classification. • The validation was performed by assessing inter- and intra-observer reliability. • The classification was reliable and could be used regardless of the affected site.
Efficient Fingercode Classification

Science.gov (United States)

Sun, Hong-Wei; Law, Kwok-Yan; Gollmann, Dieter; Chung, Siu-Leung; Li, Jian-Bin; Sun, Jia-Guang

In this paper, we present an efficient fingerprint classification algorithm which is an essential component in many critical security application systems e. g. systems in the e-government and e-finance domains. Fingerprint identification is one of the most important security requirements in homeland security systems such as personnel screening and anti-money laundering. The problem of fingerprint identification involves searching (matching) the fingerprint of a person against each of the fingerprints of all registered persons. To enhance performance and reliability, a common approach is to reduce the search space by firstly classifying the fingerprints and then performing the search in the respective class. Jain et al. proposed a fingerprint classification algorithm based on a two-stage classifier, which uses a K-nearest neighbor classifier in its first stage. The fingerprint classification algorithm is based on the fingercode representation which is an encoding of fingerprints that has been demonstrated to be an effective fingerprint biometric scheme because of its ability to capture both local and global details in a fingerprint image. We enhance this approach by improving the efficiency of the K-nearest neighbor classifier for fingercode-based fingerprint classification. Our research firstly investigates the various fast search algorithms in vector quantization (VQ) and the potential application in fingerprint classification, and then proposes two efficient algorithms based on the pyramid-based search algorithms in VQ. Experimental results on DB1 of FVC 2004 demonstrate that our algorithms can outperform the full search algorithm and the original pyramid-based search algorithms in terms of computational efficiency without sacrificing accuracy.
Dynamic Latent Classification Model

DEFF Research Database (Denmark)

Zhong, Shengtong; Martínez, Ana M.; Nielsen, Thomas Dyhre

as possible. Motivated by this problem setting, we propose a generative model for dynamic classification in continuous domains. At each time point the model can be seen as combining a naive Bayes model with a mixture of factor analyzers (FA). The latent variables of the FA are used to capture the dynamics...
The Influence of Hindu Epistemology on Ranganathan's Colon Classification.

Science.gov (United States)

Maurer, Bradley Gerald

This study attempted to determine the influence of Hindu epistemology on Ranganathan's Colon Classification. Only the epistemological schools of Hindu philosophy and the Idea Plane element of Colon Classification were included. A literature search revealed that, although there is significant literature on each side of the problem, no bridges exist…
Woven fabric defects detection based on texture classification algorithm

International Nuclear Information System (INIS)

Ben Salem, Y.; Nasri, S.

2011-01-01

In this paper we have compared two famous methods in texture classification to solve the problem of recognition and classification of defects occurring in a textile manufacture. We have compared local binary patterns method with co-occurrence matrix. The classifier used is the support vector machines (SVM). The system has been tested using TILDA database. The results obtained are interesting and show that LBP is a good method for the problems of recognition and classifcation defects, it gives a good running time especially for the real time applications.
CLASSIFICATION OF LEARNING MANAGEMENT SYSTEMS

Directory of Open Access Journals (Sweden)

Yu. B. Popova

2016-01-01

Full Text Available Using of information technologies and, in particular, learning management systems, increases opportunities of teachers and students in reaching their goals in education. Such systems provide learning content, help organize and monitor training, collect progress statistics and take into account the individual characteristics of each user. Currently, there is a huge inventory of both paid and free systems are physically located both on college servers and in the cloud, offering different features sets of different licensing scheme and the cost. This creates the problem of choosing the best system. This problem is partly due to the lack of comprehensive classification of such systems. Analysis of more than 30 of the most common now automated learning management systems has shown that a classification of such systems should be carried out according to certain criteria, under which the same type of system can be considered. As classification features offered by the author are: cost, functionality, modularity, keeping the customer’s requirements, the integration of content, the physical location of a system, adaptability training. Considering the learning management system within these classifications and taking into account the current trends of their development, it is possible to identify the main requirements to them: functionality, reliability, ease of use, low cost, support for SCORM standard or Tin Can API, modularity and adaptability. According to the requirements at the Software Department of FITR BNTU under the guidance of the author since 2009 take place the development, the use and continuous improvement of their own learning management system.
Classification of sudden and arrhythmic death

DEFF Research Database (Denmark)

Torp-Pedersen, C; Køber, L; Elming, H

1997-01-01

was nearly abolished by the implantable defibrillator, indicating that arrhythmic death by this classification is meaningful, at least in the population studied. For future investigations, a call is made for committees to present data in a way that allows the reader to examine the quality of the data used......Since all death is (eventually) sudden and associated with cardiac arrhythmias, the concept of sudden death is only meaningful if it is unexpected, while arrhythmic death is only meaningful if life could have continued had the arrhythmia been prevented or treated. Current classifications of death...... or autopsy) are available in only a few percent of cases. A main problem in using classifications is the lack of validation data. This situation has, with the MADIT trial, changed in the case of the Thaler and Hinkle classification of arrhythmic death. The MADIT trial demonstrated that arrhythmic death...

Power noise spectrum classification in the problem of the IBR-2 reactor

International Nuclear Information System (INIS)

Bargel, M.; Kitowski, J.; Pepelyshev, Yu.N.

1988-01-01

The classification spectrum results of random fluctuations in the IBR-2 energy pulse are presented. The work is performed for the application of the obtained results to the reactor diagnostics and the study of its noise uncontrolled states. For classification of the spectra the method of pattern recognition based upon the ISODATA heuristic algorithm is used. It is shown that a set of noise uncontrolled reactor states, registered during the reactor operation period at power of 0.4-2 MVt with the first variant of moving reflector (1983-1986) is formed into 4(5) most typical states. Each of the states corresponds to the general conditions of the reactor core cooling and provides the normal work of the moving reflector. However, these states differ in coolant flow, power level and peculiarities of the moving reflector rotation regime. One type of anomal power noise, connected with some disorder in the moving reflctor work, is isolated. This work also presents the possibility of control over the state of moving reflectors according to the change in the amplitude of power oscillations at some frequences. The reactor noise classification results can be used as the data bank for the IBR-2 reactor diagnostic system
Specific classification of financial analysis of enterprise activity

Directory of Open Access Journals (Sweden)

Synkevych Nadiia I.

2014-01-01

Full Text Available Despite the fact that one can find a big variety of classifications of types of financial analysis of enterprise activity, which differ with their approach to classification and a number of classification features and their content, in modern scientific literature, their complex comparison and analysis of existing classification have not been done. This explains urgency of this study. The article studies classification of types of financial analysis of scientists and presents own approach to this problem. By the results of analysis the article improves and builds up a specific classification of financial analysis of enterprise activity and offers classification by the following features: objects, subjects, goals of study, automation level, time period of the analytical base, scope of study, organisation system, classification features of the subject, spatial belonging, sufficiency, information sources, periodicity, criterial base, method of data selection for analysis and time direction. All types of financial analysis significantly differ with their inherent properties and parameters depending on the goals of financial analysis. The developed specific classification provides subjects of financial analysis of enterprise activity with a possibility to identify a specific type of financial analysis, which would correctly meet the set goals.
Clever Toolbox - the Art of Automated Genre Classification

DEFF Research Database (Denmark)

2005-01-01

Automatic musical genre classification can be defined as the science of finding computer algorithms that a digitized sound clip as input and yield a musical genre as output. The goal of automated genre classification is, of course, that the musical genre should agree with the human classificasion....... This demo illustrates an approach to the problem that first extract frequency-based sound features followed by a "linear regression" classifier. The basic features are the so-called mel-frequency cepstral coefficients (MFCCs), which are extracted on a time-scale of 30 msec. From these MFCC features, auto......) is subsequently used for classification. This classifier is rather simple; current research investigates more advanced methods of classification....
Land-cover classification with an expert classification algorithm using digital aerial photographs

Directory of Open Access Journals (Sweden)

José L. de la Cruz

2010-05-01

Full Text Available The purpose of this study was to evaluate the usefulness of the spectral information of digital aerial sensors in determining land-cover classification using new digital techniques. The land covers that have been evaluated are the following, (1 bare soil, (2 cereals, including maize (Zea mays L., oats (Avena sativa L., rye (Secale cereale L., wheat (Triticum aestivum L. and barley (Hordeun vulgare L., (3 high protein crops, such as peas (Pisum sativum L. and beans (Vicia faba L., (4 alfalfa (Medicago sativa L., (5 woodlands and scrublands, including holly oak (Quercus ilex L. and common retama (Retama sphaerocarpa L., (6 urban soil, (7 olive groves (Olea europaea L. and (8 burnt crop stubble. The best result was obtained using an expert classification algorithm, achieving a reliability rate of 95%. This result showed that the images of digital airborne sensors hold considerable promise for the future in the field of digital classifications because these images contain valuable information that takes advantage of the geometric viewpoint. Moreover, new classification techniques reduce problems encountered using high-resolution images; while reliabilities are achieved that are better than those achieved with traditional methods.
Pap-smear Benchmark Data For Pattern Classification

DEFF Research Database (Denmark)

Jantzen, Jan; Norup, Jonas; Dounias, Georgios

2005-01-01

This case study provides data and a baseline for comparing classification methods. The data consists of 917 images of Pap-smear cells, classified carefully by cyto-technicians and doctors. Each cell is described by 20 numerical features, and the cells fall into 7 classes. A basic data analysis in...
Weakly supervised classification in high energy physics

Energy Technology Data Exchange (ETDEWEB)

Dery, Lucio Mwinmaarong [Physics Department, Stanford University,Stanford, CA, 94305 (United States); Nachman, Benjamin [Physics Division, Lawrence Berkeley National Laboratory,1 Cyclotron Rd, Berkeley, CA, 94720 (United States); Rubbo, Francesco; Schwartzman, Ariel [SLAC National Accelerator Laboratory, Stanford University,2575 Sand Hill Rd, Menlo Park, CA, 94025 (United States)

2017-05-29

As machine learning algorithms become increasingly sophisticated to exploit subtle features of the data, they often become more dependent on simulations. This paper presents a new approach called weakly supervised classification in which class proportions are the only input into the machine learning algorithm. Using one of the most challenging binary classification tasks in high energy physics — quark versus gluon tagging — we show that weakly supervised classification can match the performance of fully supervised algorithms. Furthermore, by design, the new algorithm is insensitive to any mis-modeling of discriminating features in the data by the simulation. Weakly supervised classification is a general procedure that can be applied to a wide variety of learning problems to boost performance and robustness when detailed simulations are not reliable or not available.
Weakly supervised classification in high energy physics

International Nuclear Information System (INIS)

Dery, Lucio Mwinmaarong; Nachman, Benjamin; Rubbo, Francesco; Schwartzman, Ariel

2017-01-01

As machine learning algorithms become increasingly sophisticated to exploit subtle features of the data, they often become more dependent on simulations. This paper presents a new approach called weakly supervised classification in which class proportions are the only input into the machine learning algorithm. Using one of the most challenging binary classification tasks in high energy physics — quark versus gluon tagging — we show that weakly supervised classification can match the performance of fully supervised algorithms. Furthermore, by design, the new algorithm is insensitive to any mis-modeling of discriminating features in the data by the simulation. Weakly supervised classification is a general procedure that can be applied to a wide variety of learning problems to boost performance and robustness when detailed simulations are not reliable or not available.
Proteomic classification of breast cancer.

LENUS (Irish Health Repository)

Kamel, Dalia

2012-11-01

Being a significant health problem that affects patients in various age groups, breast cancer has been extensively studied to date. Recently, molecular breast cancer classification has advanced significantly with the availability of genomic profiling technologies. Proteomic technologies have also advanced from traditional protein assays including enzyme-linked immunosorbent assay, immunoblotting and immunohistochemistry to more comprehensive approaches including mass spectrometry and reverse phase protein lysate arrays (RPPA). The purpose of this manuscript is to review the current protein markers that influence breast cancer prediction and prognosis and to focus on novel advances in proteomic classification of breast cancer.
Solving Classification Problems for Large Sets of Protein Sequences with the Example of Hox and ParaHox Proteins

Directory of Open Access Journals (Sweden)

Stefanie D. Hueber

2016-02-01

Full Text Available Phylogenetic methods are key to providing models for how a given protein family evolved. However, these methods run into difficulties when sequence divergence is either too low or too high. Here, we provide a case study of Hox and ParaHox proteins so that additional insights can be gained using a new computational approach to help solve old classification problems. For two (Gsx and Cdx out of three ParaHox proteins the assignments differ between the currently most established view and four alternative scenarios. We use a non-phylogenetic, pairwise-sequence-similarity-based method to assess which of the previous predictions, if any, are best supported by the sequence-similarity relationships between Hox and ParaHox proteins. The overall sequence-similarities show Gsx to be most similar to Hox2–3, and Cdx to be most similar to Hox4–8. The results indicate that a purely pairwise-sequence-similarity-based approach can provide additional information not only when phylogenetic inference methods have insufficient information to provide reliable classifications (as was shown previously for central Hox proteins, but also when the sequence variation is so high that the resulting phylogenetic reconstructions are likely plagued by long-branch-attraction artifacts.
Exact smooth classification of Hamiltonian vector fields on symplectic 2-manifolds

International Nuclear Information System (INIS)

Krouglikov, B.S.

1994-10-01

Complete exact classification of Hamiltonian systems with one degree of freedom and Morse Hamiltonian is carried out. As it is a main part of trajectory classification of integrable Hamiltonian systems with two degrees of freedom, the corresponding generalization is considered. The dual problem of classification of symplectic form together with Morse foliation is carried out as well. (author). 10 refs, 16 figs
Hierarchical structure for audio-video based semantic classification of sports video sequences

Science.gov (United States)

Kolekar, M. H.; Sengupta, S.

2005-07-01

A hierarchical structure for sports event classification based on audio and video content analysis is proposed in this paper. Compared to the event classifications in other games, those of cricket are very challenging and yet unexplored. We have successfully solved cricket video classification problem using a six level hierarchical structure. The first level performs event detection based on audio energy and Zero Crossing Rate (ZCR) of short-time audio signal. In the subsequent levels, we classify the events based on video features using a Hidden Markov Model implemented through Dynamic Programming (HMM-DP) using color or motion as a likelihood function. For some of the game-specific decisions, a rule-based classification is also performed. Our proposed hierarchical structure can easily be applied to any other sports. Our results are very promising and we have moved a step forward towards addressing semantic classification problems in general.
Land Cover Classification in a Complex Urban-Rural Landscape with Quickbird Imagery

OpenAIRE

Moran, Emilio Federico.

2010-01-01

High spatial resolution images have been increasingly used for urban land use/cover classification, but the high spectral variation within the same land cover, the spectral confusion among different land covers, and the shadow problem often lead to poor classification performance based on the traditional per-pixel spectral-based classification methods. This paper explores approaches to improve urban land cover classification with Quickbird imagery. Traditional per-pixel spectral-based supervi...
Vision-Based Perception and Classification of Mosquitoes Using Support Vector Machine

Directory of Open Access Journals (Sweden)

Masataka Fuchida

2017-01-01

Full Text Available The need for a novel automated mosquito perception and classification method is becoming increasingly essential in recent years, with steeply increasing number of mosquito-borne diseases and associated casualties. There exist remote sensing and GIS-based methods for mapping potential mosquito inhabitants and locations that are prone to mosquito-borne diseases, but these methods generally do not account for species-wise identification of mosquitoes in closed-perimeter regions. Traditional methods for mosquito classification involve highly manual processes requiring tedious sample collection and supervised laboratory analysis. In this research work, we present the design and experimental validation of an automated vision-based mosquito classification module that can deploy in closed-perimeter mosquito inhabitants. The module is capable of identifying mosquitoes from other bugs such as bees and flies by extracting the morphological features, followed by support vector machine-based classification. In addition, this paper presents the results of three variants of support vector machine classifier in the context of mosquito classification problem. This vision-based approach to the mosquito classification problem presents an efficient alternative to the conventional methods for mosquito surveillance, mapping and sample image collection. Experimental results involving classification between mosquitoes and a predefined set of other bugs using multiple classification strategies demonstrate the efficacy and validity of the proposed approach with a maximum recall of 98%.
Domain Adaptation for Opinion Classification: A Self-Training Approach

Directory of Open Access Journals (Sweden)

Yu, Ning

2013-03-01

Full Text Available Domain transfer is a widely recognized problem for machine learning algorithms because models built upon one data domain generally do not perform well in another data domain. This is especially a challenge for tasks such as opinion classification, which often has to deal with insufficient quantities of labeled data. This study investigates the feasibility of self-training in dealing with the domain transfer problem in opinion classification via leveraging labeled data in non-target data domain(s and unlabeled data in the target-domain. Specifically, self-training is evaluated for effectiveness in sparse data situations and feasibility for domain adaptation in opinion classification. Three types of Web content are tested: edited news articles, semi-structured movie reviews, and the informal and unstructured content of the blogosphere. Findings of this study suggest that, when there are limited labeled data, self-training is a promising approach for opinion classification, although the contributions vary across data domains. Significant improvement was demonstrated for the most challenging data domain-the blogosphere-when a domain transfer-based self-training strategy was implemented.
Definitions, Criteria and Global Classification of Mast Cell Disorders with Special Reference to Mast Cell Activation Syndromes: A Consensus Proposal

Science.gov (United States)

Valent, Peter; Akin, Cem; Arock, Michel; Brockow, Knut; Butterfield, Joseph H.; Carter, Melody C.; Castells, Mariana; Escribano, Luis; Hartmann, Karin; Lieberman, Philip; Nedoszytko, Boguslaw; Orfao, Alberto; Schwartz, Lawrence B.; Sotlar, Karl; Sperr, Wolfgang R.; Triggiani, Massimo; Valenta, Rudolf; Horny, Hans-Peter; Metcalfe, Dean D.

2012-01-01

Activation of tissue mast cells (MCs) and their abnormal growth and accumulation in various organs are typically found in primary MC disorders also referred to as mastocytosis. However, increasing numbers of patients are now being informed that their clinical findings are due to MC activation (MCA) that is neither associated with mastocytosis nor with a defined allergic or inflammatory reaction. In other patients with MCA, MCs appear to be clonal cells, but criteria for diagnosing mastocytosis are not met. A working conference was organized in 2010 with the aim to define criteria for diagnosing MCA and related disorders, and to propose a global unifying classification of all MC disorders and pathologic MC reactions. This classification includes three types of ‘MCA syndromes’ (MCASs), namely primary MCAS, secondary MCAS and idiopathic MCAS. MCA is now defined by robust and generally applicable criteria, including (1) typical clinical symptoms, (2) a substantial transient increase in serum total tryptase level or an increase in other MC-derived mediators, such as histamine or prostaglandin D2, or their urinary metabolites, and (3) a response of clinical symptoms to agents that attenuate the production or activities of MC mediators. These criteria should assist in the identification and diagnosis of patients with MCAS, and in avoiding misdiagnoses or overinterpretation of clinical symptoms in daily practice. Moreover, the MCAS concept should stimulate research in order to identify and exploit new molecular mechanisms and therapeutic targets. PMID:22041891
CIN classification and prediction using machine learning methods

Science.gov (United States)

Chirkina, Anastasia; Medvedeva, Marina; Komotskiy, Evgeny

2017-06-01

The aim of this paper is a comparison of the existing classification algorithms with different parameters, and selection those ones, which allows solving the problem of primary diagnosis of cervical intraepithelial neoplasia (CIN), as it characterizes the condition of the body in the precancerous stage. The paper describes a feature selection process, as well as selection of the best models for a multiclass classification.
Sensitivity versus accuracy in multiclass problems using memetic Pareto evolutionary neural networks.

Science.gov (United States)

Fernández Caballero, Juan Carlos; Martínez, Francisco José; Hervás, César; Gutiérrez, Pedro Antonio

2010-05-01

This paper proposes a multiclassification algorithm using multilayer perceptron neural network models. It tries to boost two conflicting main objectives of multiclassifiers: a high correct classification rate level and a high classification rate for each class. This last objective is not usually optimized in classification, but is considered here given the need to obtain high precision in each class in real problems. To solve this machine learning problem, we use a Pareto-based multiobjective optimization methodology based on a memetic evolutionary algorithm. We consider a memetic Pareto evolutionary approach based on the NSGA2 evolutionary algorithm (MPENSGA2). Once the Pareto front is built, two strategies or automatic individual selection are used: the best model in accuracy and the best model in sensitivity (extremes in the Pareto front). These methodologies are applied to solve 17 classification benchmark problems obtained from the University of California at Irvine (UCI) repository and one complex real classification problem. The models obtained show high accuracy and a high classification rate for each class.
[Clinical aspects and classification of echinococcosis].

Science.gov (United States)

Nabokov, Sh A; Vasil'ev, R Kh

1978-04-01

350 cases of alveococcosis were examined with the use of clinical and generally available methods of laboratory analysis. This study helped to find out the characteristic symptoms of the disease and their incidence rate. A clinico-anatomical classification of alveoccoccosis, based on local and general manifestations, localization of a primary focus, anatomic form of the growth of an alveococcal node and the degree of its propagation in the liver parenchima, has been developed. The suggested classification promotes a correct construction of a detailed clinical diagnosis and complete solution of the problems of therapeutic tactics.
Standard classification of uranium resources-an illustrative example

International Nuclear Information System (INIS)

Krishna, P.M.; Babitzke, H.R.; Curry, D.; Masters, C.D.; McCammon, R.B.; Noble, R.B.; Rodriguez, J.A.; Schanz, J.J.; Schreiber, H.W.

1983-01-01

An example illustrates the use of ASTM Standard E901-82, Classification System for Uranium Resources. The example demonstrates the dynamic nature of the process of classification and attests to the necessity of addressing both the aggregate needs of broad-scale resource planning and the specific needs of individual property evaluation. Problems that remain in fixing the classification of a given uranium resource include the uncertainty in estimating the quantity of undiscovered resources and resolving the differences that may exist in deciding when the drill-hole spacing is adequate to determine the tonnage and grade of discovered resources
Automatic indexing, compiling and classification

International Nuclear Information System (INIS)

Andreewsky, Alexandre; Fluhr, Christian.

1975-06-01

A review of the principles of automatic indexing, is followed by a comparison and summing-up of work by the authors and by a Soviet staff from the Moscou INFORM-ELECTRO Institute. The mathematical and linguistic problems of the automatic building of thesaurus and automatic classification are examined [fr

A Soft Intelligent Risk Evaluation Model for Credit Scoring Classification

Directory of Open Access Journals (Sweden)

Mehdi Khashei

2015-09-01

Full Text Available Risk management is one of the most important branches of business and finance. Classification models are the most popular and widely used analytical group of data mining approaches that can greatly help financial decision makers and managers to tackle credit risk problems. However, the literature clearly indicates that, despite proposing numerous classification models, credit scoring is often a difficult task. On the other hand, there is no universal credit-scoring model in the literature that can be accurately and explanatorily used in all circumstances. Therefore, the research for improving the efficiency of credit-scoring models has never stopped. In this paper, a hybrid soft intelligent classification model is proposed for credit-scoring problems. In the proposed model, the unique advantages of the soft computing techniques are used in order to modify the performance of the traditional artificial neural networks in credit scoring. Empirical results of Australian credit card data classifications indicate that the proposed hybrid model outperforms its components, and also other classification models presented for credit scoring. Therefore, the proposed model can be considered as an appropriate alternative tool for binary decision making in business and finance, especially in high uncertainty conditions.
A Comprehensive Study of Features and Algorithms for URL-Based Topic Classification

CERN Document Server

Weber, I; Henzinger, M; Baykan, E

2011-01-01

Given only the URL of a Web page, can we identify its topic? We study this problem in detail by exploring a large number of different feature sets and algorithms on several datasets. We also show that the inherent overlap between topics and the sparsity of the information in URLs makes this a very challenging problem. Web page classification without a page's content is desirable when the content is not available at all, when a classification is needed before obtaining the content, or when classification speed is of utmost importance. For our experiments we used five different corpora comprising a total of about 3 million (URL, classification) pairs. We evaluated several techniques for feature generation and classification algorithms. The individual binary classifiers were then combined via boosting into metabinary classifiers. We achieve typical F-measure values between 80 and 85, and a typical precision of around 86. The precision can be pushed further over 90 while maintaining a typical level of recall betw...
On the Evaluation of Outlier Detection and One-Class Classification Methods

DEFF Research Database (Denmark)

Swersky, Lorne; Marques, Henrique O.; Sander, Jörg

2016-01-01

It has been shown that unsupervised outlier detection methods can be adapted to the one-class classification problem. In this paper, we focus on the comparison of oneclass classification algorithms with such adapted unsupervised outlier detection methods, improving on previous comparison studies ...
Motor Oil Classification using Color Histograms and Pattern Recognition Techniques.

Science.gov (United States)

Ahmadi, Shiva; Mani-Varnosfaderani, Ahmad; Habibi, Biuck

2018-04-20

Motor oil classification is important for quality control and the identification of oil adulteration. In thiswork, we propose a simple, rapid, inexpensive and nondestructive approach based on image analysis and pattern recognition techniques for the classification of nine different types of motor oils according to their corresponding color histograms. For this, we applied color histogram in different color spaces such as red green blue (RGB), grayscale, and hue saturation intensity (HSI) in order to extract features that can help with the classification procedure. These color histograms and their combinations were used as input for model development and then were statistically evaluated by using linear discriminant analysis (LDA), quadratic discriminant analysis (QDA), and support vector machine (SVM) techniques. Here, two common solutions for solving a multiclass classification problem were applied: (1) transformation to binary classification problem using a one-against-all (OAA) approach and (2) extension from binary classifiers to a single globally optimized multilabel classification model. In the OAA strategy, LDA, QDA, and SVM reached up to 97% in terms of accuracy, sensitivity, and specificity for both the training and test sets. In extension from binary case, despite good performances by the SVM classification model, QDA and LDA provided better results up to 92% for RGB-grayscale-HSI color histograms and up to 93% for the HSI color map, respectively. In order to reduce the numbers of independent variables for modeling, a principle component analysis algorithm was used. Our results suggest that the proposed method is promising for the identification and classification of different types of motor oils.
Analysis of composition-based metagenomic classification.

Science.gov (United States)

Higashi, Susan; Barreto, André da Motta Salles; Cantão, Maurício Egidio; de Vasconcelos, Ana Tereza Ribeiro

2012-01-01

An essential step of a metagenomic study is the taxonomic classification, that is, the identification of the taxonomic lineage of the organisms in a given sample. The taxonomic classification process involves a series of decisions. Currently, in the context of metagenomics, such decisions are usually based on empirical studies that consider one specific type of classifier. In this study we propose a general framework for analyzing the impact that several decisions can have on the classification problem. Instead of focusing on any specific classifier, we define a generic score function that provides a measure of the difficulty of the classification task. Using this framework, we analyze the impact of the following parameters on the taxonomic classification problem: (i) the length of n-mers used to encode the metagenomic sequences, (ii) the similarity measure used to compare sequences, and (iii) the type of taxonomic classification, which can be conventional or hierarchical, depending on whether the classification process occurs in a single shot or in several steps according to the taxonomic tree. We defined a score function that measures the degree of separability of the taxonomic classes under a given configuration induced by the parameters above. We conducted an extensive computational experiment and found out that reasonable values for the parameters of interest could be (i) intermediate values of n, the length of the n-mers; (ii) any similarity measure, because all of them resulted in similar scores; and (iii) the hierarchical strategy, which performed better in all of the cases. As expected, short n-mers generate lower configuration scores because they give rise to frequency vectors that represent distinct sequences in a similar way. On the other hand, large values for n result in sparse frequency vectors that represent differently metagenomic fragments that are in fact similar, also leading to low configuration scores. Regarding the similarity measure, in
T-ray relevant frequencies for osteosarcoma classification

Science.gov (United States)

Withayachumnankul, W.; Ferguson, B.; Rainsford, T.; Findlay, D.; Mickan, S. P.; Abbott, D.

2006-01-01

We investigate the classification of the T-ray response of normal human bone cells and human osteosarcoma cells, grown in culture. Given the magnitude and phase responses within a reliable spectral range as features for input vectors, a trained support vector machine can correctly classify the two cell types to some extent. Performance of the support vector machine is deteriorated by the curse of dimensionality, resulting from the comparatively large number of features in the input vectors. Feature subset selection methods are used to select only an optimal number of relevant features for inputs. As a result, an improvement in generalization performance is attainable, and the selected frequencies can be used for further describing different mechanisms of the cells, responding to T-rays. We demonstrate a consistent classification accuracy of 89.6%, while the only one fifth of the original features are retained in the data set.
Iris Data Classification Using Quantum Neural Networks

International Nuclear Information System (INIS)

Sahni, Vishal; Patvardhan, C.

2006-01-01

Quantum computing is a novel paradigm that promises to be the future of computing. The performance of quantum algorithms has proved to be stunning. ANN within the context of classical computation has been used for approximation and classification tasks with some success. This paper presents an idea of quantum neural networks along with the training algorithm and its convergence property. It synergizes the unique properties of quantum bits or qubits with the various techniques in vogue in neural networks. An example application of Fisher's Iris data set, a benchmark classification problem has also been presented. The results obtained amply demonstrate the classification capabilities of the quantum neuron and give an idea of their promising capabilities
Graph-Based Semi-Supervised Hyperspectral Image Classification Using Spatial Information

Science.gov (United States)

Jamshidpour, N.; Homayouni, S.; Safari, A.

2017-09-01

Hyperspectral image classification has been one of the most popular research areas in the remote sensing community in the past decades. However, there are still some problems that need specific attentions. For example, the lack of enough labeled samples and the high dimensionality problem are two most important issues which degrade the performance of supervised classification dramatically. The main idea of semi-supervised learning is to overcome these issues by the contribution of unlabeled samples, which are available in an enormous amount. In this paper, we propose a graph-based semi-supervised classification method, which uses both spectral and spatial information for hyperspectral image classification. More specifically, two graphs were designed and constructed in order to exploit the relationship among pixels in spectral and spatial spaces respectively. Then, the Laplacians of both graphs were merged to form a weighted joint graph. The experiments were carried out on two different benchmark hyperspectral data sets. The proposed method performed significantly better than the well-known supervised classification methods, such as SVM. The assessments consisted of both accuracy and homogeneity analyses of the produced classification maps. The proposed spectral-spatial SSL method considerably increased the classification accuracy when the labeled training data set is too scarce.When there were only five labeled samples for each class, the performance improved 5.92% and 10.76% compared to spatial graph-based SSL, for AVIRIS Indian Pine and Pavia University data sets respectively.
GRAPH-BASED SEMI-SUPERVISED HYPERSPECTRAL IMAGE CLASSIFICATION USING SPATIAL INFORMATION

Directory of Open Access Journals (Sweden)

N. Jamshidpour

2017-09-01

Full Text Available Hyperspectral image classification has been one of the most popular research areas in the remote sensing community in the past decades. However, there are still some problems that need specific attentions. For example, the lack of enough labeled samples and the high dimensionality problem are two most important issues which degrade the performance of supervised classification dramatically. The main idea of semi-supervised learning is to overcome these issues by the contribution of unlabeled samples, which are available in an enormous amount. In this paper, we propose a graph-based semi-supervised classification method, which uses both spectral and spatial information for hyperspectral image classification. More specifically, two graphs were designed and constructed in order to exploit the relationship among pixels in spectral and spatial spaces respectively. Then, the Laplacians of both graphs were merged to form a weighted joint graph. The experiments were carried out on two different benchmark hyperspectral data sets. The proposed method performed significantly better than the well-known supervised classification methods, such as SVM. The assessments consisted of both accuracy and homogeneity analyses of the produced classification maps. The proposed spectral-spatial SSL method considerably increased the classification accuracy when the labeled training data set is too scarce.When there were only five labeled samples for each class, the performance improved 5.92% and 10.76% compared to spatial graph-based SSL, for AVIRIS Indian Pine and Pavia University data sets respectively.
Quantum Ensemble Classification: A Sampling-Based Learning Control Approach.

Science.gov (United States)

Chen, Chunlin; Dong, Daoyi; Qi, Bo; Petersen, Ian R; Rabitz, Herschel

2017-06-01

Quantum ensemble classification (QEC) has significant applications in discrimination of atoms (or molecules), separation of isotopes, and quantum information extraction. However, quantum mechanics forbids deterministic discrimination among nonorthogonal states. The classification of inhomogeneous quantum ensembles is very challenging, since there exist variations in the parameters characterizing the members within different classes. In this paper, we recast QEC as a supervised quantum learning problem. A systematic classification methodology is presented by using a sampling-based learning control (SLC) approach for quantum discrimination. The classification task is accomplished via simultaneously steering members belonging to different classes to their corresponding target states (e.g., mutually orthogonal states). First, a new discrimination method is proposed for two similar quantum systems. Then, an SLC method is presented for QEC. Numerical results demonstrate the effectiveness of the proposed approach for the binary classification of two-level quantum ensembles and the multiclass classification of multilevel quantum ensembles.
Issues and Ethical Problems of Stem Cell Therapy – Where is Hippocrates?

Directory of Open Access Journals (Sweden)

Lucie Rousková

2008-01-01

Full Text Available Stem cells and their therapeutic use present many questions associated with ethical problems in medicine. There is great effort on the part of physicians to help millions of patients while there are ethical problems with the use of new methods and technologies and all of these are affected by economic and political influences. How will the current generation deal with these problems? Medicine, in this begard, is experiencing a stormy evolution of human culture in the relationships between disease, patient and doctor. Philosophy approaches the same juncture of human culture, but seemingly from the other side. Both disciplines are facing a great problem: How to unite the content of current human morality and the desire for health? Both philosophers and physicians perceive this deficit in human culture as it does not provide immediately usable normatives, which the living generation of healthy and ill is waiting for. It may be said that medicine, as many times before, has reached a stage where it cannot rely only on the proved axiologic values from the past, ethical normatives or cultivated moral sense of its subjects. Medicine has no other alternative than to take an active part in resolution of interdisciplinary problems originating from philosophic-biologic or philosophic-medical inquiries of axiologic, ethical, and moral issues. Our paper indicates some ways of the search in forming ethical principles of the stem-cell therapy from the view of biologists and physicians. New ways are recommended in theoretical-methodological interdisciplinary research, especially, in theoretical and experimental biology, and theoretical and clinical medicine, as well as philosophy. In this paper important ethical problems are pointed out in order to find answers to some key problems connected with cell therapy and the use of stem cells.
The SOGS-RA vs. the MAGS-7: prevalence estimates and classification congruence.

Science.gov (United States)

Langhinrichsen-Rohling, Jennifer; Rohling, Martin L; Rohde, Paul; Seeley, John R

2004-01-01

The purpose of this study was to compare the prevalence rate estimates and congruence in classification status derived from two popular measures of adolescent gambling (SOGS-RA and MAGS-7). Adolescents from three states (Alabama, Mississippi, and Oregon) completed an anonymous questionnaire ( n =1846 high school students total). Results indicate that the prevalence of probable adolescent pathological gambling varied both as a function of instrument and cut-off point utilized for classification (range 1.7%-8.2%). Classification groups (non-problem, at-risk, and problem gamblers) generated by both instruments were found to be associated with reports of gambling frequency, amount of money lost in one gambling occasion, and parental gambling problems. However, concern was raised because the MAGS-7 and the SOGS-RA had little congruence in their three-group classification decisions for specific individuals (e.g., only 20.5% agreement for problem gamblers). To improve clinical utility, an empirical case was made for using the SOGS-RA to generate a fourth group of adolescent gamblers, which we labeled "probable pathological gamblers" (SOGS-RA > or = 6). This group was differentiated from the remaining gambling groups on all the validity indices. The implications and limitations of these findings, as well as future directions, are discussed.
Learning semantic histopathological representation for basal cell carcinoma classification

Science.gov (United States)

Gutiérrez, Ricardo; Rueda, Andrea; Romero, Eduardo

2013-03-01

Diagnosis of a histopathology glass slide is a complex process that involves accurate recognition of several structures, their function in the tissue and their relation with other structures. The way in which the pathologist represents the image content and the relations between those objects yields a better and accurate diagnoses. Therefore, an appropriate semantic representation of the image content will be useful in several analysis tasks such as cancer classification, tissue retrieval and histopahological image analysis, among others. Nevertheless, to automatically recognize those structures and extract their inner semantic meaning are still very challenging tasks. In this paper we introduce a new semantic representation that allows to describe histopathological concepts suitable for classification. The approach herein identify local concepts using a dictionary learning approach, i.e., the algorithm learns the most representative atoms from a set of random sampled patches, and then models the spatial relations among them by counting the co-occurrence between atoms, while penalizing the spatial distance. The proposed approach was compared with a bag-of-features representation in a tissue classification task. For this purpose, 240 histological microscopical fields of view, 24 per tissue class, were collected. Those images fed a Support Vector Machine classifier per class, using 120 images as train set and the remaining ones for testing, maintaining the same proportion of each concept in the train and test sets. The obtained classification results, averaged from 100 random partitions of training and test sets, shows that our approach is more sensitive in average than the bag-of-features representation in almost 6%.
[Methods of substances and organelles introduction in living cell for cell engineering technologies].

Science.gov (United States)

Nikitin, V A

2007-01-01

We have presented the classification of more than 40 methods of genetic material, substances and organelles introduction into a living cell. Each of them has its characteristic advantages, disadvantages and limitations with respect to cell viability, transfer efficiency, general applicability, and technical requirements. It this article we have enlarged on the description of our developments of several new and improved approaches, methods and devices of the direct microinjection into a single cell and cell microsurgery with the help of glass micropipettes. The problem of low efficiency of mammalian cloning is discussed with emphasis on the necessity of expertizing of each step of single cell reconstruction to begin with microsurgical manipulations and necessity of the development of such methods of single cell resonstruction that could minimize the possible damage of the cell.
Fuzzy support vector machine for microarray imbalanced data classification

Science.gov (United States)

Ladayya, Faroh; Purnami, Santi Wulan; Irhamah

2017-11-01

DNA microarrays are data containing gene expression with small sample sizes and high number of features. Furthermore, imbalanced classes is a common problem in microarray data. This occurs when a dataset is dominated by a class which have significantly more instances than the other minority classes. Therefore, it is needed a classification method that solve the problem of high dimensional and imbalanced data. Support Vector Machine (SVM) is one of the classification methods that is capable of handling large or small samples, nonlinear, high dimensional, over learning and local minimum issues. SVM has been widely applied to DNA microarray data classification and it has been shown that SVM provides the best performance among other machine learning methods. However, imbalanced data will be a problem because SVM treats all samples in the same importance thus the results is bias for minority class. To overcome the imbalanced data, Fuzzy SVM (FSVM) is proposed. This method apply a fuzzy membership to each input point and reformulate the SVM such that different input points provide different contributions to the classifier. The minority classes have large fuzzy membership so FSVM can pay more attention to the samples with larger fuzzy membership. Given DNA microarray data is a high dimensional data with a very large number of features, it is necessary to do feature selection first using Fast Correlation based Filter (FCBF). In this study will be analyzed by SVM, FSVM and both methods by applying FCBF and get the classification performance of them. Based on the overall results, FSVM on selected features has the best classification performance compared to SVM.
Selection of Objective Function For Imbalanced Classification: An Industrial Case Study

DEFF Research Database (Denmark)

Khan, Abdul Rauf; Schiøler, Henrik; Kulahci, Murat

2017-01-01

In this article we discuss the issue of selecting suitable objective function for Genetic Algorithm to solve an imbalanced classification problem. More precisely, first we discuss the need of specialized objective function to solve a real classification problem from our industrial partner and the...... and then we compare the results of our proposed objective function with commonly used candidates to serve this purpose. Our comparison is based on the analysis of real data collected during the quality control stages of the manufacturing process....
Global case studies of soft-sediment deformation structures (SSDS: Definitions, classifications, advances, origins, and problems

Directory of Open Access Journals (Sweden)

G. Shanmugam

2017-10-01

Problems that hinder our understanding of SSDS still remain. They are: (1 vague definitions of the phrase “soft-sediment deformation”; (2 complex factors that govern the origin of SSDS; (3 omission of vital empirical data in documenting vertical changes in facies using measured sedimentological logs; (4 difficulties in distinguishing depositional processes from tectonic events; (5 a model-driven interpretation of SSDS (i.e., earthquake being the singular cause; (6 routine application of the genetic term “seismites” to the “SSDS”, thus undermining the basic tenet of process sedimentology (i.e., separation of interpretation from observation; (7 the absence of objective criteria to differentiate 21 triggering mechanisms of liquefaction and related SSDS; (8 application of the process concept “high-density turbidity currents”, a process that has never been documented in modern oceans; (9 application of the process concept “sediment creep” with a velocity connotation that cannot be inferred from the ancient record; (10 classification of pockmarks, which are hollow spaces (i.e., without sediments as SSDS, with their problematic origins by fluid expulsion, sediment degassing, fish activity, etc.; (11 application of the Earth's climate-change model; and most importantly, (12 an arbitrary distinction between depositional process and sediment deformation. Despite a profusion of literature on SSDS, our understanding of their origin remains muddled. A solution to the chronic SSDS problem is to utilize the robust core dataset from scientific drilling at sea (DSDP/ODP/IODP with a constrained definition of SSDS.
Predictive Manufacturing: Classification of categorical data

DEFF Research Database (Denmark)

Khan, Abdul Rauf; Schiøler, Henrik; Kulahci, Murat

2018-01-01

and classification capabilities of our methodology (on different experimental settings) is done through a specially designed simulation experiment. Secondly, in order to demonstrate the applicability in a real life problem a data set from electronics component manufacturing is being analysed through our proposed...
Integrated tracking, classification, and sensor management theory and applications

CERN Document Server

Krishnamurthy, Vikram; Vo, Ba-Ngu

2012-01-01

A unique guide to the state of the art of tracking, classification, and sensor management. This book addresses the tremendous progress made over the last few decades in algorithm development and mathematical analysis for filtering, multi-target multi-sensor tracking, sensor management and control, and target classification. It provides for the first time an integrated treatment of these advanced topics, complete with careful mathematical formulation, clear description of the theory, and real-world applications. Written by experts in the field, Integrated Tracking, Classification, and Sensor Management provides readers with easy access to key Bayesian modeling and filtering methods, multi-target tracking approaches, target classification procedures, and large scale sensor management problem-solving techniques.
Congenital muscular dystrophies--problems of classification.

Science.gov (United States)

Lenard, H G

1991-04-01

The classification of congenital muscular dystrophies (CMD), based on perceived clinical and morphological similarities or differences, is controversial. CMD without cerebral involvement has sometimes been divided into a mild and a severe form. This distinction is, however, arbitrary and not uncontested. Whether Ullrich's disease, formerly called atonic-sclerotic dystrophy, is a disease entity and if so, whether it is a primary muscle disorder, is uncertain. CMD without cerebral involvement is inherited in an autosomal recessive fashion in the great majority of cases. CMDs with cerebral involvement are usually classified into at least three forms: the Fukuyama type of CMD, occurring almost exclusively in Japanese patients; CMD with hypomyelination, sometimes also called the occidental type of cerebromuscular dystrophy; and Walker-Warburg syndrome. Muscle-eye-brain disease, described in a number of Finnish patients, may or may not belong in this last category. In CMD with cerebral involvement inheritance is also autosomal recessive. It is possible that single sporadic cases are phenocopies due to infectious or other exogenous causes. Reports of clinical and morphological findings from an increasing number of patients show a high degree of variability within and, on the other hand, certain similarities between the forms of CMD with cerebral involvement. In addition, neuroradiological changes are also found with increasing frequency in CMD patients without clinical neuropsychological abnormalities. It is not unreasonable to speculate that molecular genetic techniques will reveal in the near future a variable defect in one gene locus or defects in a few gene loci as the cause of the various clinical forms of CMDs.

A rule-learning program in high energy physics event classification

International Nuclear Information System (INIS)

Clearwater, S.H.; Stern, E.G.

1991-01-01

We have applied a rule-learning program to the problem of event classification in high energy physics. The program searches for event classifications, i.e. rules, and effectively allows an exploration of many more possible classifications than is practical by a physicist. The program, RL4, is particularly useful because it can easily explore multi-dimensional rules as well as rules that may seem non-intuitive at first to the physicist. RL4 is also contrasted with other learning programs. (orig.)
High Dimensional Classification Using Features Annealed Independence Rules.

Science.gov (United States)

Fan, Jianqing; Fan, Yingying

2008-01-01

Classification using high-dimensional features arises frequently in many contemporary statistical studies such as tumor classification using microarray or other high-throughput data. The impact of dimensionality on classifications is largely poorly understood. In a seminal paper, Bickel and Levina (2004) show that the Fisher discriminant performs poorly due to diverging spectra and they propose to use the independence rule to overcome the problem. We first demonstrate that even for the independence classification rule, classification using all the features can be as bad as the random guessing due to noise accumulation in estimating population centroids in high-dimensional feature space. In fact, we demonstrate further that almost all linear discriminants can perform as bad as the random guessing. Thus, it is paramountly important to select a subset of important features for high-dimensional classification, resulting in Features Annealed Independence Rules (FAIR). The conditions under which all the important features can be selected by the two-sample t-statistic are established. The choice of the optimal number of features, or equivalently, the threshold value of the test statistics are proposed based on an upper bound of the classification error. Simulation studies and real data analysis support our theoretical results and demonstrate convincingly the advantage of our new classification procedure.
Cell heterogeneity problems in the analysis of zero power experiments

International Nuclear Information System (INIS)

Grimstone, M.J.; Stevenson, J.M.

1979-01-01

Methods are described for treating plate and pin cell heterogeneity in the preparation of broad group cross-sections used in the analysis of zero power fast reactor experiments. Methods used at Karlsruhe and Winfrith are summarised and compared, with particular reference to the treatment of resonance shielding, the calculation of broad group spatial fine structure, the treatment of leakage and the calculation of anisotropic diffusion coefficients. The problems of cells near boundaries such as core-breeder interfaces and of singularities such as control rods are also considered briefly. Numerical studies carried out to investigate approximations in the methods are described. These include tests of the accuracy of one-dimensional cell modelling techniques, and the validation by Monte Carlo of methods for treating streaming in the calculation of diffusion coefficients. Comparisons are shown between the heterogeneity effects calculated by the Karlsruhe and Winfrith methods for typical pin and plate cells used in the BIZET experimental programme, and their effect in a whole reactor calculation is indicated. Comparisons are given with measurements which provide tests of the heterogeneity calculations. These include reaction rate scans within pin and plate cells, and reaction rate measurements across sectors of pin and plate fuel, where the flux tilt is determined by the relative reactivity of the pin and plate cells. Finally, the heterogeneity problems arising in the interpretation of reaction rate measurements are discussed. (author)
Objects Classification by Learning-Based Visual Saliency Model and Convolutional Neural Network.

Science.gov (United States)

Li, Na; Zhao, Xinbo; Yang, Yongjia; Zou, Xiaochun

2016-01-01

Humans can easily classify different kinds of objects whereas it is quite difficult for computers. As a hot and difficult problem, objects classification has been receiving extensive interests with broad prospects. Inspired by neuroscience, deep learning concept is proposed. Convolutional neural network (CNN) as one of the methods of deep learning can be used to solve classification problem. But most of deep learning methods, including CNN, all ignore the human visual information processing mechanism when a person is classifying objects. Therefore, in this paper, inspiring the completed processing that humans classify different kinds of objects, we bring forth a new classification method which combines visual attention model and CNN. Firstly, we use the visual attention model to simulate the processing of human visual selection mechanism. Secondly, we use CNN to simulate the processing of how humans select features and extract the local features of those selected areas. Finally, not only does our classification method depend on those local features, but also it adds the human semantic features to classify objects. Our classification method has apparently advantages in biology. Experimental results demonstrated that our method made the efficiency of classification improve significantly.
Comparison Of Power Quality Disturbances Classification Based On Neural Network

Directory of Open Access Journals (Sweden)

Nway Nway Kyaw Win

2015-07-01

Full Text Available Abstract Power quality disturbances PQDs result serious problems in the reliability safety and economy of power system network. In order to improve electric power quality events the detection and classification of PQDs must be made type of transient fault. Software analysis of wavelet transform with multiresolution analysis MRA algorithm and feed forward neural network probabilistic and multilayer feed forward neural network based methodology for automatic classification of eight types of PQ signals flicker harmonics sag swell impulse fluctuation notch and oscillatory will be presented. The wavelet family Db4 is chosen in this system to calculate the values of detailed energy distributions as input features for classification because it can perform well in detecting and localizing various types of PQ disturbances. This technique classifies the types of PQDs problem sevents.The classifiers classify and identify the disturbance type according to the energy distribution. The results show that the PNN can analyze different power disturbance types efficiently. Therefore it can be seen that PNN has better classification accuracy than MLFF.
Automated Processing of Imaging Data through Multi-tiered Classification of Biological Structures Illustrated Using Caenorhabditis elegans.

Directory of Open Access Journals (Sweden)

Mei Zhan

2015-04-01

Full Text Available Quantitative imaging has become a vital technique in biological discovery and clinical diagnostics; a plethora of tools have recently been developed to enable new and accelerated forms of biological investigation. Increasingly, the capacity for high-throughput experimentation provided by new imaging modalities, contrast techniques, microscopy tools, microfluidics and computer controlled systems shifts the experimental bottleneck from the level of physical manipulation and raw data collection to automated recognition and data processing. Yet, despite their broad importance, image analysis solutions to address these needs have been narrowly tailored. Here, we present a generalizable formulation for autonomous identification of specific biological structures that is applicable for many problems. The process flow architecture we present here utilizes standard image processing techniques and the multi-tiered application of classification models such as support vector machines (SVM. These low-level functions are readily available in a large array of image processing software packages and programming languages. Our framework is thus both easy to implement at the modular level and provides specific high-level architecture to guide the solution of more complicated image-processing problems. We demonstrate the utility of the classification routine by developing two specific classifiers as a toolset for automation and cell identification in the model organism Caenorhabditis elegans. To serve a common need for automated high-resolution imaging and behavior applications in the C. elegans research community, we contribute a ready-to-use classifier for the identification of the head of the animal under bright field imaging. Furthermore, we extend our framework to address the pervasive problem of cell-specific identification under fluorescent imaging, which is critical for biological investigation in multicellular organisms or tissues. Using these examples as a
Automated Processing of Imaging Data through Multi-tiered Classification of Biological Structures Illustrated Using Caenorhabditis elegans.

Science.gov (United States)

Zhan, Mei; Crane, Matthew M; Entchev, Eugeni V; Caballero, Antonio; Fernandes de Abreu, Diana Andrea; Ch'ng, QueeLim; Lu, Hang

2015-04-01

Quantitative imaging has become a vital technique in biological discovery and clinical diagnostics; a plethora of tools have recently been developed to enable new and accelerated forms of biological investigation. Increasingly, the capacity for high-throughput experimentation provided by new imaging modalities, contrast techniques, microscopy tools, microfluidics and computer controlled systems shifts the experimental bottleneck from the level of physical manipulation and raw data collection to automated recognition and data processing. Yet, despite their broad importance, image analysis solutions to address these needs have been narrowly tailored. Here, we present a generalizable formulation for autonomous identification of specific biological structures that is applicable for many problems. The process flow architecture we present here utilizes standard image processing techniques and the multi-tiered application of classification models such as support vector machines (SVM). These low-level functions are readily available in a large array of image processing software packages and programming languages. Our framework is thus both easy to implement at the modular level and provides specific high-level architecture to guide the solution of more complicated image-processing problems. We demonstrate the utility of the classification routine by developing two specific classifiers as a toolset for automation and cell identification in the model organism Caenorhabditis elegans. To serve a common need for automated high-resolution imaging and behavior applications in the C. elegans research community, we contribute a ready-to-use classifier for the identification of the head of the animal under bright field imaging. Furthermore, we extend our framework to address the pervasive problem of cell-specific identification under fluorescent imaging, which is critical for biological investigation in multicellular organisms or tissues. Using these examples as a guide, we envision
PROBLEMS AND CLASSIFICATION OF FORMER MILITARY AREAS

Directory of Open Access Journals (Sweden)

Svirezhev C.A.

2014-09-01

Full Text Available Integration of the Russian Federation in the international community, to find the most effective ways to implement the military and land reforms require a comprehensive study. The paper identifies the main problems that hinder the effective implementation of the reform of the conversion, the ways of their solutions, including use of the experience of the advanced countries of the European Union. Identified military objects to be conversion, shown combining them into groups according to various criteria. Proposed a typology of ex-military territories. Notes the role of the organization of effective land use conversion in the areas of land use planning, identifies the main documents required for the implementation of planned activities. The problems of land use planning conversion ex-military territories.
Preliminary discussion on the classification of uranium deposits in China

International Nuclear Information System (INIS)

Zhou Weixun; Liu Xinzhong; Wang Zubang.

1991-01-01

The classification of uranium deposits is a comprehensive and complicated problem which is of great importance for the guide in prospecting and exploration. The authors review the merits and shortcomings of various classifications sumitted by uranium geologists in the world based on origin, geotectonics and host rocks. Considering the reasonable parts in previous classifications and characteristics of uranium metallogenesis in China, the authors suggest a new classification of uranium deposits of China mainly according to host rocks, and also deposits' structure and morphology of ore bodies. This classification is composed of 7 goups divided into 25 subgroups. Finally, an indication and explanation are presented in order to draw attention of the Chinese uranium geologists and make further discussions among them
An adaptive simplex cut-cell method for high-order discontinuous Galerkin discretizations of elliptic interface problems and conjugate heat transfer problems

Science.gov (United States)

Sun, Huafei; Darmofal, David L.

2014-12-01

In this paper we propose a new high-order solution framework for interface problems on non-interface-conforming meshes. The framework consists of a discontinuous Galerkin (DG) discretization, a simplex cut-cell technique, and an output-based adaptive scheme. We first present a DG discretization with a dual-consistent output evaluation for elliptic interface problems on interface-conforming meshes, and then extend the method to handle multi-physics interface problems, in particular conjugate heat transfer (CHT) problems. The method is then applied to non-interface-conforming meshes using a cut-cell technique, where the interface definition is completely separate from the mesh generation process. No assumption is made on the interface shape (other than Lipschitz continuity). We then equip our strategy with an output-based adaptive scheme for an accurate output prediction. Through numerical examples, we demonstrate high-order convergence for elliptic interface problems and CHT problems with both smooth and non-smooth interface shapes.
Dense Iterative Contextual Pixel Classification using Kriging

DEFF Research Database (Denmark)

Ganz, Melanie; Loog, Marco; Brandt, Sami

2009-01-01

have been proposed to this end, e.g., iterative contextual pixel classification, iterated conditional modes, and other approaches related to Markov random fields. A problem of these methods, however, is their computational complexity, especially when dealing with high-resolution images in which......In medical applications, segmentation has become an ever more important task. One of the competitive schemes to perform such segmentation is by means of pixel classification. Simple pixel-based classification schemes can be improved by incorporating contextual label information. Various methods...... relatively long range interactions may play a role. We propose a new method based on Kriging that makes it possible to include such long range interactions, while keeping the computations manageable when dealing with large medical images....
Towards Automatic Classification of Wikipedia Content

Science.gov (United States)

Szymański, Julian

Wikipedia - the Free Encyclopedia encounters the problem of proper classification of new articles everyday. The process of assignment of articles to categories is performed manually and it is a time consuming task. It requires knowledge about Wikipedia structure, which is beyond typical editor competence, which leads to human-caused mistakes - omitting or wrong assignments of articles to categories. The article presents application of SVM classifier for automatic classification of documents from The Free Encyclopedia. The classifier application has been tested while using two text representations: inter-documents connections (hyperlinks) and word content. The results of the performed experiments evaluated on hand crafted data show that the Wikipedia classification process can be partially automated. The proposed approach can be used for building a decision support system which suggests editors the best categories that fit new content entered to Wikipedia.
Automated image processing method for the diagnosis and classification of malaria on thin blood smears.

Science.gov (United States)

Ross, Nicholas E; Pritchard, Charles J; Rubin, David M; Dusé, Adriano G

2006-05-01

Malaria is a serious global health problem, and rapid, accurate diagnosis is required to control the disease. An image processing algorithm to automate the diagnosis of malaria on thin blood smears is developed. The image classification system is designed to positively identify malaria parasites present in thin blood smears, and differentiate the species of malaria. Images are acquired using a charge-coupled device camera connected to a light microscope. Morphological and novel threshold selection techniques are used to identify erythrocytes (red blood cells) and possible parasites present on microscopic slides. Image features based on colour, texture and the geometry of the cells and parasites are generated, as well as features that make use of a priori knowledge of the classification problem and mimic features used by human technicians. A two-stage tree classifier using backpropogation feedforward neural networks distinguishes between true and false positives, and then diagnoses the species (Plasmodium falciparum, P. vivax, P. ovale or P. malariae) of the infection. Malaria samples obtained from the Department of Clinical Microbiology and Infectious Diseases at the University of the Witwatersrand Medical School are used for training and testing of the system. Infected erythrocytes are positively identified with a sensitivity of 85% and a positive predictive value (PPV) of 81%, which makes the method highly sensitive at diagnosing a complete sample provided many views are analysed. Species were correctly determined for 11 out of 15 samples.
Waste classification sampling plan

International Nuclear Information System (INIS)

Landsman, S.D.

1998-01-01

The purpose of this sampling is to explain the method used to collect and analyze data necessary to verify and/or determine the radionuclide content of the B-Cell decontamination and decommissioning waste stream so that the correct waste classification for the waste stream can be made, and to collect samples for studies of decontamination methods that could be used to remove fixed contamination present on the waste. The scope of this plan is to establish the technical basis for collecting samples and compiling quantitative data on the radioactive constituents present in waste generated during deactivation activities in B-Cell. Sampling and radioisotopic analysis will be performed on the fixed layers of contamination present on structural material and internal surfaces of process piping and tanks. In addition, dose rate measurements on existing waste material will be performed to determine the fraction of dose rate attributable to both removable and fixed contamination. Samples will also be collected to support studies of decontamination methods that are effective in removing the fixed contamination present on the waste. Sampling performed under this plan will meet criteria established in BNF-2596, Data Quality Objectives for the B-Cell Waste Stream Classification Sampling, J. M. Barnett, May 1998
VOCAL SEGMENT CLASSIFICATION IN POPULAR MUSIC

DEFF Research Database (Denmark)

Feng, Ling; Nielsen, Andreas Brinch; Hansen, Lars Kai

2008-01-01

This paper explores the vocal and non-vocal music classification problem within popular songs. A newly built labeled database covering 147 popular songs is announced. It is designed for classifying signals from 1sec time windows. Features are selected for this particular task, in order to capture...
Cell shape characterization and classification with discrete Fourier transforms and self-organizing maps.

Science.gov (United States)

Kriegel, Fabian L; Köhler, Ralf; Bayat-Sarmadi, Jannike; Bayerl, Simon; Hauser, Anja E; Niesner, Raluca; Luch, Andreas; Cseresnyes, Zoltan

2018-03-01

Cells in their natural environment often exhibit complex kinetic behavior and radical adjustments of their shapes. This enables them to accommodate to short- and long-term changes in their surroundings under physiological and pathological conditions. Intravital multi-photon microscopy is a powerful tool to record this complex behavior. Traditionally, cell behavior is characterized by tracking the cells' movements, which yields numerous parameters describing the spatiotemporal characteristics of cells. Cells can be classified according to their tracking behavior using all or a subset of these kinetic parameters. This categorization can be supported by the a priori knowledge of experts. While such an approach provides an excellent starting point for analyzing complex intravital imaging data, faster methods are required for automated and unbiased characterization. In addition to their kinetic behavior, the 3D shape of these cells also provide essential clues about the cells' status and functionality. New approaches that include the study of cell shapes as well may also allow the discovery of correlations amongst the track- and shape-describing parameters. In the current study, we examine the applicability of a set of Fourier components produced by Discrete Fourier Transform (DFT) as a tool for more efficient and less biased classification of complex cell shapes. By carrying out a number of 3D-to-2D projections of surface-rendered cells, the applied method reduces the more complex 3D shape characterization to a series of 2D DFTs. The resulting shape factors are used to train a Self-Organizing Map (SOM), which provides an unbiased estimate for the best clustering of the data, thereby characterizing groups of cells according to their shape. We propose and demonstrate that such shape characterization is a powerful addition to, or a replacement for kinetic analysis. This would make it especially useful in situations where live kinetic imaging is less practical or not
IRIS COLOUR CLASSIFICATION SCALES--THEN AND NOW.

Science.gov (United States)

Grigore, Mariana; Avram, Alina

2015-01-01

Eye colour is one of the most obvious phenotypic traits of an individual. Since the first documented classification scale developed in 1843, there have been numerous attempts to classify the iris colour. In the past centuries, iris colour classification scales has had various colour categories and mostly relied on comparison of an individual's eye with painted glass eyes. Once photography techniques were refined, standard iris photographs replaced painted eyes, but this did not solve the problem of painted/ printed colour variability in time. Early clinical scales were easy to use, but lacked objectivity and were not standardised or statistically tested for reproducibility. The era of automated iris colour classification systems came with the technological development. Spectrophotometry, digital analysis of high-resolution iris images, hyper spectral analysis of the human real iris and the dedicated iris colour analysis software, all accomplished an objective, accurate iris colour classification, but are quite expensive and limited in use to research environment. Iris colour classification systems evolved continuously due to their use in a wide range of studies, especially in the fields of anthropology, epidemiology and genetics. Despite the wide range of the existing scales, up until present there has been no generally accepted iris colour classification scale.
Acute leukemia classification by ensemble particle swarm model selection.

Science.gov (United States)

Escalante, Hugo Jair; Montes-y-Gómez, Manuel; González, Jesús A; Gómez-Gil, Pilar; Altamirano, Leopoldo; Reyes, Carlos A; Reta, Carolina; Rosales, Alejandro

2012-07-01

Acute leukemia is a malignant disease that affects a large proportion of the world population. Different types and subtypes of acute leukemia require different treatments. In order to assign the correct treatment, a physician must identify the leukemia type or subtype. Advanced and precise methods are available for identifying leukemia types, but they are very expensive and not available in most hospitals in developing countries. Thus, alternative methods have been proposed. An option explored in this paper is based on the morphological properties of bone marrow images, where features are extracted from medical images and standard machine learning techniques are used to build leukemia type classifiers. This paper studies the use of ensemble particle swarm model selection (EPSMS), which is an automated tool for the selection of classification models, in the context of acute leukemia classification. EPSMS is the application of particle swarm optimization to the exploration of the search space of ensembles that can be formed by heterogeneous classification models in a machine learning toolbox. EPSMS does not require prior domain knowledge and it is able to select highly accurate classification models without user intervention. Furthermore, specific models can be used for different classification tasks. We report experimental results for acute leukemia classification with real data and show that EPSMS outperformed the best results obtained using manually designed classifiers with the same data. The highest performance using EPSMS was of 97.68% for two-type classification problems and of 94.21% for more than two types problems. To the best of our knowledge, these are the best results reported for this data set. Compared with previous studies, these improvements were consistent among different type/subtype classification tasks, different features extracted from images, and different feature extraction regions. The performance improvements were statistically significant
Classification of Polarimetric SAR Data Using Dictionary Learning

DEFF Research Database (Denmark)

Vestergaard, Jacob Schack; Nielsen, Allan Aasbjerg; Dahl, Anders Lindbjerg

2012-01-01

This contribution deals with classification of multilook fully polarimetric synthetic aperture radar (SAR) data by learning a dictionary of crop types present in the Foulum test site. The Foulum test site contains a large number of agricultural fields, as well as lakes, forests, natural vegetation......, grasslands and urban areas, which make it ideally suited for evaluation of classification algorithms. Dictionary learning centers around building a collection of image patches typical for the classification problem at hand. This requires initial manual labeling of the classes present in the data and is thus...... a method for supervised classification. Sparse coding of these image patches aims to maintain a proficient number of typical patches and associated labels. Data is consecutively classified by a nearest neighbor search of the dictionary elements and labeled with probabilities of each class. Each dictionary...
Classifying Classifications

DEFF Research Database (Denmark)

Debus, Michael S.

2017-01-01

This paper critically analyzes seventeen game classifications. The classifications were chosen on the basis of diversity, ranging from pre-digital classification (e.g. Murray 1952), over game studies classifications (e.g. Elverdam & Aarseth 2007) to classifications of drinking games (e.g. LaBrie et...... al. 2013). The analysis aims at three goals: The classifications’ internal consistency, the abstraction of classification criteria and the identification of differences in classification across fields and/or time. Especially the abstraction of classification criteria can be used in future endeavors...... into the topic of game classifications....

New decision support tool for acute lymphoblastic leukemia classification

Science.gov (United States)

Madhukar, Monica; Agaian, Sos; Chronopoulos, Anthony T.

2012-03-01

In this paper, we build up a new decision support tool to improve treatment intensity choice in childhood ALL. The developed system includes different methods to accurately measure furthermore cell properties in microscope blood film images. The blood images are exposed to series of pre-processing steps which include color correlation, and contrast enhancement. By performing K-means clustering on the resultant images, the nuclei of the cells under consideration are obtained. Shape features and texture features are then extracted for classification. The system is further tested on the classification of spectra measured from the cell nuclei in blood samples in order to distinguish normal cells from those affected by Acute Lymphoblastic Leukemia. The results show that the proposed system robustly segments and classifies acute lymphoblastic leukemia based on complete microscopic blood images.
Heuristic Classification. Technical Report Number 12.

Science.gov (United States)

Clancey, William J.

A broad range of well-structured problems--embracing forms of diagnosis, catalog selection, and skeletal planning--are solved in expert computer systems by the method of heuristic classification. These programs have a characteristic inference structure that systematically relates data to a pre-enumerated set of solutions by abstraction, heuristic…
A contextual image segmentation system using a priori information for automatic data classification in nuclear physics

International Nuclear Information System (INIS)

Benkirane, A.; Auger, G.; Chbihi, A.; Bloyet, D.; Plagnol, E.

1994-01-01

This paper presents an original approach to solve an automatic data classification problem by means of image processing techniques. The classification is achieved using image segmentation techniques for extracting the meaningful classes. Two types of information are merged for this purpose: the information contained in experimental images and a priori information derived from underlying physics (and adapted to image segmentation problem). This data fusion is widely used at different stages of the segmentation process. This approach yields interesting results in terms of segmentation performances, even in very noisy cases. Satisfactory classification results are obtained in cases where more ''classical'' automatic data classification methods fail. (authors). 25 refs., 14 figs., 1 append
A contextual image segmentation system using a priori information for automatic data classification in nuclear physics

Energy Technology Data Exchange (ETDEWEB)

Benkirane, A; Auger, G; Chbihi, A [Grand Accelerateur National d` Ions Lourds (GANIL), 14 - Caen (France); Bloyet, D [Caen Univ., 14 (France); Plagnol, E [Paris-11 Univ., 91 - Orsay (France). Inst. de Physique Nucleaire

1994-12-31

This paper presents an original approach to solve an automatic data classification problem by means of image processing techniques. The classification is achieved using image segmentation techniques for extracting the meaningful classes. Two types of information are merged for this purpose: the information contained in experimental images and a priori information derived from underlying physics (and adapted to image segmentation problem). This data fusion is widely used at different stages of the segmentation process. This approach yields interesting results in terms of segmentation performances, even in very noisy cases. Satisfactory classification results are obtained in cases where more ``classical`` automatic data classification methods fail. (authors). 25 refs., 14 figs., 1 append.
Classification of EEG Signals using adaptive weighted distance nearest neighbor algorithm

Directory of Open Access Journals (Sweden)

E. Parvinnia

2014-01-01

Full Text Available Electroencephalogram (EEG signals are often used to diagnose diseases such as seizure, alzheimer, and schizophrenia. One main problem with the recorded EEG samples is that they are not equally reliable due to the artifacts at the time of recording. EEG signal classification algorithms should have a mechanism to handle this issue. It seems that using adaptive classifiers can be useful for the biological signals such as EEG. In this paper, a general adaptive method named weighted distance nearest neighbor (WDNN is applied for EEG signal classification to tackle this problem. This classification algorithm assigns a weight to each training sample to control its influence in classifying test samples. The weights of training samples are used to find the nearest neighbor of an input query pattern. To assess the performance of this scheme, EEG signals of thirteen schizophrenic patients and eighteen normal subjects are analyzed for the classification of these two groups. Several features including, fractal dimension, band power and autoregressive (AR model are extracted from EEG signals. The classification results are evaluated using Leave one (subject out cross validation for reliable estimation. The results indicate that combination of WDNN and selected features can significantly outperform the basic nearest-neighbor and the other methods proposed in the past for the classification of these two groups. Therefore, this method can be a complementary tool for specialists to distinguish schizophrenia disorder.
NEW CLASSIFICATION OF ECOPOLICES

Directory of Open Access Journals (Sweden)

VOROBYOV V. V.

2016-09-01

Full Text Available Problem statement. Ecopolices are the newest stage of the urban planning. They have to be consideredsuchas material and energy informational structures, included to the dynamic-evolutionary matrix netsofex change processes in the ecosystems. However, there are not made the ecopolice classifications, developing on suchapproaches basis. And this determined the topicality of the article. Analysis of publications on theoretical and applied aspects of the ecopolices formation showed, that the work on them is managed mainly in the context of the latest scientific and technological achievements in the various knowledge fields. These settlements are technocratic. They are connected with the morphology of space, network structures of regional and local natural ecosystems, without independent stability, can not exist without continuous man support. Another words, they do not work in with an ecopolices idea. It is come to a head for objective, symbiotic searching of ecopolices concept with the development of their classifications. Purpose statement is to develop the objective evidence for ecopolices and to propose their new classification. Conclusion. On the base of the ecopolices classification have to lie an elements correlation idea of their general plans and men activity type according with natural mechanism of accepting, reworking and transmission of material, energy and information between geo-ecosystems, planet, man, ecopolices material part and Cosmos. New ecopolices classification should be based on the principles of multi-dimensional, time-spaced symbiotic clarity with exchange ecosystem networks. The ecopolice function with this approach comes not from the subjective anthropocentric economy but from the holistic objective of Genesis paradigm. Or, otherwise - not from the Consequence, but from the Cause.
Application of machine learning on brain cancer multiclass classification

Science.gov (United States)

Panca, V.; Rustam, Z.

2017-07-01

Classification of brain cancer is a problem of multiclass classification. One approach to solve this problem is by first transforming it into several binary problems. The microarray gene expression dataset has the two main characteristics of medical data: extremely many features (genes) and only a few number of samples. The application of machine learning on microarray gene expression dataset mainly consists of two steps: feature selection and classification. In this paper, the features are selected using a method based on support vector machine recursive feature elimination (SVM-RFE) principle which is improved to solve multiclass classification, called multiple multiclass SVM-RFE. Instead of using only the selected features on a single classifier, this method combines the result of multiple classifiers. The features are divided into subsets and SVM-RFE is used on each subset. Then, the selected features on each subset are put on separate classifiers. This method enhances the feature selection ability of each single SVM-RFE. Twin support vector machine (TWSVM) is used as the method of the classifier to reduce computational complexity. While ordinary SVM finds single optimum hyperplane, the main objective Twin SVM is to find two non-parallel optimum hyperplanes. The experiment on the brain cancer microarray gene expression dataset shows this method could classify 71,4% of the overall test data correctly, using 100 and 1000 genes selected from multiple multiclass SVM-RFE feature selection method. Furthermore, the per class results show that this method could classify data of normal and MD class with 100% accuracy.
Comparison Effectiveness of Pixel Based Classification and Object Based Classification Using High Resolution Image In Floristic Composition Mapping (Study Case: Gunung Tidar Magelang City)

Science.gov (United States)

Ardha Aryaguna, Prama; Danoedoro, Projo

2016-11-01

Developments of analysis remote sensing have same way with development of technology especially in sensor and plane. Now, a lot of image have high spatial and radiometric resolution, that's why a lot information. Vegetation object analysis such floristic composition got a lot advantage of that development. Floristic composition can be interpreted using a lot of method such pixel based classification and object based classification. The problems for pixel based method on high spatial resolution image are salt and paper who appear in result of classification. The purpose of this research are compare effectiveness between pixel based classification and object based classification for composition vegetation mapping on high resolution image Worldview-2. The results show that pixel based classification using majority 5×5 kernel windows give the highest accuracy between another classifications. The highest accuracy is 73.32% from image Worldview-2 are being radiometric corrected level surface reflectance, but for overall accuracy in every class, object based are the best between another methods. Reviewed from effectiveness aspect, pixel based are more effective then object based for vegetation composition mapping in Tidar forest.
Automated Decision Tree Classification of Corneal Shape

Science.gov (United States)

Twa, Michael D.; Parthasarathy, Srinivasan; Roberts, Cynthia; Mahmoud, Ashraf M.; Raasch, Thomas W.; Bullimore, Mark A.

2011-01-01

Purpose The volume and complexity of data produced during videokeratography examinations present a challenge of interpretation. As a consequence, results are often analyzed qualitatively by subjective pattern recognition or reduced to comparisons of summary indices. We describe the application of decision tree induction, an automated machine learning classification method, to discriminate between normal and keratoconic corneal shapes in an objective and quantitative way. We then compared this method with other known classification methods. Methods The corneal surface was modeled with a seventh-order Zernike polynomial for 132 normal eyes of 92 subjects and 112 eyes of 71 subjects diagnosed with keratoconus. A decision tree classifier was induced using the C4.5 algorithm, and its classification performance was compared with the modified Rabinowitz–McDonnell index, Schwiegerling’s Z3 index (Z3), Keratoconus Prediction Index (KPI), KISA%, and Cone Location and Magnitude Index using recommended classification thresholds for each method. We also evaluated the area under the receiver operator characteristic (ROC) curve for each classification method. Results Our decision tree classifier performed equal to or better than the other classifiers tested: accuracy was 92% and the area under the ROC curve was 0.97. Our decision tree classifier reduced the information needed to distinguish between normal and keratoconus eyes using four of 36 Zernike polynomial coefficients. The four surface features selected as classification attributes by the decision tree method were inferior elevation, greater sagittal depth, oblique toricity, and trefoil. Conclusions Automated decision tree classification of corneal shape through Zernike polynomials is an accurate quantitative method of classification that is interpretable and can be generated from any instrument platform capable of raw elevation data output. This method of pattern classification is extendable to other classification
Advances in the classification and treatment of mastocytosis

DEFF Research Database (Denmark)

Valent, Peter; Akin, Cem; Hartmann, Karin

2017-01-01

Mastocytosis is a term used to denote a heterogeneous group of conditions defined by the expansion and accumulation of clonal (neoplastic) tissue mast cells in various organs. The classification of the World Health Organization (WHO) divides the disease into cutaneous mastocytosis, systemic...... leukemia. The clinical impact and prognostic value of this classification has been confirmed in numerous studies, and its basic concept remains valid. However, refinements have recently been proposed by the consensus group, the WHO, and the European Competence Network on Mastocytosis. In addition, new...... of mastocytosis, with emphasis on classification, prognostication, and emerging new treatment options in advanced systemic mastocytosis....
Adaptive SVM for Data Stream Classification

Directory of Open Access Journals (Sweden)

Isah A. Lawal

2017-07-01

Full Text Available In this paper, we address the problem of learning an adaptive classifier for the classification of continuous streams of data. We present a solution based on incremental extensions of the Support Vector Machine (SVM learning paradigm that updates an existing SVM whenever new training data are acquired. To ensure that the SVM effectiveness is guaranteed while exploiting the newly gathered data, we introduce an on-line model selection approach in the incremental learning process. We evaluated the proposed method on real world applications including on-line spam email filtering and human action classification from videos. Experimental results show the effectiveness and the potential of the proposed approach.
Mixed isogeometric finite cell methods for the stokes problem

NARCIS (Netherlands)

Hoang, T.; Verhoosel, C.V.; Auricchio, F.; van Brummelen, E.H.; Reali, A.

2017-01-01

We study the application of the Isogeometric Finite Cell Method (IGA-FCM) to mixed formulations in the context of the Stokes problem. We investigate the performance of the IGA-FCM when utilizing some isogeometric mixed finite elements, namely: Taylor-Hood, Sub-grid, Raviart-Thomas, and Nédélec
Maternal cell phone and cordless phone use during pregnancy and behaviour problems in 5-year-old children.

Science.gov (United States)

Guxens, Mònica; van Eijsden, Manon; Vermeulen, Roel; Loomans, Eva; Vrijkotte, Tanja G M; Komhout, Hans; van Strien, Rob T; Huss, Anke

2013-05-01

A previous study found an association between maternal cell phone use during pregnancy and maternal-reported child behaviour problems at age 7. Together with cell phones, cordless phones represent the main exposure source of radiofrequency-electromagnetic fields to the head. Therefore, we assessed the association between maternal cell phone and cordless phone use during pregnancy and teacher-reported and maternal-reported child behaviour problems at age 5. The study was embedded in the Amsterdam Born Children and their Development study, a population-based birth cohort study in Amsterdam, the Netherlands (2003-2004). Teachers and mothers reported child behaviour problems using the Strength and Difficulties Questionnaire at age 5. Maternal cell phone and cordless phone use during pregnancy was asked when children were 7 years old. A total of 2618 children were included. As compared to non-users, those exposed to prenatal cell phone use showed an increased but non-significant association of having teacher-reported overall behaviour problems, although without dose-response relationship with the number of calls (OR=2.12 (95% CI 0.95 to 4.74) for cell phone and cordless phone use with maternal-reported overall behaviour problems remained non-significant. Non-significant associations were found for the specific behaviour problem subscales. Our results do not suggest that maternal cell phone or cordless phone use during pregnancy increases the odds of behaviour problems in their children.
Data preprocessing techniques for classification without discrimination

NARCIS (Netherlands)

Kamiran, F.; Calders, T.G.K.

2012-01-01

Recently, the following Discrimination-Aware Classification Problem was introduced: Suppose we are given training data that exhibit unlawful discrimination; e.g., toward sensitive attributes such as gender or ethnicity. The task is to learn a classifier that optimizes accuracy, but does not have
Classification and data acquisition with incomplete data

Science.gov (United States)

Williams, David P.

In remote-sensing applications, incomplete data can result when only a subset of sensors (e.g., radar, infrared, acoustic) are deployed at certain regions. The limitations of single sensor systems have spurred interest in employing multiple sensor modalities simultaneously. For example, in land mine detection tasks, different sensor modalities are better-suited to capture different aspects of the underlying physics of the mines. Synthetic aperture radar sensors may be better at detecting surface mines, while infrared sensors may be better at detecting buried mines. By employing multiple sensor modalities to address the detection task, the strengths of the disparate sensors can be exploited in a synergistic manner to improve performance beyond that which would be achievable with either single sensor alone. When multi-sensor approaches are employed, however, incomplete data can be manifested. If each sensor is located on a separate platform ( e.g., aircraft), each sensor may interrogate---and hence collect data over---only partially overlapping areas of land. As a result, some data points may be characterized by data (i.e., features) from only a subset of the possible sensors employed in the task. Equivalently, this scenario implies that some data points will be missing features. Increasing focus in the future on using---and fusing data from---multiple sensors will make such incomplete-data problems commonplace. In many applications involving incomplete data, it is possible to acquire the missing data at a cost. In multi-sensor remote-sensing applications, data is acquired by deploying sensors to data points. Acquiring data is usually an expensive, time-consuming task, a fact that necessitates an intelligent data acquisition process. Incomplete data is not limited to remote-sensing applications, but rather, can arise in virtually any data set. In this dissertation, we address the general problem of classification when faced with incomplete data. We also address the
Rule-guided human classification of Volunteered Geographic Information

Science.gov (United States)

Ali, Ahmed Loai; Falomir, Zoe; Schmid, Falko; Freksa, Christian

2017-05-01

During the last decade, web technologies and location sensing devices have evolved generating a form of crowdsourcing known as Volunteered Geographic Information (VGI). VGI acted as a platform of spatial data collection, in particular, when a group of public participants are involved in collaborative mapping activities: they work together to collect, share, and use information about geographic features. VGI exploits participants' local knowledge to produce rich data sources. However, the resulting data inherits problematic data classification. In VGI projects, the challenges of data classification are due to the following: (i) data is likely prone to subjective classification, (ii) remote contributions and flexible contribution mechanisms in most projects, and (iii) the uncertainty of spatial data and non-strict definitions of geographic features. These factors lead to various forms of problematic classification: inconsistent, incomplete, and imprecise data classification. This research addresses classification appropriateness. Whether the classification of an entity is appropriate or inappropriate is related to quantitative and/or qualitative observations. Small differences between observations may be not recognizable particularly for non-expert participants. Hence, in this paper, the problem is tackled by developing a rule-guided classification approach. This approach exploits data mining techniques of Association Classification (AC) to extract descriptive (qualitative) rules of specific geographic features. The rules are extracted based on the investigation of qualitative topological relations between target features and their context. Afterwards, the extracted rules are used to develop a recommendation system able to guide participants to the most appropriate classification. The approach proposes two scenarios to guide participants towards enhancing the quality of data classification. An empirical study is conducted to investigate the classification of grass
Novel whole-cell Reporter Assay for Stress-Based Classification of Antibacterial Compounds Produced by Locally Isolated Bacillus spp.

OpenAIRE

Nithya, Vadakedath; Halami, Prakash M.

2012-01-01

Reporter bacteria are beneficial for the rapid and sensitive screening of cultures producing peptide antibiotics, which can be an addition or alternative to the established antibiotics. This study was carried out to validate the usability of specific reporter strains for the target mediated identification of antibiotics produced by native Bacillus spp. isolated from different food sources. During preliminary classification, cell wall stress causing Bacillus isolates were screened by using rep...
Lauren classification and individualized chemotherapy in gastric cancer.

Science.gov (United States)

Ma, Junli; Shen, Hong; Kapesa, Linda; Zeng, Shan

2016-05-01

Gastric cancer is one of the most common malignancies worldwide. During the last 50 years, the histological classification of gastric carcinoma has been largely based on Lauren's criteria, in which gastric cancer is classified into two major histological subtypes, namely intestinal type and diffuse type adenocarcinoma. This classification was introduced in 1965, and remains currently widely accepted and employed, since it constitutes a simple and robust classification approach. The two histological subtypes of gastric cancer proposed by the Lauren classification exhibit a number of distinct clinical and molecular characteristics, including histogenesis, cell differentiation, epidemiology, etiology, carcinogenesis, biological behaviors and prognosis. Gastric cancer exhibits varied sensitivity to chemotherapy drugs and significant heterogeneity; therefore, the disease may be a target for individualized therapy. The Lauren classification may provide the basis for individualized treatment for advanced gastric cancer, which is increasingly gaining attention in the scientific field. However, few studies have investigated individualized treatment that is guided by pathological classification. The aim of the current review is to analyze the two major histological subtypes of gastric cancer, as proposed by the Lauren classification, and to discuss the implications of this for personalized chemotherapy.
Artificial intelligence in label-free microscopy biological cell classification by time stretch

CERN Document Server

Mahjoubfar, Ata; Jalali, Bahram

2017-01-01

This book introduces time-stretch quantitative phase imaging (TS-QPI), a high-throughput label-free imaging flow cytometer developed for big data acquisition and analysis in phenotypic screening. TS-QPI is able to capture quantitative optical phase and intensity images simultaneously, enabling high-content cell analysis, cancer diagnostics, personalized genomics, and drug development. The authors also demonstrate a complete machine learning pipeline that performs optical phase measurement, image processing, feature extraction, and classification, enabling high-throughput quantitative imaging that achieves record high accuracy in label -free cellular phenotypic screening and opens up a new path to data-driven diagnosis. • Demonstrates how machine learning is used in high-speed microscopy imaging to facilitate medical diagnosis; • Provides a systematic and comprehensive illustration of time stretch technology; • Enables multidisciplinary application, including industrial, biomedical, and artificial intell...
Spectral Classification of Similar Materials using the Tetracorder Algorithm: The Calcite-Epidote-Chlorite Problem

Science.gov (United States)

Dalton, J. Brad; Bove, Dana; Mladinich, Carol; Clark, Roger; Rockwell, Barnaby; Swayze, Gregg; King, Trude; Church, Stanley

2001-01-01

Recent work on automated spectral classification algorithms has sought to distinguish ever-more similar materials. From modest beginnings separating shade, soil, rock and vegetation to ambitious attempts to discriminate mineral types and specific plant species, the trend seems to be toward using increasingly subtle spectral differences to perform the classification. Rule-based expert systems exploiting the underlying physics of spectroscopy such as the US Geological Society Tetracorder system are now taking advantage of the high spectral resolution and dimensionality of current imaging spectrometer designs to discriminate spectrally similar materials. The current paper details recent efforts to discriminate three minerals having absorptions centered at the same wavelength, with encouraging results.

Automatic music genres classification as a pattern recognition problem

Science.gov (United States)

Ul Haq, Ihtisham; Khan, Fauzia; Sharif, Sana; Shaukat, Arsalan

2013-12-01

Music genres are the simplest and effect descriptors for searching music libraries stores or catalogues. The paper compares the results of two automatic music genres classification systems implemented by using two different yet simple classifiers (K-Nearest Neighbor and Naïve Bayes). First a 10-12 second sample is selected and features are extracted from it, and then based on those features results of both classifiers are represented in the form of accuracy table and confusion matrix. An experiment carried out on test 60 taken from middle of a song represents the true essence of its genre as compared to the samples taken from beginning and ending of a song. The novel techniques have achieved an accuracy of 91% and 78% by using Naïve Bayes and KNN classifiers respectively.
Classifications, definitions and concepts of locality in Africa.

Science.gov (United States)

1983-01-01

Sub-Saharan Africa, one of the world's least urbanized regions, has within the last 30 years or so been experiencing a very high rate of urban population growth. The prevalence of small localities, invariably spread out over large land surfaces, complicates the classifications and definitions of localities as well as making the identification of settlement patterns in a mapping exercise and during a census or survey field operation difficult. The distinguishing features of a locality are: 1) a distinct (separate) population cluster, 2) inhabitants live in neighboring quarters, and 3) it has a name or a locally recognized status. Even in African countries, where the definition of locality as a distinct population cluster has been employed, problems have cropped up with respect to classifications and identifications. Another problem with the classification of localities is related to the rapid changes that occur in rural settlement patterns. Despite the rapid growth of urban localities within the past few years in sub-Saharan Africa, the proportion of the population living in urban localities is still low. Whereas a locality is a distinct population cluster, with definable boundaries, the village as employed in some countries does not have clear cut boundaries. The definition of a locality as a distinct population cluster in which the inhabitants live in neighboring living quarters and which has a name or a locally recognized status is highly recommended. The UN proposal for the adoption of a classification scheme by size of locality needs to be examined by African countries. The urban/rural dichotomy is recommended for the classification of some tabulations from censuses and surveys--especially on the total population and population of major and minor civil divisions.
Classification of time series patterns from complex dynamic systems

Energy Technology Data Exchange (ETDEWEB)

Schryver, J.C.; Rao, N.

1998-07-01

An increasing availability of high-performance computing and data storage media at decreasing cost is making possible the proliferation of large-scale numerical databases and data warehouses. Numeric warehousing enterprises on the order of hundreds of gigabytes to terabytes are a reality in many fields such as finance, retail sales, process systems monitoring, biomedical monitoring, surveillance and transportation. Large-scale databases are becoming more accessible to larger user communities through the internet, web-based applications and database connectivity. Consequently, most researchers now have access to a variety of massive datasets. This trend will probably only continue to grow over the next several years. Unfortunately, the availability of integrated tools to explore, analyze and understand the data warehoused in these archives is lagging far behind the ability to gain access to the same data. In particular, locating and identifying patterns of interest in numerical time series data is an increasingly important problem for which there are few available techniques. Temporal pattern recognition poses many interesting problems in classification, segmentation, prediction, diagnosis and anomaly detection. This research focuses on the problem of classification or characterization of numerical time series data. Highway vehicles and their drivers are examples of complex dynamic systems (CDS) which are being used by transportation agencies for field testing to generate large-scale time series datasets. Tools for effective analysis of numerical time series in databases generated by highway vehicle systems are not yet available, or have not been adapted to the target problem domain. However, analysis tools from similar domains may be adapted to the problem of classification of numerical time series data.
Overfitting Reduction of Text Classification Based on AdaBELM

Directory of Open Access Journals (Sweden)

Xiaoyue Feng

2017-07-01

Full Text Available Overfitting is an important problem in machine learning. Several algorithms, such as the extreme learning machine (ELM, suffer from this issue when facing high-dimensional sparse data, e.g., in text classification. One common issue is that the extent of overfitting is not well quantified. In this paper, we propose a quantitative measure of overfitting referred to as the rate of overfitting (RO and a novel model, named AdaBELM, to reduce the overfitting. With RO, the overfitting problem can be quantitatively measured and identified. The newly proposed model can achieve high performance on multi-class text classification. To evaluate the generalizability of the new model, we designed experiments based on three datasets, i.e., the 20 Newsgroups, Reuters-21578, and BioMed corpora, which represent balanced, unbalanced, and real application data, respectively. Experiment results demonstrate that AdaBELM can reduce overfitting and outperform classical ELM, decision tree, random forests, and AdaBoost on all three text-classification datasets; for example, it can achieve 62.2% higher accuracy than ELM. Therefore, the proposed model has a good generalizability.
[Current problems of deontology].

Science.gov (United States)

Dimov, A S

2010-01-01

The scope of knowledge in medical ethics continues to extend. Deontology as a science needs systematization of the accumulated data. This review may give impetus to classification of problems pertaining to this important area of medical activity.
The cranial cartilages of teleosts and their classification.

OpenAIRE

Benjamin, M

1990-01-01

The structure and distribution of cartilages has been studied in 45 species from 24 families. The resulting data have been used as a basis for establishing a new classification. A cartilage is regarded as 'cell-rich' if its cells or their lacunae occupy more than half of the tissue volume. Five classes of cell-rich cartilage are recognised (a) hyaline-cell cartilage (common in the lips of bottom-dwelling cyprinids) and its subtypes fibro/hyaline-cell cartilage, elastic/hyaline-cell cartilage ...
Machine learning algorithms for mode-of-action classification in toxicity assessment.

Science.gov (United States)

Zhang, Yile; Wong, Yau Shu; Deng, Jian; Anton, Cristina; Gabos, Stephan; Zhang, Weiping; Huang, Dorothy Yu; Jin, Can

2016-01-01

Real Time Cell Analysis (RTCA) technology is used to monitor cellular changes continuously over the entire exposure period. Combining with different testing concentrations, the profiles have potential in probing the mode of action (MOA) of the testing substances. In this paper, we present machine learning approaches for MOA assessment. Computational tools based on artificial neural network (ANN) and support vector machine (SVM) are developed to analyze the time-concentration response curves (TCRCs) of human cell lines responding to tested chemicals. The techniques are capable of learning data from given TCRCs with known MOA information and then making MOA classification for the unknown toxicity. A novel data processing step based on wavelet transform is introduced to extract important features from the original TCRC data. From the dose response curves, time interval leading to higher classification success rate can be selected as input to enhance the performance of the machine learning algorithm. This is particularly helpful when handling cases with limited and imbalanced data. The validation of the proposed method is demonstrated by the supervised learning algorithm applied to the exposure data of HepG2 cell line to 63 chemicals with 11 concentrations in each test case. Classification success rate in the range of 85 to 95 % are obtained using SVM for MOA classification with two clusters to cases up to four clusters. Wavelet transform is capable of capturing important features of TCRCs for MOA classification. The proposed SVM scheme incorporated with wavelet transform has a great potential for large scale MOA classification and high-through output chemical screening.
The study of electrochemical cell taught by problem-based learning

Science.gov (United States)

Srichaitung, Paisan

2018-01-01

According to the teaching activity of Chemistry, researcher found that students were not able to seek self knowledge even applied knowledge to their everyday life. Therefore, the researcher is interested in creating an activity to have students constructed their knowledge, science process skills, and can apply knowledge in their everyday life. The researcher presented form of teaching activity of electrochemical cell by using problem-based learning for Mathayom five students of Thai Christian School. The teaching activity focused on electron transfer in galvanic cell. In this activity, the researcher assigned students to design the electron transfer in galvanic cell using any solution that could light up the bulb. Then students were separated into a group of two, which were total seven groups. Each group of students searched the information about the electron transfer in galvanic cell from books, internet, or other sources of information. After students received concepts, or knowledge they searched for, Students designed and did the experiment. Finally, the students in each groups had twenty minutes to give a presentation in front of the classroom about the electron transfer in galvanic using any solution to light up the bulb with showing the experiment, and five minutes to answer their classmates' questions. Giving the presentation took four periods with total seven groups. After students finished their presentation, the researcher had students discussed and summarized the teaching activity's main idea of electron transfer in galvanic. Then, researcher observed students' behavior in each group found that 85.7 percentages of total students developed science process skills, and transferred their knowledge through presentation completely. When students done the post test, the researcher found that 92.85 percentages of total students were able to explain the concept of galvanic cell, described the preparation and the selection of experimental equipment. Furthermore
Multicenter validation of recursive partitioning analysis classification for patients with squamous cell head and neck carcinoma treated with surgery and postoperative radiotherapy.

NARCIS (Netherlands)

Jonkman, A.; Kaanders, J.H.A.M.; Terhaard, C.H.J.; Hoebers, F.J.; Ende, P.L. van den; Wijers, O.B.; Verhoef, C.G.; Jong, M. de; Leemans, C.R.; Langendijk, J.A.

2007-01-01

PURPOSE: To validate the recursive partitioning analysis (RPA) classification system for squamous cell head and neck cancer as recently reported by the VU University Medical Center. METHODS AND MATERIALS: In eight Dutch head and neck cancer centers, data necessary to classify patients according to
An Online Multisensor Data Fusion Framework for Radar Emitter Classification

Directory of Open Access Journals (Sweden)

Dongqing Zhou

2016-01-01

Full Text Available Radar emitter classification is a special application of data clustering for classifying unknown radar emitters in airborne electronic support system. In this paper, a novel online multisensor data fusion framework is proposed for radar emitter classification under the background of network centric warfare. The framework is composed of local processing and multisensor fusion processing, from which the rough and precise classification results are obtained, respectively. What is more, the proposed algorithm does not need prior knowledge and training process; it can dynamically update the number of the clusters and the cluster centers when new pulses arrive. At last, the experimental results show that the proposed framework is an efficacious way to solve radar emitter classification problem in networked warfare.
Modern problems of DNA repair in mammalian cells and some unsettled questions

International Nuclear Information System (INIS)

Gaziev, A.I.

1978-01-01

A comparison of DNA repair process in the cells of mammals and E. coli revealed no principal differences in the enzymic mechanisms of DNA repair in the cells of higher and lower organisms. It has been found that when given is the same number of impairments in the section of DNA chain in the cells of mammals and bacteria the regeneration in the former occurs more slowly than in the latter. Low rate elimination of impairments of DNA in the cells of mammals is due to a more complex intracellular and permolecular organization. It is stressed that the investigation into the mechanisms of fixing impairments in case of postreplication DNA repair is a very important and unresolved problem, especially in terms of radiation mutagenesis and cancerogenesis. Much thought is given to the problem of repairing double stranded ruptures of DNA. It is proposed that DNA repair should be considered not only in terms of functioning of enzymes in DNA metabolism, but also permolecular organization of genome in the cell
Nursing outcome "Severity of infection": conceptual definitions for indicators related to respiratory problems

Directory of Open Access Journals (Sweden)

Alba Luz Rodríguez-Acelas

Full Text Available Objective.Build conceptual definitions for some indicators of the nursing outcome Infection Severity in the Nursing Outcomes Classification (NOC related to respiratory problems, based on scientific evidence of signs and symptoms of infection in adults. Methods. Integrative literature review with search in the databases PubMed, CINAHL, LILACS and SCOPUS. Studies whose full texts were available, published in Spanish, Portuguese or English, using the descriptors infection severity, nursing outcomes classification NOC, respiratory infections and respiratory signs and symptoms. Results. Nine publications were analyzed that supported the elaboration of the conceptual definitions for eight indicators of the Nursing Outcome Infection Severity: purulent drainage, fever, chilling, unstable temperature, pain, colonization of drainage cultivation, white blood cell count elevation and white blood cell count drop. Conclusion. This study contributed to understand the terms used in the nursing outcome Infection Severity, in order to improve and facilitate the use of the NOC, as it enhances the conceptual clarity of the selected indicators with a view to producing better scientific evidence.
Adaptive phase k-means algorithm for waveform classification

Science.gov (United States)

Song, Chengyun; Liu, Zhining; Wang, Yaojun; Xu, Feng; Li, Xingming; Hu, Guangmin

2018-01-01

Waveform classification is a powerful technique for seismic facies analysis that describes the heterogeneity and compartments within a reservoir. Horizon interpretation is a critical step in waveform classification. However, the horizon often produces inconsistent waveform phase, and thus results in an unsatisfied classification. To alleviate this problem, an adaptive phase waveform classification method called the adaptive phase k-means is introduced in this paper. Our method improves the traditional k-means algorithm using an adaptive phase distance for waveform similarity measure. The proposed distance is a measure with variable phases as it moves from sample to sample along the traces. Model traces are also updated with the best phase interference in the iterative process. Therefore, our method is robust to phase variations caused by the interpretation horizon. We tested the effectiveness of our algorithm by applying it to synthetic and real data. The satisfactory results reveal that the proposed method tolerates certain waveform phase variation and is a good tool for seismic facies analysis.
A classification scheme for risk assessment methods.

Energy Technology Data Exchange (ETDEWEB)

Stamp, Jason Edwin; Campbell, Philip LaRoche

2004-08-01

This report presents a classification scheme for risk assessment methods. This scheme, like all classification schemes, provides meaning by imposing a structure that identifies relationships. Our scheme is based on two orthogonal aspects--level of detail, and approach. The resulting structure is shown in Table 1 and is explained in the body of the report. Each cell in the Table represent a different arrangement of strengths and weaknesses. Those arrangements shift gradually as one moves through the table, each cell optimal for a particular situation. The intention of this report is to enable informed use of the methods so that a method chosen is optimal for a situation given. This report imposes structure on the set of risk assessment methods in order to reveal their relationships and thus optimize their usage.We present a two-dimensional structure in the form of a matrix, using three abstraction levels for the rows and three approaches for the columns. For each of the nine cells in the matrix we identify the method type by name and example. The matrix helps the user understand: (1) what to expect from a given method, (2) how it relates to other methods, and (3) how best to use it. Each cell in the matrix represent a different arrangement of strengths and weaknesses. Those arrangements shift gradually as one moves through the table, each cell optimal for a particular situation. The intention of this report is to enable informed use of the methods so that a method chosen is optimal for a situation given. The matrix, with type names in the cells, is introduced in Table 2 on page 13 below. Unless otherwise stated we use the word 'method' in this report to refer to a 'risk assessment method', though often times we use the full phrase. The use of the terms 'risk assessment' and 'risk management' are close enough that we do not attempt to distinguish them in this report. The remainder of this report is organized as follows. In
IRIS COLOUR CLASSIFICATION SCALES – THEN AND NOW

Science.gov (United States)

Grigore, Mariana; Avram, Alina

2015-01-01

Eye colour is one of the most obvious phenotypic traits of an individual. Since the first documented classification scale developed in 1843, there have been numerous attempts to classify the iris colour. In the past centuries, iris colour classification scales has had various colour categories and mostly relied on comparison of an individual’s eye with painted glass eyes. Once photography techniques were refined, standard iris photographs replaced painted eyes, but this did not solve the problem of painted/ printed colour variability in time. Early clinical scales were easy to use, but lacked objectivity and were not standardised or statistically tested for reproducibility. The era of automated iris colour classification systems came with the technological development. Spectrophotometry, digital analysis of high-resolution iris images, hyper spectral analysis of the human real iris and the dedicated iris colour analysis software, all accomplished an objective, accurate iris colour classification, but are quite expensive and limited in use to research environment. Iris colour classification systems evolved continuously due to their use in a wide range of studies, especially in the fields of anthropology, epidemiology and genetics. Despite the wide range of the existing scales, up until present there has been no generally accepted iris colour classification scale. PMID:27373112
Effective automated feature construction and selection for classification of biological sequences.

Directory of Open Access Journals (Sweden)

Uday Kamath

Full Text Available Many open problems in bioinformatics involve elucidating underlying functional signals in biological sequences. DNA sequences, in particular, are characterized by rich architectures in which functional signals are increasingly found to combine local and distal interactions at the nucleotide level. Problems of interest include detection of regulatory regions, splice sites, exons, hypersensitive sites, and more. These problems naturally lend themselves to formulation as classification problems in machine learning. When classification is based on features extracted from the sequences under investigation, success is critically dependent on the chosen set of features.We present an algorithmic framework (EFFECT for automated detection of functional signals in biological sequences. We focus here on classification problems involving DNA sequences which state-of-the-art work in machine learning shows to be challenging and involve complex combinations of local and distal features. EFFECT uses a two-stage process to first construct a set of candidate sequence-based features and then select a most effective subset for the classification task at hand. Both stages make heavy use of evolutionary algorithms to efficiently guide the search towards informative features capable of discriminating between sequences that contain a particular functional signal and those that do not.To demonstrate its generality, EFFECT is applied to three separate problems of importance in DNA research: the recognition of hypersensitive sites, splice sites, and ALU sites. Comparisons with state-of-the-art algorithms show that the framework is both general and powerful. In addition, a detailed analysis of the constructed features shows that they contain valuable biological information about DNA architecture, allowing biologists and other researchers to directly inspect the features and potentially use the insights obtained to assist wet-laboratory studies on retainment or modification
Large margin image set representation and classification

KAUST Repository

Wang, Jim Jing-Yan; Alzahrani, Majed A.; Gao, Xin

2014-01-01

In this paper, we propose a novel image set representation and classification method by maximizing the margin of image sets. The margin of an image set is defined as the difference of the distance to its nearest image set from different classes and the distance to its nearest image set of the same class. By modeling the image sets by using both their image samples and their affine hull models, and maximizing the margins of the images sets, the image set representation parameter learning problem is formulated as an minimization problem, which is further optimized by an expectation - maximization (EM) strategy with accelerated proximal gradient (APG) optimization in an iterative algorithm. To classify a given test image set, we assign it to the class which could provide the largest margin. Experiments on two applications of video-sequence-based face recognition demonstrate that the proposed method significantly outperforms state-of-the-art image set classification methods in terms of both effectiveness and efficiency.
Large margin image set representation and classification

KAUST Repository

Wang, Jim Jing-Yan

2014-07-06

In this paper, we propose a novel image set representation and classification method by maximizing the margin of image sets. The margin of an image set is defined as the difference of the distance to its nearest image set from different classes and the distance to its nearest image set of the same class. By modeling the image sets by using both their image samples and their affine hull models, and maximizing the margins of the images sets, the image set representation parameter learning problem is formulated as an minimization problem, which is further optimized by an expectation - maximization (EM) strategy with accelerated proximal gradient (APG) optimization in an iterative algorithm. To classify a given test image set, we assign it to the class which could provide the largest margin. Experiments on two applications of video-sequence-based face recognition demonstrate that the proposed method significantly outperforms state-of-the-art image set classification methods in terms of both effectiveness and efficiency.
E-LEARNING TOOLS: STRUCTURE, CONTENT, CLASSIFICATION

Directory of Open Access Journals (Sweden)

Yuliya H. Loboda

2012-05-01

Full Text Available The article analyses the problems of organization of educational process with use of electronic means of education. Specifies the definition of "electronic learning", their structure and content. Didactic principles are considered, which are the basis of their creation and use. Given the detailed characteristics of e-learning tools for methodological purposes. On the basis of the allocated pedagogical problems of the use of electronic means of education presented and complemented by their classification, namely the means of theoretical and technological training, means of practical training, support tools, and comprehensive facilities.
Borel reductibility and classification of von neumann algebras

DEFF Research Database (Denmark)

Sasyk, R.; Törnquist, Asger Dag

2009-01-01

We announce some new results regarding the classification problem for separable von Neumann algebras. Our results are obtained by applying the notion of Borel reducibility and Hjorth's theory of turbulence to the isomorphism relation for separable von Neumann algebras....

A simple phenotypic classification for celiac disease

Directory of Open Access Journals (Sweden)

Ajit Sood

2018-04-01

Full Text Available Background/Aims : Celiac disease is a global health problem. The presentation of celiac disease has unfolded over years and it is now known that it can manifest at different ages, has varied presentations, and is prone to develop complications, if not managed properly. Although the Oslo definitions provide consensus on the various terminologies used in literature, there is no phenotypic classification providing a composite diagnosis for the disease. Methods : Various variables identified for phenotypic classification included age at diagnosis, age at onset of symptoms, clinical presentation, family history and complications. These were applied to the existing registry of 1,664 patients at Dayanand Medical College and Hospital, Ludhiana, India. In addition, age was evaluated as below 15 and below 18 years. Cross tabulations were used for the verification of the classification using the existing data. Expert opinion was sought from both international and national experts of varying fields. Results : After empirical verification, age at diagnosis was considered appropriate in between A1 (<18 and A2 (≧18. The disease presentation has been classified into 3 types–P1 (classical, P2 (non-classical and P3 (asymptomatic. Complications were considered as absent (C0 or present (C1. A single phenotypic classification based on these 3 characteristics, namely age at the diagnosis, clinical presentation, and intestinal complications (APC classification was derived. Conclusions : APC classification (age at diagnosis, presentation, complications is a simple disease explanatory classification for patients with celiac disease aimed at providing a composite diagnosis.
Automatic Classification Using Supervised Learning in a Medical Document Filtering Application.

Science.gov (United States)

Mostafa, J.; Lam, W.

2000-01-01

Presents a multilevel model of the information filtering process that permits document classification. Evaluates a document classification approach based on a supervised learning algorithm, measures the accuracy of the algorithm in a neural network that was trained to classify medical documents on cell biology, and discusses filtering…
A Novel Texture Classification Procedure by using Association Rules

Directory of Open Access Journals (Sweden)

L. Jaba Sheela

2008-11-01

Full Text Available Texture can be defined as a local statistical pattern of texture primitives in observer’s domain of interest. Texture classification aims to assign texture labels to unknown textures, according to training samples and classification rules. Association rules have been used in various applications during the past decades. Association rules capture both structural and statistical information, and automatically identify the structures that occur most frequently and relationships that have significant discriminative power. So, association rules can be adapted to capture frequently occurring local structures in textures. This paper describes the usage of association rules for texture classification problem. The performed experimental studies show the effectiveness of the association rules. The overall success rate is about 98%.
Evolutionary fuzzy ARTMAP neural networks for classification of semiconductor defects.

Science.gov (United States)

Tan, Shing Chiang; Watada, Junzo; Ibrahim, Zuwairie; Khalid, Marzuki

2015-05-01

Wafer defect detection using an intelligent system is an approach of quality improvement in semiconductor manufacturing that aims to enhance its process stability, increase production capacity, and improve yields. Occasionally, only few records that indicate defective units are available and they are classified as a minority group in a large database. Such a situation leads to an imbalanced data set problem, wherein it engenders a great challenge to deal with by applying machine-learning techniques for obtaining effective solution. In addition, the database may comprise overlapping samples of different classes. This paper introduces two models of evolutionary fuzzy ARTMAP (FAM) neural networks to deal with the imbalanced data set problems in a semiconductor manufacturing operations. In particular, both the FAM models and hybrid genetic algorithms are integrated in the proposed evolutionary artificial neural networks (EANNs) to classify an imbalanced data set. In addition, one of the proposed EANNs incorporates a facility to learn overlapping samples of different classes from the imbalanced data environment. The classification results of the proposed evolutionary FAM neural networks are presented, compared, and analyzed using several classification metrics. The outcomes positively indicate the effectiveness of the proposed networks in handling classification problems with imbalanced data sets.
Binary Stochastic Representations for Large Multi-class Classification

KAUST Repository

Gerald, Thomas; Baskiotis, Nicolas; Denoyer, Ludovic

2017-01-01

Classification with a large number of classes is a key problem in machine learning and corresponds to many real-world applications like tagging of images or textual documents in social networks. If one-vs-all methods usually reach top performance
Modeling PSA Problems - II: A Cell-to-Cell Transport Theory Approach

International Nuclear Information System (INIS)

Labeau, P.E.; Izquierdo, J.M.

2005-01-01

In the first paper of this series, we presented an extension of the classical theory of dynamic reliability in which the actual occurrence of an event causing a change in the system dynamics is possibly delayed. The concept of stimulus activation, which triggers the realization of an event after a distributed time delay, was introduced. This gives a new understanding of competing events in the sequence delineation process.In the context of the level-2 probabilistic safety analysis (PSA), the information on stimulus activation mainly consists of regions of the process variables space where the activation can occur with a given probability. The evolution equations of the extended theory of probabilistic dynamics are therefore particularized to a transport process between discrete cells defined in phase-space on this basis. Doing so, an integrated and coherent approach to level-2 PSA problems is propounded. This amounts to including the stimulus concept and the associated stochastic delays discussed in the first paper in the frame of a cell-to-cell transport process.In addition, this discrete model provides a theoretical basis for the definition of appropriate numerical schemes for integrated level-2 PSA applications
Image classification independent of orientation and scale

Science.gov (United States)

Arsenault, Henri H.; Parent, Sebastien; Moisan, Sylvain

1998-04-01

The recognition of targets independently of orientation has become fairly well developed in recent years for in-plane rotation. The out-of-plane rotation problem is much less advanced. When both out-of-plane rotations and changes of scale are present, the problem becomes very difficult. In this paper we describe our research on the combined out-of- plane rotation problem and the scale invariance problem. The rotations were limited to rotations about an axis perpendicular to the line of sight. The objects to be classified were three kinds of military vehicles. The inputs used were infrared imagery and photographs. We used a variation of a method proposed by Neiberg and Casasent, where a neural network is trained with a subset of the database and a minimum distances from lines in feature space are used for classification instead of nearest neighbors. Each line in the feature space corresponds to one class of objects, and points on one line correspond to different orientations of the same target. We found that the training samples needed to be closer for some orientations than for others, and that the most difficult orientations are where the target is head-on to the observer. By means of some additional training of the neural network, we were able to achieve 100% correct classification for 360 degree rotation and a range of scales over a factor of five.
Electroencephalography Signal Grouping and Feature Classification Using Harmony Search for BCI

Directory of Open Access Journals (Sweden)

Tae-Ju Lee

2013-01-01

Full Text Available This paper presents a heuristic method for electroencephalography (EEG grouping and feature classification using harmony search (HS for improving the accuracy of the brain-computer interface (BCI system. EEG, a noninvasive BCI method, uses many electrodes on the scalp, and a large number of electrodes make the resulting analysis difficult. In addition, traditional EEG analysis cannot handle multiple stimuli. On the other hand, the classification method using the EEG signal has a low accuracy. To solve these problems, we use a heuristic approach to reduce the complexities in multichannel problems and classification. In this study, we build a group of stimuli using the HS algorithm. Then, the features from common spatial patterns are classified by the HS classifier. To confirm the proposed method, we perform experiments using 64-channel EEG equipment. The subjects are subjected to three kinds of stimuli: audio, visual, and motion. Each stimulus is applied alone or in combination with the others. The acquired signals are processed by the proposed method. The classification results in an accuracy of approximately 63%. We conclude that the heuristic approach using the HS algorithm on the BCI is beneficial for EEG signal analysis.
Classification of urine sediment based on convolution neural network

Science.gov (United States)

Pan, Jingjing; Jiang, Cunbo; Zhu, Tiantian

2018-04-01

By designing a new convolution neural network framework, this paper breaks the constraints of the original convolution neural network framework requiring large training samples and samples of the same size. Move and cropping the input images, generate the same size of the sub-graph. And then, the generated sub-graph uses the method of dropout, increasing the diversity of samples and preventing the fitting generation. Randomly select some proper subset in the sub-graphic set and ensure that the number of elements in the proper subset is same and the proper subset is not the same. The proper subsets are used as input layers for the convolution neural network. Through the convolution layer, the pooling, the full connection layer and output layer, we can obtained the classification loss rate of test set and training set. In the red blood cells, white blood cells, calcium oxalate crystallization classification experiment, the classification accuracy rate of 97% or more.
Development of an intelligent ultrasonic welding defect classification software

International Nuclear Information System (INIS)

Song, Sung Jin; Kim, Hak Joon; Jeong, Hee Don

1997-01-01

Ultrasonic pattern recognition is the most effective approach to the problem of discriminating types of flaws in weldments based on ultrasonic flaw signals. In spite of significant progress in the research on this methodology, it has not been widely used in many practical ultrasonic inspections of weldments in industry. Hence, for the convenient application of this approach in many practical situations, we develop an intelligent ultrasonic signature classification software which can discriminate types of flaws in weldments based on their ultrasonic signals using various tools in artificial intelligence such as neural networks. This software shows the excellent performance in an experimental problem where flaws in weldments are classified into two categories of cracks and non-cracks. This performance demonstrates the high possibility of this software as a practical tool for ultrasonic flaw classification in weldments.
Image Classification Using Biomimetic Pattern Recognition with Convolutional Neural Networks Features

Science.gov (United States)

Huo, Guanying

2017-01-01

As a typical deep-learning model, Convolutional Neural Networks (CNNs) can be exploited to automatically extract features from images using the hierarchical structure inspired by mammalian visual system. For image classification tasks, traditional CNN models employ the softmax function for classification. However, owing to the limited capacity of the softmax function, there are some shortcomings of traditional CNN models in image classification. To deal with this problem, a new method combining Biomimetic Pattern Recognition (BPR) with CNNs is proposed for image classification. BPR performs class recognition by a union of geometrical cover sets in a high-dimensional feature space and therefore can overcome some disadvantages of traditional pattern recognition. The proposed method is evaluated on three famous image classification benchmarks, that is, MNIST, AR, and CIFAR-10. The classification accuracies of the proposed method for the three datasets are 99.01%, 98.40%, and 87.11%, respectively, which are much higher in comparison with the other four methods in most cases. PMID:28316614
LDA boost classification: boosting by topics

Science.gov (United States)

Lei, La; Qiao, Guo; Qimin, Cao; Qitao, Li

2012-12-01

AdaBoost is an efficacious classification algorithm especially in text categorization (TC) tasks. The methodology of setting up a classifier committee and voting on the documents for classification can achieve high categorization precision. However, traditional Vector Space Model can easily lead to the curse of dimensionality and feature sparsity problems; so it affects classification performance seriously. This article proposed a novel classification algorithm called LDABoost based on boosting ideology which uses Latent Dirichlet Allocation (LDA) to modeling the feature space. Instead of using words or phrase, LDABoost use latent topics as the features. In this way, the feature dimension is significantly reduced. Improved Naïve Bayes (NB) is designed as the weaker classifier which keeps the efficiency advantage of classic NB algorithm and has higher precision. Moreover, a two-stage iterative weighted method called Cute Integration in this article is proposed for improving the accuracy by integrating weak classifiers into strong classifier in a more rational way. Mutual Information is used as metrics of weights allocation. The voting information and the categorization decision made by basis classifiers are fully utilized for generating the strong classifier. Experimental results reveals LDABoost making categorization in a low-dimensional space, it has higher accuracy than traditional AdaBoost algorithms and many other classic classification algorithms. Moreover, its runtime consumption is lower than different versions of AdaBoost, TC algorithms based on support vector machine and Neural Networks.
Genome-Wide Comparative Gene Family Classification

Science.gov (United States)

Frech, Christian; Chen, Nansheng

2010-01-01

Correct classification of genes into gene families is important for understanding gene function and evolution. Although gene families of many species have been resolved both computationally and experimentally with high accuracy, gene family classification in most newly sequenced genomes has not been done with the same high standard. This project has been designed to develop a strategy to effectively and accurately classify gene families across genomes. We first examine and compare the performance of computer programs developed for automated gene family classification. We demonstrate that some programs, including the hierarchical average-linkage clustering algorithm MC-UPGMA and the popular Markov clustering algorithm TRIBE-MCL, can reconstruct manual curation of gene families accurately. However, their performance is highly sensitive to parameter setting, i.e. different gene families require different program parameters for correct resolution. To circumvent the problem of parameterization, we have developed a comparative strategy for gene family classification. This strategy takes advantage of existing curated gene families of reference species to find suitable parameters for classifying genes in related genomes. To demonstrate the effectiveness of this novel strategy, we use TRIBE-MCL to classify chemosensory and ABC transporter gene families in C. elegans and its four sister species. We conclude that fully automated programs can establish biologically accurate gene families if parameterized accordingly. Comparative gene family classification finds optimal parameters automatically, thus allowing rapid insights into gene families of newly sequenced species. PMID:20976221
Fast Solution in Sparse LDA for Binary Classification

Science.gov (United States)

Moghaddam, Baback

2010-01-01

An algorithm that performs sparse linear discriminant analysis (Sparse-LDA) finds near-optimal solutions in far less time than the prior art when specialized to binary classification (of 2 classes). Sparse-LDA is a type of feature- or variable- selection problem with numerous applications in statistics, machine learning, computer vision, computational finance, operations research, and bio-informatics. Because of its combinatorial nature, feature- or variable-selection problems are NP-hard or computationally intractable in cases involving more than 30 variables or features. Therefore, one typically seeks approximate solutions by means of greedy search algorithms. The prior Sparse-LDA algorithm was a greedy algorithm that considered the best variable or feature to add/ delete to/ from its subsets in order to maximally discriminate between multiple classes of data. The present algorithm is designed for the special but prevalent case of 2-class or binary classification (e.g. 1 vs. 0, functioning vs. malfunctioning, or change versus no change). The present algorithm provides near-optimal solutions on large real-world datasets having hundreds or even thousands of variables or features (e.g. selecting the fewest wavelength bands in a hyperspectral sensor to do terrain classification) and does so in typical computation times of minutes as compared to days or weeks as taken by the prior art. Sparse LDA requires solving generalized eigenvalue problems for a large number of variable subsets (represented by the submatrices of the input within-class and between-class covariance matrices). In the general (fullrank) case, the amount of computation scales at least cubically with the number of variables and thus the size of the problems that can be solved is limited accordingly. However, in binary classification, the principal eigenvalues can be found using a special analytic formula, without resorting to costly iterative techniques. The present algorithm exploits this analytic
Texture classification using autoregressive filtering

Science.gov (United States)

Lawton, W. M.; Lee, M.

1984-01-01

A general theory of image texture models is proposed and its applicability to the problem of scene segmentation using texture classification is discussed. An algorithm, based on half-plane autoregressive filtering, which optimally utilizes second order statistics to discriminate between texture classes represented by arbitrary wide sense stationary random fields is described. Empirical results of applying this algorithm to natural and sysnthesized scenes are presented and future research is outlined.
A new formulation for the problem of fuel cell homogenization

International Nuclear Information System (INIS)

Chao, Y.-A.; Martinez, A.S.

1982-01-01

A new homogenization method for reactor cells is described. This new method consists in eliminating the NR approximation for the fuel resonance and the Wigner approximation for the resonance escape probability; the background cross section is then redefined and the problem studied is reanalyzed. (E.G.) [pt
Automatic Segmentation of Dermoscopic Images by Iterative Classification

Directory of Open Access Journals (Sweden)

Maciel Zortea

2011-01-01

Full Text Available Accurate detection of the borders of skin lesions is a vital first step for computer aided diagnostic systems. This paper presents a novel automatic approach to segmentation of skin lesions that is particularly suitable for analysis of dermoscopic images. Assumptions about the image acquisition, in particular, the approximate location and color, are used to derive an automatic rule to select small seed regions, likely to correspond to samples of skin and the lesion of interest. The seed regions are used as initial training samples, and the lesion segmentation problem is treated as binary classification problem. An iterative hybrid classification strategy, based on a weighted combination of estimated posteriors of a linear and quadratic classifier, is used to update both the automatically selected training samples and the segmentation, increasing reliability and final accuracy, especially for those challenging images, where the contrast between the background skin and lesion is low.
Cyclic flow shop scheduling problem with two-machine cells

Directory of Open Access Journals (Sweden)

Bożejko Wojciech

2017-06-01

Full Text Available In the paper a variant of cyclic production with setups and two-machine cell is considered. One of the stages of the problem solving consists of assigning each operation to the machine on which it will be carried out. The total number of such assignments is exponential. We propose a polynomial time algorithm finding the optimal operations to machines assignment.
Deep learning application: rubbish classification with aid of an android device

Science.gov (United States)

Liu, Sijiang; Jiang, Bo; Zhan, Jie

2017-06-01

Deep learning is a very hot topic currently in pattern recognition and artificial intelligence researches. Aiming at the practical problem that people usually don't know correct classifications some rubbish should belong to, based on the powerful image classification ability of the deep learning method, we have designed a prototype system to help users to classify kinds of rubbish. Firstly the CaffeNet Model was adopted for our classification network training on the ImageNet dataset, and the trained network was deployed on a web server. Secondly an android app was developed for users to capture images of unclassified rubbish, upload images to the web server for analyzing backstage and retrieve the feedback, so that users can obtain the classification guide by an android device conveniently. Tests on our prototype system of rubbish classification show that: an image of one single type of rubbish with origin shape can be better used to judge its classification, while an image containing kinds of rubbish or rubbish with changed shape may fail to help users to decide rubbish's classification. However, the system still shows promising auxiliary function for rubbish classification if the network training strategy can be optimized further.
UAS Detection Classification and Neutralization: Market Survey 2015

Energy Technology Data Exchange (ETDEWEB)

Birch, Gabriel Carisle [Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States); Griffin, John Clark [Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States); Erdman, Matthew Kelly [Sandia National Laboratories (SNL-NM), Albuquerque, NM (United States)

2015-07-01

The purpose of this document is to briefly frame the challenges of detecting low, slow, and small (LSS) unmanned aerial systems (UAS). The conclusion drawn from internal discussions and external reports is the following; detection of LSS UAS is a challenging problem that can- not be achieved with a single detection modality for all potential targets. Classification of LSS UAS, especially classification in the presence of background clutter (e.g., urban environment) or other non-threating targets (e.g., birds), is under-explored. Though information of avail- able technologies is sparse, many of the existing options for UAS detection appear to be in their infancy (when compared to more established ground-based air defense systems for larger and/or faster threats). Companies currently providing or developing technologies to combat the UAS safety and security problem are certainly worth investigating, however, no company has provided the statistical evidence necessary to support robust detection, identification, and/or neutralization of LSS UAS targets. The results of a market survey are included that highlights potential commercial entities that could contribute some technology that assists in the detection, classification, and neutral- ization of a LSS UAS. This survey found no clear and obvious commercial solution, though recommendations are given for further investigation of several potential systems.

Estimating Rates of Psychosocial Problems in Urban and Poor Children with Sickle Cell Anemia.

Science.gov (United States)

Barbarin, Oscar A.; And Others

1994-01-01

Examined adjustment problems for children and adolescents with sickle cell anemia (SCA). Parents provided information on social, emotional, academic, and family adjustment of 327 children with SCA. Over 25% of children had emotional adjustment problems in form of internalizing symptoms (anxiety and depression); at least 20% had problems related to…
The Solving of Problems in Chemistry: the more open-ended problems

Science.gov (United States)

Reid, Norman; Yang, Mei-Jung

2002-01-01

Most problem solving in chemistry tends to be algorithmic in nature, while problems in life tend to be very open ended. This paper offers a simple classification of problems and seeks to explore the many factors which may be important in the successful solving of problems. It considers the place of procedures and algorithms. It analyses the role of long-term memory, not only in terms of what is known, but how that knowledge was acquired. It notes the great importance of the limitations of working memory space and the importance of confidence which comes from experience. Finally, various psychological factors are discussed. This paper argues that solving open-ended problems is extremely important in education and that offering learners experience of this in a group work context is a helpful way forward.
Classification of heterogeneous electron microscopic projections into homogeneous subsets

International Nuclear Information System (INIS)

Herman, G.T.; Kalinowski, M.

2008-01-01

The co-existence of different states of a macromolecular complex in samples used by three-dimensional electron microscopy (3D-EM) constitutes a serious challenge. The single particle method applied directly to such heterogeneous sets is unable to provide useful information about the encountered conformational diversity and produces reconstructions with severely reduced resolution. One approach to solving this problem is to partition heterogeneous projection set into homogeneous components and apply existing reconstruction techniques to each of them. Due to the nature of the projection images and the high noise level present in them, this classification task is difficult. A method is presented to achieve the desired classification by using a novel image similarity measure and solving the corresponding optimization problem. Unlike the majority of competing approaches, the presented method employs unsupervised classification (it does not require any prior knowledge about the objects being classified) and does not involve a 3D reconstruction procedure. We demonstrate a fast implementation of this method, capable of classifying projection sets that originate from 3D-EM. The method's performance is evaluated on synthetically generated data sets produced by projecting 3D objects that resemble biological structures
On the relevance of spectral features for instrument classification

DEFF Research Database (Denmark)

Nielsen, Andreas Brinch; Sigurdsson, Sigurdur; Hansen, Lars Kai

2007-01-01

Automatic knowledge extraction from music signals is a key component for most music organization and music information retrieval systems. In this paper, we consider the problem of instrument modelling and instrument classification from the rough audio data. Existing systems for automatic instrument...... classification operate normally on a relatively large number of features, from which those related to the spectrum of the audio signal are particularly relevant. In this paper, we confront two different models about the spectral characterization of musical instruments. The first assumes a constant envelope...
Classification of Motor Imagery EEG Signals with Support Vector Machines and Particle Swarm Optimization

Science.gov (United States)

Ma, Yuliang; Ding, Xiaohui; She, Qingshan; Luo, Zhizeng; Potter, Thomas; Zhang, Yingchun

2016-01-01

Support vector machines are powerful tools used to solve the small sample and nonlinear classification problems, but their ultimate classification performance depends heavily upon the selection of appropriate kernel and penalty parameters. In this study, we propose using a particle swarm optimization algorithm to optimize the selection of both the kernel and penalty parameters in order to improve the classification performance of support vector machines. The performance of the optimized classifier was evaluated with motor imagery EEG signals in terms of both classification and prediction. Results show that the optimized classifier can significantly improve the classification accuracy of motor imagery EEG signals. PMID:27313656
A New Classification Approach Based on Multiple Classification Rules

OpenAIRE

Zhongmei Zhou

2014-01-01

A good classifier can correctly predict new data for which the class label is unknown, so it is important to construct a high accuracy classifier. Hence, classification techniques are much useful in ubiquitous computing. Associative classification achieves higher classification accuracy than some traditional rule-based classification approaches. However, the approach also has two major deficiencies. First, it generates a very large number of association classification rules, especially when t...
The 2017 World Health Organization classification of tumors of the pituitary gland: a summary.

Science.gov (United States)

Lopes, M Beatriz S

2017-10-01

The 4th edition of the World Health Organization (WHO) classification of endocrine tumors has been recently released. In this new edition, major changes are recommended in several areas of the classification of tumors of the anterior pituitary gland (adenophypophysis). The scope of the present manuscript is to summarize these recommended changes, emphasizing a few significant topics. These changes include the following: (1) a novel approach for classifying pituitary neuroendocrine tumors according to pituitary adenohypophyseal cell lineages; (2) changes to the histological grading of pituitary neuroendocrine tumors with the elimination of the term "atypical adenoma;" and (3) introduction of new entities like the pituitary blastoma and re-definition of old entities like the null-cell adenoma. This new classification is very practical and mostly based on immunohistochemistry for pituitary hormones, pituitary-specific transcription factors, and other immunohistochemical markers commonly used in pathology practice, not requiring routine ultrastructural analysis of the tumors. Evaluation of tumor proliferation potential, by mitotic count and Ki-67 labeling index, and tumor invasion is strongly recommended on individual case basis to identify clinically aggressive adenomas. In addition, the classification offers the treating clinical team information on tumor prognosis by identifying specific variants of adenomas associated with an elevated risk for recurrence. Changes in the classification of non-neuroendocrine tumors are also proposed, in particular those tumors arising in the posterior pituitary including pituicytoma, granular cell tumor of the posterior pituitary, and spindle cell oncocytoma. These changes endorse those previously published in the 2016 WHO classification of CNS tumors. Other tumors arising in the sellar region are also reviewed in detail including craniopharyngiomas, mesenchymal and stromal tumors, germ cell tumors, and hematopoietic tumors. It is
78 FR 68983 - Cotton Futures Classification: Optional Classification Procedure

Science.gov (United States)

2013-11-18

...-AD33 Cotton Futures Classification: Optional Classification Procedure AGENCY: Agricultural Marketing... regulations to allow for the addition of an optional cotton futures classification procedure--identified and... response to requests from the U.S. cotton industry and ICE, AMS will offer a futures classification option...
Unsupervised classification of variable stars

Science.gov (United States)

Valenzuela, Lucas; Pichara, Karim

2018-03-01

During the past 10 years, a considerable amount of effort has been made to develop algorithms for automatic classification of variable stars. That has been primarily achieved by applying machine learning methods to photometric data sets where objects are represented as light curves. Classifiers require training sets to learn the underlying patterns that allow the separation among classes. Unfortunately, building training sets is an expensive process that demands a lot of human efforts. Every time data come from new surveys; the only available training instances are the ones that have a cross-match with previously labelled objects, consequently generating insufficient training sets compared with the large amounts of unlabelled sources. In this work, we present an algorithm that performs unsupervised classification of variable stars, relying only on the similarity among light curves. We tackle the unsupervised classification problem by proposing an untraditional approach. Instead of trying to match classes of stars with clusters found by a clustering algorithm, we propose a query-based method where astronomers can find groups of variable stars ranked by similarity. We also develop a fast similarity function specific for light curves, based on a novel data structure that allows scaling the search over the entire data set of unlabelled objects. Experiments show that our unsupervised model achieves high accuracy in the classification of different types of variable stars and that the proposed algorithm scales up to massive amounts of light curves.
Vlsi implementation of flexible architecture for decision tree classification in data mining

Science.gov (United States)

Sharma, K. Venkatesh; Shewandagn, Behailu; Bhukya, Shankar Nayak

2017-07-01

The Data mining algorithms have become vital to researchers in science, engineering, medicine, business, search and security domains. In recent years, there has been a terrific raise in the size of the data being collected and analyzed. Classification is the main difficulty faced in data mining. In a number of the solutions developed for this problem, most accepted one is Decision Tree Classification (DTC) that gives high precision while handling very large amount of data. This paper presents VLSI implementation of flexible architecture for Decision Tree classification in data mining using c4.5 algorithm.
Classification using Hierarchical Naive Bayes models

DEFF Research Database (Denmark)

Langseth, Helge; Dyhre Nielsen, Thomas

2006-01-01

Classification problems have a long history in the machine learning literature. One of the simplest, and yet most consistently well-performing set of classifiers is the Naïve Bayes models. However, an inherent problem with these classifiers is the assumption that all attributes used to describe......, termed Hierarchical Naïve Bayes models. Hierarchical Naïve Bayes models extend the modeling flexibility of Naïve Bayes models by introducing latent variables to relax some of the independence statements in these models. We propose a simple algorithm for learning Hierarchical Naïve Bayes models...
Customer and performance rating in QFD using SVM classification

Science.gov (United States)

Dzulkifli, Syarizul Amri; Salleh, Mohd Najib Mohd; Leman, A. M.

2017-09-01

In a classification problem, where each input is associated to one output. Training data is used to create a model which predicts values to the true function. SVM is a popular method for binary classification due to their theoretical foundation and good generalization performance. However, when trained with noisy data, the decision hyperplane might deviate from optimal position because of the sum of misclassification errors in the objective function. In this paper, we introduce fuzzy in weighted learning approach for improving the accuracy of Support Vector Machine (SVM) classification. The main aim of this work is to determine appropriate weighted for SVM to adjust the parameters of learning method from a given set of noisy input to output data. The performance and customer rating in Quality Function Deployment (QFD) is used as our case study to determine implementing fuzzy SVM is highly scalable for very large data sets and generating high classification accuracy.
Integrating Human and Machine Intelligence in Galaxy Morphology Classification Tasks

Science.gov (United States)

Beck, Melanie Renee

The large flood of data flowing from observatories presents significant challenges to astronomy and cosmology--challenges that will only be magnified by projects currently under development. Growth in both volume and velocity of astrophysics data is accelerating: whereas the Sloan Digital Sky Survey (SDSS) has produced 60 terabytes of data in the last decade, the upcoming Large Synoptic Survey Telescope (LSST) plans to register 30 terabytes per night starting in the year 2020. Additionally, the Euclid Mission will acquire imaging for 5 x 107 resolvable galaxies. The field of galaxy evolution faces a particularly challenging future as complete understanding often cannot be reached without analysis of detailed morphological galaxy features. Historically, morphological analysis has relied on visual classification by astronomers, accessing the human brains capacity for advanced pattern recognition. However, this accurate but inefficient method falters when confronted with many thousands (or millions) of images. In the SDSS era, efforts to automate morphological classifications of galaxies (e.g., Conselice et al., 2000; Lotz et al., 2004) are reasonably successful and can distinguish between elliptical and disk-dominated galaxies with accuracies of 80%. While this is statistically very useful, a key problem with these methods is that they often cannot say which 80% of their samples are accurate. Furthermore, when confronted with the more complex task of identifying key substructure within galaxies, automated classification algorithms begin to fail. The Galaxy Zoo project uses a highly innovative approach to solving the scalability problem of visual classification. Displaying images of SDSS galaxies to volunteers via a simple and engaging web interface, www.galaxyzoo.org asks people to classify images by eye. Within the first year hundreds of thousands of members of the general public had classified each of the 1 million SDSS galaxies an average of 40 times. Galaxy Zoo
Problem of the Classification of Quantitative Noun in the German Language

Directory of Open Access Journals (Sweden)

Elvira L. Shubina

2015-01-01

Full Text Available This work is dedicated to the semantic classification of quantitative noun on the basis of a structural study (Nquant + (Adj +N (ein Glas frisches Wasser, since this model reveals the greatest variety of grammatical formulation. These word combinations can form by the genitive government eine Tasse starken Kaffees by the grammatical agreement ein Eimer kaltes Wasser, or by the adjunction mit einem Korb reife Apfel. The suggested classification of the noun performing the function of the first components is based on the form of the noun acting as the first component. Types of the first components fall into three groups: 1. The nouns, which specify quantitative characteristics of objects and substances. Two subgroups are also distingshed: word combinations with a noun in a singular form Nquant1a as the second component and word combinations with a noun in a plural form as the second component Nquant1b; 2. The nouns defining a group of living beings and objects Nquant2; 3. The nouns which formation is grounded on quantitative nouns Nquant3. Normative recommendations on the choice of subordinate connection type should be connected at least at the present stage of existence of German literary language, exactly with the semantics of the nouns which are the first components in these word combinations. The article illustrates that all types of constructions (organizes whether on the basis of government, agreement and or adjunction are connected with the completely specific semantic characteristics of the name, i.e., these nouns belong to one of three groups of noun - first components.
Seismic Target Classification Using a Wavelet Packet Manifold in Unattended Ground Sensors Systems

Directory of Open Access Journals (Sweden)

Enliang Song

2013-07-01

Full Text Available One of the most challenging problems in target classification is the extraction of a robust feature, which can effectively represent a specific type of targets. The use of seismic signals in unattended ground sensor (UGS systems makes this problem more complicated, because the seismic target signal is non-stationary, geology-dependent and with high-dimensional feature space. This paper proposes a new feature extraction algorithm, called wavelet packet manifold (WPM, by addressing the neighborhood preserving embedding (NPE algorithm of manifold learning on the wavelet packet node energy (WPNE of seismic signals. By combining non-stationary information and low-dimensional manifold information, WPM provides a more robust representation for seismic target classification. By using a K nearest neighbors classifier on the WPM signature, the algorithm of wavelet packet manifold classification (WPMC is proposed. Experimental results show that the proposed WPMC can not only reduce feature dimensionality, but also improve the classification accuracy up to 95.03%. Moreover, compared with state-of-the-art methods, WPMC is more suitable for UGS in terms of recognition ratio and computational complexity.
Improving Music Genre Classification by Short-Time Feature Integration

DEFF Research Database (Denmark)

Meng, Anders; Ahrendt, Peter; Larsen, Jan

2005-01-01

Many different short-time features, using time windows in the size of 10-30 ms, have been proposed for music segmentation, retrieval and genre classification. However, often the available time frame of the music to make the actual decision or comparison (the decision time horizon) is in the range...... of seconds instead of milliseconds. The problem of making new features on the larger time scale from the short-time features (feature integration) has only received little attention. This paper investigates different methods for feature integration and late information fusion for music genre classification...
Precision automation of cell type classification and sub-cellular fluorescence quantification from laser scanning confocal images

Directory of Open Access Journals (Sweden)

Hardy Craig Hall

2016-02-01

Full Text Available While novel whole-plant phenotyping technologies have been successfully implemented into functional genomics and breeding programs, the potential of automated phenotyping with cellular resolution is largely unexploited. Laser scanning confocal microscopy has the potential to close this gap by providing spatially highly resolved images containing anatomic as well as chemical information on a subcellular basis. However, in the absence of automated methods, the assessment of the spatial patterns and abundance of fluorescent markers with subcellular resolution is still largely qualitative and time-consuming. Recent advances in image acquisition and analysis, coupled with improvements in microprocessor performance, have brought such automated methods within reach, so that information from thousands of cells per image for hundreds of images may be derived in an experimentally convenient time-frame. Here, we present a MATLAB-based analytical pipeline to 1 segment radial plant organs into individual cells, 2 classify cells into cell type categories based upon random forest classification, 3 divide each cell into sub-regions, and 4 quantify fluorescence intensity to a subcellular degree of precision for a separate fluorescence channel. In this research advance, we demonstrate the precision of this analytical process for the relatively complex tissues of Arabidopsis hypocotyls at various stages of development. High speed and robustness make our approach suitable for phenotyping of large collections of stem-like material and other tissue types.
Low-Rank Sparse Coding for Image Classification

KAUST Repository

Zhang, Tianzhu; Ghanem, Bernard; Liu, Si; Xu, Changsheng; Ahuja, Narendra

2013-01-01

In this paper, we propose a low-rank sparse coding (LRSC) method that exploits local structure information among features in an image for the purpose of image-level classification. LRSC represents densely sampled SIFT descriptors, in a spatial neighborhood, collectively as low-rank, sparse linear combinations of code words. As such, it casts the feature coding problem as a low-rank matrix learning problem, which is different from previous methods that encode features independently. This LRSC has a number of attractive properties. (1) It encourages sparsity in feature codes, locality in codebook construction, and low-rankness for spatial consistency. (2) LRSC encodes local features jointly by considering their low-rank structure information, and is computationally attractive. We evaluate the LRSC by comparing its performance on a set of challenging benchmarks with that of 7 popular coding and other state-of-the-art methods. Our experiments show that by representing local features jointly, LRSC not only outperforms the state-of-the-art in classification accuracy but also improves the time complexity of methods that use a similar sparse linear representation model for feature coding.
Low-Rank Sparse Coding for Image Classification

KAUST Repository

Zhang, Tianzhu

2013-12-01

In this paper, we propose a low-rank sparse coding (LRSC) method that exploits local structure information among features in an image for the purpose of image-level classification. LRSC represents densely sampled SIFT descriptors, in a spatial neighborhood, collectively as low-rank, sparse linear combinations of code words. As such, it casts the feature coding problem as a low-rank matrix learning problem, which is different from previous methods that encode features independently. This LRSC has a number of attractive properties. (1) It encourages sparsity in feature codes, locality in codebook construction, and low-rankness for spatial consistency. (2) LRSC encodes local features jointly by considering their low-rank structure information, and is computationally attractive. We evaluate the LRSC by comparing its performance on a set of challenging benchmarks with that of 7 popular coding and other state-of-the-art methods. Our experiments show that by representing local features jointly, LRSC not only outperforms the state-of-the-art in classification accuracy but also improves the time complexity of methods that use a similar sparse linear representation model for feature coding.
A New Direction of Cancer Classification: Positive Effect of Low-Ranking MicroRNAs.

Science.gov (United States)

Li, Feifei; Piao, Minghao; Piao, Yongjun; Li, Meijing; Ryu, Keun Ho

2014-10-01

Many studies based on microRNA (miRNA) expression profiles showed a new aspect of cancer classification. Because one characteristic of miRNA expression data is the high dimensionality, feature selection methods have been used to facilitate dimensionality reduction. The feature selection methods have one shortcoming thus far: they just consider the problem of where feature to class is 1:1 or n:1. However, because one miRNA may influence more than one type of cancer, human miRNA is considered to be ranked low in traditional feature selection methods and are removed most of the time. In view of the limitation of the miRNA number, low-ranking miRNAs are also important to cancer classification. We considered both high- and low-ranking features to cover all problems (1:1, n:1, 1:n, and m:n) in cancer classification. First, we used the correlation-based feature selection method to select the high-ranking miRNAs, and chose the support vector machine, Bayes network, decision tree, k-nearest-neighbor, and logistic classifier to construct cancer classification. Then, we chose Chi-square test, information gain, gain ratio, and Pearson's correlation feature selection methods to build the m:n feature subset, and used the selected miRNAs to determine cancer classification. The low-ranking miRNA expression profiles achieved higher classification accuracy compared with just using high-ranking miRNAs in traditional feature selection methods. Our results demonstrate that the m:n feature subset made a positive impression of low-ranking miRNAs in cancer classification.

Vehicle Classification Using an Imbalanced Dataset Based on a Single Magnetic Sensor

Directory of Open Access Journals (Sweden)

Chang Xu

2018-05-01

Full Text Available This paper aims to improve the accuracy of automatic vehicle classifiers for imbalanced datasets. Classification is made through utilizing a single anisotropic magnetoresistive sensor, with the models of vehicles involved being classified into hatchbacks, sedans, buses, and multi-purpose vehicles (MPVs. Using time domain and frequency domain features in combination with three common classification algorithms in pattern recognition, we develop a novel feature extraction method for vehicle classification. These three common classification algorithms are the k-nearest neighbor, the support vector machine, and the back-propagation neural network. Nevertheless, a problem remains with the original vehicle magnetic dataset collected being imbalanced, and may lead to inaccurate classification results. With this in mind, we propose an approach called SMOTE, which can further boost the performance of classifiers. Experimental results show that the k-nearest neighbor (KNN classifier with the SMOTE algorithm can reach a classification accuracy of 95.46%, thus minimizing the effect of the imbalance.
Vehicle Classification Using an Imbalanced Dataset Based on a Single Magnetic Sensor.

Science.gov (United States)

Xu, Chang; Wang, Yingguan; Bao, Xinghe; Li, Fengrong

2018-05-24

This paper aims to improve the accuracy of automatic vehicle classifiers for imbalanced datasets. Classification is made through utilizing a single anisotropic magnetoresistive sensor, with the models of vehicles involved being classified into hatchbacks, sedans, buses, and multi-purpose vehicles (MPVs). Using time domain and frequency domain features in combination with three common classification algorithms in pattern recognition, we develop a novel feature extraction method for vehicle classification. These three common classification algorithms are the k-nearest neighbor, the support vector machine, and the back-propagation neural network. Nevertheless, a problem remains with the original vehicle magnetic dataset collected being imbalanced, and may lead to inaccurate classification results. With this in mind, we propose an approach called SMOTE, which can further boost the performance of classifiers. Experimental results show that the k-nearest neighbor (KNN) classifier with the SMOTE algorithm can reach a classification accuracy of 95.46%, thus minimizing the effect of the imbalance.
Is overall similarity classification less effortful than single-dimension classification?

Science.gov (United States)

Wills, Andy J; Milton, Fraser; Longmore, Christopher A; Hester, Sarah; Robinson, Jo

2013-01-01

It is sometimes argued that the implementation of an overall similarity classification is less effortful than the implementation of a single-dimension classification. In the current article, we argue that the evidence securely in support of this view is limited, and report additional evidence in support of the opposite proposition--overall similarity classification is more effortful than single-dimension classification. Using a match-to-standards procedure, Experiments 1A, 1B and 2 demonstrate that concurrent load reduces the prevalence of overall similarity classification, and that this effect is robust to changes in the concurrent load task employed, the level of time pressure experienced, and the short-term memory requirements of the classification task. Experiment 3 demonstrates that participants who produced overall similarity classifications from the outset have larger working memory capacities than those who produced single-dimension classifications initially, and Experiment 4 demonstrates that instructions to respond meticulously increase the prevalence of overall similarity classification.
Conceptual process models and quantitative analysis of classification problems in Scrum software development practices

NARCIS (Netherlands)

Helwerda, L.S.; Niessink, F.; Verbeek, F.J.

2017-01-01

We propose a novel classification method that integrates into existing agile software development practices by collecting data records generated by software and tools used in the development process. We extract features from the collected data and create visualizations that provide insights,
Assessment of survival rates compared according to the Tamai and Yamano classifications in fingertip replantations

OpenAIRE

Mehmet Dadaci; Bilsev Ince; Zeynep Altuntas; Ozan Bitik; Haldun Onuralp Kamburoglu; Hakan Uzun

2016-01-01

Background: The fingertip is the most frequently injured and amputated segment of the hand. There are controversies about defining clear indications for microsurgical replantation. Many classification systems have been proposed to solve this problem. No previous study has simultaneously correlated different classification systems with replant survival rate. The aim of the study is to compare the outcomes of fingertip replantations according to Tamai and Yamano classifications. Materials a...
Proposed declassification of disease categories related to sexual orientation in the International Statistical Classification of Diseases and Related Health Problems (ICD-11).

Science.gov (United States)

Cochran, Susan D; Drescher, Jack; Kismödi, Eszter; Giami, Alain; García-Moreno, Claudia; Atalla, Elham; Marais, Adele; Vieira, Elisabeth Meloni; Reed, Geoffrey M

2014-09-01

The World Health Organization is developing the 11th revision of the International Statistical Classification of Diseases and Related Health Problems (ICD-11), planned for publication in 2017. The Working Group on the Classification of Sexual Disorders and Sexual Health was charged with reviewing and making recommendations on disease categories related to sexuality in the chapter on mental and behavioural disorders in the 10th revision (ICD-10), published in 1990. This chapter includes categories for diagnoses based primarily on sexual orientation even though ICD-10 states that sexual orientation alone is not a disorder. This article reviews the scientific evidence and clinical rationale for continuing to include these categories in the ICD. A review of the evidence published since 1990 found little scientific interest in these categories. In addition, the Working Group found no evidence that they are clinically useful: they neither contribute to health service delivery or treatment selection nor provide essential information for public health surveillance. Moreover, use of these categories may create unnecessary harm by delaying accurate diagnosis and treatment. The Working Group recommends that these categories be deleted entirely from ICD-11. Health concerns related to sexual orientation can be better addressed using other ICD categories.
Efficient Generation and Selection of Combined Features for Improved Classification

KAUST Repository

Shono, Ahmad N.

2014-01-01

This study contributes a methodology and associated toolkit developed to allow users to experiment with the use of combined features in classification problems. Methods are provided for efficiently generating combined features from an original
Search and Classification Using Multiple Autonomous Vehicles Decision-Making and Sensor Management

CERN Document Server

Wang, Yue

2012-01-01

Search and Classification Using Multiple Autonomous Vehicles provides a comprehensive study of decision-making strategies for domain search and object classification using multiple autonomous vehicles (MAV) under both deterministic and probabilistic frameworks. It serves as a first discussion of the problem of effective resource allocation using MAV with sensing limitations, i.e., for search and classification missions over large-scale domains, or when there are far more objects to be found and classified than there are autonomous vehicles available. Under such scenarios, search and classification compete for limited sensing resources. This is because search requires vehicle mobility while classification restricts the vehicles to the vicinity of any objects found. The authors develop decision-making strategies to choose between these competing tasks and vehicle-motion-control laws to achieve the proposed management scheme. Deterministic Lyapunov-based, probabilistic Bayesian-based, and risk-based decision-mak...
Classification of Clouds in Satellite Imagery Using Adaptive Fuzzy Sparse Representation

Directory of Open Access Journals (Sweden)

Wei Jin

2016-12-01

Full Text Available Automatic cloud detection and classification using satellite cloud imagery have various meteorological applications such as weather forecasting and climate monitoring. Cloud pattern analysis is one of the research hotspots recently. Since satellites sense the clouds remotely from space, and different cloud types often overlap and convert into each other, there must be some fuzziness and uncertainty in satellite cloud imagery. Satellite observation is susceptible to noises, while traditional cloud classification methods are sensitive to noises and outliers; it is hard for traditional cloud classification methods to achieve reliable results. To deal with these problems, a satellite cloud classification method using adaptive fuzzy sparse representation-based classification (AFSRC is proposed. Firstly, by defining adaptive parameters related to attenuation rate and critical membership, an improved fuzzy membership is introduced to accommodate the fuzziness and uncertainty of satellite cloud imagery; secondly, by effective combination of the improved fuzzy membership function and sparse representation-based classification (SRC, atoms in training dictionary are optimized; finally, an adaptive fuzzy sparse representation classifier for cloud classification is proposed. Experiment results on FY-2G satellite cloud image show that, the proposed method not only improves the accuracy of cloud classification, but also has strong stability and adaptability with high computational efficiency.
Classification of Clouds in Satellite Imagery Using Adaptive Fuzzy Sparse Representation

Science.gov (United States)

Jin, Wei; Gong, Fei; Zeng, Xingbin; Fu, Randi

2016-01-01

Automatic cloud detection and classification using satellite cloud imagery have various meteorological applications such as weather forecasting and climate monitoring. Cloud pattern analysis is one of the research hotspots recently. Since satellites sense the clouds remotely from space, and different cloud types often overlap and convert into each other, there must be some fuzziness and uncertainty in satellite cloud imagery. Satellite observation is susceptible to noises, while traditional cloud classification methods are sensitive to noises and outliers; it is hard for traditional cloud classification methods to achieve reliable results. To deal with these problems, a satellite cloud classification method using adaptive fuzzy sparse representation-based classification (AFSRC) is proposed. Firstly, by defining adaptive parameters related to attenuation rate and critical membership, an improved fuzzy membership is introduced to accommodate the fuzziness and uncertainty of satellite cloud imagery; secondly, by effective combination of the improved fuzzy membership function and sparse representation-based classification (SRC), atoms in training dictionary are optimized; finally, an adaptive fuzzy sparse representation classifier for cloud classification is proposed. Experiment results on FY-2G satellite cloud image show that, the proposed method not only improves the accuracy of cloud classification, but also has strong stability and adaptability with high computational efficiency. PMID:27999261
Classification of high resolution imagery based on fusion of multiscale texture features

International Nuclear Information System (INIS)

Liu, Jinxiu; Liu, Huiping; Lv, Ying; Xue, Xiaojuan

2014-01-01

In high resolution data classification process, combining texture features with spectral bands can effectively improve the classification accuracy. However, the window size which is difficult to choose is regarded as an important factor influencing overall classification accuracy in textural classification and current approaches to image texture analysis only depend on a single moving window which ignores different scale features of various land cover types. In this paper, we propose a new method based on the fusion of multiscale texture features to overcome these problems. The main steps in new method include the classification of fixed window size spectral/textural images from 3×3 to 15×15 and comparison of all the posterior possibility values for every pixel, as a result the biggest probability value is given to the pixel and the pixel belongs to a certain land cover type automatically. The proposed approach is tested on University of Pavia ROSIS data. The results indicate that the new method improve the classification accuracy compared to results of methods based on fixed window size textural classification
Computer Aided Design for Soil Classification Relational Database ...

African Journals Online (AJOL)

The paper focuses on the problems associated with classification, storage and retrieval of information on soil data, such as the incompatibility of soil data semantics; inadequate documentation, and lack of indexing; hence it is pretty difficult to efficiently access large database. Consequently, information on soil is very difficult ...
Use of genetic toxicity data in GHS mutagenicity classification and labeling of substances.

Science.gov (United States)

Ball, Nicholas S; Hollnagel, Heli M

2017-06-01

One of the key outcomes of testing the potential genotoxicity or mutagenicity of a substance is the conclusion on whether the substance should be classified as a germ cell mutagen and the significance of this for other endpoints such as carcinogenicity. The basis for this conclusion are the criteria presented in classification and labelling systems such as the Globally Harmonized System for classification and labeling (GHS). This article reviews the classification criteria for germ cell mutagenicity and carcinogenicity and how they are applied to substances with evidence of mutagenicity. The implications and suitability of such a classification for hazard communication, risk assessment, and risk management are discussed. It is proposed that genotoxicity assessments should not focus on specifically identifying germ cell mutagens, particularly given the challenges associated with communicating this information in a meaningful way. Rather the focus should be on deriving data to characterize the mode of action and for use in the risk assessment of mutagens, which could then feed into a more robust, risk based management of mutagenic substances versus the current more hazard based approaches. Environ. Mol. Mutagen. 58:354-360, 2017. © 2017 Wiley Periodicals, Inc. © 2017 Wiley Periodicals, Inc.
Nanog Fluctuations in Embryonic Stem Cells Highlight the Problem of Measurement in Cell Biology.

Science.gov (United States)

Smith, Rosanna C G; Stumpf, Patrick S; Ridden, Sonya J; Sim, Aaron; Filippi, Sarah; Harrington, Heather A; MacArthur, Ben D

2017-06-20

A number of important pluripotency regulators, including the transcription factor Nanog, are observed to fluctuate stochastically in individual embryonic stem cells. By transiently priming cells for commitment to different lineages, these fluctuations are thought to be important to the maintenance of, and exit from, pluripotency. However, because temporal changes in intracellular protein abundances cannot be measured directly in live cells, fluctuations are typically assessed using genetically engineered reporter cell lines that produce a fluorescent signal as a proxy for protein expression. Here, using a combination of mathematical modeling and experiment, we show that there are unforeseen ways in which widely used reporter strategies can systematically disturb the dynamics they are intended to monitor, sometimes giving profoundly misleading results. In the case of Nanog, we show how genetic reporters can compromise the behavior of important pluripotency-sustaining positive feedback loops, and induce a bifurcation in the underlying dynamics that gives rise to heterogeneous Nanog expression patterns in reporter cell lines that are not representative of the wild-type. These findings help explain the range of published observations of Nanog variability and highlight the problem of measurement in live cells. Copyright © 2017 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Classification of nutrient emission sources in the Vistula River system

International Nuclear Information System (INIS)

Kowalkowski, Tomasz

2009-01-01

Eutrophication of the Baltic sea still remains one of the biggest problems in the north-eastern area of Europe. Recognizing the sources of nutrient emission, classification of their importance and finding the way towards reduction of pollution are the most important tasks for scientists researching this area. This article presents the chemometric approach to the classification of nutrient emission with respect to the regionalisation of emission sources within the Vistula River basin (Poland). Modelled data for mean yearly emission of nitrogen and phosphorus in 1991-2000 has been used for the classification. Seventeen subcatchements in the Vistula basin have been classified according to cluster and factor analyses. The results of this analysis allowed determination of groups of areas with similar pollution characteristics and indicate the need for spatial differentiation of policies and strategies. Three major factors indicating urban, erosion and agricultural sources have been identified as major discriminants of the groups. - Two classification methods applied to evaluate the results of nutrient emission allow definition of major sources of the emissions and classification of catchments with similar pollution.
Machine Learning Classification of Buildings for Map Generalization

Directory of Open Access Journals (Sweden)

Jaeeun Lee

2017-10-01

Full Text Available A critical problem in mapping data is the frequent updating of large data sets. To solve this problem, the updating of small-scale data based on large-scale data is very effective. Various map generalization techniques, such as simplification, displacement, typification, elimination, and aggregation, must therefore be applied. In this study, we focused on the elimination and aggregation of the building layer, for which each building in a large scale was classified as “0-eliminated,” “1-retained,” or “2-aggregated.” Machine-learning classification algorithms were then used for classifying the buildings. The data of 1:1000 scale and 1:25,000 scale digital maps obtained from the National Geographic Information Institute were used. We applied to these data various machine-learning classification algorithms, including naive Bayes (NB, decision tree (DT, k-nearest neighbor (k-NN, and support vector machine (SVM. The overall accuracies of each algorithm were satisfactory: DT, 88.96%; k-NN, 88.27%; SVM, 87.57%; and NB, 79.50%. Although elimination is a direct part of the proposed process, generalization operations, such as simplification and aggregation of polygons, must still be performed for buildings classified as retained and aggregated. Thus, these algorithms can be used for building classification and can serve as preparatory steps for building generalization.
Feature extraction for classification in the data mining process

NARCIS (Netherlands)

Pechenizkiy, M.; Puuronen, S.; Tsymbal, A.

2003-01-01

Dimensionality reduction is a very important step in the data mining process. In this paper, we consider feature extraction for classification tasks as a technique to overcome problems occurring because of "the curse of dimensionality". Three different eigenvector-based feature extraction approaches
Progress, problems and prospects of porcine pluripotent stem cells

Directory of Open Access Journals (Sweden)

Hanning WANG,Yangli PEI,Ning LI,Jianyong HAN

2014-02-01

Full Text Available Pluripotent stem cells (PSCs, including embryonic stem cells (ESCs and induced PSCs (iPSCs, can differentiate into cells of the three germ layers, suggesting that PSCs have great potential for basic developmental biology research and wide applications for clinical medicine. Genuine ESCs and iPSCs have been derived from mice and rats, but not from livestock such as the pig─an ideal animal model for studying human disease and regenerative medicine due to similarities with human physiologic processes. Efforts to derive porcine ESCs and iPSCs have not yielded high-quality PSCs that can produce chimeras with germline transmission. Thus, exploration of the unique porcine gene regulation network of preimplantation embryonic development may permit optimization of in vitro culture systems for raising porcine PSCs. Here we summarize the recent progress in porcine PSC generation as well as the problems encountered during this progress and we depict prospects for generating porcine naive PSCs.
On the classification techniques in data mining for microarray data classification

Science.gov (United States)

Aydadenta, Husna; Adiwijaya

2018-03-01

Cancer is one of the deadly diseases, according to data from WHO by 2015 there are 8.8 million more deaths caused by cancer, and this will increase every year if not resolved earlier. Microarray data has become one of the most popular cancer-identification studies in the field of health, since microarray data can be used to look at levels of gene expression in certain cell samples that serve to analyze thousands of genes simultaneously. By using data mining technique, we can classify the sample of microarray data thus it can be identified with cancer or not. In this paper we will discuss some research using some data mining techniques using microarray data, such as Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5, and simulation of Random Forest algorithm with technique of reduction dimension using Relief. The result of this paper show performance measure (accuracy) from classification algorithm (SVM, ANN, Naive Bayes, kNN, C4.5, and Random Forets).The results in this paper show the accuracy of Random Forest algorithm higher than other classification algorithms (Support Vector Machine (SVM), Artificial Neural Network (ANN), Naive Bayes, k-Nearest Neighbor (kNN), and C4.5). It is hoped that this paper can provide some information about the speed, accuracy, performance and computational cost generated from each Data Mining Classification Technique based on microarray data.
A Comparative Study of Classification and Regression Algorithms for Modelling Students' Academic Performance

Science.gov (United States)

Strecht, Pedro; Cruz, Luís; Soares, Carlos; Mendes-Moreira, João; Abreu, Rui

2015-01-01

Predicting the success or failure of a student in a course or program is a problem that has recently been addressed using data mining techniques. In this paper we evaluate some of the most popular classification and regression algorithms on this problem. We address two problems: prediction of approval/failure and prediction of grade. The former is…

Web Page Classification Method Using Neural Networks

Science.gov (United States)

Selamat, Ali; Omatu, Sigeru; Yanagimoto, Hidekazu; Fujinaka, Toru; Yoshioka, Michifumi

Automatic categorization is the only viable method to deal with the scaling problem of the World Wide Web (WWW). In this paper, we propose a news web page classification method (WPCM). The WPCM uses a neural network with inputs obtained by both the principal components and class profile-based features (CPBF). Each news web page is represented by the term-weighting scheme. As the number of unique words in the collection set is big, the principal component analysis (PCA) has been used to select the most relevant features for the classification. Then the final output of the PCA is combined with the feature vectors from the class-profile which contains the most regular words in each class before feeding them to the neural networks. We have manually selected the most regular words that exist in each class and weighted them using an entropy weighting scheme. The fixed number of regular words from each class will be used as a feature vectors together with the reduced principal components from the PCA. These feature vectors are then used as the input to the neural networks for classification. The experimental evaluation demonstrates that the WPCM method provides acceptable classification accuracy with the sports news datasets.
OmniGA: Optimized Omnivariate Decision Trees for Generalizable Classification Models

KAUST Repository

Magana-Mora, Arturo

2017-06-14

Classification problems from different domains vary in complexity, size, and imbalance of the number of samples from different classes. Although several classification models have been proposed, selecting the right model and parameters for a given classification task to achieve good performance is not trivial. Therefore, there is a constant interest in developing novel robust and efficient models suitable for a great variety of data. Here, we propose OmniGA, a framework for the optimization of omnivariate decision trees based on a parallel genetic algorithm, coupled with deep learning structure and ensemble learning methods. The performance of the OmniGA framework is evaluated on 12 different datasets taken mainly from biomedical problems and compared with the results obtained by several robust and commonly used machine-learning models with optimized parameters. The results show that OmniGA systematically outperformed these models for all the considered datasets, reducing the F score error in the range from 100% to 2.25%, compared to the best performing model. This demonstrates that OmniGA produces robust models with improved performance. OmniGA code and datasets are available at www.cbrc.kaust.edu.sa/omniga/.
OmniGA: Optimized Omnivariate Decision Trees for Generalizable Classification Models

KAUST Repository

Magana-Mora, Arturo; Bajic, Vladimir B.

2017-01-01

Classification problems from different domains vary in complexity, size, and imbalance of the number of samples from different classes. Although several classification models have been proposed, selecting the right model and parameters for a given classification task to achieve good performance is not trivial. Therefore, there is a constant interest in developing novel robust and efficient models suitable for a great variety of data. Here, we propose OmniGA, a framework for the optimization of omnivariate decision trees based on a parallel genetic algorithm, coupled with deep learning structure and ensemble learning methods. The performance of the OmniGA framework is evaluated on 12 different datasets taken mainly from biomedical problems and compared with the results obtained by several robust and commonly used machine-learning models with optimized parameters. The results show that OmniGA systematically outperformed these models for all the considered datasets, reducing the F score error in the range from 100% to 2.25%, compared to the best performing model. This demonstrates that OmniGA produces robust models with improved performance. OmniGA code and datasets are available at www.cbrc.kaust.edu.sa/omniga/.
Classification and Criticism of Nigeria Literary Drama | Iwuchukwu ...

African Journals Online (AJOL)

Nigerian drama has gained prominent and permanent position on the world literary map especially with the winning of the Nobel Prize by Wole Soyinka. In spite of this, problems of definition and criticism of Nigerian drama still persists. The Relativist-Evolution controversies on the origin and classification of Nigerian drama ...
Deep learning architectures for multi-label classification of intelligent health risk prediction.

Science.gov (United States)

Maxwell, Andrew; Li, Runzhi; Yang, Bei; Weng, Heng; Ou, Aihua; Hong, Huixiao; Zhou, Zhaoxian; Gong, Ping; Zhang, Chaoyang

2017-12-28

Multi-label classification of data remains to be a challenging problem. Because of the complexity of the data, it is sometimes difficult to infer information about classes that are not mutually exclusive. For medical data, patients could have symptoms of multiple different diseases at the same time and it is important to develop tools that help to identify problems early. Intelligent health risk prediction models built with deep learning architectures offer a powerful tool for physicians to identify patterns in patient data that indicate risks associated with certain types of chronic diseases. Physical examination records of 110,300 anonymous patients were used to predict diabetes, hypertension, fatty liver, a combination of these three chronic diseases, and the absence of disease (8 classes in total). The dataset was split into training (90%) and testing (10%) sub-datasets. Ten-fold cross validation was used to evaluate prediction accuracy with metrics such as precision, recall, and F-score. Deep Learning (DL) architectures were compared with standard and state-of-the-art multi-label classification methods. Preliminary results suggest that Deep Neural Networks (DNN), a DL architecture, when applied to multi-label classification of chronic diseases, produced accuracy that was comparable to that of common methods such as Support Vector Machines. We have implemented DNNs to handle both problem transformation and algorithm adaption type multi-label methods and compare both to see which is preferable. Deep Learning architectures have the potential of inferring more information about the patterns of physical examination data than common classification methods. The advanced techniques of Deep Learning can be used to identify the significance of different features from physical examination data as well as to learn the contributions of each feature that impact a patient's risk for chronic diseases. However, accurate prediction of chronic disease risks remains a challenging
SAW Classification Algorithm for Chinese Text Classification

OpenAIRE

Xiaoli Guo; Huiyu Sun; Tiehua Zhou; Ling Wang; Zhaoyang Qu; Jiannan Zang

2015-01-01

Considering the explosive growth of data, the increased amount of text data’s effect on the performance of text categorization forward the need for higher requirements, such that the existing classification method cannot be satisfied. Based on the study of existing text classification technology and semantics, this paper puts forward a kind of Chinese text classification oriented SAW (Structural Auxiliary Word) algorithm. The algorithm uses the special space effect of Chinese text where words...
Word Embedding Perturbation for Sentence Classification

OpenAIRE

Zhang, Dongxu; Yang, Zhichao

2018-01-01

In this technique report, we aim to mitigate the overfitting problem of natural language by applying data augmentation methods. Specifically, we attempt several types of noise to perturb the input word embedding, such as Gaussian noise, Bernoulli noise, and adversarial noise, etc. We also apply several constraints on different types of noise. By implementing these proposed data augmentation methods, the baseline models can gain improvements on several sentence classification tasks.
Classification in context

DEFF Research Database (Denmark)

Mai, Jens Erik

2004-01-01

This paper surveys classification research literature, discusses various classification theories, and shows that the focus has traditionally been on establishing a scientific foundation for classification research. This paper argues that a shift has taken place, and suggests that contemporary...... classification research focus on contextual information as the guide for the design and construction of classification schemes....
Design and Evaluation of Smart Glasses for Food Intake and Physical Activity Classification.

Science.gov (United States)

Chung, Jungman; Oh, Wonjoon; Baek, Dongyoub; Ryu, Sunwoong; Lee, Won Gu; Bang, Hyunwoo

2018-02-14

This study presents a series of protocols of designing and manufacturing a glasses-type wearable device that detects the patterns of temporalis muscle activities during food intake and other physical activities. We fabricated a 3D-printed frame of the glasses and a load cell-integrated printed circuit board (PCB) module inserted in both hinges of the frame. The module was used to acquire the force signals, and transmit them wirelessly. These procedures provide the system with higher mobility, which can be evaluated in practical wearing conditions such as walking and waggling. A performance of the classification is also evaluated by distinguishing the patterns of food intake from those physical activities. A series of algorithms were used to preprocess the signals, generate feature vectors, and recognize the patterns of several featured activities (chewing and winking), and other physical activities (sedentary rest, talking, and walking). The results showed that the average F1 score of the classification among the featured activities was 91.4%. We believe this approach can be potentially useful for automatic and objective monitoring of ingestive behaviors with higher accuracy as practical means to treat ingestive problems.
3D representations of amino acids—applications to protein sequence comparison and classification

Directory of Open Access Journals (Sweden)

Jie Li

2014-08-01

Full Text Available The amino acid sequence of a protein is the key to understanding its structure and ultimately its function in the cell. This paper addresses the fundamental issue of encoding amino acids in ways that the representation of such a protein sequence facilitates the decoding of its information content. We show that a feature-based representation in a three-dimensional (3D space derived from amino acid substitution matrices provides an adequate representation that can be used for direct comparison of protein sequences based on geometry. We measure the performance of such a representation in the context of the protein structural fold prediction problem. We compare the results of classifying different sets of proteins belonging to distinct structural folds against classifications of the same proteins obtained from sequence alone or directly from structural information. We find that sequence alone performs poorly as a structure classifier. We show in contrast that the use of the three dimensional representation of the sequences significantly improves the classification accuracy. We conclude with a discussion of the current limitations of such a representation and with a description of potential improvements.
Convergence of Cell Based Finite Volume Discretizations for Problems of Control in the Conduction Coefficients

DEFF Research Database (Denmark)

Evgrafov, Anton; Gregersen, Misha Marie; Sørensen, Mads Peter

2011-01-01

We present a convergence analysis of a cell-based finite volume (FV) discretization scheme applied to a problem of control in the coefficients of a generalized Laplace equation modelling, for example, a steady state heat conduction. Such problems arise in applications dealing with geometric optimal......, whereas the convergence of the coefficients happens only with respect to the "volumetric" Lebesgue measure. Additionally, depending on whether the stationarity conditions are stated for the discretized or the original continuous problem, two distinct concepts of stationarity at a discrete level arise. We...... provide characterizations of limit points, with respect to FV mesh size, of globally optimal solutions and two types of stationary points to the discretized problems. We illustrate the practical behaviour of our cell-based FV discretization algorithm on a numerical example....
Trends and concepts in fern classification

Science.gov (United States)

Christenhusz, Maarten J. M.; Chase, Mark W.

2014-01-01

sister to all other vascular plants, whereas the whisk ferns (Psilotaceae), often included in the lycopods or believed to be associated with the first vascular plants, are sister to Ophioglossaceae and thus belong to the fern clade. The horsetails (Equisetaceae) are also members of the fern clade (sometimes inappropriately called ‘monilophytes’), but, within that clade, their placement is still uncertain. Leptosporangiate ferns are better understood, although deep relationships within this group are still unresolved. Earlier, almost all leptosporangiate ferns were placed in a single family (Polypodiaceae or Dennstaedtiaceae), but these families have been redefined to narrower more natural entities. Conclusions Concluding this paper, a classification is presented based on our current understanding of relationships of fern and lycopod clades. Major changes in our understanding of these families are highlighted, illustrating issues of classification in relation to convergent evolution and false homologies. Problems with the current classification and groups that still need study are pointed out. A summary phylogenetic tree is also presented. A new classification in which Aspleniaceae, Cyatheaceae, Polypodiaceae and Schizaeaceae are expanded in comparison with the most recent classifications is presented, which is a modification of those proposed by Smith et al. (2006, 2008) and Christenhusz et al. (2011). These classifications are now finding a wider acceptance and use, and even though a few amendments are made based on recently published results from molecular analyses, we have aimed for a stable family and generic classification of ferns. PMID:24532607
Trends and concepts in fern classification.

Science.gov (United States)

Christenhusz, Maarten J M; Chase, Mark W

2014-03-01

the whisk ferns (Psilotaceae), often included in the lycopods or believed to be associated with the first vascular plants, are sister to Ophioglossaceae and thus belong to the fern clade. The horsetails (Equisetaceae) are also members of the fern clade (sometimes inappropriately called 'monilophytes'), but, within that clade, their placement is still uncertain. Leptosporangiate ferns are better understood, although deep relationships within this group are still unresolved. Earlier, almost all leptosporangiate ferns were placed in a single family (Polypodiaceae or Dennstaedtiaceae), but these families have been redefined to narrower more natural entities. Concluding this paper, a classification is presented based on our current understanding of relationships of fern and lycopod clades. Major changes in our understanding of these families are highlighted, illustrating issues of classification in relation to convergent evolution and false homologies. Problems with the current classification and groups that still need study are pointed out. A summary phylogenetic tree is also presented. A new classification in which Aspleniaceae, Cyatheaceae, Polypodiaceae and Schizaeaceae are expanded in comparison with the most recent classifications is presented, which is a modification of those proposed by Smith et al. (2006, 2008) and Christenhusz et al. (2011). These classifications are now finding a wider acceptance and use, and even though a few amendments are made based on recently published results from molecular analyses, we have aimed for a stable family and generic classification of ferns.
Maternal cell phone use during pregnancy and child behavioral problems in five birth cohorts

NARCIS (Netherlands)

Birks, Laura; Guxens, Mònica; Papadopoulou, Eleni; Alexander, Jan; Ballester, Ferran; Estarlich, Marisa; Gallastegi, Mara; Ha, Mina; Haugen, Margaretha; Huss, Anke; Kheifets, Leeka; Lim, Hyungryul; Olsen, Jørn; Santa-Marina, Loreto; Sudan, Madhuri; Vermeulen, Roel; Vrijkotte, Tanja; Cardis, Elisabeth; Vrijheid, Martine

INTRODUCTION: Previous studies have reported associations between prenatal cell phone use and child behavioral problems, but findings have been inconsistent and based on retrospective assessment of cell phone use. This study aimed to assess this association in a multi-national analysis, using data
Colour based off-road environment and terrain type classification

NARCIS (Netherlands)

Jansen, P.; Mark, W. van der; Heuvel, J.C. van den; Groen, F.C.A.

2005-01-01

Terrain classification is an important problem that still remains to be solved for off-road autonomous robot vehicle guidance. Often, obstacle detection systems are used which cannot distinguish between solid obstacles such as rocks or soft obstacles such as tall patches of grass. Terrain
Seeing is believing: video classification for computed tomographic colonography using multiple-instance learning.

Science.gov (United States)

Wang, Shijun; McKenna, Matthew T; Nguyen, Tan B; Burns, Joseph E; Petrick, Nicholas; Sahiner, Berkman; Summers, Ronald M

2012-05-01

In this paper, we present development and testing results for a novel colonic polyp classification method for use as part of a computed tomographic colonography (CTC) computer-aided detection (CAD) system. Inspired by the interpretative methodology of radiologists using 3-D fly-through mode in CTC reading, we have developed an algorithm which utilizes sequences of images (referred to here as videos) for classification of CAD marks. For each CAD mark, we created a video composed of a series of intraluminal, volume-rendered images visualizing the detection from multiple viewpoints. We then framed the video classification question as a multiple-instance learning (MIL) problem. Since a positive (negative) bag may contain negative (positive) instances, which in our case depends on the viewing angles and camera distance to the target, we developed a novel MIL paradigm to accommodate this class of problems. We solved the new MIL problem by maximizing a L2-norm soft margin using semidefinite programming, which can optimize relevant parameters automatically. We tested our method by analyzing a CTC data set obtained from 50 patients from three medical centers. Our proposed method showed significantly better performance compared with several traditional MIL methods.
Classification

DEFF Research Database (Denmark)

Hjørland, Birger

2017-01-01

This article presents and discusses definitions of the term “classification” and the related concepts “Concept/conceptualization,”“categorization,” “ordering,” “taxonomy” and “typology.” It further presents and discusses theories of classification including the influences of Aristotle...... and Wittgenstein. It presents different views on forming classes, including logical division, numerical taxonomy, historical classification, hermeneutical and pragmatic/critical views. Finally, issues related to artificial versus natural classification and taxonomic monism versus taxonomic pluralism are briefly...
Likelihood ratio model for classification of forensic evidence

Energy Technology Data Exchange (ETDEWEB)

Zadora, G., E-mail: gzadora@ies.krakow.pl [Institute of Forensic Research, Westerplatte 9, 31-033 Krakow (Poland); Neocleous, T., E-mail: tereza@stats.gla.ac.uk [University of Glasgow, Department of Statistics, 15 University Gardens, Glasgow G12 8QW (United Kingdom)

2009-05-29

One of the problems of analysis of forensic evidence such as glass fragments, is the determination of their use-type category, e.g. does a glass fragment originate from an unknown window or container? Very small glass fragments arise during various accidents and criminal offences, and could be carried on the clothes, shoes and hair of participants. It is therefore necessary to obtain information on their physicochemical composition in order to solve the classification problem. Scanning Electron Microscopy coupled with an Energy Dispersive X-ray Spectrometer and the Glass Refractive Index Measurement method are routinely used in many forensic institutes for the investigation of glass. A natural form of glass evidence evaluation for forensic purposes is the likelihood ratio-LR = p(E|H{sub 1})/p(E|H{sub 2}). The main aim of this paper was to study the performance of LR models for glass object classification which considered one or two sources of data variability, i.e. between-glass-object variability and(or) within-glass-object variability. Within the proposed model a multivariate kernel density approach was adopted for modelling the between-object distribution and a multivariate normal distribution was adopted for modelling within-object distributions. Moreover, a graphical method of estimating the dependence structure was employed to reduce the highly multivariate problem to several lower-dimensional problems. The performed analysis showed that the best likelihood model was the one which allows to include information about between and within-object variability, and with variables derived from elemental compositions measured by SEM-EDX, and refractive values determined before (RI{sub b}) and after (RI{sub a}) the annealing process, in the form of dRI = log{sub 10}|RI{sub a} - RI{sub b}|. This model gave better results than the model with only between-object variability considered. In addition, when dRI and variables derived from elemental compositions were used, this
Likelihood ratio model for classification of forensic evidence

International Nuclear Information System (INIS)

Zadora, G.; Neocleous, T.

2009-01-01

One of the problems of analysis of forensic evidence such as glass fragments, is the determination of their use-type category, e.g. does a glass fragment originate from an unknown window or container? Very small glass fragments arise during various accidents and criminal offences, and could be carried on the clothes, shoes and hair of participants. It is therefore necessary to obtain information on their physicochemical composition in order to solve the classification problem. Scanning Electron Microscopy coupled with an Energy Dispersive X-ray Spectrometer and the Glass Refractive Index Measurement method are routinely used in many forensic institutes for the investigation of glass. A natural form of glass evidence evaluation for forensic purposes is the likelihood ratio-LR = p(E|H 1 )/p(E|H 2 ). The main aim of this paper was to study the performance of LR models for glass object classification which considered one or two sources of data variability, i.e. between-glass-object variability and(or) within-glass-object variability. Within the proposed model a multivariate kernel density approach was adopted for modelling the between-object distribution and a multivariate normal distribution was adopted for modelling within-object distributions. Moreover, a graphical method of estimating the dependence structure was employed to reduce the highly multivariate problem to several lower-dimensional problems. The performed analysis showed that the best likelihood model was the one which allows to include information about between and within-object variability, and with variables derived from elemental compositions measured by SEM-EDX, and refractive values determined before (RI b ) and after (RI a ) the annealing process, in the form of dRI = log 10 |RI a - RI b |. This model gave better results than the model with only between-object variability considered. In addition, when dRI and variables derived from elemental compositions were used, this model outperformed two other
Dirichlet problem for quasi-linear elliptic equations

Directory of Open Access Journals (Sweden)

Azeddine Baalal

2002-10-01

Full Text Available We study the Dirichlet Problem associated to the quasilinear elliptic problem $$ -sum_{i=1}^{n}frac{partial }{partial x_i}mathcal{A}_i(x,u(x, abla u(x+mathcal{B}(x,u(x,abla u(x=0. $$ Then we define a potential theory related to this problem and we show that the sheaf of continuous solutions satisfies the Bauer axiomatic theory. Submitted April 9, 2002. Published October 2, 2002. Math Subject Classifications: 31C15, 35B65, 35J60. Key Words: Supersolution; Dirichlet problem; obstacle problem; nonlinear potential theory.

Prediction of Infertility Treatment Outcomes Using Classification Trees

Directory of Open Access Journals (Sweden)

Milewska Anna Justyna

2016-12-01

Full Text Available Infertility is currently a common problem with causes that are often unexplained, which complicates treatment. In many cases, the use of ART methods provides the only possibility of getting pregnant. Analysis of this type of data is very complex. More and more often, data mining methods or artificial intelligence techniques are appropriate for solving such problems. In this study, classification trees were used for analysis. This resulted in obtaining a group of patients characterized most likely to get pregnant while using in vitro fertilization.
Hybrid Model Based on Genetic Algorithms and SVM Applied to Variable Selection within Fruit Juice Classification

Directory of Open Access Journals (Sweden)

C. Fernandez-Lozano

2013-01-01

Full Text Available Given the background of the use of Neural Networks in problems of apple juice classification, this paper aim at implementing a newly developed method in the field of machine learning: the Support Vector Machines (SVM. Therefore, a hybrid model that combines genetic algorithms and support vector machines is suggested in such a way that, when using SVM as a fitness function of the Genetic Algorithm (GA, the most representative variables for a specific classification problem can be selected.
Deteksi Penyakit Dengue Hemorrhagic Fever dengan Pendekatan One Class Classification

Directory of Open Access Journals (Sweden)

Zida Ziyan Azkiya

2017-10-01

Full Text Available Two class classification problem maps input into two target classes. In certain cases, training data is available only in the form of a single class, as in the case of Dengue Hemorrhagic Fever (DHF patients, where only data of positive patients is available. In this paper, we report our experiment in building a classification model for detecting DHF infection using One Class Classification (OCC approach. Data from this study is sourced from laboratory tests of patients with dengue fever. The OCC methods compared are One-Class Support Vector Machine and One-Class K-Means. The result shows SVM method obtained precision value = 1.0, recall = 0.993, f-1 score = 0.997, and accuracy of 99.7% while the K-Means method obtained precision value = 0.901, recall = 0.973, f- 1 score = 0.936, and accuracy of 93.3%. This indicates that the SVM method is slightly superior to K-Means for One-Class Classification of DHF patients.
Automatic classification of time-variable X-ray sources

Energy Technology Data Exchange (ETDEWEB)

Lo, Kitty K.; Farrell, Sean; Murphy, Tara; Gaensler, B. M. [Sydney Institute for Astronomy, School of Physics, The University of Sydney, Sydney, NSW 2006 (Australia)

2014-05-01

To maximize the discovery potential of future synoptic surveys, especially in the field of transient science, it will be necessary to use automatic classification to identify some of the astronomical sources. The data mining technique of supervised classification is suitable for this problem. Here, we present a supervised learning method to automatically classify variable X-ray sources in the Second XMM-Newton Serendipitous Source Catalog (2XMMi-DR2). Random Forest is our classifier of choice since it is one of the most accurate learning algorithms available. Our training set consists of 873 variable sources and their features are derived from time series, spectra, and other multi-wavelength contextual information. The 10 fold cross validation accuracy of the training data is ∼97% on a 7 class data set. We applied the trained classification model to 411 unknown variable 2XMM sources to produce a probabilistically classified catalog. Using the classification margin and the Random Forest derived outlier measure, we identified 12 anomalous sources, of which 2XMM J180658.7–500250 appears to be the most unusual source in the sample. Its X-ray spectra is suggestive of a ultraluminous X-ray source but its variability makes it highly unusual. Machine-learned classification and anomaly detection will facilitate scientific discoveries in the era of all-sky surveys.
Automatic classification of time-variable X-ray sources

International Nuclear Information System (INIS)

Lo, Kitty K.; Farrell, Sean; Murphy, Tara; Gaensler, B. M.

2014-01-01

To maximize the discovery potential of future synoptic surveys, especially in the field of transient science, it will be necessary to use automatic classification to identify some of the astronomical sources. The data mining technique of supervised classification is suitable for this problem. Here, we present a supervised learning method to automatically classify variable X-ray sources in the Second XMM-Newton Serendipitous Source Catalog (2XMMi-DR2). Random Forest is our classifier of choice since it is one of the most accurate learning algorithms available. Our training set consists of 873 variable sources and their features are derived from time series, spectra, and other multi-wavelength contextual information. The 10 fold cross validation accuracy of the training data is ∼97% on a 7 class data set. We applied the trained classification model to 411 unknown variable 2XMM sources to produce a probabilistically classified catalog. Using the classification margin and the Random Forest derived outlier measure, we identified 12 anomalous sources, of which 2XMM J180658.7–500250 appears to be the most unusual source in the sample. Its X-ray spectra is suggestive of a ultraluminous X-ray source but its variability makes it highly unusual. Machine-learned classification and anomaly detection will facilitate scientific discoveries in the era of all-sky surveys.
Semi-Supervised Learning for Classification of Protein Sequence Data

Directory of Open Access Journals (Sweden)

Brian R. King

2008-01-01

Full Text Available Protein sequence data continue to become available at an exponential rate. Annotation of functional and structural attributes of these data lags far behind, with only a small fraction of the data understood and labeled by experimental methods. Classification methods that are based on semi-supervised learning can increase the overall accuracy of classifying partly labeled data in many domains, but very few methods exist that have shown their effect on protein sequence classification. We show how proven methods from text classification can be applied to protein sequence data, as we consider both existing and novel extensions to the basic methods, and demonstrate restrictions and differences that must be considered. We demonstrate comparative results against the transductive support vector machine, and show superior results on the most difficult classification problems. Our results show that large repositories of unlabeled protein sequence data can indeed be used to improve predictive performance, particularly in situations where there are fewer labeled protein sequences available, and/or the data are highly unbalanced in nature.
Etiological classification of depression based on the enzymes of tryptophan metabolism.

Science.gov (United States)

Fukuda, Katsuhiko

2014-12-24

Viewed in terms of input and output, the mechanisms of depression are still akin to a black box. However, there must be main pivots for diverse types of depression. From recent therapeutic observations, both the serotonin (5-HT) and kynurenine pathways of tryptophan metabolism may be of particular importance to improved understanding of depression. Here, I propose an etiological classification of depression, based on key peripheral and central enzymes of tryptophan metabolism. Endogenous depression is caused by a larger genetic component than reactive depression. Besides enterochromaffin and mast cells, tryptophan hydroxylase 1 (TPH1), primarily expressed in the gastrointestinal tract, is also found in 5-hydroxytryptophan-producing cells (5-HTP cells) in normal intestinal enterocytes, which are thought to essentially shunt 5-HT production in 5-HT-producing cells. Genetic studies have reported an association between TPH1 and depression, or the responsiveness of depression to antidepressive medication. Therefore, it is possible that hypofunctional 5-HTP cells (reflecting TPH1 dysfunction) in the periphery lead to deficient brain 5-HT levels. Additionally,it has been reported that higher TPH2 expression in depressed suicides may reflect a homeostatic response to deficient 5-HT levels. Subsequently, endogenous depression may be caused by TPH1 dysfunction combined with compensatory TPH2 activation. Reactive depression results from life stresses and involves the hypothalamic-pituitary-adrenal axis, with resulting cortisol production inducing tryptophan 2,3-dioxygenase (TDO) activation. In secondary depression, caused by inflammation, infection, or oxidative stress, indoleamine 2,3-dioxygenase (IDO) is activated. In both reactive and secondary depression, the balance between 3-hydroxykynurenine (3-HK) and kynurenic acid may shift towards 3-HK production via kynurenine-3-monooxygenase (KMO) activation. By shifting the equilibrium position of key enzymes of tryptophan
Underwater object classification using scattering transform of sonar signals

Science.gov (United States)

Saito, Naoki; Weber, David S.

2017-08-01

In this paper, we apply the scattering transform (ST)-a nonlinear map based off of a convolutional neural network (CNN)-to classification of underwater objects using sonar signals. The ST formalizes the observation that the filters learned by a CNN have wavelet-like structure. We achieve effective binary classification both on a real dataset of Unexploded Ordinance (UXOs), as well as synthetically generated examples. We also explore the effects on the waveforms with respect to changes in the object domain (e.g., translation, rotation, and acoustic impedance, etc.), and examine the consequences coming from theoretical results for the scattering transform. We show that the scattering transform is capable of excellent classification on both the synthetic and real problems, thanks to having more quasi-invariance properties that are well-suited to translation and rotation of the object.
Three-class classification in computer-aided diagnosis of breast cancer by support vector machine

Science.gov (United States)

Sun, Xuejun; Qian, Wei; Song, Dansheng

2004-05-01

Design of classifier in computer-aided diagnosis (CAD) scheme of breast cancer plays important role to its overall performance in sensitivity and specificity. Classification of a detected object as malignant lesion, benign lesion, or normal tissue on mammogram is a typical three-class pattern recognition problem. This paper presents a three-class classification approach by using two-stage classifier combined with support vector machine (SVM) learning algorithm for classification of breast cancer on mammograms. The first classification stage is used to detect abnormal areas and normal breast tissues, and the second stage is for classification of malignant or benign in detected abnormal objects. A series of spatial, morphology and texture features have been extracted on detected objects areas. By using genetic algorithm (GA), different feature groups for different stage classification have been investigated. Computerized free-response receiver operating characteristic (FROC) and receiver operating characteristic (ROC) analyses have been employed in different classification stages. Results have shown that obvious performance improvement in both sensitivity and specificity was observed through proposed classification approach compared with conventional two-class classification approaches, indicating its effectiveness in classification of breast cancer on mammograms.
Maternal cell phone use during pregnancy and child behavioral problems in five birth cohorts

NARCIS (Netherlands)

Birks, Laura; Guxens, Mònica; Papadopoulou, Eleni; Alexander, Jan; Ballester, Ferran; Estarlich, Marisa; Gallastegi, Mara; Ha, Mina; Haugen, Margaretha; Huss, Anke; Kheifets, Leeka; Lim, Hyungryul; Olsen, Jørn; Santa-Marina, Loreto; Sudan, Madhuri; Vermeulen, Roel; Vrijkotte, Tanja; Cardis, Elisabeth; Vrijheid, Martine

2017-01-01

Previous studies have reported associations between prenatal cell phone use and child behavioral problems, but findings have been inconsistent and based on retrospective assessment of cell phone use. This study aimed to assess this association in a multi-national analysis, using data from three
McKenzie Classification of Extremity Lesions - An audit of primary care patients in 3 clinics

DEFF Research Database (Denmark)

Melbye, Martin

2007-01-01

Syndrome classification based on mechanical testing guides clinical decision making in conservative musculoskeletal care. The aim of this audit was to investigate how many patients presenting with problems in the extremities could be classified into the mechanical syndromes described by Robin Mc...... ranged from 4,5 to 6 years. The mechanical classification determined by the therapists, and was recorded on the first three visits. Mechanical classification was based on strict operational definitions. Assessment sheets were collected from each therapist, to determine their adherence...... to the operational definitions. 135 consecutive patients were included over an 18 months period and 28 patients were excluded. Of the 107 patients with extremity joint problems, 73% were classified into one of McKenzie's mechanical syndromes by therapists trained in the McKenzie method. 34% of patients were...
Modified DCTNet for audio signals classification

Science.gov (United States)

Xian, Yin; Pu, Yunchen; Gan, Zhe; Lu, Liang; Thompson, Andrew

2016-10-01

In this paper, we investigate DCTNet for audio signal classification. Its output feature is related to Cohen's class of time-frequency distributions. We introduce the use of adaptive DCTNet (A-DCTNet) for audio signals feature extraction. The A-DCTNet applies the idea of constant-Q transform, with its center frequencies of filterbanks geometrically spaced. The A-DCTNet is adaptive to different acoustic scales, and it can better capture low frequency acoustic information that is sensitive to human audio perception than features such as Mel-frequency spectral coefficients (MFSC). We use features extracted by the A-DCTNet as input for classifiers. Experimental results show that the A-DCTNet and Recurrent Neural Networks (RNN) achieve state-of-the-art performance in bird song classification rate, and improve artist identification accuracy in music data. They demonstrate A-DCTNet's applicability to signal processing problems.
Comparing complete and partial classification for identifying customers at risk

NARCIS (Netherlands)

Bloemer, J.M.M.; Brijs, T.; Swinnen, S.P.; Vanhoof, K.

2003-01-01

This paper evaluates complete versus partial classification for the problem of identifying customers at risk. We define customers at risk as customers reporting overall satisfaction, but these customers also possess characteristics that are strongly associated with dissatisfied customers. This
78 FR 54970 - Cotton Futures Classification: Optional Classification Procedure

Science.gov (United States)

2013-09-09

... Service 7 CFR Part 27 [AMS-CN-13-0043] RIN 0581-AD33 Cotton Futures Classification: Optional Classification Procedure AGENCY: Agricultural Marketing Service, USDA. ACTION: Proposed rule. SUMMARY: The... optional cotton futures classification procedure--identified and known as ``registration'' by the U.S...
Multiview Discriminative Geometry Preserving Projection for Image Classification

Directory of Open Access Journals (Sweden)

Ziqiang Wang

2014-01-01

Full Text Available In many image classification applications, it is common to extract multiple visual features from different views to describe an image. Since different visual features have their own specific statistical properties and discriminative powers for image classification, the conventional solution for multiple view data is to concatenate these feature vectors as a new feature vector. However, this simple concatenation strategy not only ignores the complementary nature of different views, but also ends up with “curse of dimensionality.” To address this problem, we propose a novel multiview subspace learning algorithm in this paper, named multiview discriminative geometry preserving projection (MDGPP for feature extraction and classification. MDGPP can not only preserve the intraclass geometry and interclass discrimination information under a single view, but also explore the complementary property of different views to obtain a low-dimensional optimal consensus embedding by using an alternating-optimization-based iterative algorithm. Experimental results on face recognition and facial expression recognition demonstrate the effectiveness of the proposed algorithm.
Correlation of clinicopathologic features and lung squamous cell carcinoma subtypes according to the 2015 WHO classification.

Science.gov (United States)

Chen, Rongrong; Ding, Zhengping; Zhu, Lei; Lu, Shun; Yu, Yongfeng

2017-12-01

This study aimed to determine the relationship between clinicopathologic features and lung squamous cell carcinoma (LSCC) subtypes according to the 2015 WHO classification. We identified 824 operable LSCC patients undergoing a complete surgical resection at Shanghai Chest Hospital between April 2015 and January 2017. Immunohistochemistry was used to investigate the clinicopathologic features. Among them, the percentages of LSCC subtypes were 66.1% (545/824), 28.6% (236/824), and 5.2% (43/824) for keratinizing squamous cell carcinoma (KSCC), nonkeratinizing squamous cell carcinoma (NKSCC), and basaloid squamous cell carcinoma (BSCC), respectively. There were more males, more smokers, and more pneumonectomy surgeries in KSCC patients (p = 0.008, p = 0.000, p = 0.043). There were more N2 lymph node involvement and pathological stage III in NKSCC patients (p = 0.01, p = 0.03). BSCC did not demonstrate specificity to anything, but expressed adenocarcinoma markers more frequently. No significant difference existed between pathological subtypes and other clinicopathologic features, such as age, location type, visceral pleural involvement and lymphovascular invasion. The frequencies of EGFR sensitive mutations and ALK rearrangements were not significantly different among three subtypes. Significant relationships exist between some clinicopathologic features and LSCC subtypes. Copyright © 2017 Elsevier Ltd, BASO ~ The Association for Cancer Surgery, and the European Society of Surgical Oncology. All rights reserved.
Consensus embedding: theory, algorithms and application to segmentation and classification of biomedical data

Directory of Open Access Journals (Sweden)

Viswanath Satish

2012-02-01

Full Text Available Abstract Background Dimensionality reduction (DR enables the construction of a lower dimensional space (embedding from a higher dimensional feature space while preserving object-class discriminability. However several popular DR approaches suffer from sensitivity to choice of parameters and/or presence of noise in the data. In this paper, we present a novel DR technique known as consensus embedding that aims to overcome these problems by generating and combining multiple low-dimensional embeddings, hence exploiting the variance among them in a manner similar to ensemble classifier schemes such as Bagging. We demonstrate theoretical properties of consensus embedding which show that it will result in a single stable embedding solution that preserves information more accurately as compared to any individual embedding (generated via DR schemes such as Principal Component Analysis, Graph Embedding, or Locally Linear Embedding. Intelligent sub-sampling (via mean-shift and code parallelization are utilized to provide for an efficient implementation of the scheme. Results Applications of consensus embedding are shown in the context of classification and clustering as applied to: (1 image partitioning of white matter and gray matter on 10 different synthetic brain MRI images corrupted with 18 different combinations of noise and bias field inhomogeneity, (2 classification of 4 high-dimensional gene-expression datasets, (3 cancer detection (at a pixel-level on 16 image slices obtained from 2 different high-resolution prostate MRI datasets. In over 200 different experiments concerning classification and segmentation of biomedical data, consensus embedding was found to consistently outperform both linear and non-linear DR methods within all applications considered. Conclusions We have presented a novel framework termed consensus embedding which leverages ensemble classification theory within dimensionality reduction, allowing for application to a wide range
ETIOLOGY CLASSIFICATION AND TREATMENT NEEDS (TN FOR ORAL MALODOR

Directory of Open Access Journals (Sweden)

Anton Raharjo

2015-08-01

Full Text Available Background: Oral malodor, a generic descriptor term for foul smells emanating from the mouth can be classified as either pathological or physiological halitosis. Some problems are often confounded by the clinician's mismanagement. Objective: This paper reviews the etiology of classification and determination of treatment needs (TN for oral malodor. Literature review and discussion: In the majority of cases the problem has been shown to originate in the oral cavity. Although oral malodor cases are often related to physiological aspects, sometimes they can be related to extra oral sources and psychological aspects. Classification methods of oral malodor with corresponding treatment needs (TN have already been established. Although PTC & tongue brushing and appropriate mouthrinses are both important and basic treatment measures for halitosis, other dental treatments are sometimes required. Conclusion: Accurate screening and diagnosis of halitosis followed by appropriate TN may give better results and consequently reduce the risk of mismanagement.
PN solutions for the slowing-down and the cell calculation problems in plane geometry

International Nuclear Information System (INIS)

Caldeira, Alexandre David

1999-01-01

In this work P N solutions for the slowing-down and cell problems in slab geometry are developed. To highlight the main contributions of this development, one can mention: the new particular solution developed for the P N method applied to the slowing-down problem in the multigroup model, originating a new class of polynomials denominated Chandrasekhar generalized polynomials; the treatment of a specific situation, known as a degeneracy, arising from a particularity in the group constants and the first application of the P N method, for arbitrary N, in criticality calculations at the cell level reported in literature. (author)
Evolution of managerial problems from the perspective of management science

Directory of Open Access Journals (Sweden)

Marek Szarucki

2015-12-01

Full Text Available Managerial problems and the process of their solving play an important role both in the theory of management science and practice of organisations’ functioning. There is a gap in the literature related to the evolution of management problems in the context of the methodological approaches to solve them. The main goal of this paper was to analyse the evolution of the managerial problems from the perspective of management science and to present dominant methodological approaches for problem solving. Based on the extensive literature analysis in the discipline of management science, the evolution of the managerial problems was described with relation to the sixteen streams of management science. The author reviewed the selected classifications of the management theory as well as proposed his own perspective, which took into account managerial problems and their evolution over time. Moreover, there was presented an attempt to depict sources of management problems from the historical perspective within the methodological approaches of management science. Despite the broad view on management problems presented in this paper, such perspective gives a good ground for developing new more specific problem classifications, addressing different facets of managerial problems.

Multi-sparse dictionary colorization algorithm based on the feature classification and detail enhancement

Science.gov (United States)

Yan, Dan; Bai, Lianfa; Zhang, Yi; Han, Jing

2018-02-01

For the problems of missing details and performance of the colorization based on sparse representation, we propose a conceptual model framework for colorizing gray-scale images, and then a multi-sparse dictionary colorization algorithm based on the feature classification and detail enhancement (CEMDC) is proposed based on this framework. The algorithm can achieve a natural colorized effect for a gray-scale image, and it is consistent with the human vision. First, the algorithm establishes a multi-sparse dictionary classification colorization model. Then, to improve the accuracy rate of the classification, the corresponding local constraint algorithm is proposed. Finally, we propose a detail enhancement based on Laplacian Pyramid, which is effective in solving the problem of missing details and improving the speed of image colorization. In addition, the algorithm not only realizes the colorization of the visual gray-scale image, but also can be applied to the other areas, such as color transfer between color images, colorizing gray fusion images, and infrared images.
Supervised Self-Organizing Classification of Superresolution ISAR Images: An Anechoic Chamber Experiment

Directory of Open Access Journals (Sweden)

Radoi Emanuel

2006-01-01

Full Text Available The problem of the automatic classification of superresolution ISAR images is addressed in the paper. We describe an anechoic chamber experiment involving ten-scale-reduced aircraft models. The radar images of these targets are reconstructed using MUSIC-2D (multiple signal classification method coupled with two additional processing steps: phase unwrapping and symmetry enhancement. A feature vector is then proposed including Fourier descriptors and moment invariants, which are calculated from the target shape and the scattering center distribution extracted from each reconstructed image. The classification is finally performed by a new self-organizing neural network called SART (supervised ART, which is compared to two standard classifiers, MLP (multilayer perceptron and fuzzy KNN ( nearest neighbors. While the classification accuracy is similar, SART is shown to outperform the two other classifiers in terms of training speed and classification speed, especially for large databases. It is also easier to use since it does not require any input parameter related to its structure.
CLASSIFICATION OF CROPLANDS THROUGH FUSION OF OPTICAL AND SAR TIME SERIES DATA

Directory of Open Access Journals (Sweden)

S. Park

2016-06-01

Full Text Available Many satellite sensors including Landsat series have been extensively used for land cover classification. Studies have been conducted to mitigate classification problems associated with the use of single data (e.g., such as cloud contamination through multi-sensor data fusion and the use of time series data. This study investigated two areas with different environment and climate conditions: one in South Korea and the other in US. Cropland classification was conducted by using multi-temporal Landsat 5, Radarsat-1 and digital elevation models (DEM based on two machine learning approaches (i.e., random forest and support vector machines. Seven classification scenarios were examined and evaluated through accuracy assessment. Results show that SVM produced the best performance (overall accuracy of 93.87% when using all temporal and spectral data as input variables. Normalized Difference Water Index (NDWI, SAR backscattering, and Normalized Difference Vegetation Index (NDVI were identified as more contributing variables than the others for cropland classification.
Improved Quantum Particle Swarm Optimization for Mangroves Classification

Directory of Open Access Journals (Sweden)

Zhehuang Huang

2016-01-01

Full Text Available Quantum particle swarm optimization (QPSO is a population based optimization algorithm inspired by social behavior of bird flocking which combines the ideas of quantum computing. For many optimization problems, traditional QPSO algorithm can produce high-quality solution within a reasonable computation time and relatively stable convergence characteristics. But QPSO algorithm also showed some unsatisfactory issues in practical applications, such as premature convergence and poor ability in global optimization. To solve these problems, an improved quantum particle swarm optimization algorithm is proposed and implemented in this paper. There are three main works in this paper. Firstly, an improved QPSO algorithm is introduced which can enhance decision making ability of the model. Secondly, we introduce synergetic neural network model to mangroves classification for the first time which can better handle fuzzy matching of remote sensing image. Finally, the improved QPSO algorithm is used to realize the optimization of network parameter. The experiments on mangroves classification showed that the improved algorithm has more powerful global exploration ability and faster convergence speed.
Classification of refrigerants; Classification des fluides frigorigenes

Energy Technology Data Exchange (ETDEWEB)

NONE

2001-07-01

This document was made from the US standard ANSI/ASHRAE 34 published in 2001 and entitled 'designation and safety classification of refrigerants'. This classification allows to clearly organize in an international way the overall refrigerants used in the world thanks to a codification of the refrigerants in correspondence with their chemical composition. This note explains this codification: prefix, suffixes (hydrocarbons and derived fluids, azeotropic and non-azeotropic mixtures, various organic compounds, non-organic compounds), safety classification (toxicity, flammability, case of mixtures). (J.S.)
Failure diagnosis using deep belief learning based health state classification

International Nuclear Information System (INIS)

Tamilselvan, Prasanna; Wang, Pingfeng

2013-01-01

Effective health diagnosis provides multifarious benefits such as improved safety, improved reliability and reduced costs for operation and maintenance of complex engineered systems. This paper presents a novel multi-sensor health diagnosis method using deep belief network (DBN). DBN has recently become a popular approach in machine learning for its promised advantages such as fast inference and the ability to encode richer and higher order network structures. The DBN employs a hierarchical structure with multiple stacked restricted Boltzmann machines and works through a layer by layer successive learning process. The proposed multi-sensor health diagnosis methodology using DBN based state classification can be structured in three consecutive stages: first, defining health states and preprocessing sensory data for DBN training and testing; second, developing DBN based classification models for diagnosis of predefined health states; third, validating DBN classification models with testing sensory dataset. Health diagnosis using DBN based health state classification technique is compared with four existing diagnosis techniques. Benchmark classification problems and two engineering health diagnosis applications: aircraft engine health diagnosis and electric power transformer health diagnosis are employed to demonstrate the efficacy of the proposed approach
Deep learning for EEG-Based preference classification

Science.gov (United States)

Teo, Jason; Hou, Chew Lin; Mountstephens, James

2017-10-01

Electroencephalogram (EEG)-based emotion classification is rapidly becoming one of the most intensely studied areas of brain-computer interfacing (BCI). The ability to passively identify yet accurately correlate brainwaves with our immediate emotions opens up truly meaningful and previously unattainable human-computer interactions such as in forensic neuroscience, rehabilitative medicine, affective entertainment and neuro-marketing. One particularly useful yet rarely explored areas of EEG-based emotion classification is preference recognition [1], which is simply the detection of like versus dislike. Within the limited investigations into preference classification, all reported studies were based on musically-induced stimuli except for a single study which used 2D images. The main objective of this study is to apply deep learning, which has been shown to produce state-of-the-art results in diverse hard problems such as in computer vision, natural language processing and audio recognition, to 3D object preference classification over a larger group of test subjects. A cohort of 16 users was shown 60 bracelet-like objects as rotating visual stimuli on a computer display while their preferences and EEGs were recorded. After training a variety of machine learning approaches which included deep neural networks, we then attempted to classify the users' preferences for the 3D visual stimuli based on their EEGs. Here, we show that that deep learning outperforms a variety of other machine learning classifiers for this EEG-based preference classification task particularly in a highly challenging dataset with large inter- and intra-subject variability.
Multi-label literature classification based on the Gene Ontology graph

Directory of Open Access Journals (Sweden)

Lu Xinghua

2008-12-01

Full Text Available Abstract Background The Gene Ontology is a controlled vocabulary for representing knowledge related to genes and proteins in a computable form. The current effort of manually annotating proteins with the Gene Ontology is outpaced by the rate of accumulation of biomedical knowledge in literature, which urges the development of text mining approaches to facilitate the process by automatically extracting the Gene Ontology annotation from literature. The task is usually cast as a text classification problem, and contemporary methods are confronted with unbalanced training data and the difficulties associated with multi-label classification. Results In this research, we investigated the methods of enhancing automatic multi-label classification of biomedical literature by utilizing the structure of the Gene Ontology graph. We have studied three graph-based multi-label classification algorithms, including a novel stochastic algorithm and two top-down hierarchical classification methods for multi-label literature classification. We systematically evaluated and compared these graph-based classification algorithms to a conventional flat multi-label algorithm. The results indicate that, through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods can significantly improve predictions of the Gene Ontology terms implied by the analyzed text. Furthermore, the graph-based multi-label classifiers are capable of suggesting Gene Ontology annotations (to curators that are closely related to the true annotations even if they fail to predict the true ones directly. A software package implementing the studied algorithms is available for the research community. Conclusion Through utilizing the information from the structure of the Gene Ontology graph, the graph-based multi-label classification methods have better potential than the conventional flat multi-label classification approach to facilitate
Multi-view Multi-sparsity Kernel Reconstruction for Multi-class Image Classification

KAUST Repository

Zhu, Xiaofeng; Xie, Qing; Zhu, Yonghua; Liu, Xingyi; Zhang, Shichao

2015-01-01

This paper addresses the problem of multi-class image classification by proposing a novel multi-view multi-sparsity kernel reconstruction (MMKR for short) model. Given images (including test images and training images) representing with multiple
Classification of Pulse Waveforms Using Edit Distance with Real Penalty

Directory of Open Access Journals (Sweden)

Zhang Dongyu

2010-01-01

Full Text Available Abstract Advances in sensor and signal processing techniques have provided effective tools for quantitative research in traditional Chinese pulse diagnosis (TCPD. Because of the inevitable intraclass variation of pulse patterns, the automatic classification of pulse waveforms has remained a difficult problem. In this paper, by referring to the edit distance with real penalty (ERP and the recent progress in -nearest neighbors (KNN classifiers, we propose two novel ERP-based KNN classifiers. Taking advantage of the metric property of ERP, we first develop an ERP-induced inner product and a Gaussian ERP kernel, then embed them into difference-weighted KNN classifiers, and finally develop two novel classifiers for pulse waveform classification. The experimental results show that the proposed classifiers are effective for accurate classification of pulse waveform.
Collaborative classification of hyperspectral and visible images with convolutional neural network

Science.gov (United States)

Zhang, Mengmeng; Li, Wei; Du, Qian

2017-10-01

Recent advances in remote sensing technology have made multisensor data available for the same area, and it is well-known that remote sensing data processing and analysis often benefit from multisource data fusion. Specifically, low spatial resolution of hyperspectral imagery (HSI) degrades the quality of the subsequent classification task while using visible (VIS) images with high spatial resolution enables high-fidelity spatial analysis. A collaborative classification framework is proposed to fuse HSI and VIS images for finer classification. First, the convolutional neural network model is employed to extract deep spectral features for HSI classification. Second, effective binarized statistical image features are learned as contextual basis vectors for the high-resolution VIS image, followed by a classifier. The proposed approach employs diversified data in a decision fusion, leading to an integration of the rich spectral information, spatial information, and statistical representation information. In particular, the proposed approach eliminates the potential problems of the curse of dimensionality and excessive computation time. The experiments evaluated on two standard data sets demonstrate better classification performance offered by this framework.
Exploiting monotonicity constraints for active learning in ordinal classification

NARCIS (Netherlands)

Soons, Pieter; Feelders, Adrianus

2014-01-01

We consider ordinal classification and instance ranking problems where each attribute is known to have an increasing or decreasing relation with the class label or rank. For example, it stands to reason that the number of query terms occurring in a document has a positive influence on its relevance
A Classification Table for Achondrites

Science.gov (United States)

Chennaoui-Aoudjehane, H.; Larouci, N.; Jambon, A.; Mittlefehldt, D. W.

2014-01-01

Classifying chondrites is relatively easy and the criteria are well documented. It is based on mineral compositions, textural characteristics and more recently, magnetic susceptibility. It can be more difficult to classify achondrites, especially those that are very similar to terrestrial igneous rocks, because mineralogical, textural and compositional properties can be quite variable. Achondrites contain essentially olivine, pyroxenes, plagioclases, oxides, sulphides and accessory minerals. Their origin is attributed to differentiated parents bodies: large asteroids (Vesta); planets (Mars); a satellite (the Moon); and numerous asteroids of unknown size. In most cases, achondrites are not eye witnessed falls and some do not have fusion crust. Because of the mineralogical and magnetic susceptibility similarity with terrestrial igneous rocks for some achondrites, it can be difficult for classifiers to confirm their extra-terrestrial origin. We -as classifiers of meteorites- are confronted with this problem with every suspected achondrite we receive for identification. We are developing a "grid" of classification to provide an easier approach for initial classification. We use simple but reproducible criteria based on mineralogical, petrological and geochemical studies. We presented the classes: acapulcoites, lodranites, winonaites and Martian meteorites (shergottite, chassignites, nakhlites). In this work we are completing the classification table by including the groups: angrites, aubrites, brachinites, ureilites, HED (howardites, eucrites, and diogenites), lunar meteorites, pallasites and mesosiderites. Iron meteorites are not presented in this abstract.
Abnormal uterine bleeding: advantages of formal classification to patients, clinicians and researchers.

Science.gov (United States)

Madhra, Mayank; Fraser, Ian S; Munro, Malcolm G; Critchley, Hilary O D

2014-07-01

To highlight the advantages of formal classification of causes of abnormal uterine bleeding from a clinical and scientific perspective. Review and recommendations for local implementation. In the past, research in the field of menstrual disorders has not been funded adequately with respect to the impact of symptoms on individuals, healthcare systems and society. This was confounded by a diverse terminology, which lead to confusion between clinical and scientific groups, ultimately harming the underlying evidence base. To address this, a formal classification system (PALM-COEIN) for the causes of abnormal uterine bleeding has been published for worldwide use by FIGO (International Federation of Gynecology and Obstetrics). This commentary explains problems created by the prior absence of such a system, the potential advantages stemming from its use, and practical suggestions for local implementation. The PALM-COEIN classification is applicable globally and, as momentum gathers, will ameliorate recurrence of historic problems, and harmonise reporting of clinical and scientific research to facilitate future progress in women's health. © 2014 Nordic Federation of Societies of Obstetrics and Gynecology.
WHO/ISUP classification of the urothelial tumors of the urinary bladder

Directory of Open Access Journals (Sweden)

Zdenka Ovčak

2005-09-01

Full Text Available Background: The authors present the current classification of urothelial neoplasms of the urinary bladder. The classification of urothelial tumors of the urinary bladder of 1973 was despite some imperfection relatively successfuly used for more than thirty years. The three grade classification of papillary urothelial tumors without invasion has been based on evaluation of variations in architecture of covering epithelium and tumor cell anaplasia. As reccomended by the International Society of Urological Pathologists (ISUP, the World Health Organisation (WHO accepted the new WHO/ ISUP classification in 1998 that was revised in 2002 and finally published in 2004. With intention to avoid unnecessary diagnosis of cancer in patients having papillary urothelial tumors with rare invasive or metastastatic growth, this classification introduced a new entity, the papillary urothelial neoplasia of low malignant potential (PUNLMP. The additional change in classification was the division of invasive urothelial neoplasms only to low and high grade urothelial carcinomas.Conclusions: The authors’ opinion is that although the old classification is not recommended for use anymore the new one is not solving the elementary reproaches to previous classification such as terminological unsuitability and insufficient scientific reasoning. Our proposed solution in classification of papillary urothelial neoplasms would be the application of criteria analogous to that used in diagnostics of papillary noninvasive tumors of the head and neck or alimentary tract.
Analytical solution and experimental validation of the energy management problem for fuel cell hybrid vehicles

NARCIS (Netherlands)

P.P.J. van den Bosch; Edwin Tazelaar; M. Grimminck; Stijn Hoppenbrouwers; Bram Veenhuizen

2011-01-01

The objective of an energy management strategy for fuel cell hybrid propulsion systems is to minimize the fuel needed to provide the required power demand. This minimization is defined as an optimization problem. Methods such as dynamic programming numerically solve this optimization problem.
Multiloculated hydrocephalus: a review of current problems in classification and treatment

DEFF Research Database (Denmark)

Andresen, Morten; Juhler, Marianne

2012-01-01

PURPOSE: Loculated hydrocephalus is a condition in which discrete fluid-filled compartments form in or in relation to the ventricular system of the brain. Both uni- and multiloculated variants exist, with marked differences in outcome. However, several competing and seemingly interchangeable...... of Systematic Reviews, and the U.S. NIH ClinicalTrials.gov database was carried out with the search terms: "multicystic," "multiloculated," "multicompartment," "uniloculated," and "loculated." All were used in conjunction with the search term "hydrocephalus." RESULTS: A single study with a control group......, evidence is in favor of the neuroendoscopic approach. CONCLUSIONS: In order to ensure a consistent nomenclature as well as to guide future research, we propose a new system of classification for loculated hydrocephalus. It acknowledges the differences between uniloculated and multiloculated hydrocephalus...
Cancer classification using the Immunoscore: a worldwide task force.

Science.gov (United States)

Galon, Jérôme; Pagès, Franck; Marincola, Francesco M; Angell, Helen K; Thurin, Magdalena; Lugli, Alessandro; Zlobec, Inti; Berger, Anne; Bifulco, Carlo; Botti, Gerardo; Tatangelo, Fabiana; Britten, Cedrik M; Kreiter, Sebastian; Chouchane, Lotfi; Delrio, Paolo; Arndt, Hartmann; Asslaber, Martin; Maio, Michele; Masucci, Giuseppe V; Mihm, Martin; Vidal-Vanaclocha, Fernando; Allison, James P; Gnjatic, Sacha; Hakansson, Leif; Huber, Christoph; Singh-Jasuja, Harpreet; Ottensmeier, Christian; Zwierzina, Heinz; Laghi, Luigi; Grizzi, Fabio; Ohashi, Pamela S; Shaw, Patricia A; Clarke, Blaise A; Wouters, Bradly G; Kawakami, Yutaka; Hazama, Shoichi; Okuno, Kiyotaka; Wang, Ena; O'Donnell-Tormey, Jill; Lagorce, Christine; Pawelec, Graham; Nishimura, Michael I; Hawkins, Robert; Lapointe, Réjean; Lundqvist, Andreas; Khleif, Samir N; Ogino, Shuji; Gibbs, Peter; Waring, Paul; Sato, Noriyuki; Torigoe, Toshihiko; Itoh, Kyogo; Patel, Prabhu S; Shukla, Shilin N; Palmqvist, Richard; Nagtegaal, Iris D; Wang, Yili; D'Arrigo, Corrado; Kopetz, Scott; Sinicrope, Frank A; Trinchieri, Giorgio; Gajewski, Thomas F; Ascierto, Paolo A; Fox, Bernard A

2012-10-03

Prediction of clinical outcome in cancer is usually achieved by histopathological evaluation of tissue samples obtained during surgical resection of the primary tumor. Traditional tumor staging (AJCC/UICC-TNM classification) summarizes data on tumor burden (T), presence of cancer cells in draining and regional lymph nodes (N) and evidence for metastases (M). However, it is now recognized that clinical outcome can significantly vary among patients within the same stage. The current classification provides limited prognostic information, and does not predict response to therapy. Recent literature has alluded to the importance of the host immune system in controlling tumor progression. Thus, evidence supports the notion to include immunological biomarkers, implemented as a tool for the prediction of prognosis and response to therapy. Accumulating data, collected from large cohorts of human cancers, has demonstrated the impact of immune-classification, which has a prognostic value that may add to the significance of the AJCC/UICC TNM-classification. It is therefore imperative to begin to incorporate the 'Immunoscore' into traditional classification, thus providing an essential prognostic and potentially predictive tool. Introduction of this parameter as a biomarker to classify cancers, as part of routine diagnostic and prognostic assessment of tumors, will facilitate clinical decision-making including rational stratification of patient treatment. Equally, the inherent complexity of quantitative immunohistochemistry, in conjunction with protocol variation across laboratories, analysis of different immune cell types, inconsistent region selection criteria, and variable ways to quantify immune infiltration, all underline the urgent requirement to reach assay harmonization. In an effort to promote the Immunoscore in routine clinical settings, an international task force was initiated. This review represents a follow-up of the announcement of this initiative, and of the J
Cancer classification using the Immunoscore: a worldwide task force

Directory of Open Access Journals (Sweden)

Galon Jérôme

2012-10-01

Full Text Available Abstract Prediction of clinical outcome in cancer is usually achieved by histopathological evaluation of tissue samples obtained during surgical resection of the primary tumor. Traditional tumor staging (AJCC/UICC-TNM classification summarizes data on tumor burden (T, presence of cancer cells in draining and regional lymph nodes (N and evidence for metastases (M. However, it is now recognized that clinical outcome can significantly vary among patients within the same stage. The current classification provides limited prognostic information, and does not predict response to therapy. Recent literature has alluded to the importance of the host immune system in controlling tumor progression. Thus, evidence supports the notion to include immunological biomarkers, implemented as a tool for the prediction of prognosis and response to therapy. Accumulating data, collected from large cohorts of human cancers, has demonstrated the impact of immune-classification, which has a prognostic value that may add to the significance of the AJCC/UICC TNM-classification. It is therefore imperative to begin to incorporate the ‘Immunoscore’ into traditional classification, thus providing an essential prognostic and potentially predictive tool. Introduction of this parameter as a biomarker to classify cancers, as part of routine diagnostic and prognostic assessment of tumors, will facilitate clinical decision-making including rational stratification of patient treatment. Equally, the inherent complexity of quantitative immunohistochemistry, in conjunction with protocol variation across laboratories, analysis of different immune cell types, inconsistent region selection criteria, and variable ways to quantify immune infiltration, all underline the urgent requirement to reach assay harmonization. In an effort to promote the Immunoscore in routine clinical settings, an international task force was initiated. This review represents a follow-up of the announcement of
Guidelines to classification and nomenclature of Arabian felsic plutonic rocks

Science.gov (United States)

Ramsay, C.R.; Stoeser, D.B.; Drysdall, A.R.

1986-01-01

Well-defined procedures for classifying the felsic plutonic rocks of the Arabian Shield on the basis of petrographic, chemical and lithostratigraphic criteria and mineral-resource potential have been adopted and developed in the Saudi Arabian Deputy Ministry for Mineral Resources over the past decade. A number of problems with conventional classification schemes have been identified and resolved; others, notably those arising from difficulties in identifying precise mineral compositions, continue to present difficulties. The petrographic nomenclature used is essentially that recommended by the International Union of Geological Sciences. Problems that have arisen include the definition of: (1) rocks with sodic, zoned or perthitic feldspar, (2) trondhjemites, and (3) alkali granites. Chemical classification has been largely based on relative molar amounts of alumina, lime and alkalis, and the use of conventional variation diagrams, but pilot studies utilizing univariate and multivariate statistical techniques have been made. The classification used in Saudi Arabia for stratigraphic purposes is a hierarchy of formation-rank units, suites and super-suites as defined in the Saudi Arabian stratigraphic code. For genetic and petrological studies, a grouping as 'associations' of similar and genetically related lithologies is commonly used. In order to indicate mineral-resource potential, the felsic plutons are classed as common, precursor, specialized or mineralized, in order of increasing exploration significance. ?? 1986.

Classification of Near-Horizon Geometries of Extremal Black Holes

Directory of Open Access Journals (Sweden)

Hari K. Kunduri

2013-09-01

Full Text Available Any spacetime containing a degenerate Killing horizon, such as an extremal black hole, possesses a well-defined notion of a near-horizon geometry. We review such near-horizon geometry solutions in a variety of dimensions and theories in a unified manner. We discuss various general results including horizon topology and near-horizon symmetry enhancement. We also discuss the status of the classification of near-horizon geometries in theories ranging from vacuum gravity to Einstein–Maxwell theory and supergravity theories. Finally, we discuss applications to the classification of extremal black holes and various related topics. Several new results are presented and open problems are highlighted throughout.
Classification of Near-Horizon Geometries of Extremal Black Holes.

Science.gov (United States)

Kunduri, Hari K; Lucietti, James

2013-01-01

Any spacetime containing a degenerate Killing horizon, such as an extremal black hole, possesses a well-defined notion of a near-horizon geometry. We review such near-horizon geometry solutions in a variety of dimensions and theories in a unified manner. We discuss various general results including horizon topology and near-horizon symmetry enhancement. We also discuss the status of the classification of near-horizon geometries in theories ranging from vacuum gravity to Einstein-Maxwell theory and supergravity theories. Finally, we discuss applications to the classification of extremal black holes and various related topics. Several new results are presented and open problems are highlighted throughout.
Efficient Parallel Sorting for Migrating Birds Optimization When Solving Machine-Part Cell Formation Problems

Directory of Open Access Journals (Sweden)

Ricardo Soto

2016-01-01

Full Text Available The Machine-Part Cell Formation Problem (MPCFP is a NP-Hard optimization problem that consists in grouping machines and parts in a set of cells, so that each cell can operate independently and the intercell movements are minimized. This problem has largely been tackled in the literature by using different techniques ranging from classic methods such as linear programming to more modern nature-inspired metaheuristics. In this paper, we present an efficient parallel version of the Migrating Birds Optimization metaheuristic for solving the MPCFP. Migrating Birds Optimization is a population metaheuristic based on the V-Flight formation of the migrating birds, which is proven to be an effective formation in energy saving. This approach is enhanced by the smart incorporation of parallel procedures that notably improve performance of the several sorting processes performed by the metaheuristic. We perform computational experiments on 1080 benchmarks resulting from the combination of 90 well-known MPCFP instances with 12 sorting configurations with and without threads. We illustrate promising results where the proposal is able to reach the global optimum in all instances, while the solving time with respect to a nonparallel approach is notably reduced.
A Novel Algorithm for Imbalance Data Classification Based on Neighborhood Hypergraph

Directory of Open Access Journals (Sweden)

Feng Hu

2014-01-01

Full Text Available The classification problem for imbalance data is paid more attention to. So far, many significant methods are proposed and applied to many fields. But more efficient methods are needed still. Hypergraph may not be powerful enough to deal with the data in boundary region, although it is an efficient tool to knowledge discovery. In this paper, the neighborhood hypergraph is presented, combining rough set theory and hypergraph. After that, a novel classification algorithm for imbalance data based on neighborhood hypergraph is developed, which is composed of three steps: initialization of hyperedge, classification of training data set, and substitution of hyperedge. After conducting an experiment of 10-fold cross validation on 18 data sets, the proposed algorithm has higher average accuracy than others.
Effective Packet Number for 5G IM WeChat Application at Early Stage Traffic Classification

Directory of Open Access Journals (Sweden)

Muhammad Shafiq

2017-01-01

Full Text Available Accurate network traffic classification at early stage is very important for 5G network applications. During the last few years, researchers endeavored hard to propose effective machine learning model for classification of Internet traffic applications at early stage with few packets. Nevertheless, this essential problem still needs to be studied profoundly to find out effective packet number as well as effective machine learning (ML model. In this paper, we tried to solve the above-mentioned problem. For this purpose, five Internet traffic datasets are utilized. Initially, we extract packet size of 20 packets and then mutual information analysis is carried out to find out the mutual information of each packet on n flow type. Thereafter, we execute 10 well-known machine learning algorithms using crossover classification method. Two statistical analysis tests, Friedman and Wilcoxon pairwise tests, are applied for the experimental results. Moreover, we also apply the statistical tests for classifiers to find out effective ML classifier. Our experimental results show that 13–19 packets are the effective packet numbers for 5G IM WeChat application at early stage network traffic classification. We also find out effective ML classifier, where Random Forest ML classifier is effective classifier at early stage Internet traffic classification.
Joint learning and weighting of visual vocabulary for bag-of-feature based tissue classification

KAUST Repository

Wang, Jim Jing-Yan

2013-12-01

Automated classification of tissue types of Region of Interest (ROI) in medical images has been an important application in Computer-Aided Diagnosis (CAD). Recently, bag-of-feature methods which treat each ROI as a set of local features have shown their power in this field. Two important issues of bag-of-feature strategy for tissue classification are investigated in this paper: the visual vocabulary learning and weighting, which are always considered independently in traditional methods by neglecting the inner relationship between the visual words and their weights. To overcome this problem, we develop a novel algorithm, Joint-ViVo, which learns the vocabulary and visual word weights jointly. A unified objective function based on large margin is defined for learning of both visual vocabulary and visual word weights, and optimized alternately in the iterative algorithm. We test our algorithm on three tissue classification tasks: classifying breast tissue density in mammograms, classifying lung tissue in High-Resolution Computed Tomography (HRCT) images, and identifying brain tissue type in Magnetic Resonance Imaging (MRI). The results show that Joint-ViVo outperforms the state-of-art methods on tissue classification problems. © 2013 Elsevier Ltd.
Exploring diversity in ensemble classification: Applications in large area land cover mapping

Science.gov (United States)

Mellor, Andrew; Boukir, Samia

2017-07-01

Ensemble classifiers, such as random forests, are now commonly applied in the field of remote sensing, and have been shown to perform better than single classifier systems, resulting in reduced generalisation error. Diversity across the members of ensemble classifiers is known to have a strong influence on classification performance - whereby classifier errors are uncorrelated and more uniformly distributed across ensemble members. The relationship between ensemble diversity and classification performance has not yet been fully explored in the fields of information science and machine learning and has never been examined in the field of remote sensing. This study is a novel exploration of ensemble diversity and its link to classification performance, applied to a multi-class canopy cover classification problem using random forests and multisource remote sensing and ancillary GIS data, across seven million hectares of diverse dry-sclerophyll dominated public forests in Victoria Australia. A particular emphasis is placed on analysing the relationship between ensemble diversity and ensemble margin - two key concepts in ensemble learning. The main novelty of our work is on boosting diversity by emphasizing the contribution of lower margin instances used in the learning process. Exploring the influence of tree pruning on diversity is also a new empirical analysis that contributes to a better understanding of ensemble performance. Results reveal insights into the trade-off between ensemble classification accuracy and diversity, and through the ensemble margin, demonstrate how inducing diversity by targeting lower margin training samples is a means of achieving better classifier performance for more difficult or rarer classes and reducing information redundancy in classification problems. Our findings inform strategies for collecting training data and designing and parameterising ensemble classifiers, such as random forests. This is particularly important in large area
Classification of hydrocephalus: critical analysis of classification categories and advantages of "Multi-categorical Hydrocephalus Classification" (Mc HC).

Science.gov (United States)

Oi, Shizuo

2011-10-01

Hydrocephalus is a complex pathophysiology with disturbed cerebrospinal fluid (CSF) circulation. There are numerous numbers of classification trials published focusing on various criteria, such as associated anomalies/underlying lesions, CSF circulation/intracranial pressure patterns, clinical features, and other categories. However, no definitive classification exists comprehensively to cover the variety of these aspects. The new classification of hydrocephalus, "Multi-categorical Hydrocephalus Classification" (Mc HC), was invented and developed to cover the entire aspects of hydrocephalus with all considerable classification items and categories. Ten categories include "Mc HC" category I: onset (age, phase), II: cause, III: underlying lesion, IV: symptomatology, V: pathophysiology 1-CSF circulation, VI: pathophysiology 2-ICP dynamics, VII: chronology, VII: post-shunt, VIII: post-endoscopic third ventriculostomy, and X: others. From a 100-year search of publication related to the classification of hydrocephalus, 14 representative publications were reviewed and divided into the 10 categories. The Baumkuchen classification graph made from the round o'clock classification demonstrated the historical tendency of deviation to the categories in pathophysiology, either CSF or ICP dynamics. In the preliminary clinical application, it was concluded that "Mc HC" is extremely effective in expressing the individual state with various categories in the past and present condition or among the compatible cases of hydrocephalus along with the possible chronological change in the future.
Pathological classification of human iPSC-derived neural stem/progenitor cells towards safety assessment of transplantation therapy for CNS diseases.

Science.gov (United States)

Sugai, Keiko; Fukuzawa, Ryuji; Shofuda, Tomoko; Fukusumi, Hayato; Kawabata, Soya; Nishiyama, Yuichiro; Higuchi, Yuichiro; Kawai, Kenji; Isoda, Miho; Kanematsu, Daisuke; Hashimoto-Tamaoki, Tomoko; Kohyama, Jun; Iwanami, Akio; Suemizu, Hiroshi; Ikeda, Eiji; Matsumoto, Morio; Kanemura, Yonehiro; Nakamura, Masaya; Okano, Hideyuki

2016-09-19

The risk of tumorigenicity is a hurdle for regenerative medicine using induced pluripotent stem cells (iPSCs). Although teratoma formation is readily distinguishable, the malignant transformation of iPSC derivatives has not been clearly defined due to insufficient analysis of histology and phenotype. In the present study, we evaluated the histology of neural stem/progenitor cells (NSPCs) generated from integration-free human peripheral blood mononuclear cell (PBMC)-derived iPSCs (iPSC-NSPCs) following transplantation into central nervous system (CNS) of immunodeficient mice. We found that transplanted iPSC-NSPCs produced differentiation patterns resembling those in embryonic CNS development, and that the microenvironment of the final site of migration affected their maturational stage. Genomic instability of iPSCs correlated with increased proliferation of transplants, although no carcinogenesis was evident. The histological classifications presented here may provide cues for addressing potential safety issues confronting regenerative medicine involving iPSCs.
The 2016 WHO Classification of Tumours of the Urinary System and Male Genital Organs-Part A: Renal, Penile, and Testicular Tumours.

Science.gov (United States)

Moch, Holger; Cubilla, Antonio L; Humphrey, Peter A; Reuter, Victor E; Ulbright, Thomas M

2016-07-01

The fourth edition of the World Health Organization (WHO) classification of urogenital tumours (WHO "blue book"), published in 2016, contains significant revisions. These revisions were performed after consideration by a large international group of pathologists with special expertise in this area. A subgroup of these persons met at the WHO Consensus Conference in Zurich, Switzerland, in 2015 to finalize the revisions. This review summarizes the most significant differences between the newly published classification and the prior version for renal, penile, and testicular tumours. Newly recognized epithelial renal tumours are hereditary leiomyomatosis and renal cell carcinoma (RCC) syndrome-associated RCC, succinate dehydrogenase-deficient RCC, tubulocystic RCC, acquired cystic disease-associated RCC, and clear cell papillary RCC. The WHO/International Society of Urological Pathology renal tumour grading system was recommended, and the definition of renal papillary adenoma was modified. The new WHO classification of penile squamous cell carcinomas is based on the presence of human papillomavirus and defines histologic subtypes accordingly. Germ cell neoplasia in situ (GCNIS) of the testis is the WHO-recommended term for precursor lesions of invasive germ cell tumours, and testicular germ cell tumours are now separated into two fundamentally different groups: those derived from GCNIS and those unrelated to GCNIS. Spermatocytic seminoma has been designated as a spermatocytic tumour and placed within the group of non-GCNIS-related tumours in the 2016 WHO classification. The 2016 World Health Organization (WHO) classification contains new renal tumour entities. The classification of penile squamous cell carcinomas is based on the presence of human papillomavirus. Germ cell neoplasia in situ of the testis is the WHO-recommended term for precursor lesions of invasive germ cell tumours. Copyright © 2016 European Association of Urology. Published by Elsevier B.V. All
The integration of marketing problem-solving modes and marketing management support systems

NARCIS (Netherlands)

B. Wierenga (Berend); G.H. van Bruggen (Gerrit)

1997-01-01

textabstractFocuses on the issue of problem solving in marketing and develops a classification of marketing problem-solving modes (MPSMs). Typology of MPSMs; Relationship among MPSMs; Marketing management support systems.
Nonlinear Inertia Classification Model and Application

Directory of Open Access Journals (Sweden)

Mei Wang

2014-01-01

Full Text Available Classification model of support vector machine (SVM overcomes the problem of a big number of samples. But the kernel parameter and the punishment factor have great influence on the quality of SVM model. Particle swarm optimization (PSO is an evolutionary search algorithm based on the swarm intelligence, which is suitable for parameter optimization. Accordingly, a nonlinear inertia convergence classification model (NICCM is proposed after the nonlinear inertia convergence (NICPSO is developed in this paper. The velocity of NICPSO is firstly defined as the weighted velocity of the inertia PSO, and the inertia factor is selected to be a nonlinear function. NICPSO is used to optimize the kernel parameter and a punishment factor of SVM. Then, NICCM classifier is trained by using the optical punishment factor and the optical kernel parameter that comes from the optimal particle. Finally, NICCM is applied to the classification of the normal state and fault states of online power cable. It is experimentally proved that the iteration number for the proposed NICPSO to reach the optimal position decreases from 15 to 5 compared with PSO; the training duration is decreased by 0.0052 s and the recognition precision is increased by 4.12% compared with SVM.
Classification

Science.gov (United States)

Clary, Renee; Wandersee, James

2013-01-01

In this article, Renee Clary and James Wandersee describe the beginnings of "Classification," which lies at the very heart of science and depends upon pattern recognition. Clary and Wandersee approach patterns by first telling the story of the "Linnaean classification system," introduced by Carl Linnacus (1707-1778), who is…
Similarity-dissimilarity plot for visualization of high dimensional data in biomedical pattern classification.

Science.gov (United States)

Arif, Muhammad

2012-06-01

In pattern classification problems, feature extraction is an important step. Quality of features in discriminating different classes plays an important role in pattern classification problems. In real life, pattern classification may require high dimensional feature space and it is impossible to visualize the feature space if the dimension of feature space is greater than four. In this paper, we have proposed a Similarity-Dissimilarity plot which can project high dimensional space to a two dimensional space while retaining important characteristics required to assess the discrimination quality of the features. Similarity-dissimilarity plot can reveal information about the amount of overlap of features of different classes. Separable data points of different classes will also be visible on the plot which can be classified correctly using appropriate classifier. Hence, approximate classification accuracy can be predicted. Moreover, it is possible to know about whom class the misclassified data points will be confused by the classifier. Outlier data points can also be located on the similarity-dissimilarity plot. Various examples of synthetic data are used to highlight important characteristics of the proposed plot. Some real life examples from biomedical data are also used for the analysis. The proposed plot is independent of number of dimensions of the feature space.
Molecular Classification of Lobular Carcinoma of the Breast

Science.gov (United States)

Fu, Denggang; Zuo, Qi; Huang, Qi; Su, Li; Ring, Huijun Z.; Ring, Brian Z.

2017-01-01

The morphology of breast tumors is complicated and diagnosis can be difficult. We present here a novel diagnostic model which we validate on both array-based and RNA sequencing platforms which reliably distinguishes this tumor type across multiple cohorts. We also examine how this molecular classification predicts sensitivity to common chemotherapeutics in cell-line based assays. A total of 1845 invasive breast cancer cases in six cohorts were collected, split into discovery and validation cohorts, and a classifier was created and compared to pathological diagnosis, grade and survival. In the validation cohorts the concordance of predicted diagnosis with a pathological diagnosis was 92%, and 97% when inconclusively classified cases were excluded. Tumor-derived cell lines were classified with the model as having predominantly ductal or lobular-like molecular physiologies, and sensitivity of these lines to relevant compounds was analyzed. A diagnostic tool can be created that reliably distinguishes lobular from ductal carcinoma and allows the classification of cell lines on the basis of molecular profiles associated with these tumor types. This tool may assist in improved diagnosis and aid in explorations of the response of lobular type breast tumor models to different compounds. PMID:28303886
Class Association Rule Pada Metode Associative Classification

Directory of Open Access Journals (Sweden)

Eka Karyawati

2011-11-01

Full Text Available Frequent patterns (itemsets discovery is an important problem in associative classification rule mining. Differents approaches have been proposed such as the Apriori-like, Frequent Pattern (FP-growth, and Transaction Data Location (Tid-list Intersection algorithm. This paper focuses on surveying and comparing the state of the art associative classification techniques with regards to the rule generation phase of associative classification algorithms. This phase includes frequent itemsets discovery and rules mining/extracting methods to generate the set of class association rules (CARs. There are some techniques proposed to improve the rule generation method. A technique by utilizing the concepts of discriminative power of itemsets can reduce the size of frequent itemset. It can prune the useless frequent itemsets. The closed frequent itemset concept can be utilized to compress the rules to be compact rules. This technique may reduce the size of generated rules. Other technique is in determining the support threshold value of the itemset. Specifying not single but multiple support threshold values with regard to the class label frequencies can give more appropriate support threshold value. This technique may generate more accurate rules. Alternative technique to generate rule is utilizing the vertical layout to represent dataset. This method is very effective because it only needs one scan over dataset, compare with other techniques that need multiple scan over dataset. However, one problem with these approaches is that the initial set of tid-lists may be too large to fit into main memory. It requires more sophisticated techniques to compress the tid-lists.
Astrophysical Information from Objective Prism Digitized Images: Classification with an Artificial Neural Network

Directory of Open Access Journals (Sweden)

Bratsolis Emmanuel

2005-01-01

Full Text Available Stellar spectral classification is not only a tool for labeling individual stars but is also useful in studies of stellar population synthesis. Extracting the physical quantities from the digitized spectral plates involves three main stages: detection, extraction, and classification of spectra. Low-dispersion objective prism images have been used and automated methods have been developed. The detection and extraction problems have been presented in previous works. In this paper, we present a classification method based on an artificial neural network (ANN. We make a brief presentation of the entire automated system and we compare the new classification method with the previously used method of maximum correlation coefficient (MCC. Digitized photographic material has been used here. The method can also be used on CCD spectral images.
The diagnosis and management of pre-invasive breast disease: Pathological diagnosis – problems with existing classifications

International Nuclear Information System (INIS)

Van de Vijver, Marc J; Peterse, Hans

2003-01-01

In this review, we comment on the reasons for disagreement in the concepts, diagnosis and classifications of pre-invasive intraductal proliferations. In view of these disagreements, our proposal is to distinguish epithelial hyperplasia, lobular carcinoma in situ and ductal carcinoma in situ, and to abandon the use of poorly reproducible categories, such as atypical ductal hyperplasia or ductal intraepithelial neoplasia, followed by a number to indicate the degree of proliferation and atypia, as these are not practical for clinical decision making, nor for studies aimed at improving the understanding of breast cancer development. If there is doubt about the classification of an intraductal proliferation, a differential diagnosis and the reason for and degree of uncertainty should be given, rather than categorizing a proliferation as atypical
Minimisation de fonctions de perte calibrée pour la classification des images

OpenAIRE

Bel Haj Ali , Wafa

2013-01-01

Image classification becomes a big challenge since it concerns on the one hand millions or billions of images that are available on the web and on the other hand images used for critical real-time applications. This classification involves in general learning methods and classifiers that must require both precision as well as speed performance. These learning problems concern a large number of application areas: namely, web applications (profiling, targeting, social networks, search engines),...
A Novel Vehicle Classification Using Embedded Strain Gauge Sensors

Directory of Open Access Journals (Sweden)

Qi Wang

2008-11-01

Full Text Available Abstract: This paper presents a new vehicle classification and develops a traffic monitoring detector to provide reliable vehicle classification to aid traffic management systems. The basic principle of this approach is based on measuring the dynamic strain caused by vehicles across pavement to obtain the corresponding vehicle parameters Ã¢Â€Â“ wheelbase and number of axles Ã¢Â€Â“ to then accurately classify the vehicle. A system prototype with five embedded strain sensors was developed to validate the accuracy and effectiveness of the classification method. According to the special arrangement of the sensors and the different time a vehicle arrived at the sensors one can estimate the vehicleÃ¢Â€Â™s speed accurately, corresponding to the estimated vehicle wheelbase and number of axles. Because of measurement errors and vehicle characteristics, there is a lot of overlap between vehicle wheelbase patterns. Therefore, directly setting up a fixed threshold for vehicle classification often leads to low-accuracy results. Using the machine learning pattern recognition method to deal with this problem is believed as one of the most effective tools. In this study, support vector machines (SVMs were used to integrate the classification features extracted from the strain sensors to automatically classify vehicles into five types, ranging from small vehicles to combination trucks, along the lines of the Federal Highway Administration vehicle classification guide. Test bench and field experiments will be introduced in this paper. Two support vector machines classification algorithms (one-against-all, one-against-one are used to classify single sensor data and multiple sensor combination data. Comparison of the two classification method results shows that the classification accuracy is very close using single data or multiple data. Our results indicate that using multiclass SVM-based fusion multiple sensor data significantly improves

A genetic algorithm for a bi-objective mathematical model for dynamic virtual cell formation problem

Science.gov (United States)

Moradgholi, Mostafa; Paydar, Mohammad Mahdi; Mahdavi, Iraj; Jouzdani, Javid

2016-09-01

Nowadays, with the increasing pressure of the competitive business environment and demand for diverse products, manufacturers are force to seek for solutions that reduce production costs and rise product quality. Cellular manufacturing system (CMS), as a means to this end, has been a point of attraction to both researchers and practitioners. Limitations of cell formation problem (CFP), as one of important topics in CMS, have led to the introduction of virtual CMS (VCMS). This research addresses a bi-objective dynamic virtual cell formation problem (DVCFP) with the objective of finding the optimal formation of cells, considering the material handling costs, fixed machine installation costs and variable production costs of machines and workforce. Furthermore, we consider different skills on different machines in workforce assignment in a multi-period planning horizon. The bi-objective model is transformed to a single-objective fuzzy goal programming model and to show its performance; numerical examples are solved using the LINGO software. In addition, genetic algorithm (GA) is customized to tackle large-scale instances of the problems to show the performance of the solution method.
An Outlyingness Matrix for Multivariate Functional Data Classification

KAUST Repository

Dai, Wenlin

2017-08-25

The classification of multivariate functional data is an important task in scientific research. Unlike point-wise data, functional data are usually classified by their shapes rather than by their scales. We define an outlyingness matrix by extending directional outlyingness, an effective measure of the shape variation of curves that combines the direction of outlyingness with conventional statistical depth. We propose two classifiers based on directional outlyingness and the outlyingness matrix, respectively. Our classifiers provide better performance compared with existing depth-based classifiers when applied on both univariate and multivariate functional data from simulation studies. We also test our methods on two data problems: speech recognition and gesture classification, and obtain results that are consistent with the findings from the simulated data.
Development and test of a classification scheme for human factors in incident reports

International Nuclear Information System (INIS)

Miller, R.; Freitag, M.; Wilpert, B.

1997-01-01

The Research Center System Safety of the Berlin University of Technology conducted a research project on the analysis of Human Factors (HF) aspects in incident reported by German Nuclear Power Plants. Based on psychological theories and empirical studies a classification scheme was developed which permits the identification of human involvement in incidents. The classification scheme was applied in an epidemiological study to a selection of more than 600 HF - relevant incidents. The results allow insights into HF related problem areas. An additional study proved that the application of the classification scheme produces results which are reliable and independent from raters. (author). 13 refs, 1 fig
Improving imbalanced scientific text classification using sampling strategies and dictionaries

Directory of Open Access Journals (Sweden)

Borrajo L.

2011-12-01

Full Text Available Many real applications have the imbalanced class distribution problem, where one of the classes is represented by a very small number of cases compared to the other classes. One of the systems affected are those related to the recovery and classification of scientific documentation.
A combined reconstruction-classification method for diffuse optical tomography

Energy Technology Data Exchange (ETDEWEB)

Hiltunen, P [Department of Biomedical Engineering and Computational Science, Helsinki University of Technology, PO Box 3310, FI-02015 TKK (Finland); Prince, S J D; Arridge, S [Department of Computer Science, University College London, Gower Street London, WC1E 6B (United Kingdom)], E-mail: petri.hiltunen@tkk.fi, E-mail: s.prince@cs.ucl.ac.uk, E-mail: s.arridge@cs.ucl.ac.uk

2009-11-07

We present a combined classification and reconstruction algorithm for diffuse optical tomography (DOT). DOT is a nonlinear ill-posed inverse problem. Therefore, some regularization is needed. We present a mixture of Gaussians prior, which regularizes the DOT reconstruction step. During each iteration, the parameters of a mixture model are estimated. These associate each reconstructed pixel with one of several classes based on the current estimate of the optical parameters. This classification is exploited to form a new prior distribution to regularize the reconstruction step and update the optical parameters. The algorithm can be described as an iteration between an optimization scheme with zeroth-order variable mean and variance Tikhonov regularization and an expectation-maximization scheme for estimation of the model parameters. We describe the algorithm in a general Bayesian framework. Results from simulated test cases and phantom measurements show that the algorithm enhances the contrast of the reconstructed images with good spatial accuracy. The probabilistic classifications of each image contain only a few misclassified pixels.
Rational kernels for Arabic Root Extraction and Text Classification

Directory of Open Access Journals (Sweden)

Attia Nehar

2016-04-01

Full Text Available In this paper, we address the problems of Arabic Text Classification and root extraction using transducers and rational kernels. We introduce a new root extraction approach on the basis of the use of Arabic patterns (Pattern Based Stemmer. Transducers are used to model these patterns and root extraction is done without relying on any dictionary. Using transducers for extracting roots, documents are transformed into finite state transducers. This document representation allows us to use and explore rational kernels as a framework for Arabic Text Classification. Root extraction experiments are conducted on three word collections and yield 75.6% of accuracy. Classification experiments are done on the Saudi Press Agency dataset and N-gram kernels are tested with different values of N. Accuracy and F1 report 90.79% and 62.93% respectively. These results show that our approach, when compared with other approaches, is promising specially in terms of accuracy and F1.
Automated Classification of Asteroids into Families at Work

Science.gov (United States)

Knežević, Zoran; Milani, Andrea; Cellino, Alberto; Novaković, Bojan; Spoto, Federica; Paolicchi, Paolo

2014-07-01

We have recently proposed a new approach to the asteroid family classification by combining the classical HCM method with an automated procedure to add newly discovered members to existing families. This approach is specifically intended to cope with ever increasing asteroid data sets, and consists of several steps to segment the problem and handle the very large amount of data in an efficient and accurate manner. We briefly present all these steps and show the results from three subsequent updates making use of only the automated step of attributing the newly numbered asteroids to the known families. We describe the changes of the individual families membership, as well as the evolution of the classification due to the newly added intersections between the families, resolved candidate family mergers, and emergence of the new candidates for the mergers. We thus demonstrate how by the new approach the asteroid family classification becomes stable in general terms (converging towards a permanent list of confirmed families), and in the same time evolving in details (to account for the newly discovered asteroids) at each update.
Classification of JET Neutron and Gamma Emissivity Profiles

Science.gov (United States)

Craciunescu, T.; Murari, A.; Kiptily, V.; Vega, J.; Contributors, JET

2016-05-01

In thermonuclear plasmas, emission tomography uses integrated measurements along lines of sight (LOS) to determine the two-dimensional (2-D) spatial distribution of the volume emission intensity. Due to the availability of only a limited number views and to the coarse sampling of the LOS, the tomographic inversion is a limited data set problem. Several techniques have been developed for tomographic reconstruction of the 2-D gamma and neutron emissivity on JET. In specific experimental conditions the availability of LOSs is restricted to a single view. In this case an explicit reconstruction of the emissivity profile is no longer possible. However, machine learning classification methods can be used in order to derive the type of the distribution. In the present approach the classification is developed using the theory of belief functions which provide the support to fuse the results of independent clustering and supervised classification. The method allows to represent the uncertainty of the results provided by different independent techniques, to combine them and to manage possible conflicts.
Classification of uranium reserves/resources

International Nuclear Information System (INIS)

1998-08-01

Projections of future availability of uranium to meet present and future nuclear power requirements depend on the reliability of uranium resource estimates. Lack of harmony of the definition of the different classes of uranium reserves and resources between countries makes the compilation and analysis of such information difficult. The problem was accentuated in the early 1990s with the entry of uranium producing countries from the former Soviet Union, eastern Europe and China into the world uranium supply market. The need for an internationally acceptable reserve/resource classification system and terminology using market based criteria is therefore obvious. This publication was compiled from participant's contributions and findings of the Consultants Meetings on Harmonization of Uranium Resource Assessment Concepts held in Vienna from 22 to 25 June 1992, and two Consultants Meetings on the Development of a More Meaningful Classification of Uranium Resources held in Kiev, Ukraine on 24-26 April 1995 and 20-23 August 1996. This document includes 11 contributions, summary, list of participants of the Consultants Meetings. Each contribution has been indexed and provided with an abstract
Radioactive facilities classification criteria

International Nuclear Information System (INIS)

Briso C, H.A.; Riesle W, J.

1992-01-01

Appropriate classification of radioactive facilities into groups of comparable risk constitutes one of the problems faced by most Regulatory Bodies. Regarding the radiological risk, the main facts to be considered are the radioactive inventory and the processes to which these radionuclides are subjected. Normally, operations are ruled by strict safety procedures. Thus, the total activity of the radionuclides existing in a given facility is the varying feature that defines its risk. In order to rely on a quantitative criterion and, considering that the Annual Limits of Intake are widely accepted references, an index based on these limits, to support decisions related to radioactive facilities, is proposed. (author)
Current Trends in the Molecular Classification of Renal Neoplasms

Directory of Open Access Journals (Sweden)

Andrew N. Young

2006-01-01

Full Text Available Renal cell carcinoma (RCC is the most common form of kidney cancer in adults. RCC is a significant challenge for pathologic diagnosis and clinical management. The primary approach to diagnosis is by light microscopy, using the World Health Organization (WHO classification system, which defines histopathologic tumor subtypes with distinct clinical behavior and underlying genetic mutations. However, light microscopic diagnosis of RCC subtypes is often difficult due to variable histology. In addition, the clinical behavior of RCC is highly variable and therapeutic response rates are poor. Few clinical assays are available to predict outcome in RCC or correlate behavior with histology. Therefore, novel RCC classification systems based on gene expression should be useful for diagnosis, prognosis, and treatment. Recent microarray studies have shown that renal tumors are characterized by distinct gene expression profiles, which can be used to discover novel diagnostic and prognostic biomarkers. Here, we review clinical features of kidney cancer, the WHO classification system, and the growing role of molecular classification for diagnosis, prognosis, and therapy of this disease.
An Empirical Overview of the No Free Lunch Theorem and Its Effect on Real-World Machine Learning Classification.

Science.gov (United States)

Gómez, David; Rojas, Alfonso

2016-01-01

A sizable amount of research has been done to improve the mechanisms for knowledge extraction such as machine learning classification or regression. Quite unintuitively, the no free lunch (NFL) theorem states that all optimization problem strategies perform equally well when averaged over all possible problems. This fact seems to clash with the effort put forth toward better algorithms. This letter explores empirically the effect of the NFL theorem on some popular machine learning classification techniques over real-world data sets.
Hand eczema classification

DEFF Research Database (Denmark)

Diepgen, T L; Andersen, Klaus Ejner; Brandao, F M

2008-01-01

of the disease is rarely evidence based, and a classification system for different subdiagnoses of hand eczema is not agreed upon. Randomized controlled trials investigating the treatment of hand eczema are called for. For this, as well as for clinical purposes, a generally accepted classification system...... A classification system for hand eczema is proposed. Conclusions It is suggested that this classification be used in clinical work and in clinical trials....
A Two-Stream Deep Fusion Framework for High-Resolution Aerial Scene Classification

Directory of Open Access Journals (Sweden)

Yunlong Yu

2018-01-01

Full Text Available One of the challenging problems in understanding high-resolution remote sensing images is aerial scene classification. A well-designed feature representation method and classifier can improve classification accuracy. In this paper, we construct a new two-stream deep architecture for aerial scene classification. First, we use two pretrained convolutional neural networks (CNNs as feature extractor to learn deep features from the original aerial image and the processed aerial image through saliency detection, respectively. Second, two feature fusion strategies are adopted to fuse the two different types of deep convolutional features extracted by the original RGB stream and the saliency stream. Finally, we use the extreme learning machine (ELM classifier for final classification with the fused features. The effectiveness of the proposed architecture is tested on four challenging datasets: UC-Merced dataset with 21 scene categories, WHU-RS dataset with 19 scene categories, AID dataset with 30 scene categories, and NWPU-RESISC45 dataset with 45 challenging scene categories. The experimental results demonstrate that our architecture gets a significant classification accuracy improvement over all state-of-the-art references.
Classification of the web

DEFF Research Database (Denmark)

Mai, Jens Erik

2004-01-01

This paper discusses the challenges faced by investigations into the classification of the Web and outlines inquiries that are needed to use principles for bibliographic classification to construct classifications of the Web. This paper suggests that the classification of the Web meets challenges...... that call for inquiries into the theoretical foundation of bibliographic classification theory....
Security classification of information

Energy Technology Data Exchange (ETDEWEB)

Quist, A.S.

1993-04-01

This document is the second of a planned four-volume work that comprehensively discusses the security classification of information. The main focus of Volume 2 is on the principles for classification of information. Included herein are descriptions of the two major types of information that governments classify for national security reasons (subjective and objective information), guidance to use when determining whether information under consideration for classification is controlled by the government (a necessary requirement for classification to be effective), information disclosure risks and benefits (the benefits and costs of classification), standards to use when balancing information disclosure risks and benefits, guidance for assigning classification levels (Top Secret, Secret, or Confidential) to classified information, guidance for determining how long information should be classified (classification duration), classification of associations of information, classification of compilations of information, and principles for declassifying and downgrading information. Rules or principles of certain areas of our legal system (e.g., trade secret law) are sometimes mentioned to .provide added support to some of those classification principles.
[The importance of classifications in psychiatry].

Science.gov (United States)

Lempérière, T

1995-12-01

The classifications currently used in psychiatry have different aims: to facilitate communication between researchers and clinicians at national and international levels through the use of a common language, or at least a clearly and precisely defined nomenclature; to provide a nosographical reference system which can be used in practice (diagnosis, prognosis, treatment); to optimize research by ensuring that sample cases are as homogeneous as possible; to facilitate statistical records for public health institutions. A classification is of practical interest only if it is reliable, valid and acceptable to all potential users. In recent decades, there has been a considerable systematic and coordinated effort to improve the methodological approach to classification and categorization in the field of psychiatry, including attempts to create operational definitions, field trials of inter-assessor reliability, attempts to validate the selected nosological categories by analysis of correlation between progression, treatment response, family history and additional examinations. The introduction of glossaries, and particularly of diagnostic criteria, marked a decisive step in this new approach. The key problem remains that of the validity of diagnostic criteria. Ideally, these should be based on demonstrable etiologic or pathogenic data, but such information is rarely available in psychiatry. Current classifications rely on the use of extremely diverse elements in differing degrees: descriptive criteria, evolutive criteria, etiopathogenic criteria, psychopathogenic criteria, etc. Certain syndrome-based classifications such as DSM III and its successors aim to be atheoretical and pragmatic. Others, such as ICD-10, while more eclectic than the different versions of DSM, follow suit by abandoning the terms "disease" and "illness" in favor of the more consensual "disorder". The legitimacy of classifications in the field of psychiatry has been fiercely contested, being
Complexity classifications for different equivalence and audit problems for Boolean circuits

OpenAIRE

BÃ¶hler, Elmar; Creignou, Nadia; Galota, Matthias; Reith, Steffen; Schnoor, Henning; Vollmer, Heribert

2010-01-01

We study Boolean circuits as a representation of Boolean functions and conskier different equivalence, audit, and enumeration problems. For a number of restricted sets of gate types (bases) we obtain efficient algorithms, while for all other gate types we show these problems are at least NP-hard.
Disorders of sex development: a new definition and classification.

Science.gov (United States)

Hughes, Ieuan A

2008-02-01

A newborn infant with ambiguous genitalia is a complex enough problem to unravel without any further clouding by confusing terms. The nomenclature 'intersex', 'hermaphrodite' and 'pseudohermaphrodite' is anachronistic, unhelpful, and perceived to be pejorative by some affected families. In its place, a consensus statement recommends the term 'disorder of sex development' (DSD), a generic definition encompassing any problem noted at birth where the genitalia are atypical in relation to the chromosomes or gonads. The karyotype is used as a prefix to define the category of DSD, replacing the arcane terminology of male or female pseudohermaphroditism (now known as XY DSD or XX DSD, respectively). The new nomenclature has spawned a simple and logical classification of the causes of DSD. In this chapter new facets of gonadal dysgenesis and novel defects in steroid biosynthesis are reviewed in relation to the DSD classification, and options for early, non-invasive fetal sexing are described. Future research to determine many causes of DSD will benefit from the use of this universal language of scientific communication.
ISBDD Model for Classification of Hyperspectral Remote Sensing Imagery

Directory of Open Access Journals (Sweden)

Na Li

2018-03-01

Full Text Available The diverse density (DD algorithm was proposed to handle the problem of low classification accuracy when training samples contain interference such as mixed pixels. The DD algorithm can learn a feature vector from training bags, which comprise instances (pixels. However, the feature vector learned by the DD algorithm cannot always effectively represent one type of ground cover. To handle this problem, an instance space-based diverse density (ISBDD model that employs a novel training strategy is proposed in this paper. In the ISBDD model, DD values of each pixel are computed instead of learning a feature vector, and as a result, the pixel can be classified according to its DD values. Airborne hyperspectral data collected by the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS sensor and the Push-broom Hyperspectral Imager (PHI are applied to evaluate the performance of the proposed model. Results show that the overall classification accuracy of ISBDD model on the AVIRIS and PHI images is up to 97.65% and 89.02%, respectively, while the kappa coefficient is up to 0.97 and 0.88, respectively.

A New Classification of Endodontic-Periodontal Lesions

Directory of Open Access Journals (Sweden)

Khalid S. Al-Fouzan

2014-01-01

Full Text Available The interrelationship between periodontal and endodontic disease has always aroused confusion, queries, and controversy. Differentiating between a periodontal and an endodontic problem can be difficult. A symptomatic tooth may have pain of periodontal and/or pulpal origin. The nature of that pain is often the first clue in determining the etiology of such a problem. Radiographic and clinical evaluation can help clarify the nature of the problem. In some cases, the influence of pulpal pathology may cause the periodontal involvement and vice versa. The simultaneous existence of pulpal problems and inflammatory periodontal disease can complicate diagnosis and treatment planning. An endo-perio lesion can have a varied pathogenesis which ranges from simple to relatively complex one. The differential diagnosis of endodontic and periodontal diseases can sometimes be difficult, but it is of vital importance to make a correct diagnosis for providing the appropriate treatment. This paper aims to discuss a modified clinical classification to be considered for accurately diagnosing and treating endo-perio lesion.
A new classification of endodontic-periodontal lesions.

Science.gov (United States)

Al-Fouzan, Khalid S

2014-01-01

The interrelationship between periodontal and endodontic disease has always aroused confusion, queries, and controversy. Differentiating between a periodontal and an endodontic problem can be difficult. A symptomatic tooth may have pain of periodontal and/or pulpal origin. The nature of that pain is often the first clue in determining the etiology of such a problem. Radiographic and clinical evaluation can help clarify the nature of the problem. In some cases, the influence of pulpal pathology may cause the periodontal involvement and vice versa. The simultaneous existence of pulpal problems and inflammatory periodontal disease can complicate diagnosis and treatment planning. An endo-perio lesion can have a varied pathogenesis which ranges from simple to relatively complex one. The differential diagnosis of endodontic and periodontal diseases can sometimes be difficult, but it is of vital importance to make a correct diagnosis for providing the appropriate treatment. This paper aims to discuss a modified clinical classification to be considered for accurately diagnosing and treating endo-perio lesion.
Toward functional classification of neuronal types.

Science.gov (United States)

Sharpee, Tatyana O

2014-09-17

How many types of neurons are there in the brain? This basic neuroscience question remains unsettled despite many decades of research. Classification schemes have been proposed based on anatomical, electrophysiological, or molecular properties. However, different schemes do not always agree with each other. This raises the question of whether one can classify neurons based on their function directly. For example, among sensory neurons, can a classification scheme be devised that is based on their role in encoding sensory stimuli? Here, theoretical arguments are outlined for how this can be achieved using information theory by looking at optimal numbers of cell types and paying attention to two key properties: correlations between inputs and noise in neural responses. This theoretical framework could help to map the hierarchical tree relating different neuronal classes within and across species. Copyright © 2014 Elsevier Inc. All rights reserved.
Hazard classification methodology

International Nuclear Information System (INIS)

Brereton, S.J.

1996-01-01

This document outlines the hazard classification methodology used to determine the hazard classification of the NIF LTAB, OAB, and the support facilities on the basis of radionuclides and chemicals. The hazard classification determines the safety analysis requirements for a facility
Welcoming the new WHO classification of pituitary tumors 2017: revolution in TTF-1-positive posterior pituitary tumors.

Science.gov (United States)

Shibuya, Makoto

2018-04-01

The fourth edition of the World Health Organization classification of endocrine tumors (EN-WHO2017) was released in 2017. In this new edition, changes in the classification of non-neuroendocrine tumors are proposed particularly in tumors arising in the posterior pituitary. These tumors are a distinct group of low-grade neoplasms of the sellar region that express thyroid transcription factor-1, and include pituicytoma, granular cell tumor of the sellar region, spindle cell oncocytoma, and sellar ependymoma. This short review focuses on the classification of posterior pituitary tumors newly proposed in EN-WHO2017, and controversies in their pathological differential diagnosis are discussed based on recent cases.
Measuring External Face Appearance for Face Classification

OpenAIRE

Masip, David; Lapedriza, Agata; Vitria, Jordi

2007-01-01

In this chapter we introduce the importance of the external features in face classification problems, and propose a methodology to extract the external features obtaining an aligned feature set. The extracted features can be used as input to any standard pattern recognition classifier, as the classic feature extraction approaches dealing with internal face regions in the literature. The resulting scheme follows a top-down segmentation approach to deal with the diversity inherent to the extern...
Psychology of development of moral reasoning: Problem-oriented overview of the field

Directory of Open Access Journals (Sweden)

Mirić Jovan

2008-01-01

Full Text Available First and foremost, this paper provides a short historical reminder of the emergence of the field of psychology of development of moral reasoning. In the second part of the paper, the author offers a problem-oriented overview of the field, that is, one possible classification of particular groups of problems for empirical research. This overview does not only point out to the problems that were more and that were less studied (e.g.. evaluative moral judgment and reasoning, distinguishing between moral and extra-moral rules and norms and to those that were relatively neglected (i.e. understanding moral situations, but also to the problems that psychologists did not even recognize as research problems. Such are the problems of development of moral concepts, meaning of moral words etc. Finally, the author also points out to the fact that this classification could be taken as one way to define the field, that is, the way to determine the boundaries of its subject of studying.
Automatic classification of pulmonary peri-fissural nodules in computed tomography using an ensemble of 2D views and a convolutional neural network out-of-the-box

NARCIS (Netherlands)

Ciompi, Francesco; de Hoop, Bartjan; van Riel, Sarah J.; Chung, Kaman; Scholten, Ernst Th.; Oudkerk, Matthijs; de Jong, Pim A.; Prokop, Mathias; van Ginneken, Bram

2015-01-01

In this paper, we tackle the problem of automatic classification of pulmonary peri-fissural nodules (PFNs). The classification problem is formulated as a machine learning approach, where detected nodule candidates are classified as PFNs or non-PFNs. Supervised learning is used, where a classifier is
[Categorization of uterine cervix tumors : What's new in the 2014 WHO classification].

Science.gov (United States)

Lax, S F; Horn, L-C; Löning, T

2016-11-01

In the 2014 WHO classification, squamous cell precursor lesions are classified as low-grade and high-grade intraepithelial lesions. LSIL corresponds to CIN1, HSIL includes CIN2 and CIN3. Only adenocarcinoma in situ (AIS) is accepted as precursor of adenocarcinoma and includes the stratified mucin-producing intraepithelial lesion (SMILE). Although relatively rare, adenocarcinoma and squamous cell carcinoma can be mixed with a poorly differentiated neuroendocrine carcinoma. Most cervical adenocarcinomas are low grade and of endocervical type. Mucinous carcinomas show marked intra- and extracellular mucin production. Almost all squamous cell carcinomas, the vast majority of adenocarcinomas, and many rare carcinoma types are HPV related. For low grade endocervical adenocarcinomas, the pattern-based classification according to Silva should be reported. Neuroendocrine tumors are rare and are classified into low-grade and high-grade, whereby the term carcinoid is still used.
Classification Formula and Generation Algorithm of Cycle Decomposition Expression for Dihedral Groups

Directory of Open Access Journals (Sweden)

Dakun Zhang

2013-01-01

Full Text Available The necessary of classification research on common formula of group (dihedral group cycle decomposition expression is illustrated. It includes the reflection and rotation conversion, which derived six common formulae on cycle decomposition expressions of group; it designed the generation algorithm on the cycle decomposition expressions of group, which is based on the method of replacement conversion and the classification formula; algorithm analysis and the results of the process show that the generation algorithm which is based on the classification formula is outperformed by the general algorithm which is based on replacement conversion; it has great significance to solve the enumeration of the necklace combinational scheme, especially the structural problems of combinational scheme, by using group theory and computer.
Supervised Cross-Modal Factor Analysis for Multiple Modal Data Classification

KAUST Repository

Wang, Jingbin

2015-10-09

In this paper we study the problem of learning from multiple modal data for purpose of document classification. In this problem, each document is composed two different modals of data, i.e., An image and a text. Cross-modal factor analysis (CFA) has been proposed to project the two different modals of data to a shared data space, so that the classification of a image or a text can be performed directly in this space. A disadvantage of CFA is that it has ignored the supervision information. In this paper, we improve CFA by incorporating the supervision information to represent and classify both image and text modals of documents. We project both image and text data to a shared data space by factor analysis, and then train a class label predictor in the shared space to use the class label information. The factor analysis parameter and the predictor parameter are learned jointly by solving one single objective function. With this objective function, we minimize the distance between the projections of image and text of the same document, and the classification error of the projection measured by hinge loss function. The objective function is optimized by an alternate optimization strategy in an iterative algorithm. Experiments in two different multiple modal document data sets show the advantage of the proposed algorithm over other CFA methods.
A Ternary Hybrid EEG-NIRS Brain-Computer Interface for the Classification of Brain Activation Patterns during Mental Arithmetic, Motor Imagery, and Idle State.

Science.gov (United States)

Shin, Jaeyoung; Kwon, Jinuk; Im, Chang-Hwan

2018-01-01

The performance of a brain-computer interface (BCI) can be enhanced by simultaneously using two or more modalities to record brain activity, which is generally referred to as a hybrid BCI. To date, many BCI researchers have tried to implement a hybrid BCI system by combining electroencephalography (EEG) and functional near-infrared spectroscopy (NIRS) to improve the overall accuracy of binary classification. However, since hybrid EEG-NIRS BCI, which will be denoted by hBCI in this paper, has not been applied to ternary classification problems, paradigms and classification strategies appropriate for ternary classification using hBCI are not well investigated. Here we propose the use of an hBCI for the classification of three brain activation patterns elicited by mental arithmetic, motor imagery, and idle state, with the aim to elevate the information transfer rate (ITR) of hBCI by increasing the number of classes while minimizing the loss of accuracy. EEG electrodes were placed over the prefrontal cortex and the central cortex, and NIRS optodes were placed only on the forehead. The ternary classification problem was decomposed into three binary classification problems using the "one-versus-one" (OVO) classification strategy to apply the filter-bank common spatial patterns filter to EEG data. A 10 × 10-fold cross validation was performed using shrinkage linear discriminant analysis (sLDA) to evaluate the average classification accuracies for EEG-BCI, NIRS-BCI, and hBCI when the meta-classification method was adopted to enhance classification accuracy. The ternary classification accuracies for EEG-BCI, NIRS-BCI, and hBCI were 76.1 ± 12.8, 64.1 ± 9.7, and 82.2 ± 10.2%, respectively. The classification accuracy of the proposed hBCI was thus significantly higher than those of the other BCIs ( p < 0.005). The average ITR for the proposed hBCI was calculated to be 4.70 ± 1.92 bits/minute, which was 34.3% higher than that reported for a previous binary hBCI study.
Malware distributed collection and pre-classification system using honeypot technology

Science.gov (United States)

Grégio, André R. A.; Oliveira, Isabela L.; Santos, Rafael D. C.; Cansian, Adriano M.; de Geus, Paulo L.

2009-04-01

Malware has become a major threat in the last years due to the ease of spread through the Internet. Malware detection has become difficult with the use of compression, polymorphic methods and techniques to detect and disable security software. Those and other obfuscation techniques pose a problem for detection and classification schemes that analyze malware behavior. In this paper we propose a distributed architecture to improve malware collection using different honeypot technologies to increase the variety of malware collected. We also present a daemon tool developed to grab malware distributed through spam and a pre-classification technique that uses antivirus technology to separate malware in generic classes.
Identification of immune cell infiltration in hematoxylin-eosin stained breast cancer samples: texture-based classification of tissue morphologies

Science.gov (United States)

Turkki, Riku; Linder, Nina; Kovanen, Panu E.; Pellinen, Teijo; Lundin, Johan

2016-03-01

The characteristics of immune cells in the tumor microenvironment of breast cancer capture clinically important information. Despite the heterogeneity of tumor-infiltrating immune cells, it has been shown that the degree of infiltration assessed by visual evaluation of hematoxylin-eosin (H and E) stained samples has prognostic and possibly predictive value. However, quantification of the infiltration in H and E-stained tissue samples is currently dependent on visual scoring by an expert. Computer vision enables automated characterization of the components of the tumor microenvironment, and texture-based methods have successfully been used to discriminate between different tissue morphologies and cell phenotypes. In this study, we evaluate whether local binary pattern texture features with superpixel segmentation and classification with support vector machine can be utilized to identify immune cell infiltration in H and E-stained breast cancer samples. Guided with the pan-leukocyte CD45 marker, we annotated training and test sets from 20 primary breast cancer samples. In the training set of arbitrary sized image regions (n=1,116) a 3-fold cross-validation resulted in 98% accuracy and an area under the receiver-operating characteristic curve (AUC) of 0.98 to discriminate between immune cell -rich and - poor areas. In the test set (n=204), we achieved an accuracy of 96% and AUC of 0.99 to label cropped tissue regions correctly into immune cell -rich and -poor categories. The obtained results demonstrate strong discrimination between immune cell -rich and -poor tissue morphologies. The proposed method can provide a quantitative measurement of the degree of immune cell infiltration and applied to digitally scanned H and E-stained breast cancer samples for diagnostic purposes.
Classification of phytoplankton cells as live or dead using the vital stains fluorescein diacetate and 5-chloromethylfluorescein diacetate.

Science.gov (United States)

MacIntyre, Hugh L; Cullen, John J

2016-08-01

Regulations for ballast water treatment specify limits on the concentrations of living cells in discharge water. The vital stains fluorescein diacetate (FDA) and 5-chloromethylfluorescein diacetate (CMFDA) in combination have been recommended for use in verification of ballast water treatment technology. We tested the effectiveness of FDA and CMFDA, singly and in combination, in discriminating between living and heat-killed populations of 24 species of phytoplankton from seven divisions, verifying with quantitative growth assays that uniformly live and dead populations were compared. The diagnostic signal, per-cell fluorescence intensity, was measured by flow cytometry and alternate discriminatory thresholds were defined statistically from the frequency distributions of the dead or living cells. Species were clustered by staining patterns: for four species, the staining of live versus dead cells was distinct, and live-dead classification was essentially error free. But overlap between the frequency distributions of living and heat-killed cells in the other taxa led to unavoidable errors, well in excess of 20% in many. In 4 very weakly staining taxa, the mean fluorescence intensity in the heat-killed cells was higher than that of the living cells, which is inconsistent with the assumptions of the method. Applying the criteria of ≤5% false negative plus ≤5% false positive errors, and no significant loss of cells due to staining, FDA and FDA+CMFDA gave acceptably accurate results for only 8-10 of 24 species (i.e., 33%-42%). CMFDA was the least effective stain and its addition to FDA did not improve the performance of FDA alone. © 2016 The Authors. Journal of Phycology published by Wiley Periodicals, Inc. on behalf of Phycological Society of America.
Fast Gaussian kernel learning for classification tasks based on specially structured global optimization.

Science.gov (United States)

Zhong, Shangping; Chen, Tianshun; He, Fengying; Niu, Yuzhen

2014-09-01

For a practical pattern classification task solved by kernel methods, the computing time is mainly spent on kernel learning (or training). However, the current kernel learning approaches are based on local optimization techniques, and hard to have good time performances, especially for large datasets. Thus the existing algorithms cannot be easily extended to large-scale tasks. In this paper, we present a fast Gaussian kernel learning method by solving a specially structured global optimization (SSGO) problem. We optimize the Gaussian kernel function by using the formulated kernel target alignment criterion, which is a difference of increasing (d.i.) functions. Through using a power-transformation based convexification method, the objective criterion can be represented as a difference of convex (d.c.) functions with a fixed power-transformation parameter. And the objective programming problem can then be converted to a SSGO problem: globally minimizing a concave function over a convex set. The SSGO problem is classical and has good solvability. Thus, to find the global optimal solution efficiently, we can adopt the improved Hoffman's outer approximation method, which need not repeat the searching procedure with different starting points to locate the best local minimum. Also, the proposed method can be proven to converge to the global solution for any classification task. We evaluate the proposed method on twenty benchmark datasets, and compare it with four other Gaussian kernel learning methods. Experimental results show that the proposed method stably achieves both good time-efficiency performance and good classification performance. Copyright © 2014 Elsevier Ltd. All rights reserved.
Evaluation of Current Approaches to Stream Classification and a Heuristic Guide to Developing Classifications of Integrated Aquatic Networks

Science.gov (United States)

Melles, S. J.; Jones, N. E.; Schmidt, B. J.

2014-03-01

Conservation and management of fresh flowing waters involves evaluating and managing effects of cumulative impacts on the aquatic environment from disturbances such as: land use change, point and nonpoint source pollution, the creation of dams and reservoirs, mining, and fishing. To assess effects of these changes on associated biotic communities it is necessary to monitor and report on the status of lotic ecosystems. A variety of stream classification methods are available to assist with these tasks, and such methods attempt to provide a systematic approach to modeling and understanding complex aquatic systems at various spatial and temporal scales. Of the vast number of approaches that exist, it is useful to group them into three main types. The first involves modeling longitudinal species turnover patterns within large drainage basins and relating these patterns to environmental predictors collected at reach and upstream catchment scales; the second uses regionalized hierarchical classification to create multi-scale, spatially homogenous aquatic ecoregions by grouping adjacent catchments together based on environmental similarities; and the third approach groups sites together on the basis of similarities in their environmental conditions both within and between catchments, independent of their geographic location. We review the literature with a focus on more recent classifications to examine the strengths and weaknesses of the different approaches. We identify gaps or problems with the current approaches, and we propose an eight-step heuristic process that may assist with development of more flexible and integrated aquatic classifications based on the current understanding, network thinking, and theoretical underpinnings.
Simple adaptive sparse representation based classification schemes for EEG based brain-computer interface applications.

Science.gov (United States)

Shin, Younghak; Lee, Seungchan; Ahn, Minkyu; Cho, Hohyun; Jun, Sung Chan; Lee, Heung-No

2015-11-01

One of the main problems related to electroencephalogram (EEG) based brain-computer interface (BCI) systems is the non-stationarity of the underlying EEG signals. This results in the deterioration of the classification performance during experimental sessions. Therefore, adaptive classification techniques are required for EEG based BCI applications. In this paper, we propose simple adaptive sparse representation based classification (SRC) schemes. Supervised and unsupervised dictionary update techniques for new test data and a dictionary modification method by using the incoherence measure of the training data are investigated. The proposed methods are very simple and additional computation for the re-training of the classifier is not needed. The proposed adaptive SRC schemes are evaluated using two BCI experimental datasets. The proposed methods are assessed by comparing classification results with the conventional SRC and other adaptive classification methods. On the basis of the results, we find that the proposed adaptive schemes show relatively improved classification accuracy as compared to conventional methods without requiring additional computation. Copyright © 2015 Elsevier Ltd. All rights reserved.
A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data

KAUST Repository

Abusamra, Heba

2013-05-01

Microarray technology has enriched the study of gene expression in such a way that scientists are now able to measure the expression levels of thousands of genes in a single experiment. Microarray gene expression data gained great importance in recent years due to its role in disease diagnoses and prognoses which help to choose the appropriate treatment plan for patients. This technology has shifted a new era in molecular classification, interpreting gene expression data remains a difficult problem and an active research area due to their native nature of “high dimensional low sample size”. Such problems pose great challenges to existing classification methods. Thus, effective feature selection techniques are often needed in this case to aid to correctly classify different tumor types and consequently lead to a better understanding of genetic signatures as well as improve treatment strategies. This thesis aims on a comparative study of state-of-the-art feature selection methods, classification methods, and the combination of them, based on gene expression data. We compared the efficiency of three different classification methods including: support vector machines, k- nearest neighbor and random forest, and eight different feature selection methods, including: information gain, twoing rule, sum minority, max minority, gini index, sum of variances, t- statistics, and one-dimension support vector machine. Five-fold cross validation was used to evaluate the classification performance. Two publicly available gene expression data sets of glioma were used for this study. Different experiments have been applied to compare the performance of the classification methods with and without performing feature selection. Results revealed the important role of feature selection in classifying gene expression data. By performing feature selection, the classification accuracy can be significantly boosted by using a small number of genes. The relationship of features selected in
Multi-temporal and Dual-polarization Interferometric SAR for Land Cover Type Classification

Directory of Open Access Journals (Sweden)

WANG Xinshuang

2015-05-01

Full Text Available In order to study SAR land cover classification method, this paper uses the multi-dimensional combination of temporal,polarization and InSAR data. The area covered by space borne data of ALOS PALSAR in Xunke County,Heilongjiang Province was chosen as test site. A land cover classification technique of SVM based on multi-temporal, multi-polarization and InSAR data had been proposed, using the sensitivity to land cover type of multi-temporal, multi-polarization SAR data and InSAR measurements, and combing time series characteristic of backscatter coefficient and correlation coefficient to identify ground objects. The results showed the problem of confusion between forest land and urban construction land can be nicely solved, using the correlation coefficient between HH and HV, and also combing the selected temporal, polarization and InSAR characteristics. The land cover classification result with higher accuracy is gotten using the classification algorithm proposed in this paper.

Classification, disease, and diagnosis.

Science.gov (United States)

Jutel, Annemarie

2011-01-01

Classification shapes medicine and guides its practice. Understanding classification must be part of the quest to better understand the social context and implications of diagnosis. Classifications are part of the human work that provides a foundation for the recognition and study of illness: deciding how the vast expanse of nature can be partitioned into meaningful chunks, stabilizing and structuring what is otherwise disordered. This article explores the aims of classification, their embodiment in medical diagnosis, and the historical traditions of medical classification. It provides a brief overview of the aims and principles of classification and their relevance to contemporary medicine. It also demonstrates how classifications operate as social framing devices that enable and disable communication, assert and refute authority, and are important items for sociological study.
Combined principal component preprocessing and n-tuple neural networks for improved classification

DEFF Research Database (Denmark)

Høskuldsson, Agnar; Linneberg, Christian

2000-01-01

We present a combined principal component analysis/neural network scheme for classification. The data used to illustrate the method consist of spectral fluorescence recordings from seven different production facilities, and the task is to relate an unknown sample to one of these seven factories....... The data are first preprocessed by performing an individual principal component analysis on each of the seven groups of data. The components found are then used for classifying the data, but instead of making a single multiclass classifier, we follow the ideas of turning a multiclass problem into a number...... of two-class problems. For each possible pair of classes we further apply a transformation to the calculated principal components in order to increase the separation between the classes. Finally we apply the so-called n-tuple neural network to the transformed data in order to give the classification...
[Aggressive B‑cell lymphomas : Recommendations from the German Panel of Reference Pathologists in the Competence Network on Malignant Lymphomas on diagnostic procedures according to the current WHO classification, update 2017].

Science.gov (United States)

Klapper, W; Fend, F; Feller, A; Hansmann, M L; Möller, P; Stein, H; Rosenwald, A; Ott, G

2018-04-17

The update of the 4th edition of the WHO classification for hematopoietic neoplasms introduces changes in the field of mature aggressive B‑cell lymphomas that are relevant to diagnostic pathologists. In daily practice, the question arises of which analysis should be performed when diagnosing the most common lymphoma entity, diffuse large B‑cell lymphoma. We discuss the importance of the cell of origin, the analysis of MYC translocations, and the delineation of the new WHO entities of high-grade B‑cell lymphomas.
Example-Dependent Cost-Sensitive Classification with Applications in Financial Risk Modeling and Marketing Analytics

OpenAIRE

Correa Bahnsen, Alejandro

2015-01-01

Several real-world binary classification problems are example-dependent cost-sensitive in nature, where the costs due to misclassification vary between examples and not only within classes. However, standard binary classification methods do not take these costs into account, and assume a constant cost of misclassification errors. This approach is not realistic in many real-world applications. For example in credit card fraud detection, failing to detect a fraudulent transaction may have an ec...
Multi-view Multi-sparsity Kernel Reconstruction for Multi-class Image Classification

KAUST Repository

Zhu, Xiaofeng

2015-05-28

This paper addresses the problem of multi-class image classification by proposing a novel multi-view multi-sparsity kernel reconstruction (MMKR for short) model. Given images (including test images and training images) representing with multiple visual features, the MMKR first maps them into a high-dimensional space, e.g., a reproducing kernel Hilbert space (RKHS), where test images are then linearly reconstructed by some representative training images, rather than all of them. Furthermore a classification rule is proposed to classify test images. Experimental results on real datasets show the effectiveness of the proposed MMKR while comparing to state-of-the-art algorithms.
Classification of medication incidents associated with information technology.

Science.gov (United States)

Cheung, Ka-Chun; van der Veen, Willem; Bouvy, Marcel L; Wensing, Michel; van den Bemt, Patricia M L A; de Smet, Peter A G M

2014-02-01

Information technology (IT) plays a pivotal role in improving patient safety, but can also cause new problems for patient safety. This study analyzed the nature and consequences of a large sample of IT-related medication incidents, as reported by healthcare professionals in community pharmacies and hospitals. The medication incidents submitted to the Dutch central medication incidents registration (CMR) reporting system were analyzed from the perspective of the healthcare professional with the Magrabi classification. During classification new terms were added, if necessary. The principal source of the IT-related problem, nature of error. Additional measures: consequences of incidents, IT systems, phases of the medication process. From March 2010 to February 2011 the CMR received 4161 incidents: 1643 (39.5%) from community pharmacies and 2518 (60.5%) from hospitals. Eventually one of six incidents (16.1%, n=668) were related to IT; in community pharmacies more incidents (21.5%, n=351) were related to IT than in hospitals (12.6%, n=317). In community pharmacies 41.0% (n=150) of the incidents were about choosing the wrong medicine. Most of the erroneous exchanges were associated with confusion of medicine names and poor design of screens. In hospitals 55.3% (n=187) of incidents concerned human-machine interaction-related input during the use of computerized prescriber order entry. These use problems were also a major problem in pharmacy information systems outside the hospital. A large sample of incidents shows that many of the incidents are related to IT, both in community pharmacies and hospitals. The interaction between human and machine plays a pivotal role in IT incidents in both settings.
NIM: A Node Influence Based Method for Cancer Classification

Directory of Open Access Journals (Sweden)

Yiwen Wang

2014-01-01

Full Text Available The classification of different cancer types owns great significance in the medical field. However, the great majority of existing cancer classification methods are clinical-based and have relatively weak diagnostic ability. With the rapid development of gene expression technology, it is able to classify different kinds of cancers using DNA microarray. Our main idea is to confront the problem of cancer classification using gene expression data from a graph-based view. Based on a new node influence model we proposed, this paper presents a novel high accuracy method for cancer classification, which is composed of four parts: the first is to calculate the similarity matrix of all samples, the second is to compute the node influence of training samples, the third is to obtain the similarity between every test sample and each class using weighted sum of node influence and similarity matrix, and the last is to classify each test sample based on its similarity between every class. The data sets used in our experiments are breast cancer, central nervous system, colon tumor, prostate cancer, acute lymphoblastic leukemia, and lung cancer. experimental results showed that our node influence based method (NIM is more efficient and robust than the support vector machine, K-nearest neighbor, C4.5, naive Bayes, and CART.
Ebola Virus Infection Modelling and Identifiability Problems

Directory of Open Access Journals (Sweden)

Van-Kinh eNguyen

2015-04-01

Full Text Available The recent outbreaks of Ebola virus (EBOV infections have underlined the impact of the virus as a major threat for human health. Due to the high biosafety classification of EBOV (level 4, basic research is very limited. Therefore, the development of new avenues of thinking to advance quantitative comprehension of the virus and its interaction with the host cells is urgently neededto tackle this lethal disease. Mathematical modelling of the EBOV dynamics can be instrumental to interpret Ebola infection kinetics on quantitative grounds. To the best of our knowledge, a mathematical modelling approach to unravel the interaction between EBOV and the host cells isstill missing. In this paper, a mathematical model based on differential equations is used to represent the basic interactions between EBOV and wild-type Vero cells in vitro. Parameter sets that represent infectivity of pathogens are estimated for EBOV infection and compared with influenza virus infection kinetics. The average infecting time of wild-type Vero cells in EBOV is slower than in influenza infection. Simulation results suggest that the slow infecting time of EBOV could be compensated by its efficient replication. This study reveals several identifiability problems and what kind of experiments are necessary to advance the quantification of EBOV infection. A first mathematical approach of EBOV dynamics and the estimation of standard parametersin viral infections kinetics is the key contribution of this work, paving the way for future modelling work on EBOV infection.
Identification of health problems in patients with acute inflammatory arthritis, using the International Classification of Functioning, Disability and Health (ICF).

Science.gov (United States)

Zochling, J; Grill, E; Scheuringer, M; Liman, W; Stucki, G; Braun, J

2006-01-01

To identify the most common health problems experienced by patients with acute inflammatory arthritis using the International Classification of Functioning, Disability and Health (ICF), and to provide empirical data for the development of an ICF Core Set for acute inflammatory arthritis. Cross-sectional survey of patients with acute inflammatory arthritis of two or more joints requiring admission to an acute hospital. The second level categories of the ICF were used to collect information on patients' health problems. Relative frequencies of impairments, limitations and restrictions in the study population were reported for the ICF components Body Functions, Body Structures, and Activities and Participations. For the component Environmental Factors absolute and relative frequencies of perceived barriers or facilitators were reported. In total, 130 patients were included in the survey. The mean age of the population was 59.9 years (median age 63.0 years), 75% of the patients were female. Most had rheumatoid arthritis (57%) or early inflammatory polyarthritis (22%). Fifty-four second-level ICF categories had a prevalence of 30% or more: 3 (8%) belonged to the component Body Structures and 10 (13%) to the component Body Functions. Most categories were identified in the components Activities and Participation (19; 23%) and Environmental Factors (22; 56%). Patients with acute inflammatory arthritis can be well described by ICF categories and components. This study is the first step towards the development of an ICF Core Set for patients with acute inflammatory arthritis.
Pathological Bases for a Robust Application of Cancer Molecular Classification

Directory of Open Access Journals (Sweden)

Salvador J. Diaz-Cano

2015-04-01

Full Text Available Any robust classification system depends on its purpose and must refer to accepted standards, its strength relying on predictive values and a careful consideration of known factors that can affect its reliability. In this context, a molecular classification of human cancer must refer to the current gold standard (histological classification and try to improve it with key prognosticators for metastatic potential, staging and grading. Although organ-specific examples have been published based on proteomics, transcriptomics and genomics evaluations, the most popular approach uses gene expression analysis as a direct correlate of cellular differentiation, which represents the key feature of the histological classification. RNA is a labile molecule that varies significantly according with the preservation protocol, its transcription reflect the adaptation of the tumor cells to the microenvironment, it can be passed through mechanisms of intercellular transference of genetic information (exosomes, and it is exposed to epigenetic modifications. More robust classifications should be based on stable molecules, at the genetic level represented by DNA to improve reliability, and its analysis must deal with the concept of intratumoral heterogeneity, which is at the origin of tumor progression and is the byproduct of the selection process during the clonal expansion and progression of neoplasms. The simultaneous analysis of multiple DNA targets and next generation sequencing offer the best practical approach for an analytical genomic classification of tumors.
Out-of-Sample Generalizations for Supervised Manifold Learning for Classification.

Science.gov (United States)

Vural, Elif; Guillemot, Christine

2016-03-01

Supervised manifold learning methods for data classification map high-dimensional data samples to a lower dimensional domain in a structure-preserving way while increasing the separation between different classes. Most manifold learning methods compute the embedding only of the initially available data; however, the generalization of the embedding to novel points, i.e., the out-of-sample extension problem, becomes especially important in classification applications. In this paper, we propose a semi-supervised method for building an interpolation function that provides an out-of-sample extension for general supervised manifold learning algorithms studied in the context of classification. The proposed algorithm computes a radial basis function interpolator that minimizes an objective function consisting of the total embedding error of unlabeled test samples, defined as their distance to the embeddings of the manifolds of their own class, as well as a regularization term that controls the smoothness of the interpolation function in a direction-dependent way. The class labels of test data and the interpolation function parameters are estimated jointly with an iterative process. Experimental results on face and object images demonstrate the potential of the proposed out-of-sample extension algorithm for the classification of manifold-modeled data sets.
Standard classification: Physics

International Nuclear Information System (INIS)

1977-01-01

This is a draft standard classification of physics. The conception is based on the physics part of the systematic catalogue of the Bayerische Staatsbibliothek and on the classification given in standard textbooks. The ICSU-AB classification now used worldwide by physics information services was not taken into account. (BJ) [de
On the classification of elliptic foliations induced by real quadratic fields with center

Science.gov (United States)

Puchuri, Liliana; Bueno, Orestes

2016-12-01

Related to the study of Hilbert's infinitesimal problem, is the problem of determining the existence and estimating the number of limit cycles of the linear perturbation of Hamiltonian fields. A classification of the elliptic foliations in the projective plane induced by the fields obtained by quadratic fields with center was already studied by several authors. In this work, we devise a unified proof of the classification of elliptic foliations induced by quadratic fields with center. This technique involves using a formula due to Cerveau & Lins Neto to calculate the genus of the generic fiber of a first integral of foliations of these kinds. Furthermore, we show that these foliations induce several examples of linear families of foliations which are not bimeromorphically equivalent to certain remarkable examples given by Lins Neto.
Classification and risk assessment of individuals with familial polyposis, Gardner's syndrome, and familial non-polyposis colon cancer from [3H]thymidine labeling patterns in colonic epithelial cells

International Nuclear Information System (INIS)

Lipkin, M.; Blattner, W.A.; Gardner, E.J.; Burt, R.W.; Lynch, H.; Deschner, E.; Winawer, S.; Fraumeni, J.F. Jr.

1984-01-01

A probabilistic analysis has been developed to assist the binary classification and risk assessment of members of familial colon cancer kindreds. The analysis is based on the microautoradiographic observation of [ 3 H]thymidine-labeled epithelial cells in colonic mucosa of the kindred members. From biopsies of colonic mucosa which are labeled with [ 3 H]thymidine in vitro, the degree of similarity of each subject's cell-labeling pattern measured over entire crypts was automatically compared to the labeling patterns of high-risk and low-risk reference populations. Each individual was then presumptively classified and assigned to one of the reference populations, and a degree of risk for the classification was provided. In carrying out the analysis, a linear score was calculated for each individual relative to each of the reference populations, and the classification was based on the polarity of the score difference; the degree of risk was then quantitated from the magnitude of the score difference. When the method was applied to kindreds having either familial polyposis or familial non-polyposis colon cancer, it effectively segregated individuals affected with disease from others at low risk, with sensitivity and specificity ranging from 71 to 92%. Further application of the method to asymptomatic family members believed to be at 50% risk on the basis of pedigree evaluation revealed a biomodal distribution to nearly zero or full risk. The accuracy and simplicity of this approach and its capability of revealing early stages of abnormal colonic epithelial cell development indicate potential for preclinical screening of subjects at risk in cancer-prone kindreds and for assisting the analysis of modes of inheritance
Multi-Objective Particle Swarm Optimization Approach for Cost-Based Feature Selection in Classification.

Science.gov (United States)

Zhang, Yong; Gong, Dun-Wei; Cheng, Jian

2017-01-01

Feature selection is an important data-preprocessing technique in classification problems such as bioinformatics and signal processing. Generally, there are some situations where a user is interested in not only maximizing the classification performance but also minimizing the cost that may be associated with features. This kind of problem is called cost-based feature selection. However, most existing feature selection approaches treat this task as a single-objective optimization problem. This paper presents the first study of multi-objective particle swarm optimization (PSO) for cost-based feature selection problems. The task of this paper is to generate a Pareto front of nondominated solutions, that is, feature subsets, to meet different requirements of decision-makers in real-world applications. In order to enhance the search capability of the proposed algorithm, a probability-based encoding technology and an effective hybrid operator, together with the ideas of the crowding distance, the external archive, and the Pareto domination relationship, are applied to PSO. The proposed PSO-based multi-objective feature selection algorithm is compared with several multi-objective feature selection algorithms on five benchmark datasets. Experimental results show that the proposed algorithm can automatically evolve a set of nondominated solutions, and it is a highly competitive feature selection method for solving cost-based feature selection problems.
Robust nuclear lamina-based cell classification of aging and senescent cells.

Science.gov (United States)

Righolt, Christiaan H; van 't Hoff, Merel L R; Vermolen, Bart J; Young, Ian T; Raz, Vered

2011-12-01

Changes in the shape of the nuclear lamina are exhibited in senescent cells, as well as in cells expressing mutations in lamina genes. To identify cells with defects in the nuclear lamina we developed an imaging method that quantifies the intensity and curvature of the nuclear lamina. We show that this method accurately describes changes in the nuclear lamina. Spatial changes in nuclear lamina coincide with redistribution of lamin A proteins and local reduction in protein mobility in senescent cell. We suggest that local accumulation of lamin A in the nuclear envelope leads to bending of the structure. A quantitative distinction of the nuclear lamina shape in cell populations was found between fresh and senescent cells, and between primary myoblasts from young and old donors. Moreover, with this method mutations in lamina genes were significantly distinct from cells with wild-type genes. We suggest that this method can be applied to identify abnormal cells during aging, in in vitro propagation, and in lamina disorders.
China's Classification-Based Forest Management: Procedures, Problems, and Prospects

Science.gov (United States)

Dai, Limin; Zhao, Fuqiang; Shao, Guofan; Zhou, Li; Tang, Lina

2009-06-01

China’s new Classification-Based Forest Management (CFM) is a two-class system, including Commodity Forest (CoF) and Ecological Welfare Forest (EWF) lands, so named according to differences in their distinct functions and services. The purposes of CFM are to improve forestry economic systems, strengthen resource management in a market economy, ease the conflicts between wood demands and public welfare, and meet the diversified needs for forest services in China. The formative process of China’s CFM has involved a series of trials and revisions. China’s central government accelerated the reform of CFM in the year 2000 and completed the final version in 2003. CFM was implemented at the provincial level with the aid of subsidies from the central government. About a quarter of the forestland in China was approved as National EWF lands by the State Forestry Administration in 2006 and 2007. Logging is prohibited on National EWF lands, and their landowners or managers receive subsidies of about 70 RMB (US10) per hectare from the central government. CFM represents a new forestry strategy in China and its implementation inevitably faces challenges in promoting the understanding of forest ecological services, generalizing nationwide criteria for identifying EWF and CoF lands, setting up forest-specific compensation mechanisms for ecological benefits, enhancing the knowledge of administrators and the general public about CFM, and sustaining EWF lands under China’s current forestland tenure system. CFM does, however, offer a viable pathway toward sustainable forest management in China.
A Proposal for Cardiac Arrhythmia Classification using Complexity Measures

Directory of Open Access Journals (Sweden)

AROTARITEI, D.

2017-08-01

Full Text Available Cardiovascular diseases are one of the major problems of humanity and therefore one of their component, arrhythmia detection and classification drawn an increased attention worldwide. The presence of randomness in discrete time series, like those arising in electrophysiology, is firmly connected with computational complexity measure. This connection can be used, for instance, in the analysis of RR-intervals of electrocardiographic (ECG signal, coded as binary string, to detect and classify arrhythmia. Our approach uses three algorithms (Lempel-Ziv, Sample Entropy and T-Code to compute the information complexity applied and a classification tree to detect 13 types of arrhythmia with encouraging results. To overcome the computational effort required for complexity calculus, a cloud computing solution with executable code deployment is also proposed.
Joint classification and contour extraction of large 3D point clouds

Science.gov (United States)

Hackel, Timo; Wegner, Jan D.; Schindler, Konrad

2017-08-01

We present an effective and efficient method for point-wise semantic classification and extraction of object contours of large-scale 3D point clouds. What makes point cloud interpretation challenging is the sheer size of several millions of points per scan and the non-grid, sparse, and uneven distribution of points. Standard image processing tools like texture filters, for example, cannot handle such data efficiently, which calls for dedicated point cloud labeling methods. It turns out that one of the major drivers for efficient computation and handling of strong variations in point density, is a careful formulation of per-point neighborhoods at multiple scales. This allows, both, to define an expressive feature set and to extract topologically meaningful object contours. Semantic classification and contour extraction are interlaced problems. Point-wise semantic classification enables extracting a meaningful candidate set of contour points while contours help generating a rich feature representation that benefits point-wise classification. These methods are tailored to have fast run time and small memory footprint for processing large-scale, unstructured, and inhomogeneous point clouds, while still achieving high classification accuracy. We evaluate our methods on the semantic3d.net benchmark for terrestrial laser scans with >109 points.
An Outlyingness Matrix for Multivariate Functional Data Classification

KAUST Repository

Dai, Wenlin; Genton, Marc G.

2017-01-01

outlyingness with conventional statistical depth. We propose two classifiers based on directional outlyingness and the outlyingness matrix, respectively. Our classifiers provide better performance compared with existing depth-based classifiers when applied on both univariate and multivariate functional data from simulation studies. We also test our methods on two data problems: speech recognition and gesture classification, and obtain results that are consistent with the findings from the simulated data.

Some links on this page may take you to non-federal websites. Their policies may differ from this site.