WorldWideScience

Sample records for nonparametric discriminant analysis

  1. Bayesian nonparametric data analysis

    CERN Document Server

    Müller, Peter; Jara, Alejandro; Hanson, Tim

    2015-01-01

    This book reviews nonparametric Bayesian methods and models that have proven useful in the context of data analysis. Rather than providing an encyclopedic review of probability models, the book’s structure follows a data analysis perspective. As such, the chapters are organized by traditional data analysis problems. In selecting specific nonparametric models, simpler and more traditional models are favored over specialized ones. The discussed methods are illustrated with a wealth of examples, including applications ranging from stylized examples to case studies from recent literature. The book also includes an extensive discussion of computational methods and details on their implementation. R code for many examples is included in on-line software pages.

  2. Noise filtering and nonparametric analysis of microarray data underscores discriminating markers of oral, prostate, lung, ovarian and breast cancer

    Directory of Open Access Journals (Sweden)

    Dermody James J

    2004-11-01

    Full Text Available Abstract Background A major goal of cancer research is to identify discrete biomarkers that specifically characterize a given malignancy. These markers are useful in diagnosis, may identify potential targets for drug development, and can aid in evaluating treatment efficacy and predicting patient outcome. Microarray technology has enabled marker discovery from human cells by permitting measurement of steady-state mRNA levels derived from thousands of genes. However many challenging and unresolved issues regarding the acquisition and analysis of microarray data remain, such as accounting for both experimental and biological noise, transcripts whose expression profiles are not normally distributed, guidelines for statistical assessment of false positive/negative rates and comparing data derived from different research groups. This study addresses these issues using Affymetrix HG-U95A and HG-U133 GeneChip data derived from different research groups. Results We present here a simple non parametric approach coupled with noise filtering to identify sets of genes differentially expressed between the normal and cancer states in oral, breast, lung, prostate and ovarian tumors. An important feature of this study is the ability to integrate data from different laboratories, improving the analytical power of the individual results. One of the most interesting findings is the down regulation of genes involved in tissue differentiation. Conclusions This study presents the development and application of a noise model that suppresses noise, limits false positives in the results, and allows integration of results from individual studies derived from different research groups.

  3. Bayesian Nonparametric Longitudinal Data Analysis.

    Science.gov (United States)

    Quintana, Fernando A; Johnson, Wesley O; Waetjen, Elaine; Gold, Ellen

    2016-01-01

    Practical Bayesian nonparametric methods have been developed across a wide variety of contexts. Here, we develop a novel statistical model that generalizes standard mixed models for longitudinal data that include flexible mean functions as well as combined compound symmetry (CS) and autoregressive (AR) covariance structures. AR structure is often specified through the use of a Gaussian process (GP) with covariance functions that allow longitudinal data to be more correlated if they are observed closer in time than if they are observed farther apart. We allow for AR structure by considering a broader class of models that incorporates a Dirichlet Process Mixture (DPM) over the covariance parameters of the GP. We are able to take advantage of modern Bayesian statistical methods in making full predictive inferences and about characteristics of longitudinal profiles and their differences across covariate combinations. We also take advantage of the generality of our model, which provides for estimation of a variety of covariance structures. We observe that models that fail to incorporate CS or AR structure can result in very poor estimation of a covariance or correlation matrix. In our illustration using hormone data observed on women through the menopausal transition, biology dictates the use of a generalized family of sigmoid functions as a model for time trends across subpopulation categories.

  4. Nonparametric factor analysis of time series

    OpenAIRE

    Rodríguez-Poo, Juan M.; Linton, Oliver Bruce

    1998-01-01

    We introduce a nonparametric smoothing procedure for nonparametric factor analaysis of multivariate time series. The asymptotic properties of the proposed procedures are derived. We present an application based on the residuals from the Fair macromodel.

  5. Nonparametric analysis of blocked ordered categories data: some examples revisited

    Directory of Open Access Journals (Sweden)

    O. Thas

    2006-08-01

    Full Text Available Nonparametric analysis for general block designs can be given by using the Cochran-Mantel-Haenszel (CMH statistics. We demonstrate this with four examples and note that several well-known nonparametric statistics are special cases of CMH statistics.

  6. A Bayesian Nonparametric Approach to Factor Analysis

    DEFF Research Database (Denmark)

    Piatek, Rémi; Papaspiliopoulos, Omiros

    2018-01-01

    This paper introduces a new approach for the inference of non-Gaussian factor models based on Bayesian nonparametric methods. It relaxes the usual normality assumption on the latent factors, widely used in practice, which is too restrictive in many settings. Our approach, on the contrary, does no...

  7. Weak Disposability in Nonparametric Production Analysis with Undesirable Outputs

    NARCIS (Netherlands)

    Kuosmanen, T.K.

    2005-01-01

    Environmental Economics and Natural Resources Group at Wageningen University in The Netherlands Weak disposability of outputs means that firms can abate harmful emissions by decreasing the activity level. Modeling weak disposability in nonparametric production analysis has caused some confusion.

  8. Non-parametric analysis of production efficiency of poultry egg ...

    African Journals Online (AJOL)

    Non-parametric analysis of production efficiency of poultry egg farmers in Delta ... analysis of factors affecting the output of poultry farmers showed that stock ... should be put in place for farmers to learn the best farm practices carried out on the ...

  9. Discriminant analysis of plasma fusion data

    International Nuclear Information System (INIS)

    Kardaun, O.J.W.F.; Kardaun, J.W.P.F.; Itoh, S.; Itoh, K.

    1992-06-01

    Several discriminant analysis methods has been applied and compared to predict the type of ELM's in H-mode discharges: (a) quadratic discriminant analysis (linear discriminant analysis being a special case), (b) discrimination by non-parametric (kernel-) density estimates, and (c) discrimination by a product multinomial model on a discretised scale. Practical evaluation was performed using SAS in the first two cases, and INDEP, a standard FORTRAN program, initially developed for medical applications, in the last case. We give here a flavour of the approach and its results. In summary, discriminant analysis can be used as a useful descriptive method of specifying regions where particular types of plasma discharges can be produced. Parametric methods have the advantage of a rather compact mathematical formulation . Pertinent graphical representations are useful to make the theory and the results more palatable to the experimental physicists. (J.P.N.)

  10. Comparative analysis of automotive paints by laser induced breakdown spectroscopy and nonparametric permutation tests

    International Nuclear Information System (INIS)

    McIntee, Erin; Viglino, Emilie; Rinke, Caitlin; Kumor, Stephanie; Ni Liqiang; Sigman, Michael E.

    2010-01-01

    Laser-induced breakdown spectroscopy (LIBS) has been investigated for the discrimination of automobile paint samples. Paint samples from automobiles of different makes, models, and years were collected and separated into sets based on the color, presence or absence of effect pigments and the number of paint layers. Twelve LIBS spectra were obtained for each paint sample, each an average of a five single shot 'drill down' spectra from consecutive laser ablations in the same spot on the sample. Analyses by a nonparametric permutation test and a parametric Wald test were performed to determine the extent of discrimination within each set of paint samples. The discrimination power and Type I error were assessed for each data analysis method. Conversion of the spectral intensity to a log-scale (base 10) resulted in a higher overall discrimination power while observing the same significance level. Working on the log-scale, the nonparametric permutation tests gave an overall 89.83% discrimination power with a size of Type I error being 4.44% at the nominal significance level of 5%. White paint samples, as a group, were the most difficult to differentiate with the power being only 86.56% followed by 95.83% for black paint samples. Parametric analysis of the data set produced lower discrimination (85.17%) with 3.33% Type I errors, which is not recommended for both theoretical and practical considerations. The nonparametric testing method is applicable across many analytical comparisons, with the specific application described here being the pairwise comparison of automotive paint samples.

  11. A Bayesian Nonparametric Meta-Analysis Model

    Science.gov (United States)

    Karabatsos, George; Talbott, Elizabeth; Walker, Stephen G.

    2015-01-01

    In a meta-analysis, it is important to specify a model that adequately describes the effect-size distribution of the underlying population of studies. The conventional normal fixed-effect and normal random-effects models assume a normal effect-size population distribution, conditionally on parameters and covariates. For estimating the mean overall…

  12. Using non-parametric methods in econometric production analysis

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard; Henningsen, Arne

    2012-01-01

    by investigating the relationship between the elasticity of scale and the farm size. We use a balanced panel data set of 371~specialised crop farms for the years 2004-2007. A non-parametric specification test shows that neither the Cobb-Douglas function nor the Translog function are consistent with the "true......Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify a functional form of the production function of which the Cobb...... parameter estimates, but also in biased measures which are derived from the parameters, such as elasticities. Therefore, we propose to use non-parametric econometric methods. First, these can be applied to verify the functional form used in parametric production analysis. Second, they can be directly used...

  13. Digital spectral analysis parametric, non-parametric and advanced methods

    CERN Document Server

    Castanié, Francis

    2013-01-01

    Digital Spectral Analysis provides a single source that offers complete coverage of the spectral analysis domain. This self-contained work includes details on advanced topics that are usually presented in scattered sources throughout the literature.The theoretical principles necessary for the understanding of spectral analysis are discussed in the first four chapters: fundamentals, digital signal processing, estimation in spectral analysis, and time-series models.An entire chapter is devoted to the non-parametric methods most widely used in industry.High resolution methods a

  14. Using non-parametric methods in econometric production analysis

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard; Henningsen, Arne

    Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify the functional form of the production function. Most often, the Cobb...... results—including measures that are of interest of applied economists, such as elasticities. Therefore, we propose to use nonparametric econometric methods. First, they can be applied to verify the functional form used in parametric estimations of production functions. Second, they can be directly used...

  15. STATCAT, Statistical Analysis of Parametric and Non-Parametric Data

    International Nuclear Information System (INIS)

    David, Hugh

    1990-01-01

    1 - Description of program or function: A suite of 26 programs designed to facilitate the appropriate statistical analysis and data handling of parametric and non-parametric data, using classical and modern univariate and multivariate methods. 2 - Method of solution: Data is read entry by entry, using a choice of input formats, and the resultant data bank is checked for out-of- range, rare, extreme or missing data. The completed STATCAT data bank can be treated by a variety of descriptive and inferential statistical methods, and modified, using other standard programs as required

  16. Glaucoma Monitoring in a Clinical Setting Glaucoma Progression Analysis vs Nonparametric Progression Analysis in the Groningen Longitudinal Glaucoma Study

    NARCIS (Netherlands)

    Wesselink, Christiaan; Heeg, Govert P.; Jansonius, Nomdo M.

    Objective: To compare prospectively 2 perimetric progression detection algorithms for glaucoma, the Early Manifest Glaucoma Trial algorithm (glaucoma progression analysis [GPA]) and a nonparametric algorithm applied to the mean deviation (MD) (nonparametric progression analysis [NPA]). Methods:

  17. Multi-Directional Non-Parametric Analysis of Agricultural Efficiency

    DEFF Research Database (Denmark)

    Balezentis, Tomas

    This thesis seeks to develop methodologies for assessment of agricultural efficiency and employ them to Lithuanian family farms. In particular, we focus on three particular objectives throughout the research: (i) to perform a fully non-parametric analysis of efficiency effects, (ii) to extend...... to the Multi-Directional Efficiency Analysis approach when the proposed models were employed to analyse empirical data of Lithuanian family farm performance, we saw substantial differences in efficiencies associated with different inputs. In particular, assets appeared to be the least efficiently used input...... relative to labour, intermediate consumption and land (in some cases land was not treated as a discretionary input). These findings call for further research on relationships among financial structure, investment decisions, and efficiency in Lithuanian family farms. Application of different techniques...

  18. CADDIS Volume 4. Data Analysis: PECBO Appendix - R Scripts for Non-Parametric Regressions

    Science.gov (United States)

    Script for computing nonparametric regression analysis. Overview of using scripts to infer environmental conditions from biological observations, statistically estimating species-environment relationships, statistical scripts.

  19. Discrete non-parametric kernel estimation for global sensitivity analysis

    International Nuclear Information System (INIS)

    Senga Kiessé, Tristan; Ventura, Anne

    2016-01-01

    This work investigates the discrete kernel approach for evaluating the contribution of the variance of discrete input variables to the variance of model output, via analysis of variance (ANOVA) decomposition. Until recently only the continuous kernel approach has been applied as a metamodeling approach within sensitivity analysis framework, for both discrete and continuous input variables. Now the discrete kernel estimation is known to be suitable for smoothing discrete functions. We present a discrete non-parametric kernel estimator of ANOVA decomposition of a given model. An estimator of sensitivity indices is also presented with its asymtotic convergence rate. Some simulations on a test function analysis and a real case study from agricultural have shown that the discrete kernel approach outperforms the continuous kernel one for evaluating the contribution of moderate or most influential discrete parameters to the model output. - Highlights: • We study a discrete kernel estimation for sensitivity analysis of a model. • A discrete kernel estimator of ANOVA decomposition of the model is presented. • Sensitivity indices are calculated for discrete input parameters. • An estimator of sensitivity indices is also presented with its convergence rate. • An application is realized for improving the reliability of environmental models.

  20. Bayesian nonparametric meta-analysis using Polya tree mixture models.

    Science.gov (United States)

    Branscum, Adam J; Hanson, Timothy E

    2008-09-01

    Summary. A common goal in meta-analysis is estimation of a single effect measure using data from several studies that are each designed to address the same scientific inquiry. Because studies are typically conducted in geographically disperse locations, recent developments in the statistical analysis of meta-analytic data involve the use of random effects models that account for study-to-study variability attributable to differences in environments, demographics, genetics, and other sources that lead to heterogeneity in populations. Stemming from asymptotic theory, study-specific summary statistics are modeled according to normal distributions with means representing latent true effect measures. A parametric approach subsequently models these latent measures using a normal distribution, which is strictly a convenient modeling assumption absent of theoretical justification. To eliminate the influence of overly restrictive parametric models on inferences, we consider a broader class of random effects distributions. We develop a novel hierarchical Bayesian nonparametric Polya tree mixture (PTM) model. We present methodology for testing the PTM versus a normal random effects model. These methods provide researchers a straightforward approach for conducting a sensitivity analysis of the normality assumption for random effects. An application involving meta-analysis of epidemiologic studies designed to characterize the association between alcohol consumption and breast cancer is presented, which together with results from simulated data highlight the performance of PTMs in the presence of nonnormality of effect measures in the source population.

  1. The Use of Nonparametric Kernel Regression Methods in Econometric Production Analysis

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard

    and nonparametric estimations of production functions in order to evaluate the optimal firm size. The second paper discusses the use of parametric and nonparametric regression methods to estimate panel data regression models. The third paper analyses production risk, price uncertainty, and farmers' risk preferences...... within a nonparametric panel data regression framework. The fourth paper analyses the technical efficiency of dairy farms with environmental output using nonparametric kernel regression in a semiparametric stochastic frontier analysis. The results provided in this PhD thesis show that nonparametric......This PhD thesis addresses one of the fundamental problems in applied econometric analysis, namely the econometric estimation of regression functions. The conventional approach to regression analysis is the parametric approach, which requires the researcher to specify the form of the regression...

  2. Non-Parametric Analysis of Rating Transition and Default Data

    DEFF Research Database (Denmark)

    Fledelius, Peter; Lando, David; Perch Nielsen, Jens

    2004-01-01

    We demonstrate the use of non-parametric intensity estimation - including construction of pointwise confidence sets - for analyzing rating transition data. We find that transition intensities away from the class studied here for illustration strongly depend on the direction of the previous move b...

  3. Analysis of small sample size studies using nonparametric bootstrap test with pooled resampling method.

    Science.gov (United States)

    Dwivedi, Alok Kumar; Mallawaarachchi, Indika; Alvarado, Luis A

    2017-06-30

    Experimental studies in biomedical research frequently pose analytical problems related to small sample size. In such studies, there are conflicting findings regarding the choice of parametric and nonparametric analysis, especially with non-normal data. In such instances, some methodologists questioned the validity of parametric tests and suggested nonparametric tests. In contrast, other methodologists found nonparametric tests to be too conservative and less powerful and thus preferred using parametric tests. Some researchers have recommended using a bootstrap test; however, this method also has small sample size limitation. We used a pooled method in nonparametric bootstrap test that may overcome the problem related with small samples in hypothesis testing. The present study compared nonparametric bootstrap test with pooled resampling method corresponding to parametric, nonparametric, and permutation tests through extensive simulations under various conditions and using real data examples. The nonparametric pooled bootstrap t-test provided equal or greater power for comparing two means as compared with unpaired t-test, Welch t-test, Wilcoxon rank sum test, and permutation test while maintaining type I error probability for any conditions except for Cauchy and extreme variable lognormal distributions. In such cases, we suggest using an exact Wilcoxon rank sum test. Nonparametric bootstrap paired t-test also provided better performance than other alternatives. Nonparametric bootstrap test provided benefit over exact Kruskal-Wallis test. We suggest using nonparametric bootstrap test with pooled resampling method for comparing paired or unpaired means and for validating the one way analysis of variance test results for non-normal data in small sample size studies. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  4. Categorical and nonparametric data analysis choosing the best statistical technique

    CERN Document Server

    Nussbaum, E Michael

    2014-01-01

    Featuring in-depth coverage of categorical and nonparametric statistics, this book provides a conceptual framework for choosing the most appropriate type of test in various research scenarios. Class tested at the University of Nevada, the book's clear explanations of the underlying assumptions, computer simulations, and Exploring the Concept boxes help reduce reader anxiety. Problems inspired by actual studies provide meaningful illustrations of the techniques. The underlying assumptions of each test and the factors that impact validity and statistical power are reviewed so readers can explain

  5. An example of multidimensional analysis: Discriminant analysis

    International Nuclear Information System (INIS)

    Lutz, P.

    1990-01-01

    Among the approaches on the data multi-dimensional analysis, lectures on the discriminant analysis including theoretical and practical aspects are presented. The discrimination problem, the analysis steps and the discrimination categories are stressed. Examples on the descriptive historical analysis, the discrimination for decision making, the demonstration and separation of the top quark are given. In the linear discriminant analysis the following subjects are discussed: Huyghens theorem, projection, discriminant variable, geometrical interpretation, case for g=2, classification method, separation of the top events. Criteria allowing the obtention of relevant results are included [fr

  6. Theoretical remarks on the statistics of three discriminants in Piety's automated signature analysis of PSD [Power Spectral Density] data

    International Nuclear Information System (INIS)

    Behringer, K.; Spiekerman, G.

    1984-01-01

    Piety (1977) proposed an automated signature analysis of power spectral density data. Eight statistical decision discriminants are introduced. For nearly all the discriminants, improved confidence statements can be made. The statistical characteristics of the last three discriminants, which are applications of non-parametric tests, are considered. (author)

  7. Hierarchical Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    Di Lu

    2018-01-01

    Full Text Available The Internet of Things (IoT generates lots of high-dimensional sensor intelligent data. The processing of high-dimensional data (e.g., data visualization and data classification is very difficult, so it requires excellent subspace learning algorithms to learn a latent subspace to preserve the intrinsic structure of the high-dimensional data, and abandon the least useful information in the subsequent processing. In this context, many subspace learning algorithms have been presented. However, in the process of transforming the high-dimensional data into the low-dimensional space, the huge difference between the sum of inter-class distance and the sum of intra-class distance for distinct data may cause a bias problem. That means that the impact of intra-class distance is overwhelmed. To address this problem, we propose a novel algorithm called Hierarchical Discriminant Analysis (HDA. It minimizes the sum of intra-class distance first, and then maximizes the sum of inter-class distance. This proposed method balances the bias from the inter-class and that from the intra-class to achieve better performance. Extensive experiments are conducted on several benchmark face datasets. The results reveal that HDA obtains better performance than other dimensionality reduction algorithms.

  8. Non-parametric correlative uncertainty quantification and sensitivity analysis: Application to a Langmuir bimolecular adsorption model

    Science.gov (United States)

    Feng, Jinchao; Lansford, Joshua; Mironenko, Alexander; Pourkargar, Davood Babaei; Vlachos, Dionisios G.; Katsoulakis, Markos A.

    2018-03-01

    We propose non-parametric methods for both local and global sensitivity analysis of chemical reaction models with correlated parameter dependencies. The developed mathematical and statistical tools are applied to a benchmark Langmuir competitive adsorption model on a close packed platinum surface, whose parameters, estimated from quantum-scale computations, are correlated and are limited in size (small data). The proposed mathematical methodology employs gradient-based methods to compute sensitivity indices. We observe that ranking influential parameters depends critically on whether or not correlations between parameters are taken into account. The impact of uncertainty in the correlation and the necessity of the proposed non-parametric perspective are demonstrated.

  9. Non-parametric correlative uncertainty quantification and sensitivity analysis: Application to a Langmuir bimolecular adsorption model

    Directory of Open Access Journals (Sweden)

    Jinchao Feng

    2018-03-01

    Full Text Available We propose non-parametric methods for both local and global sensitivity analysis of chemical reaction models with correlated parameter dependencies. The developed mathematical and statistical tools are applied to a benchmark Langmuir competitive adsorption model on a close packed platinum surface, whose parameters, estimated from quantum-scale computations, are correlated and are limited in size (small data. The proposed mathematical methodology employs gradient-based methods to compute sensitivity indices. We observe that ranking influential parameters depends critically on whether or not correlations between parameters are taken into account. The impact of uncertainty in the correlation and the necessity of the proposed non-parametric perspective are demonstrated.

  10. Using discriminant analysis as a nucleation event classification method

    Directory of Open Access Journals (Sweden)

    S. Mikkonen

    2006-01-01

    Full Text Available More than three years of measurements of aerosol size-distribution and different gas and meteorological parameters made in Po Valley, Italy were analysed for this study to examine which of the meteorological and trace gas variables effect on the emergence of nucleation events. As the analysis method, we used discriminant analysis with non-parametric Epanechnikov kernel, included in non-parametric density estimation method. The best classification result in our data was reached with the combination of relative humidity, ozone concentration and a third degree polynomial of radiation. RH appeared to have a preventing effect on the new particle formation whereas the effects of O3 and radiation were more conductive. The concentration of SO2 and NO2 also appeared to have significant effect on the emergence of nucleation events but because of the great amount of missing observations, we had to exclude them from the final analysis.

  11. Non-parametric production analysis of pesticides use in the Netherlands

    NARCIS (Netherlands)

    Oude Lansink, A.G.J.M.; Silva, E.

    2004-01-01

    Many previous empirical studies on the productivity of pesticides suggest that pesticides are under-utilized in agriculture despite the general held believe that these inputs are substantially over-utilized. This paper uses data envelopment analysis (DEA) to calculate non-parametric measures of the

  12. Scale-Free Nonparametric Factor Analysis: A User-Friendly Introduction with Concrete Heuristic Examples.

    Science.gov (United States)

    Mittag, Kathleen Cage

    Most researchers using factor analysis extract factors from a matrix of Pearson product-moment correlation coefficients. A method is presented for extracting factors in a non-parametric way, by extracting factors from a matrix of Spearman rho (rank correlation) coefficients. It is possible to factor analyze a matrix of association such that…

  13. Data analysis with small samples and non-normal data nonparametrics and other strategies

    CERN Document Server

    Siebert, Carl F

    2017-01-01

    Written in everyday language for non-statisticians, this book provides all the information needed to successfully conduct nonparametric analyses. This ideal reference book provides step-by-step instructions to lead the reader through each analysis, screenshots of the software and output, and case scenarios to illustrate of all the analytic techniques.

  14. Nonparametric inference in nonlinear principal components analysis : exploration and beyond

    NARCIS (Netherlands)

    Linting, Mariëlle

    2007-01-01

    In the social and behavioral sciences, data sets often do not meet the assumptions of traditional analysis methods. Therefore, nonlinear alternatives to traditional methods have been developed. This thesis starts with a didactic discussion of nonlinear principal components analysis (NLPCA),

  15. Multilevel Latent Class Analysis: Parametric and Nonparametric Models

    Science.gov (United States)

    Finch, W. Holmes; French, Brian F.

    2014-01-01

    Latent class analysis is an analytic technique often used in educational and psychological research to identify meaningful groups of individuals within a larger heterogeneous population based on a set of variables. This technique is flexible, encompassing not only a static set of variables but also longitudinal data in the form of growth mixture…

  16. Driving Style Analysis Using Primitive Driving Patterns With Bayesian Nonparametric Approaches

    OpenAIRE

    Wang, Wenshuo; Xi, Junqiang; Zhao, Ding

    2017-01-01

    Analysis and recognition of driving styles are profoundly important to intelligent transportation and vehicle calibration. This paper presents a novel driving style analysis framework using the primitive driving patterns learned from naturalistic driving data. In order to achieve this, first, a Bayesian nonparametric learning method based on a hidden semi-Markov model (HSMM) is introduced to extract primitive driving patterns from time series driving data without prior knowledge of the number...

  17. Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data

    DEFF Research Database (Denmark)

    Tan, Qihua; Thomassen, Mads; Burton, Mark

    2017-01-01

    the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray...... time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health....

  18. Orthogonal sparse linear discriminant analysis

    Science.gov (United States)

    Liu, Zhonghua; Liu, Gang; Pu, Jiexin; Wang, Xiaohong; Wang, Haijun

    2018-03-01

    Linear discriminant analysis (LDA) is a linear feature extraction approach, and it has received much attention. On the basis of LDA, researchers have done a lot of research work on it, and many variant versions of LDA were proposed. However, the inherent problem of LDA cannot be solved very well by the variant methods. The major disadvantages of the classical LDA are as follows. First, it is sensitive to outliers and noises. Second, only the global discriminant structure is preserved, while the local discriminant information is ignored. In this paper, we present a new orthogonal sparse linear discriminant analysis (OSLDA) algorithm. The k nearest neighbour graph is first constructed to preserve the locality discriminant information of sample points. Then, L2,1-norm constraint on the projection matrix is used to act as loss function, which can make the proposed method robust to outliers in data points. Extensive experiments have been performed on several standard public image databases, and the experiment results demonstrate the performance of the proposed OSLDA algorithm.

  19. Nonparametric Bounds and Sensitivity Analysis of Treatment Effects

    Science.gov (United States)

    Richardson, Amy; Hudgens, Michael G.; Gilbert, Peter B.; Fine, Jason P.

    2015-01-01

    This paper considers conducting inference about the effect of a treatment (or exposure) on an outcome of interest. In the ideal setting where treatment is assigned randomly, under certain assumptions the treatment effect is identifiable from the observable data and inference is straightforward. However, in other settings such as observational studies or randomized trials with noncompliance, the treatment effect is no longer identifiable without relying on untestable assumptions. Nonetheless, the observable data often do provide some information about the effect of treatment, that is, the parameter of interest is partially identifiable. Two approaches are often employed in this setting: (i) bounds are derived for the treatment effect under minimal assumptions, or (ii) additional untestable assumptions are invoked that render the treatment effect identifiable and then sensitivity analysis is conducted to assess how inference about the treatment effect changes as the untestable assumptions are varied. Approaches (i) and (ii) are considered in various settings, including assessing principal strata effects, direct and indirect effects and effects of time-varying exposures. Methods for drawing formal inference about partially identified parameters are also discussed. PMID:25663743

  20. Bayesian nonparametric estimation of continuous monotone functions with applications to dose-response analysis.

    Science.gov (United States)

    Bornkamp, Björn; Ickstadt, Katja

    2009-03-01

    In this article, we consider monotone nonparametric regression in a Bayesian framework. The monotone function is modeled as a mixture of shifted and scaled parametric probability distribution functions, and a general random probability measure is assumed as the prior for the mixing distribution. We investigate the choice of the underlying parametric distribution function and find that the two-sided power distribution function is well suited both from a computational and mathematical point of view. The model is motivated by traditional nonlinear models for dose-response analysis, and provides possibilities to elicitate informative prior distributions on different aspects of the curve. The method is compared with other recent approaches to monotone nonparametric regression in a simulation study and is illustrated on a data set from dose-response analysis.

  1. The Discriminant Analysis Flare Forecasting System (DAFFS)

    Science.gov (United States)

    Leka, K. D.; Barnes, Graham; Wagner, Eric; Hill, Frank; Marble, Andrew R.

    2016-05-01

    The Discriminant Analysis Flare Forecasting System (DAFFS) has been developed under NOAA/Small Business Innovative Research funds to quantitatively improve upon the NOAA/SWPC flare prediction. In the Phase-I of this project, it was demonstrated that DAFFS could indeed improve by the requested 25% most of the standard flare prediction data products from NOAA/SWPC. In the Phase-II of this project, a prototype has been developed and is presently running autonomously at NWRA.DAFFS uses near-real-time data from NOAA/GOES, SDO/HMI, and the NSO/GONG network to issue both region- and full-disk forecasts of solar flares, based on multi-variable non-parametric Discriminant Analysis. Presently, DAFFS provides forecasts which match those provided by NOAA/SWPC in terms of thresholds and validity periods (including 1-, 2-, and 3- day forecasts), although issued twice daily. Of particular note regarding DAFFS capabilities are the redundant system design, automatically-generated validation statistics and the large range of customizable options available. As part of this poster, a description of the data used, algorithm, performance and customizable options will be presented, as well as a demonstration of the DAFFS prototype.DAFFS development at NWRA is supported by NOAA/SBIR contracts WC-133R-13-CN-0079 and WC-133R-14-CN-0103, with additional support from NASA contract NNH12CG10C, plus acknowledgment to the SDO/HMI and NSO/GONG facilities and NOAA/SWPC personnel for data products, support, and feedback. DAFFS is presently ready for Phase-III development.

  2. Discriminant analysis of normal and malignant breast tissue based upon INAA investigation of elemental concentration

    International Nuclear Information System (INIS)

    Kwanhoong Ng; Senghuat Ong; Bradley, D.A.; Laimeng Looi

    1997-01-01

    Discriminant analysis of six trace element concentrations measured by instrumental neutron activation analysis (INAA) in 26 paired-samples of malignant and histologically normal human breast tissues shows the technique to be a potentially valuable clinical tool for making malignant-normal classification. Nonparametric discriminant analysis is performed for the data obtained. Linear and quadratic discriminant analyses are also carried out for comparison. For this data set a formal analysis shows that the elements which may be useful in distinguishing between malignant and normal tissues are Ca, Rb and Br, providing correct classification for 24 out of 26 normal samples and 22 out of 26 malignant samples. (Author)

  3. Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.

    Science.gov (United States)

    Tan, Qihua; Thomassen, Mads; Burton, Mark; Mose, Kristian Fredløv; Andersen, Klaus Ejner; Hjelmborg, Jacob; Kruse, Torben

    2017-06-06

    Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health.

  4. Short-term forecasting of meteorological time series using Nonparametric Functional Data Analysis (NPFDA)

    Science.gov (United States)

    Curceac, S.; Ternynck, C.; Ouarda, T.

    2015-12-01

    Over the past decades, a substantial amount of research has been conducted to model and forecast climatic variables. In this study, Nonparametric Functional Data Analysis (NPFDA) methods are applied to forecast air temperature and wind speed time series in Abu Dhabi, UAE. The dataset consists of hourly measurements recorded for a period of 29 years, 1982-2010. The novelty of the Functional Data Analysis approach is in expressing the data as curves. In the present work, the focus is on daily forecasting and the functional observations (curves) express the daily measurements of the above mentioned variables. We apply a non-linear regression model with a functional non-parametric kernel estimator. The computation of the estimator is performed using an asymmetrical quadratic kernel function for local weighting based on the bandwidth obtained by a cross validation procedure. The proximities between functional objects are calculated by families of semi-metrics based on derivatives and Functional Principal Component Analysis (FPCA). Additionally, functional conditional mode and functional conditional median estimators are applied and the advantages of combining their results are analysed. A different approach employs a SARIMA model selected according to the minimum Akaike (AIC) and Bayessian (BIC) Information Criteria and based on the residuals of the model. The performance of the models is assessed by calculating error indices such as the root mean square error (RMSE), relative RMSE, BIAS and relative BIAS. The results indicate that the NPFDA models provide more accurate forecasts than the SARIMA models. Key words: Nonparametric functional data analysis, SARIMA, time series forecast, air temperature, wind speed

  5. Nonparametric bootstrap analysis with applications to demographic effects in demand functions.

    Science.gov (United States)

    Gozalo, P L

    1997-12-01

    "A new bootstrap proposal, labeled smooth conditional moment (SCM) bootstrap, is introduced for independent but not necessarily identically distributed data, where the classical bootstrap procedure fails.... A good example of the benefits of using nonparametric and bootstrap methods is the area of empirical demand analysis. In particular, we will be concerned with their application to the study of two important topics: what are the most relevant effects of household demographic variables on demand behavior, and to what extent present parametric specifications capture these effects." excerpt

  6. CATDAT - A program for parametric and nonparametric categorical data analysis user's manual, Version 1.0

    International Nuclear Information System (INIS)

    Peterson, James R.; Haas, Timothy C.; Lee, Danny C.

    2000-01-01

    Natural resource professionals are increasingly required to develop rigorous statistical models that relate environmental data to categorical responses data. Recent advances in the statistical and computing sciences have led to the development of sophisticated methods for parametric and nonparametric analysis of data with categorical responses. The statistical software package CATDAT was designed to make some of these relatively new and powerful techniques available to scientists. The CATDAT statistical package includes 4 analytical techniques: generalized logit modeling; binary classification tree; extended K-nearest neighbor classification; and modular neural network

  7. Trend Analysis of Pahang River Using Non-Parametric Analysis: Mann Kendalls Trend Test

    International Nuclear Information System (INIS)

    Nur Hishaam Sulaiman; Mohd Khairul Amri Kamarudin; Mohd Khairul Amri Kamarudin; Ahmad Dasuki Mustafa; Muhammad Azizi Amran; Fazureen Azaman; Ismail Zainal Abidin; Norsyuhada Hairoma

    2015-01-01

    Flood is common in Pahang especially during northeast monsoon season from November to February. Three river cross station: Lubuk Paku, Sg. Yap and Temerloh were selected as area of this study. The stream flow and water level data were gathered from DID record. Data set for this study were analysed by using non-parametric analysis, Mann-Kendall Trend Test. The results that obtained from stream flow and water level analysis indicate that there are positively significant trend for Lubuk Paku (0.001) and Sg. Yap (<0.0001) from 1972-2011 with the p-value < 0.05. Temerloh (0.178) data from 1963-2011 recorded no trend for stream flow parameter but negative trend for water level parameter. Hydrological pattern and trend are extremely affected by outside factors such as north east monsoon season that occurred in South China Sea and affected Pahang during November to March. There are other factors such as development and management of the areas which can be considered as factors affected the data and results. Hydrological Pattern is important to indicate the river trend such as stream flow and water level. It can be used as flood mitigation by local authorities. (author)

  8. Bayesian Nonparametric Regression Analysis of Data with Random Effects Covariates from Longitudinal Measurements

    KAUST Repository

    Ryu, Duchwan

    2010-09-28

    We consider nonparametric regression analysis in a generalized linear model (GLM) framework for data with covariates that are the subject-specific random effects of longitudinal measurements. The usual assumption that the effects of the longitudinal covariate processes are linear in the GLM may be unrealistic and if this happens it can cast doubt on the inference of observed covariate effects. Allowing the regression functions to be unknown, we propose to apply Bayesian nonparametric methods including cubic smoothing splines or P-splines for the possible nonlinearity and use an additive model in this complex setting. To improve computational efficiency, we propose the use of data-augmentation schemes. The approach allows flexible covariance structures for the random effects and within-subject measurement errors of the longitudinal processes. The posterior model space is explored through a Markov chain Monte Carlo (MCMC) sampler. The proposed methods are illustrated and compared to other approaches, the "naive" approach and the regression calibration, via simulations and by an application that investigates the relationship between obesity in adulthood and childhood growth curves. © 2010, The International Biometric Society.

  9. Genomic outlier profile analysis: mixture models, null hypotheses, and nonparametric estimation.

    Science.gov (United States)

    Ghosh, Debashis; Chinnaiyan, Arul M

    2009-01-01

    In most analyses of large-scale genomic data sets, differential expression analysis is typically assessed by testing for differences in the mean of the distributions between 2 groups. A recent finding by Tomlins and others (2005) is of a different type of pattern of differential expression in which a fraction of samples in one group have overexpression relative to samples in the other group. In this work, we describe a general mixture model framework for the assessment of this type of expression, called outlier profile analysis. We start by considering the single-gene situation and establishing results on identifiability. We propose 2 nonparametric estimation procedures that have natural links to familiar multiple testing procedures. We then develop multivariate extensions of this methodology to handle genome-wide measurements. The proposed methodologies are compared using simulation studies as well as data from a prostate cancer gene expression study.

  10. Statistical analysis using the Bayesian nonparametric method for irradiation embrittlement of reactor pressure vessels

    Energy Technology Data Exchange (ETDEWEB)

    Takamizawa, Hisashi, E-mail: takamizawa.hisashi@jaea.go.jp; Itoh, Hiroto, E-mail: ito.hiroto@jaea.go.jp; Nishiyama, Yutaka, E-mail: nishiyama.yutaka93@jaea.go.jp

    2016-10-15

    In order to understand neutron irradiation embrittlement in high fluence regions, statistical analysis using the Bayesian nonparametric (BNP) method was performed for the Japanese surveillance and material test reactor irradiation database. The BNP method is essentially expressed as an infinite summation of normal distributions, with input data being subdivided into clusters with identical statistical parameters, such as mean and standard deviation, for each cluster to estimate shifts in ductile-to-brittle transition temperature (DBTT). The clusters typically depend on chemical compositions, irradiation conditions, and the irradiation embrittlement. Specific variables contributing to the irradiation embrittlement include the content of Cu, Ni, P, Si, and Mn in the pressure vessel steels, neutron flux, neutron fluence, and irradiation temperatures. It was found that the measured shifts of DBTT correlated well with the calculated ones. Data associated with the same materials were subdivided into the same clusters even if neutron fluences were increased.

  11. Performances of non-parametric statistics in sensitivity analysis and parameter ranking

    International Nuclear Information System (INIS)

    Saltelli, A.

    1987-01-01

    Twelve parametric and non-parametric sensitivity analysis techniques are compared in the case of non-linear model responses. The test models used are taken from the long-term risk analysis for the disposal of high level radioactive waste in a geological formation. They describe the transport of radionuclides through a set of engineered and natural barriers from the repository to the biosphere and to man. The output data from these models are the dose rates affecting the maximum exposed individual of a critical group at a given point in time. All the techniques are applied to the output from the same Monte Carlo simulations, where a modified version of Latin Hypercube method is used for the sample selection. Hypothesis testing is systematically applied to quantify the degree of confidence in the results given by the various sensitivity estimators. The estimators are ranked according to their robustness and stability, on the basis of two test cases. The conclusions are that no estimator can be considered the best from all points of view and recommend the use of more than just one estimator in sensitivity analysis

  12. Single molecule force spectroscopy at high data acquisition: A Bayesian nonparametric analysis

    Science.gov (United States)

    Sgouralis, Ioannis; Whitmore, Miles; Lapidus, Lisa; Comstock, Matthew J.; Pressé, Steve

    2018-03-01

    Bayesian nonparametrics (BNPs) are poised to have a deep impact in the analysis of single molecule data as they provide posterior probabilities over entire models consistent with the supplied data, not just model parameters of one preferred model. Thus they provide an elegant and rigorous solution to the difficult problem encountered when selecting an appropriate candidate model. Nevertheless, BNPs' flexibility to learn models and their associated parameters from experimental data is a double-edged sword. Most importantly, BNPs are prone to increasing the complexity of the estimated models due to artifactual features present in time traces. Thus, because of experimental challenges unique to single molecule methods, naive application of available BNP tools is not possible. Here we consider traces with time correlations and, as a specific example, we deal with force spectroscopy traces collected at high acquisition rates. While high acquisition rates are required in order to capture dwells in short-lived molecular states, in this setup, a slow response of the optical trap instrumentation (i.e., trapped beads, ambient fluid, and tethering handles) distorts the molecular signals introducing time correlations into the data that may be misinterpreted as true states by naive BNPs. Our adaptation of BNP tools explicitly takes into consideration these response dynamics, in addition to drift and noise, and makes unsupervised time series analysis of correlated single molecule force spectroscopy measurements possible, even at acquisition rates similar to or below the trap's response times.

  13. Nonparametric Bayesian inference for mean residual life functions in survival analysis.

    Science.gov (United States)

    Poynor, Valerie; Kottas, Athanasios

    2018-01-19

    Modeling and inference for survival analysis problems typically revolves around different functions related to the survival distribution. Here, we focus on the mean residual life (MRL) function, which provides the expected remaining lifetime given that a subject has survived (i.e. is event-free) up to a particular time. This function is of direct interest in reliability, medical, and actuarial fields. In addition to its practical interpretation, the MRL function characterizes the survival distribution. We develop general Bayesian nonparametric inference for MRL functions built from a Dirichlet process mixture model for the associated survival distribution. The resulting model for the MRL function admits a representation as a mixture of the kernel MRL functions with time-dependent mixture weights. This model structure allows for a wide range of shapes for the MRL function. Particular emphasis is placed on the selection of the mixture kernel, taken to be a gamma distribution, to obtain desirable properties for the MRL function arising from the mixture model. The inference method is illustrated with a data set of two experimental groups and a data set involving right censoring. The supplementary material available at Biostatistics online provides further results on empirical performance of the model, using simulated data examples. © The Author 2018. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  14. European regional efficiency and geographical externalities: a spatial nonparametric frontier analysis

    Science.gov (United States)

    Ramajo, Julián; Cordero, José Manuel; Márquez, Miguel Ángel

    2017-10-01

    This paper analyses region-level technical efficiency in nine European countries over the 1995-2007 period. We propose the application of a nonparametric conditional frontier approach to account for the presence of heterogeneous conditions in the form of geographical externalities. Such environmental factors are beyond the control of regional authorities, but may affect the production function. Therefore, they need to be considered in the frontier estimation. Specifically, a spatial autoregressive term is included as an external conditioning factor in a robust order- m model. Thus we can test the hypothesis of non-separability (the external factor impacts both the input-output space and the distribution of efficiencies), demonstrating the existence of significant global interregional spillovers into the production process. Our findings show that geographical externalities affect both the frontier level and the probability of being more or less efficient. Specifically, the results support the fact that the spatial lag variable has an inverted U-shaped non-linear impact on the performance of regions. This finding can be interpreted as a differential effect of interregional spillovers depending on the size of the neighboring economies: positive externalities for small values, possibly related to agglomeration economies, and negative externalities for high values, indicating the possibility of production congestion. Additionally, evidence of the existence of a strong geographic pattern of European regional efficiency is reported and the levels of technical efficiency are acknowledged to have converged during the period under analysis.

  15. Discriminant Analysis of Student Loan Applications

    Science.gov (United States)

    Dyl, Edward A.; McGann, Anthony F.

    1977-01-01

    The use of discriminant analysis in identifying potentially "good" versus potentially "bad" student loans is explained. The technique is applied to a sample of 200 student loan applications at the University of Wyoming. (LBH)

  16. USING DISCRIMINANT ANALYSIS IN RELATIONSHIP MARKETING

    OpenAIRE

    Iacob Catoiu; Mihai Èšichindelean; Simona Vinerean

    2013-01-01

    The purpose of the present paper is to describe and apply discriminant analysis withina relationship marketing context. The paper is structured into two parts; the first part contains aliterature review regarding the value chain concept and the dimensions it is built on, while thesecond part includes the results of applying discriminant analysis on several value chaindimensions. The authors have considered the client-company relationships of the gas-station marketas proper for studying the di...

  17. Credit scoring analysis using kernel discriminant

    Science.gov (United States)

    Widiharih, T.; Mukid, M. A.; Mustafid

    2018-05-01

    Credit scoring model is an important tool for reducing the risk of wrong decisions when granting credit facilities to applicants. This paper investigate the performance of kernel discriminant model in assessing customer credit risk. Kernel discriminant analysis is a non- parametric method which means that it does not require any assumptions about the probability distribution of the input. The main ingredient is a kernel that allows an efficient computation of Fisher discriminant. We use several kernel such as normal, epanechnikov, biweight, and triweight. The models accuracy was compared each other using data from a financial institution in Indonesia. The results show that kernel discriminant can be an alternative method that can be used to determine who is eligible for a credit loan. In the data we use, it shows that a normal kernel is relevant to be selected for credit scoring using kernel discriminant model. Sensitivity and specificity reach to 0.5556 and 0.5488 respectively.

  18. A Bayesian approach to the analysis of quantal bioassay studies using nonparametric mixture models.

    Science.gov (United States)

    Fronczyk, Kassandra; Kottas, Athanasios

    2014-03-01

    We develop a Bayesian nonparametric mixture modeling framework for quantal bioassay settings. The approach is built upon modeling dose-dependent response distributions. We adopt a structured nonparametric prior mixture model, which induces a monotonicity restriction for the dose-response curve. Particular emphasis is placed on the key risk assessment goal of calibration for the dose level that corresponds to a specified response. The proposed methodology yields flexible inference for the dose-response relationship as well as for other inferential objectives, as illustrated with two data sets from the literature. © 2013, The International Biometric Society.

  19. Does Private Tutoring Work? The Effectiveness of Private Tutoring: A Nonparametric Bounds Analysis

    Science.gov (United States)

    Hof, Stefanie

    2014-01-01

    Private tutoring has become popular throughout the world. However, evidence for the effect of private tutoring on students' academic outcome is inconclusive; therefore, this paper presents an alternative framework: a nonparametric bounds method. The present examination uses, for the first time, a large representative data-set in a European setting…

  20. Non-Parametric Kinetic (NPK Analysis of Thermal Oxidation of Carbon Aerogels

    Directory of Open Access Journals (Sweden)

    Azadeh Seifi

    2017-05-01

    Full Text Available In recent years, much attention has been paid to aerogel materials (especially carbon aerogels due to their potential uses in energy-related applications, such as thermal energy storage and thermal protection systems. These open cell carbon-based porous materials (carbon aerogels can strongly react with oxygen at relatively low temperatures (~ 400°C. Therefore, it is necessary to evaluate the thermal performance of carbon aerogels in view of their energy-related applications at high temperatures and under thermal oxidation conditions. The objective of this paper is to study theoretically and experimentally the oxidation reaction kinetics of carbon aerogel using the non-parametric kinetic (NPK as a powerful method. For this purpose, a non-isothermal thermogravimetric analysis, at three different heating rates, was performed on three samples each with its specific pore structure, density and specific surface area. The most significant feature of this method, in comparison with the model-free isoconversional methods, is its ability to separate the functionality of the reaction rate with the degree of conversion and temperature by the direct use of thermogravimetric data. Using this method, it was observed that the Nomen-Sempere model could provide the best fit to the data, while the temperature dependence of the rate constant was best explained by a Vogel-Fulcher relationship, where the reference temperature was the onset temperature of oxidation. Moreover, it was found from the results of this work that the assumption of the Arrhenius relation for the temperature dependence of the rate constant led to over-estimation of the apparent activation energy (up to 160 kJ/mol that was considerably different from the values (up to 3.5 kJ/mol predicted by the Vogel-Fulcher relationship in isoconversional methods

  1. A nonparametric approach to medical survival data: Uncertainty in the context of risk in mortality analysis

    International Nuclear Information System (INIS)

    Janurová, Kateřina; Briš, Radim

    2014-01-01

    Medical survival right-censored data of about 850 patients are evaluated to analyze the uncertainty related to the risk of mortality on one hand and compare two basic surgery techniques in the context of risk of mortality on the other hand. Colorectal data come from patients who underwent colectomy in the University Hospital of Ostrava. Two basic surgery operating techniques are used for the colectomy: either traditional (open) or minimally invasive (laparoscopic). Basic question arising at the colectomy operation is, which type of operation to choose to guarantee longer overall survival time. Two non-parametric approaches have been used to quantify probability of mortality with uncertainties. In fact, complement of the probability to one, i.e. survival function with corresponding confidence levels is calculated and evaluated. First approach considers standard nonparametric estimators resulting from both the Kaplan–Meier estimator of survival function in connection with Greenwood's formula and the Nelson–Aalen estimator of cumulative hazard function including confidence interval for survival function as well. The second innovative approach, represented by Nonparametric Predictive Inference (NPI), uses lower and upper probabilities for quantifying uncertainty and provides a model of predictive survival function instead of the population survival function. The traditional log-rank test on one hand and the nonparametric predictive comparison of two groups of lifetime data on the other hand have been compared to evaluate risk of mortality in the context of mentioned surgery techniques. The size of the difference between two groups of lifetime data has been considered and analyzed as well. Both nonparametric approaches led to the same conclusion, that the minimally invasive operating technique guarantees the patient significantly longer survival time in comparison with the traditional operating technique

  2. A non-parametric Data Envelopment Analysis approach for improving energy efficiency of grape production

    International Nuclear Information System (INIS)

    Khoshroo, Alireza; Mulwa, Richard; Emrouznejad, Ali; Arabi, Behrouz

    2013-01-01

    Grape is one of the world's largest fruit crops with approximately 67.5 million tonnes produced each year and energy is an important element in modern grape productions as it heavily depends on fossil and other energy resources. Efficient use of these energies is a necessary step toward reducing environmental hazards, preventing destruction of natural resources and ensuring agricultural sustainability. Hence, identifying excessive use of energy as well as reducing energy resources is the main focus of this paper to optimize energy consumption in grape production. In this study we use a two-stage methodology to find the association of energy efficiency and performance explained by farmers' specific characteristics. In the first stage a non-parametric Data Envelopment Analysis is used to model efficiencies as an explicit function of human labor, machinery, chemicals, FYM (farmyard manure), diesel fuel, electricity and water for irrigation energies. In the second step, farm specific variables such as farmers' age, gender, level of education and agricultural experience are used in a Tobit regression framework to explain how these factors influence efficiency of grape farming. The result of the first stage shows substantial inefficiency between the grape producers in the studied area while the second stage shows that the main difference between efficient and inefficient farmers was in the use of chemicals, diesel fuel and water for irrigation. The use of chemicals such as insecticides, herbicides and fungicides were considerably less than inefficient ones. The results revealed that the more educated farmers are more energy efficient in comparison with their less educated counterparts. - Highlights: • The focus of this paper is to identify excessive use of energy and optimize energy consumption in grape production. • We measure the efficiency as a function of labor/machinery/chemicals/farmyard manure/diesel-fuel/electricity/water. • Data were obtained from 41 grape

  3. Efficiency Analysis of German Electricity Distribution Utilities : Non-Parametric and Parametric Tests

    OpenAIRE

    von Hirschhausen, Christian R.; Cullmann, Astrid

    2005-01-01

    Abstract This paper applies parametric and non-parametric and parametric tests to assess the efficiency of electricity distribution companies in Germany. We address traditional issues in electricity sector benchmarking, such as the role of scale effects and optimal utility size, as well as new evidence specific to the situation in Germany. We use labour, capital, and peak load capacity as inputs, and units sold and the number of customers as output. The data cover 307 (out of 553) ...

  4. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  5. Bayesian Sensitivity Analysis of a Nonlinear Dynamic Factor Analysis Model with Nonparametric Prior and Possible Nonignorable Missingness.

    Science.gov (United States)

    Tang, Niansheng; Chow, Sy-Miin; Ibrahim, Joseph G; Zhu, Hongtu

    2017-12-01

    Many psychological concepts are unobserved and usually represented as latent factors apprehended through multiple observed indicators. When multiple-subject multivariate time series data are available, dynamic factor analysis models with random effects offer one way of modeling patterns of within- and between-person variations by combining factor analysis and time series analysis at the factor level. Using the Dirichlet process (DP) as a nonparametric prior for individual-specific time series parameters further allows the distributional forms of these parameters to deviate from commonly imposed (e.g., normal or other symmetric) functional forms, arising as a result of these parameters' restricted ranges. Given the complexity of such models, a thorough sensitivity analysis is critical but computationally prohibitive. We propose a Bayesian local influence method that allows for simultaneous sensitivity analysis of multiple modeling components within a single fitting of the model of choice. Five illustrations and an empirical example are provided to demonstrate the utility of the proposed approach in facilitating the detection of outlying cases and common sources of misspecification in dynamic factor analysis models, as well as identification of modeling components that are sensitive to changes in the DP prior specification.

  6. Parametric and Nonparametric EEG Analysis for the Evaluation of EEG Activity in Young Children with Controlled Epilepsy

    Directory of Open Access Journals (Sweden)

    Vangelis Sakkalis

    2008-01-01

    Full Text Available There is an important evidence of differences in the EEG frequency spectrum of control subjects as compared to epileptic subjects. In particular, the study of children presents difficulties due to the early stages of brain development and the various forms of epilepsy indications. In this study, we consider children that developed epileptic crises in the past but without any other clinical, psychological, or visible neurophysiological findings. The aim of the paper is to develop reliable techniques for testing if such controlled epilepsy induces related spectral differences in the EEG. Spectral features extracted by using nonparametric, signal representation techniques (Fourier and wavelet transform and a parametric, signal modeling technique (ARMA are compared and their effect on the classification of the two groups is analyzed. The subjects performed two different tasks: a control (rest task and a relatively difficult math task. The results show that spectral features extracted by modeling the EEG signals recorded from individual channels by an ARMA model give a higher discrimination between the two subject groups for the control task, where classification scores of up to 100% were obtained with a linear discriminant classifier.

  7. Robust non-parametric one-sample tests for the analysis of recurrent events.

    Science.gov (United States)

    Rebora, Paola; Galimberti, Stefania; Valsecchi, Maria Grazia

    2010-12-30

    One-sample non-parametric tests are proposed here for inference on recurring events. The focus is on the marginal mean function of events and the basis for inference is the standardized distance between the observed and the expected number of events under a specified reference rate. Different weights are considered in order to account for various types of alternative hypotheses on the mean function of the recurrent events process. A robust version and a stratified version of the test are also proposed. The performance of these tests was investigated through simulation studies under various underlying event generation processes, such as homogeneous and nonhomogeneous Poisson processes, autoregressive and renewal processes, with and without frailty effects. The robust versions of the test have been shown to be suitable in a wide variety of event generating processes. The motivating context is a study on gene therapy in a very rare immunodeficiency in children, where a major end-point is the recurrence of severe infections. Robust non-parametric one-sample tests for recurrent events can be useful to assess efficacy and especially safety in non-randomized studies or in epidemiological studies for comparison with a standard population. Copyright © 2010 John Wiley & Sons, Ltd.

  8. Linear discriminant analysis for welding fault detection

    International Nuclear Information System (INIS)

    Li, X.; Simpson, S.W.

    2010-01-01

    This work presents a new method for real time welding fault detection in industry based on Linear Discriminant Analysis (LDA). A set of parameters was calculated from one second blocks of electrical data recorded during welding and based on control data from reference welds under good conditions, as well as faulty welds. Optimised linear combinations of the parameters were determined with LDA and tested with independent data. Short arc welds in overlap joints were studied with various power sources, shielding gases, wire diameters, and process geometries. Out-of-position faults were investigated. Application of LDA fault detection to a broad range of welding procedures was investigated using a similarity measure based on Principal Component Analysis. The measure determines which reference data are most similar to a given industrial procedure and the appropriate LDA weights are then employed. Overall, results show that Linear Discriminant Analysis gives an effective and consistent performance in real-time welding fault detection.

  9. Regularized Discriminant Analysis: A Large Dimensional Study

    KAUST Repository

    Yang, Xiaoke

    2018-04-28

    In this thesis, we focus on studying the performance of general regularized discriminant analysis (RDA) classifiers. The data used for analysis is assumed to follow Gaussian mixture model with different means and covariances. RDA offers a rich class of regularization options, covering as special cases the regularized linear discriminant analysis (RLDA) and the regularized quadratic discriminant analysis (RQDA) classi ers. We analyze RDA under the double asymptotic regime where the data dimension and the training size both increase in a proportional way. This double asymptotic regime allows for application of fundamental results from random matrix theory. Under the double asymptotic regime and some mild assumptions, we show that the asymptotic classification error converges to a deterministic quantity that only depends on the data statistical parameters and dimensions. This result not only implicates some mathematical relations between the misclassification error and the class statistics, but also can be leveraged to select the optimal parameters that minimize the classification error, thus yielding the optimal classifier. Validation results on the synthetic data show a good accuracy of our theoretical findings. We also construct a general consistent estimator to approximate the true classification error in consideration of the unknown previous statistics. We benchmark the performance of our proposed consistent estimator against classical estimator on synthetic data. The observations demonstrate that the general estimator outperforms others in terms of mean squared error (MSE).

  10. NParCov3: A SAS/IML Macro for Nonparametric Randomization-Based Analysis of Covariance

    Directory of Open Access Journals (Sweden)

    Richard C. Zink

    2012-07-01

    Full Text Available Analysis of covariance serves two important purposes in a randomized clinical trial. First, there is a reduction of variance for the treatment effect which provides more powerful statistical tests and more precise confidence intervals. Second, it provides estimates of the treatment effect which are adjusted for random imbalances of covariates between the treatment groups. The nonparametric analysis of covariance method of Koch, Tangen, Jung, and Amara (1998 defines a very general methodology using weighted least-squares to generate covariate-adjusted treatment effects with minimal assumptions. This methodology is general in its applicability to a variety of outcomes, whether continuous, binary, ordinal, incidence density or time-to-event. Further, its use has been illustrated in many clinical trial settings, such as multi-center, dose-response and non-inferiority trials.NParCov3 is a SAS/IML macro written to conduct the nonparametric randomization-based covariance analyses of Koch et al. (1998. The software can analyze a variety of outcomes and can account for stratification. Data from multiple clinical trials will be used for illustration.

  11. Nonparametric statistical inference

    CERN Document Server

    Gibbons, Jean Dickinson

    2014-01-01

    Thoroughly revised and reorganized, the fourth edition presents in-depth coverage of the theory and methods of the most widely used nonparametric procedures in statistical analysis and offers example applications appropriate for all areas of the social, behavioral, and life sciences. The book presents new material on the quantiles, the calculation of exact and simulated power, multiple comparisons, additional goodness-of-fit tests, methods of analysis of count data, and modern computer applications using MINITAB, SAS, and STATXACT. It includes tabular guides for simplified applications of tests and finding P values and confidence interval estimates.

  12. A menu-driven software package of Bayesian nonparametric (and parametric) mixed models for regression analysis and density estimation.

    Science.gov (United States)

    Karabatsos, George

    2017-02-01

    Most of applied statistics involves regression analysis of data. In practice, it is important to specify a regression model that has minimal assumptions which are not violated by data, to ensure that statistical inferences from the model are informative and not misleading. This paper presents a stand-alone and menu-driven software package, Bayesian Regression: Nonparametric and Parametric Models, constructed from MATLAB Compiler. Currently, this package gives the user a choice from 83 Bayesian models for data analysis. They include 47 Bayesian nonparametric (BNP) infinite-mixture regression models; 5 BNP infinite-mixture models for density estimation; and 31 normal random effects models (HLMs), including normal linear models. Each of the 78 regression models handles either a continuous, binary, or ordinal dependent variable, and can handle multi-level (grouped) data. All 83 Bayesian models can handle the analysis of weighted observations (e.g., for meta-analysis), and the analysis of left-censored, right-censored, and/or interval-censored data. Each BNP infinite-mixture model has a mixture distribution assigned one of various BNP prior distributions, including priors defined by either the Dirichlet process, Pitman-Yor process (including the normalized stable process), beta (two-parameter) process, normalized inverse-Gaussian process, geometric weights prior, dependent Dirichlet process, or the dependent infinite-probits prior. The software user can mouse-click to select a Bayesian model and perform data analysis via Markov chain Monte Carlo (MCMC) sampling. After the sampling completes, the software automatically opens text output that reports MCMC-based estimates of the model's posterior distribution and model predictive fit to the data. Additional text and/or graphical output can be generated by mouse-clicking other menu options. This includes output of MCMC convergence analyses, and estimates of the model's posterior predictive distribution, for selected

  13. A Computational Discriminability Analysis on Twin Fingerprints

    Science.gov (United States)

    Liu, Yu; Srihari, Sargur N.

    Sharing similar genetic traits makes the investigation of twins an important study in forensics and biometrics. Fingerprints are one of the most commonly found types of forensic evidence. The similarity between twins’ prints is critical establish to the reliability of fingerprint identification. We present a quantitative analysis of the discriminability of twin fingerprints on a new data set (227 pairs of identical twins and fraternal twins) recently collected from a twin population using both level 1 and level 2 features. Although the patterns of minutiae among twins are more similar than in the general population, the similarity of fingerprints of twins is significantly different from that between genuine prints of the same finger. Twins fingerprints are discriminable with a 1.5%~1.7% higher EER than non-twins. And identical twins can be distinguished by examine fingerprint with a slightly higher error rate than fraternal twins.

  14. Bootstrap-based procedures for inference in nonparametric receiver-operating characteristic curve regression analysis.

    Science.gov (United States)

    Rodríguez-Álvarez, María Xosé; Roca-Pardiñas, Javier; Cadarso-Suárez, Carmen; Tahoces, Pablo G

    2018-03-01

    Prior to using a diagnostic test in a routine clinical setting, the rigorous evaluation of its diagnostic accuracy is essential. The receiver-operating characteristic curve is the measure of accuracy most widely used for continuous diagnostic tests. However, the possible impact of extra information about the patient (or even the environment) on diagnostic accuracy also needs to be assessed. In this paper, we focus on an estimator for the covariate-specific receiver-operating characteristic curve based on direct regression modelling and nonparametric smoothing techniques. This approach defines the class of generalised additive models for the receiver-operating characteristic curve. The main aim of the paper is to offer new inferential procedures for testing the effect of covariates on the conditional receiver-operating characteristic curve within the above-mentioned class. Specifically, two different bootstrap-based tests are suggested to check (a) the possible effect of continuous covariates on the receiver-operating characteristic curve and (b) the presence of factor-by-curve interaction terms. The validity of the proposed bootstrap-based procedures is supported by simulations. To facilitate the application of these new procedures in practice, an R-package, known as npROCRegression, is provided and briefly described. Finally, data derived from a computer-aided diagnostic system for the automatic detection of tumour masses in breast cancer is analysed.

  15. Bayesian Nonparametric Hidden Markov Models with application to the analysis of copy-number-variation in mammalian genomes.

    Science.gov (United States)

    Yau, C; Papaspiliopoulos, O; Roberts, G O; Holmes, C

    2011-01-01

    We consider the development of Bayesian Nonparametric methods for product partition models such as Hidden Markov Models and change point models. Our approach uses a Mixture of Dirichlet Process (MDP) model for the unknown sampling distribution (likelihood) for the observations arising in each state and a computationally efficient data augmentation scheme to aid inference. The method uses novel MCMC methodology which combines recent retrospective sampling methods with the use of slice sampler variables. The methodology is computationally efficient, both in terms of MCMC mixing properties, and robustness to the length of the time series being investigated. Moreover, the method is easy to implement requiring little or no user-interaction. We apply our methodology to the analysis of genomic copy number variation.

  16. Nonparametric statistics for social and behavioral sciences

    CERN Document Server

    Kraska-MIller, M

    2013-01-01

    Introduction to Research in Social and Behavioral SciencesBasic Principles of ResearchPlanning for ResearchTypes of Research Designs Sampling ProceduresValidity and Reliability of Measurement InstrumentsSteps of the Research Process Introduction to Nonparametric StatisticsData AnalysisOverview of Nonparametric Statistics and Parametric Statistics Overview of Parametric Statistics Overview of Nonparametric StatisticsImportance of Nonparametric MethodsMeasurement InstrumentsAnalysis of Data to Determine Association and Agreement Pearson Chi-Square Test of Association and IndependenceContingency

  17. Nonparametric correlation models for portfolio allocation

    DEFF Research Database (Denmark)

    Aslanidis, Nektarios; Casas, Isabel

    2013-01-01

    This article proposes time-varying nonparametric and semiparametric estimators of the conditional cross-correlation matrix in the context of portfolio allocation. Simulations results show that the nonparametric and semiparametric models are best in DGPs with substantial variability or structural ...... currencies. Results show the nonparametric model generally dominates the others when evaluating in-sample. However, the semiparametric model is best for out-of-sample analysis....

  18. DISCRIMINANT ANALYSIS OF BANK PROFITABILITY LEVELS

    Directory of Open Access Journals (Sweden)

    Ante Rozga

    2013-02-01

    Full Text Available Discriminant analysis has been employed in this paper in order to identify and explain key features of bank profitability levels. Bank profitability is set up in the form of two categorical variables: profit or loss recorded and above or below average return on equity. Predictor variables are selected from various groups of financial indicators usually included in the empirical work on microeconomic determinants of bank profitability. The data from the Croatian banking sector is analyzed using the Enter method. General recommendations for a more profitable business of banking found in the bank management literature and existing empirical framework such as rationalization of overhead costs, asset growth, increase of non-interest income by expanding scale and scope of financial products proved to be important for classification of banks in different profitability levels. A higher market share may bring additional advantages. Classification results, canonical correlation and Wilks’ Lambda test confirm statistical significance of research results. Altogether, discriminant analysis turns out to be a suitable statistical method for solving presented research problem and moving forward from the bankruptcy, credit rating or default issues in finance.

  19. UN ANÁLISIS NO PARAMÉTRICO DE ÍTEMS DE LA PRUEBA DEL BENDER/A NONPARAMETRIC ITEM ANALYSIS OF THE BENDER GESTALT TEST MODIFIED

    Directory of Open Access Journals (Sweden)

    César Merino Soto

    2009-05-01

    Full Text Available Resumen:La presente investigación hace un estudio psicométrico de un nuevo sistema de calificación de la Prueba Gestáltica del Bendermodificada para niños, que es el Sistema de Calificación Cualitativa (Brannigan y Brunner, 2002, en un muestra de 244 niñosingresantes a primer grado de primaria en cuatro colegios públicos, ubicados en Lima. El enfoque usado es un análisis noparamétrico de ítems mediante el programa Testgraf (Ramsay, 1991. Los resultados indican niveles apropiados deconsistencia interna, identificándose la unidimensionalidad, y el buen nivel discriminativo de las categorías de calificación deeste Sistema Cualitativo. No se hallaron diferencias demográficas respecto al género ni la edad. Se discuten los presenteshallazgos en el contexto del potencial uso del Sistema de Calificación Cualitativa y del análisis no paramétrico de ítems en lainvestigación psicométrica.AbstracThis research designs a psychometric study of a new scoring system of the Bender Gestalt test modified to children: it is theQualitative Scoring System (Brannigan & Brunner, 2002, in a sample of 244 first grade children of primary level, in four public school of Lima. The approach aplied is the nonparametric item analysis using The test graft computer program (Ramsay, 1991. Our findings point to good levels of internal consistency, unidimensionality and good discriminative level ofthe categories of scoring from the Qualitative Scoring System. There are not demographic differences between gender or age.We discuss our findings within the context of the potential use of the Qualitative Scoring System and of the nonparametricitem analysis approach in the psychometric research.

  20. Decision support using nonparametric statistics

    CERN Document Server

    Beatty, Warren

    2018-01-01

    This concise volume covers nonparametric statistics topics that most are most likely to be seen and used from a practical decision support perspective. While many degree programs require a course in parametric statistics, these methods are often inadequate for real-world decision making in business environments. Much of the data collected today by business executives (for example, customer satisfaction opinions) requires nonparametric statistics for valid analysis, and this book provides the reader with a set of tools that can be used to validly analyze all data, regardless of type. Through numerous examples and exercises, this book explains why nonparametric statistics will lead to better decisions and how they are used to reach a decision, with a wide array of business applications. Online resources include exercise data, spreadsheets, and solutions.

  1. Nonparametric statistical inference

    CERN Document Server

    Gibbons, Jean Dickinson

    2010-01-01

    Overall, this remains a very fine book suitable for a graduate-level course in nonparametric statistics. I recommend it for all people interested in learning the basic ideas of nonparametric statistical inference.-Eugenia Stoimenova, Journal of Applied Statistics, June 2012… one of the best books available for a graduate (or advanced undergraduate) text for a theory course on nonparametric statistics. … a very well-written and organized book on nonparametric statistics, especially useful and recommended for teachers and graduate students.-Biometrics, 67, September 2011This excellently presente

  2. Application of Discriminant Analysis on Romanian Insurance Market

    OpenAIRE

    Constantin Anghelache; Dan Armeanu

    2008-01-01

    Discriminant analysis is a supervised learning technique that can be used in order to determine which variables are the best predictors of the classification of objects belonging to a population into predetermined classes. At the same time, discriminant analysis provides a powerful tool that enables researchers to make predictions regarding the classification of new objects into predefined classes. The main goal of discriminant analysis is to determine which of the N descrip...

  3. Implementation and evaluation of nonparametric regression procedures for sensitivity analysis of computationally demanding models

    International Nuclear Information System (INIS)

    Storlie, Curtis B.; Swiler, Laura P.; Helton, Jon C.; Sallaberry, Cedric J.

    2009-01-01

    The analysis of many physical and engineering problems involves running complex computational models (simulation models, computer codes). With problems of this type, it is important to understand the relationships between the input variables (whose values are often imprecisely known) and the output. The goal of sensitivity analysis (SA) is to study this relationship and identify the most significant factors or variables affecting the results of the model. In this presentation, an improvement on existing methods for SA of complex computer models is described for use when the model is too computationally expensive for a standard Monte-Carlo analysis. In these situations, a meta-model or surrogate model can be used to estimate the necessary sensitivity index for each input. A sensitivity index is a measure of the variance in the response that is due to the uncertainty in an input. Most existing approaches to this problem either do not work well with a large number of input variables and/or they ignore the error involved in estimating a sensitivity index. Here, a new approach to sensitivity index estimation using meta-models and bootstrap confidence intervals is described that provides solutions to these drawbacks. Further, an efficient yet effective approach to incorporate this methodology into an actual SA is presented. Several simulated and real examples illustrate the utility of this approach. This framework can be extended to uncertainty analysis as well.

  4. Non-parametric analysis of technical efficiency: factors affecting efficiency of West Java rice farms

    Czech Academy of Sciences Publication Activity Database

    Brázdik, František

    -, č. 286 (2006), s. 1-45 ISSN 1211-3298 R&D Projects: GA MŠk LC542 Institutional research plan: CEZ:AV0Z70850503 Keywords : rice farms * data envelopment analysis Subject RIV: AH - Economics http://www.cerge-ei.cz/pdf/wp/Wp286.pdf

  5. Using discriminant analysis for credit decision

    Directory of Open Access Journals (Sweden)

    Gheorghiţa DINCĂ

    2015-12-01

    Full Text Available This paper follows to highlight the link between the results obtained applying discriminant analysis and lending decision. For this purpose, we have carried out the research on a sample of 24 Romanian private companies, pertaining to 12 different economic sectors, from I and II categories of Bucharest Stock Exchange, for the period 2010-2012. Our study works with two popular bankruptcy risk’s prediction models, the Altman model and the Anghel model. We have double-checked and confirmed the results of our research by comparing the results from applying the two fore-mentioned models as well as by checking existing debt commitments of each analyzed company to credit institutions during the 2010-2012 period. The aim of this paper was the classification of studied companies into potential bankrupt and non-bankrupt, to assist credit institutions in their decision to grant credit, understanding the approval or rejection algorithm of loan applications and even help potential investors in these ompanies.

  6. Zero- vs. one-dimensional, parametric vs. non-parametric, and confidence interval vs. hypothesis testing procedures in one-dimensional biomechanical trajectory analysis.

    Science.gov (United States)

    Pataky, Todd C; Vanrenterghem, Jos; Robinson, Mark A

    2015-05-01

    Biomechanical processes are often manifested as one-dimensional (1D) trajectories. It has been shown that 1D confidence intervals (CIs) are biased when based on 0D statistical procedures, and the non-parametric 1D bootstrap CI has emerged in the Biomechanics literature as a viable solution. The primary purpose of this paper was to clarify that, for 1D biomechanics datasets, the distinction between 0D and 1D methods is much more important than the distinction between parametric and non-parametric procedures. A secondary purpose was to demonstrate that a parametric equivalent to the 1D bootstrap exists in the form of a random field theory (RFT) correction for multiple comparisons. To emphasize these points we analyzed six datasets consisting of force and kinematic trajectories in one-sample, paired, two-sample and regression designs. Results showed, first, that the 1D bootstrap and other 1D non-parametric CIs were qualitatively identical to RFT CIs, and all were very different from 0D CIs. Second, 1D parametric and 1D non-parametric hypothesis testing results were qualitatively identical for all six datasets. Last, we highlight the limitations of 1D CIs by demonstrating that they are complex, design-dependent, and thus non-generalizable. These results suggest that (i) analyses of 1D data based on 0D models of randomness are generally biased unless one explicitly identifies 0D variables before the experiment, and (ii) parametric and non-parametric 1D hypothesis testing provide an unambiguous framework for analysis when one׳s hypothesis explicitly or implicitly pertains to whole 1D trajectories. Copyright © 2015 Elsevier Ltd. All rights reserved.

  7. Measuring the efficiency of dental departments in medical centers: a nonparametric analysis approach.

    Science.gov (United States)

    Wang, Su-Chen; Tsai, Chi-Cheng; Huang, Shun-Te; Hong, Yu-Jue

    2002-12-01

    Data envelopment analysis (DEA), a cross-sectional study design based on secondary data analysis, was used to evaluate the relative operational efficiency of 16 dental departments in medical centers in Taiwan in 1999. The results indicated that 68.7% of all dental departments in medical centers had poor performance in terms of overall efficiency and scale efficiency. All relatively efficient dental departments were in private medical centers. Half of these dental departments were unable to fully utilize available medical resources. 75.0% of public medical centers did not take full advantage of medical resources at their disposal. In the returns to scale, 56.3% of dental departments in medical centers exhibited increasing returns to scale, due to the insufficient scale influencing overall hospital operational efficiency. Public medical centers accounted for 77.8% of the institutions affected. The scale of dental departments in private medical centers was more appropriate than those in public medical centers. In the sensitivity analysis, the numbers of residents, interns, and published papers were used to assess teaching and research. Greater emphasis on teaching and research in medical centers has a large effect on the relative inefficiency of hospital operation. Dental departments in private medical centers had a higher mean overall efficiency score than those in public medical centers, and the overall efficiency of dental departments in non-university hospitals was greater than those in university hospitals. There was no information to evaluate the long-term efficiency of each dental department in all hospitals. A different combination of input and output variables, using common multipliers for efficiency value measurements in DEA, may help establish different pioneering dental departments in hospitals.

  8. Non-parametric trend analysis of the aridity index for three large arid and semi-arid basins in Iran

    Science.gov (United States)

    Ahani, Hossien; Kherad, Mehrzad; Kousari, Mohammad Reza; van Roosmalen, Lieke; Aryanfar, Ramin; Hosseini, Seyyed Mashaallah

    2013-05-01

    Currently, an important scientific challenge that researchers are facing is to gain a better understanding of climate change at the regional scale, which can be especially challenging in an area with low and highly variable precipitation amounts such as Iran. Trend analysis of the medium-term change using ground station observations of meteorological variables can enhance our knowledge of the dominant processes in an area and contribute to the analysis of future climate projections. Generally, studies focus on the long-term variability of temperature and precipitation and to a lesser extent on other important parameters such as moisture indices. In this study the recent 50-year trends (1955-2005) of precipitation (P), potential evapotranspiration (PET), and aridity index (AI) in monthly time scale were studied over 14 synoptic stations in three large Iran basins using the Mann-Kendall non-parametric test. Additionally, an analysis of the monthly, seasonal and annual trend of each parameter was performed. Results showed no significant trends in the monthly time series. However, PET showed significant, mostly decreasing trends, for the seasonal values, which resulted in a significant negative trend in annual PET at five stations. Significant negative trends in seasonal P values were only found at a number of stations in spring and summer and no station showed significant negative trends in annual P. Due to the varied positive and negative trends in annual P and to a lesser extent PET, almost as many stations with negative as positive trends in annual AI were found, indicating that both drying and wetting trends occurred in Iran. Overall, the northern part of the study area showed an increasing trend in annual AI which meant that the region became wetter, while the south showed decreasing trends in AI.

  9. Adaptive Kernel Canonical Correlation Analysis Algorithms for Nonparametric Identification of Wiener and Hammerstein Systems

    Directory of Open Access Journals (Sweden)

    Ignacio Santamaría

    2008-04-01

    Full Text Available This paper treats the identification of nonlinear systems that consist of a cascade of a linear channel and a nonlinearity, such as the well-known Wiener and Hammerstein systems. In particular, we follow a supervised identification approach that simultaneously identifies both parts of the nonlinear system. Given the correct restrictions on the identification problem, we show how kernel canonical correlation analysis (KCCA emerges as the logical solution to this problem. We then extend the proposed identification algorithm to an adaptive version allowing to deal with time-varying systems. In order to avoid overfitting problems, we discuss and compare three possible regularization techniques for both the batch and the adaptive versions of the proposed algorithm. Simulations are included to demonstrate the effectiveness of the presented algorithm.

  10. A Parcellation Based Nonparametric Algorithm for Independent Component Analysis with Application to fMRI Data

    Directory of Open Access Journals (Sweden)

    Shanshan eLi

    2016-01-01

    Full Text Available Independent Component analysis (ICA is a widely used technique for separating signals that have been mixed together. In this manuscript, we propose a novel ICA algorithm using density estimation and maximum likelihood, where the densities of the signals are estimated via p-spline based histogram smoothing and the mixing matrix is simultaneously estimated using an optimization algorithm. The algorithm is exceedingly simple, easy to implement and blind to the underlying distributions of the source signals. To relax the identically distributed assumption in the density function, a modified algorithm is proposed to allow for different density functions on different regions. The performance of the proposed algorithm is evaluated in different simulation settings. For illustration, the algorithm is applied to a research investigation with a large collection of resting state fMRI datasets. The results show that the algorithm successfully recovers the established brain networks.

  11. Contributions to sensitivity analysis and generalized discriminant analysis

    International Nuclear Information System (INIS)

    Jacques, J.

    2005-12-01

    Two topics are studied in this thesis: sensitivity analysis and generalized discriminant analysis. Global sensitivity analysis of a mathematical model studies how the output variables of this last react to variations of its inputs. The methods based on the study of the variance quantify the part of variance of the response of the model due to each input variable and each subset of input variables. The first subject of this thesis is the impact of a model uncertainty on results of a sensitivity analysis. Two particular forms of uncertainty are studied: that due to a change of the model of reference, and that due to the use of a simplified model with the place of the model of reference. A second problem was studied during this thesis, that of models with correlated inputs. Indeed, classical sensitivity indices not having significance (from an interpretation point of view) in the presence of correlation of the inputs, we propose a multidimensional approach consisting in expressing the sensitivity of the output of the model to groups of correlated variables. Applications in the field of nuclear engineering illustrate this work. Generalized discriminant analysis consists in classifying the individuals of a test sample in groups, by using information contained in a training sample, when these two samples do not come from the same population. This work extends existing methods in a Gaussian context to the case of binary data. An application in public health illustrates the utility of generalized discrimination models thus defined. (author)

  12. A critique of non-parametric efficiency analysis in energy economics studies

    International Nuclear Information System (INIS)

    Chen, Chien-Ming

    2013-01-01

    The paper reexamines non-additive environmental efficiency models with weakly-disposable undesirable outputs appeared in the literature of energy economics. These efficiency models are used in numerous studies published in this journal and other energy-related outlets. Recent studies, however, have found key limitations of the weak-disposability assumption in its application to environmental efficiency analysis. It is found that efficiency scores obtained from non-additive efficiency models can be non-monotonic in pollution quantities under the weak-disposability assumption — which is against common intuition and the principle of environmental economics. In this paper, I present taxonomy of efficiency models found in the energy economics literature and illustrate the above limitations and discuss implications of monotonicity from a practical viewpoint. Finally, I review the formulations for a variable returns-to-scale technology with weakly-disposable undesirable outputs, which has been misused in a number of papers in the energy economics literature. An application to evaluating the energy efficiencies of 23 European Union states is presented to illustrate the problem. - Highlights: • Review different environmental efficiency model used in energy economics studies • Highlight limitations of these environmental efficiency models • These limitations have not been recognized in the existing energy economics literature. • Data from 23 European Union states are used to illustrate the methodological consequences

  13. Industrial energy efficiency with CO2 emissions in China: A nonparametric analysis

    International Nuclear Information System (INIS)

    Wu, F.; Fan, L.W.; Zhou, P.; Zhou, D.Q.

    2012-01-01

    Global awareness on energy security and climate change has created much interest in assessing economy-wide energy efficiency performance. A number of previous studies have contributed to evaluate energy efficiency performance using different analytical techniques among which data envelopment analysis (DEA) has recently received increasing attention. Most of DEA-related energy efficiency studies do not consider undesirable outputs such as CO 2 emissions in their modeling framework, which may lead to biased energy efficiency values. Within a joint production framework of desirable and undesirable outputs, in this paper we construct both static and dynamic energy efficiency performance indexes for measuring industrial energy efficiency performance by using several environmental DEA models with CO 2 emissions. The dynamic energy efficiency performance indexes have further been decomposed into two contributing components. We finally apply the indexes proposed to assess the industrial energy efficiency performance of different provinces in China over time. Our empirical study shows that the energy efficiency improvement in China's industrial sector was mainly driven by technological improvement. - Highlights: ► China's industrial energy efficiency is evaluated by DEA models with CO 2 emissions. ► China's industrial energy efficiency improved by 5.6% annually since 1997. ► Industrial energy efficiency improvement in China was mainly driven by technological improvement.

  14. Transit Timing Observations from Kepler: II. Confirmation of Two Multiplanet Systems via a Non-parametric Correlation Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Ford, Eric B.; /Florida U.; Fabrycky, Daniel C.; /Lick Observ.; Steffen, Jason H.; /Fermilab; Carter, Joshua A.; /Harvard-Smithsonian Ctr. Astrophys.; Fressin, Francois; /Harvard-Smithsonian Ctr. Astrophys.; Holman, Matthew J.; /Harvard-Smithsonian Ctr. Astrophys.; Lissauer, Jack J.; /NASA, Ames; Moorhead, Althea V.; /Florida U.; Morehead, Robert C.; /Florida U.; Ragozzine, Darin; /Harvard-Smithsonian Ctr. Astrophys.; Rowe, Jason F.; /NASA, Ames /SETI Inst., Mtn. View /San Diego State U., Astron. Dept.

    2012-01-01

    We present a new method for confirming transiting planets based on the combination of transit timing variations (TTVs) and dynamical stability. Correlated TTVs provide evidence that the pair of bodies are in the same physical system. Orbital stability provides upper limits for the masses of the transiting companions that are in the planetary regime. This paper describes a non-parametric technique for quantifying the statistical significance of TTVs based on the correlation of two TTV data sets. We apply this method to an analysis of the transit timing variations of two stars with multiple transiting planet candidates identified by Kepler. We confirm four transiting planets in two multiple planet systems based on their TTVs and the constraints imposed by dynamical stability. An additional three candidates in these same systems are not confirmed as planets, but are likely to be validated as real planets once further observations and analyses are possible. If all were confirmed, these systems would be near 4:6:9 and 2:4:6:9 period commensurabilities. Our results demonstrate that TTVs provide a powerful tool for confirming transiting planets, including low-mass planets and planets around faint stars for which Doppler follow-up is not practical with existing facilities. Continued Kepler observations will dramatically improve the constraints on the planet masses and orbits and provide sensitivity for detecting additional non-transiting planets. If Kepler observations were extended to eight years, then a similar analysis could likely confirm systems with multiple closely spaced, small transiting planets in or near the habitable zone of solar-type stars.

  15. TRANSIT TIMING OBSERVATIONS FROM KEPLER. II. CONFIRMATION OF TWO MULTIPLANET SYSTEMS VIA A NON-PARAMETRIC CORRELATION ANALYSIS

    International Nuclear Information System (INIS)

    Ford, Eric B.; Moorhead, Althea V.; Morehead, Robert C.; Fabrycky, Daniel C.; Steffen, Jason H.; Carter, Joshua A.; Fressin, Francois; Holman, Matthew J.; Ragozzine, Darin; Charbonneau, David; Lissauer, Jack J.; Rowe, Jason F.; Borucki, William J.; Bryson, Stephen T.; Burke, Christopher J.; Caldwell, Douglas A.; Welsh, William F.; Allen, Christopher; Batalha, Natalie M.; Buchhave, Lars A.

    2012-01-01

    We present a new method for confirming transiting planets based on the combination of transit timing variations (TTVs) and dynamical stability. Correlated TTVs provide evidence that the pair of bodies is in the same physical system. Orbital stability provides upper limits for the masses of the transiting companions that are in the planetary regime. This paper describes a non-parametric technique for quantifying the statistical significance of TTVs based on the correlation of two TTV data sets. We apply this method to an analysis of the TTVs of two stars with multiple transiting planet candidates identified by Kepler. We confirm four transiting planets in two multiple-planet systems based on their TTVs and the constraints imposed by dynamical stability. An additional three candidates in these same systems are not confirmed as planets, but are likely to be validated as real planets once further observations and analyses are possible. If all were confirmed, these systems would be near 4:6:9 and 2:4:6:9 period commensurabilities. Our results demonstrate that TTVs provide a powerful tool for confirming transiting planets, including low-mass planets and planets around faint stars for which Doppler follow-up is not practical with existing facilities. Continued Kepler observations will dramatically improve the constraints on the planet masses and orbits and provide sensitivity for detecting additional non-transiting planets. If Kepler observations were extended to eight years, then a similar analysis could likely confirm systems with multiple closely spaced, small transiting planets in or near the habitable zone of solar-type stars.

  16. TRANSIT TIMING OBSERVATIONS FROM KEPLER. II. CONFIRMATION OF TWO MULTIPLANET SYSTEMS VIA A NON-PARAMETRIC CORRELATION ANALYSIS

    Energy Technology Data Exchange (ETDEWEB)

    Ford, Eric B.; Moorhead, Althea V.; Morehead, Robert C. [Astronomy Department, University of Florida, 211 Bryant Space Sciences Center, Gainesville, FL 32611 (United States); Fabrycky, Daniel C. [UCO/Lick Observatory, University of California, Santa Cruz, CA 95064 (United States); Steffen, Jason H. [Fermilab Center for Particle Astrophysics, P.O. Box 500, MS 127, Batavia, IL 60510 (United States); Carter, Joshua A.; Fressin, Francois; Holman, Matthew J.; Ragozzine, Darin; Charbonneau, David [Harvard-Smithsonian Center for Astrophysics, 60 Garden Street, Cambridge, MA 02138 (United States); Lissauer, Jack J.; Rowe, Jason F.; Borucki, William J.; Bryson, Stephen T.; Burke, Christopher J.; Caldwell, Douglas A. [NASA Ames Research Center, Moffett Field, CA 94035 (United States); Welsh, William F. [Astronomy Department, San Diego State University, San Diego, CA 92182-1221 (United States); Allen, Christopher [Orbital Sciences Corporation/NASA Ames Research Center, Moffett Field, CA 94035 (United States); Batalha, Natalie M. [Department of Physics and Astronomy, San Jose State University, San Jose, CA 95192 (United States); Buchhave, Lars A., E-mail: eford@astro.ufl.edu [Niels Bohr Institute, Copenhagen University, DK-2100 Copenhagen (Denmark); Collaboration: Kepler Science Team; and others

    2012-05-10

    We present a new method for confirming transiting planets based on the combination of transit timing variations (TTVs) and dynamical stability. Correlated TTVs provide evidence that the pair of bodies is in the same physical system. Orbital stability provides upper limits for the masses of the transiting companions that are in the planetary regime. This paper describes a non-parametric technique for quantifying the statistical significance of TTVs based on the correlation of two TTV data sets. We apply this method to an analysis of the TTVs of two stars with multiple transiting planet candidates identified by Kepler. We confirm four transiting planets in two multiple-planet systems based on their TTVs and the constraints imposed by dynamical stability. An additional three candidates in these same systems are not confirmed as planets, but are likely to be validated as real planets once further observations and analyses are possible. If all were confirmed, these systems would be near 4:6:9 and 2:4:6:9 period commensurabilities. Our results demonstrate that TTVs provide a powerful tool for confirming transiting planets, including low-mass planets and planets around faint stars for which Doppler follow-up is not practical with existing facilities. Continued Kepler observations will dramatically improve the constraints on the planet masses and orbits and provide sensitivity for detecting additional non-transiting planets. If Kepler observations were extended to eight years, then a similar analysis could likely confirm systems with multiple closely spaced, small transiting planets in or near the habitable zone of solar-type stars.

  17. Proposing a framework for airline service quality evaluation using Type-2 Fuzzy TOPSIS and non-parametric analysis

    Directory of Open Access Journals (Sweden)

    Navid Haghighat

    2017-12-01

    Full Text Available This paper focuses on evaluating airline service quality from the perspective of passengers' view. Until now a lot of researches has been performed in airline service quality evaluation in the world but a little research has been conducted in Iran, yet. In this study, a framework for measuring airline service quality in Iran is proposed. After reviewing airline service quality criteria, SSQAI model was selected because of its comprehensiveness in covering airline service quality dimensions. SSQAI questionnaire items were redesigned to adopt with Iranian airlines requirements and environmental circumstances in the Iran's economic and cultural context. This study includes fuzzy decision-making theory, considering the possible fuzzy subjective judgment of the evaluators during airline service quality evaluation. Fuzzy TOPSIS have been applied for ranking airlines service quality performances. Three major Iranian airlines which have the most passenger transfer volumes in domestic and foreign flights were chosen for evaluation in this research. Results demonstrated Mahan airline has got the best service quality performance rank in gaining passengers' satisfaction with delivery of high-quality services to its passengers, among the three major Iranian airlines. IranAir and Aseman airlines placed in the second and third rank, respectively, according to passenger's evaluation. Statistical analysis has been used in analyzing passenger responses. Due to the abnormality of data, Non-parametric tests were applied. To demonstrate airline ranks in every criterion separately, Friedman test was performed. Variance analysis and Tukey test were applied to study the influence of increasing in age and educational level of passengers on degree of their satisfaction from airline's service quality. Results showed that age has no significant relation to passenger satisfaction of airlines, however, increasing in educational level demonstrated a negative impact on

  18. On Cooper's Nonparametric Test.

    Science.gov (United States)

    Schmeidler, James

    1978-01-01

    The basic assumption of Cooper's nonparametric test for trend (EJ 125 069) is questioned. It is contended that the proper assumption alters the distribution of the statistic and reduces its usefulness. (JKS)

  19. PRICE DISCRIMINATION AND MARKET POWER: A THEORETICAL ANALYSIS

    Directory of Open Access Journals (Sweden)

    Olga Smirnova

    2015-07-01

    Full Text Available This paper analyzes the contemporary theoretical and empirical research in the field of impact assessment of market power and conclusions about the possibilities of the company to implement price discrimination in different market structures. The results of the analysis allow to evaluate current approaches to antitrust regulation of price discrimination.

  20. The NWRA Classification Infrastructure: description and extension to the Discriminant Analysis Flare Forecasting System (DAFFS)

    Science.gov (United States)

    Leka, K. D.; Barnes, Graham; Wagner, Eric

    2018-04-01

    A classification infrastructure built upon Discriminant Analysis (DA) has been developed at NorthWest Research Associates for examining the statistical differences between samples of two known populations. Originating to examine the physical differences between flare-quiet and flare-imminent solar active regions, we describe herein some details of the infrastructure including: parametrization of large datasets, schemes for handling "null" and "bad" data in multi-parameter analysis, application of non-parametric multi-dimensional DA, an extension through Bayes' theorem to probabilistic classification, and methods invoked for evaluating classifier success. The classifier infrastructure is applicable to a wide range of scientific questions in solar physics. We demonstrate its application to the question of distinguishing flare-imminent from flare-quiet solar active regions, updating results from the original publications that were based on different data and much smaller sample sizes. Finally, as a demonstration of "Research to Operations" efforts in the space-weather forecasting context, we present the Discriminant Analysis Flare Forecasting System (DAFFS), a near-real-time operationally-running solar flare forecasting tool that was developed from the research-directed infrastructure.

  1. Power of non-parametric linkage analysis in mapping genes contributing to human longevity in long-lived sib-pairs

    DEFF Research Database (Denmark)

    Tan, Qihua; Zhao, J H; Iachine, I

    2004-01-01

    This report investigates the power issue in applying the non-parametric linkage analysis of affected sib-pairs (ASP) [Kruglyak and Lander, 1995: Am J Hum Genet 57:439-454] to localize genes that contribute to human longevity using long-lived sib-pairs. Data were simulated by introducing a recently...... developed statistical model for measuring marker-longevity associations [Yashin et al., 1999: Am J Hum Genet 65:1178-1193], enabling direct power comparison between linkage and association approaches. The non-parametric linkage (NPL) scores estimated in the region harboring the causal allele are evaluated...... in case of a dominant effect. Although the power issue may depend heavily on the true genetic nature in maintaining survival, our study suggests that results from small-scale sib-pair investigations should be referred with caution, given the complexity of human longevity....

  2. Data analysis and approximate models model choice, location-scale, analysis of variance, nonparametric regression and image analysis

    CERN Document Server

    Davies, Patrick Laurie

    2014-01-01

    Introduction IntroductionApproximate Models Notation Two Modes of Statistical AnalysisTowards One Mode of Analysis Approximation, Randomness, Chaos, Determinism ApproximationA Concept of Approximation Approximation Approximating a Data Set by a Model Approximation Regions Functionals and EquivarianceRegularization and Optimality Metrics and DiscrepanciesStrong and Weak Topologies On Being (almost) Honest Simulations and Tables Degree of Approximation and p-values ScalesStability of Analysis The Choice of En(α, P) Independence Procedures, Approximation and VaguenessDiscrete Models The Empirical Density Metrics and Discrepancies The Total Variation Metric The Kullback-Leibler and Chi-Squared Discrepancies The Po(λ) ModelThe b(k, p) and nb(k, p) Models The Flying Bomb Data The Student Study Times Data OutliersOutliers, Data Analysis and Models Breakdown Points and Equivariance Identifying Outliers and Breakdown Outliers in Multivariate Data Outliers in Linear Regression Outliers in Structured Data The Location...

  3. A non-parametric meta-analysis approach for combining independent microarray datasets: application using two microarray datasets pertaining to chronic allograft nephropathy

    Directory of Open Access Journals (Sweden)

    Archer Kellie J

    2008-02-01

    Full Text Available Abstract Background With the popularity of DNA microarray technology, multiple groups of researchers have studied the gene expression of similar biological conditions. Different methods have been developed to integrate the results from various microarray studies, though most of them rely on distributional assumptions, such as the t-statistic based, mixed-effects model, or Bayesian model methods. However, often the sample size for each individual microarray experiment is small. Therefore, in this paper we present a non-parametric meta-analysis approach for combining data from independent microarray studies, and illustrate its application on two independent Affymetrix GeneChip studies that compared the gene expression of biopsies from kidney transplant recipients with chronic allograft nephropathy (CAN to those with normal functioning allograft. Results The simulation study comparing the non-parametric meta-analysis approach to a commonly used t-statistic based approach shows that the non-parametric approach has better sensitivity and specificity. For the application on the two CAN studies, we identified 309 distinct genes that expressed differently in CAN. By applying Fisher's exact test to identify enriched KEGG pathways among those genes called differentially expressed, we found 6 KEGG pathways to be over-represented among the identified genes. We used the expression measurements of the identified genes as predictors to predict the class labels for 6 additional biopsy samples, and the predicted results all conformed to their pathologist diagnosed class labels. Conclusion We present a new approach for combining data from multiple independent microarray studies. This approach is non-parametric and does not rely on any distributional assumptions. The rationale behind the approach is logically intuitive and can be easily understood by researchers not having advanced training in statistics. Some of the identified genes and pathways have been

  4. Introduction to nonparametric statistics for the biological sciences using R

    CERN Document Server

    MacFarland, Thomas W

    2016-01-01

    This book contains a rich set of tools for nonparametric analyses, and the purpose of this supplemental text is to provide guidance to students and professional researchers on how R is used for nonparametric data analysis in the biological sciences: To introduce when nonparametric approaches to data analysis are appropriate To introduce the leading nonparametric tests commonly used in biostatistics and how R is used to generate appropriate statistics for each test To introduce common figures typically associated with nonparametric data analysis and how R is used to generate appropriate figures in support of each data set The book focuses on how R is used to distinguish between data that could be classified as nonparametric as opposed to data that could be classified as parametric, with both approaches to data classification covered extensively. Following an introductory lesson on nonparametric statistics for the biological sciences, the book is organized into eight self-contained lessons on various analyses a...

  5. Robust variable selection method for nonparametric differential equation models with application to nonlinear dynamic gene regulatory network analysis.

    Science.gov (United States)

    Lu, Tao

    2016-01-01

    The gene regulation network (GRN) evaluates the interactions between genes and look for models to describe the gene expression behavior. These models have many applications; for instance, by characterizing the gene expression mechanisms that cause certain disorders, it would be possible to target those genes to block the progress of the disease. Many biological processes are driven by nonlinear dynamic GRN. In this article, we propose a nonparametric differential equation (ODE) to model the nonlinear dynamic GRN. Specially, we address following questions simultaneously: (i) extract information from noisy time course gene expression data; (ii) model the nonlinear ODE through a nonparametric smoothing function; (iii) identify the important regulatory gene(s) through a group smoothly clipped absolute deviation (SCAD) approach; (iv) test the robustness of the model against possible shortening of experimental duration. We illustrate the usefulness of the model and associated statistical methods through a simulation and a real application examples.

  6. Use of linear discriminant function analysis in seed morphotype ...

    African Journals Online (AJOL)

    Use of linear discriminant function analysis in seed morphotype relationship study in 31 ... Data were collected on 100-seed weight, seed length and seed width. ... to the Mesoamerican gene pool, comprising the cultigroups Sieva-Big Lima, ...

  7. Dimensional Analysis with space discrimination applied to Fickian difussion phenomena

    International Nuclear Information System (INIS)

    Diaz Sanchidrian, C.; Castans, M.

    1989-01-01

    Dimensional Analysis with space discrimination is applied to Fickian difussion phenomena in order to transform its partial differen-tial equations into ordinary ones, and also to obtain in a dimensionl-ess fom the Ficks second law. (Author)

  8. Discrimination between smiling faces: Human observers vs. automated face analysis.

    Science.gov (United States)

    Del Líbano, Mario; Calvo, Manuel G; Fernández-Martín, Andrés; Recio, Guillermo

    2018-05-11

    This study investigated (a) how prototypical happy faces (with happy eyes and a smile) can be discriminated from blended expressions with a smile but non-happy eyes, depending on type and intensity of the eye expression; and (b) how smile discrimination differs for human perceivers versus automated face analysis, depending on affective valence and morphological facial features. Human observers categorized faces as happy or non-happy, or rated their valence. Automated analysis (FACET software) computed seven expressions (including joy/happiness) and 20 facial action units (AUs). Physical properties (low-level image statistics and visual saliency) of the face stimuli were controlled. Results revealed, first, that some blended expressions (especially, with angry eyes) had lower discrimination thresholds (i.e., they were identified as "non-happy" at lower non-happy eye intensities) than others (especially, with neutral eyes). Second, discrimination sensitivity was better for human perceivers than for automated FACET analysis. As an additional finding, affective valence predicted human discrimination performance, whereas morphological AUs predicted FACET discrimination. FACET can be a valid tool for categorizing prototypical expressions, but is currently more limited than human observers for discrimination of blended expressions. Configural processing facilitates detection of in/congruence(s) across regions, and thus detection of non-genuine smiling faces (due to non-happy eyes). Copyright © 2018 Elsevier B.V. All rights reserved.

  9. The use of the discriminant analysis method for e π μ separation in BES

    International Nuclear Information System (INIS)

    Jiang Zhijin; Wang Taijie; Xie Yigang; Huang Tao

    1994-01-01

    We use the discriminant analysis method in multivariate statistical theory to handle the e π μ separation in BES, describing the principle of the discriminant analysis method, deriving the unstandardized discriminant functions (responsible for particle separation), giving the discriminant efficiency for e π μ and comparing the results from the discriminant analysis method with those obtained in a conventional way. ((orig.))

  10. Discrete Discriminant analysis based on tree-structured graphical models

    DEFF Research Database (Denmark)

    Perez de la Cruz, Gonzalo; Eslava, Guillermina

    The purpose of this paper is to illustrate the potential use of discriminant analysis based on tree{structured graphical models for discrete variables. This is done by comparing its empirical performance using estimated error rates for real and simulated data. The results show that discriminant a...... analysis based on tree{structured graphical models is a simple nonlinear method competitive with, and sometimes superior to, other well{known linear methods like those assuming mutual independence between variables and linear logistic regression.......The purpose of this paper is to illustrate the potential use of discriminant analysis based on tree{structured graphical models for discrete variables. This is done by comparing its empirical performance using estimated error rates for real and simulated data. The results show that discriminant...

  11. Nonparametric Transfer Function Models

    Science.gov (United States)

    Liu, Jun M.; Chen, Rong; Yao, Qiwei

    2009-01-01

    In this paper a class of nonparametric transfer function models is proposed to model nonlinear relationships between ‘input’ and ‘output’ time series. The transfer function is smooth with unknown functional forms, and the noise is assumed to be a stationary autoregressive-moving average (ARMA) process. The nonparametric transfer function is estimated jointly with the ARMA parameters. By modeling the correlation in the noise, the transfer function can be estimated more efficiently. The parsimonious ARMA structure improves the estimation efficiency in finite samples. The asymptotic properties of the estimators are investigated. The finite-sample properties are illustrated through simulations and one empirical example. PMID:20628584

  12. Theory of nonparametric tests

    CERN Document Server

    Dickhaus, Thorsten

    2018-01-01

    This textbook provides a self-contained presentation of the main concepts and methods of nonparametric statistical testing, with a particular focus on the theoretical foundations of goodness-of-fit tests, rank tests, resampling tests, and projection tests. The substitution principle is employed as a unified approach to the nonparametric test problems discussed. In addition to mathematical theory, it also includes numerous examples and computer implementations. The book is intended for advanced undergraduate, graduate, and postdoc students as well as young researchers. Readers should be familiar with the basic concepts of mathematical statistics typically covered in introductory statistics courses.

  13. A Large Dimensional Analysis of Regularized Discriminant Analysis Classifiers

    KAUST Repository

    Elkhalil, Khalil

    2017-11-01

    This article carries out a large dimensional analysis of standard regularized discriminant analysis classifiers designed on the assumption that data arise from a Gaussian mixture model with different means and covariances. The analysis relies on fundamental results from random matrix theory (RMT) when both the number of features and the cardinality of the training data within each class grow large at the same pace. Under mild assumptions, we show that the asymptotic classification error approaches a deterministic quantity that depends only on the means and covariances associated with each class as well as the problem dimensions. Such a result permits a better understanding of the performance of regularized discriminant analsysis, in practical large but finite dimensions, and can be used to determine and pre-estimate the optimal regularization parameter that minimizes the misclassification error probability. Despite being theoretically valid only for Gaussian data, our findings are shown to yield a high accuracy in predicting the performances achieved with real data sets drawn from the popular USPS data base, thereby making an interesting connection between theory and practice.

  14. Application of Discriminant Analysis on Romanian Insurance Market

    Directory of Open Access Journals (Sweden)

    Constantin Anghelache

    2008-11-01

    Full Text Available Discriminant analysis is a supervised learning technique that can be used in order to determine which variables are the best predictors of the classification of objects belonging to a population into predetermined classes. At the same time, discriminant analysis provides a powerful tool that enables researchers to make predictions regarding the classification of new objects into predefined classes. The main goal of discriminant analysis is to determine which of the N descriptive variables have the most discriminatory power, that is, which of them are the most relevant for the classification of objects into classes. In order to classify objects, we need a mathematical model that provides the rules for optimal allocation. This is the classifier. In this paper we will discuss three of the most important models of classification: the Bayesian criterion, the Mahalanobis criterion and the Fisher criterion. In this paper, we will use discriminant analysis to classify the insurance companies that operated on the Romanian market in 2006. We have selected a number of eigth (8 relevant variables: gross written premium (GR_WRI_PRE, net mathematical reserves (NET_M_PES, gross claims paid (GR_CL_PAID, net premium reserves (NET_PRE_RES, net claim reserves (NET_CL_RES, net income (NE—_INCOME, share capital (SHARE_CAP and gross written premium ceded in Reinsurance (GR_WRI_PRE_CED. Before proceeding to discriminant analysis, we performed cluster analysis on the initial data in order to identify classes (clusters that emerge from the data.

  15. Nonparametric Mixture of Regression Models.

    Science.gov (United States)

    Huang, Mian; Li, Runze; Wang, Shaoli

    2013-07-01

    Motivated by an analysis of US house price index data, we propose nonparametric finite mixture of regression models. We study the identifiability issue of the proposed models, and develop an estimation procedure by employing kernel regression. We further systematically study the sampling properties of the proposed estimators, and establish their asymptotic normality. A modified EM algorithm is proposed to carry out the estimation procedure. We show that our algorithm preserves the ascent property of the EM algorithm in an asymptotic sense. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed estimation procedure. An empirical analysis of the US house price index data is illustrated for the proposed methodology.

  16. CATDAT : A Program for Parametric and Nonparametric Categorical Data Analysis : User's Manual Version 1.0, 1998-1999 Progress Report.

    Energy Technology Data Exchange (ETDEWEB)

    Peterson, James T.

    1999-12-01

    Natural resource professionals are increasingly required to develop rigorous statistical models that relate environmental data to categorical responses data. Recent advances in the statistical and computing sciences have led to the development of sophisticated methods for parametric and nonparametric analysis of data with categorical responses. The statistical software package CATDAT was designed to make some of these relatively new and powerful techniques available to scientists. The CATDAT statistical package includes 4 analytical techniques: generalized logit modeling; binary classification tree; extended K-nearest neighbor classification; and modular neural network.

  17. Statistical analysis of water-quality data containing multiple detection limits II: S-language software for nonparametric distribution modeling and hypothesis testing

    Science.gov (United States)

    Lee, L.; Helsel, D.

    2007-01-01

    Analysis of low concentrations of trace contaminants in environmental media often results in left-censored data that are below some limit of analytical precision. Interpretation of values becomes complicated when there are multiple detection limits in the data-perhaps as a result of changing analytical precision over time. Parametric and semi-parametric methods, such as maximum likelihood estimation and robust regression on order statistics, can be employed to model distributions of multiply censored data and provide estimates of summary statistics. However, these methods are based on assumptions about the underlying distribution of data. Nonparametric methods provide an alternative that does not require such assumptions. A standard nonparametric method for estimating summary statistics of multiply-censored data is the Kaplan-Meier (K-M) method. This method has seen widespread usage in the medical sciences within a general framework termed "survival analysis" where it is employed with right-censored time-to-failure data. However, K-M methods are equally valid for the left-censored data common in the geosciences. Our S-language software provides an analytical framework based on K-M methods that is tailored to the needs of the earth and environmental sciences community. This includes routines for the generation of empirical cumulative distribution functions, prediction or exceedance probabilities, and related confidence limits computation. Additionally, our software contains K-M-based routines for nonparametric hypothesis testing among an unlimited number of grouping variables. A primary characteristic of K-M methods is that they do not perform extrapolation and interpolation. Thus, these routines cannot be used to model statistics beyond the observed data range or when linear interpolation is desired. For such applications, the aforementioned parametric and semi-parametric methods must be used.

  18. Discriminant Function Analysis as a Proof for Sexual Dimorphism ...

    African Journals Online (AJOL)

    Background: Forensic scientists study human skeleton in legal setting. Discriminant function analysis has become important in forensic anthropology. The aim of this study was to determine the sex of adolescent Yoruba ethnic group of Nigeria using iscriminant function analysis. Methodology: One thousand (500 males and ...

  19. Principal Component Clustering Approach to Teaching Quality Discriminant Analysis

    Science.gov (United States)

    Xian, Sidong; Xia, Haibo; Yin, Yubo; Zhai, Zhansheng; Shang, Yan

    2016-01-01

    Teaching quality is the lifeline of the higher education. Many universities have made some effective achievement about evaluating the teaching quality. In this paper, we establish the Students' evaluation of teaching (SET) discriminant analysis model and algorithm based on principal component clustering analysis. Additionally, we classify the SET…

  20. Facial Affect Recognition Using Regularized Discriminant Analysis-Based Algorithms

    Directory of Open Access Journals (Sweden)

    Cheng-Yuan Shih

    2010-01-01

    Full Text Available This paper presents a novel and effective method for facial expression recognition including happiness, disgust, fear, anger, sadness, surprise, and neutral state. The proposed method utilizes a regularized discriminant analysis-based boosting algorithm (RDAB with effective Gabor features to recognize the facial expressions. Entropy criterion is applied to select the effective Gabor feature which is a subset of informative and nonredundant Gabor features. The proposed RDAB algorithm uses RDA as a learner in the boosting algorithm. The RDA combines strengths of linear discriminant analysis (LDA and quadratic discriminant analysis (QDA. It solves the small sample size and ill-posed problems suffered from QDA and LDA through a regularization technique. Additionally, this study uses the particle swarm optimization (PSO algorithm to estimate optimal parameters in RDA. Experiment results demonstrate that our approach can accurately and robustly recognize facial expressions.

  1. Bayesian nonparametric hierarchical modeling.

    Science.gov (United States)

    Dunson, David B

    2009-04-01

    In biomedical research, hierarchical models are very widely used to accommodate dependence in multivariate and longitudinal data and for borrowing of information across data from different sources. A primary concern in hierarchical modeling is sensitivity to parametric assumptions, such as linearity and normality of the random effects. Parametric assumptions on latent variable distributions can be challenging to check and are typically unwarranted, given available prior knowledge. This article reviews some recent developments in Bayesian nonparametric methods motivated by complex, multivariate and functional data collected in biomedical studies. The author provides a brief review of flexible parametric approaches relying on finite mixtures and latent class modeling. Dirichlet process mixture models are motivated by the need to generalize these approaches to avoid assuming a fixed finite number of classes. Focusing on an epidemiology application, the author illustrates the practical utility and potential of nonparametric Bayes methods.

  2. Quantal Response: Nonparametric Modeling

    Science.gov (United States)

    2017-01-01

    capture the behavior of observed phenomena. Higher-order polynomial and finite-dimensional spline basis models allow for more complicated responses as the...flexibility as these are nonparametric (not constrained to any particular functional form). These should be useful in identifying nonstandard behavior via... deviance ∆ = −2 log(Lreduced/Lfull) is defined in terms of the likelihood function L. For normal error, Lfull = 1, and based on Eq. A-2, we have log

  3. Pharmacokinetic-Pharmacodynamic (PKPD) Analysis with Drug Discrimination.

    Science.gov (United States)

    Negus, S Stevens; Banks, Matthew L

    2016-08-30

    Discriminative stimulus and other drug effects are determined by the concentration of drug at its target receptor and by the pharmacodynamic consequences of drug-receptor interaction. For in vivo procedures such as drug discrimination, drug concentration at receptors in a given anatomical location (e.g., the brain) is determined both by the dose of drug administered and by pharmacokinetic processes of absorption, distribution, metabolism, and excretion that deliver drug to and from that anatomical location. Drug discrimination data are often analyzed by strategies of dose-effect analysis to determine parameters such as potency and efficacy. Pharmacokinetic-Pharmacodynamic (PKPD) analysis is an alternative to conventional dose-effect analysis, and it relates drug effects to a measure of drug concentration in a body compartment (e.g., venous blood) rather than to drug dose. PKPD analysis can yield insights on pharmacokinetic and pharmacodynamic determinants of drug action. PKPD analysis can also facilitate translational research by identifying species differences in pharmacokinetics and providing a basis for integrating these differences into interpretation of drug effects. Examples are discussed here to illustrate the application of PKPD analysis to the evaluation of drug effects in rhesus monkeys trained to discriminate cocaine from saline.

  4. Enamel surface topography analysis for diet discrimination. A methodology to enhance and select discriminative parameters

    Science.gov (United States)

    Francisco, Arthur; Blondel, Cécile; Brunetière, Noël; Ramdarshan, Anusha; Merceron, Gildas

    2018-03-01

    Tooth wear and, more specifically, dental microwear texture is a dietary proxy that has been used for years in vertebrate paleoecology and ecology. DMTA, dental microwear texture analysis, relies on a few parameters related to the surface complexity, anisotropy and heterogeneity of the enamel facets at the micrometric scale. Working with few but physically meaningful parameters helps in comparing published results and in defining levels for classification purposes. Other dental microwear approaches are based on ISO parameters and coupled with statistical tests to find the more relevant ones. The present study roughly utilizes most of the aforementioned parameters in their more or less modified form. But more than parameters, we here propose a new approach: instead of a single parameter characterizing the whole surface, we sample the surface and thus generate 9 derived parameters in order to broaden the parameter set. The identification of the most discriminative parameters is performed with an automated procedure which is an extended and refined version of the workflows encountered in some studies. The procedure in its initial form includes the most common tools, like the ANOVA and the correlation analysis, along with the required mathematical tests. The discrimination results show that a simplified form of the procedure is able to more efficiently identify the desired number of discriminative parameters. Also highlighted are some trends like the relevance of working with both height and spatial parameters, as well as the potential benefits of dimensionless surfaces. On a set of 45 surfaces issued from 45 specimens of three modern ruminants with differences in feeding preferences (grazing, leaf-browsing and fruit-eating), it is clearly shown that the level of wear discrimination is improved with the new methodology compared to the other ones.

  5. Discrimination analysis of ononis repens and ononis spinosa of the ...

    African Journals Online (AJOL)

    Discrimination analysis of ononis repens and ononis spinosa of the British Isles. CE Stephens. Abstract. No Abstract. Journal of the Ghana Association Vol. 2 (3) 1999: pp.88-94. Full Text: EMAIL FULL TEXT EMAIL FULL TEXT · DOWNLOAD FULL TEXT DOWNLOAD FULL TEXT · http://dx.doi.org/10.4314/jgsa.v2i3.17997.

  6. Discriminant analysis of functional optical topography for schizophrenia diagnosis

    Science.gov (United States)

    Chuang, Ching-Cheng; Nakagome, Kazuyuki; Pu, Shenghong; Lan, Tsuo-Hung; Lee, Chia-Yen; Sun, Chia-Wei

    2014-01-01

    Abnormal prefrontal function plays a central role in the cognition deficits of schizophrenic patients; however, the character of the relationship between discriminant analysis and prefrontal activation remains undetermined. Recently, evidence of low prefrontal cortex (PFC) activation in individuals with schizophrenia has also been found during verbal fluency tests (VFT) and other cognitive tests with several neuroimaging methods. The purpose of this study is to assess the hemodynamic changes of the PFC and discriminant analysis between schizophrenia patients and healthy controls during VFT task by utilizing functional optical topography. A total of 99 subjects including 53 schizophrenic patients and 46 age- and gender-matched healthy controls were studied. The results showed that the healthy group had larger activation in the right and left PFC than in the middle PFC. Besides, the schizophrenic group showed weaker task performance and lower activation in the whole PFC than the healthy group. The result of the discriminant analysis showed a significant difference with P value <0.001 in six channels (CH 23, 29, 31, 40, 42, 52) between the schizophrenic and healthy groups. Finally, 68.69% and 71.72% of subjects are correctly classified as being schizophrenic or healthy with all 52 channels and six significantly different channels, respectively. Our findings suggest that the left PFC can be a feature region for discriminant analysis of schizophrenic diagnosis.

  7. Linear discriminant analysis of structure within African eggplant 'Shum'

    African Journals Online (AJOL)

    A MANOVA preceded linear discriminant analysis, to model each of 61 variables, as predicted by clusters and experiment to filter out non-significant traits. Four distinct clusters emerged, with a cophenetic relation coefficient of 0.87 (P<0.01). Canonical variates that best predicted the observed clusters include petiole length, ...

  8. Nonparametric trend estimation in the presence of fractal noise: application to fMRI time-series analysis.

    Science.gov (United States)

    Afshinpour, Babak; Hossein-Zadeh, Gholam-Ali; Soltanian-Zadeh, Hamid

    2008-06-30

    Unknown low frequency fluctuations called "trend" are observed in noisy time-series measured for different applications. In some disciplines, they carry primary information while in other fields such as functional magnetic resonance imaging (fMRI) they carry nuisance effects. In all cases, however, it is necessary to estimate them accurately. In this paper, a method for estimating trend in the presence of fractal noise is proposed and applied to fMRI time-series. To this end, a partly linear model (PLM) is fitted to each time-series. The parametric and nonparametric parts of PLM are considered as contributions of hemodynamic response and trend, respectively. Using the whitening property of wavelet transform, the unknown components of the model are estimated in the wavelet domain. The results of the proposed method are compared to those of other parametric trend-removal approaches such as spline and polynomial models. It is shown that the proposed method improves activation detection and decreases variance of the estimated parameters relative to the other methods.

  9. A Unified Discussion on the Concept of Score Functions Used in the Context of Nonparametric Linkage Analysis

    Directory of Open Access Journals (Sweden)

    Lars Ängquist

    2008-01-01

    Full Text Available In this article we try to discuss nonparametric linkage (NPL score functions within a broad and quite general framework. The main focus of the paper is the structure, derivation principles and interpretations of the score function entity itself. We define and discuss several families of one-locus score function definitions, i.e. the implicit, explicit and optimal ones. Some generalizations and comments to the two-locus, unconditional and conditional, cases are included as well. Although this article mainly aims at serving as an overview, where the concept of score functions are put into a covering context, we generalize the noncentrality parameter (NCP optimal score functions in Ängquist et al. (2007 to facilitate—through weighting—for incorporation of several plausible distinct genetic models. Since the genetic model itself most oftenly is to some extent unknown this facilitates weaker prior assumptions with respect to plausible true disease models without loosing the property of NCP-optimality. Moreover, we discuss general assumptions and properties of score functions in the above sense. For instance, the concept of identical by descent (IBD sharing structures and score function equivalence are discussed in some detail.

  10. Multi spectral imaging analysis for meat spoilage discrimination

    DEFF Research Database (Denmark)

    Christiansen, Asger Nyman; Carstensen, Jens Michael; Papadopoulou, Olga

    classification methods: Naive Bayes Classifier as a reference model, Canonical Discriminant Analysis (CDA) and Support Vector Classification (SVC). As the final step, generalization of the models was performed using k-fold validation (k=10). Results showed that image analysis provided good discrimination of meat......In the present study, fresh beef fillets were purchased from a local butcher shop and stored aerobically and in modified atmosphere packaging (MAP, CO2 40%/O2 30%/N2 30%) at six different temperatures (0, 4, 8, 12, 16 and 20°C). Microbiological analysis in terms of total viable counts (TVC......) was performed in parallel with videometer image snapshots and sensory analysis. Odour and colour characteristics of meat were determined by a test panel and attributed into three pre-characterized quality classes, namely Fresh; Semi Fresh and Spoiled during the days of its shelf life. So far, different...

  11. Fish otoliths analysis by PIXE: application to stock discrimination

    International Nuclear Information System (INIS)

    Arai, Nobuaki; Takai, Noriyuki; Sakamoto, Wataru; Yoshida, Koji; Maeda, Kuniko.

    1996-01-01

    Fish otoliths are continuously deposited from fish birth to its death along with encoding environmental information. In order to decode the information, PIXE was adopted as trace elemental analysis of the otoliths. Strontium to calcium concentration ratios of red sea bream otoliths varied among rearing stations. The Sr/Ca ratios of Lake Biwa catfishes also varied between male and female and among fishing grounds. The PIXE analysis was applied to the fish stock discrimination. (author)

  12. Energy-saving and emission-abatement potential of Chinese coal-fired power enterprise: A non-parametric analysis

    International Nuclear Information System (INIS)

    Wei, Chu; Löschel, Andreas; Liu, Bing

    2015-01-01

    In the context of soaring demand for electricity, mitigating and controlling greenhouse gas emissions is a great challenge for China's power sector. Increasing attention has been placed on the evaluation of energy efficiency and CO 2 abatement potential in the power sector. However, studies at the micro-level are relatively rare due to serious data limitations. This study uses the 2004 and 2008 Census data of Zhejiang province to construct a non-parametric frontier in order to assess the abatement space of energy and associated CO 2 emission from China's coal-fired power enterprises. A Weighted Russell Directional Distance Function (WRDDF) is applied to construct an energy-saving potential index and a CO 2 emission-abatement potential index. Both indicators depict the inefficiency level in terms of energy utilization and CO 2 emissions of electric power plants. Our results show a substantial variation of energy-saving potential and CO 2 abatement potential among enterprises. We find that large power enterprises are less efficient in 2004, but become more efficient than smaller enterprises in 2008. State-owned enterprises (SOE) are not significantly different in 2008 from 2004, but perform better than their non-SOE counterparts in 2008. This change in performance for large enterprises and SOE might be driven by the “top-1000 Enterprise Energy Conservation Action” that was implemented in 2006. - Highlights: • Energy-saving potential and CO 2 abatement-potential for Chinese power enterprise are evaluated. • The potential to curb energy and emission shows great variation and dynamic changes. • Large enterprise is less efficient than small enterprise in 2004, but more efficient in 2008. • The state-owned enterprise performs better than non-state-owned enterprise in 2008

  13. Nonparametric combinatorial sequence models.

    Science.gov (United States)

    Wauthier, Fabian L; Jordan, Michael I; Jojic, Nebojsa

    2011-11-01

    This work considers biological sequences that exhibit combinatorial structures in their composition: groups of positions of the aligned sequences are "linked" and covary as one unit across sequences. If multiple such groups exist, complex interactions can emerge between them. Sequences of this kind arise frequently in biology but methodologies for analyzing them are still being developed. This article presents a nonparametric prior on sequences which allows combinatorial structures to emerge and which induces a posterior distribution over factorized sequence representations. We carry out experiments on three biological sequence families which indicate that combinatorial structures are indeed present and that combinatorial sequence models can more succinctly describe them than simpler mixture models. We conclude with an application to MHC binding prediction which highlights the utility of the posterior distribution over sequence representations induced by the prior. By integrating out the posterior, our method compares favorably to leading binding predictors.

  14. Quark/gluon jet discrimination: a reproducible analysis using R

    CERN Multimedia

    CERN. Geneva

    2017-01-01

    The power to discriminate between light-quark jets and gluon jets would have a huge impact on many searches for new physics at CERN and beyond. This talk will present a walk-through of the development of a prototype machine learning classifier for differentiating between quark and gluon jets at experiments like those at the Large Hadron Collider at CERN. A new fast feature selection method that combines information theory and graph analytics will be outlined. This method has found new variables that promise significant improvements in discrimination power. The prototype jet tagger is simple, interpretable, parsimonious, and computationally extremely cheap, and therefore might be suitable for use in trigger systems for real-time data processing. Nested stratified k-fold cross validation was used to generate robust estimates of model performance. The data analysis was performed entirely in the R statistical programming language, and is fully reproducible. The entire analysis workflow is data-driven, automated a...

  15. A Structural Labor Supply Model with Nonparametric Preferences

    NARCIS (Netherlands)

    van Soest, A.H.O.; Das, J.W.M.; Gong, X.

    2000-01-01

    Nonparametric techniques are usually seen as a statistic device for data description and exploration, and not as a tool for estimating models with a richer economic structure, which are often required for policy analysis.This paper presents an example where nonparametric flexibility can be attained

  16. Multivariable Discriminant Analysis for the Differential Diagnosis of Microcytic Anemia

    Directory of Open Access Journals (Sweden)

    Eloísa Urrechaga

    2013-01-01

    Full Text Available Introduction. Iron deficiency anemia and thalassemia are the most common causes of microcytic anemia. Powerful statistical computer programming enables sensitive discriminant analyses to aid in the diagnosis. We aimed at investigating the performance of the multiple discriminant analysis (MDA to the differential diagnosis of microcytic anemia. Methods. The training group was composed of 200 β-thalassemia carriers, 65 α-thalassemia carriers, 170 iron deficiency anemia (IDA, and 45 mixed cases of thalassemia and acute phase response or iron deficiency. A set of potential predictor parameters that could detect differences among groups were selected: Red Blood Cells (RBC, hemoglobin (Hb, mean cell volume (MCV, mean cell hemoglobin (MCH, and RBC distribution width (RDW. The functions obtained with MDA analysis were applied to a set of 628 consecutive patients with microcytic anemia. Results. For classifying patients into two groups (genetic anemia and acquired anemia, only one function was needed; 87.9% β-thalassemia carriers, and 83.3% α-thalassemia carriers, and 72.1% in the mixed group were correctly classified. Conclusion. Linear discriminant functions based on hemogram data can aid in differentiating between IDA and thalassemia, so samples can be efficiently selected for further analysis to confirm the presence of genetic anemia.

  17. Nonparametric Bayesian inference in biostatistics

    CERN Document Server

    Müller, Peter

    2015-01-01

    As chapters in this book demonstrate, BNP has important uses in clinical sciences and inference for issues like unknown partitions in genomics. Nonparametric Bayesian approaches (BNP) play an ever expanding role in biostatistical inference from use in proteomics to clinical trials. Many research problems involve an abundance of data and require flexible and complex probability models beyond the traditional parametric approaches. As this book's expert contributors show, BNP approaches can be the answer. Survival Analysis, in particular survival regression, has traditionally used BNP, but BNP's potential is now very broad. This applies to important tasks like arrangement of patients into clinically meaningful subpopulations and segmenting the genome into functionally distinct regions. This book is designed to both review and introduce application areas for BNP. While existing books provide theoretical foundations, this book connects theory to practice through engaging examples and research questions. Chapters c...

  18. Nonparametric tests for censored data

    CERN Document Server

    Bagdonavicus, Vilijandas; Nikulin, Mikhail

    2013-01-01

    This book concerns testing hypotheses in non-parametric models. Generalizations of many non-parametric tests to the case of censored and truncated data are considered. Most of the test results are proved and real applications are illustrated using examples. Theories and exercises are provided. The incorrect use of many tests applying most statistical software is highlighted and discussed.

  19. Fluid Dynamic Models for Bhattacharyya-Based Discriminant Analysis.

    Science.gov (United States)

    Noh, Yung-Kyun; Hamm, Jihun; Park, Frank Chongwoo; Zhang, Byoung-Tak; Lee, Daniel D

    2018-01-01

    Classical discriminant analysis attempts to discover a low-dimensional subspace where class label information is maximally preserved under projection. Canonical methods for estimating the subspace optimize an information-theoretic criterion that measures the separation between the class-conditional distributions. Unfortunately, direct optimization of the information-theoretic criteria is generally non-convex and intractable in high-dimensional spaces. In this work, we propose a novel, tractable algorithm for discriminant analysis that considers the class-conditional densities as interacting fluids in the high-dimensional embedding space. We use the Bhattacharyya criterion as a potential function that generates forces between the interacting fluids, and derive a computationally tractable method for finding the low-dimensional subspace that optimally constrains the resulting fluid flow. We show that this model properly reduces to the optimal solution for homoscedastic data as well as for heteroscedastic Gaussian distributions with equal means. We also extend this model to discover optimal filters for discriminating Gaussian processes and provide experimental results and comparisons on a number of datasets.

  20. Phylogenetic comparative methods complement discriminant function analysis in ecomorphology.

    Science.gov (United States)

    Barr, W Andrew; Scott, Robert S

    2014-04-01

    In ecomorphology, Discriminant Function Analysis (DFA) has been used as evidence for the presence of functional links between morphometric variables and ecological categories. Here we conduct simulations of characters containing phylogenetic signal to explore the performance of DFA under a variety of conditions. Characters were simulated using a phylogeny of extant antelope species from known habitats. Characters were modeled with no biomechanical relationship to the habitat category; the only sources of variation were body mass, phylogenetic signal, or random "noise." DFA on the discriminability of habitat categories was performed using subsets of the simulated characters, and Phylogenetic Generalized Least Squares (PGLS) was performed for each character. Analyses were repeated with randomized habitat assignments. When simulated characters lacked phylogenetic signal and/or habitat assignments were random, ecomorphology. Copyright © 2013 Wiley Periodicals, Inc.

  1. Nonparametric statistics with applications to science and engineering

    CERN Document Server

    Kvam, Paul H

    2007-01-01

    A thorough and definitive book that fully addresses traditional and modern-day topics of nonparametric statistics This book presents a practical approach to nonparametric statistical analysis and provides comprehensive coverage of both established and newly developed methods. With the use of MATLAB, the authors present information on theorems and rank tests in an applied fashion, with an emphasis on modern methods in regression and curve fitting, bootstrap confidence intervals, splines, wavelets, empirical likelihood, and goodness-of-fit testing. Nonparametric Statistics with Applications to Science and Engineering begins with succinct coverage of basic results for order statistics, methods of categorical data analysis, nonparametric regression, and curve fitting methods. The authors then focus on nonparametric procedures that are becoming more relevant to engineering researchers and practitioners. The important fundamental materials needed to effectively learn and apply the discussed methods are also provide...

  2. Discriminant analysis in Polish manufacturing sector performance assessment

    Directory of Open Access Journals (Sweden)

    Józef Dziechciarz

    2004-01-01

    Full Text Available This is a presentation of the preliminary results of a larger project on the determination of the attractiveness of manufacturing branches. Results of the performance assessment of Polish manufacturing branches in 2000 (section D „Manufacturing” – based on NACE – Nomenclatures des Activites de Communite Europeene are shown. In the research, the classical (Fisher’s linear discriminant analysis technique was used for the analysis of the profit generation ability by the firms belonging to a certain production branch. For estimation, the data describing group level was used – for cross-validation, the classes data.

  3. Nonparametric e-Mixture Estimation.

    Science.gov (United States)

    Takano, Ken; Hino, Hideitsu; Akaho, Shotaro; Murata, Noboru

    2016-12-01

    This study considers the common situation in data analysis when there are few observations of the distribution of interest or the target distribution, while abundant observations are available from auxiliary distributions. In this situation, it is natural to compensate for the lack of data from the target distribution by using data sets from these auxiliary distributions-in other words, approximating the target distribution in a subspace spanned by a set of auxiliary distributions. Mixture modeling is one of the simplest ways to integrate information from the target and auxiliary distributions in order to express the target distribution as accurately as possible. There are two typical mixtures in the context of information geometry: the [Formula: see text]- and [Formula: see text]-mixtures. The [Formula: see text]-mixture is applied in a variety of research fields because of the presence of the well-known expectation-maximazation algorithm for parameter estimation, whereas the [Formula: see text]-mixture is rarely used because of its difficulty of estimation, particularly for nonparametric models. The [Formula: see text]-mixture, however, is a well-tempered distribution that satisfies the principle of maximum entropy. To model a target distribution with scarce observations accurately, this letter proposes a novel framework for a nonparametric modeling of the [Formula: see text]-mixture and a geometrically inspired estimation algorithm. As numerical examples of the proposed framework, a transfer learning setup is considered. The experimental results show that this framework works well for three types of synthetic data sets, as well as an EEG real-world data set.

  4. Visual Tracking via Feature Tensor Multimanifold Discriminate Analysis

    Directory of Open Access Journals (Sweden)

    Ting-quan Deng

    2014-01-01

    Full Text Available In the visual tracking scenarios, if there are multiple objects, due to the interference of similar objects, tracking may fail in the progress of occlusion to separation. To address this problem, this paper proposed a visual tracking algorithm with discrimination through multimanifold learning. Color-gradient-based feature tensor was used to describe object appearance for accommodation of partial occlusion. A prior multimanifold tensor dataset is established through the template matching tracking algorithm. For the purpose of discrimination, tensor distance was defined to determine the intramanifold and intermanifold neighborhood relationship in multimanifold space. Then multimanifold discriminate analysis was employed to construct multilinear projection matrices of submanifolds. Finally, object states were obtained by combining with sequence inference. Meanwhile, the multimanifold dataset and manifold learning embedded projection should be updated online. Experiments were conducted on two real visual surveillance sequences to evaluate the proposed algorithm with three state-of-the-art tracking methods qualitatively and quantitatively. Experimental results show that the proposed algorithm can achieve effective and robust effect in multi-similar-object mutual occlusion scenarios.

  5. Robust linear discriminant analysis with distance based estimators

    Science.gov (United States)

    Lim, Yai-Fung; Yahaya, Sharipah Soaad Syed; Ali, Hazlina

    2017-11-01

    Linear discriminant analysis (LDA) is one of the supervised classification techniques concerning relationship between a categorical variable and a set of continuous variables. The main objective of LDA is to create a function to distinguish between populations and allocating future observations to previously defined populations. Under the assumptions of normality and homoscedasticity, the LDA yields optimal linear discriminant rule (LDR) between two or more groups. However, the optimality of LDA highly relies on the sample mean and pooled sample covariance matrix which are known to be sensitive to outliers. To alleviate these conflicts, a new robust LDA using distance based estimators known as minimum variance vector (MVV) has been proposed in this study. The MVV estimators were used to substitute the classical sample mean and classical sample covariance to form a robust linear discriminant rule (RLDR). Simulation and real data study were conducted to examine on the performance of the proposed RLDR measured in terms of misclassification error rates. The computational result showed that the proposed RLDR is better than the classical LDR and was comparable with the existing robust LDR.

  6. Isokinetic evaluation of knee muscles in soccer players: discriminant analysis

    Directory of Open Access Journals (Sweden)

    Bruno Fles Mazuquin

    2015-10-01

    Full Text Available ABSTRACTIntroduction:Muscle activity in soccer players can be measured by isokinetic dynamometer, which is a reliable tool for assessing human performance.Objectives:To perform isokinetic analyses and to determine which variables differentiate the under-17 (U17 soccer category from the professional (PRO.Methods:Thirty four players were assessed (n=17 for each category. The isokinetic variables used for the knee extension-flexion analysis were: peak torque (Nm, total work (J, average power (W, angle of peak torque (deg., agonist/ antagonist ratio (%, measured for three velocities (60°/s, 120°/s and 300°/s, with each series containing five repetitions. Three Wilks' Lambda discriminant analyses were performed, to identify which variables were more significant for the definition of each of the categories.Results:The discriminative variables at 60°/s in the PRO category were: extension peak torque, flexion total work, extension average power and agonist/antagonist ratio; and for the U17s were: extension total work, flexion peak torque and flexion average power. At 120°/s for the PRO category the discriminant variables were: flexion peak torque and extension average power; for the U17s they were: extension total work and flexion average power. Finally at 300°/s, the variables found in the PRO and U17 categories respectively were: extension average power and extension total work.Conclusion:Isokinetic variables for flexion and extension knee muscles were able to significantly discriminate between PRO and U17 soccer players.

  7. 2nd Conference of the International Society for Nonparametric Statistics

    CERN Document Server

    Manteiga, Wenceslao; Romo, Juan

    2016-01-01

    This volume collects selected, peer-reviewed contributions from the 2nd Conference of the International Society for Nonparametric Statistics (ISNPS), held in Cádiz (Spain) between June 11–16 2014, and sponsored by the American Statistical Association, the Institute of Mathematical Statistics, the Bernoulli Society for Mathematical Statistics and Probability, the Journal of Nonparametric Statistics and Universidad Carlos III de Madrid. The 15 articles are a representative sample of the 336 contributed papers presented at the conference. They cover topics such as high-dimensional data modelling, inference for stochastic processes and for dependent data, nonparametric and goodness-of-fit testing, nonparametric curve estimation, object-oriented data analysis, and semiparametric inference. The aim of the ISNPS 2014 conference was to bring together recent advances and trends in several areas of nonparametric statistics in order to facilitate the exchange of research ideas, promote collaboration among researchers...

  8. A new kernel discriminant analysis framework for electronic nose recognition

    International Nuclear Information System (INIS)

    Zhang, Lei; Tian, Feng-Chun

    2014-01-01

    Graphical abstract: - Highlights: • This paper proposes a new discriminant analysis framework for feature extraction and recognition. • The principle of the proposed NDA is derived mathematically. • The NDA framework is coupled with kernel PCA for classification. • The proposed KNDA is compared with state of the art e-Nose recognition methods. • The proposed KNDA shows the best performance in e-Nose experiments. - Abstract: Electronic nose (e-Nose) technology based on metal oxide semiconductor gas sensor array is widely studied for detection of gas components. This paper proposes a new discriminant analysis framework (NDA) for dimension reduction and e-Nose recognition. In a NDA, the between-class and the within-class Laplacian scatter matrix are designed from sample to sample, respectively, to characterize the between-class separability and the within-class compactness by seeking for discriminant matrix to simultaneously maximize the between-class Laplacian scatter and minimize the within-class Laplacian scatter. In terms of the linear separability in high dimensional kernel mapping space and the dimension reduction of principal component analysis (PCA), an effective kernel PCA plus NDA method (KNDA) is proposed for rapid detection of gas mixture components by an e-Nose. The NDA framework is derived in this paper as well as the specific implementations of the proposed KNDA method in training and recognition process. The KNDA is examined on the e-Nose datasets of six kinds of gas components, and compared with state of the art e-Nose classification methods. Experimental results demonstrate that the proposed KNDA method shows the best performance with average recognition rate and total recognition rate as 94.14% and 95.06% which leads to a promising feature extraction and multi-class recognition in e-Nose

  9. General tensor discriminant analysis and gabor features for gait recognition.

    Science.gov (United States)

    Tao, Dacheng; Li, Xuelong; Wu, Xindong; Maybank, Stephen J

    2007-10-01

    The traditional image representations are not suited to conventional classification methods, such as the linear discriminant analysis (LDA), because of the under sample problem (USP): the dimensionality of the feature space is much higher than the number of training samples. Motivated by the successes of the two dimensional LDA (2DLDA) for face recognition, we develop a general tensor discriminant analysis (GTDA) as a preprocessing step for LDA. The benefits of GTDA compared with existing preprocessing methods, e.g., principal component analysis (PCA) and 2DLDA, include 1) the USP is reduced in subsequent classification by, for example, LDA; 2) the discriminative information in the training tensors is preserved; and 3) GTDA provides stable recognition rates because the alternating projection optimization algorithm to obtain a solution of GTDA converges, while that of 2DLDA does not. We use human gait recognition to validate the proposed GTDA. The averaged gait images are utilized for gait representation. Given the popularity of Gabor function based image decompositions for image understanding and object recognition, we develop three different Gabor function based image representations: 1) the GaborD representation is the sum of Gabor filter responses over directions, 2) GaborS is the sum of Gabor filter responses over scales, and 3) GaborSD is the sum of Gabor filter responses over scales and directions. The GaborD, GaborS and GaborSD representations are applied to the problem of recognizing people from their averaged gait images.A large number of experiments were carried out to evaluate the effectiveness (recognition rate) of gait recognition based on first obtaining a Gabor, GaborD, GaborS or GaborSD image representation, then using GDTA to extract features and finally using LDA for classification. The proposed methods achieved good performance for gait recognition based on image sequences from the USF HumanID Database. Experimental comparisons are made with nine

  10. International comparisons of the technical efficiency of the hospital sector: panel data analysis of OECD countries using parametric and non-parametric approaches.

    Science.gov (United States)

    Varabyova, Yauheniya; Schreyögg, Jonas

    2013-09-01

    There is a growing interest in the cross-country comparisons of the performance of national health care systems. The present work provides a comparison of the technical efficiency of the hospital sector using unbalanced panel data from OECD countries over the period 2000-2009. The estimation of the technical efficiency of the hospital sector is performed using nonparametric data envelopment analysis (DEA) and parametric stochastic frontier analysis (SFA). Internal and external validity of findings is assessed by estimating the Spearman rank correlations between the results obtained in different model specifications. The panel-data analyses using two-step DEA and one-stage SFA show that countries, which have higher health care expenditure per capita, tend to have a more technically efficient hospital sector. Whether the expenditure is financed through private or public sources is not related to the technical efficiency of the hospital sector. On the other hand, the hospital sector in countries with higher income inequality and longer average hospital length of stay is less technically efficient. Copyright © 2013 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.

  11. Nonparametric identification of copula structures

    KAUST Repository

    Li, Bo; Genton, Marc G.

    2013-01-01

    We propose a unified framework for testing a variety of assumptions commonly made about the structure of copulas, including symmetry, radial symmetry, joint symmetry, associativity and Archimedeanity, and max-stability. Our test is nonparametric

  12. An isoeffect approach to the study of combined effects of mixed radiations--the nonparametric analysis of in vivo data

    International Nuclear Information System (INIS)

    Lam, G.K.

    1989-01-01

    The combined effects of mixed radiations can be examined using a system of simple isoeffect relations which are derived from a recent analysis of in vitro results obtained for a variety of radiation mixtures. Similar isoeffect analysis methods have been used for over two decades in studies of the combined action of toxic agents such as drugs and antibiotics. Because of the isoeffect approach, the method is particularly useful for the analysis of ordinal data for which conventional models that are based on parametric dose-effect relations may not be suitable. This is illustrated by applying the method to the analysis of a set of recently published in vivo data using the mouse foot skin reaction system for mixtures of neutrons and X rays. The good agreement between this method and the ordinal data also helps to provide further experimental support for the existence of a class of radiobiological data for which the simple isoeffect relations are valid

  13. Tensor Rank Preserving Discriminant Analysis for Facial Recognition.

    Science.gov (United States)

    Tao, Dapeng; Guo, Yanan; Li, Yaotang; Gao, Xinbo

    2017-10-12

    Facial recognition, one of the basic topics in computer vision and pattern recognition, has received substantial attention in recent years. However, for those traditional facial recognition algorithms, the facial images are reshaped to a long vector, thereby losing part of the original spatial constraints of each pixel. In this paper, a new tensor-based feature extraction algorithm termed tensor rank preserving discriminant analysis (TRPDA) for facial image recognition is proposed; the proposed method involves two stages: in the first stage, the low-dimensional tensor subspace of the original input tensor samples was obtained; in the second stage, discriminative locality alignment was utilized to obtain the ultimate vector feature representation for subsequent facial recognition. On the one hand, the proposed TRPDA algorithm fully utilizes the natural structure of the input samples, and it applies an optimization criterion that can directly handle the tensor spectral analysis problem, thereby decreasing the computation cost compared those traditional tensor-based feature selection algorithms. On the other hand, the proposed TRPDA algorithm extracts feature by finding a tensor subspace that preserves most of the rank order information of the intra-class input samples. Experiments on the three facial databases are performed here to determine the effectiveness of the proposed TRPDA algorithm.

  14. Sustainable Production and Trade Discrimination: An Analysis of the WTO

    Directory of Open Access Journals (Sweden)

    María Alejandra Calle Saldarriaga

    2018-02-01

    Full Text Available This article aims to examine the legality of trade measures addressing environmental conditions of production (PPMs in the context of non-discrimination provisions under the General Agreement on Tariffs and Trade (GATT  and the Agreement on Technical Barriers to Trade (TBT Agreement.  It shows that the notion of de facto discrimination is still a sensitive subject in the analysis of origin-neutral measures, including those based on environmental PPMs. Much of the discussion regarding PPMs focuses on the issue of ‘like products’. The interpretation of ‘likeness’ has also served to classify PPMs into the two categories of product related and non-product related. Such distinction rests on how the PPM affects the final product. However, it is important to analyse to what extent these measures can accord less favourable treatment to like products. The author argues that this requires a competition analysis. This article also elucidates how depending upon the applicable law (the TBT Agreement or the GATT PPMs are likely to face different legal challenges, particularly in terms of less favourable treatment. The author also assesses the possibility of transposing concepts such as ‘legitimate regulatory distinctions’ stemming from the TBT jurisprudence into GATT cases involving PPMs, and whether there will be an additional ‘test’ for PPMs characterised as TBT measures. This article is based on an extensive literature review and doctrinal legal research

  15. Frontotemporal Dysfunction in Amyotrophic Lateral Sclerosis: A Discriminant Function Analysis.

    Science.gov (United States)

    Nidos, Andreas; Kasselimis, Dimitrios S; Simos, Panagiotis G; Rentzos, Michael; Alexakis, Theodoros; Zalonis, Ioannis; Zouvelou, Vassiliki; Potagas, Constantin; Evdokimidis, Ioannis; Woolley, Susan C

    2016-01-01

    There is growing evidence for extramotor dysfunction (EMd) in amyotrophic lateral sclerosis (ALS), with a reported prevalence of up to 52%. In the present study, we explore the clinical utility of a brief neuropsychological battery for the investigation of cognitive, behavioral, and language deficits in patients with ALS. Thirty-four consecutive ALS patients aged 44-89 years were tested with a brief neuropsychological battery, including executive, behavioral, and language measures. Patients were initially classified as EMd or non-EMd based on their scores on the frontal assessment battery (FAB). Between-group comparisons revealed significant differences in all measures (p < 0.01). Discriminant analysis resulted in a single canonical function, with all tests serving as significant predictors. This function agreed with the FAB in 13 of 17 patients screened as EMd and identified extramotor deficits in 2 additional patients. Overall sensitivity and specificity estimates against FAB were 88.2%. We stress the importance of discriminant function analysis in clinical neuropsychological assessment and argue that the proposed neuropsychological battery may be of clinical value, especially when the option of extensive and comprehensive neuropsychological testing is limited. The psychometric validity of an ALS-frontotemporal dementia diagnosis using neuropsychological tests is also discussed. © 2015 S. Karger AG, Basel.

  16. Anti-discrimination Analysis Using Privacy Attack Strategies

    KAUST Repository

    Ruggieri, Salvatore; Hajian, Sara; Kamiran, Faisal; Zhang, Xiangliang

    2014-01-01

    Social discrimination discovery from data is an important task to identify illegal and unethical discriminatory patterns towards protected-by-law groups, e.g., ethnic minorities. We deploy privacy attack strategies as tools for discrimination

  17. [Development of Tianma HPLC fingerprint and discriminant analysis].

    Science.gov (United States)

    Xiao, Jia-Jia; Huang, Hong; Lei, You-Cheng; Lin, Ting-Wen; Ma, Yue; Zhang, Jing; Zhang, Xing-Guo; Zhang, Da-Quan; Lv, Guang-Hua

    2017-07-01

    Tianma(the tuber of Gastrodia eleta) is a widely used and pricy Chinese herb. Its counterfeits are often found in herbal markets, which are the plant materials with similar macroscopic characteristics of Tianma. Moreover, the prices of Winter Tianma(cultivated Tianma) and Spring Tianma(mostly wild Tianma) have significant difference. However, it is difficult to identify the true or false, good or bad quality of Tianma samples. Thus, a total of 48 Tianma samples with different characteristics(including Winter Tianma, Spring Tianma, slice, powder, etc.) and 9 plant species 10 samples of Tianma counterfeits were collected and analyzed by HPLC-DAD-MS techniques. After optimizing the procedure of sample preparation, chromatographic and mass-spectral conditions, the HPLC chromatograms of all those samples were collected and compared. The similarities and Fisher discriminant analysis were further conducted between the HPLC chromatograms of Tianma and counterfeit, Winter Tianma and Spring Tianma. The results showed the HPLC chromatograms of 48 Tianma samples were similar at the correlation coefficient more than 0.848(n=48). Their mean chromatogram was simulated and used as Tianma HPLC fingerprint. There were 11 common peaks on the HPLC chromatograms of Tianma, in which 6 main peaks were chosen as characteristic peaks and identified as gastrodin, p-hydroxybenzyl alcohol, parishin A, parishin B, parishin C, parishin E, respectively by comparison of the retention time, UV and MS data with those of standard chemical compounds. All the six chemical compounds are bioactive in Tianma. However, the HPLC chromatograms of the 10 counterfeit samples were significantly different from Tianma fingerprint. The correlation coefficients between HPLC fingerprints of Tianma with the HPLC chromatograms of counterfeits were less than 0.042 and the characteristic peaks were not observed on the HPLC chromatograms of these counterfeit samples. It indicated the true or false Tianma can be

  18. The intersectionality of discrimination attributes and bullying among youth: an applied latent class analysis.

    Science.gov (United States)

    Garnett, Bernice Raveche; Masyn, Katherine E; Austin, S Bryn; Miller, Matthew; Williams, David R; Viswanath, Kasisomayajula

    2014-08-01

    Discrimination is commonly experienced among adolescents. However, little is known about the intersection of multiple attributes of discrimination and bullying. We used a latent class analysis (LCA) to illustrate the intersections of discrimination attributes and bullying, and to assess the associations of LCA membership to depressive symptoms, deliberate self harm and suicidal ideation among a sample of ethnically diverse adolescents. The data come from the 2006 Boston Youth Survey where students were asked whether they had experienced discrimination based on four attributes: race/ethnicity, immigration status, perceived sexual orientation and weight. They were also asked whether they had been bullied or assaulted for these attributes. A total of 965 (78%) students contributed to the LCA analytic sample (45% Non-Hispanic Black, 29% Hispanic, 58% Female). The LCA revealed that a 4-class solution had adequate relative and absolute fit. The 4-classes were characterized as: low discrimination (51%); racial discrimination (33%); sexual orientation discrimination (7%); racial and weight discrimination with high bullying (intersectional class) (7%). In multivariate models, compared to the low discrimination class, individuals in the sexual orientation discrimination class and the intersectional class had higher odds of engaging in deliberate self-harm. Students in the intersectional class also had higher odds of suicidal ideation. All three discrimination latent classes had significantly higher depressive symptoms compared to the low discrimination class. Multiple attributes of discrimination and bullying co-occur among adolescents. Research should consider the co-occurrence of bullying and discrimination.

  19. Discrimination against Latina/os: A Meta-Analysis of Individual-Level Resources and Outcomes

    Science.gov (United States)

    Lee, Debbiesiu L.; Ahn, Soyeon

    2012-01-01

    This meta-analysis synthesizes the findings of 60 independent samples from 51 studies examining racial/ethnic discrimination against Latina/os in the United States. The purpose was to identify individual-level resources and outcomes that most strongly relate to discrimination. Discrimination against Latina/os significantly results in outcomes…

  20. Linear discriminant analysis of character sequences using occurrences of words

    KAUST Repository

    Dutta, Subhajit; Chaudhuri, Probal; Ghosh, Anil

    2014-01-01

    Classification of character sequences, where the characters come from a finite set, arises in disciplines such as molecular biology and computer science. For discriminant analysis of such character sequences, the Bayes classifier based on Markov models turns out to have class boundaries defined by linear functions of occurrences of words in the sequences. It is shown that for such classifiers based on Markov models with unknown orders, if the orders are estimated from the data using cross-validation, the resulting classifier has Bayes risk consistency under suitable conditions. Even when Markov models are not valid for the data, we develop methods for constructing classifiers based on linear functions of occurrences of words, where the word length is chosen by cross-validation. Such linear classifiers are constructed using ideas of support vector machines, regression depth, and distance weighted discrimination. We show that classifiers with linear class boundaries have certain optimal properties in terms of their asymptotic misclassification probabilities. The performance of these classifiers is demonstrated in various simulated and benchmark data sets.

  1. Linear discriminant analysis of character sequences using occurrences of words

    KAUST Repository

    Dutta, Subhajit

    2014-02-01

    Classification of character sequences, where the characters come from a finite set, arises in disciplines such as molecular biology and computer science. For discriminant analysis of such character sequences, the Bayes classifier based on Markov models turns out to have class boundaries defined by linear functions of occurrences of words in the sequences. It is shown that for such classifiers based on Markov models with unknown orders, if the orders are estimated from the data using cross-validation, the resulting classifier has Bayes risk consistency under suitable conditions. Even when Markov models are not valid for the data, we develop methods for constructing classifiers based on linear functions of occurrences of words, where the word length is chosen by cross-validation. Such linear classifiers are constructed using ideas of support vector machines, regression depth, and distance weighted discrimination. We show that classifiers with linear class boundaries have certain optimal properties in terms of their asymptotic misclassification probabilities. The performance of these classifiers is demonstrated in various simulated and benchmark data sets.

  2. Semi-supervised learning for ordinal Kernel Discriminant Analysis.

    Science.gov (United States)

    Pérez-Ortiz, M; Gutiérrez, P A; Carbonero-Ruz, M; Hervás-Martínez, C

    2016-12-01

    Ordinal classification considers those classification problems where the labels of the variable to predict follow a given order. Naturally, labelled data is scarce or difficult to obtain in this type of problems because, in many cases, ordinal labels are given by a user or expert (e.g. in recommendation systems). Firstly, this paper develops a new strategy for ordinal classification where both labelled and unlabelled data are used in the model construction step (a scheme which is referred to as semi-supervised learning). More specifically, the ordinal version of kernel discriminant learning is extended for this setting considering the neighbourhood information of unlabelled data, which is proposed to be computed in the feature space induced by the kernel function. Secondly, a new method for semi-supervised kernel learning is devised in the context of ordinal classification, which is combined with our developed classification strategy to optimise the kernel parameters. The experiments conducted compare 6 different approaches for semi-supervised learning in the context of ordinal classification in a battery of 30 datasets, showing (1) the good synergy of the ordinal version of discriminant analysis and the use of unlabelled data and (2) the advantage of computing distances in the feature space induced by the kernel function. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. Anti-discrimination Analysis Using Privacy Attack Strategies

    KAUST Repository

    Ruggieri, Salvatore

    2014-09-15

    Social discrimination discovery from data is an important task to identify illegal and unethical discriminatory patterns towards protected-by-law groups, e.g., ethnic minorities. We deploy privacy attack strategies as tools for discrimination discovery under hard assumptions which have rarely tackled in the literature: indirect discrimination discovery, privacy-aware discrimination discovery, and discrimination data recovery. The intuition comes from the intriguing parallel between the role of the anti-discrimination authority in the three scenarios above and the role of an attacker in private data publishing. We design strategies and algorithms inspired/based on Frèchet bounds attacks, attribute inference attacks, and minimality attacks to the purpose of unveiling hidden discriminatory practices. Experimental results show that they can be effective tools in the hands of anti-discrimination authorities.

  4. HOMOGENEOUS UGRIZ PHOTOMETRY FOR ACS VIRGO CLUSTER SURVEY GALAXIES: A NON-PARAMETRIC ANALYSIS FROM SDSS IMAGING

    International Nuclear Information System (INIS)

    Chen, Chin-Wei; Cote, Patrick; Ferrarese, Laura; West, Andrew A.; Peng, Eric W.

    2010-01-01

    We present photometric and structural parameters for 100 ACS Virgo Cluster Survey (ACSVCS) galaxies based on homogeneous, multi-wavelength (ugriz), wide-field SDSS (DR5) imaging. These early-type galaxies, which trace out the red sequence in the Virgo Cluster, span a factor of nearly ∼10 3 in g-band luminosity. We describe an automated pipeline that generates background-subtracted mosaic images, masks field sources and measures mean shapes, total magnitudes, effective radii, and effective surface brightnesses using a model-independent approach. A parametric analysis of the surface brightness profiles is also carried out to obtain Sersic-based structural parameters and mean galaxy colors. We compare the galaxy parameters to those in the literature, including those from the ACSVCS, finding good agreement in most cases, although the sizes of the brightest, and most extended, galaxies are found to be most uncertain and model dependent. Our photometry provides an external measurement of the random errors on total magnitudes from the widely used Virgo Cluster Catalog, which we estimate to be σ(B T )∼ 0.13 mag for the brightest galaxies, rising to ∼ 0.3 mag for galaxies at the faint end of our sample (B T ∼ 16). The distribution of axial ratios of low-mass ( d warf ) galaxies bears a strong resemblance to the one observed for the higher-mass ( g iant ) galaxies. The global structural parameters for the full galaxy sample-profile shape, effective radius, and mean surface brightness-are found to vary smoothly and systematically as a function of luminosity, with unmistakable evidence for changes in structural homology along the red sequence. As noted in previous studies, the ugriz galaxy colors show a nonlinear but smooth variation over a ∼7 mag range in absolute magnitude, with an enhanced scatter for the faintest systems that is likely the signature of their more diverse star formation histories.

  5. How discriminating are discriminative instruments?

    Science.gov (United States)

    Hankins, Matthew

    2008-05-27

    The McMaster framework introduced by Kirshner & Guyatt is the dominant paradigm for the development of measures of health status and health-related quality of life (HRQL). The framework defines the functions of such instruments as evaluative, predictive or discriminative. Evaluative instruments are required to be sensitive to change (responsiveness), but there is no corresponding index of the degree to which discriminative instruments are sensitive to cross-sectional differences. This paper argues that indices of validity and reliability are not sufficient to demonstrate that a discriminative instrument performs its function of discriminating between individuals, and that the McMaster framework would be augmented by the addition of a separate index of discrimination. The coefficient proposed by Ferguson (Delta) is easily adapted to HRQL instruments and is a direct, non-parametric index of the degree to which an instrument distinguishes between individuals. While Delta should prove useful in the development and evaluation of discriminative instruments, further research is required to elucidate the relationship between the measurement properties of discrimination, reliability and responsiveness.

  6. How discriminating are discriminative instruments?

    Directory of Open Access Journals (Sweden)

    Hankins Matthew

    2008-05-01

    Full Text Available Abstract The McMaster framework introduced by Kirshner & Guyatt is the dominant paradigm for the development of measures of health status and health-related quality of life (HRQL. The framework defines the functions of such instruments as evaluative, predictive or discriminative. Evaluative instruments are required to be sensitive to change (responsiveness, but there is no corresponding index of the degree to which discriminative instruments are sensitive to cross-sectional differences. This paper argues that indices of validity and reliability are not sufficient to demonstrate that a discriminative instrument performs its function of discriminating between individuals, and that the McMaster framework would be augmented by the addition of a separate index of discrimination. The coefficient proposed by Ferguson (Delta is easily adapted to HRQL instruments and is a direct, non-parametric index of the degree to which an instrument distinguishes between individuals. While Delta should prove useful in the development and evaluation of discriminative instruments, further research is required to elucidate the relationship between the measurement properties of discrimination, reliability and responsiveness.

  7. On discriminant analysis techniques and correlation structures in high dimensions

    DEFF Research Database (Denmark)

    Clemmensen, Line Katrine Harder

    This paper compares several recently proposed techniques for performing discriminant analysis in high dimensions, and illustrates that the various sparse methods dier in prediction abilities depending on their underlying assumptions about the correlation structures in the data. The techniques...... the methods in two: Those who assume independence between the variables and thus use a diagonal estimate of the within-class covariance matrix, and those who assume dependence between the variables and thus use an estimate of the within-class covariance matrix, which also estimates the correlations between...... variables. The two groups of methods are compared and the pros and cons are exemplied using dierent cases of simulated data. The results illustrate that the estimate of the covariance matrix is an important factor with respect to choice of method, and the choice of method should thus be driven by the nature...

  8. Field-scale sensitivity of vegetation discrimination to hyperspectral reflectance and coupled statistics

    DEFF Research Database (Denmark)

    Manevski, Kiril; Jabloun, Mohamed; Gupta, Manika

    2016-01-01

    a more powerful input to a nonparametric analysis for discrimination at the field scale, when compared with unaltered reflectance and parametric analysis. However, the discrimination outputs interact and are very sensitive to the number of observations - an important implication for the design......Remote sensing of land covers utilizes an increasing number of methods for spectral reflectance processing and its accompanying statistics to discriminate between the covers’ spectral signatures at various scales. To this end, the present chapter deals with the field-scale sensitivity...... of the vegetation spectral discrimination to the most common types of reflectance (unaltered and continuum-removed) and statistical tests (parametric and nonparametric analysis of variance). It is divided into two distinct parts. The first part summarizes the current knowledge in relation to vegetation...

  9. Asymptotic performance of regularized quadratic discriminant analysis based classifiers

    KAUST Repository

    Elkhalil, Khalil

    2017-12-13

    This paper carries out a large dimensional analysis of the standard regularized quadratic discriminant analysis (QDA) classifier designed on the assumption that data arise from a Gaussian mixture model. The analysis relies on fundamental results from random matrix theory (RMT) when both the number of features and the cardinality of the training data within each class grow large at the same pace. Under some mild assumptions, we show that the asymptotic classification error converges to a deterministic quantity that depends only on the covariances and means associated with each class as well as the problem dimensions. Such a result permits a better understanding of the performance of regularized QDA and can be used to determine the optimal regularization parameter that minimizes the misclassification error probability. Despite being valid only for Gaussian data, our theoretical findings are shown to yield a high accuracy in predicting the performances achieved with real data sets drawn from popular real data bases, thereby making an interesting connection between theory and practice.

  10. An Analysis of Discrimination by Real Estate Brokers.

    Science.gov (United States)

    Yinger, John

    This paper focuses on designing policies to eliminate discrimination in the sale of single-family houses by analyzing the behavior of the agents who actually do most of the discriminating, namely real estate agents. Discriminatory practices are said to be supported by policies of house builders, lending institutions, and government, and by the…

  11. A nonparametric mixture model for cure rate estimation.

    Science.gov (United States)

    Peng, Y; Dear, K B

    2000-03-01

    Nonparametric methods have attracted less attention than their parametric counterparts for cure rate analysis. In this paper, we study a general nonparametric mixture model. The proportional hazards assumption is employed in modeling the effect of covariates on the failure time of patients who are not cured. The EM algorithm, the marginal likelihood approach, and multiple imputations are employed to estimate parameters of interest in the model. This model extends models and improves estimation methods proposed by other researchers. It also extends Cox's proportional hazards regression model by allowing a proportion of event-free patients and investigating covariate effects on that proportion. The model and its estimation method are investigated by simulations. An application to breast cancer data, including comparisons with previous analyses using a parametric model and an existing nonparametric model by other researchers, confirms the conclusions from the parametric model but not those from the existing nonparametric model.

  12. Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers

    Directory of Open Access Journals (Sweden)

    Stochl Jan

    2012-06-01

    Full Text Available Abstract Background Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Methods Scalability of data from 1 a cross-sectional health survey (the Scottish Health Education Population Survey and 2 a general population birth cohort study (the National Child Development Study illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. Results and conclusions After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items we show that all items from the 12-item General Health Questionnaire (GHQ-12 – when binary scored – were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech’s “well-being” and “distress” clinical scales. An illustration of ordinal item analysis

  13. Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers.

    Science.gov (United States)

    Stochl, Jan; Jones, Peter B; Croudace, Tim J

    2012-06-11

    Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related) Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Scalability of data from 1) a cross-sectional health survey (the Scottish Health Education Population Survey) and 2) a general population birth cohort study (the National Child Development Study) illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items) we show that all items from the 12-item General Health Questionnaire (GHQ-12)--when binary scored--were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech's "well-being" and "distress" clinical scales). An illustration of ordinal item analysis confirmed that all 14 positively worded items of the Warwick-Edinburgh Mental

  14. Testing discontinuities in nonparametric regression

    KAUST Repository

    Dai, Wenlin

    2017-01-19

    In nonparametric regression, it is often needed to detect whether there are jump discontinuities in the mean function. In this paper, we revisit the difference-based method in [13 H.-G. Müller and U. Stadtmüller, Discontinuous versus smooth regression, Ann. Stat. 27 (1999), pp. 299–337. doi: 10.1214/aos/1018031100

  15. Testing discontinuities in nonparametric regression

    KAUST Repository

    Dai, Wenlin; Zhou, Yuejin; Tong, Tiejun

    2017-01-01

    In nonparametric regression, it is often needed to detect whether there are jump discontinuities in the mean function. In this paper, we revisit the difference-based method in [13 H.-G. Müller and U. Stadtmüller, Discontinuous versus smooth regression, Ann. Stat. 27 (1999), pp. 299–337. doi: 10.1214/aos/1018031100

  16. Discriminant analysis of maintaining a vertical position in the water

    Directory of Open Access Journals (Sweden)

    Bratuša Zoran

    2015-01-01

    Full Text Available Water polo is the only sports game that takes place in the water. During the outplay, a vertical body position with the two basic mechanisms of the leg work - a breaststroke leg kick and an eggbeater leg kick, prevails. Starting from the significance of a vertical position during the game play, the methods of assessing physical preparedness of the athletes of all the categories also include the evaluation of maintaining a vertical position and consequently the load of the leg muscles. The measurements are performed during the maintenance of a vertical position (swimming in place through one of the specified mechanisms of leg work, i.e. a vertical position technique. The aim of this paper was to determine the application of different mechanisms of the leg kicks in maintaining a vertical position with young water polo players in relation to their position. The study included 29 selected junior water polo players (age_15.8 ± 0.8 years; BH_185.2 ± 5.3cm and BW_81.7 ± 7.7kg. The measurements were performed during the tests of swimming in place at the maximum intensity lasting 10 seconds, by the breaststroke and eggbeater leg kicks. The isometric tensiometry tests were used for the measurements. The results were analysed by the application of descriptive statistics, and the kinetic selection characteristic was defined by the application of discriminant analysis. Higher average values were achieved with the breaststroke leg kick technique Fmax, ImpF and RFD (avgFmaxLEGGBK =157.46±19.93N; avgImpF_LEGGBK =45.43±10.64Ns; avgRFD_LEGGBK=337.85±80.73N/s; avgFmaxLBKICK=227.18±49.17N; avgImpF_LBKICK=55.99±14.59Ns; avgRFD_LBKICK=545.47±159.15N/s. After discriminant analysis, the results have shown that the eggbeater leg kick is a selection technique, whereas the force - Fmax is a kinetic selection variable. Based on the obtained results and the analyses performed it may be concluded that a training factor dominant for maintaining a vertical position by

  17. Sparse Regression by Projection and Sparse Discriminant Analysis

    KAUST Repository

    Qi, Xin

    2015-04-03

    © 2015, © American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America. Recent years have seen active developments of various penalized regression methods, such as LASSO and elastic net, to analyze high-dimensional data. In these approaches, the direction and length of the regression coefficients are determined simultaneously. Due to the introduction of penalties, the length of the estimates can be far from being optimal for accurate predictions. We introduce a new framework, regression by projection, and its sparse version to analyze high-dimensional data. The unique nature of this framework is that the directions of the regression coefficients are inferred first, and the lengths and the tuning parameters are determined by a cross-validation procedure to achieve the largest prediction accuracy. We provide a theoretical result for simultaneous model selection consistency and parameter estimation consistency of our method in high dimension. This new framework is then generalized such that it can be applied to principal components analysis, partial least squares, and canonical correlation analysis. We also adapt this framework for discriminant analysis. Compared with the existing methods, where there is relatively little control of the dependency among the sparse components, our method can control the relationships among the components. We present efficient algorithms and related theory for solving the sparse regression by projection problem. Based on extensive simulations and real data analysis, we demonstrate that our method achieves good predictive performance and variable selection in the regression setting, and the ability to control relationships between the sparse components leads to more accurate classification. In supplementary materials available online, the details of the algorithms and theoretical proofs, and R codes for all simulation studies are provided.

  18. Discrimination of ginseng cultivation regions using light stable isotope analysis.

    Science.gov (United States)

    Kim, Kiwook; Song, Joo-Hyun; Heo, Sang-Cheol; Lee, Jin-Hee; Jung, In-Woo; Min, Ji-Sook

    2015-10-01

    Korean ginseng is considered to be a precious health food in Asia. Today, thieves frequently compromise ginseng farms by pervasive theft. Thus, studies regarding the characteristics of ginseng according to growth region are required in order to deter ginseng thieves and prevent theft. In this study, 6 regions were selected on the basis of Korea regional criteria (si, gun, gu), and two ginseng-farms were randomly selected from each of the 6 regions. Then 4-6 samples of ginseng were acquired from each ginseng farm. The stable isotopic compositions of H, O, C, and N of the collected ginseng samples were analyzed. As a result, differences in the hydrogen isotope ratios could be used to distinguish regional differences, and differences in the nitrogen isotope ratios yielded characteristic information regarding the farms from which the samples were obtained. Thus, stable isotope values could be used to differentiate samples according to regional differences. Therefore, stable isotope analysis serves as a powerful tool to discriminate the regional origin of Korean ginseng samples from across Korea. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  19. ANALYSIS ON WOMEN DISCRIMINATION IN THE LABOUR MARKET IN ROMANIA

    OpenAIRE

    Victoria-Mihaela Brînzea

    2011-01-01

    Eliminating gender-based discrimination is one of the important prerequisite for building a fair society; this can be achieved only through the active involvement of the authorities and of each person. Although during recent years there have been positive changes in the relationships between men and women, improving women's situation to some extent, it can be said that discrimination based on social gender was reduced but not eliminated entirely, equality of chances having not been achieved e...

  20. Nonparametric Inference for Periodic Sequences

    KAUST Repository

    Sun, Ying

    2012-02-01

    This article proposes a nonparametric method for estimating the period and values of a periodic sequence when the data are evenly spaced in time. The period is estimated by a "leave-out-one-cycle" version of cross-validation (CV) and complements the periodogram, a widely used tool for period estimation. The CV method is computationally simple and implicitly penalizes multiples of the smallest period, leading to a "virtually" consistent estimator of integer periods. This estimator is investigated both theoretically and by simulation.We also propose a nonparametric test of the null hypothesis that the data have constantmean against the alternative that the sequence of means is periodic. Finally, our methodology is demonstrated on three well-known time series: the sunspots and lynx trapping data, and the El Niño series of sea surface temperatures. © 2012 American Statistical Association and the American Society for Quality.

  1. Decompounding random sums: A nonparametric approach

    DEFF Research Database (Denmark)

    Hansen, Martin Bøgsted; Pitts, Susan M.

    Observations from sums of random variables with a random number of summands, known as random, compound or stopped sums arise within many areas of engineering and science. Quite often it is desirable to infer properties of the distribution of the terms in the random sum. In the present paper we...... review a number of applications and consider the nonlinear inverse problem of inferring the cumulative distribution function of the components in the random sum. We review the existing literature on non-parametric approaches to the problem. The models amenable to the analysis are generalized considerably...

  2. Nonparametric predictive inference in reliability

    International Nuclear Information System (INIS)

    Coolen, F.P.A.; Coolen-Schrijner, P.; Yan, K.J.

    2002-01-01

    We introduce a recently developed statistical approach, called nonparametric predictive inference (NPI), to reliability. Bounds for the survival function for a future observation are presented. We illustrate how NPI can deal with right-censored data, and discuss aspects of competing risks. We present possible applications of NPI for Bernoulli data, and we briefly outline applications of NPI for replacement decisions. The emphasis is on introduction and illustration of NPI in reliability contexts, detailed mathematical justifications are presented elsewhere

  3. Thyroid nodule classification using ultrasound elastography via linear discriminant analysis.

    Science.gov (United States)

    Luo, Si; Kim, Eung-Hun; Dighe, Manjiri; Kim, Yongmin

    2011-05-01

    The non-surgical diagnosis of thyroid nodules is currently made via a fine needle aspiration (FNA) biopsy. It is estimated that somewhere between 250,000 and 300,000 thyroid FNA biopsies are performed in the United States annually. However, a large percentage (approximately 70%) of these biopsies turn out to be benign. Since the aggressive FNA management of thyroid nodules is costly, quantitative risk assessment and stratification of a nodule's malignancy is of value in triage and more appropriate healthcare resources utilization. In this paper, we introduce a new method for classifying the thyroid nodules based on the ultrasound (US) elastography features. Unlike approaches to assess the stiffness of a thyroid nodule by visually inspecting the pseudo-color pattern in the strain image, we use a classification algorithm to stratify the nodule by using the power spectrum of strain rate waveform extracted from the US elastography image sequence. Pulsation from the carotid artery was used to compress the thyroid nodules. Ultrasound data previously acquired from 98 thyroid nodules were used in this retrospective study to evaluate our classification algorithm. A classifier was developed based on the linear discriminant analysis (LDA) and used to differentiate the thyroid nodules into two types: (I) no FNA (observation-only) and (II) FNA. Using our method, 62 nodules were classified as type I, all of which were benign, while 36 nodules were classified as Type-II, 16 malignant and 20 benign, resulting in a sensitivity of 100% and specificity of 75.6% in detecting malignant thyroid nodules. This indicates that our triage method based on US elastography has the potential to substantially reduce the number of FNA biopsies (63.3%) by detecting benign nodules and managing them via follow-up observations rather than an FNA biopsy. Published by Elsevier B.V.

  4. WOMEN RESISTANCE TOWARD DISCRIMINATIONS: A MODERN LITERARY WORK ANALYSIS ON FEMINISM REVIEW IN BEKISAR MERAH

    Directory of Open Access Journals (Sweden)

    Mujiono .

    2016-02-01

    Full Text Available This study was conducted to discover the discriminations against women in the Bekisar Merah novel and how they formulate resistance to those discriminations. To address the above objective, this study used descriptive qualitative research design with a feminism approach. Source of the data in this study was the second edition of Bekisar Merah novel written by Ahmad Tohari. The data included were words, phrases, sentences, and paragraphs on Bekisar Merah which portray womens discrimination toward Lasi, the women figure in the novel, and power types formulated by her who resisted the discrimination. To analyze the data, content analysis was applied. Triangulation was used to ensure the trustworthiness of the data. The result of the study showed eight forms of discriminations and three resistances. The discriminations were domestic abuse, molestation, gender harassment, seduction behavior, imposition, coercion, bribery, and subordination. The resistances were physically, mentally, and verbally.

  5. Non-Discrimination à la Cour: the ECJ’s (lack of) Comparability Analysis in Direct Tax Cases

    NARCIS (Netherlands)

    Wattel, P.

    2015-01-01

    The ECJ’s discrimination analysis in direct tax cases is inconsistent. It sometimes creates discrimination, condemns non-existent discrimination or fails to address discrimination. Only one comparability standard makes sense: to be (subject to tax) or not to be (subject to tax). The ECJ is not

  6. Discrimination of Transgenic Rice Based on Near Infrared Reflectance Spectroscopy and Partial Least Squares Regression Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    ZHANG Long

    2015-09-01

    Full Text Available Near infrared reflectance spectroscopy (NIRS, a non-destructive measurement technique, was combined with partial least squares regression discrimiant analysis (PLS-DA to discriminate the transgenic (TCTP and mi166 and wild type (Zhonghua 11 rice. Furthermore, rice lines transformed with protein gene (OsTCTP and regulation gene (Osmi166 were also discriminated by the NIRS method. The performances of PLS-DA in spectral ranges of 4 000–8 000 cm-1 and 4 000–10 000 cm-1 were compared to obtain the optimal spectral range. As a result, the transgenic and wild type rice were distinguished from each other in the range of 4 000–10 000 cm-1, and the correct classification rate was 100.0% in the validation test. The transgenic rice TCTP and mi166 were also distinguished from each other in the range of 4 000–10 000 cm-1, and the correct classification rate was also 100.0%. In conclusion, NIRS combined with PLS-DA can be used for the discrimination of transgenic rice.

  7. Regularized generalized eigen-decomposition with applications to sparse supervised feature extraction and sparse discriminant analysis

    DEFF Research Database (Denmark)

    Han, Xixuan; Clemmensen, Line Katrine Harder

    2015-01-01

    We propose a general technique for obtaining sparse solutions to generalized eigenvalue problems, and call it Regularized Generalized Eigen-Decomposition (RGED). For decades, Fisher's discriminant criterion has been applied in supervised feature extraction and discriminant analysis, and it is for...

  8. Declining Bias and Gender Wage Discrimination? A Meta-Regression Analysis

    Science.gov (United States)

    Jarrell, Stephen B.; Stanley, T. D.

    2004-01-01

    The meta-regression analysis reveals that there is a strong tendency for discrimination estimates to fall and wage discrimination exist against the woman. The biasing effect of researchers' gender of not correcting for selection bias has weakened and changes in labor market have made it less important.

  9. Quantitative Phylogenomics of Within-Species Mitogenome Variation: Monte Carlo and Non-Parametric Analysis of Phylogeographic Structure among Discrete Transatlantic Breeding Areas of Harp Seals (Pagophilus groenlandicus.

    Directory of Open Access Journals (Sweden)

    Steven M Carr

    -stepping-stone biogeographic models, but not a simple 1-step trans-Atlantic model. Plots of the cumulative pairwise sequence difference curves among seals in each of the four populations provide continuous proxies for phylogenetic diversification within each. Non-parametric Kolmogorov-Smirnov (K-S tests of maximum pairwise differences between these curves indicates that the Greenland Sea population has a markedly younger phylogenetic structure than either the White Sea population or the two Northwest Atlantic populations, which are of intermediate age and homogeneous structure. The Monte Carlo and K-S assessments provide sensitive quantitative tests of within-species mitogenomic phylogeography. This is the first study to indicate that the White Sea and Greenland Sea populations have different population genetic histories. The analysis supports the hypothesis that Harp Seals comprises three genetically distinguishable breeding populations, in the White Sea, Greenland Sea, and Northwest Atlantic. Implications for an ice-dependent species during ongoing climate change are discussed.

  10. Non-parametric smoothing of experimental data

    International Nuclear Information System (INIS)

    Kuketayev, A.T.; Pen'kov, F.M.

    2007-01-01

    Full text: Rapid processing of experimental data samples in nuclear physics often requires differentiation in order to find extrema. Therefore, even at the preliminary stage of data analysis, a range of noise reduction methods are used to smooth experimental data. There are many non-parametric smoothing techniques: interval averages, moving averages, exponential smoothing, etc. Nevertheless, it is more common to use a priori information about the behavior of the experimental curve in order to construct smoothing schemes based on the least squares techniques. The latter methodology's advantage is that the area under the curve can be preserved, which is equivalent to conservation of total speed of counting. The disadvantages of this approach include the lack of a priori information. For example, very often the sums of undifferentiated (by a detector) peaks are replaced with one peak during the processing of data, introducing uncontrolled errors in the determination of the physical quantities. The problem is solvable only by having experienced personnel, whose skills are much greater than the challenge. We propose a set of non-parametric techniques, which allows the use of any additional information on the nature of experimental dependence. The method is based on a construction of a functional, which includes both experimental data and a priori information. Minimum of this functional is reached on a non-parametric smoothed curve. Euler (Lagrange) differential equations are constructed for these curves; then their solutions are obtained analytically or numerically. The proposed approach allows for automated processing of nuclear physics data, eliminating the need for highly skilled laboratory personnel. Pursuant to the proposed approach is the possibility to obtain smoothing curves in a given confidence interval, e.g. according to the χ 2 distribution. This approach is applicable when constructing smooth solutions of ill-posed problems, in particular when solving

  11. Statistical analysis of agarwood oil compounds in discriminating the ...

    African Journals Online (AJOL)

    Enhancing and improving the discrimination technique is the main aim to determine or grade the good quality of agarwood oil. In this paper, all statistical works were performed via SPSS software. Two parameters involved are abundance of compound (%) and quality of t agarwood oil either low or high quality. The result ...

  12. Logistic discriminant analysis of breast cancer using ultrasound measurement

    International Nuclear Information System (INIS)

    Abdolmaleki, P.; Mokhtari Dizaji, M.; Vahead, M.R.; Gity, M.

    2004-01-01

    Background: Logistic discriminant method was applied to differentiate malignant from benign in a group of patients with proved breast lesions of the basis of ultrasonic parameters. Materials and methods: Our database include 273 patients' ultrasonographic pictures consisting of 14 quantitative variables. The measured variables were ultrasound propagation velocity, acoustic impedance and attenuation coefficient at 10 MHz in breast lesions at 20, 25, 30 and 35 d ig c temperature, physical density and age. This database was randomly divided into the estimation of 201 and validation of 72 samples. The estimation samples were used to build the logistic discriminant model, and validation samples were used to validate the performance. Finally important criteria such as sensitivity, specificity, accuracy and area under the receiver operating characteristic curve were evaluated. Results: Our results showed that the logistic discriminant method was able to classify correctly 67 out of 72 cases presented in the validation sample. The results indicate a remarkable diagnostic accuracy of 93%. Conclusion: A logistic discriminator approach is capable of predicting the probability of malignancy of breast cancer. Features from ultrasonic measurement on ultrasound imaging is used in this approach

  13. An Application of Monte-Carlo-Based Sensitivity Analysis on the Overlap in Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    S. Razmyan

    2012-01-01

    Full Text Available Discriminant analysis (DA is used for the measurement of estimates of a discriminant function by minimizing their group misclassifications to predict group membership of newly sampled data. A major source of misclassification in DA is due to the overlapping of groups. The uncertainty in the input variables and model parameters needs to be properly characterized in decision making. This study combines DEA-DA with a sensitivity analysis approach to an assessment of the influence of banks’ variables on the overall variance in overlap in a DA in order to determine which variables are most significant. A Monte-Carlo-based sensitivity analysis is considered for computing the set of first-order sensitivity indices of the variables to estimate the contribution of each uncertain variable. The results show that the uncertainties in the loans granted and different deposit variables are more significant than uncertainties in other banks’ variables in decision making.

  14. Classification of astrocyto-mas and meningiomas using statistical discriminant analysis on MRI data

    International Nuclear Information System (INIS)

    Siromoney, Anna; Prasad, G.N.S.; Raghuram, Lakshminarayan; Korah, Ipeson; Siromoney, Arul; Chandrasekaran, R.

    2001-01-01

    The objective of this study was to investigate the usefulness of Multivariate Discriminant Analysis for classifying two groups of primary brain tumours, astrocytomas and meningiomas, from Magnetic Resonance Images. Discriminant analysis is a multivariate technique concerned with separating distinct sets of objects and with allocating new objects to previously defined groups. Allocation or classification rules are usually developed from learning examples in a supervised learning environment. Data from signal intensity measurements in the multiple scan performed on each patient in routine clinical scanning was analysed using Fisher's Classification, which is one method of discriminant analysis

  15. Nonparametric identification of copula structures

    KAUST Repository

    Li, Bo

    2013-06-01

    We propose a unified framework for testing a variety of assumptions commonly made about the structure of copulas, including symmetry, radial symmetry, joint symmetry, associativity and Archimedeanity, and max-stability. Our test is nonparametric and based on the asymptotic distribution of the empirical copula process.We perform simulation experiments to evaluate our test and conclude that our method is reliable and powerful for assessing common assumptions on the structure of copulas, particularly when the sample size is moderately large. We illustrate our testing approach on two datasets. © 2013 American Statistical Association.

  16. Application of nonparametric statistic method for DNBR limit calculation

    International Nuclear Information System (INIS)

    Dong Bo; Kuang Bo; Zhu Xuenong

    2013-01-01

    Background: Nonparametric statistical method is a kind of statistical inference method not depending on a certain distribution; it calculates the tolerance limits under certain probability level and confidence through sampling methods. The DNBR margin is one important parameter of NPP design, which presents the safety level of NPP. Purpose and Methods: This paper uses nonparametric statistical method basing on Wilks formula and VIPER-01 subchannel analysis code to calculate the DNBR design limits (DL) of 300 MW NPP (Nuclear Power Plant) during the complete loss of flow accident, simultaneously compared with the DL of DNBR through means of ITDP to get certain DNBR margin. Results: The results indicate that this method can gain 2.96% DNBR margin more than that obtained by ITDP methodology. Conclusions: Because of the reduction of the conservation during analysis process, the nonparametric statistical method can provide greater DNBR margin and the increase of DNBR margin is benefited for the upgrading of core refuel scheme. (authors)

  17. Comparing parametric and nonparametric regression methods for panel data

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard; Henningsen, Arne

    We investigate and compare the suitability of parametric and non-parametric stochastic regression methods for analysing production technologies and the optimal firm size. Our theoretical analysis shows that the most commonly used functional forms in empirical production analysis, Cobb......-Douglas and Translog, are unsuitable for analysing the optimal firm size. We show that the Translog functional form implies an implausible linear relationship between the (logarithmic) firm size and the elasticity of scale, where the slope is artificially related to the substitutability between the inputs....... The practical applicability of the parametric and non-parametric regression methods is scrutinised and compared by an empirical example: we analyse the production technology and investigate the optimal size of Polish crop farms based on a firm-level balanced panel data set. A nonparametric specification test...

  18. MR PROSTATE SEGMENTATION VIA DISTRIBUTED DISCRIMINATIVE DICTIONARY (DDD) LEARNING.

    Science.gov (United States)

    Guo, Yanrong; Zhan, Yiqiang; Gao, Yaozong; Jiang, Jianguo; Shen, Dinggang

    2013-01-01

    Segmenting prostate from MR images is important yet challenging. Due to non-Gaussian distribution of prostate appearances in MR images, the popular active appearance model (AAM) has its limited performance. Although the newly developed sparse dictionary learning method[1, 2] can model the image appearance in a non-parametric fashion, the learned dictionaries still lack the discriminative power between prostate and non-prostate tissues, which is critical for accurate prostate segmentation. In this paper, we propose to integrate deformable model with a novel learning scheme, namely the Distributed Discriminative Dictionary ( DDD ) learning, which can capture image appearance in a non-parametric and discriminative fashion. In particular, three strategies are designed to boost the tissue discriminative power of DDD. First , minimum Redundancy Maximum Relevance (mRMR) feature selection is performed to constrain the dictionary learning in a discriminative feature space. Second , linear discriminant analysis (LDA) is employed to assemble residuals from different dictionaries for optimal separation between prostate and non-prostate tissues. Third , instead of learning the global dictionaries, we learn a set of local dictionaries for the local regions (each with small appearance variations) along prostate boundary, thus achieving better tissue differentiation locally. In the application stage, DDDs will provide the appearance cues to robustly drive the deformable model onto the prostate boundary. Experiments on 50 MR prostate images show that our method can yield a Dice Ratio of 88% compared to the manual segmentations, and have 7% improvement over the conventional AAM.

  19. Discrimination based on HIV/AIDS status: A comparative analysis of ...

    African Journals Online (AJOL)

    Discrimination based on HIV/AIDS status: A comparative analysis of the Nigerian court's decision in Festus Odaife & Ors v Attorney General of the Federation & Ors with other Commonwealth jurisdictions.

  20. Analysis of Financial Ratio to Distinguish Indonesia Joint Venture General Insurance Company Performance using Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    Subiakto Soekarno

    2012-01-01

    Full Text Available Insurance industry stands as a service business that plays a significant role in Indonesiaeconomical condition. The development of insurance industry in Indonesia, both of generalinsurance and life insurance, has increased very fast. The general insurance industry itselfdivided into two major players which are local private company and Joint Venture Company.Lately, the use of statistical techniques and financial ratios models to asses financial institutionsuch as insurance company have been used as one of the appropriate combination inpredicting the performance of an industry. This research aims to distinguish between JointVenture General Insurance Companies that have a good performance and those who are lessperforming well using Discriminant Analysis. Further, the findings led that DiscriminantAnalysis is able to distinguish Joint Venture General Insurance Companies that have a goodperformance and those who are not performing well. There are also six ratios which are RBC,Technical Reserve to Investment Ratio, Debt Ratio, Return on Equity, Loss Ratio, and ExpenseRatio that stand as the most influential ratios to distinguish the performance of joint venturegeneral insurance companies. In addition, the result suggest business people to be concernedtoward those six ratios, to increase their companies’ performance.Key words: general insurance, financial ratio, discriminant analysis

  1. Research on n-γ discrimination method based on spectrum gradient analysis of signals

    International Nuclear Information System (INIS)

    Luo Xiaoliang; Liu Guofu; Yang Jun; Wang Yueke

    2013-01-01

    Having discovered that there are distinct differences between the spectrum gradient of the output neutron and γ-ray signal from liquid scintillator detectors, this paper presented a n-γ discrimination method called spectrum gradient analysis (SGA) based on frequency-domain features of the pulse signals. The basic principle and feasibility of SGA method were discussed and the validity of n-γ discrimination results of SGA was verified by the associated particle neutron flight experiment. The discrimination performance of SGA was evaluated under different conditions of sampling rates ranging from 5 G/s to 250 M/s. The results show that SGA method exhibits insensitivity to noise, strong anti-interference ability, stable discrimination performance and lower amount of calculation in contrast with time-domain n-γ discrimination methods. (authors)

  2. Application of discriminant analysis and generalized distance measures to uranium exploration

    International Nuclear Information System (INIS)

    Beauchamp, J.J.; Begovich, C.L.; Kane, V.E.; Wolf, D.A.

    1980-01-01

    The National Uranium Resource Evaluation (NURE) Program has as its goal the estimation of the nation's uranium resources. It is possile to use discriminant analysis methods on hydrogeochemical data collected in the NURE Program to aid in fomulating geochemical models that can be used to identify the anomalous areas used in resource estimation. Discriminant' analysis methods have been applied to data from the Plainview, Texas Quadrangle which has approximately 850 groundwater samples with more than 40 quantitative measurements per sample. Discriminant analysis topics involving estimation of misclassification probabilities, variable selection, and robust discrimination are applied. A method using generalized distance measures is given which enables the assignment of samples to a background population or a mineralized population whose parameters were estimated from separate studies. Each topic is related to its relevance in identifying areas of possible interest to uranium exploration. However, the methodology presented here is applicable to the identification of regions associated with other types of resources. 8 figures, 3 tables

  3. Application of discriminant analysis and generalized distance measures to uranium exploration

    International Nuclear Information System (INIS)

    Beauchamp, J.J.; Begovich, C.L.; Kane, V.E.; Wolf, D.A.

    1979-10-01

    The National Uranium Resource Evaluation (NURE) Project has as its goal estimation of the nation's uranium resources. It is possible to use discriminant analysis methods on hydrogeochemical data collected in the NURE Program to aid in formulating geochemical models which can be used to identify the anomalous regions necessary for resource estimation. Discriminant analysis methods have been applied to data from the Plainview, Texas Quadrangle which has approximately 850 groundwater samples with more than 40 quantitative measurements per sample. Discriminant analysis topics involving estimation of misclassification probabilities, variable selection, and robust discrimination are applied. A method using generalized distance measures is given which enables assigning samples to a background population or a mineralized population whose parameters were estimated from separate studies. Each topic is related to its relevance in identifying areas of possible interest to uranium exploration

  4. PIXE analysis of fish otoliths. Application to fish stock discrimination

    International Nuclear Information System (INIS)

    Arai, Nobuaki; Sakamoto, Wataru; Tateno, Koji; Yoshida, Koji.

    1996-01-01

    PIXE was adopted to analyze trace elements in otoliths of Japanese flounder to discriminate among several local fish stocks. The otoliths were removed from samples caught at five different sea areas along with the coast of the Sea of Japan: Akita, Ishikawa, Kyoto (2 stations), and Fukuoka. Besides calcium as main component, strontium, manganese, and zinc were detected. Especially Sr concentrations were different among 4 areas except between 2 stations in Kyoto. It suggested that the fish in the 2 stations in Kyoto were the same stock differed to the others. (author)

  5. A contingency table approach to nonparametric testing

    CERN Document Server

    Rayner, JCW

    2000-01-01

    Most texts on nonparametric techniques concentrate on location and linear-linear (correlation) tests, with less emphasis on dispersion effects and linear-quadratic tests. Tests for higher moment effects are virtually ignored. Using a fresh approach, A Contingency Table Approach to Nonparametric Testing unifies and extends the popular, standard tests by linking them to tests based on models for data that can be presented in contingency tables.This approach unifies popular nonparametric statistical inference and makes the traditional, most commonly performed nonparametric analyses much more comp

  6. Gas Classification Using Combined Features Based on a Discriminant Analysis for an Electronic Nose

    Directory of Open Access Journals (Sweden)

    Sang-Il Choi

    2016-01-01

    Full Text Available This paper proposes a gas classification method for an electronic nose (e-nose system, for which combined features that have been configured through discriminant analysis are used. First, each global feature is extracted from the entire measurement section of the data samples, while the same process is applied to the local features of the section that corresponds to the stabilization, exposure, and purge stages. The discriminative information amounts in the individual features are then measured based on the discriminant analysis, and the combined features are subsequently composed by selecting the features that have a large amount of discriminative information. Regarding a variety of volatile organic compound data, the results of the experiment show that, in a noisy environment, the proposed method exhibits classification performance that is relatively excellent compared to the other feature types.

  7. Discrimination of handlebar grip samples by fourier transform infrared microspectroscopy analysis and statistics

    Directory of Open Access Journals (Sweden)

    Zeyu Lin

    2017-01-01

    Full Text Available In this paper, the authors presented a study on the discrimination of handlebar grip samples, to provide effective forensic science service for hit and run traffic cases. 50 bicycle handlebar grip samples, 49 electric bike handlebar grip samples, and 96 motorcycle handlebar grip samples have been randomly collected by the local police in Beijing (China. Fourier transform infrared microspectroscopy (FTIR was utilized as analytical technology. Then, target absorption selection, data pretreatment, and discrimination of linked samples and unlinked samples were chosen as three steps to improve the discrimination of FTIR spectrums collected from different handlebar grip samples. Principal component analysis and receiver operating characteristic curve were utilized to evaluate different data selection methods and different data pretreatment methods, respectively. It is possible to explore the evidential value of handlebar grip residue evidence through instrumental analysis and statistical treatments. It will provide a universal discrimination method for other forensic science samples as well.

  8. Parametric vs. Nonparametric Regression Modelling within Clinical Decision Support

    Czech Academy of Sciences Publication Activity Database

    Kalina, Jan; Zvárová, Jana

    2017-01-01

    Roč. 5, č. 1 (2017), s. 21-27 ISSN 1805-8698 R&D Projects: GA ČR GA17-01251S Institutional support: RVO:67985807 Keywords : decision support systems * decision rules * statistical analysis * nonparametric regression Subject RIV: IN - Informatics, Computer Science OBOR OECD: Statistics and probability

  9. Non-parametric tests of productive efficiency with errors-in-variables

    NARCIS (Netherlands)

    Kuosmanen, T.K.; Post, T.; Scholtes, S.

    2007-01-01

    We develop a non-parametric test of productive efficiency that accounts for errors-in-variables, following the approach of Varian. [1985. Nonparametric analysis of optimizing behavior with measurement error. Journal of Econometrics 30(1/2), 445-458]. The test is based on the general Pareto-Koopmans

  10. Women ministers' experiences of gender discrimination in the Lutheran Church : a discourse analysis

    OpenAIRE

    2011-01-01

    M.A. The aim of this psychological study was to uncover women minister’s experiences of gender discrimination in the Lutheran Church by using a discourse analysis. Three female participants, who are involved in ministry in the Lutheran Church, were interviewed about their experiences and perceptions of gender discrimination. The resultant texts were analysed using Parker’s (2005) steps to discourse analytic reading. The discourses that were discovered indicate that power struggles are prev...

  11. Separability Analysis of Sentinel-2A Multi-Spectral Instrument (MSI Data for Burned Area Discrimination

    Directory of Open Access Journals (Sweden)

    Haiyan Huang

    2016-10-01

    Full Text Available Biomass burning is a global phenomenon and systematic burned area mapping is of increasing importance for science and applications. With high spatial resolution and novelty in band design, the recently launched Sentinel-2A satellite provides a new opportunity for moderate spatial resolution burned area mapping. This study examines the performance of the Sentinel-2A Multi Spectral Instrument (MSI bands and derived spectral indices to differentiate between unburned and burned areas. For this purpose, five pairs of pre-fire and post-fire top of atmosphere (TOA reflectance and atmospherically corrected (surface reflectance images were studied. The pixel values of locations that were unburned in the first image and burned in the second image, as well as the values of locations that were unburned in both images which served as a control, were compared and the discrimination of individual bands and spectral indices were evaluated using parametric (transformed divergence and non-parametric (decision tree approaches. Based on the results, the most suitable MSI bands to detect burned areas are the 20 m near-infrared, short wave infrared and red-edge bands, while the performance of the spectral indices varied with location. The atmospheric correction only significantly influenced the separability of the visible wavelength bands. The results provide insights that are useful for developing Sentinel-2 burned area mapping algorithms.

  12. [Comparison of Discriminant Analysis and Decision Trees for the Detection of Subclinical Keratoconus].

    Science.gov (United States)

    Kleinhans, Sonja; Herrmann, Eva; Kohnen, Thomas; Bühren, Jens

    2017-08-15

    Background Iatrogenic keratectasia is one of the most dreaded complications of refractive surgery. In most cases, keratectasia develops after refractive surgery of eyes suffering from subclinical stages of keratoconus with few or no signs. Unfortunately, there has been no reliable procedure for the early detection of keratoconus. In this study, we used binary decision trees (recursive partitioning) to assess their suitability for discrimination between normal eyes and eyes with subclinical keratoconus. Patients and Methods The method of decision tree analysis was compared with discriminant analysis which has shown good results in previous studies. Input data were 32 eyes of 32 patients with newly diagnosed keratoconus in the contralateral eye and preoperative data of 10 eyes of 5 patients with keratectasia after laser in-situ keratomileusis (LASIK). The control group was made up of 245 normal eyes after LASIK and 12-month follow-up without any signs of iatrogenic keratectasia. Results Decision trees gave better accuracy and specificity than did discriminant analysis. The sensitivity of decision trees was lower than the sensitivity of discriminant analysis. Conclusion On the basis of the patient population of this study, decision trees did not prove to be superior to linear discriminant analysis for the detection of subclinical keratoconus. Georg Thieme Verlag KG Stuttgart · New York.

  13. Factors that Affect Poverty Areas in North Sumatera Using Discriminant Analysis

    Science.gov (United States)

    Nasution, D. H.; Bangun, P.; Sitepu, H. R.

    2018-04-01

    In Indonesia, especially North Sumatera, the problem of poverty is one of the fundamental problems that become the focus of government both central and local government. Although the poverty rate decreased but the fact is there are many people who are poor. Poverty happens covers several aspects such as education, health, demographics, and also structural and cultural. This research will discuss about several factors such as population density, Unemployment Rate, GDP per capita ADHK, ADHB GDP per capita, economic growth and life expectancy that affect poverty in Indonesia. To determine the factors that most influence and differentiate the level of poverty of the Regency/City North Sumatra used discriminant analysis method. Discriminant analysis is one multivariate analysis technique are used to classify the data into a group based on the dependent variable and independent variable. Using discriminant analysis, it is evident that the factor affecting poverty is Unemployment Rate.

  14. Global classification of human facial healthy skin using PLS discriminant analysis and clustering analysis.

    Science.gov (United States)

    Guinot, C; Latreille, J; Tenenhaus, M; Malvy, D J

    2001-04-01

    Today's classifications of healthy skin are predominantly based on a very limited number of skin characteristics, such as skin oiliness or susceptibility to sun exposure. The aim of the present analysis was to set up a global classification of healthy facial skin, using mathematical models. This classification is based on clinical, biophysical skin characteristics and self-reported information related to the skin, as well as the results of a theoretical skin classification assessed separately for the frontal and the malar zones of the face. In order to maximize the predictive power of the models with a minimum of variables, the Partial Least Square (PLS) discriminant analysis method was used. The resulting PLS components were subjected to clustering analyses to identify the plausible number of clusters and to group the individuals according to their proximities. Using this approach, four PLS components could be constructed and six clusters were found relevant. So, from the 36 hypothetical combinations of the theoretical skin types classification, we tended to a strengthened six classes proposal. Our data suggest that the association of the PLS discriminant analysis and the clustering methods leads to a valid and simple way to classify healthy human skin and represents a potentially useful tool for cosmetic and dermatological research.

  15. Optical selection of trace elements for discriminant analysis

    International Nuclear Information System (INIS)

    Rasmussen, S.E.; Erasmus, C.S.; Watterson, J.I.W.; Sellschop, J.P.F.

    This report describes different methods of element selection; a combination of stepwise multivariate analysis of variance for primary element selection, and principle component analysis regression for the element interrelationship analysis. These offer a satisfactory solution to the problem of element selection

  16. Meta-analysis of field experiments shows no change in racial discrimination in hiring over time.

    Science.gov (United States)

    Quillian, Lincoln; Pager, Devah; Hexel, Ole; Midtbøen, Arnfinn H

    2017-10-10

    This study investigates change over time in the level of hiring discrimination in US labor markets. We perform a meta-analysis of every available field experiment of hiring discrimination against African Americans or Latinos ( n = 28). Together, these studies represent 55,842 applications submitted for 26,326 positions. We focus on trends since 1989 ( n = 24 studies), when field experiments became more common and improved methodologically. Since 1989, whites receive on average 36% more callbacks than African Americans, and 24% more callbacks than Latinos. We observe no change in the level of hiring discrimination against African Americans over the past 25 years, although we find modest evidence of a decline in discrimination against Latinos. Accounting for applicant education, applicant gender, study method, occupational groups, and local labor market conditions does little to alter this result. Contrary to claims of declining discrimination in American society, our estimates suggest that levels of discrimination remain largely unchanged, at least at the point of hire.

  17. L1-norm kernel discriminant analysis via Bayes error bound optimization for robust feature extraction.

    Science.gov (United States)

    Zheng, Wenming; Lin, Zhouchen; Wang, Haixian

    2014-04-01

    A novel discriminant analysis criterion is derived in this paper under the theoretical framework of Bayes optimality. In contrast to the conventional Fisher's discriminant criterion, the major novelty of the proposed one is the use of L1 norm rather than L2 norm, which makes it less sensitive to the outliers. With the L1-norm discriminant criterion, we propose a new linear discriminant analysis (L1-LDA) method for linear feature extraction problem. To solve the L1-LDA optimization problem, we propose an efficient iterative algorithm, in which a novel surrogate convex function is introduced such that the optimization problem in each iteration is to simply solve a convex programming problem and a close-form solution is guaranteed to this problem. Moreover, we also generalize the L1-LDA method to deal with the nonlinear robust feature extraction problems via the use of kernel trick, and hereafter proposed the L1-norm kernel discriminant analysis (L1-KDA) method. Extensive experiments on simulated and real data sets are conducted to evaluate the effectiveness of the proposed method in comparing with the state-of-the-art methods.

  18. Aberrant functional connectivity for diagnosis of major depressive disorder: a discriminant analysis.

    Science.gov (United States)

    Cao, Longlong; Guo, Shuixia; Xue, Zhimin; Hu, Yong; Liu, Haihong; Mwansisya, Tumbwene E; Pu, Weidan; Yang, Bo; Liu, Chang; Feng, Jianfeng; Chen, Eric Y H; Liu, Zhening

    2014-02-01

    Aberrant brain functional connectivity patterns have been reported in major depressive disorder (MDD). It is unknown whether they can be used in discriminant analysis for diagnosis of MDD. In the present study we examined the efficiency of discriminant analysis of MDD by individualized computer-assisted diagnosis. Based on resting-state functional magnetic resonance imaging data, a new approach was adopted to investigate functional connectivity changes in 39 MDD patients and 37 well-matched healthy controls. By using the proposed feature selection method, we identified significant altered functional connections in patients. They were subsequently applied to our analysis as discriminant features using a support vector machine classification method. Furthermore, the relative contribution of functional connectivity was estimated. After subset selection of high-dimension features, the support vector machine classifier reached up to approximately 84% with leave-one-out training during the discrimination process. Through summarizing the classification contribution of functional connectivities, we obtained four obvious contribution modules: inferior orbitofrontal module, supramarginal gyrus module, inferior parietal lobule-posterior cingulated gyrus module and middle temporal gyrus-inferior temporal gyrus module. The experimental results demonstrated that the proposed method is effective in discriminating MDD patients from healthy controls. Functional connectivities might be useful as new biomarkers to assist clinicians in computer auxiliary diagnosis of MDD. © 2013 The Authors. Psychiatry and Clinical Neurosciences © 2013 Japanese Society of Psychiatry and Neurology.

  19. Nonparametric Bayesian Modeling of Complex Networks

    DEFF Research Database (Denmark)

    Schmidt, Mikkel Nørgaard; Mørup, Morten

    2013-01-01

    an infinite mixture model as running example, we go through the steps of deriving the model as an infinite limit of a finite parametric model, inferring the model parameters by Markov chain Monte Carlo, and checking the model?s fit and predictive performance. We explain how advanced nonparametric models......Modeling structure in complex networks using Bayesian nonparametrics makes it possible to specify flexible model structures and infer the adequate model complexity from the observed data. This article provides a gentle introduction to nonparametric Bayesian modeling of complex networks: Using...

  20. Data mining methods in the prediction of Dementia: A real-data comparison of the accuracy, sensitivity and specificity of linear discriminant analysis, logistic regression, neural networks, support vector machines, classification trees and random forests

    Directory of Open Access Journals (Sweden)

    Santana Isabel

    2011-08-01

    Full Text Available Abstract Background Dementia and cognitive impairment associated with aging are a major medical and social concern. Neuropsychological testing is a key element in the diagnostic procedures of Mild Cognitive Impairment (MCI, but has presently a limited value in the prediction of progression to dementia. We advance the hypothesis that newer statistical classification methods derived from data mining and machine learning methods like Neural Networks, Support Vector Machines and Random Forests can improve accuracy, sensitivity and specificity of predictions obtained from neuropsychological testing. Seven non parametric classifiers derived from data mining methods (Multilayer Perceptrons Neural Networks, Radial Basis Function Neural Networks, Support Vector Machines, CART, CHAID and QUEST Classification Trees and Random Forests were compared to three traditional classifiers (Linear Discriminant Analysis, Quadratic Discriminant Analysis and Logistic Regression in terms of overall classification accuracy, specificity, sensitivity, Area under the ROC curve and Press'Q. Model predictors were 10 neuropsychological tests currently used in the diagnosis of dementia. Statistical distributions of classification parameters obtained from a 5-fold cross-validation were compared using the Friedman's nonparametric test. Results Press' Q test showed that all classifiers performed better than chance alone (p Conclusions When taking into account sensitivity, specificity and overall classification accuracy Random Forests and Linear Discriminant analysis rank first among all the classifiers tested in prediction of dementia using several neuropsychological tests. These methods may be used to improve accuracy, sensitivity and specificity of Dementia predictions from neuropsychological testing.

  1. Nonparametric methods in actigraphy: An update

    Directory of Open Access Journals (Sweden)

    Bruno S.B. Gonçalves

    2014-09-01

    Full Text Available Circadian rhythmicity in humans has been well studied using actigraphy, a method of measuring gross motor movement. As actigraphic technology continues to evolve, it is important for data analysis to keep pace with new variables and features. Our objective is to study the behavior of two variables, interdaily stability and intradaily variability, to describe rest activity rhythm. Simulated data and actigraphy data of humans, rats, and marmosets were used in this study. We modified the method of calculation for IV and IS by modifying the time intervals of analysis. For each variable, we calculated the average value (IVm and ISm results for each time interval. Simulated data showed that (1 synchronization analysis depends on sample size, and (2 fragmentation is independent of the amplitude of the generated noise. We were able to obtain a significant difference in the fragmentation patterns of stroke patients using an IVm variable, while the variable IV60 was not identified. Rhythmic synchronization of activity and rest was significantly higher in young than adults with Parkinson׳s when using the ISM variable; however, this difference was not seen using IS60. We propose an updated format to calculate rhythmic fragmentation, including two additional optional variables. These alternative methods of nonparametric analysis aim to more precisely detect sleep–wake cycle fragmentation and synchronization.

  2. Discrimination of whisky brands and counterfeit identification by UV-Vis spectroscopy and multivariate data analysis.

    Science.gov (United States)

    Martins, Angélica Rocha; Talhavini, Márcio; Vieira, Maurício Leite; Zacca, Jorge Jardim; Braga, Jez Willian Batista

    2017-08-15

    The discrimination of whisky brands and counterfeit identification were performed by UV-Vis spectroscopy combined with partial least squares for discriminant analysis (PLS-DA). In the proposed method all spectra were obtained with no sample preparation. The discrimination models were built with the employment of seven whisky brands: Red Label, Black Label, White Horse, Chivas Regal (12years), Ballantine's Finest, Old Parr and Natu Nobilis. The method was validated with an independent test set of authentic samples belonging to the seven selected brands and another eleven brands not included in the training samples. Furthermore, seventy-three counterfeit samples were also used to validate the method. Results showed correct classification rates for genuine and false samples over 98.6% and 93.1%, respectively, indicating that the method can be helpful for the forensic analysis of whisky samples. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Identification of roselle varieties through simple discriminating physicochemical characteristics using multivariate analysis

    Directory of Open Access Journals (Sweden)

    Alé KANE

    2018-01-01

    Full Text Available Abstract The objective of this work is to study the feasibility of a more objective and rigorous classification of the calices of Hibiscus sabdariffa based on their physicochemical profile. To do so, 19 analyses were carried out on 4 varieties of calices cultivated in Senegal: Vimto, Koor, Thaï and CLT92. Principal component analysis results showed that 15 physicochemical and biochemical parameters could be potentially used to discriminate the varieties of calices. Polyphenolic and anthocyanin contents were anti-correlated to protein content and could be used to differentiate the Vimto/CLT92 and the Koor/Thaï varieties. Within these two clusters, pH and lipid content could discriminate each variety. Finally, factorial discriminant analysis showed that total anthocyanin content, lipid content and chromaticity C* were the 3 parameters enabling the most efficient classification of calices according to variety and led to 100% classification accuracy.

  4. Sub-pattern based multi-manifold discriminant analysis for face recognition

    Science.gov (United States)

    Dai, Jiangyan; Guo, Changlu; Zhou, Wei; Shi, Yanjiao; Cong, Lin; Yi, Yugen

    2018-04-01

    In this paper, we present a Sub-pattern based Multi-manifold Discriminant Analysis (SpMMDA) algorithm for face recognition. Unlike existing Multi-manifold Discriminant Analysis (MMDA) approach which is based on holistic information of face image for recognition, SpMMDA operates on sub-images partitioned from the original face image and then extracts the discriminative local feature from the sub-images separately. Moreover, the structure information of different sub-images from the same face image is considered in the proposed method with the aim of further improve the recognition performance. Extensive experiments on three standard face databases (Extended YaleB, CMU PIE and AR) demonstrate that the proposed method is effective and outperforms some other sub-pattern based face recognition methods.

  5. Feature extraction with deep neural networks by a generalized discriminant analysis.

    Science.gov (United States)

    Stuhlsatz, André; Lippel, Jens; Zielke, Thomas

    2012-04-01

    We present an approach to feature extraction that is a generalization of the classical linear discriminant analysis (LDA) on the basis of deep neural networks (DNNs). As for LDA, discriminative features generated from independent Gaussian class conditionals are assumed. This modeling has the advantages that the intrinsic dimensionality of the feature space is bounded by the number of classes and that the optimal discriminant function is linear. Unfortunately, linear transformations are insufficient to extract optimal discriminative features from arbitrarily distributed raw measurements. The generalized discriminant analysis (GerDA) proposed in this paper uses nonlinear transformations that are learnt by DNNs in a semisupervised fashion. We show that the feature extraction based on our approach displays excellent performance on real-world recognition and detection tasks, such as handwritten digit recognition and face detection. In a series of experiments, we evaluate GerDA features with respect to dimensionality reduction, visualization, classification, and detection. Moreover, we show that GerDA DNNs can preprocess truly high-dimensional input data to low-dimensional representations that facilitate accurate predictions even if simple linear predictors or measures of similarity are used.

  6. Combined approach based on principal component analysis and canonical discriminant analysis for investigating hyperspectral plant response

    Directory of Open Access Journals (Sweden)

    Anna Maria Stellacci

    2012-07-01

    Full Text Available Hyperspectral (HS data represents an extremely powerful means for rapidly detecting crop stress and then aiding in the rational management of natural resources in agriculture. However, large volume of data poses a challenge for data processing and extracting crucial information. Multivariate statistical techniques can play a key role in the analysis of HS data, as they may allow to both eliminate redundant information and identify synthetic indices which maximize differences among levels of stress. In this paper we propose an integrated approach, based on the combined use of Principal Component Analysis (PCA and Canonical Discriminant Analysis (CDA, to investigate HS plant response and discriminate plant status. The approach was preliminary evaluated on a data set collected on durum wheat plants grown under different nitrogen (N stress levels. Hyperspectral measurements were performed at anthesis through a high resolution field spectroradiometer, ASD FieldSpec HandHeld, covering the 325-1075 nm region. Reflectance data were first restricted to the interval 510-1000 nm and then divided into five bands of the electromagnetic spectrum [green: 510-580 nm; yellow: 581-630 nm; red: 631-690 nm; red-edge: 705-770 nm; near-infrared (NIR: 771-1000 nm]. PCA was applied to each spectral interval. CDA was performed on the extracted components to identify the factors maximizing the differences among plants fertilised with increasing N rates. Within the intervals of green, yellow and red only the first principal component (PC had an eigenvalue greater than 1 and explained more than 95% of total variance; within the ranges of red-edge and NIR, the first two PCs had an eigenvalue higher than 1. Two canonical variables explained cumulatively more than 81% of total variance and the first was able to discriminate wheat plants differently fertilised, as confirmed also by the significant correlation with aboveground biomass and grain yield parameters. The combined

  7. Chemometric analysis for discrimination of extra virgin olive oils from whole and stoned olive pastes.

    Science.gov (United States)

    De Luca, Michele; Restuccia, Donatella; Clodoveo, Maria Lisa; Puoci, Francesco; Ragno, Gaetano

    2016-07-01

    Chemometric discrimination of extra virgin olive oils (EVOO) from whole and stoned olive pastes was carried out by using Fourier transform infrared (FTIR) data and partial least squares-discriminant analysis (PLS1-DA) approach. Four Italian commercial EVOO brands, all in both whole and stoned version, were considered in this study. The adopted chemometric methodologies were able to describe the different chemical features in phenolic and volatile compounds contained in the two types of oil by using unspecific IR spectral information. Principal component analysis (PCA) was employed in cluster analysis to capture data patterns and to highlight differences between technological processes and EVOO brands. The PLS1-DA algorithm was used as supervised discriminant analysis to identify the different oil extraction procedures. Discriminant analysis was extended to the evaluation of possible adulteration by addition of aliquots of oil from whole paste to the most valuable oil from stoned olives. The statistical parameters from external validation of all the PLS models were very satisfactory, with low root mean square error of prediction (RMSEP) and relative error (RE%). Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. Nonparametric functional mapping of quantitative trait loci.

    Science.gov (United States)

    Yang, Jie; Wu, Rongling; Casella, George

    2009-03-01

    Functional mapping is a useful tool for mapping quantitative trait loci (QTL) that control dynamic traits. It incorporates mathematical aspects of biological processes into the mixture model-based likelihood setting for QTL mapping, thus increasing the power of QTL detection and the precision of parameter estimation. However, in many situations there is no obvious functional form and, in such cases, this strategy will not be optimal. Here we propose to use nonparametric function estimation, typically implemented with B-splines, to estimate the underlying functional form of phenotypic trajectories, and then construct a nonparametric test to find evidence of existing QTL. Using the representation of a nonparametric regression as a mixed model, the final test statistic is a likelihood ratio test. We consider two types of genetic maps: dense maps and general maps, and the power of nonparametric functional mapping is investigated through simulation studies and demonstrated by examples.

  9. Essays on nonparametric econometrics of stochastic volatility

    NARCIS (Netherlands)

    Zu, Y.

    2012-01-01

    Volatility is a concept that describes the variation of financial returns. Measuring and modelling volatility dynamics is an important aspect of financial econometrics. This thesis is concerned with nonparametric approaches to volatility measurement and volatility model validation.

  10. Nonparametric methods for volatility density estimation

    NARCIS (Netherlands)

    Es, van Bert; Spreij, P.J.C.; Zanten, van J.H.

    2009-01-01

    Stochastic volatility modelling of financial processes has become increasingly popular. The proposed models usually contain a stationary volatility process. We will motivate and review several nonparametric methods for estimation of the density of the volatility process. Both models based on

  11. linear discriminant analysis of structure within african eggplant 'shum'

    African Journals Online (AJOL)

    ACSS

    observed clusters include petiole length, sepal length (or seed color), fruit calyx length, seeds per fruit, leaf fresh .... obtain means. A table of means per trait for each accession was then imported into R statistical software for UPGMA reordered hierarchical cluster analysis. ..... Mwale, S.E., Ssemakula, M.O., Sadik, K.,.

  12. Use of linear discriminant function analysis in seed morphotype ...

    African Journals Online (AJOL)

    Variation in seed morphology of the Lima bean in 31 accessions was studied. Data were collected on 100-seed weight, seed length and seed width. The differences among the accessions were significant, based on the three seed characteristics. K-means cluster analysis grouped the 31 accessions into four distinct groups, ...

  13. Use of Linear Discriminant Function Analysis in Five Yield Sub ...

    African Journals Online (AJOL)

    K-means cluster analysis grouped the 134 accessions into four distinct groups. Pairwise Mahalanobis 2 distance (D) among some of the groups was highly significant. From the study the yield sub-characters pod length, pod width, peduncle length and 100-seed weight contributed most to group separation in the cowpea ...

  14. Harassment and discrimination in medical training: a systematic review and meta-analysis.

    Science.gov (United States)

    Fnais, Naif; Soobiah, Charlene; Chen, Maggie Hong; Lillie, Erin; Perrier, Laure; Tashkhandi, Mariam; Straus, Sharon E; Mamdani, Muhammad; Al-Omran, Mohammed; Tricco, Andrea C

    2014-05-01

    Harassment and discrimination include a wide range of behaviors that medical trainees perceive as being humiliating, hostile, or abusive. To understand the significance of such mistreatment and to explore potential preventive strategies, the authors conducted a systematic review and meta-analysis to examine the prevalence, risk factors, and sources of harassment and discrimination among medical trainees. In 2011, the authors identified relevant studies by searching MEDLINE and EMBASE, scanning reference lists of relevant studies, and contacting experts. They included studies that reported the prevalence, risk factors, and sources of harassment and discrimination among medical trainees. Two reviewers independently screened all articles and abstracted study and participant characteristics and study results. The authors assessed the methodological quality in individual studies using the Newcastle-Ottawa Scale. They also conducted a meta-analysis. The authors included 57 cross-sectional and 2 cohort studies in their review. The meta-analysis of 51 studies demonstrated that 59.4% of medical trainees had experienced at least one form of harassment or discrimination during their training (95% confidence interval [CI]: 52.0%-66.7%). Verbal harassment was the most commonly cited form of harassment (prevalence: 63.0%; 95% CI: 54.8%-71.2%). Consultants were the most commonly cited source of harassment and discrimination, followed by patients or patients' families (34.4% and 21.9%, respectively). This review demonstrates the surprisingly high prevalence of harassment and discrimination among medical trainees that has not declined over time. The authors recommend both drafting policies and promoting cultural change within academic institutions to prevent future abuse.

  15. Use of discriminant analysis to determine black shales of the Lesser Carpathian crystal field

    Energy Technology Data Exchange (ETDEWEB)

    Khun, M.

    1980-01-01

    Discriminant analysis of results from geochemical testing was used to separate black shales of the ore level from the nonproductive deposits. Based on a large number of experiments, the accuracy of isolating the black shales according to content of vandium, copper and nickel reached 78%. These elements have basic importance for separation of productive shales from nonproductive.

  16. Comparison of cranial sex determination by discriminant analysis and logistic regression.

    Science.gov (United States)

    Amores-Ampuero, Anabel; Alemán, Inmaculada

    2016-04-05

    Various methods have been proposed for estimating dimorphism. The objective of this study was to compare sex determination results from cranial measurements using discriminant analysis or logistic regression. The study sample comprised 130 individuals (70 males) of known sex, age, and cause of death from San José cemetery in Granada (Spain). Measurements of 19 neurocranial dimensions and 11 splanchnocranial dimensions were subjected to discriminant analysis and logistic regression, and the percentages of correct classification were compared between the sex functions obtained with each method. The discriminant capacity of the selected variables was evaluated with a cross-validation procedure. The percentage accuracy with discriminant analysis was 78.2% for the neurocranium (82.4% in females and 74.6% in males) and 73.7% for the splanchnocranium (79.6% in females and 68.8% in males). These percentages were higher with logistic regression analysis: 85.7% for the neurocranium (in both sexes) and 94.1% for the splanchnocranium (100% in females and 91.7% in males).

  17. Discrimination of bromodeoxyuridine labelled and unlabelled mitotic cells in flow cytometric bromodeoxyuridine/DNA analysis

    DEFF Research Database (Denmark)

    Jensen, P O; Larsen, J K; Christensen, I J

    1994-01-01

    Bromodeoxyuridine (BrdUrd) labelled and unlabelled mitotic cells, respectively, can be discriminated from interphase cells using a new method, based on immunocytochemical staining of BrdUrd and flow cytometric four-parameter analysis of DNA content, BrdUrd incorporation, and forward and orthogona...

  18. Development and Validation of Discriminant Analysis Models for Student Loan Defaultees and Non-Defaultees.

    Science.gov (United States)

    Myers, Greeley; Siera, Steven

    1980-01-01

    Default on guaranteed student loans has been increasing. The use of discriminant analysis as a technique to identify "good" v "bad" student loans based on information available from the loan application is discussed. Research to test the ability of models to such predictions is reported. (Author/MLW)

  19. An Application of Discriminant Analysis to Pattern Recognition of Selected Contaminated Soil Features in Thin Sections

    DEFF Research Database (Denmark)

    Ribeiro, Alexandra B.; Nielsen, Allan Aasbjerg

    1997-01-01

    qualitative microprobe results: present elements Al, Si, Cr, Fe, As (associated with others). Selected groups of calibrated images (same light conditions and magnification) submitted to discriminant analysis, in order to find a pattern of recognition in the soil features corresponding to contamination already...

  20. Prediction Model of Collapse Risk Based on Information Entropy and Distance Discriminant Analysis Method

    Directory of Open Access Journals (Sweden)

    Hujun He

    2017-01-01

    Full Text Available The prediction and risk classification of collapse is an important issue in the process of highway construction in mountainous regions. Based on the principles of information entropy and Mahalanobis distance discriminant analysis, we have produced a collapse hazard prediction model. We used the entropy measure method to reduce the influence indexes of the collapse activity and extracted the nine main indexes affecting collapse activity as the discriminant factors of the distance discriminant analysis model (i.e., slope shape, aspect, gradient, and height, along with exposure of the structural face, stratum lithology, relationship between weakness face and free face, vegetation cover rate, and degree of rock weathering. We employ postearthquake collapse data in relation to construction of the Yingxiu-Wolong highway, Hanchuan County, China, as training samples for analysis. The results were analyzed using the back substitution estimation method, showing high accuracy and no errors, and were the same as the prediction result of uncertainty measure. Results show that the classification model based on information entropy and distance discriminant analysis achieves the purpose of index optimization and has excellent performance, high prediction accuracy, and a zero false-positive rate. The model can be used as a tool for future evaluation of collapse risk.

  1. A Comparative Analysis of the Evolution of Gender Wage Discrimination: Spain Versus Galicia

    OpenAIRE

    Pena-Boquete, Yolanda

    2006-01-01

    The aim of this paper is to analyze the degree of female wage discrimination in the Spanish region of Galicia relative to the rest of Spain. The analysis starts from an established fact: women's average earnings are lower than men's. First, we try to show the causes behind this wage differential. Next, we discuss the evolution of the wage gap between 1995 and 2002, in order to bring some light on the factors potentially accounting for wage discrimination persistence in Galicia and Spain. We w...

  2. A Comparative Analysis of the Evolution of Gender Wage Discrimination: Spain Versus Galicia.

    OpenAIRE

    Yolanda Pena-Boquete

    2006-01-01

    The aim of this paper is to analyze the degree of female wage discrimination in the Spanish region of Galicia relative to the rest of Spain. The analysis starts from an established fact: women’s average earnings are lower than men’s. First, we try to show the causes behind this wage differential. Next, we discuss the evolution of the wage gap between 1995 and 2002, in order to bring some light on the factors potentially accounting for wage discrimination persistence in Galicia and Spain. We w...

  3. Baseline drift effect on the performance of neutron and γ ray discrimination using frequency gradient analysis

    International Nuclear Information System (INIS)

    Liu Guofu; Luo Xiaoliang; Yang Jun; Lin Cunbao; Hu Qingqing; Peng Jinxian

    2013-01-01

    Frequency gradient analysis (FGA) effectively discriminates neutrons and γ rays by examining the frequency-domain features of the photomultiplier tube anode signal. This approach is insensitive to noise but is inevitably affected by the baseline drift similar to other pulse shape discrimination methods. The baseline drift effect is attributed to factors such as power line fluctuation, dark current, noise disturbances, hum, and pulse tail in front-end electronics. This effect needs to be elucidated and quantified before the baseline shift can be estimated and removed from the captured signal. Therefore, the effect of baseline shift on the discrimination performance of neutrons and γ rays with organic scintillation detectors using FGA is investigated in this paper. The relationship between the baseline shift and discrimination parameters of FGA is derived and verified by an experimental system consisting of an americium—beryllium source, a BC501A liquid scintillator detector, and a 5 GSample/s 8-bit oscilloscope. The theoretical and experimental results both show that the estimation of the baseline shift is necessary, and the removal of baseline drift from the pulse shapes can improve the discrimination performance of FGA. (authors)

  4. Discrimination of Temperature and Strain in Brillouin Optical Time Domain Analysis Using a Multicore Optical Fiber.

    Science.gov (United States)

    Zaghloul, Mohamed A S; Wang, Mohan; Milione, Giovanni; Li, Ming-Jun; Li, Shenping; Huang, Yue-Kai; Wang, Ting; Chen, Kevin P

    2018-04-12

    Brillouin optical time domain analysis is the sensing of temperature and strain changes along an optical fiber by measuring the frequency shift changes of Brillouin backscattering. Because frequency shift changes are a linear combination of temperature and strain changes, their discrimination is a challenge. Here, a multicore optical fiber that has two cores is fabricated. The differences between the cores' temperature and strain coefficients are such that temperature (strain) changes can be discriminated with error amplification factors of 4.57 °C/MHz (69.11 μ ϵ /MHz), which is 2.63 (3.67) times lower than previously demonstrated. As proof of principle, using the multicore optical fiber and a commercial Brillouin optical time domain analyzer, the temperature (strain) changes of a thermally expanding metal cylinder are discriminated with an error of 0.24% (3.7%).

  5. Discrimination of Temperature and Strain in Brillouin Optical Time Domain Analysis Using a Multicore Optical Fiber

    Directory of Open Access Journals (Sweden)

    Mohamed A. S. Zaghloul

    2018-04-01

    Full Text Available Brillouin optical time domain analysis is the sensing of temperature and strain changes along an optical fiber by measuring the frequency shift changes of Brillouin backscattering. Because frequency shift changes are a linear combination of temperature and strain changes, their discrimination is a challenge. Here, a multicore optical fiber that has two cores is fabricated. The differences between the cores’ temperature and strain coefficients are such that temperature (strain changes can be discriminated with error amplification factors of 4.57 °C/MHz (69.11 μ ϵ /MHz, which is 2.63 (3.67 times lower than previously demonstrated. As proof of principle, using the multicore optical fiber and a commercial Brillouin optical time domain analyzer, the temperature (strain changes of a thermally expanding metal cylinder are discriminated with an error of 0.24% (3.7%.

  6. Applicability of supervised discriminant analysis models to analyze astigmatism clinical trial data.

    Science.gov (United States)

    Sedghipour, Mohammad Reza; Sadeghi-Bazargani, Homayoun

    2012-01-01

    In astigmatism clinical trials where more complex measurements are common, especially in nonrandomized small sized clinical trials, there is a demand for the development and application of newer statistical methods. The source data belonged to a project on astigmatism treatment. Data were used regarding a total of 296 eyes undergoing different astigmatism treatment modalities: wavefront-guided photorefractive keratectomy, cross-cylinder photorefractive keratectomy, and monotoric (single) photorefractive keratectomy. Astigmatism analysis was primarily done using the Alpins method. Prior to fitting partial least squares regression discriminant analysis, a preliminary principal component analysis was done for data overview. Through fitting the partial least squares regression discriminant analysis statistical method, various model validity and predictability measures were assessed. The model found the patients treated by the wavefront method to be different from the two other treatments both in baseline and outcome measures. Also, the model found that patients treated with the cross-cylinder method versus the single method didn't appear to be different from each other. This analysis provided an opportunity to compare the three methods while including a substantial number of baseline and outcome variables. Partial least squares regression discriminant analysis had applicability for the statistical analysis of astigmatism clinical trials and it may be used as an adjunct or alternative analysis method in small sized clinical trials.

  7. Classification of Fusarium-Infected Korean Hulled Barley Using Near-Infrared Reflectance Spectroscopy and Partial Least Squares Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    Jongguk Lim

    2017-09-01

    Full Text Available The purpose of this study is to use near-infrared reflectance (NIR spectroscopy equipment to nondestructively and rapidly discriminate Fusarium-infected hulled barley. Both normal hulled barley and Fusarium-infected hulled barley were scanned by using a NIR spectrometer with a wavelength range of 1175 to 2170 nm. Multiple mathematical pretreatments were applied to the reflectance spectra obtained for Fusarium discrimination and the multivariate analysis method of partial least squares discriminant analysis (PLS-DA was used for discriminant prediction. The PLS-DA prediction model developed by applying the second-order derivative pretreatment to the reflectance spectra obtained from the side of hulled barley without crease achieved 100% accuracy in discriminating the normal hulled barley and the Fusarium-infected hulled barley. These results demonstrated the feasibility of rapid discrimination of the Fusarium-infected hulled barley by combining multivariate analysis with the NIR spectroscopic technique, which is utilized as a nondestructive detection method.

  8. Single versus mixture Weibull distributions for nonparametric satellite reliability

    International Nuclear Information System (INIS)

    Castet, Jean-Francois; Saleh, Joseph H.

    2010-01-01

    Long recognized as a critical design attribute for space systems, satellite reliability has not yet received the proper attention as limited on-orbit failure data and statistical analyses can be found in the technical literature. To fill this gap, we recently conducted a nonparametric analysis of satellite reliability for 1584 Earth-orbiting satellites launched between January 1990 and October 2008. In this paper, we provide an advanced parametric fit, based on mixture of Weibull distributions, and compare it with the single Weibull distribution model obtained with the Maximum Likelihood Estimation (MLE) method. We demonstrate that both parametric fits are good approximations of the nonparametric satellite reliability, but that the mixture Weibull distribution provides significant accuracy in capturing all the failure trends in the failure data, as evidenced by the analysis of the residuals and their quasi-normal dispersion.

  9. Applicability of supervised discriminant analysis models to analyze astigmatism clinical trial data

    Directory of Open Access Journals (Sweden)

    Sedghipour MR

    2012-09-01

    Full Text Available Mohammad Reza Sedghipour,1 Homayoun Sadeghi-Bazargani2,31Nikoukari Ophthalmology University Hospital, Tabriz, Iran; 2Department of Statistics and Epidemiology, Neuroscience Research Center, Tabriz University of Medical Sciences, Tabriz, Iran; 3Department of Public Health Sciences, Karolinska Institute, Stockholm, SwedenBackground: In astigmatism clinical trials where more complex measurements are common, especially in nonrandomized small sized clinical trials, there is a demand for the development and application of newer statistical methods.Methods: The source data belonged to a project on astigmatism treatment. Data were used regarding a total of 296 eyes undergoing different astigmatism treatment modalities: wavefront-guided photorefractive keratectomy, cross-cylinder photorefractive keratectomy, and monotoric (single photorefractive keratectomy. Astigmatism analysis was primarily done using the Alpins method. Prior to fitting partial least squares regression discriminant analysis, a preliminary principal component analysis was done for data overview. Through fitting the partial least squares regression discriminant analysis statistical method, various model validity and predictability measures were assessed.Results: The model found the patients treated by the wavefront method to be different from the two other treatments both in baseline and outcome measures. Also, the model found that patients treated with the cross-cylinder method versus the single method didn't appear to be different from each other. This analysis provided an opportunity to compare the three methods while including a substantial number of baseline and outcome variables.Conclusion: Partial least squares regression discriminant analysis had applicability for the statistical analysis of astigmatism clinical trials and it may be used as an adjunct or alternative analysis method in small sized clinical trials.Keywords: astigmatism, regression, partial least squares regression

  10. Discrimination of honeys using colorimetric sensor arrays, sensory analysis and gas chromatography techniques.

    Science.gov (United States)

    Tahir, Haroon Elrasheid; Xiaobo, Zou; Xiaowei, Huang; Jiyong, Shi; Mariod, Abdalbasit Adam

    2016-09-01

    Aroma profiles of six honey varieties of different botanical origins were investigated using colorimetric sensor array, gas chromatography-mass spectrometry (GC-MS) and descriptive sensory analysis. Fifty-eight aroma compounds were identified, including 2 norisoprenoids, 5 hydrocarbons, 4 terpenes, 6 phenols, 7 ketones, 9 acids, 12 aldehydes and 13 alcohols. Twenty abundant or active compounds were chosen as key compounds to characterize honey aroma. Discrimination of the honeys was subsequently implemented using multivariate analysis, including hierarchical clustering analysis (HCA) and principal component analysis (PCA). Honeys of the same botanical origin were grouped together in the PCA score plot and HCA dendrogram. SPME-GC/MS and colorimetric sensor array were able to discriminate the honeys effectively with the advantages of being rapid, simple and low-cost. Moreover, partial least squares regression (PLSR) was applied to indicate the relationship between sensory descriptors and aroma compounds. Copyright © 2016 Elsevier Ltd. All rights reserved.

  11. Variations in students' perceived reasons for, sources of, and forms of in-school discrimination: A latent class analysis.

    Science.gov (United States)

    Byrd, Christy M; Carter Andrews, Dorinda J

    2016-08-01

    Although there exists a healthy body of literature related to discrimination in schools, this research has primarily focused on racial or ethnic discrimination as perceived and experienced by students of color. Few studies examine students' perceptions of discrimination from a variety of sources, such as adults and peers, their descriptions of the discrimination, or the frequency of discrimination in the learning environment. Middle and high school students in a Midwestern school district (N=1468) completed surveys identifying whether they experienced discrimination from seven sources (e.g., peers, teachers, administrators), for seven reasons (e.g., gender, race/ethnicity, religion), and in eight forms (e.g., punished more frequently, called names, excluded from social groups). The sample was 52% White, 15% Black/African American, 14% Multiracial, and 17% Other. Latent class analysis was used to cluster individuals based on reported sources of, reasons for, and forms of discrimination. Four clusters were found, and ANOVAs were used to test for differences between clusters on perceptions of school climate, relationships with teachers, perceptions that the school was a "good school," and engagement. The Low Discrimination cluster experienced the best outcomes, whereas an intersectional cluster experienced the most discrimination and the worst outcomes. The results confirm existing research on the negative effects of discrimination. Additionally, the paper adds to the literature by highlighting the importance of an intersectional approach to examining students' perceptions of in-school discrimination. Copyright © 2016 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.

  12. Predicting Insolvency : A comparison between discriminant analysis and logistic regression using principal components

    OpenAIRE

    Geroukis, Asterios; Brorson, Erik

    2014-01-01

    In this study, we compare the two statistical techniques logistic regression and discriminant analysis to see how well they classify companies based on clusters – made from the solvency ratio ­– using principal components as independent variables. The principal components are made with different financial ratios. We use cluster analysis to find groups with low, medium and high solvency ratio of 1200 different companies found on the NASDAQ stock market and use this as an apriori definition of ...

  13. Discriminative Nonlinear Analysis Operator Learning: When Cosparse Model Meets Image Classification.

    Science.gov (United States)

    Wen, Zaidao; Hou, Biao; Jiao, Licheng

    2017-05-03

    Linear synthesis model based dictionary learning framework has achieved remarkable performances in image classification in the last decade. Behaved as a generative feature model, it however suffers from some intrinsic deficiencies. In this paper, we propose a novel parametric nonlinear analysis cosparse model (NACM) with which a unique feature vector will be much more efficiently extracted. Additionally, we derive a deep insight to demonstrate that NACM is capable of simultaneously learning the task adapted feature transformation and regularization to encode our preferences, domain prior knowledge and task oriented supervised information into the features. The proposed NACM is devoted to the classification task as a discriminative feature model and yield a novel discriminative nonlinear analysis operator learning framework (DNAOL). The theoretical analysis and experimental performances clearly demonstrate that DNAOL will not only achieve the better or at least competitive classification accuracies than the state-of-the-art algorithms but it can also dramatically reduce the time complexities in both training and testing phases.

  14. Discriminant analysis method to determine the power of the boys 11-12 year

    Directory of Open Access Journals (Sweden)

    Mirosława Cieślicka

    2016-10-01

    Full Text Available Purpose: To determine the model of power in boys 11-12 years old. Material and methods: To achieve the objectives, the following methods: analysis of scientific literature, statistical methods for analysis of results. The study involved 35 boys 11 year (n = 35 and 32 boys 12 year (n = 32. Results: Analysis of the results shows that the statistical significance of differences in the test results of boys 11 and 12 years there has been research jump from the place of execution and the amount of squats (the amount of execution time (p <0.001, p <0. Conclusions: Structural factors discriminant function suggest that more attention is paid to training of speed and endurance, the more likely to increase the force to prepare the boys. The canonical discriminant function can  be used to assess and forecast the development of motor skills in boys.

  15. Analysis of pulse-shape discrimination techniques for BC501A using GHz digital signal processing

    International Nuclear Information System (INIS)

    Rooney, B.D.; Dinwiddie, D.R.; Nelson, M.A.; Rawool-Sullivan, Mohini W.

    2001-01-01

    A comparison study of pulse-shape analysis techniques was conducted for a BC501A scintillator using digital signal processing (DSP). In this study, output signals from a preamplifier were input directly into a 1 GHz analog-to-digital converter. The digitized data obtained with this method was post-processed for both pulse-height and pulse-shape information. Several different analysis techniques were evaluated for neutron and gamma-ray pulse-shape discrimination. It was surprising that one of the simplest and fastest techniques resulted in some of the best pulse-shape discrimination results. This technique, referred to here as the Integral Ratio technique, was able to effectively process several thousand detector pulses per second. This paper presents the results and findings of this study for various pulse-shape analysis techniques with digitized detector signals.

  16. Liquid contrabands classification based on energy dispersive X-ray diffraction and hybrid discriminant analysis

    International Nuclear Information System (INIS)

    YangDai, Tianyi; Zhang, Li

    2016-01-01

    Energy dispersive X-ray diffraction (EDXRD) combined with hybrid discriminant analysis (HDA) has been utilized for classifying the liquid materials for the first time. The XRD spectra of 37 kinds of liquid contrabands and daily supplies were obtained using an EDXRD test bed facility. The unique spectra of different samples reveal XRD's capability to distinguish liquid contrabands from daily supplies. In order to create a system to detect liquid contrabands, the diffraction spectra were subjected to HDA which is the combination of principal components analysis (PCA) and linear discriminant analysis (LDA). Experiments based on the leave-one-out method demonstrate that HDA is a practical method with higher classification accuracy and lower noise sensitivity than the other methods in this application. The study shows the great capability and potential of the combination of XRD and HDA for liquid contrabands classification.

  17. Liquid contrabands classification based on energy dispersive X-ray diffraction and hybrid discriminant analysis

    Energy Technology Data Exchange (ETDEWEB)

    YangDai, Tianyi [Department of Engineering Physics, Tsinghua University, Beijing 100084 (China); Key Laboratory of Particle & Radiation Imaging (Tsinghua University), Ministry of Education (China); Zhang, Li, E-mail: zhangli@nuctech.com [Department of Engineering Physics, Tsinghua University, Beijing 100084 (China); Key Laboratory of Particle & Radiation Imaging (Tsinghua University), Ministry of Education (China)

    2016-02-01

    Energy dispersive X-ray diffraction (EDXRD) combined with hybrid discriminant analysis (HDA) has been utilized for classifying the liquid materials for the first time. The XRD spectra of 37 kinds of liquid contrabands and daily supplies were obtained using an EDXRD test bed facility. The unique spectra of different samples reveal XRD's capability to distinguish liquid contrabands from daily supplies. In order to create a system to detect liquid contrabands, the diffraction spectra were subjected to HDA which is the combination of principal components analysis (PCA) and linear discriminant analysis (LDA). Experiments based on the leave-one-out method demonstrate that HDA is a practical method with higher classification accuracy and lower noise sensitivity than the other methods in this application. The study shows the great capability and potential of the combination of XRD and HDA for liquid contrabands classification.

  18. Liquid contrabands classification based on energy dispersive X-ray diffraction and hybrid discriminant analysis

    Science.gov (United States)

    YangDai, Tianyi; Zhang, Li

    2016-02-01

    Energy dispersive X-ray diffraction (EDXRD) combined with hybrid discriminant analysis (HDA) has been utilized for classifying the liquid materials for the first time. The XRD spectra of 37 kinds of liquid contrabands and daily supplies were obtained using an EDXRD test bed facility. The unique spectra of different samples reveal XRD's capability to distinguish liquid contrabands from daily supplies. In order to create a system to detect liquid contrabands, the diffraction spectra were subjected to HDA which is the combination of principal components analysis (PCA) and linear discriminant analysis (LDA). Experiments based on the leave-one-out method demonstrate that HDA is a practical method with higher classification accuracy and lower noise sensitivity than the other methods in this application. The study shows the great capability and potential of the combination of XRD and HDA for liquid contrabands classification.

  19. Financial consumer protection and customer satisfaction. A relationship study by using factor analysis and discriminant analysis

    Directory of Open Access Journals (Sweden)

    Marimuthu SELVAKUMAR

    2015-11-01

    Full Text Available This paper tries to make an attempt to study the relationship between the financial consumer protection and customer satisfaction by using factor analysis and discriminant analysis. The main objectives of the study are to analyze the financial consumer protection in commercial banks, to examine the customer satisfaction of commercial banks and to identify the factors of financial consumer protection lead customer satisfaction. There are many research work carried out on financial consumer protection in financial literacy, but the identification of factors which lead the financial consumer protection and the relationship between financial consumer protection and the customer satisfaction is very important, Particularly for banks to improve its quality and increase the customer satisfaction. Therefore this study is carried out with the aim of identifying the factors of financial consumer protection and its influence on customer satisfaction. This study is both descriptive and analytical in nature. It covers both primary and secondary data. The primary data has been collected from the customers of commercial banks using pre-tested interview schedule and the secondary data has been collected from standard books, journals, magazines, websites and so on.

  20. A novel electroencephalographic analysis method discriminates alcohol effects from those of other sedative/hypnotics.

    Science.gov (United States)

    Steffensen, Scott C; Lee, Rong-Sheng; Henriksen, Steven J; Packer, Thomas L; Cook, Daniel R

    2002-04-15

    Here we describe a mathematical and statistical signal processing strategy termed event resolution imaging (ERI). Our principal objective was to determine if the acute intoxicating effects of ethanol on spontaneous EEG activity could be discriminated from those of other sedative/hypnotics. We employed ERI to combine and integrate standard analysis methods to learn multiple signal features of time-varying EEG signals. We recorded cortical EEG, electromyographic activity, and motor activity during intravenous administration of saline, ethanol (1.0 g/kg), chlordiazepoxide (10 mg/kg), pentobarbital (6 mg/kg), heroin (0.3 mg/kg), and methamphetamine (2 mg/kg) administered on separate days in six rats. A blind treatment of one of the drugs was readministered to validate the efficacy of ERI analysis. Significant changes in spontaneous EEG activity produced by all five drugs were detected by ERI analysis with a time resolution of 5-10 s. ERI analysis of spontaneous EEG activity also discriminated, with 90-95% accuracy, an ataxic dose of ethanol versus equivalent ataxic doses of chlordiazepoxide or pentobarbital, as well as the effects of saline, a reinforcing dose of heroin, or a locomotor activating dose of methamphetamine. ERI correctly matched the 'blind drug' as ethanol. These findings indicate that ERI analysis can detect the central nervous system effects of various psychoactive drugs and accurately discriminate the electrocortical effects of select sedative/hypnotics, with similar behavioral endpoints, but with dissimilar mechanisms of action.

  1. Selecting predictors for discriminant analysis of species performance: an example from an amphibious softwater plant.

    Science.gov (United States)

    Vanderhaeghe, F; Smolders, A J P; Roelofs, J G M; Hoffmann, M

    2012-03-01

    Selecting an appropriate variable subset in linear multivariate methods is an important methodological issue for ecologists. Interest often exists in obtaining general predictive capacity or in finding causal inferences from predictor variables. Because of a lack of solid knowledge on a studied phenomenon, scientists explore predictor variables in order to find the most meaningful (i.e. discriminating) ones. As an example, we modelled the response of the amphibious softwater plant Eleocharis multicaulis using canonical discriminant function analysis. We asked how variables can be selected through comparison of several methods: univariate Pearson chi-square screening, principal components analysis (PCA) and step-wise analysis, as well as combinations of some methods. We expected PCA to perform best. The selected methods were evaluated through fit and stability of the resulting discriminant functions and through correlations between these functions and the predictor variables. The chi-square subset, at P < 0.05, followed by a step-wise sub-selection, gave the best results. In contrast to expectations, PCA performed poorly, as so did step-wise analysis. The different chi-square subset methods all yielded ecologically meaningful variables, while probable noise variables were also selected by PCA and step-wise analysis. We advise against the simple use of PCA or step-wise discriminant analysis to obtain an ecologically meaningful variable subset; the former because it does not take into account the response variable, the latter because noise variables are likely to be selected. We suggest that univariate screening techniques are a worthwhile alternative for variable selection in ecology. © 2011 German Botanical Society and The Royal Botanical Society of the Netherlands.

  2. Statistics that learn: can logistic discriminant analysis improve diagnosis in brain SPECT?

    International Nuclear Information System (INIS)

    Behin-Ain, S.; Barnden, L.; Kwiatek, R.; Del Fante, P.; Casse, R.; Burnet, R.; Chew, G.; Kitchener, M.; Boundy, K.; Unger, S.

    2002-01-01

    Full text: Logistic discriminant analysis (LDA) is a statistical technique capable of discriminating individuals within a diseased group against normals. It also enables classification of various diseases within a group of patients. This technique provides a quantitative, automated and non-subjective clinical diagnostic tool. Based on a population known to have the disease and a normal control group, an algorithm was developed and trained to identify regions in the human brain responsible for the disease in question. The algorithm outputs a statistical map representing diseased or normal probability on a voxel or cluster basis from which an index is generated for each subject. The algorithm also generates a set of coefficients which is used to generate an index for the purpose of classification of new subjects. The results are comparable and complement those of Statistical Parametric Mapping (SPM) which employs a more common linear discriminant technique. The results are presented for brain SPECT studies of two diseases: chronic fatigue syndrome (CFS) and fibromyalgia (FM). A 100% specificity and 94% sensitivity is achieved for the CFS study (similar to SPM results) and for the FM study 82% specificity and 94% sensitivity is achieved with corresponding SPM results showing 90% specificity and 82% sensitivity. The results encourages application of LDA for discrimination of new single subjects as well as of diseased and normal groups. Copyright (2002) The Australian and New Zealand Society of Nuclear Medicine Inc

  3. Optical spectroscopic analysis for the discrimination of extra-virgin olive-oil (Conference Presentation)

    Science.gov (United States)

    McReynolds, Naomi; Auñón Garcia, Juan M.; Guengerich, Zoe; Smith, Terry K.; Dholakia, Kishan

    2017-02-01

    We present an optical spectroscopic technique, making use of both Raman signals and fluorescence spectroscopy, for the identification of five brands of commercially available extra-virgin olive-oil (EVOO). We demonstrate our technique on both a `bulk-optics' free-space system and a compact device. Using the compact device, which is capable of recording both Raman and fluorescence signals, we achieved an average sensitivity and specificity of 98.4% and 99.6% for discrimination, respectively. Our approach demonstrates that both Raman and fluorescence spectroscopy can be used for portable discrimination of EVOOs which obviates the need to use centralised laboratories and opens up the prospect of in-field testing. This technique may enable detection of EVOO that has undergone counterfeiting or adulteration. One of the main challenges facing Raman spectroscopy for use in quality control of EVOOs is that the oxidation of EVOO, which naturally occurs due to aging, causes shifts in Raman spectra with time, which implies regular retraining would be necessary. We present a potential method of analysis to minimize the effect that aging has on discrimination efficiency; we show that by discarding the first principal component, which contains information on the variations due to oxidation, we can improve discrimination efficiency thus improving the robustness of our technique.

  4. Dimensionality Reduction of Hyperspectral Image with Graph-Based Discriminant Analysis Considering Spectral Similarity

    Directory of Open Access Journals (Sweden)

    Fubiao Feng

    2017-03-01

    Full Text Available Recently, graph embedding has drawn great attention for dimensionality reduction in hyperspectral imagery. For example, locality preserving projection (LPP utilizes typical Euclidean distance in a heat kernel to create an affinity matrix and projects the high-dimensional data into a lower-dimensional space. However, the Euclidean distance is not sufficiently correlated with intrinsic spectral variation of a material, which may result in inappropriate graph representation. In this work, a graph-based discriminant analysis with spectral similarity (denoted as GDA-SS measurement is proposed, which fully considers curves changing description among spectral bands. Experimental results based on real hyperspectral images demonstrate that the proposed method is superior to traditional methods, such as supervised LPP, and the state-of-the-art sparse graph-based discriminant analysis (SGDA.

  5. Cross coherence independent component analysis in resting and action states EEG discrimination

    International Nuclear Information System (INIS)

    Almurshedi, A; Ismail, A K

    2014-01-01

    Cross Coherence time frequency transform and independent component analysis (ICA) method were used to analyse the electroencephalogram (EEG) signals in resting and action states during open and close eyes conditions. From the topographical scalp distributions of delta, theta, alpha, and beta power spectrum can clearly discriminate between the signal when the eyes were open or closed, but it was difficult to distinguish between resting and action states when the eyes were closed. In open eyes condition, the frontal area (Fp1, Fp2) was activated (higher power) in delta and theta bands whilst occipital (O1, O2) and partial (P3, P4, Pz) area of brain was activated alpha band in closed eyes condition. The cross coherence method of time frequency analysis is capable of discrimination between rest and action brain signals in closed eyes condition

  6. Z-score linear discriminant analysis for EEG based brain-computer interfaces.

    Directory of Open Access Journals (Sweden)

    Rui Zhang

    Full Text Available Linear discriminant analysis (LDA is one of the most popular classification algorithms for brain-computer interfaces (BCI. LDA assumes Gaussian distribution of the data, with equal covariance matrices for the concerned classes, however, the assumption is not usually held in actual BCI applications, where the heteroscedastic class distributions are usually observed. This paper proposes an enhanced version of LDA, namely z-score linear discriminant analysis (Z-LDA, which introduces a new decision boundary definition strategy to handle with the heteroscedastic class distributions. Z-LDA defines decision boundary through z-score utilizing both mean and standard deviation information of the projected data, which can adaptively adjust the decision boundary to fit for heteroscedastic distribution situation. Results derived from both simulation dataset and two actual BCI datasets consistently show that Z-LDA achieves significantly higher average classification accuracies than conventional LDA, indicating the superiority of the new proposed decision boundary definition strategy.

  7. Using discriminant analysis to detect intrusions in external communication for self-driving vehicles

    Directory of Open Access Journals (Sweden)

    Khattab M.Ali Alheeti

    2017-08-01

    Full Text Available Security systems are a necessity for the deployment of smart vehicles in our society. Security in vehicular ad hoc networks is crucial to the reliable exchange of information and control data. In this paper, we propose an intelligent Intrusion Detection System (IDS to protect the external communication of self-driving and semi self-driving vehicles. This technology has the ability to detect Denial of Service (DoS and black hole attacks on vehicular ad hoc networks (VANETs. The advantage of the proposed IDS over existing security systems is that it detects attacks before they causes significant damage. The intrusion prediction technique is based on Linear Discriminant Analysis (LDA and Quadratic Discriminant Analysis (QDA which are used to predict attacks based on observed vehicle behavior. We perform simulations using Network Simulator 2 to demonstrate that the IDS achieves a low rate of false alarms and high accuracy in detection.

  8. Recent Advances and Trends in Nonparametric Statistics

    CERN Document Server

    Akritas, MG

    2003-01-01

    The advent of high-speed, affordable computers in the last two decades has given a new boost to the nonparametric way of thinking. Classical nonparametric procedures, such as function smoothing, suddenly lost their abstract flavour as they became practically implementable. In addition, many previously unthinkable possibilities became mainstream; prime examples include the bootstrap and resampling methods, wavelets and nonlinear smoothers, graphical methods, data mining, bioinformatics, as well as the more recent algorithmic approaches such as bagging and boosting. This volume is a collection o

  9. The application of sparse estimation of covariance matrix to quadratic discriminant analysis

    OpenAIRE

    Sun, Jiehuan; Zhao, Hongyu

    2015-01-01

    Background Although Linear Discriminant Analysis (LDA) is commonly used for classification, it may not be directly applied in genomics studies due to the large p, small n problem in these studies. Different versions of sparse LDA have been proposed to address this significant challenge. One implicit assumption of various LDA-based methods is that the covariance matrices are the same across different classes. However, rewiring of genetic networks (therefore different covariance matrices) acros...

  10. Nonparametric Identification and Estimation of Finite Mixture Models of Dynamic Discrete Choices

    OpenAIRE

    Hiroyuki Kasahara; Katsumi Shimotsu

    2006-01-01

    In dynamic discrete choice analysis, controlling for unobserved heterogeneity is an important issue, and finite mixture models provide flexible ways to account for unobserved heterogeneity. This paper studies nonparametric identifiability of type probabilities and type-specific component distributions in finite mixture models of dynamic discrete choices. We derive sufficient conditions for nonparametric identification for various finite mixture models of dynamic discrete choices used in appli...

  11. Discriminant analysis on the treatment results of interstitial radium tongue implants

    International Nuclear Information System (INIS)

    Hoshina, Masao; Shibuya, Hitoshi; Horiuchi, Jun-Ichi; Matsubara, Sho; Suzuki, Soji; Takeda, Masamune

    1989-01-01

    Discriminant analysis was carried out for 48 tongue cancer patients who were treated with radium single-plane implantation. The 48 patients were grouped into 32 successfully cured without complications, five successfully cured with complications, six successfully cured but requiring additional boost therapy and five with local recurrence. To evaluate the relation between the dose distribution and the local treatment results, the analysis was based on a volume-dose relationship. The functions introduced by this discriminant analysis were linear, and the parameters used were modal dose, average dose and shape factors of histograms. Each group of treatment results had a correction rate of >80%, except for the successfully cured group with ulcers. The discriminant functions were useful as an index to obtain a final clinical treatment result at the early time of implantation, and these functions could be used as a criterion for the optimal treatment of tongue carcinoma. We were also able to recognize the limitation of the actual arrangement of sources in the single-plane implant. (author)

  12. Differentiation of free-ranging chicken using discriminant analysis of phenotypic traits

    Directory of Open Access Journals (Sweden)

    Raed M. Al-Atiyat

    Full Text Available ABSTRACT In this study, we investigated the differentiation of five different chicken ecotypes - Center, North, South, West, and East - of Saudi Arabia using discriminate analysis. The analysis was based on nine important morphological and phenotypic traits: body color, beak color, earlobe color, eye color, shank color, comb color, comb type, comb size, and feather distribution. There was a strong significant relationship between the phenotype and effect of geographic height in terms of comb type and earlobe color in males as well as body, beak, eye, and shank color. In particular, the comb type and earlobe color differentiated the ecotypes of males. Among the females, the beak, earlobe, eye, shank color, and feather distribution had more differentiating power. Moreover, the discriminant analysis revealed that the five ecotypes were grouped into three clusters; the Center and the North in one cluster, the West and the South ecotypes in the second for males, and the East ecotype in the last cluster. The female dendogram branching was similar to the male dendrogram branching, except that the Center ecotype was grouped with the North instead of the South. The East ecotype was highly discriminated from the other ecotypes. Nevertheless, the potential of recent individual migration between ecotypes was also noted. Accordingly, the results of the utilized traits in this study might be effective in characterization and conservation of the genetic resources of the Saudi chicken.

  13. Spike detection, characterization, and discrimination using feature analysis software written in LabVIEW.

    Science.gov (United States)

    Stewart, C M; Newlands, S D; Perachio, A A

    2004-12-01

    Rapid and accurate discrimination of single units from extracellular recordings is a fundamental process for the analysis and interpretation of electrophysiological recordings. We present an algorithm that performs detection, characterization, discrimination, and analysis of action potentials from extracellular recording sessions. The program was entirely written in LabVIEW (National Instruments), and requires no external hardware devices or a priori information about action potential shapes. Waveform events are detected by scanning the digital record for voltages that exceed a user-adjustable trigger. Detected events are characterized to determine nine different time and voltage levels for each event. Various algebraic combinations of these waveform features are used as axis choices for 2-D Cartesian plots of events. The user selects axis choices that generate distinct clusters. Multiple clusters may be defined as action potentials by manually generating boundaries of arbitrary shape. Events defined as action potentials are validated by visual inspection of overlain waveforms. Stimulus-response relationships may be identified by selecting any recorded channel for comparison to continuous and average cycle histograms of binned unit data. The algorithm includes novel aspects of feature analysis and acquisition, including higher acquisition rates for electrophysiological data compared to other channels. The program confirms that electrophysiological data may be discriminated with high-speed and efficiency using algebraic combinations of waveform features derived from high-speed digital records.

  14. Machinery fault diagnosis using joint global and local/nonlocal discriminant analysis with selective ensemble learning

    Science.gov (United States)

    Yu, Jianbo

    2016-11-01

    The vibration signals of faulty machine are generally non-stationary and nonlinear under those complicated working conditions. Thus, it is a big challenge to extract and select the effective features from vibration signals for machinery fault diagnosis. This paper proposes a new manifold learning algorithm, joint global and local/nonlocal discriminant analysis (GLNDA), which aims to extract effective intrinsic geometrical information from the given vibration data. Comparisons with other regular methods, principal component analysis (PCA), local preserving projection (LPP), linear discriminant analysis (LDA) and local LDA (LLDA), illustrate the superiority of GLNDA in machinery fault diagnosis. Based on the extracted information by GLNDA, a GLNDA-based Fisher discriminant rule (FDR) is put forward and applied to machinery fault diagnosis without additional recognizer construction procedure. By importing Bagging into GLNDA score-based feature selection and FDR, a novel manifold ensemble method (selective GLNDA ensemble, SE-GLNDA) is investigated for machinery fault diagnosis. The motivation for developing ensemble of manifold learning components is that it can achieve higher accuracy and applicability than single component in machinery fault diagnosis. The effectiveness of the SE-GLNDA-based fault diagnosis method has been verified by experimental results from bearing full life testers.

  15. Using Dynamic Fourier Analysis to Discriminate Between Seismic Signals from Natural Earthquakes and Mining Explosions

    Directory of Open Access Journals (Sweden)

    Maria C. Mariani

    2017-08-01

    Full Text Available A sequence of intraplate earthquakes occurred in Arizona at the same location where miningexplosions were carried out in previous years. The explosions and some of the earthquakes generatedvery similar seismic signals. In this study Dynamic Fourier Analysis is used for discriminating signalsoriginating from natural earthquakes and mining explosions. Frequency analysis of seismogramsrecorded at regional distances shows that compared with the mining explosions the earthquake signalshave larger amplitudes in the frequency interval ~ 6 to 8 Hz and significantly smaller amplitudes inthe frequency interval ~ 2 to 4 Hz. This type of analysis permits identifying characteristics in theseismograms frequency yielding to detect potentially risky seismic events.

  16. [Etiological analysis and establishment of a discriminant model for lower respiratory tract infections in hospitalized patients].

    Science.gov (United States)

    Chen, Y S; Lin, X H; Li, H R; Hua, Z D; Lin, M Q; Huang, W S; Yu, T; Lyu, H Y; Mao, W P; Liang, Y Q; Peng, X R; Chen, S J; Zheng, H; Lian, S Q; Hu, X L; Yao, X Q

    2017-12-12

    Objective: To analyze the pathogens of lower respiratory tract infection(LRTI) including bacterial, viral and mixed infection, and to establish a discriminant model based on clinical features in order to predict the pathogens. Methods: A total of 243 hospitalized patients with lower respiratory tract infections were enrolled in Fujian Provincial Hospital from April 2012 to September 2015. The clinical data and airway (sputum and/or bronchoalveolar lavage) samples were collected. Microbes were identified by traditional culture (for bacteria), loop-mediated isothermal amplification(LAMP) and gene sequencing (for bacteria and atypical pathogen), or Real-time quantitative polymerase chain reaction (Real-time PCR)for viruses. Finally, a discriminant model was established by using the discriminant analysis methods to help to predict bacterial, viral and mixed infections. Results: Pathogens were detected in 53.9% (131/243) of the 243 cases.Bacteria accounted for 23.5%(57/243, of which 17 cases with the virus, 1 case with Mycoplasma pneumoniae and virus), mainly Pseudomonas Aeruginosa and Klebsiella Pneumonia. Atypical pathogens for 4.9% (12/243, of which 3 cases with the virus, 1 case of bacteria and viruses), all were mycoplasma pneumonia. Viruses for 34.6% (84/243, of which 17 cases of bacteria, 3 cases with Mycoplasma pneumoniae, 1 case with Mycoplasma pneumoniae and bacteria) of the cases, mainly Influenza A virus and Human Cytomegalovirus, and other virus like adenovirus, human parainfluenza virus, respiratory syncytial virus, human metapneumovirus, human boca virus were also detected fewly. Seven parameters including mental status, using antibiotics prior to admission, complications, abnormal breath sounds, neutrophil alkaline phosphatase (NAP) score, pneumonia severity index (PSI) score and CRUB-65 score were enrolled after univariate analysis, and discriminant analysis was used to establish the discriminant model by applying the identified pathogens as the

  17. The impact of ICT on educational performance and its efficiency in selected EU and OECD countries: a non-parametric analysis

    OpenAIRE

    Aristovnik, Aleksander

    2012-01-01

    The purpose of the paper is to review some previous researches examining ICT efficiency and the impact of ICT on educational output/outcome as well as different conceptual and methodological issues related to performance measurement. Moreover, a definition, measurements and the empirical application of a model measuring the efficiency of ICT use and its impact at national levels will be considered. For this purpose, the Data Envelopment Analysis (DEA) technique is presented and then applied t...

  18. Teaching Nonparametric Statistics Using Student Instrumental Values.

    Science.gov (United States)

    Anderson, Jonathan W.; Diddams, Margaret

    Nonparametric statistics are often difficult to teach in introduction to statistics courses because of the lack of real-world examples. This study demonstrated how teachers can use differences in the rankings and ratings of undergraduate and graduate values to discuss: (1) ipsative and normative scaling; (2) uses of the Mann-Whitney U-test; and…

  19. Nonparametric conditional predictive regions for time series

    NARCIS (Netherlands)

    de Gooijer, J.G.; Zerom Godefay, D.

    2000-01-01

    Several nonparametric predictors based on the Nadaraya-Watson kernel regression estimator have been proposed in the literature. They include the conditional mean, the conditional median, and the conditional mode. In this paper, we consider three types of predictive regions for these predictors — the

  20. Nonparametric predictive inference in statistical process control

    NARCIS (Netherlands)

    Arts, G.R.J.; Coolen, F.P.A.; Laan, van der P.

    2000-01-01

    New methods for statistical process control are presented, where the inferences have a nonparametric predictive nature. We consider several problems in process control in terms of uncertainties about future observable random quantities, and we develop inferences for these random quantities hased on

  1. Nonparametric predictive inference in statistical process control

    NARCIS (Netherlands)

    Arts, G.R.J.; Coolen, F.P.A.; Laan, van der P.

    2004-01-01

    Statistical process control (SPC) is used to decide when to stop a process as confidence in the quality of the next item(s) is low. Information to specify a parametric model is not always available, and as SPC is of a predictive nature, we present a control chart developed using nonparametric

  2. Non-Parametric Estimation of Correlation Functions

    DEFF Research Database (Denmark)

    Brincker, Rune; Rytter, Anders; Krenk, Steen

    In this paper three methods of non-parametric correlation function estimation are reviewed and evaluated: the direct method, estimation by the Fast Fourier Transform and finally estimation by the Random Decrement technique. The basic ideas of the techniques are reviewed, sources of bias are point...

  3. Nonparametric estimation in models for unobservable heterogeneity

    OpenAIRE

    Hohmann, Daniel

    2014-01-01

    Nonparametric models which allow for data with unobservable heterogeneity are studied. The first publication introduces new estimators and their asymptotic properties for conditional mixture models. The second publication considers estimation of a function from noisy observations of its Radon transform in a Gaussian white noise model.

  4. Nonparametric estimation of location and scale parameters

    KAUST Repository

    Potgieter, C.J.; Lombard, F.

    2012-01-01

    Two random variables X and Y belong to the same location-scale family if there are constants μ and σ such that Y and μ+σX have the same distribution. In this paper we consider non-parametric estimation of the parameters μ and σ under minimal

  5. Panel data specifications in nonparametric kernel regression

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard; Henningsen, Arne

    parametric panel data estimators to analyse the production technology of Polish crop farms. The results of our nonparametric kernel regressions generally differ from the estimates of the parametric models but they only slightly depend on the choice of the kernel functions. Based on economic reasoning, we...

  6. The contribution of cluster and discriminant analysis to the classification of complex aquifer systems.

    Science.gov (United States)

    Panagopoulos, G P; Angelopoulou, D; Tzirtzilakis, E E; Giannoulopoulos, P

    2016-10-01

    This paper presents an innovated method for the discrimination of groundwater samples in common groups representing the hydrogeological units from where they have been pumped. This method proved very efficient even in areas with complex hydrogeological regimes. The proposed method requires chemical analyses of water samples only for major ions, meaning that it is applicable to most of cases worldwide. Another benefit of the method is that it gives a further insight of the aquifer hydrogeochemistry as it provides the ions that are responsible for the discrimination of the group. The procedure begins with cluster analysis of the dataset in order to classify the samples in the corresponding hydrogeological unit. The feasibility of the method is proven from the fact that the samples of volcanic origin were separated into two different clusters, namely the lava units and the pyroclastic-ignimbritic aquifer. The second step is the discriminant analysis of the data which provides the functions that distinguish the groups from each other and the most significant variables that define the hydrochemical composition of the aquifer. The whole procedure was highly successful as the 94.7 % of the samples were classified to the correct aquifer system. Finally, the resulted functions can be safely used to categorize samples of either unknown or doubtful origin improving thus the quality and the size of existing hydrochemical databases.

  7. Detection of non-milk fat in milk fat by gas chromatography and linear discriminant analysis.

    Science.gov (United States)

    Gutiérrez, R; Vega, S; Díaz, G; Sánchez, J; Coronado, M; Ramírez, A; Pérez, J; González, M; Schettino, B

    2009-05-01

    Gas chromatography was utilized to determine triacylglycerol profiles in milk and non-milk fat. The values of triacylglycerol were subjected to linear discriminant analysis to detect and quantify non-milk fat in milk fat. Two groups of milk fat were analyzed: A) raw milk fat from the central region of Mexico (n = 216) and B) ultrapasteurized milk fat from 3 industries (n = 36), as well as pork lard (n = 2), bovine tallow (n = 2), fish oil (n = 2), peanut (n = 2), corn (n = 2), olive (n = 2), and soy (n = 2). The samples of raw milk fat were adulterated with non-milk fats in proportions of 0, 5, 10, 15, and 20% to form 5 groups. The first function obtained from the linear discriminant analysis allowed the correct classification of 94.4% of the samples with levels <10% of adulteration. The triacylglycerol values of the ultrapasteurized milk fats were evaluated with the discriminant function, demonstrating that one industry added non-milk fat to its product in 80% of the samples analyzed.

  8. A Normalization-Free and Nonparametric Method Sharpens Large-Scale Transcriptome Analysis and Reveals Common Gene Alteration Patterns in Cancers.

    Science.gov (United States)

    Li, Qi-Gang; He, Yong-Han; Wu, Huan; Yang, Cui-Ping; Pu, Shao-Yan; Fan, Song-Qing; Jiang, Li-Ping; Shen, Qiu-Shuo; Wang, Xiao-Xiong; Chen, Xiao-Qiong; Yu, Qin; Li, Ying; Sun, Chang; Wang, Xiangting; Zhou, Jumin; Li, Hai-Peng; Chen, Yong-Bin; Kong, Qing-Peng

    2017-01-01

    Heterogeneity in transcriptional data hampers the identification of differentially expressed genes (DEGs) and understanding of cancer, essentially because current methods rely on cross-sample normalization and/or distribution assumption-both sensitive to heterogeneous values. Here, we developed a new method, Cross-Value Association Analysis (CVAA), which overcomes the limitation and is more robust to heterogeneous data than the other methods. Applying CVAA to a more complex pan-cancer dataset containing 5,540 transcriptomes discovered numerous new DEGs and many previously rarely explored pathways/processes; some of them were validated, both in vitro and in vivo , to be crucial in tumorigenesis, e.g., alcohol metabolism ( ADH1B ), chromosome remodeling ( NCAPH ) and complement system ( Adipsin ). Together, we present a sharper tool to navigate large-scale expression data and gain new mechanistic insights into tumorigenesis.

  9. Background reduction and noise discrimination in the proportional counting of tritium using pulse-shape analysis

    Energy Technology Data Exchange (ETDEWEB)

    Hochel, R C; Hayes, D W [Du Pont de Nemours (E.I.) and Co., Aiken, S.C. (USA). Savannah River Lab.

    1975-12-01

    A pulse-shape analysis (PSA) unit of commercial design has been incorporated into a proportional counting system to determine the effectiveness of pulse-shape discrimination in increasing the sensitivity of tritium counting. It was found that a quantitative determination of tritium could be obtained directly from the PSA time spectrum eliminating the need for beta-ray energy selection used in the pulse-shape discrimination (PSD) technique. The performance of the proportional counting system was tested using the PSA unit and anticoincidence shielding, both singly and combined, under several types of background. A background reduction factor of 169 was obtained from the combined PSA-anticoincidence system with only a 2% loss in tritium counting efficiency. The PSA method was also found to offer significant reductions in noise background.

  10. Background reduction and noise discrimination in the proportional counting of tritium using pulse-shape analysis

    International Nuclear Information System (INIS)

    Hochel, R.C.; Hayes, D.W.

    1975-01-01

    A pulse-shape analysis (PSA) unit of commercial design has been incorporated into a proportional counting system to determine the effectiveness of pulse-shape discrimination in increasing the sensitivity of tritium counting. It was found that a quantitative determination of tritium could be obtained directly from the PSA time spectrum eliminating the need for beta-ray energy selection used in the pulse-shape discrimination (PSD) technique. The performance of the proportional counting system was tested using the PSA unit and anticoincidence shielding, both singly and combined, under several types of background. A background reduction factor of 169 was obtained from the combined PSA-anticoincidence system with only a 2% loss in tritium counting efficiency. The PSA method was also found to offer significant reductions in noise background. (Auth.)

  11. INCOME INEQUALITY IN SOME MAJOR EUROPEAN UNION ECONOMIES A DISCRIMINANT ANALYSIS

    Directory of Open Access Journals (Sweden)

    JYOTIRMAYEE KAR

    2012-12-01

    Full Text Available This exercise is an attempt to assess the importance of some social, economic, demographic and infrastructural factors which account for the prevailing income inequality across some of the EU countries. Using discriminant analysis the study suggests that crime recorded by police is the most important predictor in discriminating between the group of countries with relatively more equitable distribution of income from those with less. This variable is followed by number of students in the country. Reduction in the level of crime and improvement in the student strength could help in reducing income inequality. Quite intuitively, improvement in all the economic factors like GDP per capita and agricultural index will help to reduce income inequality. Identical is the case of the demographic factors. This calls for implementation of developmental policies towards improvement in these areas.

  12. Principal component analysis for neural electron/jet discrimination in highly segmented calorimeters

    International Nuclear Information System (INIS)

    Vassali, M.R.; Seixas, J.M.

    2001-01-01

    A neural electron/jet discriminator based on calorimetry is developed for the second-level trigger system of the ATLAS detector. As preprocessing of the calorimeter information, a principal component analysis is performed on each segment of the two sections (electromagnetic and hadronic) of the calorimeter system, in order to reduce significantly the dimension of the input data space and fully explore the detailed energy deposition profile, which is provided by the highly-segmented calorimeter system. It is shown that projecting calorimeter data onto 33 segmented principal components, the discrimination efficiency of the neural classifier reaches 98.9% for electrons (with only 1% of false alarm probability). Furthermore, restricting data projection onto only 9 components, an electron efficiency of 99.1% is achieved (with 3% of false alarm), which confirms that a fast triggering system may be designed using few components

  13. Sex assessment from carpals bones: discriminant function analysis in a contemporary Mexican sample.

    Science.gov (United States)

    Mastrangelo, Paola; De Luca, Stefano; Sánchez-Mejorada, Gabriela

    2011-06-15

    Sex assessment is one of the first essential steps in human identification, in both medico-legal cases and bio-archaeological contexts. Fragmentary human remains compromised by different types of burial or physical insults may frustrate the use of the traditional sex estimation methods, such as the analysis of the skull and pelvis. Currently, the application of discriminant functions to sex unidentified skeletal remains is steadily increasing. However, several studies have demonstrated that, due to variation in size and patterns of sexual dimorphism, discriminant functions are population-specific. In this study, in order to improve sex assessment from skeletal remains and to establish population-specific discriminant functions, the diagnostic values of the carpal bones were considered. A sample of 136 individuals (78 males, 58 females) of known sex and age was analyzed. They belong to a contemporary identified collection from the Laboratory of Physical Anthropology, Faculty of Medicine, UNAM (Universidad Nacional Autónoma de México, Mexico City). The age of the individuals ranged between 25 and 85 years. Between four and nine measurements of each carpal bone were taken. Independent t-tests confirm that all carpals are sexually dimorphic. Univariate measurements produce accuracy levels that range from 61.8% to 90.8%. Classification accuracies ranged between 81.3% and 92.3% in the multivariate stepwise discriminant analysis. In addition, intra- and inter-observer error tests were performed. These indicated that replication of measurements was satisfactory for the same observer over time and between observers. These results suggest that carpal bones can be used for assessing sex in both forensic and bio-archaeological identification procedures and that bone dimensions are population specific. Copyright © 2011 Elsevier Ireland Ltd. All rights reserved.

  14. Are Public-Private Partnerships a Source of Greater Efficiency in Water Supply? Results of a Non-Parametric Performance Analysis Relating to the Italian Industry

    Directory of Open Access Journals (Sweden)

    Corrado lo Storto

    2013-12-01

    Full Text Available This article reports the outcome of a performance study of the water service provision industry in Italy. The study evaluates the efficiency of 21 “private or public-private” equity and 32 “public” equity water service operators and investigates controlling factors. In particular, the influence that the operator typology and service management nature - private vs. public - has on efficiency is assessed. The study employed a two-stage Data Envelopment Analysis methodology. In the first stage, the operational efficiency of water supply operators is calculated by implementing a conventional BCC DEA model, that uses both physical infrastructure and financial input and output variables to explore economies of scale. In the second stage, bootstrapped DEA and Tobit regression are performed to estimate the influence that a number of environmental factors have on water supplier efficiency. The results show that the integrated water provision industry in Italy is characterized by operational inefficiencies of service operators, and scale and agglomeration economies may have a not negligible effect on efficiency. In addition, the operator typology and its geographical location affect efficiency.

  15. Study on discriminant analysis by military mental disorder prediction scale for mental disorder of new recruits

    Directory of Open Access Journals (Sweden)

    Li-yi ZHANG

    2011-11-01

    Full Text Available Objective To examine the predictive role of the Military Mental Disorder Prediction Scale on the mental disorder of new recruits.Methods The present study examined 115 new recruits diagnosed with mental disorder and 115 healthy new recruits.The recruits were tested using the Military Mental Disorder Prediction Scale.The discriminant function was built by discriminant analysis method.The current study analyzed the predictive value of 11 factors(family medical record and past medical record(X1,growth experience(X2,introversion(X3,stressor(X4,poor mental defense(X5,social support(X6,psychosis(X7,depression(X8,mania(X9,neurosis(X10,and personality disorder(X11 aside from lie factor on the mental disorder of new recruits.Results The mental disorder group has higher total score and factor score in family medical record and past medical record,introversion,stressor,poor mental defense,social support,psychosis,depression,mania,neurosis,personality disorder,and lie than those of the contrast group(P < 0.01.For the score of growth experience factor,that of the mental disorder group is higher than the score of the contrast group(P < 0.05.All 11 factors except the lie factor in the Mental Disorder Prediction Scale are taken as independent variables by enforced introduction to obtain the Fisher linear discriminant function as follows: The mental disorder group=-7.014-0.278X1+1.556X2+1.563X3+0.878X4+0.183X5-0.845X6-0.562X7-0.353X8+1.246X9-0.505X10+1.029X11.The contrast group=-2.971+0.056X1+2.194X2+0.707X3+0.592X4-0.086X5-0.888X6-0.133X7-0.360X8+0.654X9-0.467X10+0.308X11.The discriminant function has an accuracy rate of 76.5% on the new recruits with mental disorders and 100% on the healthy new recruits.The total accurate discrimination rate is 88.3% and the total inaccurate discrimination rate is 11.7%.Conclusion The Military Mental Disorder Prediction Scale has a high accuracy rate on the prediction of mental disorder of new recruits and is worthy of

  16. Textural Maturity Analysis and Sedimentary Environment Discrimination Based on Grain Shape Data

    Science.gov (United States)

    Tunwal, M.; Mulchrone, K. F.; Meere, P. A.

    2017-12-01

    Morphological analysis of clastic sedimentary grains is an important source of information regarding the processes involved in their formation, transportation and deposition. However, a standardised approach for quantitative grain shape analysis is generally lacking. In this contribution we report on a study where fully automated image analysis techniques were applied to loose sediment samples collected from glacial, aeolian, beach and fluvial environments. A range of shape parameters are evaluated for their usefulness in textural characterisation of populations of grains. The utility of grain shape data in ranking textural maturity of samples within a given sedimentary environment is evaluated. Furthermore, discrimination of sedimentary environment on the basis of grain shape information is explored. The data gathered demonstrates a clear progression in textural maturity in terms of roundness, angularity, irregularity, fractal dimension, convexity, solidity and rectangularity. Textural maturity can be readily categorised using automated grain shape parameter analysis. However, absolute discrimination between different depositional environments on the basis of shape parameters alone is less certain. For example, the aeolian environment is quite distinct whereas fluvial, glacial and beach samples are inherently variable and tend to overlap each other in terms of textural maturity. This is most likely due to a collection of similar processes and sources operating within these environments. This study strongly demonstrates the merit of quantitative population-based shape parameter analysis of texture and indicates that it can play a key role in characterising both loose and consolidated sediments. This project is funded by the Irish Petroleum Infrastructure Programme (www.pip.ie)

  17. Prediction of Depression in Cancer Patients With Different Classification Criteria, Linear Discriminant Analysis versus Logistic Regression.

    Science.gov (United States)

    Shayan, Zahra; Mohammad Gholi Mezerji, Naser; Shayan, Leila; Naseri, Parisa

    2015-11-03

    Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.

  18. Attractor structure discriminates sleep states: recurrence plot analysis applied to infant breathing patterns.

    Science.gov (United States)

    Terrill, Philip Ian; Wilson, Stephen James; Suresh, Sadasivam; Cooper, David M; Dakin, Carolyn

    2010-05-01

    Breathing patterns are characteristically different between infant active sleep (AS) and quiet sleep (QS), and statistical quantifications of interbreath interval (IBI) data have previously been used to discriminate between infant sleep states. It has also been identified that breathing patterns are governed by a nonlinear controller. This study aims to investigate whether nonlinear quantifications of infant IBI data are characteristically different between AS and QS, and whether they may be used to discriminate between these infant sleep states. Polysomnograms were obtained from 24 healthy infants at six months of age. Periods of AS and QS were identified, and IBI data extracted. Recurrence quantification analysis (RQA) was applied to each period, and recurrence calculated for a fixed radius in the range of 0-8 in steps of 0.02, and embedding dimensions of 4, 6, 8, and 16. When a threshold classifier was trained, the RQA variable recurrence was able to correctly classify 94.3% of periods in a test dataset. It was concluded that RQA of IBI data is able to accurately discriminate between infant sleep states. This is a promising step toward development of a minimal-channel automatic sleep state classification system.

  19. Applying linear discriminant analysis to predict groundwater redox conditions conducive to denitrification

    Science.gov (United States)

    Wilson, S. R.; Close, M. E.; Abraham, P.

    2018-01-01

    Diffuse nitrate losses from agricultural land pollute groundwater resources worldwide, but can be attenuated under reducing subsurface conditions. In New Zealand, the ability to predict where groundwater denitrification occurs is important for understanding the linkage between land use and discharges of nitrate-bearing groundwater to streams. This study assesses the application of linear discriminant analysis (LDA) for predicting groundwater redox status for Southland, a major dairy farming region in New Zealand. Data cases were developed by assigning a redox status to samples derived from a regional groundwater quality database. Pre-existing regional-scale geospatial databases were used as training variables for the discriminant functions. The predictive accuracy of the discriminant functions was slightly improved by optimising the thresholds between sample depth classes. The models predict 23% of the region as being reducing at shallow depths (water table, and low-permeability clastic sediments. The coastal plains are an area of widespread groundwater discharge, and the soil and hydrology characteristics require the land to be artificially drained to render the land suitable for farming. For the improvement of water quality in coastal areas, it is therefore important that land and water management efforts focus on understanding hydrological bypassing that may occur via artificial drainage systems.

  20. Accurate palm vein recognition based on wavelet scattering and spectral regression kernel discriminant analysis

    Science.gov (United States)

    Elnasir, Selma; Shamsuddin, Siti Mariyam; Farokhi, Sajad

    2015-01-01

    Palm vein recognition (PVR) is a promising new biometric that has been applied successfully as a method of access control by many organizations, which has even further potential in the field of forensics. The palm vein pattern has highly discriminative features that are difficult to forge because of its subcutaneous position in the palm. Despite considerable progress and a few practical issues, providing accurate palm vein readings has remained an unsolved issue in biometrics. We propose a robust and more accurate PVR method based on the combination of wavelet scattering (WS) with spectral regression kernel discriminant analysis (SRKDA). As the dimension of WS generated features is quite large, SRKDA is required to reduce the extracted features to enhance the discrimination. The results based on two public databases-PolyU Hyper Spectral Palmprint public database and PolyU Multi Spectral Palmprint-show the high performance of the proposed scheme in comparison with state-of-the-art methods. The proposed approach scored a 99.44% identification rate and a 99.90% verification rate [equal error rate (EER)=0.1%] for the hyperspectral database and a 99.97% identification rate and a 99.98% verification rate (EER=0.019%) for the multispectral database.

  1. Rapid discrimination of plastic packaging materials using MIR spectroscopy coupled with independent components analysis (ICA).

    Science.gov (United States)

    Kassouf, Amine; Maalouly, Jacqueline; Rutledge, Douglas N; Chebib, Hanna; Ducruet, Violette

    2014-11-01

    Plastic packaging wastes increased considerably in recent decades, raising a major and serious public concern on political, economical and environmental levels. Dealing with this kind of problems is generally done by landfilling and energy recovery. However, these two methods are becoming more and more expensive, hazardous to the public health and the environment. Therefore, recycling is gaining worldwide consideration as a solution to decrease the growing volume of plastic packaging wastes and simultaneously reduce the consumption of oil required to produce virgin resin. Nevertheless, a major shortage is encountered in recycling which is related to the sorting of plastic wastes. In this paper, a feasibility study was performed in order to test the potential of an innovative approach combining mid infrared (MIR) spectroscopy with independent components analysis (ICA), as a simple and fast approach which could achieve high separation rates. This approach (MIR-ICA) gave 100% discrimination rates in the separation of all studied plastics: polyethylene terephthalate (PET), polyethylene (PE), polypropylene (PP), polystyrene (PS) and polylactide (PLA). In addition, some more specific discriminations were obtained separating plastic materials belonging to the same polymer family e.g. high density polyethylene (HDPE) from low density polyethylene (LDPE). High discrimination rates were obtained despite the heterogeneity among samples especially differences in colors, thicknesses and surface textures. The reproducibility of the proposed approach was also tested using two spectrometers with considerable differences in their sensitivities. Discrimination rates were not affected proving that the developed approach could be extrapolated to different spectrometers. MIR combined with ICA is a promising tool for plastic waste separation that can help improve performance in this field; however further technological improvements and developments are required before it can be applied

  2. Nonparametric Change Point Diagnosis Method of Concrete Dam Crack Behavior Abnormality

    Directory of Open Access Journals (Sweden)

    Zhanchao Li

    2013-01-01

    Full Text Available The study on diagnosis method of concrete crack behavior abnormality has always been a hot spot and difficulty in the safety monitoring field of hydraulic structure. Based on the performance of concrete dam crack behavior abnormality in parametric statistical model and nonparametric statistical model, the internal relation between concrete dam crack behavior abnormality and statistical change point theory is deeply analyzed from the model structure instability of parametric statistical model and change of sequence distribution law of nonparametric statistical model. On this basis, through the reduction of change point problem, the establishment of basic nonparametric change point model, and asymptotic analysis on test method of basic change point problem, the nonparametric change point diagnosis method of concrete dam crack behavior abnormality is created in consideration of the situation that in practice concrete dam crack behavior may have more abnormality points. And the nonparametric change point diagnosis method of concrete dam crack behavior abnormality is used in the actual project, demonstrating the effectiveness and scientific reasonableness of the method established. Meanwhile, the nonparametric change point diagnosis method of concrete dam crack behavior abnormality has a complete theoretical basis and strong practicality with a broad application prospect in actual project.

  3. Hyperplane distance neighbor clustering based on local discriminant analysis for complex chemical processes monitoring

    Energy Technology Data Exchange (ETDEWEB)

    Lu, Chunhong; Xiao, Shaoqing; Gu, Xiaofeng [Jiangnan University, Wuxi (China)

    2014-11-15

    The collected training data often include both normal and faulty samples for complex chemical processes. However, some monitoring methods, such as partial least squares (PLS), principal component analysis (PCA), independent component analysis (ICA) and Fisher discriminant analysis (FDA), require fault-free data to build the normal operation model. These techniques are applicable after the preliminary step of data clustering is applied. We here propose a novel hyperplane distance neighbor clustering (HDNC) based on the local discriminant analysis (LDA) for chemical process monitoring. First, faulty samples are separated from normal ones using the HDNC method. Then, the optimal subspace for fault detection and classification can be obtained using the LDA approach. The proposed method takes the multimodality within the faulty data into account, and thus improves the capability of process monitoring significantly. The HDNC-LDA monitoring approach is applied to two simulation processes and then compared with the conventional FDA based on the K-nearest neighbor (KNN-FDA) method. The results obtained in two different scenarios demonstrate the superiority of the HDNC-LDA approach in terms of fault detection and classification accuracy.

  4. Hyperplane distance neighbor clustering based on local discriminant analysis for complex chemical processes monitoring

    International Nuclear Information System (INIS)

    Lu, Chunhong; Xiao, Shaoqing; Gu, Xiaofeng

    2014-01-01

    The collected training data often include both normal and faulty samples for complex chemical processes. However, some monitoring methods, such as partial least squares (PLS), principal component analysis (PCA), independent component analysis (ICA) and Fisher discriminant analysis (FDA), require fault-free data to build the normal operation model. These techniques are applicable after the preliminary step of data clustering is applied. We here propose a novel hyperplane distance neighbor clustering (HDNC) based on the local discriminant analysis (LDA) for chemical process monitoring. First, faulty samples are separated from normal ones using the HDNC method. Then, the optimal subspace for fault detection and classification can be obtained using the LDA approach. The proposed method takes the multimodality within the faulty data into account, and thus improves the capability of process monitoring significantly. The HDNC-LDA monitoring approach is applied to two simulation processes and then compared with the conventional FDA based on the K-nearest neighbor (KNN-FDA) method. The results obtained in two different scenarios demonstrate the superiority of the HDNC-LDA approach in terms of fault detection and classification accuracy

  5. Microaggressions, Discrimination, and Phenotype among African Americans: A Latent Class Analysis of the Impact of Skin Tone and BMI.

    Science.gov (United States)

    Keith, Verna M; Nguyen, Ann W; Taylor, Robert Joseph; Mouzon, Dawne M; Chatters, Linda M

    2017-05-01

    Data from the 2001-2003National Survey of American Life are used to investigate the effects of phenotype on everyday experiences with discrimination among African Americans (N=3343). Latent class analysis is used to identify four classes of discriminatory treatment: 1) low levels of discrimination, 2) disrespect and condescension, 3) character-based discrimination, and 4) high levels of discrimination. We then employ latent class multinomial logistic regression to evaluate the association between skin tone and body weight and these four classes of discrimination. Designating the low level discrimination class as the reference group, findings revealed that respondents with darker skin were more likely to be classified into the disrespect/condescension and the high level microaggression types. BMI was unrelated to the discrimination type, although there was a significant interaction effect between gender and BMI. BMI was strongly and positively associated with membership in the disrespect and condescension type among men but not among women. These findings indicate that skin tone and body weight are two phenotypic characteristics that influence the type and frequency of discrimination experienced by African Americans.

  6. Rapid discrimination of bergamot essential oil by paper spray mass spectrometry and chemometric analysis.

    Science.gov (United States)

    Taverna, Domenico; Di Donna, Leonardo; Mazzotti, Fabio; Tagarelli, Antonio; Napoli, Anna; Furia, Emilia; Sindona, Giovanni

    2016-09-01

    A novel approach for the rapid discrimination of bergamot essential oil from other citrus fruits oils is presented. The method was developed using paper spray mass spectrometry (PS-MS) allowing for a rapid molecular profiling coupled with a statistic tool for a precise and reliable discrimination between the bergamot complex matrix and other similar matrices, commonly used for its reconstitution. Ambient mass spectrometry possesses the ability to record mass spectra of ordinary samples, in their native environment, without sample preparation or pre-separation by creating ions outside the instrument. The present study reports a PS-MS method for the determination of oxygen heterocyclic compounds such as furocoumarins, psoralens and flavonoids present in the non-volatile fraction of citrus fruits essential oils followed by chemometric analysis. The volatile fraction of Bergamot is one of the most known and fashionable natural products, which found applications in flavoring industry as ingredient in beverages and flavored foodstuff. The development of the presented method employed bergamot, sweet orange, orange, cedar, grapefruit and mandarin essential oils. PS-MS measurements were carried out in full scan mode for a total run time of 2 min. The capability of PS-MS profiling to act as marker for the classification of bergamot essential oils was evaluated by using multivariate statistical analysis. Two pattern recognition techniques, linear discriminant analysis and soft independent modeling of class analogy, were applied to MS data. The cross-validation procedure has shown excellent results in terms of the prediction ability because both models have correctly classified all samples for each category. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  7. Estimating the causes of traffic accidents using logistic regression and discriminant analysis.

    Science.gov (United States)

    Karacasu, Murat; Ergül, Barış; Altin Yavuz, Arzu

    2014-01-01

    Factors that affect traffic accidents have been analysed in various ways. In this study, we use the methods of logistic regression and discriminant analysis to determine the damages due to injury and non-injury accidents in the Eskisehir Province. Data were obtained from the accident reports of the General Directorate of Security in Eskisehir; 2552 traffic accidents between January and December 2009 were investigated regarding whether they resulted in injury. According to the results, the effects of traffic accidents were reflected in the variables. These results provide a wealth of information that may aid future measures toward the prevention of undesired results.

  8. Lameness detection challenges in automated milking systems addressed with partial least squares discriminant analysis

    DEFF Research Database (Denmark)

    Garcia, Emanuel; Klaas, Ilka Christine; Amigo Rubio, Jose Manuel

    2014-01-01

    Lameness is prevalent in dairy herds. It causes decreased animal welfare and leads to higher production costs. This study explored data from an automatic milking system (AMS) to model on-farm gait scoring from a commercial farm. A total of 88 cows were gait scored once per week, for 2 5-wk periods......). The reference gait scoring error was estimated in the first week of the study and was, on average, 15%. Two partial least squares discriminant analysis models were fitted to parity 1 and parity 2 groups, respectively, to assign the lameness class according to the predicted probability of being lame (score 3...

  9. Discrimination of bromodeoxyuridine labelled and unlabelled mitotic cells in flow cytometric bromodeoxyuridine/DNA analysis

    DEFF Research Database (Denmark)

    Jensen, P O; Larsen, J K; Christensen, I J

    1994-01-01

    Bromodeoxyuridine (BrdUrd) labelled and unlabelled mitotic cells, respectively, can be discriminated from interphase cells using a new method, based on immunocytochemical staining of BrdUrd and flow cytometric four-parameter analysis of DNA content, BrdUrd incorporation, and forward and orthogonal...... light scatter. The method was optimized using the human leukemia cell lines HL-60 and K-562. Samples of 10(5) ethanol-fixed cells were treated with pepsin/HCl and stained as a nuclear suspension with anti-BrdUrd antibody, FITC-conjugated secondary antibody, and propidium iodide. Labelled mitoses could...

  10. Classification of Error-Diffused Halftone Images Based on Spectral Regression Kernel Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    Zhigao Zeng

    2016-01-01

    Full Text Available This paper proposes a novel algorithm to solve the challenging problem of classifying error-diffused halftone images. We firstly design the class feature matrices, after extracting the image patches according to their statistics characteristics, to classify the error-diffused halftone images. Then, the spectral regression kernel discriminant analysis is used for feature dimension reduction. The error-diffused halftone images are finally classified using an idea similar to the nearest centroids classifier. As demonstrated by the experimental results, our method is fast and can achieve a high classification accuracy rate with an added benefit of robustness in tackling noise.

  11. Parametric and Non-Parametric System Modelling

    DEFF Research Database (Denmark)

    Nielsen, Henrik Aalborg

    1999-01-01

    the focus is on combinations of parametric and non-parametric methods of regression. This combination can be in terms of additive models where e.g. one or more non-parametric term is added to a linear regression model. It can also be in terms of conditional parametric models where the coefficients...... considered. It is shown that adaptive estimation in conditional parametric models can be performed by combining the well known methods of local polynomial regression and recursive least squares with exponential forgetting. The approach used for estimation in conditional parametric models also highlights how...... networks is included. In this paper, neural networks are used for predicting the electricity production of a wind farm. The results are compared with results obtained using an adaptively estimated ARX-model. Finally, two papers on stochastic differential equations are included. In the first paper, among...

  12. Nonparametric Bayes Modeling of Multivariate Categorical Data.

    Science.gov (United States)

    Dunson, David B; Xing, Chuanhua

    2012-01-01

    Modeling of multivariate unordered categorical (nominal) data is a challenging problem, particularly in high dimensions and cases in which one wishes to avoid strong assumptions about the dependence structure. Commonly used approaches rely on the incorporation of latent Gaussian random variables or parametric latent class models. The goal of this article is to develop a nonparametric Bayes approach, which defines a prior with full support on the space of distributions for multiple unordered categorical variables. This support condition ensures that we are not restricting the dependence structure a priori. We show this can be accomplished through a Dirichlet process mixture of product multinomial distributions, which is also a convenient form for posterior computation. Methods for nonparametric testing of violations of independence are proposed, and the methods are applied to model positional dependence within transcription factor binding motifs.

  13. Rapid discrimination of plastic packaging materials using MIR spectroscopy coupled with independent components analysis (ICA)

    Energy Technology Data Exchange (ETDEWEB)

    Kassouf, Amine, E-mail: amine.kassouf@agroparistech.fr [ER004 “Lebanese Food Packaging”, Faculty of Sciences II, Lebanese University, 90656 Jdeideth El Matn, Fanar (Lebanon); INRA, UMR1145 Ingénierie Procédés Aliments, 1 Avenue des Olympiades, 91300 Massy (France); AgroParisTech, UMR1145 Ingénierie Procédés Aliments, 16 rue Claude Bernard, 75005 Paris (France); Maalouly, Jacqueline, E-mail: j_maalouly@hotmail.com [ER004 “Lebanese Food Packaging”, Faculty of Sciences II, Lebanese University, 90656 Jdeideth El Matn, Fanar (Lebanon); Rutledge, Douglas N., E-mail: douglas.rutledge@agroparistech.fr [INRA, UMR1145 Ingénierie Procédés Aliments, 1 Avenue des Olympiades, 91300 Massy (France); AgroParisTech, UMR1145 Ingénierie Procédés Aliments, 16 rue Claude Bernard, 75005 Paris (France); Chebib, Hanna, E-mail: hchebib@hotmail.com [ER004 “Lebanese Food Packaging”, Faculty of Sciences II, Lebanese University, 90656 Jdeideth El Matn, Fanar (Lebanon); Ducruet, Violette, E-mail: violette.ducruet@agroparistech.fr [INRA, UMR1145 Ingénierie Procédés Aliments, 1 Avenue des Olympiades, 91300 Massy (France); AgroParisTech, UMR1145 Ingénierie Procédés Aliments, 16 rue Claude Bernard, 75005 Paris (France)

    2014-11-15

    Highlights: • An innovative technique, MIR-ICA, was applied to plastic packaging separation. • This study was carried out on PE, PP, PS, PET and PLA plastic packaging materials. • ICA was applied to discriminate plastics and 100% separation rates were obtained. • Analyses performed on two spectrometers proved the reproducibility of the method. • MIR-ICA is a simple and fast technique allowing plastic identification/classification. - Abstract: Plastic packaging wastes increased considerably in recent decades, raising a major and serious public concern on political, economical and environmental levels. Dealing with this kind of problems is generally done by landfilling and energy recovery. However, these two methods are becoming more and more expensive, hazardous to the public health and the environment. Therefore, recycling is gaining worldwide consideration as a solution to decrease the growing volume of plastic packaging wastes and simultaneously reduce the consumption of oil required to produce virgin resin. Nevertheless, a major shortage is encountered in recycling which is related to the sorting of plastic wastes. In this paper, a feasibility study was performed in order to test the potential of an innovative approach combining mid infrared (MIR) spectroscopy with independent components analysis (ICA), as a simple and fast approach which could achieve high separation rates. This approach (MIR-ICA) gave 100% discrimination rates in the separation of all studied plastics: polyethylene terephthalate (PET), polyethylene (PE), polypropylene (PP), polystyrene (PS) and polylactide (PLA). In addition, some more specific discriminations were obtained separating plastic materials belonging to the same polymer family e.g. high density polyethylene (HDPE) from low density polyethylene (LDPE). High discrimination rates were obtained despite the heterogeneity among samples especially differences in colors, thicknesses and surface textures. The reproducibility of

  14. Rapid discrimination of plastic packaging materials using MIR spectroscopy coupled with independent components analysis (ICA)

    International Nuclear Information System (INIS)

    Kassouf, Amine; Maalouly, Jacqueline; Rutledge, Douglas N.; Chebib, Hanna; Ducruet, Violette

    2014-01-01

    Highlights: • An innovative technique, MIR-ICA, was applied to plastic packaging separation. • This study was carried out on PE, PP, PS, PET and PLA plastic packaging materials. • ICA was applied to discriminate plastics and 100% separation rates were obtained. • Analyses performed on two spectrometers proved the reproducibility of the method. • MIR-ICA is a simple and fast technique allowing plastic identification/classification. - Abstract: Plastic packaging wastes increased considerably in recent decades, raising a major and serious public concern on political, economical and environmental levels. Dealing with this kind of problems is generally done by landfilling and energy recovery. However, these two methods are becoming more and more expensive, hazardous to the public health and the environment. Therefore, recycling is gaining worldwide consideration as a solution to decrease the growing volume of plastic packaging wastes and simultaneously reduce the consumption of oil required to produce virgin resin. Nevertheless, a major shortage is encountered in recycling which is related to the sorting of plastic wastes. In this paper, a feasibility study was performed in order to test the potential of an innovative approach combining mid infrared (MIR) spectroscopy with independent components analysis (ICA), as a simple and fast approach which could achieve high separation rates. This approach (MIR-ICA) gave 100% discrimination rates in the separation of all studied plastics: polyethylene terephthalate (PET), polyethylene (PE), polypropylene (PP), polystyrene (PS) and polylactide (PLA). In addition, some more specific discriminations were obtained separating plastic materials belonging to the same polymer family e.g. high density polyethylene (HDPE) from low density polyethylene (LDPE). High discrimination rates were obtained despite the heterogeneity among samples especially differences in colors, thicknesses and surface textures. The reproducibility of

  15. The application of sparse estimation of covariance matrix to quadratic discriminant analysis.

    Science.gov (United States)

    Sun, Jiehuan; Zhao, Hongyu

    2015-02-18

    Although Linear Discriminant Analysis (LDA) is commonly used for classification, it may not be directly applied in genomics studies due to the large p, small n problem in these studies. Different versions of sparse LDA have been proposed to address this significant challenge. One implicit assumption of various LDA-based methods is that the covariance matrices are the same across different classes. However, rewiring of genetic networks (therefore different covariance matrices) across different diseases has been observed in many genomics studies, which suggests that LDA and its variations may be suboptimal for disease classifications. However, it is not clear whether considering differing genetic networks across diseases can improve classification in genomics studies. We propose a sparse version of Quadratic Discriminant Analysis (SQDA) to explicitly consider the differences of the genetic networks across diseases. Both simulation and real data analysis are performed to compare the performance of SQDA with six commonly used classification methods. SQDA provides more accurate classification results than other methods for both simulated and real data. Our method should prove useful for classification in genomics studies and other research settings, where covariances differ among classes.

  16. Discriminant analysis of Social Work’s performance in licensure examination

    Directory of Open Access Journals (Sweden)

    Jonel R. Alonzo

    2017-12-01

    Full Text Available Many research studies have examined academic factors as predictors of success in licensure examination. The purpose of this descriptive discriminant analysis was to explore possible factors in passing social work licensure examination. Data were examined from academic records of 69 (37 passed and 32 failed Social Work graduates of the University of Mindanao who took Social Work Licensure Examination 2014. This can be used as a basis of Social Work program in planning and administering strategies to improve its national passing rates. Discriminant analysis was employed along five academic factors which are Human Behavior and Social Environment (HBSE, Social Work Programs and Policies (SWPP, Social Work Methods (SWM, Field Practice (FP and Grade Point Average (GPA. The analysis generated three significant predictors accounting for 76.22% of between group variability. The function had a hit ratio of 100%. Structure matrix revealed that three cluster subjects were identified as good factors of passing the social work licensure examination: HBSE, SWPP and SWM had a correlation value of 0.713, 0.768 and 0.840, respectively.

  17. Quantization of liver tissue in dual kVp computed tomography using linear discriminant analysis

    Science.gov (United States)

    Tkaczyk, J. Eric; Langan, David; Wu, Xiaoye; Xu, Daniel; Benson, Thomas; Pack, Jed D.; Schmitz, Andrea; Hara, Amy; Palicek, William; Licato, Paul; Leverentz, Jaynne

    2009-02-01

    Linear discriminate analysis (LDA) is applied to dual kVp CT and used for tissue characterization. The potential to quantitatively model both malignant and benign, hypo-intense liver lesions is evaluated by analysis of portal-phase, intravenous CT scan data obtained on human patients. Masses with an a priori classification are mapped to a distribution of points in basis material space. The degree of localization of tissue types in the material basis space is related to both quantum noise and real compositional differences. The density maps are analyzed with LDA and studied with system simulations to differentiate these factors. The discriminant analysis is formulated so as to incorporate the known statistical properties of the data. Effective kVp separation and mAs relates to precision of tissue localization. Bias in the material position is related to the degree of X-ray scatter and partial-volume effect. Experimental data and simulations demonstrate that for single energy (HU) imaging or image-based decomposition pixel values of water-like tissues depend on proximity to other iodine-filled bodies. Beam-hardening errors cause a shift in image value on the scale of that difference sought between in cancerous and cystic lessons. In contrast, projection-based decomposition or its equivalent when implemented on a carefully calibrated system can provide accurate data. On such a system, LDA may provide novel quantitative capabilities for tissue characterization in dual energy CT.

  18. Network structure exploration via Bayesian nonparametric models

    International Nuclear Information System (INIS)

    Chen, Y; Wang, X L; Xiang, X; Tang, B Z; Bu, J Z

    2015-01-01

    Complex networks provide a powerful mathematical representation of complex systems in nature and society. To understand complex networks, it is crucial to explore their internal structures, also called structural regularities. The task of network structure exploration is to determine how many groups there are in a complex network and how to group the nodes of the network. Most existing structure exploration methods need to specify either a group number or a certain type of structure when they are applied to a network. In the real world, however, the group number and also the certain type of structure that a network has are usually unknown in advance. To explore structural regularities in complex networks automatically, without any prior knowledge of the group number or the certain type of structure, we extend a probabilistic mixture model that can handle networks with any type of structure but needs to specify a group number using Bayesian nonparametric theory. We also propose a novel Bayesian nonparametric model, called the Bayesian nonparametric mixture (BNPM) model. Experiments conducted on a large number of networks with different structures show that the BNPM model is able to explore structural regularities in networks automatically with a stable, state-of-the-art performance. (paper)

  19. portfolio optimization based on nonparametric estimation methods

    Directory of Open Access Journals (Sweden)

    mahsa ghandehari

    2017-03-01

    Full Text Available One of the major issues investors are facing with in capital markets is decision making about select an appropriate stock exchange for investing and selecting an optimal portfolio. This process is done through the risk and expected return assessment. On the other hand in portfolio selection problem if the assets expected returns are normally distributed, variance and standard deviation are used as a risk measure. But, the expected returns on assets are not necessarily normal and sometimes have dramatic differences from normal distribution. This paper with the introduction of conditional value at risk ( CVaR, as a measure of risk in a nonparametric framework, for a given expected return, offers the optimal portfolio and this method is compared with the linear programming method. The data used in this study consists of monthly returns of 15 companies selected from the top 50 companies in Tehran Stock Exchange during the winter of 1392 which is considered from April of 1388 to June of 1393. The results of this study show the superiority of nonparametric method over the linear programming method and the nonparametric method is much faster than the linear programming method.

  20. Nonparametric Mixture Models for Supervised Image Parcellation.

    Science.gov (United States)

    Sabuncu, Mert R; Yeo, B T Thomas; Van Leemput, Koen; Fischl, Bruce; Golland, Polina

    2009-09-01

    We present a nonparametric, probabilistic mixture model for the supervised parcellation of images. The proposed model yields segmentation algorithms conceptually similar to the recently developed label fusion methods, which register a new image with each training image separately. Segmentation is achieved via the fusion of transferred manual labels. We show that in our framework various settings of a model parameter yield algorithms that use image intensity information differently in determining the weight of a training subject during fusion. One particular setting computes a single, global weight per training subject, whereas another setting uses locally varying weights when fusing the training data. The proposed nonparametric parcellation approach capitalizes on recently developed fast and robust pairwise image alignment tools. The use of multiple registrations allows the algorithm to be robust to occasional registration failures. We report experiments on 39 volumetric brain MRI scans with expert manual labels for the white matter, cerebral cortex, ventricles and subcortical structures. The results demonstrate that the proposed nonparametric segmentation framework yields significantly better segmentation than state-of-the-art algorithms.

  1. Robustifying Bayesian nonparametric mixtures for count data.

    Science.gov (United States)

    Canale, Antonio; Prünster, Igor

    2017-03-01

    Our motivating application stems from surveys of natural populations and is characterized by large spatial heterogeneity in the counts, which makes parametric approaches to modeling local animal abundance too restrictive. We adopt a Bayesian nonparametric approach based on mixture models and innovate with respect to popular Dirichlet process mixture of Poisson kernels by increasing the model flexibility at the level both of the kernel and the nonparametric mixing measure. This allows to derive accurate and robust estimates of the distribution of local animal abundance and of the corresponding clusters. The application and a simulation study for different scenarios yield also some general methodological implications. Adding flexibility solely at the level of the mixing measure does not improve inferences, since its impact is severely limited by the rigidity of the Poisson kernel with considerable consequences in terms of bias. However, once a kernel more flexible than the Poisson is chosen, inferences can be robustified by choosing a prior more general than the Dirichlet process. Therefore, to improve the performance of Bayesian nonparametric mixtures for count data one has to enrich the model simultaneously at both levels, the kernel and the mixing measure. © 2016, The International Biometric Society.

  2. NBLDA: negative binomial linear discriminant analysis for RNA-Seq data.

    Science.gov (United States)

    Dong, Kai; Zhao, Hongyu; Tong, Tiejun; Wan, Xiang

    2016-09-13

    RNA-sequencing (RNA-Seq) has become a powerful technology to characterize gene expression profiles because it is more accurate and comprehensive than microarrays. Although statistical methods that have been developed for microarray data can be applied to RNA-Seq data, they are not ideal due to the discrete nature of RNA-Seq data. The Poisson distribution and negative binomial distribution are commonly used to model count data. Recently, Witten (Annals Appl Stat 5:2493-2518, 2011) proposed a Poisson linear discriminant analysis for RNA-Seq data. The Poisson assumption may not be as appropriate as the negative binomial distribution when biological replicates are available and in the presence of overdispersion (i.e., when the variance is larger than or equal to the mean). However, it is more complicated to model negative binomial variables because they involve a dispersion parameter that needs to be estimated. In this paper, we propose a negative binomial linear discriminant analysis for RNA-Seq data. By Bayes' rule, we construct the classifier by fitting a negative binomial model, and propose some plug-in rules to estimate the unknown parameters in the classifier. The relationship between the negative binomial classifier and the Poisson classifier is explored, with a numerical investigation of the impact of dispersion on the discriminant score. Simulation results show the superiority of our proposed method. We also analyze two real RNA-Seq data sets to demonstrate the advantages of our method in real-world applications. We have developed a new classifier using the negative binomial model for RNA-seq data classification. Our simulation results show that our proposed classifier has a better performance than existing works. The proposed classifier can serve as an effective tool for classifying RNA-seq data. Based on the comparison results, we have provided some guidelines for scientists to decide which method should be used in the discriminant analysis of RNA-Seq data

  3. Discriminant analysis to predict the occurrence of ELMs in H-mode discharges

    International Nuclear Information System (INIS)

    Kardaun, O.J.W.F.; Itoh, S.; Itoh, K.; Kardaun, J.W.P.F.

    1993-08-01

    After an exposition of its theoretical background, discriminant analysis is applied to the H-mode confinement database to find the region in plasma parameter space in which H-mode with small ELMs (Edge Localized Modes) is likely to occur. The boundary of this region is determined by the condition that the probability of appearance of such a type of H-mode, as a function of the plasma parameters, should be (1) larger than some threshold value and (2) larger than the corresponding probability for other types of H-mode (i.e., H-mode without ELMs or with giant ELMs). In practice, the discrimination has been performed for the ASDEX, JET and JFT-2M tokamaks (a) using four instantaneous plasma parameters (injected power P inj , magnetic field B t , plasma current I p and line averaged electron density (n-bar e ) and (b) taking also memory effects of the plasma and the distance between the plasma and the wall into account, while using variables that are normalised with respect to machine size. Generally speaking, it is found that there is a substantial overlap between the region of H-mode with small ELMs and the region of the two other types of H-mode. However, the ELM-free and the giant ELM H-modes relatively rarely appear in the region, that, according to the analysis, is allocated to small ELMs. A reliable production of H-mode with only small ELMs seems well possible by choosing this regime in parameter space. In the present study, it was not attempted to arrive at a unified discrimination across the machines. So, projection from one machine to another remains difficult, and a reliable determination of the region where small ELMs occur still requires a training sample from the device under consideration. (author) 53 refs

  4. Penalized discriminant analysis for the detection of wild-grown and cultivated Ganoderma lucidum using Fourier transform infrared spectroscopy.

    Science.gov (United States)

    Zhu, Ying; Tan, Tuck Lee

    2016-04-15

    An effective and simple analytical method using Fourier transform infrared (FTIR) spectroscopy to distinguish wild-grown high-quality Ganoderma lucidum (G. lucidum) from cultivated one is of essential importance for its quality assurance and medicinal value estimation. Commonly used chemical and analytical methods using full spectrum are not so effective for the detection and interpretation due to the complex system of the herbal medicine. In this study, two penalized discriminant analysis models, penalized linear discriminant analysis (PLDA) and elastic net (Elnet),using FTIR spectroscopy have been explored for the purpose of discrimination and interpretation. The classification performances of the two penalized models have been compared with two widely used multivariate methods, principal component discriminant analysis (PCDA) and partial least squares discriminant analysis (PLSDA). The Elnet model involving a combination of L1 and L2 norm penalties enabled an automatic selection of a small number of informative spectral absorption bands and gave an excellent classification accuracy of 99% for discrimination between spectra of wild-grown and cultivated G. lucidum. Its classification performance was superior to that of the PLDA model in a pure L1 setting and outperformed the PCDA and PLSDA models using full wavelength. The well-performed selection of informative spectral features leads to substantial reduction in model complexity and improvement of classification accuracy, and it is particularly helpful for the quantitative interpretations of the major chemical constituents of G. lucidum regarding its anti-cancer effects. Copyright © 2016 Elsevier B.V. All rights reserved.

  5. Penalized discriminant analysis for the detection of wild-grown and cultivated Ganoderma lucidum using Fourier transform infrared spectroscopy

    Science.gov (United States)

    Zhu, Ying; Tan, Tuck Lee

    2016-04-01

    An effective and simple analytical method using Fourier transform infrared (FTIR) spectroscopy to distinguish wild-grown high-quality Ganoderma lucidum (G. lucidum) from cultivated one is of essential importance for its quality assurance and medicinal value estimation. Commonly used chemical and analytical methods using full spectrum are not so effective for the detection and interpretation due to the complex system of the herbal medicine. In this study, two penalized discriminant analysis models, penalized linear discriminant analysis (PLDA) and elastic net (Elnet),using FTIR spectroscopy have been explored for the purpose of discrimination and interpretation. The classification performances of the two penalized models have been compared with two widely used multivariate methods, principal component discriminant analysis (PCDA) and partial least squares discriminant analysis (PLSDA). The Elnet model involving a combination of L1 and L2 norm penalties enabled an automatic selection of a small number of informative spectral absorption bands and gave an excellent classification accuracy of 99% for discrimination between spectra of wild-grown and cultivated G. lucidum. Its classification performance was superior to that of the PLDA model in a pure L1 setting and outperformed the PCDA and PLSDA models using full wavelength. The well-performed selection of informative spectral features leads to substantial reduction in model complexity and improvement of classification accuracy, and it is particularly helpful for the quantitative interpretations of the major chemical constituents of G. lucidum regarding its anti-cancer effects.

  6. Nonparametric, Coupled ,Bayesian ,Dictionary ,and Classifier Learning for Hyperspectral Classification.

    Science.gov (United States)

    Akhtar, Naveed; Mian, Ajmal

    2017-10-03

    We present a principled approach to learn a discriminative dictionary along a linear classifier for hyperspectral classification. Our approach places Gaussian Process priors over the dictionary to account for the relative smoothness of the natural spectra, whereas the classifier parameters are sampled from multivariate Gaussians. We employ two Beta-Bernoulli processes to jointly infer the dictionary and the classifier. These processes are coupled under the same sets of Bernoulli distributions. In our approach, these distributions signify the frequency of the dictionary atom usage in representing class-specific training spectra, which also makes the dictionary discriminative. Due to the coupling between the dictionary and the classifier, the popularity of the atoms for representing different classes gets encoded into the classifier. This helps in predicting the class labels of test spectra that are first represented over the dictionary by solving a simultaneous sparse optimization problem. The labels of the spectra are predicted by feeding the resulting representations to the classifier. Our approach exploits the nonparametric Bayesian framework to automatically infer the dictionary size--the key parameter in discriminative dictionary learning. Moreover, it also has the desirable property of adaptively learning the association between the dictionary atoms and the class labels by itself. We use Gibbs sampling to infer the posterior probability distributions over the dictionary and the classifier under the proposed model, for which, we derive analytical expressions. To establish the effectiveness of our approach, we test it on benchmark hyperspectral images. The classification performance is compared with the state-of-the-art dictionary learning-based classification methods.

  7. Comparative analysis of targeted metabolomics: dominance-based rough set approach versus orthogonal partial least square-discriminant analysis.

    Science.gov (United States)

    Blasco, H; Błaszczyński, J; Billaut, J C; Nadal-Desbarats, L; Pradat, P F; Devos, D; Moreau, C; Andres, C R; Emond, P; Corcia, P; Słowiński, R

    2015-02-01

    Metabolomics is an emerging field that includes ascertaining a metabolic profile from a combination of small molecules, and which has health applications. Metabolomic methods are currently applied to discover diagnostic biomarkers and to identify pathophysiological pathways involved in pathology. However, metabolomic data are complex and are usually analyzed by statistical methods. Although the methods have been widely described, most have not been either standardized or validated. Data analysis is the foundation of a robust methodology, so new mathematical methods need to be developed to assess and complement current methods. We therefore applied, for the first time, the dominance-based rough set approach (DRSA) to metabolomics data; we also assessed the complementarity of this method with standard statistical methods. Some attributes were transformed in a way allowing us to discover global and local monotonic relationships between condition and decision attributes. We used previously published metabolomics data (18 variables) for amyotrophic lateral sclerosis (ALS) and non-ALS patients. Principal Component Analysis (PCA) and Orthogonal Partial Least Square-Discriminant Analysis (OPLS-DA) allowed satisfactory discrimination (72.7%) between ALS and non-ALS patients. Some discriminant metabolites were identified: acetate, acetone, pyruvate and glutamine. The concentrations of acetate and pyruvate were also identified by univariate analysis as significantly different between ALS and non-ALS patients. DRSA correctly classified 68.7% of the cases and established rules involving some of the metabolites highlighted by OPLS-DA (acetate and acetone). Some rules identified potential biomarkers not revealed by OPLS-DA (beta-hydroxybutyrate). We also found a large number of common discriminating metabolites after Bayesian confirmation measures, particularly acetate, pyruvate, acetone and ascorbate, consistent with the pathophysiological pathways involved in ALS. DRSA provides

  8. A Recurrent Probabilistic Neural Network with Dimensionality Reduction Based on Time-series Discriminant Component Analysis.

    Science.gov (United States)

    Hayashi, Hideaki; Shibanoki, Taro; Shima, Keisuke; Kurita, Yuichi; Tsuji, Toshio

    2015-12-01

    This paper proposes a probabilistic neural network (NN) developed on the basis of time-series discriminant component analysis (TSDCA) that can be used to classify high-dimensional time-series patterns. TSDCA involves the compression of high-dimensional time series into a lower dimensional space using a set of orthogonal transformations and the calculation of posterior probabilities based on a continuous-density hidden Markov model with a Gaussian mixture model expressed in the reduced-dimensional space. The analysis can be incorporated into an NN, which is named a time-series discriminant component network (TSDCN), so that parameters of dimensionality reduction and classification can be obtained simultaneously as network coefficients according to a backpropagation through time-based learning algorithm with the Lagrange multiplier method. The TSDCN is considered to enable high-accuracy classification of high-dimensional time-series patterns and to reduce the computation time taken for network training. The validity of the TSDCN is demonstrated for high-dimensional artificial data and electroencephalogram signals in the experiments conducted during the study.

  9. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

    Science.gov (United States)

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.

  10. Forensic analysis of Salvia divinorum using multivariate statistical procedures. Part I: discrimination from related Salvia species.

    Science.gov (United States)

    Willard, Melissa A Bodnar; McGuffin, Victoria L; Smith, Ruth Waddell

    2012-01-01

    Salvia divinorum is a hallucinogenic herb that is internationally regulated. In this study, salvinorin A, the active compound in S. divinorum, was extracted from S. divinorum plant leaves using a 5-min extraction with dichloromethane. Four additional Salvia species (Salvia officinalis, Salvia guaranitica, Salvia splendens, and Salvia nemorosa) were extracted using this procedure, and all extracts were analyzed by gas chromatography-mass spectrometry. Differentiation of S. divinorum from other Salvia species was successful based on visual assessment of the resulting chromatograms. To provide a more objective comparison, the total ion chromatograms (TICs) were subjected to principal components analysis (PCA). Prior to PCA, the TICs were subjected to a series of data pretreatment procedures to minimize non-chemical sources of variance in the data set. Successful discrimination of S. divinorum from the other four Salvia species was possible based on visual assessment of the PCA scores plot. To provide a numerical assessment of the discrimination, a series of statistical procedures such as Euclidean distance measurement, hierarchical cluster analysis, Student's t tests, Wilcoxon rank-sum tests, and Pearson product moment correlation were also applied to the PCA scores. The statistical procedures were then compared to determine the advantages and disadvantages for forensic applications.

  11. Combined use of correlation dimension and entropy as discriminating measures for time series analysis

    Science.gov (United States)

    Harikrishnan, K. P.; Misra, R.; Ambika, G.

    2009-09-01

    We show that the combined use of correlation dimension (D2) and correlation entropy (K2) as discriminating measures can extract a more accurate information regarding the different types of noise present in a time series data. For this, we make use of an algorithmic approach for computing D2 and K2 proposed by us recently [Harikrishnan KP, Misra R, Ambika G, Kembhavi AK. Physica D 2006;215:137; Harikrishnan KP, Ambika G, Misra R. Mod Phys Lett B 2007;21:129; Harikrishnan KP, Misra R, Ambika G. Pramana - J Phys, in press], which is a modification of the standard Grassberger-Proccacia scheme. While the presence of white noise can be easily identified by computing D2 of data and surrogates, K2 is a better discriminating measure to detect colored noise in the data. Analysis of time series from a real world system involving both white and colored noise is presented as evidence. To our knowledge, this is the first time that such a combined analysis is undertaken on a real world data.

  12. A Non-Parametric Item Response Theory Evaluation of the CAGE Instrument Among Older Adults.

    Science.gov (United States)

    Abdin, Edimansyah; Sagayadevan, Vathsala; Vaingankar, Janhavi Ajit; Picco, Louisa; Chong, Siow Ann; Subramaniam, Mythily

    2018-02-23

    The validity of the CAGE using item response theory (IRT) has not yet been examined in older adult population. This study aims to investigate the psychometric properties of the CAGE using both non-parametric and parametric IRT models, assess whether there is any differential item functioning (DIF) by age, gender and ethnicity and examine the measurement precision at the cut-off scores. We used data from the Well-being of the Singapore Elderly study to conduct Mokken scaling analysis (MSA), dichotomous Rasch and 2-parameter logistic IRT models. The measurement precision at the cut-off scores were evaluated using classification accuracy (CA) and classification consistency (CC). The MSA showed the overall scalability H index was 0.459, indicating a medium performing instrument. All items were found to be homogenous, measuring the same construct and able to discriminate well between respondents with high levels of the construct and the ones with lower levels. The item discrimination ranged from 1.07 to 6.73 while the item difficulty ranged from 0.33 to 2.80. Significant DIF was found for 2-item across ethnic group. More than 90% (CC and CA ranged from 92.5% to 94.3%) of the respondents were consistently and accurately classified by the CAGE cut-off scores of 2 and 3. The current study provides new evidence on the validity of the CAGE from the IRT perspective. This study provides valuable information of each item in the assessment of the overall severity of alcohol problem and the precision of the cut-off scores in older adult population.

  13. Discrimination Against State and Local Government LGBT Employees: An Analysis of Administrative Complaints

    OpenAIRE

    Mallory, Christy; Sears, Brad

    2014-01-01

    This article documents evidence of recent discrimination against lesbian, gay, bisexual, and transgender (LGBT) public sector workers by analyzing employment discrimination complaints filed with state and local administrative agencies. We present information about 589 complaints of sexual orientation and gender identity discrimination filed by public sector workers in 123 jurisdictions. We find that discrimination against LGBT people in the public sector is pervasive and occurs nearly as freq...

  14. Rapid direct analysis to discriminate geographic origin of extra virgin olive oils by flash gas chromatography electronic nose and chemometrics.

    Science.gov (United States)

    Melucci, Dora; Bendini, Alessandra; Tesini, Federica; Barbieri, Sara; Zappi, Alessandro; Vichi, Stefania; Conte, Lanfranco; Gallina Toschi, Tullia

    2016-08-01

    At present, the geographical origin of extra virgin olive oils can be ensured by documented traceability, although chemical analysis may add information that is useful for possible confirmation. This preliminary study investigated the effectiveness of flash gas chromatography electronic nose and multivariate data analysis to perform rapid screening of commercial extra virgin olive oils characterized by a different geographical origin declared in the label. A comparison with solid phase micro extraction coupled to gas chromatography mass spectrometry was also performed. The new method is suitable to verify the geographic origin of extra virgin olive oils based on principal components analysis and discriminant analysis applied to the volatile profile of the headspace as a fingerprint. The selected variables were suitable in discriminating between "100% Italian" and "non-100% Italian" oils. Partial least squares discriminant analysis also allowed prediction of the degree of membership of unknown samples to the classes examined. Copyright © 2016. Published by Elsevier Ltd.

  15. Evaluation of hierarchical agglomerative cluster analysis methods for discrimination of primary biological aerosol

    Directory of Open Access Journals (Sweden)

    I. Crawford

    2015-11-01

    Full Text Available In this paper we present improved methods for discriminating and quantifying primary biological aerosol particles (PBAPs by applying hierarchical agglomerative cluster analysis to multi-parameter ultraviolet-light-induced fluorescence (UV-LIF spectrometer data. The methods employed in this study can be applied to data sets in excess of 1 × 106 points on a desktop computer, allowing for each fluorescent particle in a data set to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient data set. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4 where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best-performing methods were applied to the BEACHON-RoMBAS (Bio–hydro–atmosphere interactions of Energy, Aerosols, Carbon, H2O, Organics and Nitrogen–Rocky Mountain Biogenic Aerosol Study ambient data set, where it was found that the z-score and range normalisation methods yield similar results, with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the

  16. Colored inks analysis and differentiation: A first step in artistic contemporary prints discrimination

    International Nuclear Information System (INIS)

    Vila, Anna; Ferrer, Nuria; Garcia, Jose F.

    2007-01-01

    Prints are the most popular artistic technique. Due to their manufacturing procedure, they are also one of the most frequently falsified types of artwork. In terms of their economic and historic value, the chemical analysis and characterisation of coloured inks and their principal constituent materials (pigments), together with the historical and aesthetic information available in the Catalogues Raisonees, are important tools in distinguishing originals from non-original prints. The chemical characterisation and discrimination of coloured inks has test in this study. Analysis using Fourier transform infrared spectroscopy (FTIR), Scanning electron microscopy (SEM) and X-ray diffraction (XRD) has been done on blue pigments and inks, due to this colour is one of the most representative for the presence of organic and inorganic materials in their composition. Conclusion obtained for this colour would demonstrate the capability of the approach when it is applied to any other coloured set of inks

  17. Bioelectric signal classification using a recurrent probabilistic neural network with time-series discriminant component analysis.

    Science.gov (United States)

    Hayashi, Hideaki; Shima, Keisuke; Shibanoki, Taro; Kurita, Yuichi; Tsuji, Toshio

    2013-01-01

    This paper outlines a probabilistic neural network developed on the basis of time-series discriminant component analysis (TSDCA) that can be used to classify high-dimensional time-series patterns. TSDCA involves the compression of high-dimensional time series into a lower-dimensional space using a set of orthogonal transformations and the calculation of posterior probabilities based on a continuous-density hidden Markov model that incorporates a Gaussian mixture model expressed in the reduced-dimensional space. The analysis can be incorporated into a neural network so that parameters can be obtained appropriately as network coefficients according to backpropagation-through-time-based training algorithm. The network is considered to enable high-accuracy classification of high-dimensional time-series patterns and to reduce the computation time taken for network training. In the experiments conducted during the study, the validity of the proposed network was demonstrated for EEG signals.

  18. Colored inks analysis and differentiation: A first step in artistic contemporary prints discrimination

    Energy Technology Data Exchange (ETDEWEB)

    Vila, Anna [Department de Pintura, Conservacio-Restauracio, Facultat de Belles Arts, Universitat de Barcelona, C/Pau Gargallo 4, 08028 Barcelona (Spain)]. E-mail: avila@sct.ub.es; Ferrer, Nuria [Serveis Cientificotecnics, Universitat de Barcelona, C/Lluis Sole i Sabaris 1, 08028 Barcelona (Spain)]. E-mail: nferrer@sctub.es; Garcia, Jose F. [Department de Pintura, Conservacio-Restauracio, Facultat de Belles Arts, Universitat de Barcelona, C/Pau Gargallo 4, 08028 Barcelona (Spain)]. E-mail: ifgarcia@ub.edu

    2007-04-04

    Prints are the most popular artistic technique. Due to their manufacturing procedure, they are also one of the most frequently falsified types of artwork. In terms of their economic and historic value, the chemical analysis and characterisation of coloured inks and their principal constituent materials (pigments), together with the historical and aesthetic information available in the Catalogues Raisonees, are important tools in distinguishing originals from non-original prints. The chemical characterisation and discrimination of coloured inks has test in this study. Analysis using Fourier transform infrared spectroscopy (FTIR), Scanning electron microscopy (SEM) and X-ray diffraction (XRD) has been done on blue pigments and inks, due to this colour is one of the most representative for the presence of organic and inorganic materials in their composition. Conclusion obtained for this colour would demonstrate the capability of the approach when it is applied to any other coloured set of inks.

  19. Nonparametric Analyses of Log-Periodic Precursors to Financial Crashes

    Science.gov (United States)

    Zhou, Wei-Xing; Sornette, Didier

    We apply two nonparametric methods to further test the hypothesis that log-periodicity characterizes the detrended price trajectory of large financial indices prior to financial crashes or strong corrections. The term "parametric" refers here to the use of the log-periodic power law formula to fit the data; in contrast, "nonparametric" refers to the use of general tools such as Fourier transform, and in the present case the Hilbert transform and the so-called (H, q)-analysis. The analysis using the (H, q)-derivative is applied to seven time series ending with the October 1987 crash, the October 1997 correction and the April 2000 crash of the Dow Jones Industrial Average (DJIA), the Standard & Poor 500 and Nasdaq indices. The Hilbert transform is applied to two detrended price time series in terms of the ln(tc-t) variable, where tc is the time of the crash. Taking all results together, we find strong evidence for a universal fundamental log-frequency f=1.02±0.05 corresponding to the scaling ratio λ=2.67±0.12. These values are in very good agreement with those obtained in earlier works with different parametric techniques. This note is extracted from a long unpublished report with 58 figures available at , which extensively describes the evidence we have accumulated on these seven time series, in particular by presenting all relevant details so that the reader can judge for himself or herself the validity and robustness of the results.

  20. Discrimination of Aurantii Fructus Immaturus and Fructus Poniciri Trifoliatae Immaturus by Flow Injection UV Spectroscopy (FIUV) and 1H NMR using Partial Least-squares Discriminant Analysis (PLS-DA)

    Science.gov (United States)

    Two simple fingerprinting methods, flow-injection UV spectroscopy (FIUV) and 1H nuclear magnetic resonance (NMR), for discrimination of Aurantii FructusImmaturus and Fructus Poniciri TrifoliataeImmaturususing were described. Both methods were combined with partial least-squares discriminant analysis...

  1. Promises of silent salesman to the FMCG industry: an investigation using linear discriminant analysis approach

    Directory of Open Access Journals (Sweden)

    Shekhar Suraj Kushe

    2015-12-01

    Full Text Available Packaging which is often called as the ‘silent salesman’ is an important component of marketing. Today the importance of packaging has risen to such an extent that product packaging is rightly called as the fifth ‘P’ of marketing mix. FMCG are products which are utilized by large number of people. The present study examined the discriminating power of five selected FMCG packaging variables namely ‘picture’, ‘colour’, ‘size’, ‘shape’ and ‘material’ amidst those who purchased FMCG based on these packaging variables and for those who purchased FMCG not based on these packaging variables. Descriptive research was carried out in the study. Respondents (students were asked to rate four packaging variable on a five point Likert’s scale. Discriminant analysis showed that only two variables namely ‘Colour’ (.706 and ‘Shape’ (–.527 were good predictors. Variables ‘Picture’, ‘size’ and ‘material’ were considered as poor predictors as far as the student communities were considered. The cross validated classification showed that out of the 240 samples drawn, 91.8% of the cases were correctly classified.

  2. Image analysis of food particles can discriminate deficient mastication of mixed foodstuffs simulating daily meal.

    Science.gov (United States)

    Sugimoto, K; Hashimoto, Y; Fukuike, C; Kodama, N; Minagi, S

    2014-03-01

    Because food texture is regarded as an important factor for smooth deglutition, identification of objective parameters that could provide a basis for food texture selection for elderly or dysphagic patients is of great importance. We aimed to develop an objective evaluation method of mastication using a mixed test food comprising foodstuffs, simulating daily dietary life. The particle size distribution (>2 mm in diameter) in a bolus was analysed using a digital image under dark-field illumination. Ten female participants (mean age ± s.d., 27·6 ± 2·6 years) masticated a mixed test food comprising prescribed amounts of rice, sausage, hard omelette, raw cabbage and raw cucumber with 100%, 75%, 50% and 25% of the number of their masticatory strokes. A single set of coefficient thresholds of 0·10 for the homogeneity index and 1·62 for the particle size index showed excellent discrimination of deficient masticatory conditions with high sensitivity (0·90) and specificity (0·77). Based on the results of this study, normal mastication was discriminated from deficient masticatory conditions using a large particle analysis of mixed foodstuffs, thus showing the possibility of future application of this method for objective decision-making regarding the properties of meals served to dysphagic patients. © 2014 John Wiley & Sons Ltd.

  3. Two-Stage Regularized Linear Discriminant Analysis for 2-D Data.

    Science.gov (United States)

    Zhao, Jianhua; Shi, Lei; Zhu, Ji

    2015-08-01

    Fisher linear discriminant analysis (LDA) involves within-class and between-class covariance matrices. For 2-D data such as images, regularized LDA (RLDA) can improve LDA due to the regularized eigenvalues of the estimated within-class matrix. However, it fails to consider the eigenvectors and the estimated between-class matrix. To improve these two matrices simultaneously, we propose in this paper a new two-stage method for 2-D data, namely a bidirectional LDA (BLDA) in the first stage and the RLDA in the second stage, where both BLDA and RLDA are based on the Fisher criterion that tackles correlation. BLDA performs the LDA under special separable covariance constraints that incorporate the row and column correlations inherent in 2-D data. The main novelty is that we propose a simple but effective statistical test to determine the subspace dimensionality in the first stage. As a result, the first stage reduces the dimensionality substantially while keeping the significant discriminant information in the data. This enables the second stage to perform RLDA in a much lower dimensional subspace, and thus improves the two estimated matrices simultaneously. Experiments on a number of 2-D synthetic and real-world data sets show that BLDA+RLDA outperforms several closely related competitors.

  4. A New Method for Improving the Discrimination Power and Weights Dispersion in the Data Envelopment Analysis

    Directory of Open Access Journals (Sweden)

    S. Kordrostami

    2013-06-01

    Full Text Available The appropriate choice of input-output weights is necessary to have a successful DEA model. Generally, if the number of DMUs i.e., n, is less than number of inputs and outputs i.e., m+s, then many of DMUs are introduced as efficient then the discrimination between DMUs is not possible. Besides, DEA models are free to choose the best weights. For resolving the problems that are resulted from freedom of weights, some constraints are set on the input-output weights. Symmetric weight constraints are a kind of weight constrains. In this paper, we represent a new model based on a multi-criterion data envelopment analysis (MCDEA are developed to moderate the homogeneity of weights distribution by using symmetric weight constrains.Consequently, we show that the improvement of the dispersal of unrealistic input-output weights and the increasing discrimination power for our suggested models. Finally, as an application of the new model, we use this model to evaluate and ranking guilan selected hospitals.

  5. Forensic analysis of explosives using isotope ratio mass spectrometry (IRMS)--discrimination of ammonium nitrate sources.

    Science.gov (United States)

    Benson, Sarah J; Lennard, Christopher J; Maynard, Philip; Hill, David M; Andrew, Anita S; Roux, Claude

    2009-06-01

    An evaluation was undertaken to determine if isotope ratio mass spectrometry (IRMS) could assist in the investigation of complex forensic cases by providing a level of discrimination not achievable utilising traditional forensic techniques. The focus of the research was on ammonium nitrate (AN), a common oxidiser used in improvised explosive mixtures. The potential value of IRMS to attribute Australian AN samples to the manufacturing source was demonstrated through the development of a preliminary AN classification scheme based on nitrogen isotopes. Although the discrimination utilising nitrogen isotopes alone was limited and only relevant to samples from the three Australian manufacturers during the evaluated time period, the classification scheme has potential as an investigative aid. Combining oxygen and hydrogen stable isotope values permitted the differentiation of AN prills from three different Australian manufacturers. Samples from five different overseas sources could be differentiated utilising a combination of the nitrogen, oxygen and hydrogen isotope values. Limited differentiation between Australian and overseas prills was achieved for the samples analysed. The comparison of nitrogen isotope values from intact AN prill samples with those from post-blast AN prill residues highlighted that the nitrogen isotopic composition of the prills was not maintained post-blast; hence, limiting the technique to analysis of un-reacted explosive material.

  6. Hyperspectral image analysis for rapid and accurate discrimination of bacterial infections: A benchmark study.

    Science.gov (United States)

    Arrigoni, Simone; Turra, Giovanni; Signoroni, Alberto

    2017-09-01

    With the rapid diffusion of Full Laboratory Automation systems, Clinical Microbiology is currently experiencing a new digital revolution. The ability to capture and process large amounts of visual data from microbiological specimen processing enables the definition of completely new objectives. These include the direct identification of pathogens growing on culturing plates, with expected improvements in rapid definition of the right treatment for patients affected by bacterial infections. In this framework, the synergies between light spectroscopy and image analysis, offered by hyperspectral imaging, are of prominent interest. This leads us to assess the feasibility of a reliable and rapid discrimination of pathogens through the classification of their spectral signatures extracted from hyperspectral image acquisitions of bacteria colonies growing on blood agar plates. We designed and implemented the whole data acquisition and processing pipeline and performed a comprehensive comparison among 40 combinations of different data preprocessing and classification techniques. High discrimination performance has been achieved also thanks to improved colony segmentation and spectral signature extraction. Experimental results reveal the high accuracy and suitability of the proposed approach, driving the selection of most suitable and scalable classification pipelines and stimulating clinical validations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Bearing Performance Degradation Assessment Using Linear Discriminant Analysis and Coupled HMM

    International Nuclear Information System (INIS)

    Liu, T; Chen, J; Zhou, X N; Xiao, W B

    2012-01-01

    Bearing is one of the most important units in rotary machinery, its performance may vary significantly under different working stages. Thus it is critical to choose the most effective features for bearing performance degradation prediction. Linear Discriminant Analysis (LDA) is a useful method in finding few feature's dimensions that best discriminate a set of features extracted from original vibration signals. Another challenge in bearing performance degradation is how to build a model to recognize the different conditions with the data coming from different monitoring channels. In this paper, coupled hidden Markov models (CHMM) is presented to model interacting processes which can overcome the defections of the HMM. Because the input data in CHMM are collected by several sensors, and the interacting information can be fused by coupled modalities, it is more effective than HMM which used only one state chain. The model can be used in estimating the bearing performance degradation states according to several observation data. When becoming degradation pattern recognition, the new observation features should be input into the pre-trained CHMM and calculate the performance index (PI) of the outputs, the changing of PI could be used to describe the different degradation level of the bearings. The results show that PI will decline with the increase of the bearing degradation. Assessment results of the whole life time experimental bearing signals validate the feasibility and effectiveness of this method.

  8. Cognitive Strategies and Physical Activity in Older Adults: A Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    Nathalie André

    2018-01-01

    Full Text Available Background. Although a number of studies have examined sociodemographic, psychosocial, and environmental determinants of the level of physical activity (PA for older people, little attention has been paid to the predictive power of cognitive strategies for independently living older adults. However, cognitive strategies have recently been considered to be critical in the management of day-to-day living. Methods. Data were collected from 243 men and women aged 55 years and older living in France using face-to-face interviews between 2011 and 2013. Results. A stepwise discriminant analysis selected five predictor variables (age, perceived health status, barriers’ self-efficacy, internal memory, and attentional control strategies of the level of PA. The function showed that the rate of correct prediction was 73% for the level of PA. The calculated discriminant function based on the five predictor variables is useful for detecting individuals at high risk of lapses once engaged in regular PA. Conclusions. This study highlighted the need to consider cognitive functions as a determinant of the level of PA and, more specifically, those cognitive functions related to executive functions (internal memory and attentional control, to facilitate the maintenance of regular PA. These results are discussed in relation to successful aging.

  9. Is it really organic? – Multi-isotopic analysis as a tool to discriminate between organic and conventional plants

    DEFF Research Database (Denmark)

    Laursen, K.H.; Mihailova, A.; Kelly, S.D.

    2013-01-01

    for discrimination of organically and conventionally grown plants. The study was based on wheat, barley, faba bean and potato produced in rigorously controlled long-term field trials comprising 144 experimental plots. Nitrogen isotope analysis revealed the use of animal manure, but was unable to discriminate between......Novel procedures for analytical authentication of organic plant products are urgently needed. Here we present the first study encompassing stable isotopes of hydrogen, carbon, nitrogen, oxygen, magnesium and sulphur as well as compound-specific nitrogen and oxygen isotope analysis of nitrate...... plants that were fertilised with synthetic nitrogen fertilisers or green manures from atmospheric nitrogen fixing legumes. This limitation was bypassed using oxygen isotope analysis of nitrate in potato tubers, while hydrogen isotope analysis allowed complete discrimination of organic and conventional...

  10. A Nonparametric Test for Seasonal Unit Roots

    OpenAIRE

    Kunst, Robert M.

    2009-01-01

    Abstract: We consider a nonparametric test for the null of seasonal unit roots in quarterly time series that builds on the RUR (records unit root) test by Aparicio, Escribano, and Sipols. We find that the test concept is more promising than a formalization of visual aids such as plots by quarter. In order to cope with the sensitivity of the original RUR test to autocorrelation under its null of a unit root, we suggest an augmentation step by autoregression. We present some evidence on the siz...

  11. A Bayesian nonparametric estimation of distributions and quantiles

    International Nuclear Information System (INIS)

    Poern, K.

    1988-11-01

    The report describes a Bayesian, nonparametric method for the estimation of a distribution function and its quantiles. The method, presupposing random sampling, is nonparametric, so the user has to specify a prior distribution on a space of distributions (and not on a parameter space). In the current application, where the method is used to estimate the uncertainty of a parametric calculational model, the Dirichlet prior distribution is to a large extent determined by the first batch of Monte Carlo-realizations. In this case the results of the estimation technique is very similar to the conventional empirical distribution function. The resulting posterior distribution is also Dirichlet, and thus facilitates the determination of probability (confidence) intervals at any given point in the space of interest. Another advantage is that also the posterior distribution of a specified quantitle can be derived and utilized to determine a probability interval for that quantile. The method was devised for use in the PROPER code package for uncertainty and sensitivity analysis. (orig.)

  12. On Kolmogorov asymptotics of estimators of the misclassification error rate in linear discriminant analysis

    KAUST Repository

    Zollanvari, Amin

    2013-05-24

    We provide a fundamental theorem that can be used in conjunction with Kolmogorov asymptotic conditions to derive the first moments of well-known estimators of the actual error rate in linear discriminant analysis of a multivariate Gaussian model under the assumption of a common known covariance matrix. The estimators studied in this paper are plug-in and smoothed resubstitution error estimators, both of which have not been studied before under Kolmogorov asymptotic conditions. As a result of this work, we present an optimal smoothing parameter that makes the smoothed resubstitution an unbiased estimator of the true error. For the sake of completeness, we further show how to utilize the presented fundamental theorem to achieve several previously reported results, namely the first moment of the resubstitution estimator and the actual error rate. We provide numerical examples to show the accuracy of the succeeding finite sample approximations in situations where the number of dimensions is comparable or even larger than the sample size.

  13. A discrimination technique for extensive air showers based on multiscale, lacunarity and neural network analysis

    International Nuclear Information System (INIS)

    Pagliaro, Antonio; D'Ali Staiti, G.; D'Anna, F.

    2011-01-01

    We present a new method for the identification of extensive air showers initiated by different primaries. The method uses the multiscale concept and is based on the analysis of multifractal behaviour and lacunarity of secondary particle distributions together with a properly designed and trained artificial neural network. In the present work the method is discussed and applied to a set of fully simulated vertical showers, in the experimental framework of ARGO-YBJ, to obtain hadron to gamma primary separation. We show that the presented approach gives very good results, leading, in the 1-10 TeV energy range, to a clear improvement of the discrimination power with respect to the existing figures for extended shower detectors.

  14. Detection of feigned mental disorders on the personality assessment inventory: a discriminant analysis.

    Science.gov (United States)

    Rogers, R; Sewell, K W; Morey, L C; Ustad, K L

    1996-12-01

    Psychological assessment with multiscale inventories is largely dependent on the honesty and forthrightness of those persons evaluated. We investigated the effectiveness of the Personality Assessment Inventory (PAI) in detecting participants feigning three specific disorders: schizophrenia, major depression, and generalized anxiety disorder. With a simulation design, we tested the PAI validity scales on 166 naive (undergraduates with minimal preparation) and 80 sophisticated (doctoral psychology students with 1 week preparation) participants. We compared their results to persons with the designated disorders: schizophrenia (n = 45), major depression (n = 136), and generalized anxiety disorder (n = 40). Although moderately effective with naive simulators, the validity scales evidenced only modest positive predictive power with their sophisticated counterparts. Therefore, we performed a two-stage discriminant analysis that yielded a moderately high hit rate (> 80%) that was maintained in the cross-validation sample, irrespective of the feigned disorder or the sophistication of the simulators.

  15. Quantitative Classification of Quartz by Laser Induced Breakdown Spectroscopy in Conjunction with Discriminant Function Analysis

    Directory of Open Access Journals (Sweden)

    A. Ali

    2016-01-01

    Full Text Available A responsive laser induced breakdown spectroscopic system was developed and improved for utilizing it as a sensor for the classification of quartz samples on the basis of trace elements present in the acquired samples. Laser induced breakdown spectroscopy (LIBS in conjunction with discriminant function analysis (DFA was applied for the classification of five different types of quartz samples. The quartz plasmas were produced at ambient pressure using Nd:YAG laser at fundamental harmonic mode (1064 nm. We optimized the detection system by finding the suitable delay time of the laser excitation. This is the first study, where the developed technique (LIBS+DFA was successfully employed to probe and confirm the elemental composition of quartz samples.

  16. Financial Distress Prediction using Linear Discriminant Analysis and Support Vector Machine

    Science.gov (United States)

    Santoso, Noviyanti; Wibowo, Wahyu

    2018-03-01

    A financial difficulty is the early stages before the bankruptcy. Bankruptcies caused by the financial distress can be seen from the financial statements of the company. The ability to predict financial distress became an important research topic because it can provide early warning for the company. In addition, predicting financial distress is also beneficial for investors and creditors. This research will be made the prediction model of financial distress at industrial companies in Indonesia by comparing the performance of Linear Discriminant Analysis (LDA) and Support Vector Machine (SVM) combined with variable selection technique. The result of this research is prediction model based on hybrid Stepwise-SVM obtains better balance among fitting ability, generalization ability and model stability than the other models.

  17. On Kolmogorov asymptotics of estimators of the misclassification error rate in linear discriminant analysis

    KAUST Repository

    Zollanvari, Amin; Genton, Marc G.

    2013-01-01

    We provide a fundamental theorem that can be used in conjunction with Kolmogorov asymptotic conditions to derive the first moments of well-known estimators of the actual error rate in linear discriminant analysis of a multivariate Gaussian model under the assumption of a common known covariance matrix. The estimators studied in this paper are plug-in and smoothed resubstitution error estimators, both of which have not been studied before under Kolmogorov asymptotic conditions. As a result of this work, we present an optimal smoothing parameter that makes the smoothed resubstitution an unbiased estimator of the true error. For the sake of completeness, we further show how to utilize the presented fundamental theorem to achieve several previously reported results, namely the first moment of the resubstitution estimator and the actual error rate. We provide numerical examples to show the accuracy of the succeeding finite sample approximations in situations where the number of dimensions is comparable or even larger than the sample size.

  18. Non-destructive Testing of Wood Defects Based on Discriminant Analysis Method

    Directory of Open Access Journals (Sweden)

    Wenshu LIN

    2015-09-01

    Full Text Available The defects of wood samples were tested by the technique of stress wave and ultrasonic technology, and the testing results were comparatively analyzed by using the Fisher discriminant analysis in the statistic software of SPSS. The differences of defect detection sensitivity and accuracy for stress wave and ultrasonic under different wood properties and defects were concluded. Therefore, in practical applications, according to different situations the corresponding wood non- destructive testing method should be used, or the two detection methods are applied at the same time in order to compensate for its shortcomings with each other to improve the ability to distinguish the timber defects. The results can provide a reference for further improvement of the reliability of timber defects detection.

  19. Photospheric Magnetic Field Properties of Flaring versus Flare-quiet Active Regions. II. Discriminant Analysis

    Science.gov (United States)

    Leka, K. D.; Barnes, G.

    2003-10-01

    We apply statistical tests based on discriminant analysis to the wide range of photospheric magnetic parameters described in a companion paper by Leka & Barnes, with the goal of identifying those properties that are important for the production of energetic events such as solar flares. The photospheric vector magnetic field data from the University of Hawai'i Imaging Vector Magnetograph are well sampled both temporally and spatially, and we include here data covering 24 flare-event and flare-quiet epochs taken from seven active regions. The mean value and rate of change of each magnetic parameter are treated as separate variables, thus evaluating both the parameter's state and its evolution, to determine which properties are associated with flaring. Considering single variables first, Hotelling's T2-tests show small statistical differences between flare-producing and flare-quiet epochs. Even pairs of variables considered simultaneously, which do show a statistical difference for a number of properties, have high error rates, implying a large degree of overlap of the samples. To better distinguish between flare-producing and flare-quiet populations, larger numbers of variables are simultaneously considered; lower error rates result, but no unique combination of variables is clearly the best discriminator. The sample size is too small to directly compare the predictive power of large numbers of variables simultaneously. Instead, we rank all possible four-variable permutations based on Hotelling's T2-test and look for the most frequently appearing variables in the best permutations, with the interpretation that they are most likely to be associated with flaring. These variables include an increasing kurtosis of the twist parameter and a larger standard deviation of the twist parameter, but a smaller standard deviation of the distribution of the horizontal shear angle and a horizontal field that has a smaller standard deviation but a larger kurtosis. To support the

  20. Generative Temporal Modelling of Neuroimaging - Decomposition and Nonparametric Testing

    DEFF Research Database (Denmark)

    Hald, Ditte Høvenhoff

    The goal of this thesis is to explore two improvements for functional magnetic resonance imaging (fMRI) analysis; namely our proposed decomposition method and an extension to the non-parametric testing framework. Analysis of fMRI allows researchers to investigate the functional processes...... of the brain, and provides insight into neuronal coupling during mental processes or tasks. The decomposition method is a Gaussian process-based independent components analysis (GPICA), which incorporates a temporal dependency in the sources. A hierarchical model specification is used, featuring both...... instantaneous and convolutive mixing, and the inferred temporal patterns. Spatial maps are seen to capture smooth and localized stimuli-related components, and often identifiable noise components. The implementation is freely available as a GUI/SPM plugin, and we recommend using GPICA as an additional tool when...

  1. A longitudinal analysis of Hispanic youth acculturation and cigarette smoking: the roles of gender, culture, family, and discrimination.

    Science.gov (United States)

    Lorenzo-Blanco, Elma I; Unger, Jennifer B; Ritt-Olson, Anamara; Soto, Daniel; Baezconde-Garbanati, Lourdes

    2013-05-01

    Risk for smoking initiation increases as Hispanic youth acculturate to U.S. society, and this association seems to be stronger for Hispanic girls than boys. To better understand the influence of culture, family, and everyday discrimination on cigarette smoking, we tested a process-oriented model of acculturation and cigarette smoking. Data came from Project RED (Reteniendo y Entendiendo Diversidad para Salud), which included 1,436 Hispanic students (54% girls) from Southern California. We used data from 9th to 11th grade (85% were 14 years old, and 86% were U.S. born) to test the influence of acculturation-related experiences on smoking over time. Multigroup structural equation analysis suggested that acculturation was associated with increased familismo and lower traditional gender roles, and enculturation was linked more with familismo and respeto. Familismo, respeto, and traditional gender roles were linked with lower family conflict and increased family cohesion, and these links were stronger for girls. Familismo and respeto were further associated with lower discrimination. Conversely, fatalismo was linked with worse family functioning (especially for boys) and increased discrimination in both the groups. Discrimination was the only predictor of smoking for boys and girls. In all, the results of the current study indicate that reducing discrimination and helping youth cope with discrimination may prevent or reduce smoking in Hispanic boys and girls. This may be achieved by promoting familismo and respeto and by discouraging fatalistic beliefs.

  2. A Longitudinal Analysis of Hispanic Youth Acculturation and Cigarette Smoking: The Roles of Gender, Culture, Family, and Discrimination

    Science.gov (United States)

    2013-01-01

    Introduction: Risk for smoking initiation increases as Hispanic youth acculturate to U.S. society, and this association seems to be stronger for Hispanic girls than boys. To better understand the influence of culture, family, and everyday discrimination on cigarette smoking, we tested a process-oriented model of acculturation and cigarette smoking. Methods: Data came from Project RED (Reteniendo y Entendiendo Diversidad para Salud), which included 1,436 Hispanic students (54% girls) from Southern California. We used data from 9th to 11th grade (85% were 14 years old, and 86% were U.S. born) to test the influence of acculturation-related experiences on smoking over time. Results: Multigroup structural equation analysis suggested that acculturation was associated with increased familismo and lower traditional gender roles, and enculturation was linked more with familismo and respeto. Familismo, respeto, and traditional gender roles were linked with lower family conflict and increased family cohesion, and these links were stronger for girls. Familismo and respeto were further associated with lower discrimination. Conversely, fatalismo was linked with worse family functioning (especially for boys) and increased discrimination in both the groups. Discrimination was the only predictor of smoking for boys and girls. Conclusions: In all, the results of the current study indicate that reducing discrimination and helping youth cope with discrimination may prevent or reduce smoking in Hispanic boys and girls. This may be achieved by promoting familismo and respeto and by discouraging fatalistic beliefs. PMID:23109671

  3. Bayesian Nonparametric Clustering for Positive Definite Matrices.

    Science.gov (United States)

    Cherian, Anoop; Morellas, Vassilios; Papanikolopoulos, Nikolaos

    2016-05-01

    Symmetric Positive Definite (SPD) matrices emerge as data descriptors in several applications of computer vision such as object tracking, texture recognition, and diffusion tensor imaging. Clustering these data matrices forms an integral part of these applications, for which soft-clustering algorithms (K-Means, expectation maximization, etc.) are generally used. As is well-known, these algorithms need the number of clusters to be specified, which is difficult when the dataset scales. To address this issue, we resort to the classical nonparametric Bayesian framework by modeling the data as a mixture model using the Dirichlet process (DP) prior. Since these matrices do not conform to the Euclidean geometry, rather belongs to a curved Riemannian manifold,existing DP models cannot be directly applied. Thus, in this paper, we propose a novel DP mixture model framework for SPD matrices. Using the log-determinant divergence as the underlying dissimilarity measure to compare these matrices, and further using the connection between this measure and the Wishart distribution, we derive a novel DPM model based on the Wishart-Inverse-Wishart conjugate pair. We apply this model to several applications in computer vision. Our experiments demonstrate that our model is scalable to the dataset size and at the same time achieves superior accuracy compared to several state-of-the-art parametric and nonparametric clustering algorithms.

  4. The Analysis of the Ethnical Discrimination on the Manpower’s Market under the Economical Crisis

    Directory of Open Access Journals (Sweden)

    Mihaela Hrisanta DOBRE

    2012-06-01

    Full Text Available Discrimination means any difference, exclusion, restriction, preference or different treatment that brings forth disadvantages for a person or a group as compared to other ones that are in similar situations. The reasons on which discrimination is based can be various, such as race, nationality, ethnics, religion, gender, sexual orientation, language, age, disabilities etc. and in this case we talk about multiple discrimination. In Romania the main forms of discrimination are linked to ethnics and to sexual appurtenance. Within this column we analysed the discrimination amongst the Romany ethnics people, according to a statistical investigation (Access onto the Labour Market – A Chance for You, the research goal being to identify the answer to the following questions: Is there any discrimination inside the Romany ethnic group? What is the correlation between their level of education and their income? What is the correlation between the level of education of the parents and the respondent’s?

  5. Demographic Consequences of Gender Discrimination in China: Simulation Analysis of Policy Options

    Science.gov (United States)

    Quanbao, Jiang; Marcus W., Feldman

    2013-01-01

    The large number of missing females in China, a consequence of gender discrimination, is having and will continue to have a profound effect on the country's population development. In this paper, we analyze the causes of this gender discrimination in terms of institutions, culture and, economy, and suggest public policies that might help eliminate gender discrimination. Using a population simulation model, we study the effect of public policies on the sex ratio at birth and excess female child mortality, and the effect of gender discrimination on China's population development. We find that gender discrimination will decrease China's population size, number of births, and working age population, accelerate population aging and exacerbate the male marriage squeeze. These results provide theoretical support for suggesting that the government enact and implement public policies aimed at eliminating gender discrimination. PMID:24363477

  6. Phonological experience modulates voice discrimination: Evidence from functional brain networks analysis.

    Science.gov (United States)

    Hu, Xueping; Wang, Xiangpeng; Gu, Yan; Luo, Pei; Yin, Shouhang; Wang, Lijun; Fu, Chao; Qiao, Lei; Du, Yi; Chen, Antao

    2017-10-01

    Numerous behavioral studies have found a modulation effect of phonological experience on voice discrimination. However, the neural substrates underpinning this phenomenon are poorly understood. Here we manipulated language familiarity to test the hypothesis that phonological experience affects voice discrimination via mediating the engagement of multiple perceptual and cognitive resources. The results showed that during voice discrimination, the activation of several prefrontal regions was modulated by language familiarity. More importantly, the same effect was observed concerning the functional connectivity from the fronto-parietal network to the voice-identity network (VIN), and from the default mode network to the VIN. Our findings indicate that phonological experience could bias the recruitment of cognitive control and information retrieval/comparison processes during voice discrimination. Therefore, the study unravels the neural substrates subserving the modulation effect of phonological experience on voice discrimination, and provides new insights into studying voice discrimination from the perspective of network interactions. Copyright © 2017. Published by Elsevier Inc.

  7. Demographic Consequences of Gender Discrimination in China: Simulation Analysis of Policy Options.

    Science.gov (United States)

    Quanbao, Jiang; Shuzhuo, Li; Marcus W, Feldman

    2011-08-01

    The large number of missing females in China, a consequence of gender discrimination, is having and will continue to have a profound effect on the country's population development. In this paper, we analyze the causes of this gender discrimination in terms of institutions, culture and, economy, and suggest public policies that might help eliminate gender discrimination. Using a population simulation model, we study the effect of public policies on the sex ratio at birth and excess female child mortality, and the effect of gender discrimination on China's population development. We find that gender discrimination will decrease China's population size, number of births, and working age population, accelerate population aging and exacerbate the male marriage squeeze. These results provide theoretical support for suggesting that the government enact and implement public policies aimed at eliminating gender discrimination.

  8. Discrimination in relation to parenthood reported by community psychiatric service users in the UK: a framework analysis.

    Science.gov (United States)

    Jeffery, Debra; Clement, Sarah; Corker, Elizabeth; Howard, Louise M; Murray, Joanna; Thornicroft, Graham

    2013-04-20

    Experienced discrimination refers to an individual's perception that they have been treated unfairly due to an attribute and is an important recent focus within stigma research. A significant proportion of mental health service users report experiencing mental illness-based discrimination in relation to parenthood. Existing studies in this area have not gone beyond prevalence, therefore little is known about the nature of experienced discrimination in relation to parenthood, and how is it constituted. This study aims to generate a typology of community psychiatric service users' reports of mental illness-based discrimination in relation to becoming or being a parent. A secondary aim is to assess the prevalence of these types of experienced discrimination. In a telephone survey 2026 community psychiatric service users in ten UK Mental Health service provider organisations (Trusts) were asked about discrimination experienced in the previous 12 months using the Discrimination and Stigma Scale (DISC). The sample were asked if, due to their mental health problem, they had been treated unfairly in starting a family, or in their role as a parent, and gave examples of this. Prevalence is reported and the examples of experienced discrimination in relation to parenthood were analysed using the framework method of qualitative analysis. Three hundred and four participants (73% female) reported experienced discrimination, with prevalences of 22.5% and 28.3% for starting a family and for the parenting role respectively. Participants gave 89 examples of discrimination about starting a family and 228 about parenting, and these occurred in social and professional contexts. Ten themes were identified. These related to being seen as an unfit parent; people not being understanding; being stopped from having children; not being allowed to see their children; not getting the support needed; children being affected; children avoiding their parents; children's difficulties being blamed

  9. Statistical analysis of Thematic Mapper Simulator data for the geobotanical discrimination of rock types in southwest Oregon

    Science.gov (United States)

    Morrissey, L. A.; Weinstock, K. J.; Mouat, D. A.; Card, D. H.

    1984-01-01

    An evaluation of Thematic Mapper Simulator (TMS) data for the geobotanical discrimination of rock types based on vegetative cover characteristics is addressed in this research. A methodology for accomplishing this evaluation utilizing univariate and multivariate techniques is presented. TMS data acquired with a Daedalus DEI-1260 multispectral scanner were integrated with vegetation and geologic information for subsequent statistical analyses, which included a chi-square test, an analysis of variance, stepwise discriminant analysis, and Duncan's multiple range test. Results indicate that ultramafic rock types are spectrally separable from nonultramafics based on vegetative cover through the use of statistical analyses.

  10. Statistical analysis for discrimination of prompt gamma ray peak induced by high energy neutron: Monte Carlo simulation study

    International Nuclear Information System (INIS)

    Do-Kun Yoon; Joo-Young Jung; Tae Suk Suh; Seong-Min Han

    2015-01-01

    The purpose of this research is a statistical analysis for discrimination of prompt gamma ray peak induced by the 14.1 MeV neutron particles from spectra using Monte Carlo simulation. For the simulation, the information of 18 detector materials was used to simulate spectra by the neutron capture reaction. The discrimination of nine prompt gamma ray peaks from the simulation of each detector material was performed. We presented the several comparison indexes of energy resolution performance depending on the detector material using the simulation and statistics for the prompt gamma activation analysis. (author)

  11. Demographic Consequences of Gender Discrimination in China: Simulation Analysis of Policy Options

    OpenAIRE

    Quanbao, Jiang; Shuzhuo, Li; Marcus W., Feldman

    2011-01-01

    The large number of missing females in China, a consequence of gender discrimination, is having and will continue to have a profound effect on the country's population development. In this paper, we analyze the causes of this gender discrimination in terms of institutions, culture and, economy, and suggest public policies that might help eliminate gender discrimination. Using a population simulation model, we study the effect of public policies on the sex ratio at birth and excess female chil...

  12. Laws' masks descriptors applied to bone texture analysis: an innovative and discriminant tool in osteoporosis

    International Nuclear Information System (INIS)

    Rachidi, M.; Marchadier, A.; Gadois, C.; Lespessailles, E.; Chappard, C.; Benhamou, C.L.

    2008-01-01

    The objective of this study was to explore Laws' masks analysis to describe structural variations of trabecular bone due to osteoporosis on high-resolution digital radiographs and to check its dependence on the spatial resolution. Laws' masks are well established as one of the best methods for texture analysis in image processing and are used in various applications, but not in bone tissue characterisation. This method is based on masks that aim to filter the images. From each mask, five classical statistical parameters can be calculated. The study was performed on 182 healthy postmenopausal women with no fractures and 114 age-matched women with fractures [26 hip fractures (HFs), 29 vertebrae fractures (VFs), 29 wrist fractures (WFs) and 30 other fractures (OFs)]. For all subjects radiographs were obtained of the calcaneus with a new high-resolution X-ray device with direct digitisation (BMA, D3A, France). The lumbar spine, femoral neck, and total hip bone mineral density (BMD) were assessed by dual-energy X-ray absorptiometry. In terms of reproducibility, the best results were obtained with the TR E5E5 mask, especially for three parameters: ''mean'', ''standard deviation'' and ''entropy'' with, respectively, in vivo mid-term root mean square average coefficient of variation (RMSCV)%=1.79, 4.24 and 2.05. The ''mean'' and ''entropy'' parameters had a better reproducibility but ''standard deviation'' showed a better discriminant power. Thus, for univariate analysis, the difference between subjects with fractures and controls was significant (P -3 ) and significant for each fracture group independently (P -4 for HF, P=0.025 for VF and P -3 for OF). After multivariate analysis with adjustment for age and total hip BMD, the difference concerning the ''standard deviation'' parameter remained statistically significant between the control group and the HF and VF groups (P -5 , and P=0.04, respectively). No significant correlation between these Laws' masks parameters and

  13. Nonparametric Bayesian density estimation on manifolds with applications to planar shapes.

    Science.gov (United States)

    Bhattacharya, Abhishek; Dunson, David B

    2010-12-01

    Statistical analysis on landmark-based shape spaces has diverse applications in morphometrics, medical diagnostics, machine vision and other areas. These shape spaces are non-Euclidean quotient manifolds. To conduct nonparametric inferences, one may define notions of centre and spread on this manifold and work with their estimates. However, it is useful to consider full likelihood-based methods, which allow nonparametric estimation of the probability density. This article proposes a broad class of mixture models constructed using suitable kernels on a general compact metric space and then on the planar shape space in particular. Following a Bayesian approach with a nonparametric prior on the mixing distribution, conditions are obtained under which the Kullback-Leibler property holds, implying large support and weak posterior consistency. Gibbs sampling methods are developed for posterior computation, and the methods are applied to problems in density estimation and classification with shape-based predictors. Simulation studies show improved estimation performance relative to existing approaches.

  14. NONPARAMETRIC FIXED EFFECT PANEL DATA MODELS: RELATIONSHIP BETWEEN AIR POLLUTION AND INCOME FOR TURKEY

    Directory of Open Access Journals (Sweden)

    Rabia Ece OMAY

    2013-06-01

    Full Text Available In this study, relationship between gross domestic product (GDP per capita and sulfur dioxide (SO2 and particulate matter (PM10 per capita is modeled for Turkey. Nonparametric fixed effect panel data analysis is used for the modeling. The panel data covers 12 territories, in first level of Nomenclature of Territorial Units for Statistics (NUTS, for period of 1990-2001. Modeling of the relationship between GDP and SO2 and PM10 for Turkey, the non-parametric models have given good results.

  15. A Comparative Study of Feature Selection Methods for the Discriminative Analysis of Temporal Lobe Epilepsy

    Directory of Open Access Journals (Sweden)

    Chunren Lai

    2017-12-01

    Full Text Available It is crucial to differentiate patients with temporal lobe epilepsy (TLE from the healthy population and determine abnormal brain regions in TLE. The cortical features and changes can reveal the unique anatomical patterns of brain regions from structural magnetic resonance (MR images. In this study, structural MR images from 41 patients with left TLE, 34 patients with right TLE, and 58 normal controls (NC were acquired, and four kinds of cortical measures, namely cortical thickness, cortical surface area, gray matter volume (GMV, and mean curvature, were explored for discriminative analysis. Three feature selection methods including the independent sample t-test filtering, the sparse-constrained dimensionality reduction model (SCDRM, and the support vector machine-recursive feature elimination (SVM-RFE were investigated to extract dominant features among the compared groups for classification using the support vector machine (SVM classifier. The results showed that the SVM-RFE achieved the highest performance (most classifications with more than 84% accuracy, followed by the SCDRM, and the t-test. Especially, the surface area and GMV exhibited prominent discriminative ability, and the performance of the SVM was improved significantly when the four cortical measures were combined. Additionally, the dominant regions with higher classification weights were mainly located in the temporal and the frontal lobe, including the entorhinal cortex, rostral middle frontal, parahippocampal cortex, superior frontal, insula, and cuneus. This study concluded that the cortical features provided effective information for the recognition of abnormal anatomical patterns and the proposed methods had the potential to improve the clinical diagnosis of TLE.

  16. The use of principal component, discriminate and rough sets analysis methods of radiological data

    International Nuclear Information System (INIS)

    Seddeek, M.K.; Kozae, A.M.; Sharshar, T.; Badran, H.M.

    2006-01-01

    In this work, computational methods of finding clusters of multivariate data points were explored using principal component analysis (PCA), discriminate analysis (DA) and rough set analysis (RSA) methods. The variables were the concentrations of four natural isotopes and the texture characteristics of 100 sand samples from the coast of North Sinai, Egypt. Beach and dune sands are the two types of samples included. These methods were used to reduce the dimensionality of multivariate data and as classification and clustering methods. The results showed that the classification of sands in the environment of North Sinai is dependent upon the radioactivity contents of the naturally occurring radioactive materials and not upon the characteristics of the sand. The application of DA enables the creation of a classification rule for sand type and it revealed that samples with high negatively values of the first score have the highest contamination of black sand. PCA revealed that radioactivity concentrations alone can be considered to predict the classification of other samples. The results of RSA showed that only one of the concentrations of 238 U, 226 Ra and 232 Th with 40 K content, can characterize the clusters together with characteristics of the sand. Both PCA and RSA result in the following conclusion: 238 U, 226 Ra and 232 Th behave similarly. RSA revealed that one/two of them may not be considered without affecting the body of knowledge

  17. Classification of Surface and Deep Soil Samples Using Linear Discriminant Analysis

    International Nuclear Information System (INIS)

    Wasim, M.; Ali, M.; Daud, M.

    2015-01-01

    A statistical analysis was made of the activity concentrations measured in surface and deep soil samples for natural and anthropogenic gamma-emitting radionuclides. Soil samples were obtained from 48 different locations in Gilgit, Pakistan covering about 50 km/sup 2/ areas at an average altitude of 1550 m above sea level. From each location two samples were collected: one from the top soil (2-6 cm) and another from a depth of 6-10 cm. Four radionuclides including /sup 226/Ra, /sup 232/Th, /sup 40/K and /sup 137/Cs were quantified. The data was analyzed using t-test to find out activity concentration difference between the surface and depth samples. At the surface, the median activity concentrations were 23.7, 29.1, 4.6 and 115 Bq kg/sup -1/ for 226Ra, 232Th, 137Cs and 40K respectively. For the same radionuclides, the activity concentrations were respectively 25.5, 26.2, 2.9 and 191 Bq kg/sup -1/ for the depth samples. Principal component analysis (PCA) was applied to explore patterns within the data. A positive significant correlation was observed between the radionuclides /sup 226/Ra and /sup 232/Th. The data from PCA was further utilized in linear discriminant analysis (LDA) for the classification of surface and depth samples. LDA classified surface and depth samples with good predictability. (author)

  18. Classification of root canal microorganisms using electronic-nose and discriminant analysis

    Directory of Open Access Journals (Sweden)

    Özbilge Hatice

    2010-11-01

    Full Text Available Abstract Background Root canal treatment is a debridement process which disrupts and removes entire microorganisms from the root canal system. Identification of microorganisms may help clinicians decide on treatment alternatives such as using different irrigants, intracanal medicaments and antibiotics. However, the difficulty in cultivation and the complexity in isolation of predominant anaerobic microorganisms make clinicians resort to empirical medical treatments. For this reason, identification of microorganisms is not a routinely used procedure in root canal treatment. In this study, we aimed at classifying 7 different standard microorganism strains which are frequently seen in root canal infections, using odor data collected using an electronic nose instrument. Method Our microorganism odor data set consisted of 5 repeated samples from 7 different classes at 4 concentration levels. For each concentration, 35 samples were classified using 3 different discriminant analysis methods. In order to determine an optimal setting for using electronic-nose in such an application, we have tried 3 different approaches in evaluating sensor responses. Moreover, we have used 3 different sensor baseline values in normalizing sensor responses. Since the number of sensors is relatively large compared to sample size, we have also investigated the influence of two different dimension reduction methods on classification performance. Results We have found that quadratic type dicriminant analysis outperforms other varieties of this method. We have also observed that classification performance decreases as the concentration decreases. Among different baseline values used for pre-processing the sensor responses, the model where the minimum values of sensor readings in the sample were accepted as the baseline yields better classification performance. Corresponding to this optimal choice of baseline value, we have noted that among different sensor response model and

  19. Structural Discrimination

    DEFF Research Database (Denmark)

    Thorsen, Mira Skadegård

    discrimination as two ways of articulating particular, opaque forms of racial discrimination that occur in everyday Danish (and other) contexts, and have therefore become normalized. I present and discuss discrimination as it surfaces in data from my empirical studies of discrimination in Danish contexts...

  20. Race, Sex, and Discrimination in School Settings: A Multilevel Analysis of Associations with Delinquency

    Science.gov (United States)

    Chambers, Brittany D.; Erausquin, Jennifer Toller

    2018-01-01

    Background: Adolescence is a critical phase of development and experimentation with delinquent behaviors. There is a growing body of literature exploring individual and structural impacts of discrimination on health outcomes and delinquent behaviors. However, there is limited research assessing how school diversity and discrimination impact…

  1. Perceived Discrimination among African American Adolescents and Allostatic Load: A Longitudinal Analysis with Buffering Effects

    Science.gov (United States)

    Brody, Gene H.; Lei, Man-Kit; Chae, David H.; Yu, Tianyi; Kogan, Steven M.; Beach, Steven R. H.

    2014-01-01

    This study was designed to examine the prospective relations of perceived racial discrimination with allostatic load (AL), along with a possible buffer of the association. A sample of 331 African Americans in the rural South provided assessments of perceived discrimination from ages 16 to 18 years. When youth were 18 years, caregivers reported…

  2. A nutritional risk screening model for patients with liver cirrhosis established using discriminant analysis

    Directory of Open Access Journals (Sweden)

    ZHU Binghua

    2017-06-01

    Full Text Available ObjectiveTo establish a nutritional risk screening model for patients with liver cirrhosis using discriminant analysis. MethodsThe clinical data of 273 patients with liver cirrhosis who were admitted to Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine from August 2015 to March 2016 were collected. Body height, body weight, upper arm circumference, triceps skinfold thickness, subscapular skinfold thickness, and hand grip strength were measured and recorded, and then body mass index (BMI and upper arm muscle circumference were calculated. Laboratory markers including liver function parameters, renal function parameters, and vitamins were measured. The patients were asked to complete Nutritional Risk Screening 2002 and Malnutrition Universal Screening Tool (MUST, and a self-developed nutritional risk screening pathway was used for nutritional risk classification. Observation scales of the four diagnostic methods in traditional Chinese medicine were used to collect patients′ symptoms and signs. Continuous data were expressed as mean±SD (x±s; an analysis of variance was used for comparison between multiple groups, and the least significant difference t-test was used for further comparison between two groups. Discriminant analysis was used for model establishment, and cross validation was used for model verification. ResultsThe nutritional risk screening pathway for patients with liver cirrhosis was used for the screening of respondents, and there were 49 patients (17.95% in non-risk group, 49 (17.95% in possible-risk group, and 175 (64.10% in risk group. The distance criterion function was used to establish the nutritional risk screening model for patients with liver cirrhosis: D1=-11.885+0.310×BMI+0150×MAC+0.005×P-Alb-0.001×Vit B12+0.103×Vit D-0.89×ascites-0.404×weakness-0.560×hypochondriac pain+0035×dysphoria with feverish sensation (note: if a patient has ascites, weakness, hypochondriac pain

  3. The geometry of distributional preferences and a non-parametric identification approach: The Equality Equivalence Test.

    Science.gov (United States)

    Kerschbamer, Rudolf

    2015-05-01

    This paper proposes a geometric delineation of distributional preference types and a non-parametric approach for their identification in a two-person context. It starts with a small set of assumptions on preferences and shows that this set (i) naturally results in a taxonomy of distributional archetypes that nests all empirically relevant types considered in previous work; and (ii) gives rise to a clean experimental identification procedure - the Equality Equivalence Test - that discriminates between archetypes according to core features of preferences rather than properties of specific modeling variants. As a by-product the test yields a two-dimensional index of preference intensity.

  4. On Parametric (and Non-Parametric Variation

    Directory of Open Access Journals (Sweden)

    Neil Smith

    2009-11-01

    Full Text Available This article raises the issue of the correct characterization of ‘Parametric Variation’ in syntax and phonology. After specifying their theoretical commitments, the authors outline the relevant parts of the Principles–and–Parameters framework, and draw a three-way distinction among Universal Principles, Parameters, and Accidents. The core of the contribution then consists of an attempt to provide identity criteria for parametric, as opposed to non-parametric, variation. Parametric choices must be antecedently known, and it is suggested that they must also satisfy seven individually necessary and jointly sufficient criteria. These are that they be cognitively represented, systematic, dependent on the input, deterministic, discrete, mutually exclusive, and irreversible.

  5. Nonparametric predictive pairwise comparison with competing risks

    International Nuclear Information System (INIS)

    Coolen-Maturi, Tahani

    2014-01-01

    In reliability, failure data often correspond to competing risks, where several failure modes can cause a unit to fail. This paper presents nonparametric predictive inference (NPI) for pairwise comparison with competing risks data, assuming that the failure modes are independent. These failure modes could be the same or different among the two groups, and these can be both observed and unobserved failure modes. NPI is a statistical approach based on few assumptions, with inferences strongly based on data and with uncertainty quantified via lower and upper probabilities. The focus is on the lower and upper probabilities for the event that the lifetime of a future unit from one group, say Y, is greater than the lifetime of a future unit from the second group, say X. The paper also shows how the two groups can be compared based on particular failure mode(s), and the comparison of the two groups when some of the competing risks are combined is discussed

  6. Nonparametric estimation of location and scale parameters

    KAUST Repository

    Potgieter, C.J.

    2012-12-01

    Two random variables X and Y belong to the same location-scale family if there are constants μ and σ such that Y and μ+σX have the same distribution. In this paper we consider non-parametric estimation of the parameters μ and σ under minimal assumptions regarding the form of the distribution functions of X and Y. We discuss an approach to the estimation problem that is based on asymptotic likelihood considerations. Our results enable us to provide a methodology that can be implemented easily and which yields estimators that are often near optimal when compared to fully parametric methods. We evaluate the performance of the estimators in a series of Monte Carlo simulations. © 2012 Elsevier B.V. All rights reserved.

  7. Nonparametric inference of network structure and dynamics

    Science.gov (United States)

    Peixoto, Tiago P.

    The network structure of complex systems determine their function and serve as evidence for the evolutionary mechanisms that lie behind them. Despite considerable effort in recent years, it remains an open challenge to formulate general descriptions of the large-scale structure of network systems, and how to reliably extract such information from data. Although many approaches have been proposed, few methods attempt to gauge the statistical significance of the uncovered structures, and hence the majority cannot reliably separate actual structure from stochastic fluctuations. Due to the sheer size and high-dimensionality of many networks, this represents a major limitation that prevents meaningful interpretations of the results obtained with such nonstatistical methods. In this talk, I will show how these issues can be tackled in a principled and efficient fashion by formulating appropriate generative models of network structure that can have their parameters inferred from data. By employing a Bayesian description of such models, the inference can be performed in a nonparametric fashion, that does not require any a priori knowledge or ad hoc assumptions about the data. I will show how this approach can be used to perform model comparison, and how hierarchical models yield the most appropriate trade-off between model complexity and quality of fit based on the statistical evidence present in the data. I will also show how this general approach can be elegantly extended to networks with edge attributes, that are embedded in latent spaces, and that change in time. The latter is obtained via a fully dynamic generative network model, based on arbitrary-order Markov chains, that can also be inferred in a nonparametric fashion. Throughout the talk I will illustrate the application of the methods with many empirical networks such as the internet at the autonomous systems level, the global airport network, the network of actors and films, social networks, citations among

  8. Combining pharmacophore fingerprints and PLS-discriminant analysis for virtual screening and SAR elucidation

    DEFF Research Database (Denmark)

    Askjær, Sune; Langgård, Morten

    2008-01-01

    The criterion of success for the initial stages of a ligand-based drug-discovery project is dual. First, a set of suitable lead compounds has to be identified. Second, a level of a preliminary structure-activity relationship (SAR) of the identified ligands has to be established in order to guide ...... by the protein-binding site known from X-ray complexes. The result of this analysis assists in explaining the efficiency of 2D pharmacophore fingerprints as descriptors in virtual screening....... the lead optimization toward a final drug candidate. This paper presents a combined approach to solving these two problems of ligand-based virtual screening and elucidation of SAR based on interplay between pharmacophore fingerprints and interpretation of PLS-discriminant analysis (PLS-DA) models....... The virtual screening capability of the PLS-DA method is compared to group fusion maximum similarity searching in a test using four graph-based pharmacophore fingerprints over a range of 10 diverse targets. The PLS-DA method was generally found to do better than the Smax method. The GpiDAPH3 and PCH...

  9. Molecular discrimination of lactobacilli used as starter and probiotic cultures by amplified ribosomal DNA restriction analysis.

    Science.gov (United States)

    Roy, D; Sirois, S; Vincent, D

    2001-04-01

    Lactic acid bacteria such as Lactobacillus helveticus, L. delbrueckii subsp. delbrueckii, L. delbrueckii subsp. lactis, L. delbrueckii subsp. bulgaricus, L. acidophilus, and L. casei related taxa which are widely used as starter or probiotic cultures can be identified by amplified ribosomal DNA restriction analysis (ARDRA). The genetic discrimination of the related species belonging to these groups was first obtained by PCR amplifications by using group-specific or species-specific 16S rDNA primers. The numerical analysis of the ARDRA patterns obtained by using CfoI, HinfI, Tru9I, and ScrFI was an efficient typing tool for identification of species of the L. acidophilus and L. casei complex. ARDRA by using CfoI was a reliable method for differentiation of L. delbrueckii subsp. bulgaricus and L. delbrueckii subsp. lactis. Finally, strains ATCC 393 and ATCC 15820 exhibited unique ARDRA patterns with CfoI and Tru9I restriction enzymes as compared with the other strains of L. casei, L. paracasei, and L. rhamnosus.

  10. Two-dimensional statistical linear discriminant analysis for real-time robust vehicle-type recognition

    Science.gov (United States)

    Zafar, I.; Edirisinghe, E. A.; Acar, S.; Bez, H. E.

    2007-02-01

    Automatic vehicle Make and Model Recognition (MMR) systems provide useful performance enhancements to vehicle recognitions systems that are solely based on Automatic License Plate Recognition (ALPR) systems. Several car MMR systems have been proposed in literature. However these approaches are based on feature detection algorithms that can perform sub-optimally under adverse lighting and/or occlusion conditions. In this paper we propose a real time, appearance based, car MMR approach using Two Dimensional Linear Discriminant Analysis that is capable of addressing this limitation. We provide experimental results to analyse the proposed algorithm's robustness under varying illumination and occlusions conditions. We have shown that the best performance with the proposed 2D-LDA based car MMR approach is obtained when the eigenvectors of lower significance are ignored. For the given database of 200 car images of 25 different make-model classifications, a best accuracy of 91% was obtained with the 2D-LDA approach. We use a direct Principle Component Analysis (PCA) based approach as a benchmark to compare and contrast the performance of the proposed 2D-LDA approach to car MMR. We conclude that in general the 2D-LDA based algorithm supersedes the performance of the PCA based approach.

  11. Using stable isotope analysis to discriminate gasoline on the basis of its origin.

    Science.gov (United States)

    Heo, Su-Young; Shin, Woo-Jin; Lee, Sin-Woo; Bong, Yeon-Sik; Lee, Kwang-Sik

    2012-03-15

    Leakage of gasoline and diesel from underground tanks has led to a severe environmental problem in many countries. Tracing the production origin of gasoline and diesel is required to enable the development of dispute resolution and appropriate remediation strategies for the oil-contaminated sites. We investigated the bulk and compound-specific isotopic compositions of gasoline produced by four oil companies in South Korea: S-Oil, SK, GS and Hyundai. The relative abundance of several compounds in gasoline was determined by the peak height of the major ion (m/z 44). The δ(13)C(Bulk) and δD(Bulk) values of gasoline produced by S-Oil were significantly different from those of SK, GS and Hyundai. In particular, the compound-specific isotopic value (δ(13)C(CSIA)) of methyl tert-butyl ether (MTBE) in S-Oil gasoline was significantly lower than that of gasoline produced by other oil companies. The abundance of several compounds in gasoline, such as n-pentane, MTBE, n-hexane, toluene, ethylbenzene and o-xylene, differed widely among gasoline from different oil companies. This study shows that gasoline can be forensically discriminated according to the oil company responsible for its manufacture using stable isotope analysis combined with multivariate statistical analysis. Copyright © 2012 John Wiley & Sons, Ltd.

  12. Ultrasonic analysis to discriminate bread dough of different types of flour

    Science.gov (United States)

    García-Álvarez, J.; Rosell, C. M.; García-Hernández, M. J.; Chávez, J. A.; Turó, A.; Salazar, J.

    2012-12-01

    Many varieties of bread are prepared using flour coming from wheat. However, there are other types of flours milled from rice, legumes and some fruits and vegetables that are also suitable for baking purposes, used alone or in combination with wheat flour. The type of flour employed strongly influences the dough consistency, which is a relevant property for determining the dough potential for breadmaking purposes. Traditional methods for dough testing are relatively expensive, time-consuming, off-line and often require skilled operators. In this work, ultrasonic analysis are performed in order to obtain acoustic properties of bread dough samples prepared using two different types of flour, wheat flour and rice flour. The dough acoustic properties can be related to its viscoelastic characteristics, which in turn determine the dough feasibility for baking. The main advantages of the ultrasonic dough testing can be, among others, its low cost, fast, hygienic and on-line performance. The obtained results point out the potential of the ultrasonic analysis to discriminate doughs of different types of flour.

  13. Ultrasonic analysis to discriminate bread dough of different types of flour

    International Nuclear Information System (INIS)

    García-Álvarez, J; García-Hernández, M J; Chávez, J A; Turó, A; Salazar, J; Rosell, C M

    2012-01-01

    Many varieties of bread are prepared using flour coming from wheat. However, there are other types of flours milled from rice, legumes and some fruits and vegetables that are also suitable for baking purposes, used alone or in combination with wheat flour. The type of flour employed strongly influences the dough consistency, which is a relevant property for determining the dough potential for breadmaking purposes. Traditional methods for dough testing are relatively expensive, time-consuming, off-line and often require skilled operators. In this work, ultrasonic analysis are performed in order to obtain acoustic properties of bread dough samples prepared using two different types of flour, wheat flour and rice flour. The dough acoustic properties can be related to its viscoelastic characteristics, which in turn determine the dough feasibility for baking. The main advantages of the ultrasonic dough testing can be, among others, its low cost, fast, hygienic and on-line performance. The obtained results point out the potential of the ultrasonic analysis to discriminate doughs of different types of flour.

  14. Further studies of crania from ancient northern Africa: an analysis of crania from first dynasty Egyptian tombs, using discriminant functions.

    Science.gov (United States)

    Keita, S O

    1992-03-01

    An analysis of First Dynasty crania from Abydos was undertaken using multiple discriminant functions. The results demonstrate greater affinity with Upper Nile Valley patterns, but also suggest change from earlier craniometric trends. Gene flow and movement of northern officials to the important southern city may explain the findings.

  15. Signal Detection Methods and Discriminant Analysis Applied to Categorization of Newspaper and Government Documents: A Preliminary Study.

    Science.gov (United States)

    Ng, Kwong Bor; Rieh, Soo Young; Kantor, Paul

    2000-01-01

    Discussion of natural language processing focuses on experiments using linear discriminant analysis to distinguish "Wall Street Journal" texts from "Federal Register" tests using information about the frequency of occurrence of word boundaries, sentence boundaries, and punctuation marks. Displays and interprets results in terms…

  16. Search for the standard model Higgs boson in $e^{+}e^{-}$ four- jet topology using neural networks and discriminant analysis

    CERN Document Server

    Mjahed, M

    2003-01-01

    We present an attempt to separate between Higgs boson events (e/sup + /e/sup -/ to ZH to qqbb) and other physics processes in the 4-jet channel (e/sup +/e/sup -/ to Z/ gamma , W/sup +/W, ZZ to 4jets), using the discriminant analysis and neural networks methods. Events were produced at LEP2 energies, using the Lund Monte Carlo generator and the Aleph package. The most discriminant variables as the reconstructed jet mass, the jet properties (b-tag, rapidity weighted moments) and other variables are used. (8 refs).

  17. Discrimination of Geographical Origin of Asian Garlic Using Isotopic and Chemical Datasets under Stepwise Principal Component Analysis.

    Science.gov (United States)

    Liu, Tsang-Sen; Lin, Jhen-Nan; Peng, Tsung-Ren

    2018-01-16

    Isotopic compositions of δ 2 H, δ 18 O, δ 13 C, and δ 15 N and concentrations of 22 trace elements from garlic samples were analyzed and processed with stepwise principal component analysis (PCA) to discriminate garlic's country of origin among Asian regions including South Korea, Vietnam, Taiwan, and China. Results indicate that there is no single trace-element concentration or isotopic composition that can accomplish the study's purpose and the stepwise PCA approach proposed does allow for discrimination between countries on a regional basis. Sequentially, Step-1 PCA distinguishes garlic's country of origin among Taiwanese, South Korean, and Vietnamese samples; Step-2 PCA discriminates Chinese garlic from South Korean garlic; and Step-3 and Step-4 PCA, Chinese garlic from Vietnamese garlic. In model tests, countries of origin of all audit samples were correctly discriminated by stepwise PCA. Consequently, this study demonstrates that stepwise PCA as applied is a simple and effective approach to discriminating country of origin among Asian garlics. © 2018 American Academy of Forensic Sciences.

  18. Identifying Plant Part Composition of Forest Logging Residue Using Infrared Spectral Data and Linear Discriminant Analysis

    Directory of Open Access Journals (Sweden)

    Gifty E. Acquah

    2016-08-01

    Full Text Available As new markets, technologies and economies evolve in the low carbon bioeconomy, forest logging residue, a largely untapped renewable resource will play a vital role. The feedstock can however be variable depending on plant species and plant part component. This heterogeneity can influence the physical, chemical and thermochemical properties of the material, and thus the final yield and quality of products. Although it is challenging to control compositional variability of a batch of feedstock, it is feasible to monitor this heterogeneity and make the necessary changes in process parameters. Such a system will be a first step towards optimization, quality assurance and cost-effectiveness of processes in the emerging biofuel/chemical industry. The objective of this study was therefore to qualitatively classify forest logging residue made up of different plant parts using both near infrared spectroscopy (NIRS and Fourier transform infrared spectroscopy (FTIRS together with linear discriminant analysis (LDA. Forest logging residue harvested from several Pinus taeda (loblolly pine plantations in Alabama, USA, were classified into three plant part components: clean wood, wood and bark and slash (i.e., limbs and foliage. Five-fold cross-validated linear discriminant functions had classification accuracies of over 96% for both NIRS and FTIRS based models. An extra factor/principal component (PC was however needed to achieve this in FTIRS modeling. Analysis of factor loadings of both NIR and FTIR spectra showed that, the statistically different amount of cellulose in the three plant part components of logging residue contributed to their initial separation. This study demonstrated that NIR or FTIR spectroscopy coupled with PCA and LDA has the potential to be used as a high throughput tool in classifying the plant part makeup of a batch of forest logging residue feedstock. Thus, NIR/FTIR could be employed as a tool to rapidly probe/monitor the variability

  19. Diagnosing basal cell carcinoma in vivo by near-infrared Raman spectroscopy: a Principal Components Analysis discrimination algorithm

    Science.gov (United States)

    Silveira, Landulfo, Jr.; Silveira, Fabrício L.; Bodanese, Benito; Pacheco, Marcos Tadeu T.; Zângaro, Renato A.

    2012-02-01

    This work demonstrated the discrimination among basal cell carcinoma (BCC) and normal human skin in vivo using near-infrared Raman spectroscopy. Spectra were obtained in the suspected lesion prior resectional surgery. After tissue withdrawn, biopsy fragments were submitted to histopathology. Spectra were also obtained in the adjacent, clinically normal skin. Raman spectra were measured using a Raman spectrometer (830 nm) with a fiber Raman probe. By comparing the mean spectra of BCC with the normal skin, it has been found important differences in the 800-1000 cm-1 and 1250-1350 cm-1 (vibrations of C-C and amide III, respectively, from lipids and proteins). A discrimination algorithm based on Principal Components Analysis and Mahalanobis distance (PCA/MD) could discriminate the spectra of both tissues with high sensitivity and specificity.

  20. Nonparametric Estimation of Distributions in Random Effects Models

    KAUST Repository

    Hart, Jeffrey D.

    2011-01-01

    We propose using minimum distance to obtain nonparametric estimates of the distributions of components in random effects models. A main setting considered is equivalent to having a large number of small datasets whose locations, and perhaps scales, vary randomly, but which otherwise have a common distribution. Interest focuses on estimating the distribution that is common to all datasets, knowledge of which is crucial in multiple testing problems where a location/scale invariant test is applied to every small dataset. A detailed algorithm for computing minimum distance estimates is proposed, and the usefulness of our methodology is illustrated by a simulation study and an analysis of microarray data. Supplemental materials for the article, including R-code and a dataset, are available online. © 2011 American Statistical Association.

  1. Spurious Seasonality Detection: A Non-Parametric Test Proposal

    Directory of Open Access Journals (Sweden)

    Aurelio F. Bariviera

    2018-01-01

    Full Text Available This paper offers a general and comprehensive definition of the day-of-the-week effect. Using symbolic dynamics, we develop a unique test based on ordinal patterns in order to detect it. This test uncovers the fact that the so-called “day-of-the-week” effect is partly an artifact of the hidden correlation structure of the data. We present simulations based on artificial time series as well. While time series generated with long memory are prone to exhibit daily seasonality, pure white noise signals exhibit no pattern preference. Since ours is a non-parametric test, it requires no assumptions about the distribution of returns, so that it could be a practical alternative to conventional econometric tests. We also made an exhaustive application of the here-proposed technique to 83 stock indexes around the world. Finally, the paper highlights the relevance of symbolic analysis in economic time series studies.

  2. Nonparametric estimation of stochastic differential equations with sparse Gaussian processes.

    Science.gov (United States)

    García, Constantino A; Otero, Abraham; Félix, Paulo; Presedo, Jesús; Márquez, David G

    2017-08-01

    The application of stochastic differential equations (SDEs) to the analysis of temporal data has attracted increasing attention, due to their ability to describe complex dynamics with physically interpretable equations. In this paper, we introduce a nonparametric method for estimating the drift and diffusion terms of SDEs from a densely observed discrete time series. The use of Gaussian processes as priors permits working directly in a function-space view and thus the inference takes place directly in this space. To cope with the computational complexity that requires the use of Gaussian processes, a sparse Gaussian process approximation is provided. This approximation permits the efficient computation of predictions for the drift and diffusion terms by using a distribution over a small subset of pseudosamples. The proposed method has been validated using both simulated data and real data from economy and paleoclimatology. The application of the method to real data demonstrates its ability to capture the behavior of complex systems.

  3. Nonparametric estimation of benchmark doses in environmental risk assessment

    Science.gov (United States)

    Piegorsch, Walter W.; Xiong, Hui; Bhattacharya, Rabi N.; Lin, Lizhen

    2013-01-01

    Summary An important statistical objective in environmental risk analysis is estimation of minimum exposure levels, called benchmark doses (BMDs), that induce a pre-specified benchmark response in a dose-response experiment. In such settings, representations of the risk are traditionally based on a parametric dose-response model. It is a well-known concern, however, that if the chosen parametric form is misspecified, inaccurate and possibly unsafe low-dose inferences can result. We apply a nonparametric approach for calculating benchmark doses, based on an isotonic regression method for dose-response estimation with quantal-response data (Bhattacharya and Kong, 2007). We determine the large-sample properties of the estimator, develop bootstrap-based confidence limits on the BMDs, and explore the confidence limits’ small-sample properties via a short simulation study. An example from cancer risk assessment illustrates the calculations. PMID:23914133

  4. A NONPARAMETRIC HYPOTHESIS TEST VIA THE BOOTSTRAP RESAMPLING

    OpenAIRE

    Temel, Tugrul T.

    2001-01-01

    This paper adapts an already existing nonparametric hypothesis test to the bootstrap framework. The test utilizes the nonparametric kernel regression method to estimate a measure of distance between the models stated under the null hypothesis. The bootstraped version of the test allows to approximate errors involved in the asymptotic hypothesis test. The paper also develops a Mathematica Code for the test algorithm.

  5. Simple nonparametric checks for model data fit in CAT

    NARCIS (Netherlands)

    Meijer, R.R.

    2005-01-01

    In this paper, the usefulness of several nonparametric checks is discussed in a computerized adaptive testing (CAT) context. Although there is no tradition of nonparametric scalability in CAT, it can be argued that scalability checks can be useful to investigate, for example, the quality of item

  6. Nonparametric Bayesian inference for multidimensional compound Poisson processes

    NARCIS (Netherlands)

    Gugushvili, S.; van der Meulen, F.; Spreij, P.

    2015-01-01

    Given a sample from a discretely observed multidimensional compound Poisson process, we study the problem of nonparametric estimation of its jump size density r0 and intensity λ0. We take a nonparametric Bayesian approach to the problem and determine posterior contraction rates in this context,

  7. Analysis of the discriminative methods for diagnosis of benign and malignant solitary pulmonary nodules based on serum markers.

    Science.gov (United States)

    Wang, Wanping; Liu, Mingyue; Wang, Jing; Tian, Rui; Dong, Junqiang; Liu, Qi; Zhao, Xianping; Wang, Yuanfang

    2014-01-01

    Screening indexes of tumor serum markers for benign and malignant solitary pulmonary nodules (SPNs) were analyzed to find the optimum method for diagnosis. Enzyme-linked immunosorbent assays, an automatic immune analyzer and radioimmunoassay methods were used to examine the levels of 8 serum markers in 164 SPN patients, and the sensitivity for differential diagnosis of malignant or benign SPN was compared for detection using a single plasma marker or a combination of markers. The results for serological indicators that closely relate to benign and malignant SPNs were screened using the Fisher discriminant analysis and a non-conditional logistic regression analysis method, respectively. The results were then verified by the k-means clustering analysis method. The sensitivity when using a combination of serum markers to detect SPN was higher than that using a single marker. By Fisher discriminant analysis, cytokeratin 19 fragments (CYFRA21-1), carbohydrate antigen 125 (CA125), squamous cell carcinoma antigen (SCC) and breast cancer antigen (CA153), which relate to the benign and malignant SPNs, were screened. Through non-conditional logistic regression analysis, CYFRA21-1, SCC and CA153 were obtained. Using the k-means clustering analysis, the cophenetic correlation coefficient (0.940) obtained by the Fisher discriminant analysis was higher than that obtained with logistic regression analysis (0.875). This study indicated that the Fisher discriminant analysis functioned better in screening out serum markers to recognize the benign and malignant SPN. The combined detection of CYFRA21-1, CA125, SCC and CA153 is an effective way to distinguish benign and malignant SPN, and will find an important clinical application in the early diagnosis of SPN. © 2014 S. Karger GmbH, Freiburg.

  8. Introduction to multivariate discrimination

    Science.gov (United States)

    Kégl, Balázs

    2013-07-01

    Multivariate discrimination or classification is one of the best-studied problem in machine learning, with a plethora of well-tested and well-performing algorithms. There are also several good general textbooks [1-9] on the subject written to an average engineering, computer science, or statistics graduate student; most of them are also accessible for an average physics student with some background on computer science and statistics. Hence, instead of writing a generic introduction, we concentrate here on relating the subject to a practitioner experimental physicist. After a short introduction on the basic setup (Section 1) we delve into the practical issues of complexity regularization, model selection, and hyperparameter optimization (Section 2), since it is this step that makes high-complexity non-parametric fitting so different from low-dimensional parametric fitting. To emphasize that this issue is not restricted to classification, we illustrate the concept on a low-dimensional but non-parametric regression example (Section 2.1). Section 3 describes the common algorithmic-statistical formal framework that unifies the main families of multivariate classification algorithms. We explain here the large-margin principle that partly explains why these algorithms work. Section 4 is devoted to the description of the three main (families of) classification algorithms, neural networks, the support vector machine, and AdaBoost. We do not go into the algorithmic details; the goal is to give an overview on the form of the functions these methods learn and on the objective functions they optimize. Besides their technical description, we also make an attempt to put these algorithm into a socio-historical context. We then briefly describe some rather heterogeneous applications to illustrate the pattern recognition pipeline and to show how widespread the use of these methods is (Section 5). We conclude the chapter with three essentially open research problems that are either

  9. Introduction to multivariate discrimination

    International Nuclear Information System (INIS)

    Kegl, B.

    2013-01-01

    Multivariate discrimination or classification is one of the best-studied problem in machine learning, with a plethora of well-tested and well-performing algorithms. There are also several good general textbooks [1-9] on the subject written to an average engineering, computer science, or statistics graduate student; most of them are also accessible for an average physics student with some background on computer science and statistics. Hence, instead of writing a generic introduction, we concentrate here on relating the subject to a practitioner experimental physicist. After a short introduction on the basic setup (Section 1) we delve into the practical issues of complexity regularization, model selection, and hyper-parameter optimization (Section 2), since it is this step that makes high-complexity non-parametric fitting so different from low-dimensional parametric fitting. To emphasize that this issue is not restricted to classification, we illustrate the concept on a low-dimensional but non-parametric regression example (Section 2.1). Section 3 describes the common algorithmic-statistical formal framework that unifies the main families of multivariate classification algorithms. We explain here the large-margin principle that partly explains why these algorithms work. Section 4 is devoted to the description of the three main (families of) classification algorithms, neural networks, the support vector machine, and AdaBoost. We do not go into the algorithmic details; the goal is to give an overview on the form of the functions these methods learn and on the objective functions they optimize. Besides their technical description, we also make an attempt to put these algorithm into a socio-historical context. We then briefly describe some rather heterogeneous applications to illustrate the pattern recognition pipeline and to show how widespread the use of these methods is (Section 5). We conclude the chapter with three essentially open research problems that are either

  10. Multivariate fault isolation of batch processes via variable selection in partial least squares discriminant analysis.

    Science.gov (United States)

    Yan, Zhengbing; Kuang, Te-Hui; Yao, Yuan

    2017-09-01

    In recent years, multivariate statistical monitoring of batch processes has become a popular research topic, wherein multivariate fault isolation is an important step aiming at the identification of the faulty variables contributing most to the detected process abnormality. Although contribution plots have been commonly used in statistical fault isolation, such methods suffer from the smearing effect between correlated variables. In particular, in batch process monitoring, the high autocorrelations and cross-correlations that exist in variable trajectories make the smearing effect unavoidable. To address such a problem, a variable selection-based fault isolation method is proposed in this research, which transforms the fault isolation problem into a variable selection problem in partial least squares discriminant analysis and solves it by calculating a sparse partial least squares model. As different from the traditional methods, the proposed method emphasizes the relative importance of each process variable. Such information may help process engineers in conducting root-cause diagnosis. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.

  11. Predicting the aquatic toxicity mode of action using logistic regression and linear discriminant analysis.

    Science.gov (United States)

    Ren, Y Y; Zhou, L C; Yang, L; Liu, P Y; Zhao, B W; Liu, H X

    2016-09-01

    The paper highlights the use of the logistic regression (LR) method in the construction of acceptable statistically significant, robust and predictive models for the classification of chemicals according to their aquatic toxic modes of action. Essentials accounting for a reliable model were all considered carefully. The model predictors were selected by stepwise forward discriminant analysis (LDA) from a combined pool of experimental data and chemical structure-based descriptors calculated by the CODESSA and DRAGON software packages. Model predictive ability was validated both internally and externally. The applicability domain was checked by the leverage approach to verify prediction reliability. The obtained models are simple and easy to interpret. In general, LR performs much better than LDA and seems to be more attractive for the prediction of the more toxic compounds, i.e. compounds that exhibit excess toxicity versus non-polar narcotic compounds and more reactive compounds versus less reactive compounds. In addition, model fit and regression diagnostics was done through the influence plot which reflects the hat-values, studentized residuals, and Cook's distance statistics of each sample. Overdispersion was also checked for the LR model. The relationships between the descriptors and the aquatic toxic behaviour of compounds are also discussed.

  12. Partial Least Square Discriminant Analysis Discovered a Dietary Pattern Inversely Associated with Nasopharyngeal Carcinoma Risk.

    Science.gov (United States)

    Lo, Yen-Li; Pan, Wen-Harn; Hsu, Wan-Lun; Chien, Yin-Chu; Chen, Jen-Yang; Hsu, Mow-Ming; Lou, Pei-Jen; Chen, I-How; Hildesheim, Allan; Chen, Chien-Jen

    2016-01-01

    Evidence on the association between dietary component, dietary pattern and nasopharyngeal carcinoma (NPC) is scarce. A major challenge is the high degree of correlation among dietary constituents. We aimed to identify dietary pattern associated with NPC and to illustrate the dose-response relationship between the identified dietary pattern scores and the risk of NPC. Taking advantage of a matched NPC case-control study, data from a total of 319 incident cases and 319 matched controls were analyzed. Dietary pattern was derived employing partial least square discriminant analysis (PLS-DA) performed on energy-adjusted food frequencies derived from a 66-item food-frequency questionnaire. Odds ratios (ORs) and 95% confidence intervals (CIs) were estimated with multiple conditional logistic regression models, linking pattern scores and NPC risk. A high score of the PLS-DA derived pattern was characterized by high intakes of fruits, milk, fresh fish, vegetables, tea, and eggs ordered by loading values. We observed that one unit increase in the scores was associated with a significantly lower risk of NPC (ORadj = 0.73, 95% CI = 0.60-0.88) after controlling for potential confounders. Similar results were observed among Epstein-Barr virus seropositive subjects. An NPC protective diet is indicated with more phytonutrient-rich plant foods (fruits, vegetables), milk, other protein-rich foods (in particular fresh fish and eggs), and tea. This information may be used to design potential dietary regimen for NPC prevention.

  13. Why Does Rebalancing Class-Unbalanced Data Improve AUC for Linear Discriminant Analysis?

    Science.gov (United States)

    Xue, Jing-Hao; Hall, Peter

    2015-05-01

    Many established classifiers fail to identify the minority class when it is much smaller than the majority class. To tackle this problem, researchers often first rebalance the class sizes in the training dataset, through oversampling the minority class or undersampling the majority class, and then use the rebalanced data to train the classifiers. This leads to interesting empirical patterns. In particular, using the rebalanced training data can often improve the area under the receiver operating characteristic curve (AUC) for the original, unbalanced test data. The AUC is a widely-used quantitative measure of classification performance, but the property that it increases with rebalancing has, as yet, no theoretical explanation. In this note, using Gaussian-based linear discriminant analysis (LDA) as the classifier, we demonstrate that, at least for LDA, there is an intrinsic, positive relationship between the rebalancing of class sizes and the improvement of AUC. We show that the largest improvement of AUC is achieved, asymptotically, when the two classes are fully rebalanced to be of equal sizes.

  14. Protein Subcellular Localization with Gaussian Kernel Discriminant Analysis and Its Kernel Parameter Selection.

    Science.gov (United States)

    Wang, Shunfang; Nie, Bing; Yue, Kun; Fei, Yu; Li, Wenjia; Xu, Dongshu

    2017-12-15

    Kernel discriminant analysis (KDA) is a dimension reduction and classification algorithm based on nonlinear kernel trick, which can be novelly used to treat high-dimensional and complex biological data before undergoing classification processes such as protein subcellular localization. Kernel parameters make a great impact on the performance of the KDA model. Specifically, for KDA with the popular Gaussian kernel, to select the scale parameter is still a challenging problem. Thus, this paper introduces the KDA method and proposes a new method for Gaussian kernel parameter selection depending on the fact that the differences between reconstruction errors of edge normal samples and those of interior normal samples should be maximized for certain suitable kernel parameters. Experiments with various standard data sets of protein subcellular localization show that the overall accuracy of protein classification prediction with KDA is much higher than that without KDA. Meanwhile, the kernel parameter of KDA has a great impact on the efficiency, and the proposed method can produce an optimum parameter, which makes the new algorithm not only perform as effectively as the traditional ones, but also reduce the computational time and thus improve efficiency.

  15. Conversion Discriminative Analysis on Mild Cognitive Impairment Using Multiple Cortical Features from MR Images

    Directory of Open Access Journals (Sweden)

    Shengwen Guo

    2017-05-01

    Full Text Available Neuroimaging measurements derived from magnetic resonance imaging provide important information required for detecting changes related to the progression of mild cognitive impairment (MCI. Cortical features and changes play a crucial role in revealing unique anatomical patterns of brain regions, and further differentiate MCI patients from normal states. Four cortical features, namely, gray matter volume, cortical thickness, surface area, and mean curvature, were explored for discriminative analysis among three groups including the stable MCI (sMCI, the converted MCI (cMCI, and the normal control (NC groups. In this study, 158 subjects (72 NC, 46 sMCI, and 40 cMCI were selected from the Alzheimer's Disease Neuroimaging Initiative. A sparse-constrained regression model based on the l2-1-norm was introduced to reduce the feature dimensionality and retrieve essential features for the discrimination of the three groups by using a support vector machine (SVM. An optimized strategy of feature addition based on the weight of each feature was adopted for the SVM classifier in order to achieve the best classification performance. The baseline cortical features combined with the longitudinal measurements for 2 years of follow-up data yielded prominent classification results. In particular, the cortical thickness produced a classification with 98.84% accuracy, 97.5% sensitivity, and 100% specificity for the sMCI–cMCI comparison; 92.37% accuracy, 84.78% sensitivity, and 97.22% specificity for the cMCI–NC comparison; and 93.75% accuracy, 92.5% sensitivity, and 94.44% specificity for the sMCI–NC comparison. The best performances obtained by the SVM classifier using the essential features were 5–40% more than those using all of the retained features. The feasibility of the cortical features for the recognition of anatomical patterns was certified; thus, the proposed method has the potential to improve the clinical diagnosis of sub-types of MCI and

  16. Digital discrimination of neutrons and γ-rays in liquid scintillators using pulse gradient analysis

    International Nuclear Information System (INIS)

    D'Mellow, B.; Aspinall, M.D.; Mackin, R.O.; Joyce, M.J.; Peyton, A.J.

    2007-01-01

    A method for the digital discrimination of neutrons and γ-rays in mixed radiation fields is described. Pulses in the time domain, arising from the interaction of photons and neutrons in a liquid scintillator, have been produced using an accepted empirical model and from experimental measurements with an americium-beryllium source. Neutrons and γ-rays have been successfully discriminated in both of these data sets in the digital domain. The digital discrimination method described in this paper is simple and exploits samples early in the life of the pulse. It is thus compatible with current embedded system technologies, offers a degree of immunity to pulse pile-up and heralds a real-time means for neutron/γ discrimination that is fundamental to many potential industrial applications

  17. Discrimination of Clover and Citrus Honeys from Egypt According to Floral Type Using Easily Assessable Physicochemical Parameters and Discriminant Analysis: An External Validation of the Chemometric Approach

    Directory of Open Access Journals (Sweden)

    Ioannis K. Karabagias

    2018-05-01

    Full Text Available Twenty-two honey samples, namely clover and citrus honeys, were collected from the greater Cairo area during the harvesting year 2014–2015. The main purpose of the present study was to characterize the aforementioned honey types and to investigate whether the use of easily assessable physicochemical parameters, including color attributes in combination with chemometrics, could differentiate honey floral origin. Parameters taken into account were: pH, electrical conductivity, ash, free acidity, lactonic acidity, total acidity, moisture content, total sugars (degrees Brix-°Bx, total dissolved solids and their ratio to total acidity, salinity, CIELAB color parameters, along with browning index values. Results showed that all honey samples analyzed met the European quality standards set for honey and had variations in the aforementioned physicochemical parameters depending on floral origin. Application of linear discriminant analysis showed that eight physicochemical parameters, including color, could classify Egyptian honeys according to floral origin (p < 0.05. Correct classification rate was 95.5% using the original method and 90.9% using the cross validation method. The discriminatory ability of the developed model was further validated using unknown honey samples. The overall correct classification rate was not affected. Specific physicochemical parameter analysis in combination with chemometrics has the potential to enhance the differences in floral honeys produced in a given geographical zone.

  18. Discrimination of irradiated MOX fuel from UOX fuel by multivariate statistical analysis of simulated activities of gamma-emitting isotopes

    Science.gov (United States)

    Åberg Lindell, M.; Andersson, P.; Grape, S.; Hellesen, C.; Håkansson, A.; Thulin, M.

    2018-03-01

    This paper investigates how concentrations of certain fission products and their related gamma-ray emissions can be used to discriminate between uranium oxide (UOX) and mixed oxide (MOX) type fuel. Discrimination of irradiated MOX fuel from irradiated UOX fuel is important in nuclear facilities and for transport of nuclear fuel, for purposes of both criticality safety and nuclear safeguards. Although facility operators keep records on the identity and properties of each fuel, tools for nuclear safeguards inspectors that enable independent verification of the fuel are critical in the recovery of continuity of knowledge, should it be lost. A discrimination methodology for classification of UOX and MOX fuel, based on passive gamma-ray spectroscopy data and multivariate analysis methods, is presented. Nuclear fuels and their gamma-ray emissions were simulated in the Monte Carlo code Serpent, and the resulting data was used as input to train seven different multivariate classification techniques. The trained classifiers were subsequently implemented and evaluated with respect to their capabilities to correctly predict the classes of unknown fuel items. The best results concerning successful discrimination of UOX and MOX-fuel were acquired when using non-linear classification techniques, such as the k nearest neighbors method and the Gaussian kernel support vector machine. For fuel with cooling times up to 20 years, when it is considered that gamma-rays from the isotope 134Cs can still be efficiently measured, success rates of 100% were obtained. A sensitivity analysis indicated that these methods were also robust.

  19. Sex determination using discriminant function analysis in Indigenous (Kurubas children and adolescents of Coorg, Karnataka, India: A lateral cephalometric study

    Directory of Open Access Journals (Sweden)

    Darshan Devang Divakar

    2016-11-01

    Full Text Available Aim: To test the validity of sex discrimination using lateral cephalometric radiograph and discriminant function analysis in Indigenous (Kuruba children and adolescents of Coorg, Karnataka, India. Methods and materials: Six hundred and sixteen lateral cephalograms of 380 male and 236 females of age ranging from 6.5 to 18 years of Indigenous population of Coorg, Karnataka, India called Kurubas having a normal occlusion were included in the study. Lateral cephalograms were obtained in a standard position with teeth in centric occlusion and lips relaxed. Each radiograph was traced and cephalometric landmarks were measured using digital calliper. Calculations of 24 cephalometric measurements were performed. Results: Males exhibited significantly greater mean angular and linear cephalometric measurements as compared to females (p < 0.05 (Table 5. Also, significant differences (p < 0.05 were observed in all the variables according to age (Table 6. Out of 24 variables, only ULTc predicts the gender. The reliability of the derived discriminant function was assessed among study subjects; 100% of males and females were recognized correctly. Conclusion: The final outcome of this study validates the existence of sexual dimorphism in the skeleton as early as 6.5 years of age. There is a need for further research to determine other landmarks that can help in sex determination and norms for Indigenous (Kuruba population and also other Indigenous population of Coorg, Karnataka, India. Keywords: Discriminant function analysis, Forensic investigation, Indigenous, Lateral cephalograms, Sex determination

  20. Discrimination and chemical phylogenetic study of seven species of Dendrobium using infrared spectroscopy combined with cluster analysis

    Science.gov (United States)

    Luo, Congpei; He, Tao; Chun, Ze

    2013-04-01

    Dendrobium is a commonly used and precious herb in Traditional Chinese Medicine. The high biodiversity of Dendrobium and the therapeutic needs require tools for the correct and fast discrimination of different Dendrobium species. This study investigates Fourier transform infrared spectroscopy followed by cluster analysis for discrimination and chemical phylogenetic study of seven Dendrobium species. Despite the general pattern of the IR spectra, different intensities, shapes, peak positions were found in the IR spectra of these samples, especially in the range of 1800-800 cm-1. The second derivative transformation and alcoholic extracting procedure obviously enlarged the tiny spectral differences among these samples. The results indicated each Dendrobium species had a characteristic IR spectra profile, which could be used to discriminate them. The similarity coefficients among the samples were analyzed based on their second derivative IR spectra, which ranged from 0.7632 to 0.9700, among the seven Dendrobium species, and from 0.5163 to 0.9615, among the ethanol extracts. A dendrogram was constructed based on cluster analysis the IR spectra for studying the chemical phylogenetic relationships among the samples. The results indicated that D. denneanum and D. crepidatum could be the alternative resources to substitute D. chrysotoxum, D. officinale and D. nobile which were officially recorded in Chinese Pharmacopoeia. In conclusion, with the advantages of high resolution, speediness and convenience, the experimental approach can successfully discriminate and construct the chemical phylogenetic relationships of the seven Dendrobium species.

  1. Evaluation of sensory panels of consumers of specialty coffee beverages using the boosting method in discriminant analysis

    Directory of Open Access Journals (Sweden)

    Gilberto Rodrigues Liska

    2015-12-01

    Full Text Available Automatic classification methods have been widely used in numerous situations and the boosting method has become known for use of a classification algorithm, which considers a set of training data and, from that set, constructs a classifier with reweighted versions of the training set. Given this characteristic, the aim of this study is to assess a sensory experiment related to acceptance tests with specialty coffees, with reference to both trained and untrained consumer groups. For the consumer group, four sensory characteristics were evaluated, such as aroma, body, sweetness, and final score, attributed to four types of specialty coffees. In order to obtain a classification rule that discriminates trained and untrained tasters, we used the conventional Fisher’s Linear Discriminant Analysis (LDA and discriminant analysis via boosting algorithm (AdaBoost. The criteria used in the comparison of the two approaches were sensitivity, specificity, false positive rate, false negative rate, and accuracy of classification methods. Additionally, to evaluate the performance of the classifiers, the success rates and error rates were obtained by Monte Carlo simulation, considering 100 replicas of a random partition of 70% for the training set, and the remaining for the test set. It was concluded that the boosting method applied to discriminant analysis yielded a higher sensitivity rate in regard to the trained panel, at a value of 80.63% and, hence, reduction in the rate of false negatives, at 19.37%. Thus, the boosting method may be used as a means of improving the LDA classifier for discrimination of trained tasters.

  2. [External therapy of plasma cell mastitis by jiuyi powder using partial least-squares discriminant analysis: a safety analysis].

    Science.gov (United States)

    Ye, Mei-na; Yang, Ming; Cheng, Yi-qin; Wang, Bing; Zhu, Ying; Xia, Ya-ru; Meng, Tian; Chen, Hao; Chen, Li-ying; Cheng, Hong-feng

    2015-04-01

    To evaluate the safety and the clinical value of external use of jiuyi Powder (JP) in treating plasma cell mastitis using partial least-squares discriminant analysis (PLSDA). Totally 50 patients with plasma cell mastitis treated by external use of JP were observed and biochemical examinations of blood and urine detected before application, at day 4 after application, at day 1 and 14 after discontinuation. Blood mercury and urinary mercury were detected before application, at day 1, 4, and 7 after application, at day 1 and 14 after discontinuation. Urinary mercury was also detected at 28 after discontinuation and 3 months after discontinuation. The information of wound, days of external application and the total dosage of external application were recorded before application, at day 1, 4, and 7 after application, as well as at day 1 after discontinuation. Then a discriminant model covering potential safety factors was set up by PLSDA after screening safety indices with important effects. The applicability of the model was assessed using area under ROC curve. Potential safety factors were assessed using variable importance in the projection (VIP). Urinary β2-microglobulin (β2-MG), urinary N-acetyl-β-D-glucosaminidase (NAG), 24 h urinary protein, and urinary α1-microglobulin (α1-MG) were greatly affected by external use of JP in treating plasma cell mastitis. The accuracy rate of PLSDA discriminate model was 74. 00%. The sensitivity, specificity, and the area under ROC curve was 0. 7826, 0. 7037, and 0. 8084, respectively. Three factors with greater effect on the potential safety were screened as follows: pre-application volume of the sore cavity, days of external application, and the total dosage of external application. PLSDA method could be used in analyzing bioinformation of clinical Chinese medicine. Urinary β2-MG and urinary NAG were two main safety monitoring indices. Days of external application and the total dosage of external application were main

  3. Bayesian nonparametric adaptive control using Gaussian processes.

    Science.gov (United States)

    Chowdhary, Girish; Kingravi, Hassan A; How, Jonathan P; Vela, Patricio A

    2015-03-01

    Most current model reference adaptive control (MRAC) methods rely on parametric adaptive elements, in which the number of parameters of the adaptive element are fixed a priori, often through expert judgment. An example of such an adaptive element is radial basis function networks (RBFNs), with RBF centers preallocated based on the expected operating domain. If the system operates outside of the expected operating domain, this adaptive element can become noneffective in capturing and canceling the uncertainty, thus rendering the adaptive controller only semiglobal in nature. This paper investigates a Gaussian process-based Bayesian MRAC architecture (GP-MRAC), which leverages the power and flexibility of GP Bayesian nonparametric models of uncertainty. The GP-MRAC does not require the centers to be preallocated, can inherently handle measurement noise, and enables MRAC to handle a broader set of uncertainties, including those that are defined as distributions over functions. We use stochastic stability arguments to show that GP-MRAC guarantees good closed-loop performance with no prior domain knowledge of the uncertainty. Online implementable GP inference methods are compared in numerical simulations against RBFN-MRAC with preallocated centers and are shown to provide better tracking and improved long-term learning.

  4. Nonparametric tests for equality of psychometric functions.

    Science.gov (United States)

    García-Pérez, Miguel A; Núñez-Antón, Vicente

    2017-12-07

    Many empirical studies measure psychometric functions (curves describing how observers' performance varies with stimulus magnitude) because these functions capture the effects of experimental conditions. To assess these effects, parametric curves are often fitted to the data and comparisons are carried out by testing for equality of mean parameter estimates across conditions. This approach is parametric and, thus, vulnerable to violations of the implied assumptions. Furthermore, testing for equality of means of parameters may be misleading: Psychometric functions may vary meaningfully across conditions on an observer-by-observer basis with no effect on the mean values of the estimated parameters. Alternative approaches to assess equality of psychometric functions per se are thus needed. This paper compares three nonparametric tests that are applicable in all situations of interest: The existing generalized Mantel-Haenszel test, a generalization of the Berry-Mielke test that was developed here, and a split variant of the generalized Mantel-Haenszel test also developed here. Their statistical properties (accuracy and power) are studied via simulation and the results show that all tests are indistinguishable as to accuracy but they differ non-uniformly as to power. Empirical use of the tests is illustrated via analyses of published data sets and practical recommendations are given. The computer code in MATLAB and R to conduct these tests is available as Electronic Supplemental Material.

  5. The role of critical ethnic awareness and social support in the discrimination-depression relationship among Asian Americans: path analysis.

    Science.gov (United States)

    Kim, Isok

    2014-01-01

    This study used a path analytic technique to examine associations among critical ethnic awareness, racial discrimination, social support, and depressive symptoms. Using a convenience sample from online survey of Asian American adults (N = 405), the study tested 2 main hypotheses: First, based on the empowerment theory, critical ethnic awareness would be positively associated with racial discrimination experience; and second, based on the social support deterioration model, social support would partially mediate the relationship between racial discrimination and depressive symptoms. The result of the path analysis model showed that the proposed path model was a good fit based on global fit indices, χ²(2) = 4.70, p = .10; root mean square error of approximation = 0.06; comparative fit index = 0.97; Tucker-Lewis index = 0.92; and standardized root mean square residual = 0.03. The examinations of study hypotheses demonstrated that critical ethnic awareness was directly associated (b = .11, p Asian Americans. This study highlights the usefulness of the critical ethnic awareness concept as a way to better understand how Asian Americans might perceive and recognize racial discrimination experiences in relation to its mental health consequences.

  6. Subclassification and Detection of New Markers for the Discrimination of Primary Liver Tumors by Gene Expression Analysis Using Oligonucleotide Arrays.

    Science.gov (United States)

    Hass, Holger G; Vogel, Ulrich; Scheurlen, Michael; Jobst, Jürgen

    2017-12-26

    The failure to correctly differentiate between intrahepatic cholangiocarcinoma [CC] and hepatocellular carcinoma [HCC] is a significant clinical problem, particularly in terms of the different treatment goals for both cancers. In this study a specific gene expression profile to discriminate these two subgroups of liver cancer was established and potential diagnostic markers for clinical use were analyzed. To evaluate the gene expression profiles of HCC and intrahepatic CC, Oligonucleotide arrays ( Affymetrix U133A) were used. Overexpressed genes were checked for their potential use as new markers for discrimination and their expression values were validated by reverse transcription polymerase chain reaction and immunohistochemistry analyses. 695 genes/expressed sequence tags (ESTs) in HCC (245 up-/450 down-regulated) and 552 genes/ESTs in CC (221 up-/331 down-regulated) were significantly dysregulated (p〈0.05, fold change >2, ≥70%). Using a supervised learning method, and one-way analysis of variance a specific 270-gene expression profile that enabled rapid, reproducible differentiation between both tumors and non-malignant liver tissues was established. A panel of 12 genes (e.g. HSP90β, ERG1, GPC3, TKT, ACLY, and NME1 for HCC; SPT2, T4S3, CNX43, TTD1, HBD01 for CC) were detected and partly described for the first time as potential discrimination markers. A specific gene expression profile for discrimination of primary liver cancer was identified and potential marker genes with feasible clinical impact were described.

  7. The use of kernel local Fisher discriminant analysis for the channelization of the Hotelling model observer

    Science.gov (United States)

    Wen, Gezheng; Markey, Mia K.

    2015-03-01

    It is resource-intensive to conduct human studies for task-based assessment of medical image quality and system optimization. Thus, numerical model observers have been developed as a surrogate for human observers. The Hotelling observer (HO) is the optimal linear observer for signal-detection tasks, but the high dimensionality of imaging data results in a heavy computational burden. Channelization is often used to approximate the HO through a dimensionality reduction step, but how to produce channelized images without losing significant image information remains a key challenge. Kernel local Fisher discriminant analysis (KLFDA) uses kernel techniques to perform supervised dimensionality reduction, which finds an embedding transformation that maximizes betweenclass separability and preserves within-class local structure in the low-dimensional manifold. It is powerful for classification tasks, especially when the distribution of a class is multimodal. Such multimodality could be observed in many practical clinical tasks. For example, primary and metastatic lesions may both appear in medical imaging studies, but the distributions of their typical characteristics (e.g., size) may be very different. In this study, we propose to use KLFDA as a novel channelization method. The dimension of the embedded manifold (i.e., the result of KLFDA) is a counterpart to the number of channels in the state-of-art linear channelization. We present a simulation study to demonstrate the potential usefulness of KLFDA for building the channelized HOs (CHOs) and generating reliable decision statistics for clinical tasks. We show that the performance of the CHO with KLFDA channels is comparable to that of the benchmark CHOs.

  8. DNA content analysis allows discrimination between Trypanosoma cruzi and Trypanosoma rangeli.

    Science.gov (United States)

    Naves, Lucila Langoni; da Silva, Marcos Vinícius; Fajardo, Emanuella Francisco; da Silva, Raíssa Bernardes; De Vito, Fernanda Bernadelli; Rodrigues, Virmondes; Lages-Silva, Eliane; Ramírez, Luis Eduardo; Pedrosa, André Luiz

    2017-01-01

    Trypanosoma cruzi, a human protozoan parasite, is the causative agent of Chagas disease. Currently the species is divided into six taxonomic groups. The genome of the CL Brener clone has been estimated to be 106.4-110.7 Mb, and DNA content analyses revealed that it is a diploid hybrid clone. Trypanosoma rangeli is a hemoflagellate that has the same reservoirs and vectors as T. cruzi; however, it is non-pathogenic to vertebrate hosts. The haploid genome of T. rangeli was previously estimated to be 24 Mb. The parasitic strains of T. rangeli are divided into KP1(+) and KP1(-). Thus, the objective of this study was to investigate the DNA content in different strains of T. cruzi and T. rangeli by flow cytometry. All T. cruzi and T. rangeli strains yielded cell cycle profiles with clearly identifiable G1-0 (2n) and G2-M (4n) peaks. T. cruzi and T. rangeli genome sizes were estimated using the clone CL Brener and the Leishmania major CC1 as reference cell lines because their genome sequences have been previously determined. The DNA content of T. cruzi strains ranged from 87,41 to 108,16 Mb, and the DNA content of T. rangeli strains ranged from 63,25 Mb to 68,66 Mb. No differences in DNA content were observed between KP1(+) and KP1(-) T. rangeli strains. Cultures containing mixtures of the epimastigote forms of T. cruzi and T. rangeli strains resulted in cell cycle profiles with distinct G1 peaks for strains of each species. These results demonstrate that DNA content analysis by flow cytometry is a reliable technique for discrimination between T. cruzi and T. rangeli isolated from different hosts.

  9. Multi-task linear programming discriminant analysis for the identification of progressive MCI individuals.

    Directory of Open Access Journals (Sweden)

    Guan Yu

    Full Text Available Accurately identifying mild cognitive impairment (MCI individuals who will progress to Alzheimer's disease (AD is very important for making early interventions. Many classification methods focus on integrating multiple imaging modalities such as magnetic resonance imaging (MRI and fluorodeoxyglucose positron emission tomography (FDG-PET. However, the main challenge for MCI classification using multiple imaging modalities is the existence of a lot of missing data in many subjects. For example, in the Alzheimer's Disease Neuroimaging Initiative (ADNI study, almost half of the subjects do not have PET images. In this paper, we propose a new and flexible binary classification method, namely Multi-task Linear Programming Discriminant (MLPD analysis, for the incomplete multi-source feature learning. Specifically, we decompose the classification problem into different classification tasks, i.e., one for each combination of available data sources. To solve all different classification tasks jointly, our proposed MLPD method links them together by constraining them to achieve the similar estimated mean difference between the two classes (under classification for those shared features. Compared with the state-of-the-art incomplete Multi-Source Feature (iMSF learning method, instead of constraining different classification tasks to choose a common feature subset for those shared features, MLPD can flexibly and adaptively choose different feature subsets for different classification tasks. Furthermore, our proposed MLPD method can be efficiently implemented by linear programming. To validate our MLPD method, we perform experiments on the ADNI baseline dataset with the incomplete MRI and PET images from 167 progressive MCI (pMCI subjects and 226 stable MCI (sMCI subjects. We further compared our method with the iMSF method (using incomplete MRI and PET images and also the single-task classification method (using only MRI or only subjects with both MRI and

  10. Identifikasi Huruf Kapital Tulisan Tangan Menggunakan Linear Discriminant Analysis dan Euclidean Distance

    Directory of Open Access Journals (Sweden)

    Septa Cahyani

    2018-04-01

    Full Text Available The human ability to recognize a variety of objects, however complex the object, is the special ability that humans possess. Any normal human will have no difficulty in recognizing handwriting objects between an author and another author. With the rapid development of digital technology, the human ability to recognize handwriting objects has been applied in a program known as Computer Vision. This study aims to create identification system different types of handwriting capital letters that have different sizes, thickness, shape, and tilt (distinctive features in handwriting using Linear Discriminant Analysis (LDA and Euclidean Distance methods. LDA is used to obtain characteristic characteristics of the image and provide the distance between the classes becomes larger, while the distance between training data in one class becomes smaller, so that the introduction time of digital image of handwritten capital letter using Euclidean Distance becomes faster computation time (by searching closest distance between training data and data testing. The results of testing the sample data showed that the image resolution of 50x50 pixels is the exact image resolution used for data as much as 1560 handwritten capital letter data compared to image resolution 25x25 pixels and 40x40 pixels. While the test data and training data testing using the method of 10-fold cross validation where 1404 for training data and 156 for data testing showed identification of digital image handwriting capital letter has an average effectiveness of the accuracy rate of 75.39% with the average time computing of 0.4199 seconds.

  11. Multi-task linear programming discriminant analysis for the identification of progressive MCI individuals.

    Science.gov (United States)

    Yu, Guan; Liu, Yufeng; Thung, Kim-Han; Shen, Dinggang

    2014-01-01

    Accurately identifying mild cognitive impairment (MCI) individuals who will progress to Alzheimer's disease (AD) is very important for making early interventions. Many classification methods focus on integrating multiple imaging modalities such as magnetic resonance imaging (MRI) and fluorodeoxyglucose positron emission tomography (FDG-PET). However, the main challenge for MCI classification using multiple imaging modalities is the existence of a lot of missing data in many subjects. For example, in the Alzheimer's Disease Neuroimaging Initiative (ADNI) study, almost half of the subjects do not have PET images. In this paper, we propose a new and flexible binary classification method, namely Multi-task Linear Programming Discriminant (MLPD) analysis, for the incomplete multi-source feature learning. Specifically, we decompose the classification problem into different classification tasks, i.e., one for each combination of available data sources. To solve all different classification tasks jointly, our proposed MLPD method links them together by constraining them to achieve the similar estimated mean difference between the two classes (under classification) for those shared features. Compared with the state-of-the-art incomplete Multi-Source Feature (iMSF) learning method, instead of constraining different classification tasks to choose a common feature subset for those shared features, MLPD can flexibly and adaptively choose different feature subsets for different classification tasks. Furthermore, our proposed MLPD method can be efficiently implemented by linear programming. To validate our MLPD method, we perform experiments on the ADNI baseline dataset with the incomplete MRI and PET images from 167 progressive MCI (pMCI) subjects and 226 stable MCI (sMCI) subjects. We further compared our method with the iMSF method (using incomplete MRI and PET images) and also the single-task classification method (using only MRI or only subjects with both MRI and PET images

  12. Differential discriminator

    International Nuclear Information System (INIS)

    Dukhanov, V.I.; Mazurov, I.B.

    1981-01-01

    A principal flowsheet of a differential discriminator intended for operation in a spectrometric circuit with statistical time distribution of pulses is described. The differential discriminator includes four integrated discriminators and a channel of piled-up signal rejection. The presence of the rejection channel enables the discriminator to operate effectively at loads of 14x10 3 pulse/s. The temperature instability of the discrimination thresholds equals 250 μV/ 0 C. The discrimination level changes within 0.1-5 V, the level shift constitutes 0.5% for the filling ratio of 1:10. The rejection coefficient is not less than 90%. Alpha spectrum of the 228 Th source is presented to evaluate the discriminator operation with the rejector. The rejector provides 50 ns time resolution

  13. Automated discrimination of lower and higher grade gliomas based on histopathological image analysis

    Directory of Open Access Journals (Sweden)

    Hojjat Seyed Mousavi

    2015-01-01

    Full Text Available Introduction: Histopathological images have rich structural information, are multi-channel in nature and contain meaningful pathological information at various scales. Sophisticated image analysis tools that can automatically extract discriminative information from the histopathology image slides for diagnosis remain an area of significant research activity. In this work, we focus on automated brain cancer grading, specifically glioma grading. Grading of a glioma is a highly important problem in pathology and is largely done manually by medical experts based on an examination of pathology slides (images. To complement the efforts of clinicians engaged in brain cancer diagnosis, we develop novel image processing algorithms and systems to automatically grade glioma tumor into two categories: Low-grade glioma (LGG and high-grade glioma (HGG which represent a more advanced stage of the disease. Results: We propose novel image processing algorithms based on spatial domain analysis for glioma tumor grading that will complement the clinical interpretation of the tissue. The image processing techniques are developed in close collaboration with medical experts to mimic the visual cues that a clinician looks for in judging of the grade of the disease. Specifically, two algorithmic techniques are developed: (1 A cell segmentation and cell-count profile creation for identification of Pseudopalisading Necrosis, and (2 a customized operation of spatial and morphological filters to accurately identify microvascular proliferation (MVP. In both techniques, a hierarchical decision is made via a decision tree mechanism. If either Pseudopalisading Necrosis or MVP is found present in any part of the histopathology slide, the whole slide is identified as HGG, which is consistent with World Health Organization guidelines. Experimental results on the Cancer Genome Atlas database are presented in the form of: (1 Successful detection rates of pseudopalisading necrosis

  14. Predicting The Type Of Pregnancy Using Flexible Discriminate Analysis And Artificial Neural Networks: A Comparison Study

    International Nuclear Information System (INIS)

    Hooman, A.; Mohammadzadeh, M.

    2008-01-01

    Some medical and epidemiological surveys have been designed to predict a nominal response variable with several levels. With regard to the type of pregnancy there are four possible states: wanted, unwanted by wife, unwanted by husband and unwanted by couple. In this paper, we have predicted the type of pregnancy, as well as the factors influencing it using three different models and comparing them. Regarding the type of pregnancy with several levels, we developed a multinomial logistic regression, a neural network and a flexible discrimination based on the data and compared their results using tow statistical indices: Surface under curve (ROC) and kappa coefficient. Based on these tow indices, flexible discrimination proved to be a better fit for prediction on data in comparison to other methods. When the relations among variables are complex, one can use flexible discrimination instead of multinomial logistic regression and neural network to predict the nominal response variables with several levels in order to gain more accurate predictions

  15. Early discrimination of nasopharyngeal carcinoma based on tissue deoxyribose nucleic acid surface-enhanced Raman spectroscopy analysis

    Science.gov (United States)

    Qiu, Sufang; Li, Chao; Lin, Jinyong; Xu, Yuanji; Lu, Jun; Huang, Qingting; Zou, Changyan; Chen, Chao; Xiao, Nanyang; Lin, Duo; Chen, Rong; Pan, Jianji; Feng, Shangyuan

    2016-12-01

    Surface-enhanced Raman spectroscopy (SERS) was employed to detect deoxyribose nucleic acid (DNA) variations associated with the development of nasopharyngeal carcinoma (NPC). Significant SERS spectral differences between the DNA extracted from early NPC, advanced NPC, and normal nasopharyngeal tissue specimens were observed at 678, 729, 788, 1337, 1421, 1506, and 1573 cm-1, which reflects the genetic variations in NPC. Principal component analysis combined with discriminant function analysis for early NPC discrimination yielded a diagnostic accuracy of 86.8%, 92.3%, and 87.9% for early NPC, advanced NPC, and normal nasopharyngeal tissue DNA, respectively. In this exploratory study, we demonstrated the potential of SERS for early detection of NPC based on the DNA molecular study of biopsy tissues.

  16. EVOLUTION OF NEUROENDOCRINE CELL POPULATION AND PEPTIDERGIC INNERVATION, ASSESSED BY DISCRIMINANT ANALYSIS, DURING POSTNATAL DEVELOPMENT OF THE RAT PROSTATE

    Directory of Open Access Journals (Sweden)

    Rosario Rodríguez

    2011-05-01

    Full Text Available Serotonin immunoreactive neuroendocrine cells and peptidergic nerves (NPY and VIP could have a role in prostate growth and function. In the present study, rats grouped by stages of postnatal development (prepubertal, pubertal, young and aged adults were employed in order to ascertain whether age causes changes in the number of serotoninergic neuroendocrine cells and the length of NPY and VIP fibres. Discriminant analysis was performed in order to ascertain the classificatory power of stereologic variables (absolute and relative measurements of cell number and fibre length on age groups. The following conclusions were drawn: a discriminant analysis confirms the androgen-dependence of both neuroendocrine cells and NPYVIP innervation during the postnatal development of the rat prostate; b periglandular innervation has more relevance than interglandular innervation in classifying the rats in age groups; and c peptidergic nerves from ventral, ampullar and periductal regions were more age-dependent than nerves from the dorso-lateral region.

  17. An automated land-use mapping comparison of the Bayesian maximum likelihood and linear discriminant analysis algorithms

    Science.gov (United States)

    Tom, C. H.; Miller, L. D.

    1984-01-01

    The Bayesian maximum likelihood parametric classifier has been tested against the data-based formulation designated 'linear discrimination analysis', using the 'GLIKE' decision and "CLASSIFY' classification algorithms in the Landsat Mapping System. Identical supervised training sets, USGS land use/land cover classes, and various combinations of Landsat image and ancilliary geodata variables, were used to compare the algorithms' thematic mapping accuracy on a single-date summer subscene, with a cellularized USGS land use map of the same time frame furnishing the ground truth reference. CLASSIFY, which accepts a priori class probabilities, is found to be more accurate than GLIKE, which assumes equal class occurrences, for all three mapping variable sets and both levels of detail. These results may be generalized to direct accuracy, time, cost, and flexibility advantages of linear discriminant analysis over Bayesian methods.

  18. Rotation and Noise Invariant Near-Infrared Face Recognition by means of Zernike Moments and Spectral Regression Discriminant Analysis

    Czech Academy of Sciences Publication Activity Database

    Farokhi, S.; Shamsuddin, S. M.; Flusser, Jan; Sheikh, U. U.; Khansari, M.; Jafari-Khouzani, K.

    2013-01-01

    Roč. 22, č. 1 (2013), s. 1-11 ISSN 1017-9909 R&D Projects: GA ČR GAP103/11/1552 Keywords : face recognition * infrared imaging * image moments Subject RIV: JD - Computer Applications, Robotics Impact factor: 0.850, year: 2013 http://library.utia.cas.cz/separaty/2013/ZOI/flusser-rotation and noise invariant near-infrared face recognition by means of zernike moments and spectral regression discriminant analysis.pdf

  19. A qualitative analysis of hate speech reported to the Romanian National Council for Combating Discrimination (2003‑2015)

    OpenAIRE

    Adriana Iordache

    2015-01-01

    The article analyzes the specificities of Romanian hate speech over a period of twelve years through a qualitative analysis of 384 Decisions of the National Council for Combating Discrimination. The study employs a coding methodology which allows one to separate decisions according to the group that was the victim of hate speech. The article finds that stereotypes employed are similar to those encountered in the international literature. The main target of hate speech is the Roma, who are ...

  20. Effects of measurement errors on psychometric measurements in ergonomics studies: Implications for correlations, ANOVA, linear regression, factor analysis, and linear discriminant analysis.

    Science.gov (United States)

    Liu, Yan; Salvendy, Gavriel

    2009-05-01

    This paper aims to demonstrate the effects of measurement errors on psychometric measurements in ergonomics studies. A variety of sources can cause random measurement errors in ergonomics studies and these errors can distort virtually every statistic computed and lead investigators to erroneous conclusions. The effects of measurement errors on five most widely used statistical analysis tools have been discussed and illustrated: correlation; ANOVA; linear regression; factor analysis; linear discriminant analysis. It has been shown that measurement errors can greatly attenuate correlations between variables, reduce statistical power of ANOVA, distort (overestimate, underestimate or even change the sign of) regression coefficients, underrate the explanation contributions of the most important factors in factor analysis and depreciate the significance of discriminant function and discrimination abilities of individual variables in discrimination analysis. The discussions will be restricted to subjective scales and survey methods and their reliability estimates. Other methods applied in ergonomics research, such as physical and electrophysiological measurements and chemical and biomedical analysis methods, also have issues of measurement errors, but they are beyond the scope of this paper. As there has been increasing interest in the development and testing of theories in ergonomics research, it has become very important for ergonomics researchers to understand the effects of measurement errors on their experiment results, which the authors believe is very critical to research progress in theory development and cumulative knowledge in the ergonomics field.

  1. A rapid method to screen for cell-wall mutants using discriminant analysis of Fourier transform infrared spectra

    International Nuclear Information System (INIS)

    Chen LiMei; Carpita, N.C.; Reiter, W.D.; Wilson, R.H.; Jeffries, C.; McCann, M.C.

    1998-01-01

    We have developed a rapid method to screen large numbers of mutant plants for a broad range of cell wall phenotypes using Fourier transform infrared (FTIR) microspectroscopy of leaves. We established and validated a model that can discriminate between the leaves of wild-type and a previously defined set of cell-wall mutants of Arabidopsis. Exploratory principal component analysis indicated that mutants deficient in different cell-wall sugars can be distinguished from each other. Discrimination of cell-wall mutants from wild-type was independent of variability in starch content or additional unrelated mutations that might be present in a heavily mutagenised population. We then developed an analysis of FTIR spectra of leaves obtained from over 1000 mutagenised flax plants, and selected 59 plants whose spectral variation from wild-type was significantly out of the range of a wild-type population, determined by Mahalanobis distance. Cell wall sugars from the leaves of selected putative mutants were assayed by gas chromatography-mass spectrometry and 42 showed significant differences in neutral sugar composition. The FTIR spectra indicated that six of the remaining 17 plants have altered ester or protein content. We conclude that linear discriminant analysis of FTIR spectra is a robust method to identify a broad range of structural and architectural alterations in cell walls, appearing as a consequence of developmental regulation, environmental adaptation or genetic modification. (author)

  2. Describing three-class task performance: three-class linear discriminant analysis and three-class ROC analysis

    Science.gov (United States)

    He, Xin; Frey, Eric C.

    2007-03-01

    Binary ROC analysis has solid decision-theoretic foundations and a close relationship to linear discriminant analysis (LDA). In particular, for the case of Gaussian equal covariance input data, the area under the ROC curve (AUC) value has a direct relationship to the Hotelling trace. Many attempts have been made to extend binary classification methods to multi-class. For example, Fukunaga extended binary LDA to obtain multi-class LDA, which uses the multi-class Hotelling trace as a figure-of-merit, and we have previously developed a three-class ROC analysis method. This work explores the relationship between conventional multi-class LDA and three-class ROC analysis. First, we developed a linear observer, the three-class Hotelling observer (3-HO). For Gaussian equal covariance data, the 3- HO provides equivalent performance to the three-class ideal observer and, under less strict conditions, maximizes the signal to noise ratio for classification of all pairs of the three classes simultaneously. The 3-HO templates are not the eigenvectors obtained from multi-class LDA. Second, we show that the three-class Hotelling trace, which is the figureof- merit in the conventional three-class extension of LDA, has significant limitations. Third, we demonstrate that, under certain conditions, there is a linear relationship between the eigenvectors obtained from multi-class LDA and 3-HO templates. We conclude that the 3-HO based on decision theory has advantages both in its decision theoretic background and in the usefulness of its figure-of-merit. Additionally, there exists the possibility of interpreting the two linear features extracted by the conventional extension of LDA from a decision theoretic point of view.

  3. Multi-sample nonparametric treatments comparison in medical ...

    African Journals Online (AJOL)

    Multi-sample nonparametric treatments comparison in medical follow-up study with unequal observation processes through simulation and bladder tumour case study. P. L. Tan, N.A. Ibrahim, M.B. Adam, J. Arasan ...

  4. Multiple endmember spectral-angle-mapper (SAM) analysis improves discrimination of Savanna tree species

    CSIR Research Space (South Africa)

    Cho, Moses A

    2009-08-01

    Full Text Available of this paper was to evaluate the classification performance of a multiple-endmember spectral angle mapper (SAM) classification approach in discriminating seven common African savanna tree species and to compare the results with the traditional SAM classifier...

  5. An Information Analysis of 2-, 3-, and 4-Word Verbal Discrimination Learning.

    Science.gov (United States)

    Arima, James K.; Gray, Francis D.

    Information theory was used to qualify the difficulty of verbal discrimination (VD) learning tasks and to measure VD performance. Words for VD items were selected with high background frequency and equal a priori probabilities of being selected as a first response. Three VD lists containing only 2-, 3-, or 4-word items were created and equated for…

  6. A Qualitative Analysis of Multiracial Students' Experiences with Prejudice and Discrimination in College

    Science.gov (United States)

    Museus, Samuel D.; Lambe Sariñana, Susan A.; Yee, April L.; Robinson, Thomas E.

    2016-01-01

    Mixed-race persons constitute a substantial and growing population in the United States. We examined multiracial college students' experiences with prejudice and discrimination in college with conducted focus group interviews with 12 mixed-race participants and individual interviews with 22 mixed-race undergraduates to understand how they…

  7. Speaker Linking and Applications using Non-Parametric Hashing Methods

    Science.gov (United States)

    2016-09-08

    nonparametric estimate of a multivariate density function,” The Annals of Math- ematical Statistics , vol. 36, no. 3, pp. 1049–1051, 1965. [9] E. A. Patrick...Speaker Linking and Applications using Non-Parametric Hashing Methods† Douglas Sturim and William M. Campbell MIT Lincoln Laboratory, Lexington, MA...with many approaches [1, 2]. For this paper, we focus on using i-vectors [2], but the methods apply to any embedding. For the task of speaker QBE and

  8. Gender-based discrimination in South Africa: A quantitative analysis of fairness of remuneration

    Directory of Open Access Journals (Sweden)

    Renier Steyn

    2015-05-01

    Full Text Available Equity is important to most individuals and its perceived absence  may impact negatively on individual and organisational performance. The concept of equity presupposes fair treatment, while discrimination implies unfair treatment. The perceptions of discrimination, or being treated unfairly, may result from psycho-social processes, or from data that justifies discrimination and is quantifiable. Objectives: To assess whether differences in post grading and remuneration for males and females are based on gender, rather than on quantifiable variables that could justify these differences. Method: Biographical information was gathered from 1740 employees representing 29 organisations. The data collected included self-reported post grading (dependent variable and 14 independent variables, which may predict the employees’ post gradings. The independent variables related primarily to education, tenure and family responsibility. Results: Males reported higher post gradings and higher salaries than those of females, but the difference was not statistically significant and the practical significance of this difference was slight. Qualification types, job specific training, and membership of professional bodies did not affect post grading along gender lines. The ways in which work experience was measured had no influence on post grading or salary for either males or females. Furthermore, family responsibility, union membership and the type of work the employees performed did not influence the employees’ post grading. The only difference found concerned the unfair treatment of males, particularly those who were well-qualified.   Conclusions: Objective evidence of unfair gender-based discrimination affecting post grading and salary is scarce, and the few differences that do occur have little statistical and practical significance. Perceptions of being discriminated against may therefore more often be seen as the result of psycho-social processes and

  9. Studies in genetic discrimination. Final progress report

    Energy Technology Data Exchange (ETDEWEB)

    1994-06-01

    We have screened 1006 respondents in a study of genetic discrimination. Analysis of these responses has produced evidence of the range of institutions engaged in genetic discrimination and demonstrates the impact of this discrimination on the respondents to the study. We have found that both ignorance and policy underlie genetic discrimination and that anti-discrimination laws are being violated.

  10. Otolith shape analysis for stock discrimination of two Collichthys genus croaker (Pieces: Sciaenidae,) from the northern Chinese coast

    Science.gov (United States)

    Zhao, Bo; Liu, Jinhu; Song, Junjie; Cao, Liang; Dou, Shuozeng

    2017-08-01

    The otolith morphology of two croaker species (Collichthys lucidus and Collichthys niveatus) from three areas (Liaodong Bay, LD; Huanghe (Yellow) River estuary, HRE; Jiaozhou Bay, JZ) along the northern Chinese coast were investigated for species identification and stock discrimination. The otolith contour shape described by elliptic Fourier coefficients (EFC) were analysed using principal components analysis (PCA) and stepwise canonical discriminant analysis (CDA) to identify species and stocks. The two species were well differentiated, with an overall classification success rate of 97.8%. And variations in the otolith shapes were significant enough to discriminate among the three geographical samples of C. lucidus (67.7%) or C. niveatus (65.2%). Relatively high mis-assignment occurred between the geographically adjacent LD and HRE samples, which implied that individual mixing may exist between the two samples. This study yielded information complementary to that derived from genetic studies and provided information for assessing the stock structure of C. lucidus and C. niveatus in the Bohai Sea and the Yellow Sea.

  11. Optimal Threshold Determination for Discriminating Driving Anger Intensity Based on EEG Wavelet Features and ROC Curve Analysis

    Directory of Open Access Journals (Sweden)

    Ping Wan

    2016-08-01

    Full Text Available Driving anger, called “road rage”, has become increasingly common nowadays, affecting road safety. A few researches focused on how to identify driving anger, however, there is still a gap in driving anger grading, especially in real traffic environment, which is beneficial to take corresponding intervening measures according to different anger intensity. This study proposes a method for discriminating driving anger states with different intensity based on Electroencephalogram (EEG spectral features. First, thirty drivers were recruited to conduct on-road experiments on a busy route in Wuhan, China where anger could be inducted by various road events, e.g., vehicles weaving/cutting in line, jaywalking/cyclist crossing, traffic congestion and waiting red light if they want to complete the experiments ahead of basic time for extra paid. Subsequently, significance analysis was used to select relative energy spectrum of β band (β% and relative energy spectrum of θ band (θ% for discriminating the different driving anger states. Finally, according to receiver operating characteristic (ROC curve analysis, the optimal thresholds (best cut-off points of β% and θ% for identifying none anger state (i.e., neutral were determined to be 0.2183 ≤ θ% < 1, 0 < β% < 0.2586; low anger state is 0.1539 ≤ θ% < 0.2183, 0.2586 ≤ β% < 0.3269; moderate anger state is 0.1216 ≤ θ% < 0.1539, 0.3269 ≤ β% < 0.3674; high anger state is 0 < θ% < 0.1216, 0.3674 ≤ β% < 1. Moreover, the discrimination performances of verification indicate that, the overall accuracy (Acc of the optimal thresholds of β% for discriminating the four driving anger states is 80.21%, while 75.20% for that of θ%. The results can provide theoretical foundation for developing driving anger detection or warning devices based on the relevant optimal thresholds.

  12. DPpackage: Bayesian Semi- and Nonparametric Modeling in R

    Directory of Open Access Journals (Sweden)

    Alejandro Jara

    2011-04-01

    Full Text Available Data analysis sometimes requires the relaxation of parametric assumptions in order to gain modeling flexibility and robustness against mis-specification of the probability model. In the Bayesian context, this is accomplished by placing a prior distribution on a function space, such as the space of all probability distributions or the space of all regression functions. Unfortunately, posterior distributions ranging over function spaces are highly complex and hence sampling methods play a key role. This paper provides an introduction to a simple, yet comprehensive, set of programs for the implementation of some Bayesian nonparametric and semiparametric models in R, DPpackage. Currently, DPpackage includes models for marginal and conditional density estimation, receiver operating characteristic curve analysis, interval-censored data, binary regression data, item response data, longitudinal and clustered data using generalized linear mixed models, and regression data using generalized additive models. The package also contains functions to compute pseudo-Bayes factors for model comparison and for eliciting the precision parameter of the Dirichlet process prior, and a general purpose Metropolis sampling algorithm. To maximize computational efficiency, the actual sampling for each model is carried out using compiled C, C++ or Fortran code.

  13. An Initial Analysis of LANDSAT-4 Thematic Mapper Data for the Discrimination of Agricultural, Forested Wetland, and Urban Land Covers

    Science.gov (United States)

    Quattrochi, D. A.

    1984-01-01

    An initial analysis of LANDSAT 4 Thematic Mapper (TM) data for the discrimination of agricultural, forested wetland, and urban land covers is conducted using a scene of data collected over Arkansas and Tennessee. A classification of agricultural lands derived from multitemporal LANDSAT Multispectral Scanner (MSS) data is compared with a classification of TM data for the same area. Results from this comparative analysis show that the multitemporal MSS classification produced an overall accuracy of 80.91% while the TM classification yields an overall classification accuracy of 97.06% correct.

  14. Morphometric analysis to discriminate between species: The case of the Megalobulimus leucostoma complex

    Directory of Open Access Journals (Sweden)

    Victor Borda

    2014-10-01

    Full Text Available Plasticity of conchological characters had led to erroneous descriptions and the accumulation of synonyms making difficult the discrimination among species. The land snail genus Megalobulimus is an example of this problem. Megalobulimus leucostoma (Sowerby, 1835 has three subspecies which are difficult to differentiate by using the original descriptions. The aim of this paper is to discriminate among the subspecies of M. leucostoma by using morphometric and distribution analyses. Both provide substantial differences between M. l. leucostoma and M. l lacunosus that would not support the subspecies status of the former. Megalobulimus leucostoma weyrauchi fits into the great conchological variability of M. l .leucostoma; also the sympatric status between these two subspecies would not support the subspecies status of the former, and M. l. weyrauchi should be considered as part of M. l. leucostoma.

  15. Micro-PIXE analysis of fish otoliths. Methodology and evaluation of first results for stock discrimination

    International Nuclear Information System (INIS)

    Sie, S.H.; Thresher, R.E.

    1992-01-01

    Micro-PIXE has been used to measure the trace element distribution in otoliths from several species of ocean fish, in order to investigate its possible use in stock discrimination. Trace elements detected include Sr, Fe, Mn, Ni, Zn, Cu, Se, Cd, Br, Hg and Pb. Trace elements Na, K, Cl, S and Cl were detected with the electron microprobe. The high sensitivity of PIXE demands a meticulous sample preparation procedure to avoid contamination problems. Practical problems associated with the application of the technique were investigated in detail. Preliminary results indicate that most trace elements except Sr, are present at close to the limits of detection at few ppm, but biologically significant data can be obtained for stock discrimination applications. (author)

  16. Scalable Bayesian nonparametric measures for exploring pairwise dependence via Dirichlet Process Mixtures.

    Science.gov (United States)

    Filippi, Sarah; Holmes, Chris C; Nieto-Barajas, Luis E

    2016-11-16

    In this article we propose novel Bayesian nonparametric methods using Dirichlet Process Mixture (DPM) models for detecting pairwise dependence between random variables while accounting for uncertainty in the form of the underlying distributions. A key criteria is that the procedures should scale to large data sets. In this regard we find that the formal calculation of the Bayes factor for a dependent-vs.-independent DPM joint probability measure is not feasible computationally. To address this we present Bayesian diagnostic measures for characterising evidence against a "null model" of pairwise independence. In simulation studies, as well as for a real data analysis, we show that our approach provides a useful tool for the exploratory nonparametric Bayesian analysis of large multivariate data sets.

  17. Analysis of Child Gender Discrimination Based on Adults' Consumption Patterns: Microdata Evidence from China

    OpenAIRE

    Feridoon Koohi-Kamali; R. Liu; Y. Liu

    2015-01-01

    The applications of the Rothbarth model of inferring child gender discrimination from the variations in parental living standard have consistently failed to uncover evidence for bias from surveys in countries with some of the world's worst welfare outcomes for girls. This paper demonstrates the importance of the remedies required for an effective implementation of that model with an application to a survey from urban China. The paper obtains econometric evidence for the presence of child gend...

  18. Post-Apartheid Trends in Gender Discrimination in South Africa: Analysis through Decomposition Techniques

    OpenAIRE

    Debra Shepherd

    2008-01-01

    Using appropriate econometric methods and 11 representative household surveys, this paper empirically assesses the extent and evolution of gender discrimination in the South African labour market over the post-apartheid period. Attention is also paid to the role that anti-discriminatory legislation has had to play in effecting change in the South African labour market. Much of the paper’s focus is placed on African women who would have benefited most from the new legislative environment. Afri...

  19. Prion strain discrimination based on rapid in vivo amplification and analysis by the cell panel assay.

    Directory of Open Access Journals (Sweden)

    Yervand Eduard Karapetyan

    Full Text Available Prion strain identification has been hitherto achieved using time-consuming incubation time determinations in one or more mouse lines and elaborate neuropathological assessment. In the present work, we make a detailed study of the properties of PrP-overproducing Tga20 mice. We show that in these mice the four prion strains examined are rapidly and faithfully amplified and can subsequently be discriminated by a cell-based procedure, the Cell Panel Assay.

  20. Pinpointing the classifiers of English language writing ability: A discriminant function analysis approach

    Directory of Open Access Journals (Sweden)

    Mohammad Ali Shams

    2013-02-01

    Full Text Available     The major aim of this paper was to investigate the validity of language and intelligence factors for classifying Iranian English learners` writing performance. Iranian participants of the study took three tests for grammar, breadth, and depth of vocabulary, and two tests for verbal and narrative intelligence. They also produced a corpus of argumentative writings in answer to IELTS specimen. Several runs of discriminant function analyses were used to examine the classifying power of the five variables for discriminating between low and high ability L2 writers. The results revealed that among language factors, depth of vocabulary (collocational knowledge produces the best discriminant function. In general, narrative intelligence was found to be the most reliable predictor for membership in low or high groups. It was also found that, among the five sub-abilities of narrative intelligence, emplotment carries the highest classifying value. Finally, the applications and implications of the results for second language researchers, cognitive scientists, and applied linguists were discussed.Â

  1. Sex determination using discriminant function analysis in Indigenous (Kurubas) children and adolescents of Coorg, Karnataka, India: A lateral cephalometric study.

    Science.gov (United States)

    Devang Divakar, Darshan; John, Jacob; Al Kheraif, Abdulaziz Abdullah; Mavinapalla, Seema; Ramakrishnaiah, Ravikumar; Vellappally, Sajith; Hashem, Mohamed Ibrahim; Dalati, M H N; Durgesh, B H; Safadi, Rima A; Anil, Sukumaran

    2016-11-01

    Aim: To test the validity of sex discrimination using lateral cephalometric radiograph and discriminant function analysis in Indigenous (Kuruba) children and adolescents of Coorg, Karnataka, India. Methods and materials: Six hundred and sixteen lateral cephalograms of 380 male and 236 females of age ranging from 6.5 to 18 years of Indigenous population of Coorg, Karnataka, India called Kurubas having a normal occlusion were included in the study. Lateral cephalograms were obtained in a standard position with teeth in centric occlusion and lips relaxed. Each radiograph was traced and cephalometric landmarks were measured using digital calliper. Calculations of 24 cephalometric measurements were performed. Results: Males exhibited significantly greater mean angular and linear cephalometric measurements as compared to females ( p  gender. The reliability of the derived discriminant function was assessed among study subjects; 100% of males and females were recognized correctly. Conclusion: The final outcome of this study validates the existence of sexual dimorphism in the skeleton as early as 6.5 years of age. There is a need for further research to determine other landmarks that can help in sex determination and norms for Indigenous (Kuruba) population and also other Indigenous population of Coorg, Karnataka, India.

  2. Genotypic and Phenotypic Analysis of Dairy Lactococcus lactis Biodiversity in Milk: Volatile Organic Compounds as Discriminating Markers

    Science.gov (United States)

    Dhaisne, Amandine; Guellerin, Maeva; Laroute, Valérie; Laguerre, Sandrine; Le Bourgeois, Pascal; Loubiere, Pascal

    2013-01-01

    The diversity of nine dairy strains of Lactococcus lactis subsp. lactis in fermented milk was investigated by both genotypic and phenotypic analyses. Pulsed-field gel electrophoresis and multilocus sequence typing were used to establish an integrated genotypic classification. This classification was coherent with discrimination of the L. lactis subsp. lactis bv. diacetylactis lineage and reflected clonal complex phylogeny and the uniqueness of the genomes of these strains. To assess phenotypic diversity, 82 variables were selected as important dairy features; they included physiological descriptors and the production of metabolites and volatile organic compounds (VOCs). Principal-component analysis (PCA) demonstrated the phenotypic uniqueness of each of these genetically closely related strains, allowing strain discrimination. A method of variable selection was developed to reduce the time-consuming experimentation. We therefore identified 20 variables, all associated with VOCs, as phenotypic markers allowing discrimination between strain groups. These markers are representative of the three metabolic pathways involved in flavor: lipolysis, proteolysis, and glycolysis. Despite great phenotypic diversity, the strains could be divided into four robust phenotypic clusters based on their metabolic orientations. Inclusion of genotypic diversity in addition to phenotypic characters in the classification led to five clusters rather than four being defined. However, genotypic characters make a smaller contribution than phenotypic variables (no genetic distances selected among the most contributory variables). This work proposes an original method for the phenotypic differentiation of closely related strains in milk and may be the first step toward a predictive classification for the manufacture of starters. PMID:23709512

  3. Transition redshift: new constraints from parametric and nonparametric methods

    Energy Technology Data Exchange (ETDEWEB)

    Rani, Nisha; Mahajan, Shobhit; Mukherjee, Amitabha [Department of Physics and Astrophysics, University of Delhi, New Delhi 110007 (India); Jain, Deepak [Deen Dayal Upadhyaya College, University of Delhi, New Delhi 110015 (India); Pires, Nilza, E-mail: nrani@physics.du.ac.in, E-mail: djain@ddu.du.ac.in, E-mail: shobhit.mahajan@gmail.com, E-mail: amimukh@gmail.com, E-mail: npires@dfte.ufrn.br [Departamento de Física Teórica e Experimental, UFRN, Campus Universitário, Natal, RN 59072-970 (Brazil)

    2015-12-01

    In this paper, we use the cosmokinematics approach to study the accelerated expansion of the Universe. This is a model independent approach and depends only on the assumption that the Universe is homogeneous and isotropic and is described by the FRW metric. We parametrize the deceleration parameter, q(z), to constrain the transition redshift (z{sub t}) at which the expansion of the Universe goes from a decelerating to an accelerating phase. We use three different parametrizations of q(z) namely, q{sub I}(z)=q{sub 1}+q{sub 2}z, q{sub II} (z) = q{sub 3} + q{sub 4} ln (1 + z) and q{sub III} (z)=½+q{sub 5}/(1+z){sup 2}. A joint analysis of the age of galaxies, strong lensing and supernovae Ia data indicates that the transition redshift is less than unity i.e. z{sub t} < 1. We also use a nonparametric approach (LOESS+SIMEX) to constrain z{sub t}. This too gives z{sub t} < 1 which is consistent with the value obtained by the parametric approach.

  4. Nonparametric Integrated Agrometeorological Drought Monitoring: Model Development and Application

    Science.gov (United States)

    Zhang, Qiang; Li, Qin; Singh, Vijay P.; Shi, Peijun; Huang, Qingzhong; Sun, Peng

    2018-01-01

    Drought is a major natural hazard that has massive impacts on the society. How to monitor drought is critical for its mitigation and early warning. This study proposed a modified version of the multivariate standardized drought index (MSDI) based on precipitation, evapotranspiration, and soil moisture, i.e., modified multivariate standardized drought index (MMSDI). This study also used nonparametric joint probability distribution analysis. Comparisons were done between standardized precipitation evapotranspiration index (SPEI), standardized soil moisture index (SSMI), MSDI, and MMSDI, and real-world observed drought regimes. Results indicated that MMSDI detected droughts that SPEI and/or SSMI failed to do. Also, MMSDI detected almost all droughts that were identified by SPEI and SSMI. Further, droughts detected by MMSDI were similar to real-world observed droughts in terms of drought intensity and drought-affected area. When compared to MMSDI, MSDI has the potential to overestimate drought intensity and drought-affected area across China, which should be attributed to exclusion of the evapotranspiration components from estimation of drought intensity. Therefore, MMSDI is proposed for drought monitoring that can detect agrometeorological droughts. Results of this study provide a framework for integrated drought monitoring in other regions of the world and can help to develop drought mitigation.

  5. Bayesian nonparametric clustering in phylogenetics: modeling antigenic evolution in influenza.

    Science.gov (United States)

    Cybis, Gabriela B; Sinsheimer, Janet S; Bedford, Trevor; Rambaut, Andrew; Lemey, Philippe; Suchard, Marc A

    2018-01-30

    Influenza is responsible for up to 500,000 deaths every year, and antigenic variability represents much of its epidemiological burden. To visualize antigenic differences across many viral strains, antigenic cartography methods use multidimensional scaling on binding assay data to map influenza antigenicity onto a low-dimensional space. Analysis of such assay data ideally leads to natural clustering of influenza strains of similar antigenicity that correlate with sequence evolution. To understand the dynamics of these antigenic groups, we present a framework that jointly models genetic and antigenic evolution by combining multidimensional scaling of binding assay data, Bayesian phylogenetic machinery and nonparametric clustering methods. We propose a phylogenetic Chinese restaurant process that extends the current process to incorporate the phylogenetic dependency structure between strains in the modeling of antigenic clusters. With this method, we are able to use the genetic information to better understand the evolution of antigenicity throughout epidemics, as shown in applications of this model to H1N1 influenza. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  6. Application of the exploratory analysis of data in the geographical discrimination of okra of Rio Grande do Norte and Pernambuco

    Directory of Open Access Journals (Sweden)

    Francisco Santos Panero

    2009-11-01

    Full Text Available The contents of Cu, Zn, Na, Fe, K, Ca, Mn, Mg, PO43-, Cl- and SO42- were determined in samples of okra of the municipal districts of Caruaru and Vitória de Santo Antão, in Pernambuco, as well as in the municipal districts of Ceará-Mirim, Macaíba and Extremoz in the state of Rio Grande do Norte. The objective of this work is the application of two methods of  exploratory analysis of data: Principal Component Analysis - PCA and Hierarquical Cluster Analysis - HCA in the geographical discrimination of okra originating in the states of Rio Grande do Norte and Pernambuco. The results showed that Cl- and Na were the main elements for the differentiation of the samples of Rio Grande do Norte and, the samples of Pernambuco presented the largest amount of Fe, Cu, Mn, Mg, Ca, Zn, K, PO43-, and SO42-. Boths the methods of exploratory analysis of data investigated are efficient for geographical discrimination of okra originating in Rio Grande do Norte and Pernambuco.

  7. Chemical discrimination of lubricant marketing types using direct analysis in real time time-of-flight mass spectrometry.

    Science.gov (United States)

    Maric, Mark; Harvey, Lauren; Tomcsak, Maren; Solano, Angelique; Bridge, Candice

    2017-06-30

    In comparison to other violent crimes, sexual assaults suffer from very low prosecution and conviction rates especially in the absence of DNA evidence. As a result, the forensic community needs to utilize other forms of trace contact evidence, like lubricant evidence, in order to provide a link between the victim and the assailant. In this study, 90 personal bottled and condom lubricants from the three main marketing types, silicone-based, water-based and condoms, were characterized by direct analysis in real time time of flight mass spectrometry (DART-TOFMS). The instrumental data was analyzed by multivariate statistics including hierarchal cluster analysis, principal component analysis, and linear discriminant analysis. By interpreting the mass spectral data with multivariate statistics, 12 discrete groupings were identified, indicating inherent chemical diversity not only between but within the three main marketing groups. A number of unique chemical markers, both major and minor, were identified, other than the three main chemical components (i.e. PEG, PDMS and nonoxynol-9) currently used for lubricant classification. The data was validated by a stratified 20% withheld cross-validation which demonstrated that there was minimal overlap between the groupings. Based on the groupings identified and unique features of each group, a highly discriminating statistical model was then developed that aims to provide the foundation for the development of a forensic lubricant database that may eventually be applied to casework. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  8. A discriminant analysis prediction model of non-syndromic cleft lip with or without cleft palate based on risk factors.

    Science.gov (United States)

    Li, Huixia; Luo, Miyang; Luo, Jiayou; Zheng, Jianfei; Zeng, Rong; Du, Qiyun; Fang, Junqun; Ouyang, Na

    2016-11-23

    A risk prediction model of non-syndromic cleft lip with or without cleft palate (NSCL/P) was established by a discriminant analysis to predict the individual risk of NSCL/P in pregnant women. A hospital-based case-control study was conducted with 113 cases of NSCL/P and 226 controls without NSCL/P. The cases and the controls were obtained from 52 birth defects' surveillance hospitals in Hunan Province, China. A questionnaire was administered in person to collect the variables relevant to NSCL/P by face to face interviews. Logistic regression models were used to analyze the influencing factors of NSCL/P, and a stepwise Fisher discriminant analysis was subsequently used to construct the prediction model. In the univariate analysis, 13 influencing factors were related to NSCL/P, of which the following 8 influencing factors as predictors determined the discriminant prediction model: family income, maternal occupational hazards exposure, premarital medical examination, housing renovation, milk/soymilk intake in the first trimester of pregnancy, paternal occupational hazards exposure, paternal strong tea drinking, and family history of NSCL/P. The model had statistical significance (lambda = 0.772, chi-square = 86.044, df = 8, P Self-verification showed that 83.8 % of the participants were correctly predicted to be NSCL/P cases or controls with a sensitivity of 74.3 % and a specificity of 88.5 %. The area under the receiver operating characteristic curve (AUC) was 0.846. The prediction model that was established using the risk factors of NSCL/P can be useful for predicting the risk of NSCL/P. Further research is needed to improve the model, and confirm the validity and reliability of the model.

  9. Discrimination of Wild Paris Based on Near Infrared Spectroscopy and High Performance Liquid Chromatography Combined with Multivariate Analysis

    Science.gov (United States)

    Zhao, Yanli; Zhang, Ji; Yuan, Tianjun; Shen, Tao; Li, Wei; Yang, Shihua; Hou, Ying; Wang, Yuanzhong; Jin, Hang

    2014-01-01

    Different geographical origins and species of Paris obtained from southwestern China were discriminated by near infrared (NIR) spectroscopy and high performance liquid chromatography (HPLC) combined with multivariate analysis. The NIR parameter settings were scanning (64 times), resolution (4 cm−1), scanning range (10000 cm−1∼4000 cm−1) and parallel collection (3 times). NIR spectrum was optimized by TQ 8.6 software, and the ranges 7455∼6852 cm−1 and 5973∼4007 cm−1 were selected according to the spectrum standard deviation. The contents of polyphyllin I, polyphyllin II, polyphyllin VI, and polyphyllin VII and total steroid saponins were detected by HPLC. The contents of chemical components data matrix and spectrum data matrix were integrated and analyzed by partial least squares discriminant analysis (PLS-DA). From the PLS-DA model of NIR spectrum, Paris samples were separated into three groups according to the different geographical origins. The R2X and Q2Y described accumulative contribution rates were 99.50% and 94.03% of the total variance, respectively. The PLS-DA model according to 12 species of Paris described 99.62% of the variation in X and predicted 95.23% in Y. The results of the contents of chemical components described differences among collections quantitatively. A multivariate statistical model of PLS-DA showed geographical origins of Paris had a much greater influence on Paris compared with species. NIR and HPLC combined with multivariate analysis could discriminate different geographical origins and different species. The quality of Paris showed regional dependence. PMID:24558477

  10. Spatial discrimination and visual discrimination

    DEFF Research Database (Denmark)

    Haagensen, Annika M. J.; Grand, Nanna; Klastrup, Signe

    2013-01-01

    Two methods investigating learning and memory in juvenile Gottingen minipigs were evaluated for potential use in preclinical toxicity testing. Twelve minipigs were tested using a spatial hole-board discrimination test including a learning phase and two memory phases. Five minipigs were tested...... in a visual discrimination test. The juvenile minipigs were able to learn the spatial hole-board discrimination test and showed improved working and reference memory during the learning phase. Performance in the memory phases was affected by the retention intervals, but the minipigs were able to remember...... the concept of the test in both memory phases. Working memory and reference memory were significantly improved in the last trials of the memory phases. In the visual discrimination test, the minipigs learned to discriminate between the three figures presented to them within 9-14 sessions. For the memory test...

  11. Study on non-linear bistable dynamics model based EEG signal discrimination analysis method.

    Science.gov (United States)

    Ying, Xiaoguo; Lin, Han; Hui, Guohua

    2015-01-01

    Electroencephalogram (EEG) is the recording of electrical activity along the scalp. EEG measures voltage fluctuations generating from ionic current flows within the neurons of the brain. EEG signal is looked as one of the most important factors that will be focused in the next 20 years. In this paper, EEG signal discrimination based on non-linear bistable dynamical model was proposed. EEG signals were processed by non-linear bistable dynamical model, and features of EEG signals were characterized by coherence index. Experimental results showed that the proposed method could properly extract the features of different EEG signals.

  12. Numerical experiment on different validation cases of water coolant flow in supercritical pressure test sections assisted by discriminated dimensional analysis part I: the dimensional analysis

    International Nuclear Information System (INIS)

    Kiss, A.; Aszodi, A.

    2011-01-01

    As recent studies prove in contrast to 'classical' dimensional analysis, whose application is widely described in heat transfer textbooks despite its poor results, the less well known and used discriminated dimensional analysis approach can provide a deeper insight into the physical problems involved and much better results in all cases where it is applied. As a first step of this ongoing research discriminated dimensional analysis has been performed on supercritical pressure water pipe flow heated through the pipe solid wall to identify the independent dimensionless groups (which play an independent role in the above mentioned thermal hydraulic phenomena) in order to serve a theoretical base to comparison between well known supercritical pressure water pipe heat transfer experiments and results of their validated CFD simulations. (author)

  13. Characterization and Discrimination of Gram-Positive Bacteria Using Raman Spectroscopy with the Aid of Principal Component Analysis

    Directory of Open Access Journals (Sweden)

    Alia Colniță

    2017-09-01

    Full Text Available Raman scattering and its particular effect, surface-enhanced Raman scattering (SERS, are whole-organism fingerprinting spectroscopic techniques that gain more and more popularity in bacterial detection. In this work, two relevant Gram-positive bacteria species, Lactobacillus casei (L. casei and Listeria monocytogenes (L. monocytogenes were characterized based on their Raman and SERS spectral fingerprints. The SERS spectra were used to identify the biochemical structures of the bacterial cell wall. Two synthesis methods of the SERS-active nanomaterials were used and the recorded spectra were analyzed. L. casei and L. monocytogenes were successfully discriminated by applying Principal Component Analysis (PCA to their specific spectral data.

  14. A multitemporal and non-parametric approach for assessing the impacts of drought on vegetation greenness

    DEFF Research Database (Denmark)

    Carrao, Hugo; Sepulcre, Guadalupe; Horion, Stéphanie Marie Anne F

    2013-01-01

    This study evaluates the relationship between the frequency and duration of meteorological droughts and the subsequent temporal changes on the quantity of actively photosynthesizing biomass (greenness) estimated from satellite imagery on rainfed croplands in Latin America. An innovative non-parametric...... and non-supervised approach, based on the Fisher-Jenks optimal classification algorithm, is used to identify multi-scale meteorological droughts on the basis of empirical cumulative distributions of 1, 3, 6, and 12-monthly precipitation totals. As input data for the classifier, we use the gridded GPCC...... for the period between 1998 and 2010. The time-series analysis of vegetation greenness is performed during the growing season with a non-parametric method, namely the seasonal Relative Greenness (RG) of spatially accumulated fAPAR. The Global Land Cover map of 2000 and the GlobCover maps of 2005/2006 and 2009...

  15. Contributions to sensitivity analysis and generalized discriminant analysis; Contributions a l'analyse de sensibilite et a l'analyse discriminante generalisee

    Energy Technology Data Exchange (ETDEWEB)

    Jacques, J

    2005-12-15

    Two topics are studied in this thesis: sensitivity analysis and generalized discriminant analysis. Global sensitivity analysis of a mathematical model studies how the output variables of this last react to variations of its inputs. The methods based on the study of the variance quantify the part of variance of the response of the model due to each input variable and each subset of input variables. The first subject of this thesis is the impact of a model uncertainty on results of a sensitivity analysis. Two particular forms of uncertainty are studied: that due to a change of the model of reference, and that due to the use of a simplified model with the place of the model of reference. A second problem was studied during this thesis, that of models with correlated inputs. Indeed, classical sensitivity indices not having significance (from an interpretation point of view) in the presence of correlation of the inputs, we propose a multidimensional approach consisting in expressing the sensitivity of the output of the model to groups of correlated variables. Applications in the field of nuclear engineering illustrate this work. Generalized discriminant analysis consists in classifying the individuals of a test sample in groups, by using information contained in a training sample, when these two samples do not come from the same population. This work extends existing methods in a Gaussian context to the case of binary data. An application in public health illustrates the utility of generalized discrimination models thus defined. (author)

  16. Contributions to sensitivity analysis and generalized discriminant analysis; Contributions a l'analyse de sensibilite et a l'analyse discriminante generalisee

    Energy Technology Data Exchange (ETDEWEB)

    Jacques, J

    2005-12-15

    Two topics are studied in this thesis: sensitivity analysis and generalized discriminant analysis. Global sensitivity analysis of a mathematical model studies how the output variables of this last react to variations of its inputs. The methods based on the study of the variance quantify the part of variance of the response of the model due to each input variable and each subset of input variables. The first subject of this thesis is the impact of a model uncertainty on results of a sensitivity analysis. Two particular forms of uncertainty are studied: that due to a change of the model of reference, and that due to the use of a simplified model with the place of the model of reference. A second problem was studied during this thesis, that of models with correlated inputs. Indeed, classical sensitivity indices not having significance (from an interpretation point of view) in the presence of correlation of the inputs, we propose a multidimensional approach consisting in expressing the sensitivity of the output of the model to groups of correlated variables. Applications in the field of nuclear engineering illustrate this work. Generalized discriminant analysis consists in classifying the individuals of a test sample in groups, by using information contained in a training sample, when these two samples do not come from the same population. This work extends existing methods in a Gaussian context to the case of binary data. An application in public health illustrates the utility of generalized discrimination models thus defined. (author)

  17. Discriminative analysis of Parkinson's disease based on whole-brain functional connectivity.

    Directory of Open Access Journals (Sweden)

    Yongbin Chen

    Full Text Available Recently, there has been an increasing emphasis on applications of pattern recognition and neuroimaging techniques in the effective and accurate diagnosis of psychiatric or neurological disorders. In the present study, we investigated the whole-brain resting-state functional connectivity patterns of Parkinson's disease (PD, which are expected to provide additional information for the clinical diagnosis and treatment of this disease. First, we computed the functional connectivity between each pair of 116 regions of interest derived from a prior atlas. The most discriminative features based on Kendall tau correlation coefficient were then selected. A support vector machine classifier was employed to classify 21 PD patients with 26 demographically matched healthy controls. This method achieved a classification accuracy of 93.62% using leave-one-out cross-validation, with a sensitivity of 90.47% and a specificity of 96.15%. The majority of the most discriminative functional connections were located within or across the default mode, cingulo-opercular and frontal-parietal networks and the cerebellum. These disease-related resting-state network alterations might play important roles in the pathophysiology of this disease. Our results suggest that analyses of whole-brain resting-state functional connectivity patterns have the potential to improve the clinical diagnosis and treatment evaluation of PD.

  18. Stock discrimination in Great Lakes Walleye using mitochondrial DNA restriction analysis

    International Nuclear Information System (INIS)

    Billington, N.; Hebert, P.D.N.

    1986-01-01

    Over the past two years it has become evident that because of its strict maternal inheritance and rapid rate of evolutionary differentiation, mitochondrial (mt) DNA diversity offers exceptional promise in the discrimination of fish stocks. The current project aims to determine the extent of mt DNA variation among stocks of walleye (Stizostedion vitreum) from the Great Lakes. At this point, mt DNA has been isolated from 68 walleye representing the Thames River stock and a reef breeding stock from western Lake Erie, as well as from individuals of S. canadense, a species which hybridizes with S. vitreum. Mitochondrial DNA was extracted from livers of these fish, purified by CsCl density gradient centrifugation and digested using 20 endonucleases. Polymorphisms were detected with 8 of the enzymes. There was a great deal of variation among fish from both spawning populations, so much so that individual fish could be identified by this technique. No single enzyme allowed discrimination of the two stocks, but restriction pattern variation following Dde I digestion permitted separation of 50% of Lake Erie fish from Thames River stock. Comparison of mt DNA restriction patterns of walleye and sauger showed that two species are easily separable, setting the stage for a more detailed study of hybridization between the taxa

  19. Predicting Market Impact Costs Using Nonparametric Machine Learning Models.

    Directory of Open Access Journals (Sweden)

    Saerom Park

    Full Text Available Market impact cost is the most significant portion of implicit transaction costs that can reduce the overall transaction cost, although it cannot be measured directly. In this paper, we employed the state-of-the-art nonparametric machine learning models: neural networks, Bayesian neural network, Gaussian process, and support vector regression, to predict market impact cost accurately and to provide the predictive model that is versatile in the number of variables. We collected a large amount of real single transaction data of US stock market from Bloomberg Terminal and generated three independent input variables. As a result, most nonparametric machine learning models outperformed a-state-of-the-art benchmark parametric model such as I-star model in four error measures. Although these models encounter certain difficulties in separating the permanent and temporary cost directly, nonparametric machine learning models can be good alternatives in reducing transaction costs by considerably improving in prediction performance.

  20. Predicting Market Impact Costs Using Nonparametric Machine Learning Models.

    Science.gov (United States)

    Park, Saerom; Lee, Jaewook; Son, Youngdoo

    2016-01-01

    Market impact cost is the most significant portion of implicit transaction costs that can reduce the overall transaction cost, although it cannot be measured directly. In this paper, we employed the state-of-the-art nonparametric machine learning models: neural networks, Bayesian neural network, Gaussian process, and support vector regression, to predict market impact cost accurately and to provide the predictive model that is versatile in the number of variables. We collected a large amount of real single transaction data of US stock market from Bloomberg Terminal and generated three independent input variables. As a result, most nonparametric machine learning models outperformed a-state-of-the-art benchmark parametric model such as I-star model in four error measures. Although these models encounter certain difficulties in separating the permanent and temporary cost directly, nonparametric machine learning models can be good alternatives in reducing transaction costs by considerably improving in prediction performance.

  1. A nonparametric spatial scan statistic for continuous data.

    Science.gov (United States)

    Jung, Inkyung; Cho, Ho Jin

    2015-10-20

    Spatial scan statistics are widely used for spatial cluster detection, and several parametric models exist. For continuous data, a normal-based scan statistic can be used. However, the performance of the model has not been fully evaluated for non-normal data. We propose a nonparametric spatial scan statistic based on the Wilcoxon rank-sum test statistic and compared the performance of the method with parametric models via a simulation study under various scenarios. The nonparametric method outperforms the normal-based scan statistic in terms of power and accuracy in almost all cases under consideration in the simulation study. The proposed nonparametric spatial scan statistic is therefore an excellent alternative to the normal model for continuous data and is especially useful for data following skewed or heavy-tailed distributions.

  2. Morphological evaluation of common bean diversity in Bosnia and Herzegovina using the discriminant analysis of principal components (DAPC multivariate method

    Directory of Open Access Journals (Sweden)

    Grahić Jasmin

    2013-01-01

    Full Text Available In order to analyze morphological characteristics of locally cultivated common bean landraces from Bosnia and Herzegovina (B&H, thirteen quantitative and qualitative traits of 40 P. vulgaris accessions, collected from four geographical regions (Northwest B&H, Northeast B&H, Central B&H and Sarajevo and maintained at the Gene bank of the Faculty of Agriculture and Food Sciences in Sarajevo, were examined. Principal component analysis (PCA showed that the proportion of variance retained in the first two principal components was 54.35%. The first principal component had high contributing factor loadings from seed width, seed height and seed weight, whilst the second principal component had high contributing factor loadings from the analyzed traits seed per pod and pod length. PCA plot, based on the first two principal components, displayed a high level of variability among the analyzed material. The discriminant analysis of principal components (DAPC created 3 discriminant functions (DF, whereby the first two discriminant functions accounted for 90.4% of the variance retained. Based on the retained DFs, DAPC provided group membership probabilities which showed that 70% of the accessions examined were correctly classified between the geographically defined groups. Based on the taxonomic distance, 40 common bean accessions analyzed in this study formed two major clusters, whereas two accessions Acc304 and Acc307 didn’t group in any of those. Acc360 and Acc362, as well as Acc324 and Acc371 displayed a high level of similarity and are probably the same landrace. The present diversity of Bosnia and Herzegovina’s common been landraces could be useful in future breeding programs.

  3. Mass discrimination

    Energy Technology Data Exchange (ETDEWEB)

    Broeckman, A. [Rijksuniversiteit Utrecht (Netherlands)

    1978-12-15

    In thermal ionization mass spectrometry the phenomenon of mass discrimination has led to the use of a correction factor for isotope ratio-measurements. The correction factor is defined as the measured ratio divided by the true or accepted value of this ratio. In fact this factor corrects for systematic errors of the whole procedure; however mass discrimination is often associated just with the mass spectrometer.

  4. Application of linear discriminant analysis and Attenuated Total Reflectance Fourier Transform Infrared microspectroscopy for diagnosis of colon cancer.

    Science.gov (United States)

    Khanmohammadi, Mohammadreza; Bagheri Garmarudi, Amir; Samani, Simin; Ghasemi, Keyvan; Ashuri, Ahmad

    2011-06-01

    Attenuated Total Reflectance Fourier Transform Infrared (ATR-FTIR) microspectroscopy was applied for detection of colon cancer according to the spectral features of colon tissues. Supervised classification models can be trained to identify the tissue type based on the spectroscopic fingerprint. A total of 78 colon tissues were used in spectroscopy studies. Major spectral differences were observed in 1,740-900 cm(-1) spectral region. Several chemometric methods such as analysis of variance (ANOVA), cluster analysis (CA) and linear discriminate analysis (LDA) were applied for classification of IR spectra. Utilizing the chemometric techniques, clear and reproducible differences were observed between the spectra of normal and cancer cases, suggesting that infrared microspectroscopy in conjunction with spectral data processing would be useful for diagnostic classification. Using LDA technique, the spectra were classified into cancer and normal tissue classes with an accuracy of 95.8%. The sensitivity and specificity was 100 and 93.1%, respectively.

  5. Quantitative analysis of the clinical data on leukemia, 5. Specificity of clinical features in acute myelocytic leukemia with 8; 21 translocation by multiple logistic discriminant analysis

    Energy Technology Data Exchange (ETDEWEB)

    Ueoka, Hiroshi; Kamada, Nanao; Yamamoto, Hisashi; Ohtaki, Megu; Takimoto, Yasuo; Kuramoto, Atsushi; Munaka, Masaki

    1984-11-01

    In order to determine the necessity of chromosome analysis required for the evaluation of 8;21 translocation, multiple logistic discriminant analysis was made on 124 patients with acute non-lymphocytic leukemia experienced in the authors' institution. Variables which showed positive correlation with the presence of 8;21 translocation were the presence of Auer body and granular abnormality of the cells, numbers of peripheral promyelocytes, myelocytes and metamyelocytes, and bone marrow promyelocytes, myelocytes, and the sum of rods and segments. Those which showed negative correlation with 8;21 translocation were peripheral platelet count, neutrocytealkaline phosphatase (N-AP) score, numbers of eosinocytes, monocytes and erythroblasts, and erythroblasts on myelogram. Auer body, four peripheral hematological features (platelet count, N-AP score, metamyelocytes and monocytes), and three myelogram features (myelocytes, reticular cells and granulocytes/eosionocytes) were used for the multiple logistic discriminant analysis. By the analysis, 2 of the 22 patients (9.1%) with translocation were judged not to have 8;21 translocation and 3 of the 102 patients (2.9%) without translocation were judged to have it. Therefore, this multiple logistic discriminant method has proved to be simple and useful in clinically evaluating acute non-lymphocytic leukemia. (Namekawa, K.).

  6. Like/dislike analysis using EEG: determination of most discriminative channels and frequencies.

    Science.gov (United States)

    Yılmaz, Bülent; Korkmaz, Sümeyye; Arslan, Dilek Betül; Güngör, Evrim; Asyalı, Musa H

    2014-02-01

    In this study, we have analyzed electroencephalography (EEG) signals to investigate the following issues, (i) which frequencies and EEG channels could be relatively better indicators of preference (like or dislike decisions) of consumer products, (ii) timing characteristic of "like" decisions during such mental processes. For this purpose, we have obtained multichannel EEG recordings from 15 subjects, during total of 16 epochs of 10 s long, while they were presented with some shoe photographs. When they liked a specific shoe, they pressed on a button and marked the time of this activity and the particular epoch was labeled as a LIKE case. No button press meant that the subject did not like the particular shoe that was displayed and corresponding epoch designated as a DISLIKE case. After preprocessing, power spectral density (PSD) of EEG data was estimated at different frequencies (4, 5, …, 40 Hz) using the Burg method, for each epoch corresponding to one shoe presentation. Each subject's data consisted of normalized PSD values (NPVs) from all LIKE and DISLIKE cases/epochs coming from all 19 EEG channels. In order to determine the most discriminative frequencies and channels, we have utilized logistic regression, where LIKE/DISLIKE status was used as a categorical (binary) response variable and corresponding NPVs were the continuously valued input variables or predictors. We observed that when all the NPVs (total of 37) are used as predictors, the regression problem was becoming ill-posed due to large number of predictors (compared to the number of samples) and high correlation among predictors. To circumvent this issue, we have divided the frequency band into low frequency (LF) 4-19 Hz and high frequency (HF) 20-40 Hz bands and analyzed the influence of the NPV in these bands separately. Then, using the p-values that indicate how significantly estimated predictor weights are different than zero, we have determined the NPVs and channels that are more influential

  7. Acromegaly determination using discriminant analysis of the three-dimensional facial classification in Taiwanese.

    Science.gov (United States)

    Wang, Ming-Hsu; Lin, Jen-Der; Chang, Chen-Nen; Chiou, Wen-Ko

    2017-08-01

    The aim of this study was to assess the size, angles and positional characteristics of facial anthropometry between "acromegalic" patients and control subjects. We also identify possible facial soft tissue measurements for generating discriminant functions toward acromegaly determination in males and females for acromegaly early self-awareness. This is a cross-sectional study. Subjects participating in this study included 70 patients diagnosed with acromegaly (35 females and 35 males) and 140 gender-matched control individuals. Three-dimensional facial images were collected via a camera system. Thirteen landmarks were selected. Eleven measurements from the three categories were selected and applied, including five frontal widths, three lateral depths and three lateral angular measurements. Descriptive analyses were conducted using means and standard deviations for each measurement. Univariate and multivariate discriminant function analyses were applied in order to calculate the accuracy of acromegaly detection. Patients with acromegaly exhibit soft-tissue facial enlargement and hypertrophy. Frontal widths as well as lateral depth and angle of facial changes were evident. The average accuracies of all functions for female patient detection ranged from 80.0-91.40%. The average accuracies of all functions for male patient detection were from 81.0-94.30%. The greatest anomaly observed was evidenced in the lateral angles, with greater enlargement of "nasofrontal" angles for females and greater "mentolabial" angles for males. Additionally, shapes of the lateral angles showed changes. The majority of the facial measurements proved dynamic for acromegaly patients; however, it is problematic to detect the disease with progressive body anthropometric changes. The discriminant functions of detection developed in this study could help patients, their families, medical practitioners and others to identify and track progressive facial change patterns before the possible patients

  8. Sensitivity of cognitive tests in four cognitive domains in discriminating MDD patients from healthy controls: a meta-analysis.

    Science.gov (United States)

    Lim, JaeHyoung; Oh, In Kyung; Han, Changsu; Huh, Yu Jeong; Jung, In-Kwa; Patkar, Ashwin A; Steffens, David C; Jang, Bo-Hyoung

    2013-09-01

    We performed a meta-analysis in order to determine which neuropsychological domains and tasks would be most sensitive for discriminating between patients with major depressive disorder (MDD) and healthy controls. Relevant articles were identified through a literature search of the PubMed and Cochrane Library databases for the period between January 1997 and May 2011. A meta-analysis was conducted using the standardized means of individual cognitive tests in each domain. The heterogeneity was assessed, and subgroup analyses according to age and medication status were performed to explore the sources of heterogeneity. A total of 22 trials involving 955 MDD patients and 7,664 healthy participants were selected for our meta-analysis. MDD patients showed significantly impaired results compared with healthy participants on the Digit Span and Continuous Performance Test in the attention domain; the Trail Making Test A (TMT-A) and the Digit Symbol Test in the processing speed domain; the Stroop Test, the Wisconsin Card Sorting Test, and Verbal Fluency in the executive function domain; and immediate verbal memory in the memory domain. The Finger Tapping Task, TMT-B, delayed verbal memory, and immediate and delayed visual memory failed to separate MDD patients from healthy controls. The results of subgroup analysis showed that performance of Verbal Fluency was significantly impaired in younger depressed patients (memory was significantly reduced in depressed patients using antidepressants. Our findings have inevitable limitations arising from methodological issues inherent in the meta-analysis and we could not explain high heterogeneity between studies. Despite such limitations, current study has the strength of being the first meta-analysis which tried to specify cognitive function of depressed patients compared with healthy participants. And our findings may provide clinicians with further evidences that some cognitive tests in specific cognitive domains have sensitivity

  9. Nonparametric regression using the concept of minimum energy

    International Nuclear Information System (INIS)

    Williams, Mike

    2011-01-01

    It has recently been shown that an unbinned distance-based statistic, the energy, can be used to construct an extremely powerful nonparametric multivariate two sample goodness-of-fit test. An extension to this method that makes it possible to perform nonparametric regression using multiple multivariate data sets is presented in this paper. The technique, which is based on the concept of minimizing the energy of the system, permits determination of parameters of interest without the need for parametric expressions of the parent distributions of the data sets. The application and performance of this new method is discussed in the context of some simple example analyses.

  10. Proteome comparison for discrimination between honeydew and floral honeys from botanical species Mimosa scabrella Bentham by principal component analysis.

    Science.gov (United States)

    Azevedo, Mônia Stremel; Valentim-Neto, Pedro Alexandre; Seraglio, Siluana Katia Tischer; da Luz, Cynthia Fernandes Pinto; Arisi, Ana Carolina Maisonnave; Costa, Ana Carolina Oliveira

    2017-10-01

    Due to the increasing valuation and appreciation of honeydew honey in many European countries and also to existing contamination among different types of honeys, authentication is an important aspect of quality control with regard to guaranteeing the origin in terms of source (honeydew or floral) and needs to be determined. Furthermore, proteins are minor components of the honey, despite the importance of their physiological effects, and can differ according to the source of the honey. In this context, the aims of this study were to carry out protein extraction from honeydew and floral honeys and to discriminate these honeys from the same botanical species, Mimosa scabrella Bentham, through proteome comparison using two-dimensional gel electrophoresis and principal component analysis. The results showed that the proteome profile and principal component analysis can be a useful tool for discrimination between these types of honey using matched proteins (45 matched spots). Also, the proteome profile showed 160 protein spots in honeydew honey and 84 spots in the floral honey. The protein profile can be a differential characteristic of this type of honey, in view of the importance of proteins as bioactive compounds in honey. © 2017 Society of Chemical Industry. © 2017 Society of Chemical Industry.

  11. HDclassif : An R Package for Model-Based Clustering and Discriminant Analysis of High-Dimensional Data

    Directory of Open Access Journals (Sweden)

    Laurent Berge

    2012-01-01

    Full Text Available This paper presents the R package HDclassif which is devoted to the clustering and the discriminant analysis of high-dimensional data. The classification methods proposed in the package result from a new parametrization of the Gaussian mixture model which combines the idea of dimension reduction and model constraints on the covariance matrices. The supervised classification method using this parametrization is called high dimensional discriminant analysis (HDDA. In a similar manner, the associated clustering method iscalled high dimensional data clustering (HDDC and uses the expectation-maximization algorithm for inference. In order to correctly t the data, both methods estimate the specific subspace and the intrinsic dimension of the groups. Due to the constraints on the covariance matrices, the number of parameters to estimate is significantly lower than other model-based methods and this allows the methods to be stable and efficient in high dimensions. Two introductory examples illustrated with R codes allow the user to discover the hdda and hddc functions. Experiments on simulated and real datasets also compare HDDC and HDDA with existing classification methods on high-dimensional datasets. HDclassif is a free software and distributed under the general public license, as part of the R software project.

  12. Discriminant analysis of characteristics determining acceptance or rejection of nuclear power

    International Nuclear Information System (INIS)

    Holsapple, C.W.; Whinston, A.B.

    1977-01-01

    This study utilizes the linear discriminant model to analyze demographic and attitudinal data concerning the construction of a nuclear power facility at the Bailly site in northern Indiana. The objective is to ascertain the extent to which various respondent characteristics are useful in distinguishing among respondent attitudes (opposed, in favor, unsure) toward the Bailly project. Examination of reduced space characteristics leads the authors to postulate an interpretation of its two dimensions as respondent uncertainty and respondent resistance. The largest contributor (positive) to uncertainty was found to be a divorced or separated marital status; the greatest contributer (negative) to resistance was found to be home ownership. Both of these respondent characteristics were significant in the univariate sense. A particularly striking trend was the reliance of the opposed group upon electronic media as the source of most local news, whereas the other two groups tended to rely most heavily on newspapers

  13. Combining generative and discriminative representation learning for lung CT analysis with convolutional restricted Boltzmann machines

    DEFF Research Database (Denmark)

    van Tulder, Gijs; de Bruijne, Marleen

    2016-01-01

    The choice of features greatly influences the performance of a tissue classification system. Despite this, many systems are built with standard, predefined filter banks that are not optimized for that particular application. Representation learning methods such as restricted Boltzmann machines may...... outperform these standard filter banks because they learn a feature description directly from the training data. Like many other representation learning methods, restricted Boltzmann machines are unsupervised and are trained with a generative learning objective; this allows them to learn representations from...... unlabeled data, but does not necessarily produce features that are optimal for classification. In this paper we propose the convolutional classification restricted Boltzmann machine, which combines a generative and a discriminative learning objective. This allows it to learn filters that are good both...

  14. Male-female discrimination: an analysis of gender gap and its determinants

    Directory of Open Access Journals (Sweden)

    Claudio Quintano

    2013-05-01

    Full Text Available In recent years, the occupational dynamics have brought in significant innovations in Italy, as the increased participation of women in the labour market, that have stimulated studies about the gender wage gap, concerning the different remuneration reserved to male and female workers. In this work the Authors, following Oaxaca and Blinder approach, estimate the gap for Italian employers and proceed to its decomposition, one part due to differences in individual characteristics (endowment effect and another part due to the different returns on the same characteristics (coefficient effect, related to discrimination. Then, the gender wage gap and its decomposition is analyzed with reference to Italian macro-areas considered separately with the aim to highlight the different fundamental dynamics. The model has also been modified using the Heckmann correction to eliminate the bias due to self-selection; i.e. the different propensity to work for men and women.

  15. Discriminating poststroke depression from stroke by nuclear magnetic resonance spectroscopy-based metabonomic analysis

    Directory of Open Access Journals (Sweden)

    Xiao J

    2016-08-01

    Full Text Available Jianqi Xiao,1,* Jie Zhang,2,* Dan Sun,3,* Lin Wang,4,* Lijun Yu,5 Hongjing Wu,5 Dan Wang,5 Xuerong Qiu5 1Department of Neurosurgery, The First Hospital of Qiqihar City, Qiqihar, 2Department of Internal Medicine, Central Hospital of Jiamusi City, Jiamusi, 3Department of Geriatrics, General Hospital of Daqing Oil Field, Daqing, 4Department of Nursing, 5Department of Neurology, The First Hospital of Qiqihar City, Qiqihar, Heilongjiang, People’s Republic of China *These authors contributed equally to this work Abstract: Poststroke depression (PSD, the most common psychiatric disease that stroke survivors face, is estimated to affect ~30% of poststroke patients. However, there are still no objective methods to diagnose PSD. In this study, to explore the differential metabolites in the urine of PSD subjects and to identify a potential biomarker panel for PSD diagnosis, the nuclear magnetic resonance-based metabonomic method was applied. Ten differential metabolites responsible for discriminating PSD subjects from healthy control (HC and stroke subjects were found, and five of these metabolites were identified as potential biomarkers (lactate, α-hydroxybutyrate, phenylalanine, formate, and arabinitol. The panel consisting of these five metabolites provided excellent performance in discriminating PSD subjects from HC and stroke subjects, achieving an area under the receiver operating characteristic curve of 0.946 in the training set (43 HC, 45 stroke, and 62 PSD subjects. Moreover, this panel could classify the blinded samples from the test set (31 HC, 33 stroke, and 32 PSD subjects with an area under the curve of 0.946. These results laid a foundation for the future development of urine-based objective methods for PSD diagnosis and investigation of PSD pathogenesis. Keywords: poststroke depression, PSD, stroke, nuclear magnetic resonance, NMR, metabonomic

  16. Generalized Hyperalgesia in Children and Adults Diagnosed With Hypermobility Syndrome and Ehlers-Danlos Syndrome Hypermobility Type: A Discriminative Analysis.

    Science.gov (United States)

    Scheper, M C; Pacey, V; Rombaut, L; Adams, R D; Tofts, L; Calders, P; Nicholson, L L; Engelbert, R H H

    2017-03-01

    Lowered pressure-pain thresholds have been demonstrated in adults with Ehlers-Danlos syndrome hypermobility type (EDS-HT), but whether these findings are also present in children is unclear. Therefore, the objectives of the study were to determine whether generalized hyperalgesia is present in children with hypermobility syndrome (HMS)/EDS-HT, explore potential differences in pressure-pain thresholds between children and adults with HMS/EDS-HT, and determine the discriminative value of generalized hyperalgesia. Patients were classified in 1 of 3 groups: HMS/EDS-HT, hypermobile (Beighton score ≥4 of 9), and healthy controls. Descriptive data of age, sex, body mass index, Beighton score, skin laxity, and medication usage were collected. Generalized hyperalgesia was quantified by the average pressure-pain thresholds collected from 12 locations. Confounders collected were pain locations/intensity, fatigue, and psychological distress. Comparisons between children with HMS/EDS-HT and normative values, between children and adults with HMS/EDS-HT, and corrected confounders were analyzed with multivariate analysis of covariance. The discriminative value of generalized hyperalgesia employed to differentiate between HMS/EDS-HT, hypermobility, and controls was quantified with logistic regression. Significantly lower pressure-pain thresholds were found in children with HMS/EDS-HT compared to normative values (range -22.0% to -59.0%; P ≤ 0.05). When applying a threshold of 30.8 N/cm 2 for males and 29.0 N/cm 2 for females, the presence of generalized hyperalgesia discriminated between individuals with HMS/EDS-HT, hypermobility, and healthy controls (odds ratio 6.0). Children and adults with HMS/EDS-HT are characterized by hypermobility, chronic pain, and generalized hyperalgesia. The presence of generalized hyperalgesia may indicate involvement of the central nervous system in the development of chronic pain. © 2016, American College of Rheumatology.

  17. A new powerful non-parametric two-stage approach for testing multiple phenotypes in family-based association studies

    NARCIS (Netherlands)

    Lange, C; Lyon, H; DeMeo, D; Raby, B; Silverman, EK; Weiss, ST

    2003-01-01

    We introduce a new powerful nonparametric testing strategy for family-based association studies in which multiple quantitative traits are recorded and the phenotype with the strongest genetic component is not known prior to the analysis. In the first stage, using a population-based test based on the

  18. Discrimination between Bacillus and Alicyclobacillus isolates in apple juice by Fourier transform infrared spectroscopy and multivariate analysis.

    Science.gov (United States)

    Al-Holy, Murad A; Lin, Mengshi; Alhaj, Omar A; Abu-Goush, Mahmoud H

    2015-02-01

    Alicyclobacillus is a causative agent of spoilage in pasteurized and heat-treated apple juice products. Differentiating between this genus and the closely related Bacillus is crucially important. In this study, Fourier transform infrared spectroscopy (FT-IR) was used to identify and discriminate between 4 Alicyclobacillus strains and 4 Bacillus isolates inoculated individually into apple juice. Loading plots over the range of 1350 and 1700 cm(-1) reflected the most distinctive biochemical features of Bacillus and Alicyclobacillus. Multivariate statistical methods (for example, principal component analysis and soft independent modeling of class analogy) were used to analyze the spectral data. Distinctive separation of spectral samples was observed. This study demonstrates that FT-IR spectroscopy in combination with multivariate analysis could serve as a rapid and effective tool for fruit juice industry to differentiate between Bacillus and Alicyclobacillus and to distinguish between species belonging to these 2 genera. © 2015 Institute of Food Technologists®

  19. Nonparametric Bayesian models for a spatial covariance.

    Science.gov (United States)

    Reich, Brian J; Fuentes, Montserrat

    2012-01-01

    A crucial step in the analysis of spatial data is to estimate the spatial correlation function that determines the relationship between a spatial process at two locations. The standard approach to selecting the appropriate correlation function is to use prior knowledge or exploratory analysis, such as a variogram analysis, to select the correct parametric correlation function. Rather that selecting a particular parametric correlation function, we treat the covariance function as an unknown function to be estimated from the data. We propose a flexible prior for the correlation function to provide robustness to the choice of correlation function. We specify the prior for the correlation function using spectral methods and the Dirichlet process prior, which is a common prior for an unknown distribution function. Our model does not require Gaussian data or spatial locations on a regular grid. The approach is demonstrated using a simulation study as well as an analysis of California air pollution data.

  20. Efficient nonparametric estimation of causal mediation effects

    OpenAIRE

    Chan, K. C. G.; Imai, K.; Yam, S. C. P.; Zhang, Z.

    2016-01-01

    An essential goal of program evaluation and scientific research is the investigation of causal mechanisms. Over the past several decades, causal mediation analysis has been used in medical and social sciences to decompose the treatment effect into the natural direct and indirect effects. However, all of the existing mediation analysis methods rely on parametric modeling assumptions in one way or another, typically requiring researchers to specify multiple regression models involving the treat...