Local kernel nonparametric discriminant analysis for adaptive extraction of complex structures
Li, Quanbao; Wei, Fajie; Zhou, Shenghan
2017-05-01
The linear discriminant analysis (LDA) is one of popular means for linear feature extraction. It usually performs well when the global data structure is consistent with the local data structure. Other frequently-used approaches of feature extraction usually require linear, independence, or large sample condition. However, in real world applications, these assumptions are not always satisfied or cannot be tested. In this paper, we introduce an adaptive method, local kernel nonparametric discriminant analysis (LKNDA), which integrates conventional discriminant analysis with nonparametric statistics. LKNDA is adept in identifying both complex nonlinear structures and the ad hoc rule. Six simulation cases demonstrate that LKNDA have both parametric and nonparametric algorithm advantages and higher classification accuracy. Quartic unilateral kernel function may provide better robustness of prediction than other functions. LKNDA gives an alternative solution for discriminant cases of complex nonlinear feature extraction or unknown feature extraction. At last, the application of LKNDA in the complex feature extraction of financial market activities is proposed.
Bayesian nonparametric data analysis
Müller, Peter; Jara, Alejandro; Hanson, Tim
2015-01-01
This book reviews nonparametric Bayesian methods and models that have proven useful in the context of data analysis. Rather than providing an encyclopedic review of probability models, the book’s structure follows a data analysis perspective. As such, the chapters are organized by traditional data analysis problems. In selecting specific nonparametric models, simpler and more traditional models are favored over specialized ones. The discussed methods are illustrated with a wealth of examples, including applications ranging from stylized examples to case studies from recent literature. The book also includes an extensive discussion of computational methods and details on their implementation. R code for many examples is included in on-line software pages.
非参数判别模型%Nonparametric discriminant model
Institute of Scientific and Technical Information of China (English)
谢斌锋; 梁飞豹
2011-01-01
提出了一类新的判别分析方法,主要思想是将非参数回归模型推广到判别分析中,形成相应的非参数判别模型.通过实例与传统判别法相比较,表明非参数判别法具有更广泛的适用性和较高的回代正确率.%In this paper, the author puts forth a new class of discriminant method, which the main idea is applied non- parametric regression model to discriminant analysis and forms the corresponding nonparametric discriminant model. Compared with the traditional discriminant methods by citing an example, the nonparametric discriminant method has more comprehensive adaptability and higher correct rate of back subsitution.
A Bayesian nonparametric meta-analysis model.
Karabatsos, George; Talbott, Elizabeth; Walker, Stephen G
2015-03-01
In a meta-analysis, it is important to specify a model that adequately describes the effect-size distribution of the underlying population of studies. The conventional normal fixed-effect and normal random-effects models assume a normal effect-size population distribution, conditionally on parameters and covariates. For estimating the mean overall effect size, such models may be adequate, but for prediction, they surely are not if the effect-size distribution exhibits non-normal behavior. To address this issue, we propose a Bayesian nonparametric meta-analysis model, which can describe a wider range of effect-size distributions, including unimodal symmetric distributions, as well as skewed and more multimodal distributions. We demonstrate our model through the analysis of real meta-analytic data arising from behavioral-genetic research. We compare the predictive performance of the Bayesian nonparametric model against various conventional and more modern normal fixed-effects and random-effects models.
Nonparametric Bayes analysis of social science data
Kunihama, Tsuyoshi
Social science data often contain complex characteristics that standard statistical methods fail to capture. Social surveys assign many questions to respondents, which often consist of mixed-scale variables. Each of the variables can follow a complex distribution outside parametric families and associations among variables may have more complicated structures than standard linear dependence. Therefore, it is not straightforward to develop a statistical model which can approximate structures well in the social science data. In addition, many social surveys have collected data over time and therefore we need to incorporate dynamic dependence into the models. Also, it is standard to observe massive number of missing values in the social science data. To address these challenging problems, this thesis develops flexible nonparametric Bayesian methods for the analysis of social science data. Chapter 1 briefly explains backgrounds and motivations of the projects in the following chapters. Chapter 2 develops a nonparametric Bayesian modeling of temporal dependence in large sparse contingency tables, relying on a probabilistic factorization of the joint pmf. Chapter 3 proposes nonparametric Bayes inference on conditional independence with conditional mutual information used as a measure of the strength of conditional dependence. Chapter 4 proposes a novel Bayesian density estimation method in social surveys with complex designs where there is a gap between sample and population. We correct for the bias by adjusting mixture weights in Bayesian mixture models. Chapter 5 develops a nonparametric model for mixed-scale longitudinal surveys, in which various types of variables can be induced through latent continuous variables and dynamic latent factors lead to flexibly time-varying associations among variables.
Local Component Analysis for Nonparametric Bayes Classifier
Khademi, Mahmoud; safayani, Meharn
2010-01-01
The decision boundaries of Bayes classifier are optimal because they lead to maximum probability of correct decision. It means if we knew the prior probabilities and the class-conditional densities, we could design a classifier which gives the lowest probability of error. However, in classification based on nonparametric density estimation methods such as Parzen windows, the decision regions depend on the choice of parameters such as window width. Moreover, these methods suffer from curse of dimensionality of the feature space and small sample size problem which severely restricts their practical applications. In this paper, we address these problems by introducing a novel dimension reduction and classification method based on local component analysis. In this method, by adopting an iterative cross-validation algorithm, we simultaneously estimate the optimal transformation matrices (for dimension reduction) and classifier parameters based on local information. The proposed method can classify the data with co...
A Nonparametric Analogy of Analysis of Covariance
Burnett, Thomas D.; Barr, Donald R.
1977-01-01
A nonparametric test of the hypothesis of no treatment effect is suggested for a situation where measures of the severity of the condition treated can be obtained and ranked both pre- and post-treatment. The test allows the pre-treatment rank to be used as a concomitant variable. (Author/JKS)
Lottery spending: a non-parametric analysis.
Garibaldi, Skip; Frisoli, Kayla; Ke, Li; Lim, Melody
2015-01-01
We analyze the spending of individuals in the United States on lottery tickets in an average month, as reported in surveys. We view these surveys as sampling from an unknown distribution, and we use non-parametric methods to compare properties of this distribution for various demographic groups, as well as claims that some properties of this distribution are constant across surveys. We find that the observed higher spending by Hispanic lottery players can be attributed to differences in education levels, and we dispute previous claims that the top 10% of lottery players consistently account for 50% of lottery sales.
Lottery spending: a non-parametric analysis.
Directory of Open Access Journals (Sweden)
Skip Garibaldi
Full Text Available We analyze the spending of individuals in the United States on lottery tickets in an average month, as reported in surveys. We view these surveys as sampling from an unknown distribution, and we use non-parametric methods to compare properties of this distribution for various demographic groups, as well as claims that some properties of this distribution are constant across surveys. We find that the observed higher spending by Hispanic lottery players can be attributed to differences in education levels, and we dispute previous claims that the top 10% of lottery players consistently account for 50% of lottery sales.
Investigating the cultural patterns of corruption: A nonparametric analysis
Halkos, George; Tzeremes, Nickolaos
2011-01-01
By using a sample of 77 countries our analysis applies several nonparametric techniques in order to reveal the link between national culture and corruption. Based on Hofstede’s cultural dimensions and the corruption perception index, the results reveal that countries with higher levels of corruption tend to have higher power distance and collectivism values in their society.
Unsupervised Linear Discriminant Analysis
Institute of Scientific and Technical Information of China (English)
无
2006-01-01
An algorithm for unsupervised linear discriminant analysis was presented. Optimal unsupervised discriminant vectors are obtained through maximizing covariance of all samples and minimizing covariance of local k-nearest neighbor samples. The experimental results show our algorithm is effective.
A Bayesian Nonparametric Meta-Analysis Model
Karabatsos, George; Talbott, Elizabeth; Walker, Stephen G.
2015-01-01
In a meta-analysis, it is important to specify a model that adequately describes the effect-size distribution of the underlying population of studies. The conventional normal fixed-effect and normal random-effects models assume a normal effect-size population distribution, conditionally on parameters and covariates. For estimating the mean overall…
Nonparametric inference procedures for multistate life table analysis.
Dow, M M
1985-01-01
Recent generalizations of the classical single state life table procedures to the multistate case provide the means to analyze simultaneously the mobility and mortality experience of 1 or more cohorts. This paper examines fairly general nonparametric combinatorial matrix procedures, known as quadratic assignment, as an analysis technic of various transitional patterns commonly generated by cohorts over the life cycle course. To some degree, the output from a multistate life table analysis suggests inference procedures. In his discussion of multstate life table construction features, the author focuses on the matrix formulation of the problem. He then presents several examples of the proposed nonparametric procedures. Data for the mobility and life expectancies at birth matrices come from the 458 member Cayo Santiago rhesus monkey colony. The author's matrix combinatorial approach to hypotheses testing may prove to be a useful inferential strategy in several multidimensional demographic areas.
Using non-parametric methods in econometric production analysis
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
2012-01-01
by investigating the relationship between the elasticity of scale and the farm size. We use a balanced panel data set of 371~specialised crop farms for the years 2004-2007. A non-parametric specification test shows that neither the Cobb-Douglas function nor the Translog function are consistent with the "true......Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify a functional form of the production function of which the Cobb...... parameter estimates, but also in biased measures which are derived from the parameters, such as elasticities. Therefore, we propose to use non-parametric econometric methods. First, these can be applied to verify the functional form used in parametric production analysis. Second, they can be directly used...
Using non-parametric methods in econometric production analysis
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
-Douglas function nor the Translog function are consistent with the “true” relationship between the inputs and the output in our data set. We solve this problem by using non-parametric regression. This approach delivers reasonable results, which are on average not too different from the results of the parametric......Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify the functional form of the production function. Most often, the Cobb...... results—including measures that are of interest of applied economists, such as elasticities. Therefore, we propose to use nonparametric econometric methods. First, they can be applied to verify the functional form used in parametric estimations of production functions. Second, they can be directly used...
Digital spectral analysis parametric, non-parametric and advanced methods
Castanié, Francis
2013-01-01
Digital Spectral Analysis provides a single source that offers complete coverage of the spectral analysis domain. This self-contained work includes details on advanced topics that are usually presented in scattered sources throughout the literature.The theoretical principles necessary for the understanding of spectral analysis are discussed in the first four chapters: fundamentals, digital signal processing, estimation in spectral analysis, and time-series models.An entire chapter is devoted to the non-parametric methods most widely used in industry.High resolution methods a
Using non-parametric methods in econometric production analysis
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
2012-01-01
Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify a functional form of the production function of which the Cobb-Douglas a......Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify a functional form of the production function of which the Cobb...... parameter estimates, but also in biased measures which are derived from the parameters, such as elasticities. Therefore, we propose to use non-parametric econometric methods. First, these can be applied to verify the functional form used in parametric production analysis. Second, they can be directly used...... to estimate production functions without the specification of a functional form. Therefore, they avoid possible misspecification errors due to the use of an unsuitable functional form. In this paper, we use parametric and non-parametric methods to identify the optimal size of Polish crop farms...
Wesselink, Christiaan; Heeg, Govert P.; Jansonius, Nomdo M.
Objective: To compare prospectively 2 perimetric progression detection algorithms for glaucoma, the Early Manifest Glaucoma Trial algorithm (glaucoma progression analysis [GPA]) and a nonparametric algorithm applied to the mean deviation (MD) (nonparametric progression analysis [NPA]). Methods:
Using non-parametric methods in econometric production analysis
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
Econometric estimation of production functions is one of the most common methods in applied economic production analysis. These studies usually apply parametric estimation techniques, which obligate the researcher to specify the functional form of the production function. Most often, the Cobb......-Douglas or the Translog production function is used. However, the specification of a functional form for the production function involves the risk of specifying a functional form that is not similar to the “true” relationship between the inputs and the output. This misspecification might result in biased estimation...... results—including measures that are of interest of applied economists, such as elasticities. Therefore, we propose to use nonparametric econometric methods. First, they can be applied to verify the functional form used in parametric estimations of production functions. Second, they can be directly used...
Multi-Directional Non-Parametric Analysis of Agricultural Efficiency
DEFF Research Database (Denmark)
Balezentis, Tomas
This thesis seeks to develop methodologies for assessment of agricultural efficiency and employ them to Lithuanian family farms. In particular, we focus on three particular objectives throughout the research: (i) to perform a fully non-parametric analysis of efficiency effects, (ii) to extend...... relative to labour, intermediate consumption and land (in some cases land was not treated as a discretionary input). These findings call for further research on relationships among financial structure, investment decisions, and efficiency in Lithuanian family farms. Application of different techniques...... of stochasticity associated with Lithuanian family farm performance. The former technique showed that the farms differed in terms of the mean values and variance of the efficiency scores over time with some clear patterns prevailing throughout the whole research period. The fuzzy Free Disposal Hull showed...
Bayesian nonparametric meta-analysis using Polya tree mixture models.
Branscum, Adam J; Hanson, Timothy E
2008-09-01
Summary. A common goal in meta-analysis is estimation of a single effect measure using data from several studies that are each designed to address the same scientific inquiry. Because studies are typically conducted in geographically disperse locations, recent developments in the statistical analysis of meta-analytic data involve the use of random effects models that account for study-to-study variability attributable to differences in environments, demographics, genetics, and other sources that lead to heterogeneity in populations. Stemming from asymptotic theory, study-specific summary statistics are modeled according to normal distributions with means representing latent true effect measures. A parametric approach subsequently models these latent measures using a normal distribution, which is strictly a convenient modeling assumption absent of theoretical justification. To eliminate the influence of overly restrictive parametric models on inferences, we consider a broader class of random effects distributions. We develop a novel hierarchical Bayesian nonparametric Polya tree mixture (PTM) model. We present methodology for testing the PTM versus a normal random effects model. These methods provide researchers a straightforward approach for conducting a sensitivity analysis of the normality assumption for random effects. An application involving meta-analysis of epidemiologic studies designed to characterize the association between alcohol consumption and breast cancer is presented, which together with results from simulated data highlight the performance of PTMs in the presence of nonnormality of effect measures in the source population.
2008-06-01
December 2006. 7. M. Fritz, B. Leibe, B. Caputo, and B. Schiele . Integrating representative and discriminant models for object category detection. 8... Schiele . Robust object detection with interleaved categorization and segmentation. Int. J. of Computer Vision, 2007 (in press). 18. D. G. Lowe. Distinctive
Nonparametric Cointegration Analysis of Fractional Systems With Unknown Integration Orders
DEFF Research Database (Denmark)
Nielsen, Morten Ørregaard
2009-01-01
In this paper a nonparametric variance ratio testing approach is proposed for determining the number of cointegrating relations in fractionally integrated systems. The test statistic is easily calculated without prior knowledge of the integration order of the data, the strength of the cointegrating...
Non-parametric analysis of rating transition and default data
DEFF Research Database (Denmark)
Fledelius, Peter; Lando, David; Perch Nielsen, Jens
2004-01-01
We demonstrate the use of non-parametric intensity estimation - including construction of pointwise confidence sets - for analyzing rating transition data. We find that transition intensities away from the class studied here for illustration strongly depend on the direction of the previous move b...... but that this dependence vanishes after 2-3 years....
Non-parametric analysis of rating transition and default data
DEFF Research Database (Denmark)
Fledelius, Peter; Lando, David; Perch Nielsen, Jens
2004-01-01
We demonstrate the use of non-parametric intensity estimation - including construction of pointwise confidence sets - for analyzing rating transition data. We find that transition intensities away from the class studied here for illustration strongly depend on the direction of the previous move...
Poverty and life cycle effects: A nonparametric analysis for Germany
Stich, Andreas
1996-01-01
Most empirical studies on poverty consider the extent of poverty either for the entire society or for separate groups like elderly people.However, these papers do not show what the situation looks like for persons of a certain age. In this paper poverty measures depending on age are derived using the joint density of income and age. The density is nonparametrically estimated by weighted Gaussian kernel density estimation. Applying the conditional density of income to several poverty measures ...
ANALYSIS OF TIED DATA: AN ALTERNATIVE NON-PARAMETRIC APPROACH
Directory of Open Access Journals (Sweden)
I. C. A. OYEKA
2012-02-01
Full Text Available This paper presents a non-parametric statistical method of analyzing two-sample data that makes provision for the possibility of ties in the data. A test statistic is developed and shown to be free of the effect of any possible ties in the data. An illustrative example is provided and the method is shown to compare favourably with its competitor; the Mann-Whitney test and is more powerful than the latter when there are ties.
A Bayesian nonparametric method for prediction in EST analysis
Directory of Open Access Journals (Sweden)
Prünster Igor
2007-09-01
Full Text Available Abstract Background Expressed sequence tags (ESTs analyses are a fundamental tool for gene identification in organisms. Given a preliminary EST sample from a certain library, several statistical prediction problems arise. In particular, it is of interest to estimate how many new genes can be detected in a future EST sample of given size and also to determine the gene discovery rate: these estimates represent the basis for deciding whether to proceed sequencing the library and, in case of a positive decision, a guideline for selecting the size of the new sample. Such information is also useful for establishing sequencing efficiency in experimental design and for measuring the degree of redundancy of an EST library. Results In this work we propose a Bayesian nonparametric approach for tackling statistical problems related to EST surveys. In particular, we provide estimates for: a the coverage, defined as the proportion of unique genes in the library represented in the given sample of reads; b the number of new unique genes to be observed in a future sample; c the discovery rate of new genes as a function of the future sample size. The Bayesian nonparametric model we adopt conveys, in a statistically rigorous way, the available information into prediction. Our proposal has appealing properties over frequentist nonparametric methods, which become unstable when prediction is required for large future samples. EST libraries, previously studied with frequentist methods, are analyzed in detail. Conclusion The Bayesian nonparametric approach we undertake yields valuable tools for gene capture and prediction in EST libraries. The estimators we obtain do not feature the kind of drawbacks associated with frequentist estimators and are reliable for any size of the additional sample.
Discriminant Analysis on Land Grading
Institute of Scientific and Technical Information of China (English)
LIU Yaolin; HOU Yajuan
2004-01-01
This paper proposes the discriminant analysis on land grading after analyzing the common methods and discussing the Fisher's discriminant in detail. Actually this method deduces the dimension from multi to single, thus it makes the feature vectors in n-dimension change to a scalar, and use this scalar to classify samples. This paper illustrates the result by giving an example of the residential land grading by the discriminant analysis.
Discriminant Incoherent Component Analysis.
Georgakis, Christos; Panagakis, Yannis; Pantic, Maja
2016-05-01
Face images convey rich information which can be perceived as a superposition of low-complexity components associated with attributes, such as facial identity, expressions, and activation of facial action units (AUs). For instance, low-rank components characterizing neutral facial images are associated with identity, while sparse components capturing non-rigid deformations occurring in certain face regions reveal expressions and AU activations. In this paper, the discriminant incoherent component analysis (DICA) is proposed in order to extract low-complexity components, corresponding to facial attributes, which are mutually incoherent among different classes (e.g., identity, expression, and AU activation) from training data, even in the presence of gross sparse errors. To this end, a suitable optimization problem, involving the minimization of nuclear-and l1 -norm, is solved. Having found an ensemble of class-specific incoherent components by the DICA, an unseen (test) image is expressed as a group-sparse linear combination of these components, where the non-zero coefficients reveal the class(es) of the respective facial attribute(s) that it belongs to. The performance of the DICA is experimentally assessed on both synthetic and real-world data. Emphasis is placed on face analysis tasks, namely, joint face and expression recognition, face recognition under varying percentages of training data corruption, subject-independent expression recognition, and AU detection by conducting experiments on four data sets. The proposed method outperforms all the methods that are compared with all the tasks and experimental settings.
Variable Selection in Discriminant Analysis.
Huberty, Carl J.; Mourad, Salah A.
Methods for ordering and selecting variables for discriminant analysis in multiple group comparison or group prediction studies include: univariate Fs, stepwise analysis, learning discriminant function (LDF) variable correlations, communalities, LDF standardized coefficients, and weighted standardized coefficients. Five indices based on distance,…
Variable Selection in Discriminant Analysis.
Huberty, Carl J.; Mourad, Salah A.
Methods for ordering and selecting variables for discriminant analysis in multiple group comparison or group prediction studies include: univariate Fs, stepwise analysis, learning discriminant function (LDF) variable correlations, communalities, LDF standardized coefficients, and weighted standardized coefficients. Five indices based on distance,…
Categorical and nonparametric data analysis choosing the best statistical technique
Nussbaum, E Michael
2014-01-01
Featuring in-depth coverage of categorical and nonparametric statistics, this book provides a conceptual framework for choosing the most appropriate type of test in various research scenarios. Class tested at the University of Nevada, the book's clear explanations of the underlying assumptions, computer simulations, and Exploring the Concept boxes help reduce reader anxiety. Problems inspired by actual studies provide meaningful illustrations of the techniques. The underlying assumptions of each test and the factors that impact validity and statistical power are reviewed so readers can explain
Non-parametric production analysis of pesticides use in the Netherlands
Oude Lansink, A.G.J.M.; Silva, E.
2004-01-01
Many previous empirical studies on the productivity of pesticides suggest that pesticides are under-utilized in agriculture despite the general held believe that these inputs are substantially over-utilized. This paper uses data envelopment analysis (DEA) to calculate non-parametric measures of the
Comparison of Rank Analysis of Covariance and Nonparametric Randomized Blocks Analysis.
Porter, Andrew C.; McSweeney, Maryellen
The relative power of three possible experimental designs under the condition that data is to be analyzed by nonparametric techniques; the comparison of the power of each nonparametric technique to its parametric analogue; and the comparison of relative powers using nonparametric and parametric techniques are discussed. The three nonparametric…
ks: Kernel Density Estimation and Kernel Discriminant Analysis for Multivariate Data in R
Directory of Open Access Journals (Sweden)
Tarn Duong
2007-09-01
Full Text Available Kernel smoothing is one of the most widely used non-parametric data smoothing techniques. We introduce a new R package ks for multivariate kernel smoothing. Currently it contains functionality for kernel density estimation and kernel discriminant analysis. It is a comprehensive package for bandwidth matrix selection, implementing a wide range of data-driven diagonal and unconstrained bandwidth selectors.
Spline Nonparametric Regression Analysis of Stress-Strain Curve of Confined Concrete
Directory of Open Access Journals (Sweden)
Tavio Tavio
2008-01-01
Full Text Available Due to enormous uncertainties in confinement models associated with the maximum compressive strength and ductility of concrete confined by rectilinear ties, the implementation of spline nonparametric regression analysis is proposed herein as an alternative approach. The statistical evaluation is carried out based on 128 large-scale column specimens of either normal-or high-strength concrete tested under uniaxial compression. The main advantage of this kind of analysis is that it can be applied when the trend of relation between predictor and response variables are not obvious. The error in the analysis can, therefore, be minimized so that it does not depend on the assumption of a particular shape of the curve. This provides higher flexibility in the application. The results of the statistical analysis indicates that the stress-strain curves of confined concrete obtained from the spline nonparametric regression analysis proves to be in good agreement with the experimental curves available in literatures
The Discriminant Analysis Flare Forecasting System (DAFFS)
Leka, K. D.; Barnes, Graham; Wagner, Eric; Hill, Frank; Marble, Andrew R.
2016-05-01
The Discriminant Analysis Flare Forecasting System (DAFFS) has been developed under NOAA/Small Business Innovative Research funds to quantitatively improve upon the NOAA/SWPC flare prediction. In the Phase-I of this project, it was demonstrated that DAFFS could indeed improve by the requested 25% most of the standard flare prediction data products from NOAA/SWPC. In the Phase-II of this project, a prototype has been developed and is presently running autonomously at NWRA.DAFFS uses near-real-time data from NOAA/GOES, SDO/HMI, and the NSO/GONG network to issue both region- and full-disk forecasts of solar flares, based on multi-variable non-parametric Discriminant Analysis. Presently, DAFFS provides forecasts which match those provided by NOAA/SWPC in terms of thresholds and validity periods (including 1-, 2-, and 3- day forecasts), although issued twice daily. Of particular note regarding DAFFS capabilities are the redundant system design, automatically-generated validation statistics and the large range of customizable options available. As part of this poster, a description of the data used, algorithm, performance and customizable options will be presented, as well as a demonstration of the DAFFS prototype.DAFFS development at NWRA is supported by NOAA/SBIR contracts WC-133R-13-CN-0079 and WC-133R-14-CN-0103, with additional support from NASA contract NNH12CG10C, plus acknowledgment to the SDO/HMI and NSO/GONG facilities and NOAA/SWPC personnel for data products, support, and feedback. DAFFS is presently ready for Phase-III development.
The Use of Nonparametric Kernel Regression Methods in Econometric Production Analysis
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard
This PhD thesis addresses one of the fundamental problems in applied econometric analysis, namely the econometric estimation of regression functions. The conventional approach to regression analysis is the parametric approach, which requires the researcher to specify the form of the regression...... to avoid this problem. The main objective is to investigate the applicability of the nonparametric kernel regression method in applied production analysis. The focus of the empirical analyses included in this thesis is the agricultural sector in Poland. Data on Polish farms are used to investigate...... practically and politically relevant problems and to illustrate how nonparametric regression methods can be used in applied microeconomic production analysis both in panel data and cross-section data settings. The thesis consists of four papers. The first paper addresses problems of parametric...
Multilevel Latent Class Analysis: Parametric and Nonparametric Models
Finch, W. Holmes; French, Brian F.
2014-01-01
Latent class analysis is an analytic technique often used in educational and psychological research to identify meaningful groups of individuals within a larger heterogeneous population based on a set of variables. This technique is flexible, encompassing not only a static set of variables but also longitudinal data in the form of growth mixture…
Applications of non-parametric statistics and analysis of variance on sample variances
Myers, R. H.
1981-01-01
Nonparametric methods that are available for NASA-type applications are discussed. An attempt will be made here to survey what can be used, to attempt recommendations as to when each would be applicable, and to compare the methods, when possible, with the usual normal-theory procedures that are avavilable for the Gaussion analog. It is important here to point out the hypotheses that are being tested, the assumptions that are being made, and limitations of the nonparametric procedures. The appropriateness of doing analysis of variance on sample variances are also discussed and studied. This procedure is followed in several NASA simulation projects. On the surface this would appear to be reasonably sound procedure. However, difficulties involved center around the normality problem and the basic homogeneous variance assumption that is mase in usual analysis of variance problems. These difficulties discussed and guidelines given for using the methods.
The Use of Nonparametric Kernel Regression Methods in Econometric Production Analysis
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard
This PhD thesis addresses one of the fundamental problems in applied econometric analysis, namely the econometric estimation of regression functions. The conventional approach to regression analysis is the parametric approach, which requires the researcher to specify the form of the regression...... function. However, the a priori specification of a functional form involves the risk of choosing one that is not similar to the “true” but unknown relationship between the regressors and the dependent variable. This problem, known as parametric misspecification, can result in biased parameter estimates...... and nonparametric estimations of production functions in order to evaluate the optimal firm size. The second paper discusses the use of parametric and nonparametric regression methods to estimate panel data regression models. The third paper analyses production risk, price uncertainty, and farmers' risk preferences...
Multi-Directional Non-Parametric Analysis of Agricultural Efficiency
DEFF Research Database (Denmark)
Balezentis, Tomas
the Multi-Directional Efficiency Analysis approach, (iii) to account for uncertainties via the use of probabilistic and fuzzy measures. Therefore, the thesis encompass six papers dedicated to (the combinations of) these objectives. One of the main contributions of this thesis is a number of extensions...... relative to labour, intermediate consumption and land (in some cases land was not treated as a discretionary input). These findings call for further research on relationships among financial structure, investment decisions, and efficiency in Lithuanian family farms. Application of different techniques...
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data.
Tan, Qihua; Thomassen, Mads; Burton, Mark; Mose, Kristian Fredløv; Andersen, Klaus Ejner; Hjelmborg, Jacob; Kruse, Torben
2017-06-06
Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health.
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data
DEFF Research Database (Denmark)
Tan, Qihua; Thomassen, Mads; Burton, Mark
2017-01-01
Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering...... the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray...... time-course data and for exploring the complex relationships in the omics data for studying their association with disease and health....
Generalized Correlation Coefficient for Non-Parametric Analysis of Microarray Time-Course Data
DEFF Research Database (Denmark)
Tan, Qihua; Thomassen, Mads; Burton, Mark
2017-01-01
Modeling complex time-course patterns is a challenging issue in microarray study due to complex gene expression patterns in response to the time-course experiment. We introduce the generalized correlation coefficient and propose a combinatory approach for detecting, testing and clustering...... the heterogeneous time-course gene expression patterns. Application of the method identified nonlinear time-course patterns in high agreement with parametric analysis. We conclude that the non-parametric nature in the generalized correlation analysis could be an useful and efficient tool for analyzing microarray...
Nonparametric bootstrap analysis with applications to demographic effects in demand functions.
Gozalo, P L
1997-12-01
"A new bootstrap proposal, labeled smooth conditional moment (SCM) bootstrap, is introduced for independent but not necessarily identically distributed data, where the classical bootstrap procedure fails.... A good example of the benefits of using nonparametric and bootstrap methods is the area of empirical demand analysis. In particular, we will be concerned with their application to the study of two important topics: what are the most relevant effects of household demographic variables on demand behavior, and to what extent present parametric specifications capture these effects." excerpt
Nonparametric Bayesian Dictionary Learning for Analysis of Noisy and Incomplete Images
2010-04-01
OF EACH CELL ARE RESULTS OF KSVD AND BPFA, RESPECTIVELY. σ C.man House Peppers Lena Barbara Boats F.print Couple Hill 5 37.87 39.37 37.78 38.60 38.08...INTERPOLATION PSNR RESULTS, USING PATCH SIZE 8× 8. BOTTOM: BPFA RGB IMAGE INTERPOLATION PSNR RESULTS, USING PATCH SIZE 7× 7. data ratio C.man House Peppers Lena...of subspaces. IEEE Trans. Inform. Theory, 2009. [16] T. Ferguson . A Bayesian analysis of some nonparametric problems. Annals of Statistics, 1:209–230
An adaptive nonparametric method in benchmark analysis for bioassay and environmental studies.
Bhattacharya, Rabi; Lin, Lizhen
2010-12-01
We present a novel nonparametric method for bioassay and benchmark analysis in risk assessment, which averages isotonic MLEs based on disjoint subgroups of dosages. The asymptotic theory for the methodology is derived, showing that the MISEs (mean integrated squared error) of the estimates of both the dose-response curve F and its inverse F(-1) achieve the optimal rate O(N(-4/5)). Also, we compute the asymptotic distribution of the estimate ζ~p of the effective dosage ζ(p) = F(-1) (p) which is shown to have an optimally small asymptotic variance.
Evolution of the CMB Power Spectrum Across WMAP Data Releases: A Nonparametric Analysis
Aghamousa, Amir; Souradeep, Tarun
2011-01-01
We present a comparative analysis of the WMAP 1-, 3-, 5-, and 7-year data releases for the CMB angular power spectrum, with respect to the following three key questions: (a) How well is the angular power spectrum determined by the data alone? (b) How well is the Lambda-CDM model supported by a model-independent, data-driven analysis? (c) What are the realistic uncertainties on peak/dip locations and heights? Our analysis is based on a nonparametric function estimation methodology [1,2]. Our results show that the height of the power spectrum is well determined by data alone for multipole index l approximately less than 600 (1-year), 800 (3-year), and 900 (5- and 7-year data realizations). We also show that parametric fits based on the Lambda-CDM model are remarkably close to our nonparametric fit in l-regions where the data are sufficiently precise. A contrasting example is provided by an H-Lambda-CDM model: As the data become precise with successive data realizations, the H-Lambda-CDM angular power spectrum g...
Ryu, Duchwan
2010-09-28
We consider nonparametric regression analysis in a generalized linear model (GLM) framework for data with covariates that are the subject-specific random effects of longitudinal measurements. The usual assumption that the effects of the longitudinal covariate processes are linear in the GLM may be unrealistic and if this happens it can cast doubt on the inference of observed covariate effects. Allowing the regression functions to be unknown, we propose to apply Bayesian nonparametric methods including cubic smoothing splines or P-splines for the possible nonlinearity and use an additive model in this complex setting. To improve computational efficiency, we propose the use of data-augmentation schemes. The approach allows flexible covariance structures for the random effects and within-subject measurement errors of the longitudinal processes. The posterior model space is explored through a Markov chain Monte Carlo (MCMC) sampler. The proposed methods are illustrated and compared to other approaches, the "naive" approach and the regression calibration, via simulations and by an application that investigates the relationship between obesity in adulthood and childhood growth curves. © 2010, The International Biometric Society.
Ryu, Duchwan; Li, Erning; Mallick, Bani K
2011-06-01
We consider nonparametric regression analysis in a generalized linear model (GLM) framework for data with covariates that are the subject-specific random effects of longitudinal measurements. The usual assumption that the effects of the longitudinal covariate processes are linear in the GLM may be unrealistic and if this happens it can cast doubt on the inference of observed covariate effects. Allowing the regression functions to be unknown, we propose to apply Bayesian nonparametric methods including cubic smoothing splines or P-splines for the possible nonlinearity and use an additive model in this complex setting. To improve computational efficiency, we propose the use of data-augmentation schemes. The approach allows flexible covariance structures for the random effects and within-subject measurement errors of the longitudinal processes. The posterior model space is explored through a Markov chain Monte Carlo (MCMC) sampler. The proposed methods are illustrated and compared to other approaches, the "naive" approach and the regression calibration, via simulations and by an application that investigates the relationship between obesity in adulthood and childhood growth curves.
Szabo, Zoltan
2010-01-01
The goal of this paper is to extend independent subspace analysis (ISA) to the case of (i) nonparametric, not strictly stationary source dynamics and (ii) unknown source component dimensions. We make use of functional autoregressive (fAR) processes to model the temporal evolution of the hidden sources. An extension of the ISA separation principle--which states that the ISA problem can be solved by traditional independent component analysis (ICA) and clustering of the ICA elements--is derived for the solution of the defined fAR independent process analysis task (fAR-IPA): applying fAR identification we reduce the problem to ISA. A local averaging approach, the Nadaraya-Watson kernel regression technique is adapted to obtain strongly consistent fAR estimation. We extend the Amari-index to different dimensional components and illustrate the efficiency of the fAR-IPA approach by numerical examples.
A Level Set Analysis and A Nonparametric Regression on S&P 500 Daily Return
Directory of Open Access Journals (Sweden)
Yipeng Yang
2016-02-01
Full Text Available In this paper, a level set analysis is proposed which aims to analyze the S&P 500 return with a certain magnitude. It is found that the process of large jumps/drops of return tend to have negative serial correlation, and volatility clustering phenomenon can be easily seen. Then, a nonparametric analysis is performed and new patterns are discovered. An ARCH model is constructed based on the patterns we discovered and it is capable of manifesting the volatility skew in option pricing. A comparison of our model with the GARCH(1,1 model is carried out. The explanation of the validity on our model through prospect theory is provided, and, as a novelty, we linked the volatility skew phenomenon to the prospect theory in behavioral finance.
Takara, K. T.
2015-12-01
This paper describes a non-parametric frequency analysis method for hydrological extreme-value samples with a size larger than 100, verifying the estimation accuracy with a computer intensive statistics (CIS) resampling such as the bootstrap. Probable maximum values are also incorporated into the analysis for extreme events larger than a design level of flood control. Traditional parametric frequency analysis methods of extreme values include the following steps: Step 1: Collecting and checking extreme-value data; Step 2: Enumerating probability distributions that would be fitted well to the data; Step 3: Parameter estimation; Step 4: Testing goodness of fit; Step 5: Checking the variability of quantile (T-year event) estimates by the jackknife resampling method; and Step_6: Selection of the best distribution (final model). The non-parametric method (NPM) proposed here can skip Steps 2, 3, 4 and 6. Comparing traditional parameter methods (PM) with the NPM, this paper shows that PM often underestimates 100-year quantiles for annual maximum rainfall samples with records of more than 100 years. Overestimation examples are also demonstrated. The bootstrap resampling can do bias correction for the NPM and can also give the estimation accuracy as the bootstrap standard error. This NPM has advantages to avoid various difficulties in above-mentioned steps in the traditional PM. Probable maximum events are also incorporated into the NPM as an upper bound of the hydrological variable. Probable maximum precipitation (PMP) and probable maximum flood (PMF) can be a new parameter value combined with the NPM. An idea how to incorporate these values into frequency analysis is proposed for better management of disasters that exceed the design level. The idea stimulates more integrated approach by geoscientists and statisticians as well as encourages practitioners to consider the worst cases of disasters in their disaster management planning and practices.
Discriminant and Proximity Analysis in Intercultural Investigation.
Laveault, Dany
1982-01-01
Discriminant analysis is applied to data from previous research dealing with assessing the particularities of cognitive development in young (four to nine years old) Montagnais Indians and French Canadians. The most important future contribution of discriminant analysis to intercultural research will be its ability to conceptualize group…
Semisupervised Sparse Multilinear Discriminant Analysis
Institute of Scientific and Technical Information of China (English)
黄锴; 张丽清
2014-01-01
Various problems are encountered when adopting ordinary vector space algorithms for high-order tensor data input. Namely, one must overcome the Small Sample Size (SSS) and overfitting problems. In addition, the structural information of the original tensor signal is lost during the vectorization process. Therefore, comparable methods using a direct tensor input are more appropriate. In the case of electrocardiograms (ECGs), another problem must be overcome;the manual diagnosis of ECG data is expensive and time consuming, rendering it diﬃcult to acquire data with diagnosis labels. However, when effective features for classification in the original data are very sparse, we propose a semisupervised sparse multilinear discriminant analysis (SSSMDA) method. This method uses the distribution of both the labeled and the unlabeled data together with labels discovered through a label propagation algorithm. In practice, we use 12-lead ECGs collected from a remote diagnosis system and apply a short-time-fourier transformation (STFT) to obtain third-order tensors. The experimental results highlight the sparsity of the ECG data and the ability of our method to extract sparse and effective features that can be used for classification.
Takamizawa, Hisashi; Itoh, Hiroto; Nishiyama, Yutaka
2016-10-01
In order to understand neutron irradiation embrittlement in high fluence regions, statistical analysis using the Bayesian nonparametric (BNP) method was performed for the Japanese surveillance and material test reactor irradiation database. The BNP method is essentially expressed as an infinite summation of normal distributions, with input data being subdivided into clusters with identical statistical parameters, such as mean and standard deviation, for each cluster to estimate shifts in ductile-to-brittle transition temperature (DBTT). The clusters typically depend on chemical compositions, irradiation conditions, and the irradiation embrittlement. Specific variables contributing to the irradiation embrittlement include the content of Cu, Ni, P, Si, and Mn in the pressure vessel steels, neutron flux, neutron fluence, and irradiation temperatures. It was found that the measured shifts of DBTT correlated well with the calculated ones. Data associated with the same materials were subdivided into the same clusters even if neutron fluences were increased.
Trend Analysis of Golestan's Rivers Discharges Using Parametric and Non-parametric Methods
Mosaedi, Abolfazl; Kouhestani, Nasrin
2010-05-01
One of the major problems in human life is climate changes and its problems. Climate changes will cause changes in rivers discharges. The aim of this research is to investigate the trend analysis of seasonal and yearly rivers discharges of Golestan province (Iran). In this research four trend analysis method including, conjunction point, linear regression, Wald-Wolfowitz and Mann-Kendall, for analyzing of river discharges in seasonal and annual periods in significant level of 95% and 99% were applied. First, daily discharge data of 12 hydrometrics stations with a length of 42 years (1965-2007) were selected, after some common statistical tests such as, homogeneity test (by applying G-B and M-W tests), the four mentioned trends analysis tests were applied. Results show that in all stations, for summer data time series, there are decreasing trends with a significant level of 99% according to Mann-Kendall (M-K) test. For autumn time series data, all four methods have similar results. For other periods, the results of these four tests were more or less similar together. While, for some stations the results of tests were different. Keywords: Trend Analysis, Discharge, Non-parametric methods, Wald-Wolfowitz, The Mann-Kendall test, Golestan Province.
Directory of Open Access Journals (Sweden)
Andrea Furková
2007-06-01
Full Text Available This paper explores the aplication of parametric and non-parametric benchmarking methods in measuring cost efficiency of Slovak and Czech electricity distribution companies. We compare the relative cost efficiency of Slovak and Czech distribution companies using two benchmarking methods: the non-parametric Data Envelopment Analysis (DEA and the Stochastic Frontier Analysis (SFA as the parametric approach. The first part of analysis was based on DEA models. Traditional cross-section CCR and BCC model were modified to cost efficiency estimation. In further analysis we focus on two versions of stochastic frontier cost functioin using panel data: MLE model and GLS model. These models have been applied to an unbalanced panel of 11 (Slovakia 3 and Czech Republic 8 regional electricity distribution utilities over a period from 2000 to 2004. The differences in estimated scores, parameters and ranking of utilities were analyzed. We observed significant differences between parametric methods and DEA approach.
Ramajo, Julián; Cordero, José Manuel; Márquez, Miguel Ángel
2017-10-01
This paper analyses region-level technical efficiency in nine European countries over the 1995-2007 period. We propose the application of a nonparametric conditional frontier approach to account for the presence of heterogeneous conditions in the form of geographical externalities. Such environmental factors are beyond the control of regional authorities, but may affect the production function. Therefore, they need to be considered in the frontier estimation. Specifically, a spatial autoregressive term is included as an external conditioning factor in a robust order- m model. Thus we can test the hypothesis of non-separability (the external factor impacts both the input-output space and the distribution of efficiencies), demonstrating the existence of significant global interregional spillovers into the production process. Our findings show that geographical externalities affect both the frontier level and the probability of being more or less efficient. Specifically, the results support the fact that the spatial lag variable has an inverted U-shaped non-linear impact on the performance of regions. This finding can be interpreted as a differential effect of interregional spillovers depending on the size of the neighboring economies: positive externalities for small values, possibly related to agglomeration economies, and negative externalities for high values, indicating the possibility of production congestion. Additionally, evidence of the existence of a strong geographic pattern of European regional efficiency is reported and the levels of technical efficiency are acknowledged to have converged during the period under analysis.
Non-parametric seismic hazard analysis in the presence of incomplete data
Yazdani, Azad; Mirzaei, Sajjad; Dadkhah, Koroush
2017-01-01
The distribution of earthquake magnitudes plays a crucial role in the estimation of seismic hazard parameters. Due to the complexity of earthquake magnitude distribution, non-parametric approaches are recommended over classical parametric methods. The main deficiency of the non-parametric approach is the lack of complete magnitude data in almost all cases. This study aims to introduce an imputation procedure for completing earthquake catalog data that will allow the catalog to be used for non-parametric density estimation. Using a Monte Carlo simulation, the efficiency of introduced approach is investigated. This study indicates that when a magnitude catalog is incomplete, the imputation procedure can provide an appropriate tool for seismic hazard assessment. As an illustration, the imputation procedure was applied to estimate earthquake magnitude distribution in Tehran, the capital city of Iran.
Marmarelis, Vasilis Z; Shin, Dae C; Zhang, Yaping; Kautzky-Willer, Alexandra; Pacini, Giovanni; D'Argenio, David Z
2013-07-01
Modeling studies of the insulin-glucose relationship have mainly utilized parametric models, most notably the minimal model (MM) of glucose disappearance. This article presents results from the comparative analysis of the parametric MM and a nonparametric Laguerre based Volterra Model (LVM) applied to the analysis of insulin modified (IM) intravenous glucose tolerance test (IVGTT) data from a clinical study of gestational diabetes mellitus (GDM). An IM IVGTT study was performed 8 to 10 weeks postpartum in 125 women who were diagnosed with GDM during their pregnancy [population at risk of developing diabetes (PRD)] and in 39 control women with normal pregnancies (control subjects). The measured plasma glucose and insulin from the IM IVGTT in each group were analyzed via a population analysis approach to estimate the insulin sensitivity parameter of the parametric MM. In the nonparametric LVM analysis, the glucose and insulin data were used to calculate the first-order kernel, from which a diagnostic scalar index representing the integrated effect of insulin on glucose was derived. Both the parametric MM and nonparametric LVM describe the glucose concentration data in each group with good fidelity, with an improved measured versus predicted r² value for the LVM of 0.99 versus 0.97 for the MM analysis in the PRD. However, application of the respective diagnostic indices of the two methods does result in a different classification of 20% of the individuals in the PRD. It was found that the data based nonparametric LVM revealed additional insights about the manner in which infused insulin affects blood glucose concentration. © 2013 Diabetes Technology Society.
Talaska, Cara A.; Fiske, Susan T.; Chaiken, Shelly
2008-01-01
Investigations of racial bias have emphasized stereotypes and other beliefs as central explanatory mechanisms and as legitimating discrimination. In recent theory and research, emotional prejudices have emerged as another, more direct predictor of discrimination. A new comprehensive meta-analysis of 57 racial attitude-discrimination studies finds a moderate relationship between overall attitudes and discrimination. Emotional prejudices are twices as closely related to racial discrimination as...
Incremental Discriminant Analysis in Tensor Space
Chang, Liu; Weidong, Zhao; Tao, Yan; Qiang, Pu; Xiaodan, Du
2015-01-01
To study incremental machine learning in tensor space, this paper proposes incremental tensor discriminant analysis. The algorithm employs tensor representation to carry on discriminant analysis and combine incremental learning to alleviate the computational cost. This paper proves that the algorithm can be unified into the graph framework theoretically and analyzes the time and space complexity in detail. The experiments on facial image detection have shown that the algorithm not only achieves sound performance compared with other algorithms, but also reduces the computational issues apparently. PMID:26339229
Variable Selection Strategies in Discriminate Analysis.
Tanguma, Jesus
This paper presents three variable selection strategies in discriminate analysis (all variables in the model, use of stepwise methods, and all possible subsets). All three methods are illustrated through examples. Although the all variables in the model and the stepwise methods are the most widely used, B. Thompson (1996) and C. Huberty (1994)…
Efficient Global Programming Model for Discriminant Analysis
Directory of Open Access Journals (Sweden)
M.ANGULAKSHMI
2011-03-01
Full Text Available Conventional statistical analysis includes the capacity to systematically assign individuals to groups. We suggest alternative assignment procedures, utilizing a set of interrelated goal programming formulations. This paper represents an effort to suggest ways by which the discriminant problem might reasonably be addressed via straightforward linear goal programming formulations. Simple and direct, such formulations may ultimately compete with conventional approaches - free of the classical assumptions and possessing a stronger intuitive appeal. We further demonstrate via simple illustration the potential of these procedures to play a significant part in addressing the discriminant problem, and indicate fundamental ideas that lay the foundation for other more sophisticated approaches.
An exact predictive recursion for Bayesian nonparametric analysis of incomplete data
Garibaldi, Ubaldo; Viarengo, Paolo
2010-01-01
This paper presents a new derivation of nonparametric distribution estimation with right-censored data. It is based on an extension of the predictive inferences to compound evidence. The estimate is recursive and exact, and no stochastic approximation is needed: it simply requires that the censored data are processed in decreasing order. Only in this case the recursion provides exact posterior predictive distributions for subsequent samples under a Dirichlet process prior. The resulting estim...
Directory of Open Access Journals (Sweden)
Fernandez Ana
2010-05-01
Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.
Directory of Open Access Journals (Sweden)
Vangelis Sakkalis
2008-01-01
Full Text Available There is an important evidence of differences in the EEG frequency spectrum of control subjects as compared to epileptic subjects. In particular, the study of children presents difficulties due to the early stages of brain development and the various forms of epilepsy indications. In this study, we consider children that developed epileptic crises in the past but without any other clinical, psychological, or visible neurophysiological findings. The aim of the paper is to develop reliable techniques for testing if such controlled epilepsy induces related spectral differences in the EEG. Spectral features extracted by using nonparametric, signal representation techniques (Fourier and wavelet transform and a parametric, signal modeling technique (ARMA are compared and their effect on the classification of the two groups is analyzed. The subjects performed two different tasks: a control (rest task and a relatively difficult math task. The results show that spectral features extracted by modeling the EEG signals recorded from individual channels by an ARMA model give a higher discrimination between the two subject groups for the control task, where classification scores of up to 100% were obtained with a linear discriminant classifier.
CURRENT STATUS OF NONPARAMETRIC STATISTICS
Directory of Open Access Journals (Sweden)
Orlov A. I.
2015-02-01
Full Text Available Nonparametric statistics is one of the five points of growth of applied mathematical statistics. Despite the large number of publications on specific issues of nonparametric statistics, the internal structure of this research direction has remained undeveloped. The purpose of this article is to consider its division into regions based on the existing practice of scientific activity determination of nonparametric statistics and classify investigations on nonparametric statistical methods. Nonparametric statistics allows to make statistical inference, in particular, to estimate the characteristics of the distribution and testing statistical hypotheses without, as a rule, weakly proven assumptions about the distribution function of samples included in a particular parametric family. For example, the widespread belief that the statistical data are often have the normal distribution. Meanwhile, analysis of results of observations, in particular, measurement errors, always leads to the same conclusion - in most cases the actual distribution significantly different from normal. Uncritical use of the hypothesis of normality often leads to significant errors, in areas such as rejection of outlying observation results (emissions, the statistical quality control, and in other cases. Therefore, it is advisable to use nonparametric methods, in which the distribution functions of the results of observations are imposed only weak requirements. It is usually assumed only their continuity. On the basis of generalization of numerous studies it can be stated that to date, using nonparametric methods can solve almost the same number of tasks that previously used parametric methods. Certain statements in the literature are incorrect that nonparametric methods have less power, or require larger sample sizes than parametric methods. Note that in the nonparametric statistics, as in mathematical statistics in general, there remain a number of unresolved problems
Nonparametric variance estimation in the analysis of microarray data: a measurement error approach.
Carroll, Raymond J; Wang, Yuedong
2008-01-01
This article investigates the effects of measurement error on the estimation of nonparametric variance functions. We show that either ignoring measurement error or direct application of the simulation extrapolation, SIMEX, method leads to inconsistent estimators. Nevertheless, the direct SIMEX method can reduce bias relative to a naive estimator. We further propose a permutation SIMEX method which leads to consistent estimators in theory. The performance of both SIMEX methods depends on approximations to the exact extrapolants. Simulations show that both SIMEX methods perform better than ignoring measurement error. The methodology is illustrated using microarray data from colon cancer patients.
Non-parametric trend analysis of water quality data of rivers in Kansas
Yu, Y.-S.; Zou, S.; Whittemore, D.
1993-01-01
Surface water quality data for 15 sampling stations in the Arkansas, Verdigris, Neosho, and Walnut river basins inside the state of Kansas were analyzed to detect trends (or lack of trends) in 17 major constituents by using four different non-parametric methods. The results show that concentrations of specific conductance, total dissolved solids, calcium, total hardness, sodium, potassium, alkalinity, sulfate, chloride, total phosphorus, ammonia plus organic nitrogen, and suspended sediment generally have downward trends. Some of the downward trends are related to increases in discharge, while others could be caused by decreases in pollution sources. Homogeneity tests show that both station-wide trends and basinwide trends are non-homogeneous. ?? 1993.
Robust non-parametric one-sample tests for the analysis of recurrent events.
Rebora, Paola; Galimberti, Stefania; Valsecchi, Maria Grazia
2010-12-30
One-sample non-parametric tests are proposed here for inference on recurring events. The focus is on the marginal mean function of events and the basis for inference is the standardized distance between the observed and the expected number of events under a specified reference rate. Different weights are considered in order to account for various types of alternative hypotheses on the mean function of the recurrent events process. A robust version and a stratified version of the test are also proposed. The performance of these tests was investigated through simulation studies under various underlying event generation processes, such as homogeneous and nonhomogeneous Poisson processes, autoregressive and renewal processes, with and without frailty effects. The robust versions of the test have been shown to be suitable in a wide variety of event generating processes. The motivating context is a study on gene therapy in a very rare immunodeficiency in children, where a major end-point is the recurrence of severe infections. Robust non-parametric one-sample tests for recurrent events can be useful to assess efficacy and especially safety in non-randomized studies or in epidemiological studies for comparison with a standard population.
Direct Neighborhood Discriminant Analysis for Face Recognition
Directory of Open Access Journals (Sweden)
Miao Cheng
2008-01-01
Full Text Available Face recognition is a challenging problem in computer vision and pattern recognition. Recently, many local geometrical structure-based techiniques are presented to obtain the low-dimensional representation of face images with enhanced discriminatory power. However, these methods suffer from the small simple size (SSS problem or the high computation complexity of high-dimensional data. To overcome these problems, we propose a novel local manifold structure learning method for face recognition, named direct neighborhood discriminant analysis (DNDA, which separates the nearby samples of interclass and preserves the local within-class geometry in two steps, respectively. In addition, the PCA preprocessing to reduce dimension to a large extent is not needed in DNDA avoiding loss of discriminative information. Experiments conducted on ORL, Yale, and UMIST face databases show the effectiveness of the proposed method.
A Computational Discriminability Analysis on Twin Fingerprints
Liu, Yu; Srihari, Sargur N.
Sharing similar genetic traits makes the investigation of twins an important study in forensics and biometrics. Fingerprints are one of the most commonly found types of forensic evidence. The similarity between twins’ prints is critical establish to the reliability of fingerprint identification. We present a quantitative analysis of the discriminability of twin fingerprints on a new data set (227 pairs of identical twins and fraternal twins) recently collected from a twin population using both level 1 and level 2 features. Although the patterns of minutiae among twins are more similar than in the general population, the similarity of fingerprints of twins is significantly different from that between genuine prints of the same finger. Twins fingerprints are discriminable with a 1.5%~1.7% higher EER than non-twins. And identical twins can be distinguished by examine fingerprint with a slightly higher error rate than fraternal twins.
NParCov3: A SAS/IML Macro for Nonparametric Randomization-Based Analysis of Covariance
Directory of Open Access Journals (Sweden)
Richard C. Zink
2012-07-01
Full Text Available Analysis of covariance serves two important purposes in a randomized clinical trial. First, there is a reduction of variance for the treatment effect which provides more powerful statistical tests and more precise confidence intervals. Second, it provides estimates of the treatment effect which are adjusted for random imbalances of covariates between the treatment groups. The nonparametric analysis of covariance method of Koch, Tangen, Jung, and Amara (1998 defines a very general methodology using weighted least-squares to generate covariate-adjusted treatment effects with minimal assumptions. This methodology is general in its applicability to a variety of outcomes, whether continuous, binary, ordinal, incidence density or time-to-event. Further, its use has been illustrated in many clinical trial settings, such as multi-center, dose-response and non-inferiority trials.NParCov3 is a SAS/IML macro written to conduct the nonparametric randomization-based covariance analyses of Koch et al. (1998. The software can analyze a variety of outcomes and can account for stratification. Data from multiple clinical trials will be used for illustration.
Face Recognition Using Kernel Discriminant Analysis
Institute of Scientific and Technical Information of China (English)
无
2002-01-01
Linear Discrimiant Analysis (LDA) has demonstrated their success in face recognition. But LDA is difficult to handle the high nonlinear problems, such as changes of large viewpoint and illumination in face recognition. In order to overcome these problems, we investigate Kernel Discriminant Analysis (KDA) for face recognition. This approach adopts the kernel functions to replace the dot products of nonlinear mapping in the high dimensional feature space, and then the nonlinear problem can be solved in the input space conveniently without explicit mapping. Two face databases are used to test KDA approach. The results show that our approach outperforms the conventional PCA(Eigenface) and LDA(Fisherface) approaches.
Nonparametric statistical inference
Gibbons, Jean Dickinson
2014-01-01
Thoroughly revised and reorganized, the fourth edition presents in-depth coverage of the theory and methods of the most widely used nonparametric procedures in statistical analysis and offers example applications appropriate for all areas of the social, behavioral, and life sciences. The book presents new material on the quantiles, the calculation of exact and simulated power, multiple comparisons, additional goodness-of-fit tests, methods of analysis of count data, and modern computer applications using MINITAB, SAS, and STATXACT. It includes tabular guides for simplified applications of tests and finding P values and confidence interval estimates.
Nonparametric analysis of competing risks data with event category missing at random.
Gouskova, Natalia A; Lin, Feng-Chang; Fine, Jason P
2017-03-01
In competing risks setup, the data for each subject consist of the event time, censoring indicator, and event category. However, sometimes the information about the event category can be missing, as, for example, in a case when the date of death is known but the cause of death is not available. In such situations, treating subjects with missing event category as censored leads to the underestimation of the hazard functions. We suggest nonparametric estimators for the cumulative cause-specific hazards and the cumulative incidence functions which use the Nadaraya-Watson estimator to obtain the contribution of an event with missing category to each of the cause-specific hazards. We derive the propertied of the proposed estimators. Optimal bandwidth is determined, which minimizes the mean integrated squared errors of the proposed estimators over time. The methodology is illustrated using data on lung infections in patients from the United States Cystic Fibrosis Foundation Patient Registry. © 2016, The International Biometric Society.
Crainiceanu, Ciprian M; Caffo, Brian S; Di, Chong-Zhi; Punjabi, Naresh M
2009-06-01
We introduce methods for signal and associated variability estimation based on hierarchical nonparametric smoothing with application to the Sleep Heart Health Study (SHHS). SHHS is the largest electroencephalographic (EEG) collection of sleep-related data, which contains, at each visit, two quasi-continuous EEG signals for each subject. The signal features extracted from EEG data are then used in second level analyses to investigate the relation between health, behavioral, or biometric outcomes and sleep. Using subject specific signals estimated with known variability in a second level regression becomes a nonstandard measurement error problem. We propose and implement methods that take into account cross-sectional and longitudinal measurement error. The research presented here forms the basis for EEG signal processing for the SHHS.
Nonparametric analysis of the time structure of seismicity in a geographic region
Directory of Open Access Journals (Sweden)
A. Quintela-del-Río
2002-06-01
Full Text Available As an alternative to traditional parametric approaches, we suggest nonparametric methods for analyzing temporal data on earthquake occurrences. In particular, the kernel method for estimating the hazard function and the intensity function are presented. One novelty of our approaches is that we take into account the possible dependence of the data to estimate the distribution of time intervals between earthquakes, which has not been considered in most statistics studies on seismicity. Kernel estimation of hazard function has been used to study the occurrence process of cluster centers (main shocks. Kernel intensity estimation, on the other hand, has helped to describe the occurrence process of cluster members (aftershocks. Similar studies in two geographic areas of Spain (Granada and Galicia have been carried out to illustrate the estimation methods suggested.
Directory of Open Access Journals (Sweden)
Saeed Banihashemi
2015-12-01
Full Text Available In line with the growing global trend toward energy efficiency in buildings, this paper aims to first; investigate the energy performance of double-glazed windows in different climates and second; analyze the most dominant used parametric and non-parametric tests in dimension reduction for simulating this component. A four-story building representing the conventional type of residential apartments for four climates of cold, temperate, hot-arid and hot-humid was selected for simulation. 10 variables of U-factor, SHGC, emissivity, visible transmittance, monthly average dry bulb temperature, monthly average percent humidity, monthly average wind speed, monthly average direct solar radiation, monthly average diffuse solar radiation and orientation constituted the parameters considered in the calculation of cooling and heating loads of the case. Design of Experiment and Principal Component Analysis methods were applied to find the most significant factors and reduction dimension of initial variables. It was observed that in two climates of temperate and hot-arid, using double glazed windows was beneficial in both cold and hot months whereas in cold and hot-humid climates where heating and cooling loads are dominant respectively, they were advantageous in only those dominant months. Furthermore, an inconsistency was revealed between parametric and non-parametric tests in terms of identifying the most significant variables.
'nparACT' package for R: A free software tool for the non-parametric analysis of actigraphy data.
Blume, Christine; Santhi, Nayantara; Schabus, Manuel
2016-01-01
For many studies, participants' sleep-wake patterns are monitored and recorded prior to, during and following an experimental or clinical intervention using actigraphy, i.e. the recording of data generated by movements. Often, these data are merely inspected visually without computation of descriptive parameters, in part due to the lack of user-friendly software. To address this deficit, we developed a package for R Core Team [6], that allows computing several non-parametric measures from actigraphy data. Specifically, it computes the interdaily stability (IS), intradaily variability (IV) and relative amplitude (RA) of activity and gives the start times and average activity values of M10 (i.e. the ten hours with maximal activity) and L5 (i.e. the five hours with least activity). Two functions compute these 'classical' parameters and handle either single or multiple files. Two other functions additionally allow computing an L-value (i.e. the least activity value) for a user-defined time span termed 'Lflex' value. A plotting option is included in all functions. The package can be downloaded from the Comprehensive R Archives Network (CRAN). •The package 'nparACT' for R serves the non-parametric analysis of actigraphy data.•Computed parameters include interdaily stability (IS), intradaily variability (IV) and relative amplitude (RA) as well as start times and average activity during the 10 h with maximal and the 5 h with minimal activity (i.e. M10 and L5).
EM-63 Decay Curve Analysis for UXO Discrimination
2016-06-13
EM-63 Decay Curve Analysis for UXO Discrimination ESTCP Contract # 200035 Final Report NAEVA Geophysics September 7...07 SEP 2001 2. REPORT TYPE 3. DATES COVERED 00-00-2001 to 00-00-2001 4. TITLE AND SUBTITLE EM-63 Decay Curve Analysis for UXO Discrimination ... Discrimination ESTCP Contract # 200035 Final Report NAEVA Geophysics September 7, 2001 Table Of Contents 1 Introduction
Nonparametric statistics for social and behavioral sciences
Kraska-MIller, M
2013-01-01
Introduction to Research in Social and Behavioral SciencesBasic Principles of ResearchPlanning for ResearchTypes of Research Designs Sampling ProceduresValidity and Reliability of Measurement InstrumentsSteps of the Research Process Introduction to Nonparametric StatisticsData AnalysisOverview of Nonparametric Statistics and Parametric Statistics Overview of Parametric Statistics Overview of Nonparametric StatisticsImportance of Nonparametric MethodsMeasurement InstrumentsAnalysis of Data to Determine Association and Agreement Pearson Chi-Square Test of Association and IndependenceContingency
DISCRIMINANT ANALYSIS OF BANK PROFITABILITY LEVELS
Directory of Open Access Journals (Sweden)
Ante Rozga
2013-02-01
Full Text Available Discriminant analysis has been employed in this paper in order to identify and explain key features of bank profitability levels. Bank profitability is set up in the form of two categorical variables: profit or loss recorded and above or below average return on equity. Predictor variables are selected from various groups of financial indicators usually included in the empirical work on microeconomic determinants of bank profitability. The data from the Croatian banking sector is analyzed using the Enter method. General recommendations for a more profitable business of banking found in the bank management literature and existing empirical framework such as rationalization of overhead costs, asset growth, increase of non-interest income by expanding scale and scope of financial products proved to be important for classification of banks in different profitability levels. A higher market share may bring additional advantages. Classification results, canonical correlation and Wilks’ Lambda test confirm statistical significance of research results. Altogether, discriminant analysis turns out to be a suitable statistical method for solving presented research problem and moving forward from the bankruptcy, credit rating or default issues in finance.
Discriminant analysis with errors in variables
Loustau, Sébastien
2012-01-01
The effect of measurement error in discriminant analysis is investigated. Given observations $Z=X+\\epsilon$, where $\\epsilon$ denotes a random noise, the goal is to predict the density of $X$ among two possible candidates $f$ and $g$. We suppose that we have at our disposal two learning samples. The aim is to approach the best possible decision rule $G^*$ defined as a minimizer of the Bayes risk. In the free-noise case $(\\epsilon=0)$, minimax fast rates of convergence are well-known under the margin assumption in discriminant analysis (see \\cite{mammen}) or in the more general classification framework (see \\cite{tsybakov2004,AT}). In this paper we intend to establish similar results in the noisy case, i.e. when dealing with errors in variables. In particular, we discuss two possible complexity assumptions that can be set on the problem, which may alternatively concern the regularity of $f-g$ or the boundary of $G^*$. We prove minimax lower bounds for these both problems and explain how can these rates be atta...
Discrimination in Recruitment: An Empirical Analysis.
Newman, Jerry M.
1978-01-01
To investigate whether recruitment practices of companies with affirmative action programs discriminated against Blacks or resulted in reverse discrimination, qualifications and race of fictitious job applicants were manipulated on resumes sent to a sample of employers. Responses strongly indicate discrimination, with Black applicants favored…
Hussey, Michael A; Koch, Gary G; Preisser, John S; Saville, Benjamin R
2016-01-01
Time-to-event or dichotomous outcomes in randomized clinical trials often have analyses using the Cox proportional hazards model or conditional logistic regression, respectively, to obtain covariate-adjusted log hazard (or odds) ratios. Nonparametric Randomization-Based Analysis of Covariance (NPANCOVA) can be applied to unadjusted log hazard (or odds) ratios estimated from a model containing treatment as the only explanatory variable. These adjusted estimates are stratified population-averaged treatment effects and only require a valid randomization to the two treatment groups and avoid key modeling assumptions (e.g., proportional hazards in the case of a Cox model) for the adjustment variables. The methodology has application in the regulatory environment where such assumptions cannot be verified a priori. Application of the methodology is illustrated through three examples on real data from two randomized trials.
Ford, Eric B; Steffen, Jason H; Carter, Joshua A; Fressin, Francois; Holman, Matthew J; Lissauer, Jack J; Moorhead, Althea V; Morehead, Robert C; Ragozzine, Darin; Rowe, Jason F; Welsh, William F; Allen, Christopher; Batalha, Natalie M; Borucki, William J; Bryson, Stephen T; Buchhave, Lars A; Burke, Christopher J; Caldwell, Douglas A; Charbonneau, David; Clarke, Bruce D; Cochran, William D; Désert, Jean-Michel; Endl, Michael; Everett, Mark E; Fischer, Debra A; Gautier, Thomas N; Gilliland, Ron L; Jenkins, Jon M; Haas, Michael R; Horch, Elliott; Howell, Steve B; Ibrahim, Khadeejah A; Isaacson, Howard; Koch, David G; Latham, David W; Li, Jie; Lucas, Philip; MacQueen, Phillip J; Marcy, Geoffrey W; McCauliff, Sean; Mullally, Fergal R; Quinn, Samuel N; Quintana, Elisa; Shporer, Avi; Still, Martin; Tenenbaum, Peter; Thompson, Susan E; Torres, Guillermo; Twicken, Joseph D; Wohler, Bill
2012-01-01
We present a new method for confirming transiting planets based on the combination of transit timingn variations (TTVs) and dynamical stability. Correlated TTVs provide evidence that the pair of bodies are in the same physical system. Orbital stability provides upper limits for the masses of the transiting companions that are in the planetary regime. This paper describes a non-parametric technique for quantifying the statistical significance of TTVs based on the correlation of two TTV data sets. We apply this method to an analysis of the transit timing variations of two stars with multiple transiting planet candidates identified by Kepler. We confirm four transiting planets in two multiple planet systems based on their TTVs and the constraints imposed by dynamical stability. An additional three candidates in these same systems are not confirmed as planets, but are likely to be validated as real planets once further observations and analyses are possible. If all were confirmed, these systems would be near 4:6:...
Jiménez-Arenas, Juan Manuel; Esquivel, José Antonio
2013-05-10
This study assesses the performance of two analytical approaches to sex discrimination based on single linear variables: discriminant analysis and the Lubischew's test. Ninety individuals from an archaeological population (La Torrecilla-Arenas del Rey, Granada, southern Spain) and 17 craniometrical variables were included in the analyses. Most craniometrical variables were higher for men. The bizygomatic breadth enabled the highest level of discrimination: 87.5% and 88.5%, using discriminant analysis and Lubischew's test, respectively. Bizygomatic breadth proved highly dimorphic in comparison to other populations reported in the literature. Lubischew's test raised the discrimination percentage in specific craniometrical variables, while others showed a superior performance by means of the discriminant analysis. The inconsistent results across statistical methods resulted from the specific formulation of each procedure. Discriminant analysis accounts both for within-group and between-group variance, while Lubischew's test emphasizes between-group variation only. Therefore, both techniques are recommended, as they provide different means of achieving optimal discrimination percentages.
A Non-Parametric and Entropy Based Analysis of the Relationship between the VIX and S&P 500
Directory of Open Access Journals (Sweden)
Abhay K. Singh
2013-10-01
Full Text Available This paper features an analysis of the relationship between the S&P 500 Index and the VIX using daily data obtained from the CBOE website and SIRCA (The Securities Industry Research Centre of the Asia Pacific. We explore the relationship between the S&P 500 daily return series and a similar series for the VIX in terms of a long sample drawn from the CBOE from 1990 to mid 2011 and a set of returns from SIRCA’s TRTH datasets from March 2005 to-date. This shorter sample, which captures the behavior of the new VIX, introduced in 2003, is divided into four sub-samples which permit the exploration of the impact of the Global Financial Crisis. We apply a series of non-parametric based tests utilizing entropy based metrics. These suggest that the PDFs and CDFs of these two return distributions change shape in various subsample periods. The entropy and MI statistics suggest that the degree of uncertainty attached to these distributions changes through time and using the S&P 500 return as the dependent variable, that the amount of information obtained from the VIX changes with time and reaches a relative maximum in the most recent period from 2011 to 2012. The entropy based non-parametric tests of the equivalence of the two distributions and their symmetry all strongly reject their respective nulls. The results suggest that parametric techniques do not adequately capture the complexities displayed in the behavior of these series. This has practical implications for hedging utilizing derivatives written on the VIX.
Directory of Open Access Journals (Sweden)
César Merino Soto
2009-05-01
Full Text Available Resumen:La presente investigación hace un estudio psicométrico de un nuevo sistema de calificación de la Prueba Gestáltica del Bendermodificada para niños, que es el Sistema de Calificación Cualitativa (Brannigan y Brunner, 2002, en un muestra de 244 niñosingresantes a primer grado de primaria en cuatro colegios públicos, ubicados en Lima. El enfoque usado es un análisis noparamétrico de ítems mediante el programa Testgraf (Ramsay, 1991. Los resultados indican niveles apropiados deconsistencia interna, identificándose la unidimensionalidad, y el buen nivel discriminativo de las categorías de calificación deeste Sistema Cualitativo. No se hallaron diferencias demográficas respecto al género ni la edad. Se discuten los presenteshallazgos en el contexto del potencial uso del Sistema de Calificación Cualitativa y del análisis no paramétrico de ítems en lainvestigación psicométrica.AbstracThis research designs a psychometric study of a new scoring system of the Bender Gestalt test modified to children: it is theQualitative Scoring System (Brannigan & Brunner, 2002, in a sample of 244 first grade children of primary level, in four public school of Lima. The approach aplied is the nonparametric item analysis using The test graft computer program (Ramsay, 1991. Our findings point to good levels of internal consistency, unidimensionality and good discriminative level ofthe categories of scoring from the Qualitative Scoring System. There are not demographic differences between gender or age.We discuss our findings within the context of the potential use of the Qualitative Scoring System and of the nonparametricitem analysis approach in the psychometric research.
Stochastic model updating using distance discrimination analysis
Institute of Scientific and Technical Information of China (English)
Deng Zhongmin; Bi Sifeng; Sez Atamturktur
2014-01-01
This manuscript presents a stochastic model updating method, taking both uncertainties in models and variability in testing into account. The updated finite element (FE) models obtained through the proposed technique can aid in the analysis and design of structural systems. The authors developed a stochastic model updating method integrating distance discrimination analysis (DDA) and advanced Monte Carlo (MC) technique to (1) enable more efficient MC by using a response surface model, (2) calibrate parameters with an iterative test-analysis correlation based upon DDA, and (3) utilize and compare different distance functions as correlation metrics. Using DDA, the influence of distance functions on model updating results is analyzed. The proposed sto-chastic method makes it possible to obtain a precise model updating outcome with acceptable cal-culation cost. The stochastic method is demonstrated on a helicopter case study updated using both Euclidian and Mahalanobis distance metrics. It is observed that the selected distance function influ-ences the iterative calibration process and thus, the calibration outcome, indicating that an integra-tion of different metrics might yield improved results.
Nonparametric statistical inference
Gibbons, Jean Dickinson
2010-01-01
Overall, this remains a very fine book suitable for a graduate-level course in nonparametric statistics. I recommend it for all people interested in learning the basic ideas of nonparametric statistical inference.-Eugenia Stoimenova, Journal of Applied Statistics, June 2012… one of the best books available for a graduate (or advanced undergraduate) text for a theory course on nonparametric statistics. … a very well-written and organized book on nonparametric statistics, especially useful and recommended for teachers and graduate students.-Biometrics, 67, September 2011This excellently presente
Hsu, Yu-Han H; Ferl, Gregory Z; Ng, Chee M
2013-05-01
Dynamic contrast-enhanced magnetic resonance imaging (DCE-MRI) is often used to examine vascular function in malignant tumors and noninvasively monitor drug efficacy of antivascular therapies in clinical studies. However, complex numerical methods used to derive tumor physiological properties from DCE-MRI images can be time-consuming and computationally challenging. Recent advancement of computing technology in graphics processing unit (GPU) makes it possible to build an energy-efficient and high-power parallel computing platform for solving complex numerical problems. This study develops the first reported fast GPU-based method for nonparametric kinetic analysis of DCE-MRI data using clinical scans of glioblastoma patients treated with bevacizumab (Avastin®). In the method, contrast agent concentration-time profiles in arterial blood and tumor tissue are smoothed using a robust kernel-based regression algorithm in order to remove artifacts due to patient motion and then deconvolved to produce the impulse response function (IRF). The area under the curve (AUC) and mean residence time (MRT) of the IRF are calculated using statistical moment analysis, and two tumor physiological properties that relate to vascular permeability, volume transfer constant between blood plasma and extravascular extracellular space (K(trans)) and fractional interstitial volume (ve) are estimated using the approximations AUC/MRT and AUC. The most significant feature in this method is the use of GPU-computing to analyze data from more than 60,000 voxels in each DCE-MRI image in parallel fashion. All analysis steps have been automated in a single program script that requires only blood and tumor data as the sole input. The GPU-accelerated method produces K(trans) and ve estimates that are comparable to results from previous studies but reduces computational time by more than 80-fold compared to a previously reported central processing unit-based nonparametric method. Furthermore, it is at
Farcomeni, Alessio; Serranti, Silvia; Bonifazi, Giuseppe
2008-01-01
Glass ceramic detection in glass recycling plants represents a still unsolved problem, as glass ceramic material looks like normal glass and is usually detected only by specialized personnel. The presence of glass-like contaminants inside waste glass products, resulting from both industrial and differentiated urban waste collection, increases process production costs and reduces final product quality. In this paper an innovative approach for glass ceramic recognition, based on the non-parametric analysis of infrared spectra, is proposed and investigated. The work was specifically addressed to the spectral classification of glass and glass ceramic fragments collected in an actual recycling plant from three different production lines: flat glass, colored container-glass and white container-glass. The analyses, carried out in the near and mid-infrared (NIR-MIR) spectral field (1280-4480 nm), show that glass ceramic and glass fragments can be recognized by applying a wavelet transform, with a small classification error. Moreover, a method for selecting only a small subset of relevant wavelength ratios is suggested, allowing the conduct of a fast recognition of the two classes of materials. The results show how the proposed approach can be utilized to develop a classification engine to be integrated inside a hardware and software sorting architecture for fast "on-line" ceramic glass recognition and separation.
Directory of Open Access Journals (Sweden)
Kimihiro Noguchi
2012-09-01
Full Text Available Longitudinal data from factorial experiments frequently arise in various fields of study, ranging from medicine and biology to public policy and sociology. In most practical situations, the distribution of observed data is unknown and there may exist a number of atypical measurements and outliers. Hence, use of parametric and semi-parametric procedures that impose restrictive distributional assumptions on observed longitudinal samples becomes questionable. This, in turn, has led to a substantial demand for statistical procedures that enable us to accurately and reliably analyze longitudinal measurements in factorial experiments with minimal conditions on available data, and robust nonparametric methodology offering such a possibility becomes of particular practical importance. In this article, we introduce a new R package nparLD which provides statisticians and researchers from other disciplines an easy and user-friendly access to the most up-to-date robust rank-based methods for the analysis of longitudinal data in factorial settings. We illustrate the implemented procedures by case studies from dentistry, biology, and medicine.
Pataky, Todd C; Vanrenterghem, Jos; Robinson, Mark A
2015-05-01
Biomechanical processes are often manifested as one-dimensional (1D) trajectories. It has been shown that 1D confidence intervals (CIs) are biased when based on 0D statistical procedures, and the non-parametric 1D bootstrap CI has emerged in the Biomechanics literature as a viable solution. The primary purpose of this paper was to clarify that, for 1D biomechanics datasets, the distinction between 0D and 1D methods is much more important than the distinction between parametric and non-parametric procedures. A secondary purpose was to demonstrate that a parametric equivalent to the 1D bootstrap exists in the form of a random field theory (RFT) correction for multiple comparisons. To emphasize these points we analyzed six datasets consisting of force and kinematic trajectories in one-sample, paired, two-sample and regression designs. Results showed, first, that the 1D bootstrap and other 1D non-parametric CIs were qualitatively identical to RFT CIs, and all were very different from 0D CIs. Second, 1D parametric and 1D non-parametric hypothesis testing results were qualitatively identical for all six datasets. Last, we highlight the limitations of 1D CIs by demonstrating that they are complex, design-dependent, and thus non-generalizable. These results suggest that (i) analyses of 1D data based on 0D models of randomness are generally biased unless one explicitly identifies 0D variables before the experiment, and (ii) parametric and non-parametric 1D hypothesis testing provide an unambiguous framework for analysis when one׳s hypothesis explicitly or implicitly pertains to whole 1D trajectories.
Classification tree analysis for the discrimination of pleural exudates and transudates.
Esquerda, Aureli; Trujillano, Javier; López de Ullibarri, Ignacio; Bielsa, Silvia; Madroñero, Ana B; Porcel, José M
2007-01-01
Classification and regression tree (CART) analysis is a non-parametric technique suitable for the generation of clinical decision rules. We have studied the performance of CART analysis in the separation of pleural exudates and transudates. Basic demographic, radiologic and laboratory data were retrospectively evaluated in 1257 pleural effusions (204 transudates and 1053 exudates, according to standard clinical criteria) and submitted for CART analysis. The model's discriminative ability was compared with that of Light's criteria, in both the original formulation and an abbreviated version, i.e., deleting the pleural fluid (PF)/serum lactate dehydrogenase (LDH) ratio from the triad. A first CART model built starting from all available data identified PF/serum protein ratio and PF LDH ratios as the two best discriminatory parameters. This algorithm achieved a sensitivity of 96.8%, slightly lower than that of classical Light's criteria (98.5%) and comparable to that of the abbreviated Light's criteria (97.0%), and significantly better specificity (85.3%) compared to both classical (74.0%) and abbreviated (79.4%) Light's criteria. A second CART model developed after excluding serum measurements selected PF protein and PF LDH as the most discriminatory variables, and correctly classified 97.2% of exudates and 77.0% of transudates. CART-based algorithms can efficiently discriminate between pleural exudates and transudates.
National Research Council Canada - National Science Library
Midtbøen, Arnfinn H; Rogstad, Jon
2012-01-01
... of discrimination in the labour market as well as to the mechanisms involved in discriminatory hiring practices. The design has several advantages compared to -‘single-method’ approaches and provides a more substantial understanding of the processes leading to ethnic inequality in the labour market.
PRICE DISCRIMINATION AND MARKET POWER: A THEORETICAL ANALYSIS
Directory of Open Access Journals (Sweden)
Olga Smirnova
2015-07-01
Full Text Available This paper analyzes the contemporary theoretical and empirical research in the field of impact assessment of market power and conclusions about the possibilities of the company to implement price discrimination in different market structures. The results of the analysis allow to evaluate current approaches to antitrust regulation of price discrimination.
Guccio, Calogero; Martorana, Marco Ferdinando; Mazza, Isidoro
2017-01-01
The paper assesses the evolution of efficiency of Italian public universities for the period 2000-2010. It aims at investigating whether their levels of efficiency showed signs of convergence, and if the well-known disparity between northern and southern regions decreased. For this purpose, we use a refinement of data envelopment analysis, namely…
Non-parametric group-level statistics for source-resolved ERP analysis.
Lee, Clement; Miyakoshi, Makoto; Delorme, Arnaud; Cauwenberghs, Gert; Makeig, Scott
2015-01-01
We have developed a new statistical framework for group-level event-related potential (ERP) analysis in EEGLAB. The framework calculates the variance of scalp channel signals accounted for by the activity of homogeneous clusters of sources found by independent component analysis (ICA). When ICA data decomposition is performed on each subject's data separately, functionally equivalent ICs can be grouped into EEGLAB clusters. Here, we report a new addition (statPvaf) to the EEGLAB plug-in std_envtopo to enable inferential statistics on main effects and interactions in event related potentials (ERPs) of independent component (IC) processes at the group level. We demonstrate the use of the updated plug-in on simulated and actual EEG data.
Nonparametric Bayesian Inference for Mean Residual Life Functions in Survival Analysis
Poynor, Valerie; Kottas, Athanasios
2014-01-01
Modeling and inference for survival analysis problems typically revolves around different functions related to the survival distribution. Here, we focus on the mean residual life function which provides the expected remaining lifetime given that a subject has survived (i.e., is event-free) up to a particular time. This function is of direct interest in reliability, medical, and actuarial fields. In addition to its practical interpretation, the mean residual life function characterizes the sur...
A Bayesian Predictive Discriminant Analysis with Screened Data
Hea-Jung Kim
2015-01-01
In the application of discriminant analysis, a situation sometimes arises where individual measurements are screened by a multidimensional screening scheme. For this situation, a discriminant analysis with screened populations is considered from a Bayesian viewpoint, and an optimal predictive rule for the analysis is proposed. In order to establish a flexible method to incorporate the prior information of the screening mechanism, we propose a hierarchical screened scale mixture of normal (HSS...
Cloud type discrimination via multispectral textural analysis
Lamei, Niloufar; Crawford, Melba M.; Hutchison, Keith D.; Khazenie, Nahid
1993-09-01
One of the primary interests in digital image processing is the development of robust methods to perform feature detection, extraction, and classification. Until recently, classification methods for cloud discrimination were mainly based on the spectral information of the imagery. However, because of the spectral similarities of certain features (such as ice clouds and snow) and the effects of atmospheric attenuation, multi-spectral rule based classifications do not necessarily produce accurate feature discrimination. Spectral homogeneity of two different features within a scene can lead to misclassification. Furthermore, the opposite problem can occur when one feature exhibits different spectral signatures locally but is homogeneous in its cyclic spatial variation. The exploration of spatial information is often advantageous in these discrimination problems. A texture-based method for feature identification has been investigated. This method uses a set of localized spatial filters known as two dimensional Gabor functions. Gabor filters can be described as a sinusoidal plane wave within a two-dimensional Gaussian envelope. The frequency and orientation of the sine plane and the width of the Gaussian envelope are determined by the Gabor parameters. These tunable channels yield joint optimal information both in the spatial and the frequency domains. The new method has been applied to the thermal channels of the NOAA-advanced very high resolution radiometer (AVHRR) data for cloud type discrimination.
Energy Technology Data Exchange (ETDEWEB)
Ford, Eric B.; /Florida U.; Fabrycky, Daniel C.; /Lick Observ.; Steffen, Jason H.; /Fermilab; Carter, Joshua A.; /Harvard-Smithsonian Ctr. Astrophys.; Fressin, Francois; /Harvard-Smithsonian Ctr. Astrophys.; Holman, Matthew J.; /Harvard-Smithsonian Ctr. Astrophys.; Lissauer, Jack J.; /NASA, Ames; Moorhead, Althea V.; /Florida U.; Morehead, Robert C.; /Florida U.; Ragozzine, Darin; /Harvard-Smithsonian Ctr. Astrophys.; Rowe, Jason F.; /NASA, Ames /SETI Inst., Mtn. View /San Diego State U., Astron. Dept.
2012-01-01
We present a new method for confirming transiting planets based on the combination of transit timing variations (TTVs) and dynamical stability. Correlated TTVs provide evidence that the pair of bodies are in the same physical system. Orbital stability provides upper limits for the masses of the transiting companions that are in the planetary regime. This paper describes a non-parametric technique for quantifying the statistical significance of TTVs based on the correlation of two TTV data sets. We apply this method to an analysis of the transit timing variations of two stars with multiple transiting planet candidates identified by Kepler. We confirm four transiting planets in two multiple planet systems based on their TTVs and the constraints imposed by dynamical stability. An additional three candidates in these same systems are not confirmed as planets, but are likely to be validated as real planets once further observations and analyses are possible. If all were confirmed, these systems would be near 4:6:9 and 2:4:6:9 period commensurabilities. Our results demonstrate that TTVs provide a powerful tool for confirming transiting planets, including low-mass planets and planets around faint stars for which Doppler follow-up is not practical with existing facilities. Continued Kepler observations will dramatically improve the constraints on the planet masses and orbits and provide sensitivity for detecting additional non-transiting planets. If Kepler observations were extended to eight years, then a similar analysis could likely confirm systems with multiple closely spaced, small transiting planets in or near the habitable zone of solar-type stars.
Quantal Response: Nonparametric Modeling
2017-01-01
spline N−spline Fig. 3 Logistic regression 7 Approved for public release; distribution is unlimited. 5. Nonparametric QR Models Nonparametric linear ...stimulus and probability of response. The Generalized Linear Model approach does not make use of the limit distribution but allows arbitrary functional...7. Conclusions and Recommendations 18 8. References 19 Appendix A. The Linear Model 21 Appendix B. The Generalized Linear Model 33 Appendix C. B
Directory of Open Access Journals (Sweden)
Shanshan eLi
2016-01-01
Full Text Available Independent Component analysis (ICA is a widely used technique for separating signals that have been mixed together. In this manuscript, we propose a novel ICA algorithm using density estimation and maximum likelihood, where the densities of the signals are estimated via p-spline based histogram smoothing and the mixing matrix is simultaneously estimated using an optimization algorithm. The algorithm is exceedingly simple, easy to implement and blind to the underlying distributions of the source signals. To relax the identically distributed assumption in the density function, a modified algorithm is proposed to allow for different density functions on different regions. The performance of the proposed algorithm is evaluated in different simulation settings. For illustration, the algorithm is applied to a research investigation with a large collection of resting state fMRI datasets. The results show that the algorithm successfully recovers the established brain networks.
Directory of Open Access Journals (Sweden)
Ignacio Santamaría
2008-04-01
Full Text Available This paper treats the identification of nonlinear systems that consist of a cascade of a linear channel and a nonlinearity, such as the well-known Wiener and Hammerstein systems. In particular, we follow a supervised identification approach that simultaneously identifies both parts of the nonlinear system. Given the correct restrictions on the identification problem, we show how kernel canonical correlation analysis (KCCA emerges as the logical solution to this problem. We then extend the proposed identification algorithm to an adaptive version allowing to deal with time-varying systems. In order to avoid overfitting problems, we discuss and compare three possible regularization techniques for both the batch and the adaptive versions of the proposed algorithm. Simulations are included to demonstrate the effectiveness of the presented algorithm.
Score-moment combined linear discrimination analysis (SMC-LDA) as an improved discrimination method.
Han, Jintae; Chung, Hoeil; Han, Sung-Hwan; Yoon, Moon-Young
2007-01-01
A new discrimination method called the score-moment combined linear discrimination analysis (SMC-LDA) has been developed and its performance has been evaluated using three practical spectroscopic datasets. The key concept of SMC-LDA was to use not only the score from principal component analysis (PCA), but also the moment of the spectrum, as inputs for LDA to improve discrimination. Along with conventional score, moment is used in spectroscopic fields as an effective alternative for spectral feature representation. Three different approaches were considered. Initially, the score generated from PCA was projected onto a two-dimensional feature space by maximizing Fisher's criterion function (conventional PCA-LDA). Next, the same procedure was performed using only moment. Finally, both score and moment were utilized simultaneously for LDA. To evaluate discrimination performances, three different spectroscopic datasets were employed: (1) infrared (IR) spectra of normal and malignant stomach tissue, (2) near-infrared (NIR) spectra of diesel and light gas oil (LGO) and (3) Raman spectra of Chinese and Korean ginseng. For each case, the best discrimination results were achieved when both score and moment were used for LDA (SMC-LDA). Since the spectral representation character of moment was different from that of score, inclusion of both score and moment for LDA provided more diversified and descriptive information.
Introduction to nonparametric statistics for the biological sciences using R
MacFarland, Thomas W
2016-01-01
This book contains a rich set of tools for nonparametric analyses, and the purpose of this supplemental text is to provide guidance to students and professional researchers on how R is used for nonparametric data analysis in the biological sciences: To introduce when nonparametric approaches to data analysis are appropriate To introduce the leading nonparametric tests commonly used in biostatistics and how R is used to generate appropriate statistics for each test To introduce common figures typically associated with nonparametric data analysis and how R is used to generate appropriate figures in support of each data set The book focuses on how R is used to distinguish between data that could be classified as nonparametric as opposed to data that could be classified as parametric, with both approaches to data classification covered extensively. Following an introductory lesson on nonparametric statistics for the biological sciences, the book is organized into eight self-contained lessons on various analyses a...
Small visible energy scalar top iterative discriminant analysis
Indian Academy of Sciences (India)
A Sopczak; A Finch; A Freitas; C Milsténe; M Schimtt
2007-12-01
Light scalar top quarks with a small mass difference with respect to the neutralino mass are of particular cosmological interest. This study uses an iterative discriminant analysis method to optimize the expected selection efficiency at the international linear collider (ILC).
DEFF Research Database (Denmark)
Tan, Qihua; Zhao, J H; Iachine, I
2004-01-01
This report investigates the power issue in applying the non-parametric linkage analysis of affected sib-pairs (ASP) [Kruglyak and Lander, 1995: Am J Hum Genet 57:439-454] to localize genes that contribute to human longevity using long-lived sib-pairs. Data were simulated by introducing a recently...... developed statistical model for measuring marker-longevity associations [Yashin et al., 1999: Am J Hum Genet 65:1178-1193], enabling direct power comparison between linkage and association approaches. The non-parametric linkage (NPL) scores estimated in the region harboring the causal allele are evaluated...... in case of a dominant effect. Although the power issue may depend heavily on the true genetic nature in maintaining survival, our study suggests that results from small-scale sib-pair investigations should be referred with caution, given the complexity of human longevity....
Discrete Discriminant analysis based on tree-structured graphical models
DEFF Research Database (Denmark)
Perez de la Cruz, Gonzalo; Eslava, Guillermina
The purpose of this paper is to illustrate the potential use of discriminant analysis based on tree{structured graphical models for discrete variables. This is done by comparing its empirical performance using estimated error rates for real and simulated data. The results show that discriminant...... analysis based on tree{structured graphical models is a simple nonlinear method competitive with, and sometimes superior to, other well{known linear methods like those assuming mutual independence between variables and linear logistic regression....
Discrete Discriminant analysis based on tree-structured graphical models
DEFF Research Database (Denmark)
Perez de la Cruz, Gonzalo; Eslava, Guillermina
The purpose of this paper is to illustrate the potential use of discriminant analysis based on tree{structured graphical models for discrete variables. This is done by comparing its empirical performance using estimated error rates for real and simulated data. The results show that discriminant a...... analysis based on tree{structured graphical models is a simple nonlinear method competitive with, and sometimes superior to, other well{known linear methods like those assuming mutual independence between variables and linear logistic regression....
On discriminant analysis techniques and correlation structures in high dimensions
DEFF Research Database (Denmark)
Clemmensen, Line Katrine Harder
This paper compares several recently proposed techniques for performing discriminant analysis in high dimensions, and illustrates that the various sparse methods dier in prediction abilities depending on their underlying assumptions about the correlation structures in the data. The techniques...... generally focus on two things: Obtaining sparsity (variable selection) and regularizing the estimate of the within-class covariance matrix. For high-dimensional data, this gives rise to increased interpretability and generalization ability over standard linear discriminant analysis. Here, we group...
Directory of Open Access Journals (Sweden)
Archer Kellie J
2008-02-01
Full Text Available Abstract Background With the popularity of DNA microarray technology, multiple groups of researchers have studied the gene expression of similar biological conditions. Different methods have been developed to integrate the results from various microarray studies, though most of them rely on distributional assumptions, such as the t-statistic based, mixed-effects model, or Bayesian model methods. However, often the sample size for each individual microarray experiment is small. Therefore, in this paper we present a non-parametric meta-analysis approach for combining data from independent microarray studies, and illustrate its application on two independent Affymetrix GeneChip studies that compared the gene expression of biopsies from kidney transplant recipients with chronic allograft nephropathy (CAN to those with normal functioning allograft. Results The simulation study comparing the non-parametric meta-analysis approach to a commonly used t-statistic based approach shows that the non-parametric approach has better sensitivity and specificity. For the application on the two CAN studies, we identified 309 distinct genes that expressed differently in CAN. By applying Fisher's exact test to identify enriched KEGG pathways among those genes called differentially expressed, we found 6 KEGG pathways to be over-represented among the identified genes. We used the expression measurements of the identified genes as predictors to predict the class labels for 6 additional biopsy samples, and the predicted results all conformed to their pathologist diagnosed class labels. Conclusion We present a new approach for combining data from multiple independent microarray studies. This approach is non-parametric and does not rely on any distributional assumptions. The rationale behind the approach is logically intuitive and can be easily understood by researchers not having advanced training in statistics. Some of the identified genes and pathways have been
Kong, Xiangrong; Mas, Valeria; Archer, Kellie J
2008-02-26
With the popularity of DNA microarray technology, multiple groups of researchers have studied the gene expression of similar biological conditions. Different methods have been developed to integrate the results from various microarray studies, though most of them rely on distributional assumptions, such as the t-statistic based, mixed-effects model, or Bayesian model methods. However, often the sample size for each individual microarray experiment is small. Therefore, in this paper we present a non-parametric meta-analysis approach for combining data from independent microarray studies, and illustrate its application on two independent Affymetrix GeneChip studies that compared the gene expression of biopsies from kidney transplant recipients with chronic allograft nephropathy (CAN) to those with normal functioning allograft. The simulation study comparing the non-parametric meta-analysis approach to a commonly used t-statistic based approach shows that the non-parametric approach has better sensitivity and specificity. For the application on the two CAN studies, we identified 309 distinct genes that expressed differently in CAN. By applying Fisher's exact test to identify enriched KEGG pathways among those genes called differentially expressed, we found 6 KEGG pathways to be over-represented among the identified genes. We used the expression measurements of the identified genes as predictors to predict the class labels for 6 additional biopsy samples, and the predicted results all conformed to their pathologist diagnosed class labels. We present a new approach for combining data from multiple independent microarray studies. This approach is non-parametric and does not rely on any distributional assumptions. The rationale behind the approach is logically intuitive and can be easily understood by researchers not having advanced training in statistics. Some of the identified genes and pathways have been reported to be relevant to renal diseases. Further study on the
Face Recognition Using Double Sparse Local Fisher Discriminant Analysis
Directory of Open Access Journals (Sweden)
Zhan Wang
2015-01-01
Full Text Available Local Fisher discriminant analysis (LFDA was proposed for dealing with the multimodal problem. It not only combines the idea of locality preserving projections (LPP for preserving the local structure of the high-dimensional data but also combines the idea of Fisher discriminant analysis (FDA for obtaining the discriminant power. However, LFDA also suffers from the undersampled problem as well as many dimensionality reduction methods. Meanwhile, the projection matrix is not sparse. In this paper, we propose double sparse local Fisher discriminant analysis (DSLFDA for face recognition. The proposed method firstly constructs a sparse and data-adaptive graph with nonnegative constraint. Then, DSLFDA reformulates the objective function as a regression-type optimization problem. The undersampled problem is avoided naturally and the sparse solution can be obtained by adding the regression-type problem to a l1 penalty. Experiments on Yale, ORL, and CMU PIE face databases are implemented to demonstrate the effectiveness of the proposed method.
EEG based Autism Diagnosis Using Regularized Fisher Linear Discriminant Analysis
Directory of Open Access Journals (Sweden)
Mahmoud I. Kamel
2012-04-01
Full Text Available Diagnosis of autism is one of the difficult problems facing researchers. To reveal the discriminative pattern between autistic and normal children via electroencephalogram (EEG analysis is a big challenge. The feature extraction is averaged Fast Fourier Transform (FFT with the Regulated Fisher Linear Discriminant (RFLD classifier. Gaussinaty condition for the optimality of Regulated Fisher Linear Discriminant (RFLD has been achieved by a well-conditioned appropriate preprocessing of the data, as well as optimal shrinkage technique for the Lambda parameter. Winsorised Filtered Data gave the best result.
Discrimination analysis of mass spectrometry proteomics for ovarian cancer detection
Institute of Scientific and Technical Information of China (English)
Yan-jun HONG; Xiao-dan WANG; David SHEN; Su ZENG
2008-01-01
Aim:A discrimination analysis has been explored for the probabilistic classifica-tion of healthy versus ovarian cancer serum samples using proteomics data from mass spectrometry (MS).Methods:The method employs data normalization,clustering,and a linear discriminant analysis on surface-enhanced laser desorp-tion ionization (SELDI) time-of-flight MS data.The probabilistic classification method computes the optimal linear discriminant using the complex human blood serum SELDI spectra.Cross-validation and training/testing data-split experi-ments are conducted to verify the optimal discriminant and demonstrate the accu-racy and robustness of the method.Results:The cluster discrimination method achieves excellent performance.The sensitivity,specificity,and positive predic-tive values are above 97% on ovarian cancer.The protein fraction peaks,which significantly contribute to the classification,can be available from the analysis process.Conclusion:The discrimination analysis helps the molecular identities of differentially expressed proteins and peptides between the healthy and ovarian patients.
Kernel-based fisher discriminant analysis for hyperspectral target detection
Institute of Scientific and Technical Information of China (English)
GU Yan-feng; ZHANG Ye; YOU Di
2007-01-01
A new method based on kernel Fisher discriminant analysis (KFDA) is proposed for target detection of hyperspectral images. The KFDA combines kernel mapping derived from support vector machine and the classical linear Fisher discriminant analysis (LFDA), and it possesses good ability to process nonlinear data such as hyperspectral images. According to the Fisher rule that the ratio of the between-class and within-class scatters is maximized, the KFDA is used to obtain a set of optimal discriminant basis vectors in high dimensional feature space. All pixels in the hyperspectral images are projected onto the discriminant basis vectors and the target detection is performed according to the projection result. The numerical experiments are performed on hyperspectral data with 126 bands collected by Airborne Visible/Infrared Imaging Spectrometer (AVIRIS). The experimental results show the effectiveness of the proposed detection method and prove that this method has good ability to overcome small sample size and spectral variability in the hyperspectral target detection.
Ford, Eric B.; Fabrycky, Daniel C.; Steffen, Jason H.; Carter, Joshua A.; Fressin, Francois; Holman, Matthew Jon; Lissauer, Jack J.; Moorhead, Althea V.; Morehead, Robert C.; Ragozzine, Darin; Rowe, Jason F.; Welsh, William F.; Allen, Christopher; Batalha, Natalie M.; Borucki, William J.
2012-01-01
We present a new method for confirming transiting planets based on the combination of transit timingn variations (TTVs) and dynamical stability. Correlated TTVs provide evidence that the pair of bodies are in the same physical system. Orbital stability provides upper limits for the masses of the transiting companions that are in the planetary regime. This paper describes a non-parametric technique for quantifying the statistical significance of TTVs based on the correlation of two TTV data se...
Application of Discriminant Analysis on Romanian Insurance Market
Directory of Open Access Journals (Sweden)
Constantin Anghelache
2008-11-01
Full Text Available Discriminant analysis is a supervised learning technique that can be used in order to determine which variables are the best predictors of the classification of objects belonging to a population into predetermined classes. At the same time, discriminant analysis provides a powerful tool that enables researchers to make predictions regarding the classification of new objects into predefined classes. The main goal of discriminant analysis is to determine which of the N descriptive variables have the most discriminatory power, that is, which of them are the most relevant for the classification of objects into classes. In order to classify objects, we need a mathematical model that provides the rules for optimal allocation. This is the classifier. In this paper we will discuss three of the most important models of classification: the Bayesian criterion, the Mahalanobis criterion and the Fisher criterion. In this paper, we will use discriminant analysis to classify the insurance companies that operated on the Romanian market in 2006. We have selected a number of eigth (8 relevant variables: gross written premium (GR_WRI_PRE, net mathematical reserves (NET_M_PES, gross claims paid (GR_CL_PAID, net premium reserves (NET_PRE_RES, net claim reserves (NET_CL_RES, net income (NE—_INCOME, share capital (SHARE_CAP and gross written premium ceded in Reinsurance (GR_WRI_PRE_CED. Before proceeding to discriminant analysis, we performed cluster analysis on the initial data in order to identify classes (clusters that emerge from the data.
Nonparametric statistical methods
Hollander, Myles; Chicken, Eric
2013-01-01
Praise for the Second Edition"This book should be an essential part of the personal library of every practicing statistician."-Technometrics Thoroughly revised and updated, the new edition of Nonparametric Statistical Methods includes additional modern topics and procedures, more practical data sets, and new problems from real-life situations. The book continues to emphasize the importance of nonparametric methods as a significant branch of modern statistics and equips readers with the conceptual and technical skills necessary to select and apply the appropriate procedures for any given sit
High-Order Supervised Discriminant Analysis for Visual Data
Institute of Scientific and Technical Information of China (English)
Xiao-Ling Xia; Hang-Hui Huang
2014-01-01
In practical applications, we often have to deal with high-order data, for example, a grayscale image and a video clip are intrinsically a 2nd-order tensor and a 3rd-order tensor, respectively. In order to satisty these high-order data, it is conventional to vectorize these data in advance, which often destroys the intrinsic structures of the data and includes the curse of dimensionality. For this reason, we consider the problem of high-order data representation and classification, and propose a tensor based fisher discriminant analysis (FDA), which is a generalized version of FDA, named as GFDA. Experimental results show our GFDA outperforms the existing methods, such as the 2-directional 2-dimensional principal component analysis ((2D)2PCA), 2-directional 2-dimensional linear discriminant analysis ((2D)2LDA), and multilinear discriminant analysis (MDA), in high-order data classification under a lower compression ratio.
Discriminant analysis for repeated measures data: a review
Directory of Open Access Journals (Sweden)
Lisa Lix
2010-09-01
Full Text Available Discriminant analysis (DA encompasses procedures for classifying observations into groups (i.e., predictive discriminative analysis and describing the relative importance of variables for distinguishing amongst groups (i.e., descriptive discriminative analysis. In recent years, a number of developments have occurred in DA procedures for the analysis of data from repeated measures designs. Specifically, DA procedures have been developed for repeated measures data characterized by missing observations and/or unbalanced measurement occasions, as well as high-dimensional data in which measurements are collected repeatedly on two or more variables. This paper reviews the literature on DA procedures for univariate and multivariate repeated measures data, focusing on covariance pattern and linear mixed-effects models. A numeric example illustrates their implementation using SAS software.
Principal Component Clustering Approach to Teaching Quality Discriminant Analysis
Xian, Sidong; Xia, Haibo; Yin, Yubo; Zhai, Zhansheng; Shang, Yan
2016-01-01
Teaching quality is the lifeline of the higher education. Many universities have made some effective achievement about evaluating the teaching quality. In this paper, we establish the Students' evaluation of teaching (SET) discriminant analysis model and algorithm based on principal component clustering analysis. Additionally, we classify the SET…
A Bayesian Predictive Discriminant Analysis with Screened Data
Directory of Open Access Journals (Sweden)
Hea-Jung Kim
2015-09-01
Full Text Available In the application of discriminant analysis, a situation sometimes arises where individual measurements are screened by a multidimensional screening scheme. For this situation, a discriminant analysis with screened populations is considered from a Bayesian viewpoint, and an optimal predictive rule for the analysis is proposed. In order to establish a flexible method to incorporate the prior information of the screening mechanism, we propose a hierarchical screened scale mixture of normal (HSSMN model, which makes provision for flexible modeling of the screened observations. An Markov chain Monte Carlo (MCMC method using the Gibbs sampler and the Metropolis–Hastings algorithm within the Gibbs sampler is used to perform a Bayesian inference on the HSSMN models and to approximate the optimal predictive rule. A simulation study is given to demonstrate the performance of the proposed predictive discrimination procedure.
Facial Affect Recognition Using Regularized Discriminant Analysis-Based Algorithms
Directory of Open Access Journals (Sweden)
Cheng-Yuan Shih
2010-01-01
Full Text Available This paper presents a novel and effective method for facial expression recognition including happiness, disgust, fear, anger, sadness, surprise, and neutral state. The proposed method utilizes a regularized discriminant analysis-based boosting algorithm (RDAB with effective Gabor features to recognize the facial expressions. Entropy criterion is applied to select the effective Gabor feature which is a subset of informative and nonredundant Gabor features. The proposed RDAB algorithm uses RDA as a learner in the boosting algorithm. The RDA combines strengths of linear discriminant analysis (LDA and quadratic discriminant analysis (QDA. It solves the small sample size and ill-posed problems suffered from QDA and LDA through a regularization technique. Additionally, this study uses the particle swarm optimization (PSO algorithm to estimate optimal parameters in RDA. Experiment results demonstrate that our approach can accurately and robustly recognize facial expressions.
Energy Technology Data Exchange (ETDEWEB)
Peterson, James T.
1999-12-01
Natural resource professionals are increasingly required to develop rigorous statistical models that relate environmental data to categorical responses data. Recent advances in the statistical and computing sciences have led to the development of sophisticated methods for parametric and nonparametric analysis of data with categorical responses. The statistical software package CATDAT was designed to make some of these relatively new and powerful techniques available to scientists. The CATDAT statistical package includes 4 analytical techniques: generalized logit modeling; binary classification tree; extended K-nearest neighbor classification; and modular neural network.
Lee, L.; Helsel, D.
2007-01-01
Analysis of low concentrations of trace contaminants in environmental media often results in left-censored data that are below some limit of analytical precision. Interpretation of values becomes complicated when there are multiple detection limits in the data-perhaps as a result of changing analytical precision over time. Parametric and semi-parametric methods, such as maximum likelihood estimation and robust regression on order statistics, can be employed to model distributions of multiply censored data and provide estimates of summary statistics. However, these methods are based on assumptions about the underlying distribution of data. Nonparametric methods provide an alternative that does not require such assumptions. A standard nonparametric method for estimating summary statistics of multiply-censored data is the Kaplan-Meier (K-M) method. This method has seen widespread usage in the medical sciences within a general framework termed "survival analysis" where it is employed with right-censored time-to-failure data. However, K-M methods are equally valid for the left-censored data common in the geosciences. Our S-language software provides an analytical framework based on K-M methods that is tailored to the needs of the earth and environmental sciences community. This includes routines for the generation of empirical cumulative distribution functions, prediction or exceedance probabilities, and related confidence limits computation. Additionally, our software contains K-M-based routines for nonparametric hypothesis testing among an unlimited number of grouping variables. A primary characteristic of K-M methods is that they do not perform extrapolation and interpolation. Thus, these routines cannot be used to model statistics beyond the observed data range or when linear interpolation is desired. For such applications, the aforementioned parametric and semi-parametric methods must be used.
Discrimination and numerical analysis of human pathogenic ...
African Journals Online (AJOL)
SERVER
2008-02-19
Feb 19, 2008 ... Numerical analysis of whole-cell protein profiles of all strains revealed 2 .... average linkage method and correlation coefficient distance. ... distance yielded a dendrogam, consisting of two basic .... Candida glabrata: review of.
Visual category recognition using Spectral Regression and Kernel Discriminant Analysis
Tahir, M.A.; Kittler, J.; Mikolajczyk, K.; Yan, F.; van de Sande, K.E.A.; Gevers, T.
2009-01-01
Visual category recognition (VCR) is one of the most important tasks in image and video indexing. Spectral methods have recently emerged as a powerful tool for dimensionality reduction and manifold learning. Recently, Spectral Regression combined with Kernel Discriminant Analysis (SR-KDA) has been s
Nonparametric Predictive Regression
Ioannis Kasparis; Elena Andreou; Phillips, Peter C.B.
2012-01-01
A unifying framework for inference is developed in predictive regressions where the predictor has unknown integration properties and may be stationary or nonstationary. Two easily implemented nonparametric F-tests are proposed. The test statistics are related to those of Kasparis and Phillips (2012) and are obtained by kernel regression. The limit distribution of these predictive tests holds for a wide range of predictors including stationary as well as non-stationary fractional and near unit...
A Critical Analysis of Anti-Discrimination Law and Microaggressions in Academia
Lukes, Robin; Bangs, Joann
2014-01-01
This article provides a critical analysis of microaggressions and anti-discrimination law in academia. There are many challenges for faculty claiming discrimination under current civil rights laws. Examples of microaggressions that fall outside of anti-discrimination law will be provided. Traditional legal analysis of discrimination will not end…
Kernel-Based Nonlinear Discriminant Analysis for Face Recognition
Institute of Scientific and Technical Information of China (English)
LIU QingShan (刘青山); HUANG Rui (黄锐); LU HanQing (卢汉清); MA SongDe (马颂德)
2003-01-01
Linear subspace analysis methods have been successfully applied to extract features for face recognition. But they are inadequate to represent the complex and nonlinear variations of real face images, such as illumination, facial expression and pose variations, because of their linear properties. In this paper, a nonlinear subspace analysis method, Kernel-based Nonlinear Discriminant Analysis (KNDA), is presented for face recognition, which combines the nonlinear kernel trick with the linear subspace analysis method - Fisher Linear Discriminant Analysis (FLDA).First, the kernel trick is used to project the input data into an implicit feature space, then FLDA is performed in this feature space. Thus nonlinear discriminant features of the input data are yielded. In addition, in order to reduce the computational complexity, a geometry-based feature vectors selection scheme is adopted. Another similar nonlinear subspace analysis is Kernel-based Principal Component Analysis (KPCA), which combines the kernel trick with linear Principal Component Analysis (PCA). Experiments are performed with the polynomial kernel, and KNDA is compared with KPCA and FLDA. Extensive experimental results show that KNDA can give a higher recognition rate than KPCA and FLDA.
ODVBA: optimally-discriminative voxel-based analysis.
Zhang, Tianhao; Davatzikos, Christos
2011-08-01
Gaussian smoothing of images prior to applying voxel-based statistics is an important step in voxel-based analysis and statistical parametric mapping (VBA-SPM) and is used to account for registration errors, to Gaussianize the data and to integrate imaging signals from a region around each voxel. However, it has also become a limitation of VBA-SPM based methods, since it is often chosen empirically and lacks spatial adaptivity to the shape and spatial extent of the region of interest, such as a region of atrophy or functional activity. In this paper, we propose a new framework, named optimally-discriminative voxel-based analysis (ODVBA), for determining the optimal spatially adaptive smoothing of images, followed by applying voxel-based group analysis. In ODVBA, nonnegative discriminative projection is applied regionally to get the direction that best discriminates between two groups, e.g., patients and controls; this direction is equivalent to local filtering by an optimal kernel whose coefficients define the optimally discriminative direction. By considering all the neighborhoods that contain a given voxel, we then compose this information to produce the statistic for each voxel. Finally, permutation tests are used to obtain a statistical parametric map of group differences. ODVBA has been evaluated using simulated data in which the ground truth is known and with data from an Alzheimer's disease (AD) study. The experimental results have shown that the proposed ODVBA can precisely describe the shape and location of structural abnormality.
Gao, Oliver H.; Holmén, Britt A.; Niemeier, Debbie A.
The Ozone Weekend Effect (OWE) has become increasingly more frequent and widespread in southern California since the mid-1970s. Although a number of hypotheses have been suggested to explain the effect, there remains uncertainty associated with the root factors contributing to elevated weekend ozone concentrations. Targeting the time window of the 1997 Southern California Ozone Study (SCOS97), this paper examines traffic activity data for 14 vehicle classes at 27 weigh-in-motion (WIM) stations in southern California. Nonparametric factorial analyses of light-duty vehicle (LDV) and heavy-duty truck (HDT) traffic volumes indicate significant differences in daily volumes by day of week and between the weekly patterns of daily LDV and HDT volumes. Across WIM stations, the daily LDV volume was highest on Friday and decreased by 10% on weekends compared to that on midweek days. In contrast, daily HDT volumes showed dramatic weekend drops of 53% on Saturday and 64% on Sunday. As a result, LDV to HDT ratios increased by 145% on weekends. Nonparametric tests also suggest that weekly traffic patterns varied significantly between WIM stations located close to (central) and far from (peripheral) the Los Angeles Metro area. Weekend increases in LDV/HDT ratios were more pronounced at central WIM sites due to greater weekend declines of HDT relative to LDV traffic. The implications of these weekly traffic patterns for the OWE in southern California were investigated by estimating daily WIM traffic on-road running exhaust emissions of total organic gas (TOG) and oxides of nitrogen (NO x) using EMFAC2002 emission factors. The results support the California Air Resource Board's (CARB's) NO x reduction hypothesis that greater weekend NO x reductions relative to volatile organic compound (VOC) emissions, in combinations with the VOC-limited ozone system, contribute to the OWE observed in the region. The results from this study can be used to develop weekend on-road mobile emission
Insulation Defects Discrimination in GIS by Fisher Discriminant Analysis of Partial Discharge
Institute of Scientific and Technical Information of China (English)
DING Dengwei; GAO Wensheng; LIU Weidong
2013-01-01
To monitor the condition of online power apparatus accurately and provide appropriate guidance on their maintenance,a fundamental ultra-high frequency (UHF) database of partial discharges corresponding to different types of defects is presented for the observation of insulation state of gas insulated switchgear (GIS).In order to imitate the defects in a GIS online,five types of typical partial discharge (PD) sources (i.e.floating metal,protrusion,bouncing particles,void in spacer,and particle on the surface of solid insulation) were designed and fabricated.A wideband UHF sensor and an amplifier were applied to obtain UHF signals.Based on the characteristics of the five different defects,nine meaningful parameters which were independent of the phase of the applied voltage were extracted from the PD samples and then discussed.In this paper,Fisher discriminant analysis (FDA),a pattern recognition algorithm,was applied for the purpose of classifying the defects.In this way,there was no need to set complicated parameters and kernel function,and the total discrimination accuracy of test samples was 97.6％.It indicates that based on the nine especial parameters,the typical defects in GIS can be identified exactly by means of FDA.Hence the analysis method could be possibly regarded as a universal classification method.
Multi spectral imaging analysis for meat spoilage discrimination
DEFF Research Database (Denmark)
Christiansen, Asger Nyman; Carstensen, Jens Michael; Papadopoulou, Olga
) was performed in parallel with videometer image snapshots and sensory analysis. Odour and colour characteristics of meat were determined by a test panel and attributed into three pre-characterized quality classes, namely Fresh; Semi Fresh and Spoiled during the days of its shelf life. So far, different...... classification methods: Naive Bayes Classifier as a reference model, Canonical Discriminant Analysis (CDA) and Support Vector Classification (SVC). As the final step, generalization of the models was performed using k-fold validation (k=10). Results showed that image analysis provided good discrimination of meat...... samples regarding the spoilage process as evaluated from sensory as well as from microbiological data. The support vector classification (SVC) model outperformed other models. Specifically, the misclassification error rate (MER), derived from odour characteristics, was 18% for both aerobic and MAP meat...
Forensic discrimination of dyed hair color: II. Multivariate statistical analysis.
Barrett, Julie A; Siegel, Jay A; Goodpaster, John V
2011-01-01
This research is intended to assess the ability of UV-visible microspectrophotometry to successfully discriminate the color of dyed hair. Fifty-five red hair dyes were analyzed and evaluated using multivariate statistical techniques including agglomerative hierarchical clustering (AHC), principal component analysis (PCA), and discriminant analysis (DA). The spectra were grouped into three classes, which were visually consistent with different shades of red. A two-dimensional PCA observations plot was constructed, describing 78.6% of the overall variance. The wavelength regions associated with the absorbance of hair and dye were highly correlated. Principal components were selected to represent 95% of the overall variance for analysis with DA. A classification accuracy of 89% was observed for the comprehensive dye set, while external validation using 20 of the dyes resulted in a prediction accuracy of 75%. Significant color loss from successive washing of hair samples was estimated to occur within 3 weeks of dye application.
Non-parametric approach to the study of phenotypic stability.
Ferreira, D F; Fernandes, S B; Bruzi, A T; Ramalho, M A P
2016-02-19
The aim of this study was to undertake the theoretical derivations of non-parametric methods, which use linear regressions based on rank order, for stability analyses. These methods were extension different parametric methods used for stability analyses and the result was compared with a standard non-parametric method. Intensive computational methods (e.g., bootstrap and permutation) were applied, and data from the plant-breeding program of the Biology Department of UFLA (Minas Gerais, Brazil) were used to illustrate and compare the tests. The non-parametric stability methods were effective for the evaluation of phenotypic stability. In the presence of variance heterogeneity, the non-parametric methods exhibited greater power of discrimination when determining the phenotypic stability of genotypes.
Multiatlas segmentation as nonparametric regression.
Awate, Suyash P; Whitaker, Ross T
2014-09-01
This paper proposes a novel theoretical framework to model and analyze the statistical characteristics of a wide range of segmentation methods that incorporate a database of label maps or atlases; such methods are termed as label fusion or multiatlas segmentation. We model these multiatlas segmentation problems as nonparametric regression problems in the high-dimensional space of image patches. We analyze the nonparametric estimator's convergence behavior that characterizes expected segmentation error as a function of the size of the multiatlas database. We show that this error has an analytic form involving several parameters that are fundamental to the specific segmentation problem (determined by the chosen anatomical structure, imaging modality, registration algorithm, and label-fusion algorithm). We describe how to estimate these parameters and show that several human anatomical structures exhibit the trends modeled analytically. We use these parameter estimates to optimize the regression estimator. We show that the expected error for large database sizes is well predicted by models learned on small databases. Thus, a few expert segmentations can help predict the database sizes required to keep the expected error below a specified tolerance level. Such cost-benefit analysis is crucial for deploying clinical multiatlas segmentation systems.
Nonparametric correlation models for portfolio allocation
DEFF Research Database (Denmark)
Aslanidis, Nektarios; Casas, Isabel
2013-01-01
breaks in correlations. Only when correlations are constant does the parametric DCC model deliver the best outcome. The methodologies are illustrated by evaluating two interesting portfolios. The first portfolio consists of the equity sector SPDRs and the S&P 500, while the second one contains major......This article proposes time-varying nonparametric and semiparametric estimators of the conditional cross-correlation matrix in the context of portfolio allocation. Simulations results show that the nonparametric and semiparametric models are best in DGPs with substantial variability or structural...... currencies. Results show the nonparametric model generally dominates the others when evaluating in-sample. However, the semiparametric model is best for out-of-sample analysis....
A Censored Nonparametric Software Reliability Model
Institute of Scientific and Technical Information of China (English)
无
2006-01-01
This paper analyses the effct of censoring on the estimation of failure rate, and presents a framework of a censored nonparametric software reliability model. The model is based on nonparametric testing of failure rate monotonically decreasing and weighted kernel failure rate estimation under the constraint of failure rate monotonically decreasing. Not only does the model have the advantages of little assumptions and weak constraints, but also the residual defects number of the software system can be estimated. The numerical experiment and real data analysis show that the model performs well with censored data.
Multi spectral imaging analysis for meat spoilage discrimination
DEFF Research Database (Denmark)
Christiansen, Asger Nyman; Carstensen, Jens Michael; Papadopoulou, Olga
with corresponding sensory data would be of great interest. The purpose of this research was to produce a method capable of quantifying and/or predicting the spoilage status (e.g. express in TVC counts as well as on sensory evaluation) using a multi spectral image of a meat sample and thereby avoid any time...... classification methods: Naive Bayes Classifier as a reference model, Canonical Discriminant Analysis (CDA) and Support Vector Classification (SVC). As the final step, generalization of the models was performed using k-fold validation (k=10). Results showed that image analysis provided good discrimination of meat...... samples. In the case where all data were taken together the misclassification error amounted to 16%. When spoilage status was based on visual sensory data, the model produced a MER of 22% for the combined dataset. These results suggest that it is feasible to employ a multi spectral image...
Quark/gluon jet discrimination: a reproducible analysis using R
CERN. Geneva
2017-01-01
The power to discriminate between light-quark jets and gluon jets would have a huge impact on many searches for new physics at CERN and beyond. This talk will present a walk-through of the development of a prototype machine learning classifier for differentiating between quark and gluon jets at experiments like those at the Large Hadron Collider at CERN. A new fast feature selection method that combines information theory and graph analytics will be outlined. This method has found new variables that promise significant improvements in discrimination power. The prototype jet tagger is simple, interpretable, parsimonious, and computationally extremely cheap, and therefore might be suitable for use in trigger systems for real-time data processing. Nested stratified k-fold cross validation was used to generate robust estimates of model performance. The data analysis was performed entirely in the R statistical programming language, and is fully reproducible. The entire analysis workflow is data-driven, automated a...
Multiple Kernel Learning in Fisher Discriminant Analysis for Face Recognition
Directory of Open Access Journals (Sweden)
Xiao-Zhang Liu
2013-02-01
Full Text Available Recent applications and developments based on support vector machines (SVMs have shown that using multiple kernels instead of a single one can enhance classifier performance. However, there are few reports on performance of the kernel‐based Fisher discriminant analysis (kernel‐based FDA method with multiple kernels. This paper proposes a multiple kernel construction method for kernel‐based FDA. The constructed kernel is a linear combination of several base kernels with a constraint on their weights. By maximizing the margin maximization criterion (MMC, we present an iterative scheme for weight optimization. The experiments on the FERET and CMU PIE face databases show that, our multiple kernel Fisher discriminant analysis (MKFD achieves high recognition performance, compared with single‐kernel‐based FDA. The experiments also show that the constructed kernel relaxes parameter selection for kernel‐based FDA to some extent.
Discrimination and content analysis of fritillaria using near infrared spectroscopy.
Meng, Yu; Wang, Shisheng; Cai, Rui; Jiang, Bohai; Zhao, Weijie
2015-01-01
Fritillaria is a traditional Chinese herbal medicine which can be used to moisten the lungs. The objective of this study is to develop simple, accurate, and solvent-free methods to discriminate and quantify Fritillaria herbs from seven different origins. Near infrared spectroscopy (NIRS) methods are established for the rapid discrimination of seven different Fritillaria samples and quantitative analysis of their total alkaloids. The scaling to first range method and the partial least square (PLS) method are used for the establishment of qualitative and quantitative analysis models. As a result of evaluation for the qualitative NIR model, the selectivity values between groups are always above 2, and the mistaken judgment rate of fifteen samples in prediction sets was zero. This means that the NIR model can be used to distinguish different species of Fritillaria herbs. The established quantitative NIR model can accurately predict the content of total alkaloids from Fritillaria samples.
二维PCA非参数子空间分析的人脸识别算法%Face Recognition Algorithm of 2DPCA Nonparametric Subspace Analysis
Institute of Scientific and Technical Information of China (English)
王美; 梁久祯
2011-01-01
This paper proposes a novel face recognition algorithm of 2D Nonparametric Subspace Analysis(2DNSA) based on 2D Principal Componet Analysis(2DPCA) subspace. The original face matrices are performed to have feature dimension reduction, and the reduced feature matrices are used as a new training set, which can be conducted by 2D non-parametric subspace analysis. This method not only can reduce feature dimensions by 2DPCA, but also consider the impact of boundary samples for classification by taking full advantage of classification capacity of 2DNSA, which avoids the irrationality of using class centers to measure the distances of different classes. Experimental results on the two face databases(namely Yale and LARGE) show the improvements of the developed new algorithm over the traditional subspace methods such as (2D)2PCA, 2DPCA, (2D)2LDA, 2DLDA, 2DPCA+2DLDA, 2DNSA, etc.%提出一种结合二维PCA(2DPCA)的二维非参数子空间分析(2DNSA)人脸识别算法.利用2DPCA对原始图像矩阵进行特征降维,以降维后的特征为训练样本,进行二维非参数判别分析,并综合考虑类边界样本对分类的影响,采用2DNSA实现更合理的特征提取.基于Yale、LARGE人脸数据库的实验结果表明,与(2D)2pCA、2DPCA、(2D)2LDA、2DLDA、2DPCA+2DLDA、2DNSA算法相比,该算法性能更优.
BUSINESS FAILURE PREDICTION FOR ROMANIAN SMES USING MULTIVARIATE DISCRIMINANT ANALYSIS
2012-01-01
Business failure prediction is one of special importance for small and medium sized enterprises (SMEs) due to their increased vulnerability. Consequently, the purpose of this paper is to investigate the utility of financial ratios and other non-financial variables to predict business failure using a sample of Romanian SMEs and applying multiple discriminant analysis. The process that leads to failure is analyzed on a three year time horizon prior to failure and the results showed that failure...
A Comparison of Parametric and Non-Parametric Methods Applied to a Likert Scale.
Mircioiu, Constantin; Atkinson, Jeffrey
2017-05-10
A trenchant and passionate dispute over the use of parametric versus non-parametric methods for the analysis of Likert scale ordinal data has raged for the past eight decades. The answer is not a simple "yes" or "no" but is related to hypotheses, objectives, risks, and paradigms. In this paper, we took a pragmatic approach. We applied both types of methods to the analysis of actual Likert data on responses from different professional subgroups of European pharmacists regarding competencies for practice. Results obtained show that with "large" (>15) numbers of responses and similar (but clearly not normal) distributions from different subgroups, parametric and non-parametric analyses give in almost all cases the same significant or non-significant results for inter-subgroup comparisons. Parametric methods were more discriminant in the cases of non-similar conclusions. Considering that the largest differences in opinions occurred in the upper part of the 4-point Likert scale (ranks 3 "very important" and 4 "essential"), a "score analysis" based on this part of the data was undertaken. This transformation of the ordinal Likert data into binary scores produced a graphical representation that was visually easier to understand as differences were accentuated. In conclusion, in this case of Likert ordinal data with high response rates, restraining the analysis to non-parametric methods leads to a loss of information. The addition of parametric methods, graphical analysis, analysis of subsets, and transformation of data leads to more in-depth analyses.
Afshinpour, Babak; Hossein-Zadeh, Gholam-Ali; Soltanian-Zadeh, Hamid
2008-06-30
Unknown low frequency fluctuations called "trend" are observed in noisy time-series measured for different applications. In some disciplines, they carry primary information while in other fields such as functional magnetic resonance imaging (fMRI) they carry nuisance effects. In all cases, however, it is necessary to estimate them accurately. In this paper, a method for estimating trend in the presence of fractal noise is proposed and applied to fMRI time-series. To this end, a partly linear model (PLM) is fitted to each time-series. The parametric and nonparametric parts of PLM are considered as contributions of hemodynamic response and trend, respectively. Using the whitening property of wavelet transform, the unknown components of the model are estimated in the wavelet domain. The results of the proposed method are compared to those of other parametric trend-removal approaches such as spline and polynomial models. It is shown that the proposed method improves activation detection and decreases variance of the estimated parameters relative to the other methods.
Directory of Open Access Journals (Sweden)
Lars Ängquist
2008-01-01
Full Text Available In this article we try to discuss nonparametric linkage (NPL score functions within a broad and quite general framework. The main focus of the paper is the structure, derivation principles and interpretations of the score function entity itself. We define and discuss several families of one-locus score function definitions, i.e. the implicit, explicit and optimal ones. Some generalizations and comments to the two-locus, unconditional and conditional, cases are included as well. Although this article mainly aims at serving as an overview, where the concept of score functions are put into a covering context, we generalize the noncentrality parameter (NCP optimal score functions in Ängquist et al. (2007 to facilitate—through weighting—for incorporation of several plausible distinct genetic models. Since the genetic model itself most oftenly is to some extent unknown this facilitates weaker prior assumptions with respect to plausible true disease models without loosing the property of NCP-optimality. Moreover, we discuss general assumptions and properties of score functions in the above sense. For instance, the concept of identical by descent (IBD sharing structures and score function equivalence are discussed in some detail.
Multivariable Discriminant Analysis for the Differential Diagnosis of Microcytic Anemia
Directory of Open Access Journals (Sweden)
Eloísa Urrechaga
2013-01-01
Full Text Available Introduction. Iron deficiency anemia and thalassemia are the most common causes of microcytic anemia. Powerful statistical computer programming enables sensitive discriminant analyses to aid in the diagnosis. We aimed at investigating the performance of the multiple discriminant analysis (MDA to the differential diagnosis of microcytic anemia. Methods. The training group was composed of 200 β-thalassemia carriers, 65 α-thalassemia carriers, 170 iron deficiency anemia (IDA, and 45 mixed cases of thalassemia and acute phase response or iron deficiency. A set of potential predictor parameters that could detect differences among groups were selected: Red Blood Cells (RBC, hemoglobin (Hb, mean cell volume (MCV, mean cell hemoglobin (MCH, and RBC distribution width (RDW. The functions obtained with MDA analysis were applied to a set of 628 consecutive patients with microcytic anemia. Results. For classifying patients into two groups (genetic anemia and acquired anemia, only one function was needed; 87.9% β-thalassemia carriers, and 83.3% α-thalassemia carriers, and 72.1% in the mixed group were correctly classified. Conclusion. Linear discriminant functions based on hemogram data can aid in differentiating between IDA and thalassemia, so samples can be efficiently selected for further analysis to confirm the presence of genetic anemia.
Quantum discriminant analysis for dimensionality reduction and classification
Cong, Iris; Duan, Luming
2016-07-01
We present quantum algorithms to efficiently perform discriminant analysis for dimensionality reduction and classification over an exponentially large input data set. Compared with the best-known classical algorithms, the quantum algorithms show an exponential speedup in both the number of training vectors M and the feature space dimension N. We generalize the previous quantum algorithm for solving systems of linear equations (2009 Phys. Rev. Lett. 103 150502) to efficiently implement a Hermitian chain product of k trace-normalized N ×N Hermitian positive-semidefinite matrices with time complexity of O({log}(N)). Using this result, we perform linear as well as nonlinear Fisher discriminant analysis for dimensionality reduction over M vectors, each in an N-dimensional feature space, in time O(p {polylog}({MN})/{ε }3), where ɛ denotes the tolerance error, and p is the number of principal projection directions desired. We also present a quantum discriminant analysis algorithm for data classification with time complexity O({log}({MN})/{ε }3).
Rapid discrimination between four seagrass species using hybrid analysis.
Osathanunkul, M; Madesis, P; Ounjai, S; Suwannapoom, C; Jampeetong, A
2015-04-27
Biological species are traditionally identified based on their morphological features and the correct identification of species is critical in biological studies. However, some plant types, such as seagrass, are taxonomically problematic and difficult to identify. Furthermore, closely related seagrass species, such as Halophila spp, form a taxonomically unresolved complex. Although some seagrass taxa are easy to recognize, most species are difficult to identify without skilled taxonomic or molecular techniques. Barcoding coupled with High Resolution Melting analysis (BAR-HRM) offers a potentially reliable, rapid, and cost-effective method to confirm species. Here, DNA information of two chloroplast loci was used in combination with HRM analysis to discriminate four species of seagrass collected off the southern coast of Thailand. A distinct melting curve presenting one inflection point was generated for each species using rbcL primers. While the melting profiles of Cymodocea rotundata and Cymodocea serrulata were not statistically different, analysis of the normalized HRM curves produced with the rpoC primers allowed for their discrimination. The Bar-HRM technique showed promise in discriminating seagrass species and with further adaptations and improvements, could make for an effective and power tool for confirming seagrass species.
How Are Teachers Teaching? A Nonparametric Approach
De Witte, Kristof; Van Klaveren, Chris
2014-01-01
This paper examines which configuration of teaching activities maximizes student performance. For this purpose a nonparametric efficiency model is formulated that accounts for (1) self-selection of students and teachers in better schools and (2) complementary teaching activities. The analysis distinguishes both individual teaching (i.e., a…
Decompounding random sums: A nonparametric approach
DEFF Research Database (Denmark)
Hansen, Martin Bøgsted; Pitts, Susan M.
review a number of applications and consider the nonlinear inverse problem of inferring the cumulative distribution function of the components in the random sum. We review the existing literature on non-parametric approaches to the problem. The models amenable to the analysis are generalized considerably...
How Are Teachers Teaching? A Nonparametric Approach
De Witte, Kristof; Van Klaveren, Chris
2014-01-01
This paper examines which configuration of teaching activities maximizes student performance. For this purpose a nonparametric efficiency model is formulated that accounts for (1) self-selection of students and teachers in better schools and (2) complementary teaching activities. The analysis distinguishes both individual teaching (i.e., a…
Ding, Hao; Cao, Ming; DuPont, Andrew W.; Scott, Larry D.; Guha, Sushovan; Singhal, Shashideep; Younes, Mamoun; Pence, Isaac; Herline, Alan; Schwartz, David; Xu, Hua; Mahadevan-Jansen, Anita; Bi, Xiaohong
2016-03-01
Inflammatory bowel disease (IBD) is an idiopathic disease that is typically characterized by chronic inflammation of the gastrointestinal tract. Recently much effort has been devoted to the development of novel diagnostic tools that can assist physicians for fast, accurate, and automated diagnosis of the disease. Previous research based on Raman spectroscopy has shown promising results in differentiating IBD patients from normal screening cases. In the current study, we examined IBD patients in vivo through a colonoscope-coupled Raman system. Optical diagnosis for IBD discrimination was conducted based on full-range spectra using multivariate statistical methods. Further, we incorporated several feature selection methods in machine learning into the classification model. The diagnostic performance for disease differentiation was significantly improved after feature selection. Our results showed that improved IBD diagnosis can be achieved using Raman spectroscopy in combination with multivariate analysis and feature selection.
Directory of Open Access Journals (Sweden)
Meguro,Tadamichi
1978-10-01
Full Text Available With the parameters of a flow-volume and a volume-time curve, the discriminant analysis of bronchial asthma is described. The subjects were classified into three groups (healthy adults, mild asthmatic patients and moderates ones. The difference of the mean vectors of the parameters of the three groups was made clear by the selection methods of the discriminant analysis between any two of the groups both with 6 parameters (%FVC, FEV1.0%, peak flow rate (PF, flow rate at 50% of FVC (V50, flow rate at 25% of FVC (V25, and V50/V25 and with 8 (6 parameters mentioned above and V75, V10. Forced expiratory volume in 1 second percent (FEV1.0% or V50 was selected at the first step with 6 parameters, and V75 was selected at the first step with 8 parameters. Probabilities of misclassification with 8 parameters were lower than those with 6 ones and the probability of misclassification at the discriminant analysis between healthy adults and mild asthmatic patients with 8 parameters was 15.75% at the final step.
Use of discriminant analysis to identify propensity for purchasing properties
Directory of Open Access Journals (Sweden)
Ricardo Floriani
2015-03-01
Full Text Available Properties usually represent a milestone for people and families due to the high added-value when compared with family income. The objective of this study is the proposition of a discrimination model, by a discriminant analysis of people with characteristics (according to independent variables classified as potential buyers of properties, as well as to identify the interest in the use of such property, if it will be assigned to housing or leisure activities such as a cottage or beach house, and/or for investment. Thus, the following research question is proposed: What are the characteristics that better describe the profile of people which intend to acquire properties? The study justifies itself by its economic relevance in the real estate industry, as well as to the players of the real estate Market that may develop products based on the profile of potential customers. As a statistical technique, discriminant analysis was applied to the data gathered by questionnaire, which was sent via e-mail. Three hundred and thirty four responses were gathered. Based on this study, it was observed that it is possible to identify the intention for acquired properties, as well the purpose for acquiring it, for housing or investments.
Discriminating dysplasia: Optical tomographic texture analysis of colorectal polyps.
Li, Wenqi; Coats, Maria; Zhang, Jianguo; McKenna, Stephen J
2015-12-01
Optical projection tomography enables 3-D imaging of colorectal polyps at resolutions of 5-10 µm. This paper investigates the ability of image analysis based on 3-D texture features to discriminate diagnostic levels of dysplastic change from such images, specifically, low-grade dysplasia, high-grade dysplasia and invasive cancer. We build a patch-based recognition system and evaluate both multi-class classification and ordinal regression formulations on a 90 polyp dataset. 3-D texture representations computed with a hand-crafted feature extractor, random projection, and unsupervised image filter learning are compared using a bag-of-words framework. We measure performance in terms of error rates, F-measures, and ROC surfaces. Results demonstrate that randomly projected features are effective. Discrimination was improved by carefully manipulating various important aspects of the system, including class balancing, output calibration and approximation of non-linear kernels.
A Direct Estimation Approach to Sparse Linear Discriminant Analysis
Cai, Tony
2011-01-01
This paper considers sparse linear discriminant analysis of high-dimensional data. In contrast to the existing methods which are based on separate estimation of the precision matrix $\\O$ and the difference $\\de$ of the mean vectors, we introduce a simple and effective classifier by estimating the product $\\O\\de$ directly through constrained $\\ell_1$ minimization. The estimator can be implemented efficiently using linear programming and the resulting classifier is called the linear programming discriminant (LPD) rule. The LPD rule is shown to have desirable theoretical and numerical properties. It exploits the approximate sparsity of $\\O\\de$ and as a consequence allows cases where it can still perform well even when $\\O$ and/or $\\de$ cannot be estimated consistently. Asymptotic properties of the LPD rule are investigated and consistency and rate of convergence results are given. The LPD classifier has superior finite sample performance and significant computational advantages over the existing methods that req...
Sex determination by discriminant function analysis of lumbar vertebrae.
Ostrofsky, Kelly R; Churchill, Steven E
2015-01-01
Sex determination is critical for developing the biological profile of unidentified skeletal remains. When more commonly used elements (os coxa, cranium) for sexing are not available, methods utilizing other skeletal elements are needed. This study aims to assess the degree of sexual dimorphism of the lumbar vertebrae and develop discriminant functions for sex determination from them, using a sample of South African blacks from the Raymond A. Dart Collection (47 males, 51 females). Eleven variables at each lumbar level were subjected to univariate and multivariate discriminant function analyses. Univariate equations produced classification rates ranging from 57.7% to 83.5%, with the highest accuracies associated with dimensions of the vertebral body. Multivariate stepwise analysis generated classification rates ranging from 75.9% to 88.7%. These results are comparable to other methods for sexing the skeleton and indicate that measures of the lumbar vertebrae can be used as an effective tool for sex determination.
Sparse discriminant analysis for breast cancer biomarker identification and classification
Institute of Scientific and Technical Information of China (English)
Yu Shi; Daoqing Dai; Chaochun Liu; Hong Yan
2009-01-01
Biomarker identification and cancer classification are two important procedures in microarray data analysis. We propose a novel uni-fied method to carry out both tasks. We first preselect biomarker candidates by eliminating unrelated genes through the BSS/WSS ratio filter to reduce computational cost, and then use a sparse discriminant analysis method for simultaneous biomarker identification and cancer classification. Moreover, we give a mathematical justification about automatic biomarker identification. Experimental results show that the proposed method can identify key genes that have been verified in biochemical or biomedical research and classify the breast cancer type correctly.
Nonparametric Bayesian inference in biostatistics
Müller, Peter
2015-01-01
As chapters in this book demonstrate, BNP has important uses in clinical sciences and inference for issues like unknown partitions in genomics. Nonparametric Bayesian approaches (BNP) play an ever expanding role in biostatistical inference from use in proteomics to clinical trials. Many research problems involve an abundance of data and require flexible and complex probability models beyond the traditional parametric approaches. As this book's expert contributors show, BNP approaches can be the answer. Survival Analysis, in particular survival regression, has traditionally used BNP, but BNP's potential is now very broad. This applies to important tasks like arrangement of patients into clinically meaningful subpopulations and segmenting the genome into functionally distinct regions. This book is designed to both review and introduce application areas for BNP. While existing books provide theoretical foundations, this book connects theory to practice through engaging examples and research questions. Chapters c...
Nonparametric tests for censored data
Bagdonavicus, Vilijandas; Nikulin, Mikhail
2013-01-01
This book concerns testing hypotheses in non-parametric models. Generalizations of many non-parametric tests to the case of censored and truncated data are considered. Most of the test results are proved and real applications are illustrated using examples. Theories and exercises are provided. The incorrect use of many tests applying most statistical software is highlighted and discussed.
Visual Tracking via Feature Tensor Multimanifold Discriminate Analysis
Directory of Open Access Journals (Sweden)
Ting-quan Deng
2014-01-01
Full Text Available In the visual tracking scenarios, if there are multiple objects, due to the interference of similar objects, tracking may fail in the progress of occlusion to separation. To address this problem, this paper proposed a visual tracking algorithm with discrimination through multimanifold learning. Color-gradient-based feature tensor was used to describe object appearance for accommodation of partial occlusion. A prior multimanifold tensor dataset is established through the template matching tracking algorithm. For the purpose of discrimination, tensor distance was defined to determine the intramanifold and intermanifold neighborhood relationship in multimanifold space. Then multimanifold discriminate analysis was employed to construct multilinear projection matrices of submanifolds. Finally, object states were obtained by combining with sequence inference. Meanwhile, the multimanifold dataset and manifold learning embedded projection should be updated online. Experiments were conducted on two real visual surveillance sequences to evaluate the proposed algorithm with three state-of-the-art tracking methods qualitatively and quantitatively. Experimental results show that the proposed algorithm can achieve effective and robust effect in multi-similar-object mutual occlusion scenarios.
Ananiev, Jovan; Poposka, Zaneta
2013-01-01
Discrimination which is evident in in companies in the Republic of Macedonia is mostly done by the owners or management and on the ground of personal status, gender, ethnicity and age and mostly in the form of harassment and direct discrimination.
Isokinetic evaluation of knee muscles in soccer players: discriminant analysis
Directory of Open Access Journals (Sweden)
Bruno Fles Mazuquin
2015-10-01
Full Text Available ABSTRACTIntroduction:Muscle activity in soccer players can be measured by isokinetic dynamometer, which is a reliable tool for assessing human performance.Objectives:To perform isokinetic analyses and to determine which variables differentiate the under-17 (U17 soccer category from the professional (PRO.Methods:Thirty four players were assessed (n=17 for each category. The isokinetic variables used for the knee extension-flexion analysis were: peak torque (Nm, total work (J, average power (W, angle of peak torque (deg., agonist/ antagonist ratio (%, measured for three velocities (60°/s, 120°/s and 300°/s, with each series containing five repetitions. Three Wilks' Lambda discriminant analyses were performed, to identify which variables were more significant for the definition of each of the categories.Results:The discriminative variables at 60°/s in the PRO category were: extension peak torque, flexion total work, extension average power and agonist/antagonist ratio; and for the U17s were: extension total work, flexion peak torque and flexion average power. At 120°/s for the PRO category the discriminant variables were: flexion peak torque and extension average power; for the U17s they were: extension total work and flexion average power. Finally at 300°/s, the variables found in the PRO and U17 categories respectively were: extension average power and extension total work.Conclusion:Isokinetic variables for flexion and extension knee muscles were able to significantly discriminate between PRO and U17 soccer players.
Asymptotic theory of nonparametric regression estimates with censored data
Institute of Scientific and Technical Information of China (English)
施沛德; 王海燕; 张利华
2000-01-01
For regression analysis, some useful Information may have been lost when the responses are right censored. To estimate nonparametric functions, several estimates based on censored data have been proposed and their consistency and convergence rates have been studied in literat黵e, but the optimal rates of global convergence have not been obtained yet. Because of the possible Information loss, one may think that it is impossible for an estimate based on censored data to achieve the optimal rates of global convergence for nonparametric regression, which were established by Stone based on complete data. This paper constructs a regression spline estimate of a general nonparametric regression f unction based on right-censored response data, and proves, under some regularity condi-tions, that this estimate achieves the optimal rates of global convergence for nonparametric regression. Since the parameters for the nonparametric regression estimate have to be chosen based on a data driven criterion, we also obtai
2nd Conference of the International Society for Nonparametric Statistics
Manteiga, Wenceslao; Romo, Juan
2016-01-01
This volume collects selected, peer-reviewed contributions from the 2nd Conference of the International Society for Nonparametric Statistics (ISNPS), held in Cádiz (Spain) between June 11–16 2014, and sponsored by the American Statistical Association, the Institute of Mathematical Statistics, the Bernoulli Society for Mathematical Statistics and Probability, the Journal of Nonparametric Statistics and Universidad Carlos III de Madrid. The 15 articles are a representative sample of the 336 contributed papers presented at the conference. They cover topics such as high-dimensional data modelling, inference for stochastic processes and for dependent data, nonparametric and goodness-of-fit testing, nonparametric curve estimation, object-oriented data analysis, and semiparametric inference. The aim of the ISNPS 2014 conference was to bring together recent advances and trends in several areas of nonparametric statistics in order to facilitate the exchange of research ideas, promote collaboration among researchers...
A Comparative Study: Globality versus Locality for Graph Construction in Discriminant Analysis
Directory of Open Access Journals (Sweden)
Bo Yang
2014-01-01
Discriminant Analysis (GmGcDA just based on globality alone, GmLcDA, and LmGcDA, we suggest that the joint of locally constructed intraclass and globally constructed interclass graphs is more discriminant.
Gazmeh, Meisam; Bahreini, Maryam; Tavassoli, Seyed Hassan
2015-01-01
In the laser drilling of teeth, a microplasma is generated which may be utilized for elemental analysis of ablated tissue via a laser-induced breakdown spectroscopy (LIBS) technique. In this study, LIBS is used to investigate the possibility of discrimination of healthy and carious tooth tissues. This possibility is examined using multivariate statistical analysis called partial least square discriminant analysis (PLS-DA) based on atomic and ionic emission lines of teeth LIBS spectra belonging to P, Ca, Mg, Zn, K, Sr, C, Na, H, and O elements. Results show an excellent discrimination and prediction of unknown tooth tissues. It is shown that using the PLS-DA method, the spectroscopic analysis of plasma emission during the laser drilling, would be a promising technique for caries detection.
Wu, Quan-Jin; Dong, Qing-Hua; Sun, Wei-Jiang; Huang, Yan; Wang, Qiong-Qiong; Zhou, Wei-Long
2014-09-24
This study aimed to construct objective and accurate analytical models of tea categories based on their polyphenols and caffeine. A total of 522 tea samples of 4 commonly consumed teas with different fermentation degrees (green tea, white tea, oolong tea, and black tea) were analyzed by high-performance liquid chromatography (HPLC) coupled with spectrophotometry, utilizing ISO 14502, as analytical tools. The content of polyphenols and caffeine varied significantly according to differently fermented teas, indicating that these active constituents may discriminate fermentation degrees effectively. By principal component analysis (PCA) and stepwise linear discriminant analysis (S-LDA), the vast majority of tea samples could be successfully differentiated according to their chemical markers. This study yielded three discriminant functions with the capacity to simultaneously discriminate the four tea categories with a 97.8% correct rate. In classification of oolong and other teas, there were one discriminant function and two equations with best discriminant capacity. Furthermore, the classification of different degrees of fermentation of oolong and external validation achieved the desired results. It is suggested that polyphenols and caffeine are the distinct variables to establish internationally recognized models of teas.
Fluorometric Discrimination Technique of Phytoplankton Population Based on Wavelet Analysis
Institute of Scientific and Technical Information of China (English)
ZHANG Shanshan; SU Rongguo; DUAN Yali; ZHANG Cui; SONG Zhijie; WANG Xiulin
2012-01-01
The discrete excitation-emission-matrix fluorescence spectra(EEMS)at 12 excitation wavelengths (400,430,450,460,470,490,500,510,525,550,570,and 590 nm)and emission wavelengths ranging from 600-750 nm were determined for 43 phytoplankton species.A two-rank fluorescence spectra database was established by wavelet analysis and a fluorometric discrimination technique for determining phytoplankton population was developed.For laboratory simulatively mixed samples,the samples mixed from 43 algal species(the algae of one division accounted for 25％,50％,75％,85％,and 100％ of the gross biomass,respectively),the average discrimination rates at the level of division were 65.0％,87.5％,98.6％,99.0％,and 99.1％,with average relative contents of 18.9％,44.5％,68.9％,73.4％,and 82.9％,respectively;the samples mixed from 32 red tide algal species(the dominant species accounted for 60％,70％,80％,90％,and 100％ of the gross biomass,respectively),the average correct discrimination rates of the dominant species at the level of genus were 63.3％,74.2％,78.8％,83.4％,and 79.4％,respectively.For the 81 laboratory mixed samples with the dominant species accounting for 75％ of the gross biomass(chlorophyll),the discrimination rates of the dominant species were 95.1％ and 72.8％ at the level of division and genus,respectively.For the 12 samples collected from the mesocosm experiment in Maidao Bay of Qingdao in August 2007,the dominant species of the 11 samples were recognized at the division level and the dominant species of four of the five samples in which the dominant species accounted for more than 80％ of the gross biomass were discriminated at the genus level;for the 12 samples obtained from Jiaozhou Bay in August 2007,the dominant species of all the 12 samples were recognized at the division level.The technique can be directly applied to fluorescence spectrophotometers and to the developing of an in situ algae fluorescence auto-analyzer for
Nonparametric statistical methods using R
Kloke, John
2014-01-01
A Practical Guide to Implementing Nonparametric and Rank-Based ProceduresNonparametric Statistical Methods Using R covers traditional nonparametric methods and rank-based analyses, including estimation and inference for models ranging from simple location models to general linear and nonlinear models for uncorrelated and correlated responses. The authors emphasize applications and statistical computation. They illustrate the methods with many real and simulated data examples using R, including the packages Rfit and npsm.The book first gives an overview of the R language and basic statistical c
Garnett, Bernice Raveche; Masyn, Katherine E; Austin, S Bryn; Miller, Matthew; Williams, David R; Viswanath, Kasisomayajula
2014-08-01
Discrimination is commonly experienced among adolescents. However, little is known about the intersection of multiple attributes of discrimination and bullying. We used a latent class analysis (LCA) to illustrate the intersections of discrimination attributes and bullying, and to assess the associations of LCA membership to depressive symptoms, deliberate self harm and suicidal ideation among a sample of ethnically diverse adolescents. The data come from the 2006 Boston Youth Survey where students were asked whether they had experienced discrimination based on four attributes: race/ethnicity, immigration status, perceived sexual orientation and weight. They were also asked whether they had been bullied or assaulted for these attributes. A total of 965 (78%) students contributed to the LCA analytic sample (45% Non-Hispanic Black, 29% Hispanic, 58% Female). The LCA revealed that a 4-class solution had adequate relative and absolute fit. The 4-classes were characterized as: low discrimination (51%); racial discrimination (33%); sexual orientation discrimination (7%); racial and weight discrimination with high bullying (intersectional class) (7%). In multivariate models, compared to the low discrimination class, individuals in the sexual orientation discrimination class and the intersectional class had higher odds of engaging in deliberate self-harm. Students in the intersectional class also had higher odds of suicidal ideation. All three discrimination latent classes had significantly higher depressive symptoms compared to the low discrimination class. Multiple attributes of discrimination and bullying co-occur among adolescents. Research should consider the co-occurrence of bullying and discrimination.
Varabyova, Yauheniya; Schreyögg, Jonas
2013-09-01
There is a growing interest in the cross-country comparisons of the performance of national health care systems. The present work provides a comparison of the technical efficiency of the hospital sector using unbalanced panel data from OECD countries over the period 2000-2009. The estimation of the technical efficiency of the hospital sector is performed using nonparametric data envelopment analysis (DEA) and parametric stochastic frontier analysis (SFA). Internal and external validity of findings is assessed by estimating the Spearman rank correlations between the results obtained in different model specifications. The panel-data analyses using two-step DEA and one-stage SFA show that countries, which have higher health care expenditure per capita, tend to have a more technically efficient hospital sector. Whether the expenditure is financed through private or public sources is not related to the technical efficiency of the hospital sector. On the other hand, the hospital sector in countries with higher income inequality and longer average hospital length of stay is less technically efficient. Copyright © 2013 The Authors. Published by Elsevier Ireland Ltd.. All rights reserved.
Salim, Samir; Ly, Chun; Brinchmann, Jarle; Davé, Romeel; Dickinson, Mark; Salzer, John J; Charlot, Stéphane
2014-01-01
It has been proposed that the mass-metallicity relation of galaxies exhibits a secondary dependence on star formation rate (SFR), and that the resulting M-Z-SFR relation may be redshift-invariant, i.e., "fundamental." However, conflicting results on the character of the SFR dependence, and whether it exists, have been reported. To gain insight into the origins of the conflicting results, we (a) devise a non-parametric, astrophysically-motivated analysis framework based on the offset from the star-forming ("main") sequence at a given stellar mass (relative specific SFR), (b) apply this methodology and perform a comprehensive re-analysis of the local M-Z-SFR relation, based on SDSS, GALEX, and WISE data, and (c) study the impact of sample selection, and of using different metallicity and SFR indicators. We show that metallicity is anti-correlated with specific SFR regardless of the indicators used. We do not find that the relation is spurious due to correlations arising from biased metallicity measurements, or ...
Discrimination against Latina/os: A Meta-Analysis of Individual-Level Resources and Outcomes
Lee, Debbiesiu L.; Ahn, Soyeon
2012-01-01
This meta-analysis synthesizes the findings of 60 independent samples from 51 studies examining racial/ethnic discrimination against Latina/os in the United States. The purpose was to identify individual-level resources and outcomes that most strongly relate to discrimination. Discrimination against Latina/os significantly results in outcomes…
Discrimination against Latina/os: A Meta-Analysis of Individual-Level Resources and Outcomes
Lee, Debbiesiu L.; Ahn, Soyeon
2012-01-01
This meta-analysis synthesizes the findings of 60 independent samples from 51 studies examining racial/ethnic discrimination against Latina/os in the United States. The purpose was to identify individual-level resources and outcomes that most strongly relate to discrimination. Discrimination against Latina/os significantly results in outcomes…
Homothetic Efficiency and Test Power: A Non-Parametric Approach
J. Heufer (Jan); P. Hjertstrand (Per)
2015-01-01
markdownabstract__Abstract__ We provide a nonparametric revealed preference approach to demand analysis based on homothetic efficiency. Homotheticity is a useful restriction but data rarely satisfies testable conditions. To overcome this we provide a way to estimate homothetic efficiency of
Discriminant possibilities of the Hamilton depression scale: ROC analysis
Directory of Open Access Journals (Sweden)
Novović Zdenka
2005-01-01
Full Text Available The purpose of this study is to compare discrimination power of original and reconstructed version of Hamilton’s depression scale in separation of depressive vs. anxious patients and to suggest some possibilities which offer ROC analysis. The subjects of the study were 119 patients of Psychiatric clinic in Novi Sad. 67 of them were diagnosed with some of the forms of affective disorders and 52 with an anxious-phobic diagnosis. Results of ROC analysis suggest that both instruments can be used in distinguishing depressive from anxious patients, but reconstructed version shows greater sensitivity and specificity with optimal cut-off score. It also has more significant AUC, which refers to probability of prediction on the basis of the whole spectrum of the results. These data is commented in relation with current debates, between unitaristic and pluralistic oriented authors, about the nature of the anxious-depression relationship.
SEM-EDS analysis and discrimination of forensic soil.
Cengiz, Salih; Cengiz Karaca, Ali; Cakir, Ismail; Bülent Uner, H; Sevindik, Aytekin
2004-04-20
Soils vary among different areas, and have some characteristics because of the natural effects and transfers made by human and other living beings in time. So that forensic examination of soil is not only concerned with the analysis of naturally occurring rocks, minerals, vegetation, and animal matter. It also includes the detection of such manufactured materials such as ions from synthetic fertilizers and from different environments (e.g., nitrate, phosphate, and sulfate) as environmental artifacts (e.g., lead or objects as glass, paint chips, asphalt, brick fragments, and cinders) whose presence may impart soil with characteristics that will make it unique to a particular location. Many screening and analytical methods have been applied for determining the characteristics which differentiate and discriminate the forensic soil samples but none of them easily standardized. Some of the methods that applied in forensic laboratories in forensic soil discrimination are the color comparison of the normal air-dried (dehumidified) and overheated soil samples, macroscopic observation, and low-power stereo-microscopic observation, determination of anionic composition by capillary electrophoresis (CE), and the elemental composition by scanning electron microscope (SEM)-energy dispersive X-ray spectrometer (EDS) and other high sensitivity techniques. The objective of this study was to show the effect of the application of 9 tonnes/cm2 pressure on the elemental compositions obtained by SEM-EDS technique and comparing the discrimination power of the pressed-homogenized and not homogenized forensic soil samples. For this purpose soil samples from 17 different locations of Istanbul were collected. Aliquots of the well mixed samples were dried in an oven at 110-120 degrees C and sieved by using 0.5 mm sieve and then the undersieve fraction(JEO-JSM-5600 equipped with an energy dispersive X-ray spectrometer OXFORD Link-ISIS-300. The samples from top of the sieves were examined with
Semi-supervised learning for ordinal Kernel Discriminant Analysis.
Pérez-Ortiz, M; Gutiérrez, P A; Carbonero-Ruz, M; Hervás-Martínez, C
2016-12-01
Ordinal classification considers those classification problems where the labels of the variable to predict follow a given order. Naturally, labelled data is scarce or difficult to obtain in this type of problems because, in many cases, ordinal labels are given by a user or expert (e.g. in recommendation systems). Firstly, this paper develops a new strategy for ordinal classification where both labelled and unlabelled data are used in the model construction step (a scheme which is referred to as semi-supervised learning). More specifically, the ordinal version of kernel discriminant learning is extended for this setting considering the neighbourhood information of unlabelled data, which is proposed to be computed in the feature space induced by the kernel function. Secondly, a new method for semi-supervised kernel learning is devised in the context of ordinal classification, which is combined with our developed classification strategy to optimise the kernel parameters. The experiments conducted compare 6 different approaches for semi-supervised learning in the context of ordinal classification in a battery of 30 datasets, showing (1) the good synergy of the ordinal version of discriminant analysis and the use of unlabelled data and (2) the advantage of computing distances in the feature space induced by the kernel function.
Multitask linear discriminant analysis for view invariant action recognition.
Yan, Yan; Ricci, Elisa; Subramanian, Ramanathan; Liu, Gaowen; Sebe, Nicu
2014-12-01
Robust action recognition under viewpoint changes has received considerable attention recently. To this end, self-similarity matrices (SSMs) have been found to be effective view-invariant action descriptors. To enhance the performance of SSM-based methods, we propose multitask linear discriminant analysis (LDA), a novel multitask learning framework for multiview action recognition that allows for the sharing of discriminative SSM features among different views (i.e., tasks). Inspired by the mathematical connection between multivariate linear regression and LDA, we model multitask multiclass LDA as a single optimization problem by choosing an appropriate class indicator matrix. In particular, we propose two variants of graph-guided multitask LDA: 1) where the graph weights specifying view dependencies are fixed a priori and 2) where graph weights are flexibly learnt from the training data. We evaluate the proposed methods extensively on multiview RGB and RGBD video data sets, and experimental results confirm that the proposed approaches compare favorably with the state-of-the-art.
Discriminative Ocular Artifact Correction for Feature Learning in EEG Analysis.
Li, Xinyang; Guan, Cuntai; Zhang, Haihong; Ang, Kai Keng
2016-11-16
Electrooculogram (EOG) artifact contamination is a common critical issue in general electroencephalogram (EEG) studies as well as in brain computer interface (BCI) research. It is especially challenging when dedicated EOG channels are unavailable or when there are very few EEG channels available for ICA-based ocular artifact removal. It is even more challenging to avoid loss of the signal of interest during the artifact correction process, where the signal of interest can be multiple magnitudes weaker than the artifact. To address these issues, we propose a novel discriminative ocular artifact correction approach for feature learning in EEG analysis.Without extra ocular movement measurements, the artifact is extracted from raw EEG data, which is totally automatic and requires no visual inspection of artifacts. Then, artifact correction is optimized jointly with feature extraction by maximizing oscillatory correlations between trials from the same class and minimizing them between trials from different classes. We evaluate this approach on a real world EEG data set comprising 68 subjects performing cognitive tasks. The results showed that the approach is capable of not only suppressing the artifact components but also improving the discriminative power of a classifier with statistical significance. We also demonstrate that the proposed method addresses the confounding issues induced by ocular movements in cognitive EEG study.
Linear discriminant analysis of character sequences using occurrences of words
Dutta, Subhajit
2014-02-01
Classification of character sequences, where the characters come from a finite set, arises in disciplines such as molecular biology and computer science. For discriminant analysis of such character sequences, the Bayes classifier based on Markov models turns out to have class boundaries defined by linear functions of occurrences of words in the sequences. It is shown that for such classifiers based on Markov models with unknown orders, if the orders are estimated from the data using cross-validation, the resulting classifier has Bayes risk consistency under suitable conditions. Even when Markov models are not valid for the data, we develop methods for constructing classifiers based on linear functions of occurrences of words, where the word length is chosen by cross-validation. Such linear classifiers are constructed using ideas of support vector machines, regression depth, and distance weighted discrimination. We show that classifiers with linear class boundaries have certain optimal properties in terms of their asymptotic misclassification probabilities. The performance of these classifiers is demonstrated in various simulated and benchmark data sets.
Anti-discrimination Analysis Using Privacy Attack Strategies
Ruggieri, Salvatore
2014-09-15
Social discrimination discovery from data is an important task to identify illegal and unethical discriminatory patterns towards protected-by-law groups, e.g., ethnic minorities. We deploy privacy attack strategies as tools for discrimination discovery under hard assumptions which have rarely tackled in the literature: indirect discrimination discovery, privacy-aware discrimination discovery, and discrimination data recovery. The intuition comes from the intriguing parallel between the role of the anti-discrimination authority in the three scenarios above and the role of an attacker in private data publishing. We design strategies and algorithms inspired/based on Frèchet bounds attacks, attribute inference attacks, and minimality attacks to the purpose of unveiling hidden discriminatory practices. Experimental results show that they can be effective tools in the hands of anti-discrimination authorities.
Discriminating topology in galaxy distributions using network analysis
Hong, Sungryong; Coutinho, Bruno C.; Dey, Arjun; Barabási, Albert-L.; Vogelsberger, Mark; Hernquist, Lars; Gebhardt, Karl
2016-07-01
The large-scale distribution of galaxies is generally analysed using the two-point correlation function. However, this statistic does not capture the topology of the distribution, and it is necessary to resort to higher order correlations to break degeneracies. We demonstrate that an alternate approach using network analysis can discriminate between topologically different distributions that have similar two-point correlations. We investigate two galaxy point distributions, one produced by a cosmological simulation and the other by a Lévy walk. For the cosmological simulation, we adopt the redshift z = 0.58 slice from Illustris and select galaxies with stellar masses greater than 108 M⊙. The two-point correlation function of these simulated galaxies follows a single power law, ξ(r) ˜ r-1.5. Then, we generate Lévy walks matching the correlation function and abundance with the simulated galaxies. We find that, while the two simulated galaxy point distributions have the same abundance and two-point correlation function, their spatial distributions are very different; most prominently, filamentary structures, absent in Lévy fractals. To quantify these missing topologies, we adopt network analysis tools and measure diameter, giant component, and transitivity from networks built by a conventional friends-of-friends recipe with various linking lengths. Unlike the abundance and two-point correlation function, these network quantities reveal a clear separation between the two simulated distributions; therefore, the galaxy distribution simulated by Illustris is not a Lévy fractal quantitatively. We find that the described network quantities offer an efficient tool for discriminating topologies and for comparing observed and theoretical distributions.
Directory of Open Access Journals (Sweden)
Kühnast, Corinna
2008-04-01
Full Text Available Background: Although non-normal data are widespread in biomedical research, parametric tests unnecessarily predominate in statistical analyses. Methods: We surveyed five biomedical journals and – for all studies which contain at least the unpaired t-test or the non-parametric Wilcoxon-Mann-Whitney test – investigated the relationship between the choice of a statistical test and other variables such as type of journal, sample size, randomization, sponsoring etc. Results: The non-parametric Wilcoxon-Mann-Whitney was used in 30% of the studies. In a multivariable logistic regression the type of journal, the test object, the scale of measurement and the statistical software were significant. The non-parametric test was more common in case of non-continuous data, in high-impact journals, in studies in humans, and when the statistical software is specified, in particular when SPSS was used.
1980-05-01
kEgeering Psychology Programs a dOffice of Naval Research (Code. 455) Arlington, VA 22217 1 14 MONITORING AGENCY NAME A ADORESS(hI diferent 0000...STATEMENT (of the Abe~ag @amed tol 8110" 2. it Aliment *001 Repel) Approved for public release; distribution unlimited. If. SUPPLEMENTARY NOTES I...Structure ........ I i Introduction J Experimental psychology has developed a sophisticated set of experimental designs for the analysis of behaviour when
NONPARAMETRIC ESTIMATION OF CHARACTERISTICS OF PROBABILITY DISTRIBUTIONS
Directory of Open Access Journals (Sweden)
Orlov A. I.
2015-10-01
Full Text Available The article is devoted to the nonparametric point and interval estimation of the characteristics of the probabilistic distribution (the expectation, median, variance, standard deviation, variation coefficient of the sample results. Sample values are regarded as the implementation of independent and identically distributed random variables with an arbitrary distribution function having the desired number of moments. Nonparametric analysis procedures are compared with the parametric procedures, based on the assumption that the sample values have a normal distribution. Point estimators are constructed in the obvious way - using sample analogs of the theoretical characteristics. Interval estimators are based on asymptotic normality of sample moments and functions from them. Nonparametric asymptotic confidence intervals are obtained through the use of special output technology of the asymptotic relations of Applied Statistics. In the first step this technology uses the multidimensional central limit theorem, applied to the sums of vectors whose coordinates are the degrees of initial random variables. The second step is the conversion limit multivariate normal vector to obtain the interest of researcher vector. At the same considerations we have used linearization and discarded infinitesimal quantities. The third step - a rigorous justification of the results on the asymptotic standard for mathematical and statistical reasoning level. It is usually necessary to use the necessary and sufficient conditions for the inheritance of convergence. This article contains 10 numerical examples. Initial data - information about an operating time of 50 cutting tools to the limit state. Using the methods developed on the assumption of normal distribution, it can lead to noticeably distorted conclusions in a situation where the normality hypothesis failed. Practical recommendations are: for the analysis of real data we should use nonparametric confidence limits
König, Seth D; Buffalo, Elizabeth A
2014-04-30
Eye tracking is an important component of many human and non-human primate behavioral experiments. As behavioral paradigms have become more complex, including unconstrained viewing of natural images, eye movements measured in these paradigms have become more variable and complex as well. Accordingly, the common practice of using acceleration, dispersion, or velocity thresholds to segment viewing behavior into periods of fixations and saccades may be insufficient. Here we propose a novel algorithm, called Cluster Fix, which uses k-means cluster analysis to take advantage of the qualitative differences between fixations and saccades. The algorithm finds natural divisions in 4 state space parameters-distance, velocity, acceleration, and angular velocity-to separate scan paths into periods of fixations and saccades. The number and size of clusters adjusts to the variability of individual scan paths. Cluster Fix can detect small saccades that were often indistinguishable from noisy fixations. Local analysis of fixations helped determine the transition times between fixations and saccades. Because Cluster Fix detects natural divisions in the data, predefined thresholds are not needed. A major advantage of Cluster Fix is the ability to precisely identify the beginning and end of saccades, which is essential for studying neural activity that is modulated by or time-locked to saccades. Our data suggest that Cluster Fix is more sensitive than threshold-based algorithms but comes at the cost of an increase in computational time. Copyright © 2014 Elsevier B.V. All rights reserved.
DEFF Research Database (Denmark)
Han, Xixuan; Clemmensen, Line Katrine Harder
2015-01-01
techniques, for instance, 2D-Linear Discriminant Analysis (2D-LDA). Furthermore, an iterative algorithm based on the alternating direction method of multipliers is developed. The algorithm approximately solves RGED with monotonically decreasing convergence and at an acceptable speed for results of modest......We propose a general technique for obtaining sparse solutions to generalized eigenvalue problems, and call it Regularized Generalized Eigen-Decomposition (RGED). For decades, Fisher's discriminant criterion has been applied in supervised feature extraction and discriminant analysis...... accuracy. Numerical experiments based on four data sets of different types of images show that RGED has competitive classification performance with existing multidimensional and sparse techniques of discriminant analysis....
An Analysis of Discrimination by Real Estate Brokers.
Yinger, John
This paper focuses on designing policies to eliminate discrimination in the sale of single-family houses by analyzing the behavior of the agents who actually do most of the discriminating, namely real estate agents. Discriminatory practices are said to be supported by policies of house builders, lending institutions, and government, and by the…
Racial Discrimination and Asian Mental Health: A Meta-Analysis
Lee, Debbiesiu L.; Ahn, Soyeon
2011-01-01
Although research on racial discrimination and mental health has proliferated, findings are varied and dispersed. This study explored the critical question of how Asians, in particular, deal with discrimination and how this relates to Asian mental health. With 99 correlations from 23 independent studies, the overall relationship between racial…
Sabir, Aryani; Rafi, Mohamad; Darusman, Latifah K
2017-04-15
HPLC fingerprint analysis combined with chemometrics was developed to discriminate between the red and the white rice bran grown in Indonesia. The major component in rice bran is γ-oryzanol which consisted of 4 main compounds, namely cycloartenol ferulate, cyclobranol ferulate, campesterol ferulate and β-sitosterol ferulate. Separation of these four compounds along with other compounds was performed using C18 and methanol-acetonitrile with gradient elution system. By using these intensity variations, principal component and discriminant analysis were performed to discriminate the two samples. Discriminant analysis was successfully discriminated the red from the white rice bran with predictive ability of the model showed a satisfactory classification for the test samples. The results of this study indicated that the developed method was suitable as quality control method for rice bran in terms of identification and discrimination of the red and the white rice bran.
Semi- and Nonparametric ARCH Processes
Directory of Open Access Journals (Sweden)
Oliver B. Linton
2011-01-01
Full Text Available ARCH/GARCH modelling has been successfully applied in empirical finance for many years. This paper surveys the semiparametric and nonparametric methods in univariate and multivariate ARCH/GARCH models. First, we introduce some specific semiparametric models and investigate the semiparametric and nonparametrics estimation techniques applied to: the error density, the functional form of the volatility function, the relationship between mean and variance, long memory processes, locally stationary processes, continuous time processes and multivariate models. The second part of the paper is about the general properties of such processes, including stationary conditions, ergodic conditions and mixing conditions. The last part is on the estimation methods in ARCH/GARCH processes.
Sparse Regression by Projection and Sparse Discriminant Analysis
Qi, Xin
2015-04-03
© 2015, © American Statistical Association, Institute of Mathematical Statistics, and Interface Foundation of North America. Recent years have seen active developments of various penalized regression methods, such as LASSO and elastic net, to analyze high-dimensional data. In these approaches, the direction and length of the regression coefficients are determined simultaneously. Due to the introduction of penalties, the length of the estimates can be far from being optimal for accurate predictions. We introduce a new framework, regression by projection, and its sparse version to analyze high-dimensional data. The unique nature of this framework is that the directions of the regression coefficients are inferred first, and the lengths and the tuning parameters are determined by a cross-validation procedure to achieve the largest prediction accuracy. We provide a theoretical result for simultaneous model selection consistency and parameter estimation consistency of our method in high dimension. This new framework is then generalized such that it can be applied to principal components analysis, partial least squares, and canonical correlation analysis. We also adapt this framework for discriminant analysis. Compared with the existing methods, where there is relatively little control of the dependency among the sparse components, our method can control the relationships among the components. We present efficient algorithms and related theory for solving the sparse regression by projection problem. Based on extensive simulations and real data analysis, we demonstrate that our method achieves good predictive performance and variable selection in the regression setting, and the ability to control relationships between the sparse components leads to more accurate classification. In supplementary materials available online, the details of the algorithms and theoretical proofs, and R codes for all simulation studies are provided.
核诱导距离度量的鲁棒判别分析%Robust Discriminant Analysis Based on Kernel-Induced Measure
Institute of Scientific and Technical Information of China (English)
王嗣钧; 陈松灿
2012-01-01
This paper proposes a robust discriminant analysis based on kernel-induced distance measure (KI-RDA). KI-RDA not only extends the linear discriminant analysis (LDA), but also extends the newest and powerful algorithm called robust discriminant analysis based on nonparametric maximum entropy (MaxEnt-RDA). By using robust radial basis function (RBF) kernels, KI-RDA can effectively deal with the data mixed with noise as well as the non-Gaussian distributed nonlinear data. Its robustness is accredited to that KI-RDA makes use of the kernel-induced non-Euclidean distance instead of the Euclidean distance in LDA to characterize the within-class and between-class divergence respectively. With the aid of these divergences, the paper defines a discriminant criterion which is similar to LDA for feature extraction, but this leads to a corresponding nonlinear optimization problem. With the further help of approximation strategy, the problem is converted into a generalized eigenvalue problem which can be solved directly so as to get a closed-form solution of the dimensionality reduction matrix. At last, experiments on multifold datasets verify the effectiveness of KI-RDA. Because of the diversity of kernel functions, KI-RDA is actually a general discriminant analysis framework.%提出了基于核诱导距离度量的鲁棒判别分析算法(robust discriminant analysis based on kernel-induced distance measure,KI-RDA).KI-RDA不仅自然地推广了线性判别分析(linear discriminant analysis,LDA),而且推广了最近提出的强有力的基于非参数最大熵的鲁棒判别分析(robust discriminant analysis based on nonparametric maximum entropy,MaxEnt-RDA).通过采用鲁棒径向基核,KI-RDA不仅能有效处理含噪数据,而且也适合处理非高斯分布的非线性数据,其本质的鲁棒性归咎于KI-RDA通过核诱导的非欧距离代替LDA的欧氏距离来刻画类间散度和类内散度.借助这些散度,为特征提取定义类似LDA的判别准则,
Sex determination of the Acadian Flycatcher using discriminant analysis
Wilson, R.R.
1999-01-01
I used five morphometric variables from 114 individuals captured in Arkansas to develop a discriminant model to predict the sex of Acadian Flycatchers (Empidonax virescens). Stepwise discriminant function analyses selected wing chord and tail length as the most parsimonious subset of variables for discriminating sex. This two-variable model correctly classified 80% of females and 97% of males used to develop the model. Validation of the model using 19 individuals from Louisiana and Virginia resulted in 100% correct classification of males and females. This model provides criteria for sexing monomorphic Acadian Flycatchers during the breeding season and possibly during the winter.
Unbiased bootstrap error estimation for linear discriminant analysis.
Vu, Thang; Sima, Chao; Braga-Neto, Ulisses M; Dougherty, Edward R
2014-12-01
Convex bootstrap error estimation is a popular tool for classifier error estimation in gene expression studies. A basic question is how to determine the weight for the convex combination between the basic bootstrap estimator and the resubstitution estimator such that the resulting estimator is unbiased at finite sample sizes. The well-known 0.632 bootstrap error estimator uses asymptotic arguments to propose a fixed 0.632 weight, whereas the more recent 0.632+ bootstrap error estimator attempts to set the weight adaptively. In this paper, we study the finite sample problem in the case of linear discriminant analysis under Gaussian populations. We derive exact expressions for the weight that guarantee unbiasedness of the convex bootstrap error estimator in the univariate and multivariate cases, without making asymptotic simplifications. Using exact computation in the univariate case and an accurate approximation in the multivariate case, we obtain the required weight and show that it can deviate significantly from the constant 0.632 weight, depending on the sample size and Bayes error for the problem. The methodology is illustrated by application on data from a well-known cancer classification study.
Discriminating Topology in Galaxy Distributions using Network Analysis
Hong, Sungryong; Dey, Arjun; Barabási, Albert -L; Vogelsberger, Mark; Hernquist, Lars; Gebhardt, Karl
2016-01-01
(abridged) The large-scale distribution of galaxies is generally analyzed using the two-point correlation function. However, this statistic does not capture the topology of the distribution, and it is necessary to resort to higher order correlations to break degeneracies. We demonstrate that an alternate approach using network analysis can discriminate between topologically different distributions that have similar two-point correlations. We investigate two galaxy point distributions, one produced by a cosmological simulation and the other by a L\\'evy walk. For the cosmological simulation, we adopt the redshift $z = 0.58$ slice from Illustris (Vogelsberger et al. 2014A) and select galaxies with stellar masses greater than $10^8$$M_\\odot$. The two point correlation function of these simulated galaxies follows a single power-law, $\\xi(r) \\sim r^{-1.5}$. Then, we generate L\\'evy walks matching the correlation function and abundance with the simulated galaxies. We find that, while the two simulated galaxy point d...
Genetic mapping of complex discrete human diseases by discriminant analysis
Institute of Scientific and Technical Information of China (English)
无
2002-01-01
The objective of the present study is to propose and evaluate a novel multivariate approach for genetic mapping of complex categorical diseases. This approach results from an application of standard stepwise discriminant analysis to detect linkage based on the differential marker identity-by-descent (IBD) distributions among the different groups of sib pairs. Two major advantages of this method are that it allows for simultaneously testing all markers, together with other genetic and environmental factors in a single multivariate setting and it avoids explicitly modeling the complex relationship between the affection status of sib pairs and the underlying genetic determinants. The efficiency and properties of the method are demonstrated via simulations. The proposed multivariate approach has successfully located the true position(s) under various genetic scenarios. The more important finding is that using highly densely spaced markers (1～2 cM) leads to only a marginal loss of statistical efficiency of the proposed methods in terms of gene localization and statistical power. These results have well established its utility and advantages as a fine-mapping tool. A unique property of the proposed method is the ability to map multiple linked trait loci to their precise positions due to its sequential nature, as demonstrated via simulations.
Learning Discriminative Subspaces on Random Contrasts for Image Saliency Analysis.
Fang, Shu; Li, Jia; Tian, Yonghong; Huang, Tiejun; Chen, Xiaowu
2017-05-01
In visual saliency estimation, one of the most challenging tasks is to distinguish targets and distractors that share certain visual attributes. With the observation that such targets and distractors can sometimes be easily separated when projected to specific subspaces, we propose to estimate image saliency by learning a set of discriminative subspaces that perform the best in popping out targets and suppressing distractors. Toward this end, we first conduct principal component analysis on massive randomly selected image patches. The principal components, which correspond to the largest eigenvalues, are selected to construct candidate subspaces since they often demonstrate impressive abilities to separate targets and distractors. By projecting images onto various subspaces, we further characterize each image patch by its contrasts against randomly selected neighboring and peripheral regions. In this manner, the probable targets often have the highest responses, while the responses at background regions become very low. Based on such random contrasts, an optimization framework with pairwise binary terms is adopted to learn the saliency model that best separates salient targets and distractors by optimally integrating the cues from various subspaces. Experimental results on two public benchmarks show that the proposed approach outperforms 16 state-of-the-art methods in human fixation prediction.
Face Recognition Using Holistic Features and Simplified Linear Discriminant Analysis
Directory of Open Access Journals (Sweden)
Gou Koutaki
2012-08-01
Full Text Available This paper proposed an alternative approach to face recognition algorithm that is based on global/holistic features of face image and simplified Linear Discriminant Analysis (LDA. The proposed method can overcome main problems of the conventional LDA in terms of large processing time for retraining when a new class data was registered into the training data set. The holistic features of face image were proposed as dimensional reduction of raw face image. While, the simplified LDA which is the redefinition of between class scatter using constant global mean assignment was proposed to decrease time complexity of retraining process. In order to know the performance of the proposed method, several experiments were performed using several challenging face databases: ORL, YALE, ITS-Lab, INDIA, and FERET database. Furthermore, we compared the developed algorithm experimental results to the best traditional subspace methods such as DLDA, 2DLDA, (2D2DLDA, 2DPCA, and (2D22DPCA. The experimental results show that the proposed method can solve the retraining problem of the conventional LDA indicated by requiring short retraining time and stable recognition rate.
Sparse linear discriminant analysis by thresholding for high dimensional data
Shao, Jun; Deng, Xinwei; Wang, Sijian; 10.1214/10-AOS870
2011-01-01
In many social, economical, biological and medical studies, one objective is to classify a subject into one of several classes based on a set of variables observed from the subject. Because the probability distribution of the variables is usually unknown, the rule of classification is constructed using a training sample. The well-known linear discriminant analysis (LDA) works well for the situation where the number of variables used for classification is much smaller than the training sample size. Because of the advance in technologies, modern statistical studies often face classification problems with the number of variables much larger than the sample size, and the LDA may perform poorly. We explore when and why the LDA has poor performance and propose a sparse LDA that is asymptotically optimal under some sparsity conditions on the unknown parameters. For illustration of application, we discuss an example of classifying human cancer into two classes of leukemia based on a set of 7,129 genes and a training ...
Nonparametric estimation of ultrasound pulses
DEFF Research Database (Denmark)
Jensen, Jørgen Arendt; Leeman, Sidney
1994-01-01
An algorithm for nonparametric estimation of 1D ultrasound pulses in echo sequences from human tissues is derived. The technique is a variation of the homomorphic filtering technique using the real cepstrum, and the underlying basis of the method is explained. The algorithm exploits a priori...
Testing discontinuities in nonparametric regression
Dai, Wenlin
2017-01-19
In nonparametric regression, it is often needed to detect whether there are jump discontinuities in the mean function. In this paper, we revisit the difference-based method in [13 H.-G. Müller and U. Stadtmüller, Discontinuous versus smooth regression, Ann. Stat. 27 (1999), pp. 299–337. doi: 10.1214/aos/1018031100
Directory of Open Access Journals (Sweden)
Stochl Jan
2012-06-01
Full Text Available Abstract Background Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Methods Scalability of data from 1 a cross-sectional health survey (the Scottish Health Education Population Survey and 2 a general population birth cohort study (the National Child Development Study illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. Results and conclusions After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items we show that all items from the 12-item General Health Questionnaire (GHQ-12 – when binary scored – were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech’s “well-being” and “distress” clinical scales. An illustration of ordinal item analysis
Stochl, Jan; Jones, Peter B; Croudace, Tim J
2012-06-11
Mokken scaling techniques are a useful tool for researchers who wish to construct unidimensional tests or use questionnaires that comprise multiple binary or polytomous items. The stochastic cumulative scaling model offered by this approach is ideally suited when the intention is to score an underlying latent trait by simple addition of the item response values. In our experience, the Mokken model appears to be less well-known than for example the (related) Rasch model, but is seeing increasing use in contemporary clinical research and public health. Mokken's method is a generalisation of Guttman scaling that can assist in the determination of the dimensionality of tests or scales, and enables consideration of reliability, without reliance on Cronbach's alpha. This paper provides a practical guide to the application and interpretation of this non-parametric item response theory method in empirical research with health and well-being questionnaires. Scalability of data from 1) a cross-sectional health survey (the Scottish Health Education Population Survey) and 2) a general population birth cohort study (the National Child Development Study) illustrate the method and modeling steps for dichotomous and polytomous items respectively. The questionnaire data analyzed comprise responses to the 12 item General Health Questionnaire, under the binary recoding recommended for screening applications, and the ordinal/polytomous responses to the Warwick-Edinburgh Mental Well-being Scale. After an initial analysis example in which we select items by phrasing (six positive versus six negatively worded items) we show that all items from the 12-item General Health Questionnaire (GHQ-12)--when binary scored--were scalable according to the double monotonicity model, in two short scales comprising six items each (Bech's "well-being" and "distress" clinical scales). An illustration of ordinal item analysis confirmed that all 14 positively worded items of the Warwick-Edinburgh Mental
Optimal discrimination of multiple quantum systems: controllability analysis
Energy Technology Data Exchange (ETDEWEB)
Turinici, Gabriel [INRIA Rocquencourt, BP 105, 78153 Le Chesnay Cedex (France); Ramakhrishna, Viswanath [Department of Mathematical Sciences and Center for Signals, Systems and Communications, University of Texas at Dallas, PO Box 830688, Richardson, TX 75083 (United States); Li Baiqing [Department of Chemistry, Princeton University, Princeton, NJ 08544 (United States); Rabitz, Herschel [Department of Chemistry, Princeton University, Princeton, NJ 08544 (United States)
2004-01-09
A theoretical study is presented concerning the ability to dynamically discriminate between members of a set of different (but possibly similar) quantum systems. This discrimination is analysed in terms of independently and simultaneously steering about the wavefunction of each component system to a target state of interest using a tailored control (i.e. laser) field. Controllability criteria are revealed and their applicability is demonstrated in simple cases. Discussion is also presented in some uncontrollable cases.
Ajayi, Alex A; Syed, Moin
2014-10-01
This study used a person-oriented analytic approach to identify meaningful patterns of barriers-focused racial socialization and perceived racial discrimination experiences in a sample of 295 late adolescents. Using cluster analysis, three distinct groups were identified: Low Barrier Socialization-Low Discrimination, High Barrier Socialization-Low Discrimination, and High Barrier Socialization-High Discrimination clusters. These groups were substantively unique in terms of the frequency of racial socialization messages about bias preparation and out-group mistrust its members received and their actual perceived discrimination experiences. Further, individuals in the High Barrier Socialization-High Discrimination cluster reported significantly higher depressive symptoms than those in the Low Barrier Socialization-Low Discrimination and High Barrier Socialization-Low Discrimination clusters. However, no differences in adjustment were observed between the Low Barrier Socialization-Low Discrimination and High Barrier Socialization-Low Discrimination clusters. Overall, the findings highlight important individual differences in how young people of color experience their race and how these differences have significant implications on psychological adjustment. Copyright © 2014 The Foundation for Professionals in Services for Adolescents. Published by Elsevier Ltd. All rights reserved.
Directory of Open Access Journals (Sweden)
He Miao
2009-12-01
Full Text Available Abstract Background More studies based on gene expression data have been reported in great detail, however, one major challenge for the methodologists is the choice of classification methods. The main purpose of this research was to compare the performance of linear discriminant analysis (LDA and its modification methods for the classification of cancer based on gene expression data. Methods The classification performance of linear discriminant analysis (LDA and its modification methods was evaluated by applying these methods to six public cancer gene expression datasets. These methods included linear discriminant analysis (LDA, prediction analysis for microarrays (PAM, shrinkage centroid regularized discriminant analysis (SCRDA, shrinkage linear discriminant analysis (SLDA and shrinkage diagonal discriminant analysis (SDDA. The procedures were performed by software R 2.80. Results PAM picked out fewer feature genes than other methods from most datasets except from Brain dataset. For the two methods of shrinkage discriminant analysis, SLDA selected more genes than SDDA from most datasets except from 2-class lung cancer dataset. When comparing SLDA with SCRDA, SLDA selected more genes than SCRDA from 2-class lung cancer, SRBCT and Brain dataset, the result was opposite for the rest datasets. The average test error of LDA modification methods was lower than LDA method. Conclusions The classification performance of LDA modification methods was superior to that of traditional LDA with respect to the average error and there was no significant difference between theses modification methods.
Non-Parametric Inference in Astrophysics
Wasserman, L H; Nichol, R C; Genovese, C; Jang, W; Connolly, A J; Moore, A W; Schneider, J; Wasserman, Larry; Miller, Christopher J.; Nichol, Robert C.; Genovese, Chris; Jang, Woncheol; Connolly, Andrew J.; Moore, Andrew W.; Schneider, Jeff; group, the PICA
2001-01-01
We discuss non-parametric density estimation and regression for astrophysics problems. In particular, we show how to compute non-parametric confidence intervals for the location and size of peaks of a function. We illustrate these ideas with recent data on the Cosmic Microwave Background. We also briefly discuss non-parametric Bayesian inference.
稀疏判别分析%Sparse discriminant analysis
Institute of Scientific and Technical Information of China (English)
陈小冬; 林焕祥
2012-01-01
Methods for manifold embedding have the following issues: on one hand, neighborhood graph is constructed in such high-dimensionality of original space that it tends to work poorly; on the other hand, appropriate values for the neighborhood size and heat kernel parameter involved in graph construction are generally difficult to be assigned. To address these problems, a new semi-supervised dimensionality reduction algorithm called SparsE Discriminant Analysis (SEDA) was proposed. Firstly, SEDA set up a sparse graph to preserve the global information and geometric structure of the data based on sparse representation. Secondly, it applied both sparse graph and Fisher criterion to seek the optimal projection. The experimental results on a broad range of data sets show that SEDA is superior to many popular dimensionality reduction methods.%针对流形嵌入降维方法中在高维空间构建近邻图无益于后续工作,以及不容易给近邻大小和热核参数赋合适值的问题,提出一种稀疏判别分析算法(SEDA).首先使用稀疏表示构建稀疏图保持数据的全局信息和几何结构,以克服流形嵌入方法的不足；其次,将稀疏保持作为正则化项使用Fisher判别准则,能够得到最优的投影.在一组高维数据集上的实验结果表明,SEDA是非常有效的半监督降维方法.
Declining Bias and Gender Wage Discrimination? A Meta-Regression Analysis
Jarrell, Stephen B.; Stanley, T. D.
2004-01-01
The meta-regression analysis reveals that there is a strong tendency for discrimination estimates to fall and wage discrimination exist against the woman. The biasing effect of researchers' gender of not correcting for selection bias has weakened and changes in labor market have made it less important.
MR PROSTATE SEGMENTATION VIA DISTRIBUTED DISCRIMINATIVE DICTIONARY (DDD) LEARNING.
Guo, Yanrong; Zhan, Yiqiang; Gao, Yaozong; Jiang, Jianguo; Shen, Dinggang
2013-01-01
Segmenting prostate from MR images is important yet challenging. Due to non-Gaussian distribution of prostate appearances in MR images, the popular active appearance model (AAM) has its limited performance. Although the newly developed sparse dictionary learning method[1, 2] can model the image appearance in a non-parametric fashion, the learned dictionaries still lack the discriminative power between prostate and non-prostate tissues, which is critical for accurate prostate segmentation. In this paper, we propose to integrate deformable model with a novel learning scheme, namely the Distributed Discriminative Dictionary (DDD) learning, which can capture image appearance in a non-parametric and discriminative fashion. In particular, three strategies are designed to boost the tissue discriminative power of DDD. First, minimum Redundancy Maximum Relevance (mRMR) feature selection is performed to constrain the dictionary learning in a discriminative feature space. Second, linear discriminant analysis (LDA) is employed to assemble residuals from different dictionaries for optimal separation between prostate and non-prostate tissues. Third, instead of learning the global dictionaries, we learn a set of local dictionaries for the local regions (each with small appearance variations) along prostate boundary, thus achieving better tissue differentiation locally. In the application stage, DDDs will provide the appearance cues to robustly drive the deformable model onto the prostate boundary. Experiments on 50 MR prostate images show that our method can yield a Dice Ratio of 88% compared to the manual segmentations, and have 7% improvement over the conventional AAM.
Comparing parametric and nonparametric regression methods for panel data
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
We investigate and compare the suitability of parametric and non-parametric stochastic regression methods for analysing production technologies and the optimal firm size. Our theoretical analysis shows that the most commonly used functional forms in empirical production analysis, Cobb-Douglas and......We investigate and compare the suitability of parametric and non-parametric stochastic regression methods for analysing production technologies and the optimal firm size. Our theoretical analysis shows that the most commonly used functional forms in empirical production analysis, Cobb......-Douglas and Translog, are unsuitable for analysing the optimal firm size. We show that the Translog functional form implies an implausible linear relationship between the (logarithmic) firm size and the elasticity of scale, where the slope is artificially related to the substitutability between the inputs...... rejects both the Cobb-Douglas and the Translog functional form, while a recently developed nonparametric kernel regression method with a fully nonparametric panel data specification delivers plausible results. On average, the nonparametric regression results are similar to results that are obtained from...
Nonparametric Maize TFP Measurement Analysis and Countermeasures%玉米全要素生产率非参数测算分析及对策
Institute of Scientific and Technical Information of China (English)
曲会朋; 李宁; 田玉英
2014-01-01
玉米生产受到诸多投入要素的限制与影响，如种子、秧苗、劳动力、土地、农药、化肥、农膜、机械设备、畜力和其他物质投入等，提高这些因素的投入-产出效率水平对于促进玉米高效持续增产至关重要。为此，在对中美两国玉米生产成本与单产时序比较的基础上，利用基于DEA 的非参数前沿面效率分解方法对全国主要地区的玉米全要素生产效率问题进行了实证分析，从纵向时间序列和横向不同区域两个视角研究了我国玉米生产效率和生产资源配置的演化过程及区域对比特征。最后，有针对性地提出了促进我国玉米生产资源有效配置、提高玉米生产效率的相应措施与途径。%Maize production is affected by the restrictions and lots of inputs , such as seed seedlings, Labour, land, pesti-cide , chemical fertilizer , agricultural film , machinery and equipment , animal and other material input , improve the effi-ciency of input-output level of these factors is very important to promote efficient continuous corn production .Based on corn production cost compared with the yield time series in China and the United States , on the basis of the nonparamet-ric frontier efficiency based on DEA decomposition methods for main parts of the country's corn crop total factor produc-tivity question has carried on the empirical analysis , from the perspective of two different regions of the horizontal and vertical time sequence to study the evolution of China's corn production efficiency and resource allocation process and re-gional correlation characteristics .Finally , puts forward the promotion our country maize production resources effectively configuration , corresponding measures and way to improve efficiency of corn production .
Parametric versus non-parametric simulation
Dupeux, Bérénice; Buysse, Jeroen
2014-01-01
Most of ex-ante impact assessment policy models have been based on a parametric approach. We develop a novel non-parametric approach, called Inverse DEA. We use non parametric efficiency analysis for determining the farm’s technology and behaviour. Then, we compare the parametric approach and the Inverse DEA models to a known data generating process. We use a bio-economic model as a data generating process reflecting a real world situation where often non-linear relationships exist. Results s...
Nonparametric Inference for Periodic Sequences
Sun, Ying
2012-02-01
This article proposes a nonparametric method for estimating the period and values of a periodic sequence when the data are evenly spaced in time. The period is estimated by a "leave-out-one-cycle" version of cross-validation (CV) and complements the periodogram, a widely used tool for period estimation. The CV method is computationally simple and implicitly penalizes multiples of the smallest period, leading to a "virtually" consistent estimator of integer periods. This estimator is investigated both theoretically and by simulation.We also propose a nonparametric test of the null hypothesis that the data have constantmean against the alternative that the sequence of means is periodic. Finally, our methodology is demonstrated on three well-known time series: the sunspots and lynx trapping data, and the El Niño series of sea surface temperatures. © 2012 American Statistical Association and the American Society for Quality.
Nonparametric Econometrics: The np Package
Directory of Open Access Journals (Sweden)
Tristen Hayﬁeld
2008-07-01
Full Text Available We describe the R np package via a series of applications that may be of interest to applied econometricians. The np package implements a variety of nonparametric and semiparametric kernel-based estimators that are popular among econometricians. There are also procedures for nonparametric tests of signiﬁcance and consistent model speciﬁcation tests for parametric mean regression models and parametric quantile regression models, among others. The np package focuses on kernel methods appropriate for the mix of continuous, discrete, and categorical data often found in applied settings. Data-driven methods of bandwidth selection are emphasized throughout, though we caution the user that data-driven bandwidth selection methods can be computationally demanding.
Kiso, Atsushi; Taniguchi, Yu; Seki, Hirokazu
2011-01-01
This paper describes an optimal measurement position estimation by the discriminant analysis based on Wilks' lambda for the myoelectric hand control. In the past studies, the myoelectric signals were measured from the same positions for the motions discrimination. However, the optimal measurement positions of the myoelectric signals for the motion discrimination are different according to the remaining muscle situation of amputees. Therefore the purpose of this study is to estimate the optimal and fewer measurement positions for the precise motion discrimination of the human forearm. This study proposes the estimation method of the optimal measurement positions by the discriminant analysis based on Wilks' lambda among the myoelectric signal measured from multiple positions. Some experiments on the myoelectric hand simulator show the effectiveness of the proposed optimal measurement position estimation method.
Astronomical Methods for Nonparametric Regression
Steinhardt, Charles L.; Jermyn, Adam
2017-01-01
I will discuss commonly used techniques for nonparametric regression in astronomy. We find that several of them, particularly running averages and running medians, are generically biased, asymmetric between dependent and independent variables, and perform poorly in recovering the underlying function, even when errors are present only in one variable. We then examine less-commonly used techniques such as Multivariate Adaptive Regressive Splines and Boosted Trees and find them superior in bias, asymmetry, and variance both theoretically and in practice under a wide range of numerical benchmarks. In this context the chief advantage of the common techniques is runtime, which even for large datasets is now measured in microseconds compared with milliseconds for the more statistically robust techniques. This points to a tradeoff between bias, variance, and computational resources which in recent years has shifted heavily in favor of the more advanced methods, primarily driven by Moore's Law. Along these lines, we also propose a new algorithm which has better overall statistical properties than all techniques examined thus far, at the cost of significantly worse runtime, in addition to providing guidance on choosing the nonparametric regression technique most suitable to any specific problem. We then examine the more general problem of errors in both variables and provide a new algorithm which performs well in most cases and lacks the clear asymmetry of existing non-parametric methods, which fail to account for errors in both variables.
Institute of Scientific and Technical Information of China (English)
无
2006-01-01
A kernel-based discriminant analysis method called kernel direct discriminant analysis is employed, which combines the merit of direct linear discriminant analysis with that of kernel trick. In order to demonstrate its better robustness to the complex and nonlinear variations of real face images, such as illumination, facial expression, scale and pose variations, experiments are carried out on the Olivetti Research Laboratory, Yale and self-built face databases. The results indicate that in contrast to kernel principal component analysis and kernel linear discriminant analysis, the method can achieve lower (7%) error rate using only a very small set of features. Furthermore, a new corrected kernel model is proposed to improve the recognition performance. Experimental results confirm its superiority (1% in terms of recognition rate) to other polynomial kernel models.
An approach for mechanical fault classification based on generalized discriminant analysis
Institute of Scientific and Technical Information of China (English)
LI Wei-hua; SHI Tie-lin; YANG Shu-zi
2006-01-01
To deal with pattern classification of complicated mechanical faults,an approach to multi-faults classification based on generalized discriminant analysis is presented.Compared with linear discriminant analysis (LDA),generalized discriminant analysis (GDA),one of nonlinear discriminant analysis methods,is more suitable for classifying the linear non-separable problem.The connection and difference between KPCA (Kernel Principal Component Analysis) and GDA is discussed.KPCA is good at detection of machine abnormality while GDA performs well in multi-faults classification based on the collection of historical faults symptoms.When the proposed method is applied to air compressor condition classification and gear fault classification,an excellent performance in complicated multi-faults classification is presented.
Block-diagonal discriminant analysis and its bias-corrected rules.
Pang, Herbert; Tong, Tiejun; Ng, Michael
2013-06-01
High-throughput expression profiling allows simultaneous measure of tens of thousands of genes at once. These data have motivated the development of reliable biomarkers for disease subtypes identification and diagnosis. Many methods have been developed in the literature for analyzing these data, such as diagonal discriminant analysis, support vector machines, and k-nearest neighbor methods. The diagonal discriminant methods have been shown to perform well for high-dimensional data with small sample sizes. Despite its popularity, the independence assumption is unlikely to be true in practice. Recently, a gene module based linear discriminant analysis strategy has been proposed by utilizing the correlation among genes in discriminant analysis. However, the approach can be underpowered when the samples of the two classes are unbalanced. In this paper, we propose to correct the biases in the discriminant scores of block-diagonal discriminant analysis. In simulation studies, our proposed method outperforms other approaches in various settings. We also illustrate our proposed discriminant analysis method for analyzing microarray data studies.
Cloud-type discrimination via multispectral textural analysis
Lamei, Niloufar; Hutchison, Keith D.; Crawford, Melba M.; Khazenie, Nahid
1994-04-01
In recent years, with the development of satellite and computer technology, Earth observation and atmospheric research have become highly dependent on digital imagery. One of the primary interests in digital image processing is the development of robust methods to perform feature detection, extraction, and classification. Until recently, classification methods for cloud discrimination were mainly based on the spectral information of the imagery. However, because of the spectral similarities of certain features (such as ice clouds and snow) and the effects of atmospheric attenuation, multispectral rule-based classifications do not necessarily produce accurate feature discrimination. Spectral homogeneity of two different features within a scene can lead to misclassification. Furthermore, the opposite problem can occur when one feature exhibits different spectral signatures locally but is homogeneous in its cyclic spatial variation. The exploration of spatial information is often advantageous in these discrimination problems. A texture- based method for feature identification has been investigated. This method uses a set of localized spatial filters known as 2-D Gabor functions. Gabor filters can be described as a sinusoidal plane wave within a 2-D Gaussian envelope. The frequency and orientation of the sine plane and the width of the Gaussian envelope are determine by the Gabor parameters. These tunable channels yield joint optimal information both in the spatial and the frequency domains. The new method has been applied to the thermal channels of the NOAA Advanced Very High Resolution Radiometer data for cloud-type discrimination. Results show that additional texture information improves discrimination between cloud types (especially thin cirrus).
Comparing parametric and nonparametric regression methods for panel data
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
We investigate and compare the suitability of parametric and non-parametric stochastic regression methods for analysing production technologies and the optimal firm size. Our theoretical analysis shows that the most commonly used functional forms in empirical production analysis, Cobb......-Douglas and Translog, are unsuitable for analysing the optimal firm size. We show that the Translog functional form implies an implausible linear relationship between the (logarithmic) firm size and the elasticity of scale, where the slope is artificially related to the substitutability between the inputs....... The practical applicability of the parametric and non-parametric regression methods is scrutinised and compared by an empirical example: we analyse the production technology and investigate the optimal size of Polish crop farms based on a firm-level balanced panel data set. A nonparametric specification test...
Directory of Open Access Journals (Sweden)
Steven M Carr
-stepping-stone biogeographic models, but not a simple 1-step trans-Atlantic model. Plots of the cumulative pairwise sequence difference curves among seals in each of the four populations provide continuous proxies for phylogenetic diversification within each. Non-parametric Kolmogorov-Smirnov (K-S tests of maximum pairwise differences between these curves indicates that the Greenland Sea population has a markedly younger phylogenetic structure than either the White Sea population or the two Northwest Atlantic populations, which are of intermediate age and homogeneous structure. The Monte Carlo and K-S assessments provide sensitive quantitative tests of within-species mitogenomic phylogeography. This is the first study to indicate that the White Sea and Greenland Sea populations have different population genetic histories. The analysis supports the hypothesis that Harp Seals comprises three genetically distinguishable breeding populations, in the White Sea, Greenland Sea, and Northwest Atlantic. Implications for an ice-dependent species during ongoing climate change are discussed.
An Analysis of Visible Racial Discrimination In Invisible Man
Institute of Scientific and Technical Information of China (English)
LUO Hui
2016-01-01
Invisible Man by Ralph Ellison is regarded as one of the classical works in contemporary Black American Literature. By briefly analyzing the causes and effects of racial discrimination, the study aims to explore the blacks’responses to it. It is concluded that the hero, as an“invisible man”, should accept his black identity, admitting the blacks’tradition and cultural iden-tity, then he can find his self-belonging in the black community.
Directory of Open Access Journals (Sweden)
Subiakto Soekarno
2012-01-01
Full Text Available Insurance industry stands as a service business that plays a significant role in Indonesiaeconomical condition. The development of insurance industry in Indonesia, both of generalinsurance and life insurance, has increased very fast. The general insurance industry itselfdivided into two major players which are local private company and Joint Venture Company.Lately, the use of statistical techniques and financial ratios models to asses financial institutionsuch as insurance company have been used as one of the appropriate combination inpredicting the performance of an industry. This research aims to distinguish between JointVenture General Insurance Companies that have a good performance and those who are lessperforming well using Discriminant Analysis. Further, the findings led that DiscriminantAnalysis is able to distinguish Joint Venture General Insurance Companies that have a goodperformance and those who are not performing well. There are also six ratios which are RBC,Technical Reserve to Investment Ratio, Debt Ratio, Return on Equity, Loss Ratio, and ExpenseRatio that stand as the most influential ratios to distinguish the performance of joint venturegeneral insurance companies. In addition, the result suggest business people to be concernedtoward those six ratios, to increase their companies’ performance.Key words: general insurance, financial ratio, discriminant analysis
Directory of Open Access Journals (Sweden)
Subiakto Soekarno
2012-01-01
Full Text Available Insurance industry stands as a service business that plays a significant role in Indonesiaeconomical condition. The development of insurance industry in Indonesia, both of generalinsurance and life insurance, has increased very fast. The general insurance industry itselfdivided into two major players which are local private company and Joint Venture Company.Lately, the use of statistical techniques and financial ratios models to asses financial institutionsuch as insurance company have been used as one of the appropriate combination inpredicting the performance of an industry. This research aims to distinguish between JointVenture General Insurance Companies that have a good performance and those who are lessperforming well using Discriminant Analysis. Further, the findings led that DiscriminantAnalysis is able to distinguish Joint Venture General Insurance Companies that have a goodperformance and those who are not performing well. There are also six ratios which are RBC,Technical Reserve to Investment Ratio, Debt Ratio, Return on Equity, Loss Ratio, and ExpenseRatio that stand as the most influential ratios to distinguish the performance of joint venturegeneral insurance companies. In addition, the result suggest business people to be concernedtoward those six ratios, to increase their companies’ performance.Key words: general insurance, financial ratio, discriminant analysis
Mu opioid mediated discriminative-stimulus effects of tramadol: an individual subjects analysis.
Strickland, Justin C; Rush, Craig R; Stoops, William W
2015-03-01
Drug discrimination procedures use dose-dependent generalization, substitution, and pretreatment with selective agonists and antagonists to evaluate receptor systems mediating interoceptive effects of drugs. Despite the extensive use of these techniques in the nonhuman animal literature, few studies have used human participants. Specifically, human studies have not routinely used antagonist administration as a pharmacological tool to elucidate the mechanisms mediating the discriminative stimulus effects of drugs. This study evaluated the discriminative-stimulus effects of tramadol, an atypical analgesic with monoamine and mu opioid activity. Three human participants first learned to discriminate 100 mg tramadol from placebo. A range of tramadol doses (25 to 150 mg) and hydromorphone (4 mg) with and without naltrexone pretreatment (50 mg) were then administered to participants after they acquired the discrimination. Tramadol produced dose-dependent increases in drug-appropriate responding and hydromorphone partially or fully substituted for tramadol in all participants. These effects were attenuated by naltrexone. Individual participant records indicated a relationship between mu opioid activity (i.e., miosis) and drug discrimination performance. Our findings indicate that mu opioid activity may mediate the discriminative-stimulus effects of tramadol in humans. The correspondence of generalization, substitution, and pretreatment findings with the animal literature supports the neuropharmacological specificity of the drug discrimination procedure. © Society for the Experimental Analysis of Behavior.
Asymptotic theory of nonparametric regression estimates with censored data
Institute of Scientific and Technical Information of China (English)
无
2000-01-01
For regression analysis, some useful information may have been lost when the responses are right censored. To estimate nonparametric functions, several estimates based on censored data have been proposed and their consistency and convergence rates have been studied in literature, but the optimal rates of global convergence have not been obtained yet. Because of the possible information loss, one may think that it is impossible for an estimate based on censored data to achieve the optimal rates of global convergence for nonparametric regression, which were established by Stone based on complete data. This paper constructs a regression spline estimate of a general nonparametric regression function based on right_censored response data, and proves, under some regularity conditions, that this estimate achieves the optimal rates of global convergence for nonparametric regression. Since the parameters for the nonparametric regression estimate have to be chosen based on a data driven criterion, we also obtain the asymptotic optimality of AIC, AICC, GCV, Cp and FPE criteria in the process of selecting the parameters.
Rediscovery of Good-Turing estimators via Bayesian nonparametrics.
Favaro, Stefano; Nipoti, Bernardo; Teh, Yee Whye
2016-03-01
The problem of estimating discovery probabilities originated in the context of statistical ecology, and in recent years it has become popular due to its frequent appearance in challenging applications arising in genetics, bioinformatics, linguistics, designs of experiments, machine learning, etc. A full range of statistical approaches, parametric and nonparametric as well as frequentist and Bayesian, has been proposed for estimating discovery probabilities. In this article, we investigate the relationships between the celebrated Good-Turing approach, which is a frequentist nonparametric approach developed in the 1940s, and a Bayesian nonparametric approach recently introduced in the literature. Specifically, under the assumption of a two parameter Poisson-Dirichlet prior, we show that Bayesian nonparametric estimators of discovery probabilities are asymptotically equivalent, for a large sample size, to suitably smoothed Good-Turing estimators. As a by-product of this result, we introduce and investigate a methodology for deriving exact and asymptotic credible intervals to be associated with the Bayesian nonparametric estimators of discovery probabilities. The proposed methodology is illustrated through a comprehensive simulation study and the analysis of Expressed Sequence Tags data generated by sequencing a benchmark complementary DNA library.
Corvucci, Francesca; Nobili, Lara; Melucci, Dora; Grillenzoni, Francesca-Vittoria
2015-02-15
Honey traceability to food quality is required by consumers and food control institutions. Melissopalynologists traditionally use percentages of nectariferous pollens to discriminate the botanical origin and the entire pollen spectrum (presence/absence, type and quantities and association of some pollen types) to determinate the geographical origin of honeys. To improve melissopalynological routine analysis, principal components analysis (PCA) was used. A remarkable and innovative result was that the most significant pollens for the traditional discrimination of the botanical and geographical origin of honeys were the same as those individuated with the chemometric model. The reliability of assignments of samples to honey classes was estimated through explained variance (85%). This confirms that the chemometric model properly describes the melissopalynological data. With the aim to improve honey discrimination, FT-microRaman spectrography and multivariate analysis were also applied. Well performing PCA models and good agreement with known classes were achieved. Encouraging results were obtained for botanical discrimination.
Nonparametric regression with filtered data
Linton, Oliver; Nielsen, Jens Perch; Van Keilegom, Ingrid; 10.3150/10-BEJ260
2011-01-01
We present a general principle for estimating a regression function nonparametrically, allowing for a wide variety of data filtering, for example, repeated left truncation and right censoring. Both the mean and the median regression cases are considered. The method works by first estimating the conditional hazard function or conditional survivor function and then integrating. We also investigate improved methods that take account of model structure such as independent errors and show that such methods can improve performance when the model structure is true. We establish the pointwise asymptotic normality of our estimators.
Nonparametric identification of copula structures
Li, Bo
2013-06-01
We propose a unified framework for testing a variety of assumptions commonly made about the structure of copulas, including symmetry, radial symmetry, joint symmetry, associativity and Archimedeanity, and max-stability. Our test is nonparametric and based on the asymptotic distribution of the empirical copula process.We perform simulation experiments to evaluate our test and conclude that our method is reliable and powerful for assessing common assumptions on the structure of copulas, particularly when the sample size is moderately large. We illustrate our testing approach on two datasets. © 2013 American Statistical Association.
Gas Classification Using Combined Features Based on a Discriminant Analysis for an Electronic Nose
Directory of Open Access Journals (Sweden)
Sang-Il Choi
2016-01-01
Full Text Available This paper proposes a gas classification method for an electronic nose (e-nose system, for which combined features that have been configured through discriminant analysis are used. First, each global feature is extracted from the entire measurement section of the data samples, while the same process is applied to the local features of the section that corresponds to the stabilization, exposure, and purge stages. The discriminative information amounts in the individual features are then measured based on the discriminant analysis, and the combined features are subsequently composed by selecting the features that have a large amount of discriminative information. Regarding a variety of volatile organic compound data, the results of the experiment show that, in a noisy environment, the proposed method exhibits classification performance that is relatively excellent compared to the other feature types.
Directory of Open Access Journals (Sweden)
Zeyu Lin
2017-01-01
Full Text Available In this paper, the authors presented a study on the discrimination of handlebar grip samples, to provide effective forensic science service for hit and run traffic cases. 50 bicycle handlebar grip samples, 49 electric bike handlebar grip samples, and 96 motorcycle handlebar grip samples have been randomly collected by the local police in Beijing (China. Fourier transform infrared microspectroscopy (FTIR was utilized as analytical technology. Then, target absorption selection, data pretreatment, and discrimination of linked samples and unlinked samples were chosen as three steps to improve the discrimination of FTIR spectrums collected from different handlebar grip samples. Principal component analysis and receiver operating characteristic curve were utilized to evaluate different data selection methods and different data pretreatment methods, respectively. It is possible to explore the evidential value of handlebar grip residue evidence through instrumental analysis and statistical treatments. It will provide a universal discrimination method for other forensic science samples as well.
Equity and efficiency in private and public education: a nonparametric comparison
L. Cherchye; K. de Witte; E. Ooghe; I. Nicaise
2007-01-01
We present a nonparametric approach for the equity and efficiency evaluation of (private and public) primary schools in Flanders. First, we use a nonparametric (Data Envelopment Analysis) model that is specially tailored to assess educational efficiency at the pupil level. The model accounts for the
Non-parametric tests of productive efficiency with errors-in-variables
Kuosmanen, T.K.; Post, T.; Scholtes, S.
2007-01-01
We develop a non-parametric test of productive efficiency that accounts for errors-in-variables, following the approach of Varian. [1985. Nonparametric analysis of optimizing behavior with measurement error. Journal of Econometrics 30(1/2), 445-458]. The test is based on the general Pareto-Koopmans
Equity and efficiency in private and public education: a nonparametric comparison
Cherchye, L.; de Witte, K.; Ooghe, E.; Nicaise, I.
2007-01-01
We present a nonparametric approach for the equity and efficiency evaluation of (private and public) primary schools in Flanders. First, we use a nonparametric (Data Envelopment Analysis) model that is specially tailored to assess educational efficiency at the pupil level. The model accounts for the
A contingency table approach to nonparametric testing
Rayner, JCW
2000-01-01
Most texts on nonparametric techniques concentrate on location and linear-linear (correlation) tests, with less emphasis on dispersion effects and linear-quadratic tests. Tests for higher moment effects are virtually ignored. Using a fresh approach, A Contingency Table Approach to Nonparametric Testing unifies and extends the popular, standard tests by linking them to tests based on models for data that can be presented in contingency tables.This approach unifies popular nonparametric statistical inference and makes the traditional, most commonly performed nonparametric analyses much more comp
Ramos, M. Rosário; Carolino, E.; Viegas, Carla; Viegas, Sandra
2016-06-01
Health effects associated with occupational exposure to particulate matter have been studied by several authors. In this study were selected six industries of five different areas: Cork company 1, Cork company 2, poultry, slaughterhouse for cattle, riding arena and production of animal feed. The measurements tool was a portable device for direct reading. This tool provides information on the particle number concentration for six different diameters, namely 0.3 µm, 0.5 µm, 1 µm, 2.5 µm, 5 µm and 10 µm. The focus on these features is because they might be more closely related with adverse health effects. The aim is to identify the particles that better discriminate the industries, with the ultimate goal of classifying industries regarding potential negative effects on workers' health. Several methods of discriminant analysis were applied to data of occupational exposure to particulate matter and compared with respect to classification accuracy. The selected methods were linear discriminant analyses (LDA); linear quadratic discriminant analysis (QDA), robust linear discriminant analysis with selected estimators (MLE (Maximum Likelihood Estimators), MVE (Minimum Volume Elipsoid), "t", MCD (Minimum Covariance Determinant), MCD-A, MCD-B), multinomial logistic regression and artificial neural networks (ANN). The predictive accuracy of the methods was accessed through a simulation study. ANN yielded the highest rate of classification accuracy in the data set under study. Results indicate that the particle number concentration of diameter size 0.5 µm is the parameter that better discriminates industries.
Mixture subclass discriminant analysis link to restricted Gaussian model and other generalizations.
Gkalelis, Nikolaos; Mezaris, Vasileios; Kompatsiaris, Ioannis; Stathaki, Tania
2013-01-01
In this paper, a theoretical link between mixture subclass discriminant analysis (MSDA) and a restricted Gaussian model is first presented. Then, two further discriminant analysis (DA) methods, i.e., fractional step MSDA (FSMSDA) and kernel MSDA (KMSDA) are proposed. Linking MSDA to an appropriate Gaussian model allows the derivation of a new DA method under the expectation maximization (EM) framework (EM-MSDA), which simultaneously derives the discriminant subspace and the maximum likelihood estimates. The two other proposed methods generalize MSDA in order to solve problems inherited from conventional DA. FSMSDA solves the subclass separation problem, that is, the situation in which the dimensionality of the discriminant subspace is strictly smaller than the rank of the inter-between-subclass scatter matrix. This is done by an appropriate weighting scheme and the utilization of an iterative algorithm for preserving useful discriminant directions. On the other hand, KMSDA uses the kernel trick to separate data with nonlinearly separable subclass structure. Extensive experimentation shows that the proposed methods outperform conventional MSDA and other linear discriminant analysis variants.
Estimation of Spatial Dynamic Nonparametric Durbin Models with Fixed Effects
Qian, Minghui; Hu, Ridong; Chen, Jianwei
2016-01-01
Spatial panel data models have been widely studied and applied in both scientific and social science disciplines, especially in the analysis of spatial influence. In this paper, we consider the spatial dynamic nonparametric Durbin model (SDNDM) with fixed effects, which takes the nonlinear factors into account base on the spatial dynamic panel…
Non-parametric Bayesian inference for inhomogeneous Markov point processes
DEFF Research Database (Denmark)
Berthelsen, Kasper Klitgaard; Møller, Jesper
With reference to a specific data set, we consider how to perform a flexible non-parametric Bayesian analysis of an inhomogeneous point pattern modelled by a Markov point process, with a location dependent first order term and pairwise interaction only. A priori we assume that the first order term...
Homothetic Efficiency and Test Power: A Non-Parametric Approach
J. Heufer (Jan); P. Hjertstrand (Per)
2015-01-01
markdownabstract__Abstract__ We provide a nonparametric revealed preference approach to demand analysis based on homothetic efficiency. Homotheticity is a useful restriction but data rarely satisfies testable conditions. To overcome this we provide a way to estimate homothetic efficiency of consump
Directory of Open Access Journals (Sweden)
Haiyan Huang
2016-10-01
Full Text Available Biomass burning is a global phenomenon and systematic burned area mapping is of increasing importance for science and applications. With high spatial resolution and novelty in band design, the recently launched Sentinel-2A satellite provides a new opportunity for moderate spatial resolution burned area mapping. This study examines the performance of the Sentinel-2A Multi Spectral Instrument (MSI bands and derived spectral indices to differentiate between unburned and burned areas. For this purpose, five pairs of pre-fire and post-fire top of atmosphere (TOA reflectance and atmospherically corrected (surface reflectance images were studied. The pixel values of locations that were unburned in the first image and burned in the second image, as well as the values of locations that were unburned in both images which served as a control, were compared and the discrimination of individual bands and spectral indices were evaluated using parametric (transformed divergence and non-parametric (decision tree approaches. Based on the results, the most suitable MSI bands to detect burned areas are the 20 m near-infrared, short wave infrared and red-edge bands, while the performance of the spectral indices varied with location. The atmospheric correction only significantly influenced the separability of the visible wavelength bands. The results provide insights that are useful for developing Sentinel-2 burned area mapping algorithms.
Ocampo-Duque, William; Osorio, Carolina; Piamba, Christian; Schuhmacher, Marta; Domingo, José L
2013-02-01
The integration of water quality monitoring variables is essential in environmental decision making. Nowadays, advanced techniques to manage subjectivity, imprecision, uncertainty, vagueness, and variability are required in such complex evaluation process. We here propose a probabilistic fuzzy hybrid model to assess river water quality. Fuzzy logic reasoning has been used to compute a water quality integrative index. By applying a Monte Carlo technique, based on non-parametric probability distributions, the randomness of model inputs was estimated. Annual histograms of nine water quality variables were built with monitoring data systematically collected in the Colombian Cauca River, and probability density estimations using the kernel smoothing method were applied to fit data. Several years were assessed, and river sectors upstream and downstream the city of Santiago de Cali, a big city with basic wastewater treatment and high industrial activity, were analyzed. The probabilistic fuzzy water quality index was able to explain the reduction in water quality, as the river receives a larger number of agriculture, domestic, and industrial effluents. The results of the hybrid model were compared to traditional water quality indexes. The main advantage of the proposed method is that it considers flexible boundaries between the linguistic qualifiers used to define the water status, being the belongingness of water quality to the diverse output fuzzy sets or classes provided with percentiles and histograms, which allows classify better the real water condition. The results of this study show that fuzzy inference systems integrated to stochastic non-parametric techniques may be used as complementary tools in water quality indexing methodologies.
蜻蜒3属的非参数判别分析%Discriminant Analysis of 3 Genera of Odonata by Nonparametric Methods
Institute of Scientific and Technical Information of China (English)
蔡笃程; 刘桂清; 李大军
2005-01-01
以斑小蜻属Nannophipsis、灰蜻属Orthetrum和黄蜻属Pantala(蜻蜒目蜻科)成虫的腹部、后翅、后翅痣、上肛附器、内肛附器和第10腹节的长度作为数值变量,用非参数法进行判别分析.结果表明:非参数判别法能有效地区分斑小蜻属、灰蜻属和黄蜻属成虫,回判正确率和交叉确认正确率均为99.05%.
Electron - nuclear recoil discrimination by pulse shape analysis
Elbs, J; Collin, E; Godfrin, H; Suvorova, O
2007-01-01
In the framework of the ``ULTIMA'' project, we use ultra cold superfluid 3He bolometers for the direct detection of single particle events, aimed for a future use as a dark matter detector. One parameter of the pulse shape observed after such an event is the thermalization time constant. Until now it was believed that this parameter only depends on geometrical factors and superfluid 3He properties, and that it is independent of the nature of the incident particles. In this report we show new results which demonstrate that a difference for muon- and neutron events, as well as events simulated by heater pulses exist. The possibility to use this difference for event discrimination in a future dark matter detector will be discussed.
Moran, E. R.
2000-01-01
The prospect of a justification defence in cases of direct sex discrimination is universally criticised by academic commentators on the ground that it would subvert the goal of equality that underlies sex discrimination and equal treatment legislation. At the outset the thesis examines the differences between the sexes, how these differences can be used to explain the distinction between direct and indirect sex discrimination and considers various concepts of equality. Building...
Parametric and Nonparametric Discriminants for Regional Earthquakes and Explosions
1993-07-31
0 1... . .... .. 2 0 00 ......... 0015 .200E00O0 .2 2 . 7 ... .. -25000.:y ... ..e... .... ....... Figure. .... Eartquak 7 at... Statio ... on41/9...0 10-5 7 Wt 000 1250030(.025% M 0 U02& OWN ~ igr .8 Eartquak 8.. at.... Stat on .... on~* 5/18/92... wi..h local.... scmagiue27 n S.4E0
Institute of Scientific and Technical Information of China (English)
ZHANG Wei; LI Xi-bing; GONG Feng-qiang
2008-01-01
Based on the principle of Mahalanobis distance discriminant analysis (DDA) theory, a stability classification model for mine-lane surrounding rock was established, including six indexes of discriminant factors that reflect the engineering quality of surrounding rock: lane depth below surface, span of lane, ratio of directly top layer thickness to coal thickness, uniaxial comprehensive strength of surrounding rock, development degree coefficient of surrounding rock joint and range of broken surrounding rock zone. A DDA model was obtained through training 15 practical measuring samples. The re-substitution method was introduced to verify the stability of DDA model and the ratio of mis-discrimination is zero. The DDA model was used to discriminate3 new samples and the results are identical with actual rock kind. Compared with the artificial neural network method and support vector mechanic method, the results show that this model has high prediction accuracy and can be used in practical engineering.
Liu, Guofu; Yang, Jun; Lin, Cunbao; Hu, Qingqing; Peng, Jinxian
2013-01-01
Frequency gradient analysis (FGA) effectively discriminates neutrons and gamma rays by examining the frequency-domain features of the photomultiplier tube anode signal. This approach is insensitive to noise but is inevitably affected by the baseline drift, similar to other pulse shape discrimination methods. The baseline drift effect is attributed to the factors such as power line fluctuation, dark current, noise disturbances, hum, and pulse tail in front-end electronics. This effect needs to be elucidated and quantified before the baseline shift can be estimated and removed from the captured signal. Therefore, the effect of baseline shift on the discrimination performance of neutrons and gamma rays with organic scintillation detectors using FGA is investigated in this paper. The relationship between the baseline shift and discrimination parameters of FGA is derived and verified by an experimental system consisting of an americium-beryllium source, a BC501A liquid scintillator detector, and a 5 GSPS 8-bit osc...
Nonparametric Regression with Common Shocks
Directory of Open Access Journals (Sweden)
Eduardo A. Souza-Rodrigues
2016-09-01
Full Text Available This paper considers a nonparametric regression model for cross-sectional data in the presence of common shocks. Common shocks are allowed to be very general in nature; they do not need to be finite dimensional with a known (small number of factors. I investigate the properties of the Nadaraya-Watson kernel estimator and determine how general the common shocks can be while still obtaining meaningful kernel estimates. Restrictions on the common shocks are necessary because kernel estimators typically manipulate conditional densities, and conditional densities do not necessarily exist in the present case. By appealing to disintegration theory, I provide sufficient conditions for the existence of such conditional densities and show that the estimator converges in probability to the Kolmogorov conditional expectation given the sigma-field generated by the common shocks. I also establish the rate of convergence and the asymptotic distribution of the kernel estimator.
Directory of Open Access Journals (Sweden)
Farshad Vesali
2012-09-01
Full Text Available In present study we tried to recognizing weeds in potato fields to effective use from herbicides. As we know potato is one of the crops which is cultivated vastly all over the world and it is a major world food crop that is consumed by over one billion people world over, but it is threated by weed invade, because of row cropping system applied in potato tillage. Machine vision is used in this research for effective application of herbicides in field. About 300 color images from 3 potato farms of Qorveh city and 2 farms of Urmia University-Iran, was acquired. Images were acquired in different illumination condition from morning to evening in sunny and cloudy days. Because of overlap and shading of plants in farm condition it is hard to use morphologic parameters. In method used for classifying weeds and potato plants, primary color components of each plant were extracted and the relation between them was estimated for determining discriminant function and classifying plants using discrimination analysis. In addition the decision tree method was used to compare results with discriminant analysis. Three different classifications were applied: first, Classification was applied to discriminate potato plant from all other weeds (two groups, the rate of correct classification was 76.67% for discriminant analysis and 83.82% for decision tree; second classification was applied to discriminate potato plant from separate groups of each weed (6 groups, the rate of correct classification was 87%. And the third, Classification of potato plant versus weed species one by one. As the weeds were different, the results of classification were different in this composition. The decision tree in all conditions showed the better result than discriminant analysis.
Nonparametric Bayesian Modeling of Complex Networks
DEFF Research Database (Denmark)
Schmidt, Mikkel Nørgaard; Mørup, Morten
2013-01-01
Modeling structure in complex networks using Bayesian nonparametrics makes it possible to specify flexible model structures and infer the adequate model complexity from the observed data. This article provides a gentle introduction to nonparametric Bayesian modeling of complex networks: Using...... for complex networks can be derived and point out relevant literature....
An asymptotically optimal nonparametric adaptive controller
Institute of Scientific and Technical Information of China (English)
郭雷; 谢亮亮
2000-01-01
For discrete-time nonlinear stochastic systems with unknown nonparametric structure, a kernel estimation-based nonparametric adaptive controller is constructed based on truncated certainty equivalence principle. Global stability and asymptotic optimality of the closed-loop systems are established without resorting to any external excitations.
Determination of sex by discriminant function analysis of mandibles from a Central Indian population
Directory of Open Access Journals (Sweden)
Kanchankumar P Wankhede
2015-01-01
Full Text Available Context: Identification of sex from skeletal remains is one of the important forensic considerations. Discriminant function analysis is increasingly used to determine the sex from skeleton. Aims: To develop discriminant function to determine sex from mandible in a Central Indian population. Settings and Design: This was a prospective study done at the Department of Anatomy. Materials and Methods: The mandibles used in the present study were from the museum specimens. Only 82 adult mandibles (55 male and 27 female that had been preserved were selected. Ten mandibular parameters were measured. Statistical Analysis Used: Statistical analysis was conducted using Statistical Package for Social Sciences (SPSS for Windows, version 16. The level of statistical significance was set at P < 0.05. Results: Using stepwise discriminant function analysis, only six variables were selected as the best discriminant between sexes, with the projection length of corpus mandibulae being the most dimorphic. It was observed that sex classification accuracy of the discriminant functions ranged from 57.3 to 80.5% for the individual variables, 81.7% for the stepwise method, and 85.4% for the direct method. Conclusion: The results of the study show that mandibles can be used for determining sex and the results are comparable with other similar studies. The studied mandibular variables showed sexual dimorphism with an accuracy comparable with other skeletal remains, next to cranium and pelvis.
Directory of Open Access Journals (Sweden)
Santana Isabel
2011-08-01
Full Text Available Abstract Background Dementia and cognitive impairment associated with aging are a major medical and social concern. Neuropsychological testing is a key element in the diagnostic procedures of Mild Cognitive Impairment (MCI, but has presently a limited value in the prediction of progression to dementia. We advance the hypothesis that newer statistical classification methods derived from data mining and machine learning methods like Neural Networks, Support Vector Machines and Random Forests can improve accuracy, sensitivity and specificity of predictions obtained from neuropsychological testing. Seven non parametric classifiers derived from data mining methods (Multilayer Perceptrons Neural Networks, Radial Basis Function Neural Networks, Support Vector Machines, CART, CHAID and QUEST Classification Trees and Random Forests were compared to three traditional classifiers (Linear Discriminant Analysis, Quadratic Discriminant Analysis and Logistic Regression in terms of overall classification accuracy, specificity, sensitivity, Area under the ROC curve and Press'Q. Model predictors were 10 neuropsychological tests currently used in the diagnosis of dementia. Statistical distributions of classification parameters obtained from a 5-fold cross-validation were compared using the Friedman's nonparametric test. Results Press' Q test showed that all classifiers performed better than chance alone (p Conclusions When taking into account sensitivity, specificity and overall classification accuracy Random Forests and Linear Discriminant analysis rank first among all the classifiers tested in prediction of dementia using several neuropsychological tests. These methods may be used to improve accuracy, sensitivity and specificity of Dementia predictions from neuropsychological testing.
ALE meta-analysis reveals dissociable networks for affective and discriminative aspects of touch.
Morrison, India
2016-04-01
Emotionally-laden tactile stimulation-such as a caress on the skin or the feel of velvet-may represent a functionally distinct domain of touch, underpinned by specific cortical pathways. In order to determine whether, and to what extent, cortical functional neuroanatomy supports a distinction between affective and discriminative touch, an activation likelihood estimate (ALE) meta-analysis was performed. This meta-analysis statistically mapped reported functional magnetic resonance imaging (fMRI) activations from 17 published affective touch studies in which tactile stimulation was associated with positive subjective evaluation (n = 291, 34 experimental contrasts). A separate ALE meta-analysis mapped regions most likely to be activated by tactile stimulation during detection and discrimination tasks (n = 1,075, 91 experimental contrasts). These meta-analyses revealed dissociable regions for affective and discriminative touch, with posterior insula (PI) more likely to be activated for affective touch, and primary somatosensory cortices (SI) more likely to be activated for discriminative touch. Secondary somatosensory cortex had a high likelihood of engagement by both affective and discriminative touch. Further, meta-analytic connectivity (MCAM) analyses investigated network-level co-activation likelihoods independent of task or stimulus, across a range of domains and paradigms. Affective-related PI and discriminative-related SI regions co-activated with different networks, implicated in dissociable functions, but sharing somatosensory co-activations. Taken together, these meta-analytic findings suggest that affective and discriminative touch are dissociable both on the regional and network levels. However, their degree of shared activation likelihood in somatosensory cortices indicates that this dissociation reflects functional biases within tactile processing networks, rather than functionally and anatomically distinct pathways.
Liu, Rong; Lv, Guorong; He, Bin; Xu, Kexin
2011-03-01
Since the beginning of the 21st century, the issue of food safety is becoming a global concern. It is very important to develop a rapid, cost-effective, and widely available method for food adulteration detection. In this paper, near-infrared spectroscopy techniques and pattern recognition were applied to study the qualitative discriminant analysis method. The samples were prepared and adulterated with one of the three adulterants, urea, glucose and melamine with different concentrations. First, the spectral characteristics of milk and adulterant samples were analyzed. Then, pattern recognition methods were used for qualitative discriminant analysis of milk adulteration. Soft independent modeling of class analogy and partial least squares discriminant analysis (PLSDA) were used to construct discriminant models, respectively. Furthermore, the optimization method of the model was studied. The best spectral pretreatment methods and the optimal band were determined. In the optimal conditions, PLSDA models were constructed respectively for each type of adulterated sample sets (urea, melamine and glucose) and all the three types of adulterated sample sets. Results showed that, the discrimination accuracy of model achieved 93.2% in the classification of different adulterated and unadulterated milk samples. Thus, it can be concluded that near-infrared spectroscopy and PLSDA can be used to identify whether the milk has been adulterated or not and the type of adulterant used.
Sun, Hui; Wang, Huiyu; Zhang, Aihua; Yan, Guangli; Han, Ying; Li, Yuan; Wu, Xiuhong; Meng, Xiangcai; Wang, Xijun
2016-01-01
As herbal medicines have an important position in health care systems worldwide, their current assessment, and quality control are a major bottleneck. Cortex Phellodendri chinensis (CPC) and Cortex Phellodendri amurensis (CPA) are widely used in China, however, how to identify species of CPA and CPC has become urgent. In this study, multivariate analysis approach was performed to the investigation of chemical discrimination of CPA and CPC. Principal component analysis showed that two herbs could be separated clearly. The chemical markers such as berberine, palmatine, phellodendrine, magnoflorine, obacunone, and obaculactone were identified through the orthogonal partial least squared discriminant analysis, and were identified tentatively by the accurate mass of quadruple-time-of-flight mass spectrometry. A total of 29 components can be used as the chemical markers for discrimination of CPA and CPC. Of them, phellodenrine is significantly higher in CPC than that of CPA, whereas obacunone and obaculactone are significantly higher in CPA than that of CPC. The present study proves that multivariate analysis approach based chemical analysis greatly contributes to the investigation of CPA and CPC, and showed that the identified chemical markers as a whole should be used to discriminate the two herbal medicines, and simultaneously the results also provided chemical information for their quality assessment. Multivariate analysis approach was performed to the investigate the herbal medicineThe chemical markers were identified through multivariate analysis approachA total of 29 components can be used as the chemical markers. UPLC-Q/TOF-MS-based multivariate analysis method for the herbal medicine samples Abbreviations used: CPC: Cortex Phellodendri chinensis, CPA: Cortex Phellodendri amurensis, PCA: Principal component analysis, OPLS-DA: Orthogonal partial least squares discriminant analysis, BPI: Base peaks ion intensity.
Energy Technology Data Exchange (ETDEWEB)
Mann, L.K.; Shugart, H.H.; Kitchings, J.T.
1978-03-01
The purpose of this study was to evaluate the effectiveness of using discriminant analysis in assessing plant niches. As a component of research by the Environmental Research Park Project at Oak Ridge, Tennessee, five sites were inventoried for herbaceous species. From this inventory, four sympatric species of Galium and seventeen co-occurring herbaceous species were selected for discriminant analysis. The four species of Galium were treated as two data sets: one was composed of information collected at one site (a mesic hardwood area) and the other contained data from two cedar sites of shallow soil over limestone bedrock. The seventeen herbaceous species all occurred in the mesic hardwood area.
Martins, Angélica Rocha; Talhavini, Márcio; Vieira, Maurício Leite; Zacca, Jorge Jardim; Braga, Jez Willian Batista
2017-08-15
The discrimination of whisky brands and counterfeit identification were performed by UV-Vis spectroscopy combined with partial least squares for discriminant analysis (PLS-DA). In the proposed method all spectra were obtained with no sample preparation. The discrimination models were built with the employment of seven whisky brands: Red Label, Black Label, White Horse, Chivas Regal (12years), Ballantine's Finest, Old Parr and Natu Nobilis. The method was validated with an independent test set of authentic samples belonging to the seven selected brands and another eleven brands not included in the training samples. Furthermore, seventy-three counterfeit samples were also used to validate the method. Results showed correct classification rates for genuine and false samples over 98.6% and 93.1%, respectively, indicating that the method can be helpful for the forensic analysis of whisky samples. Copyright © 2017 Elsevier Ltd. All rights reserved.
Perrière, Guy; Thioulouse, Jean
2003-02-01
Correspondence discriminant analysis (CDA) is a multivariate statistical method derived from discriminant analysis which can be used on contingency tables. We have used CDA to separate Gram negative bacteria proteins according to their subcellular location. The high resolution of the discrimination obtained makes this method a good tool to predict subcellular location when this information is not known. The main advantage of this technique is its simplicity. Indeed, by computing two linear formulae on amino acid composition, it is possible to classify a protein into one of the three classes of subcellular location we have defined. The CDA itself can be computed with the ADE-4 software package that can be downloaded, as well as the data set used in this study, from the Pôle Bio-Informatique Lyonnais (PBIL) server at http://pbil.univ-lyon1.fr.
Directory of Open Access Journals (Sweden)
Anna Maria Stellacci
2012-07-01
Full Text Available Hyperspectral (HS data represents an extremely powerful means for rapidly detecting crop stress and then aiding in the rational management of natural resources in agriculture. However, large volume of data poses a challenge for data processing and extracting crucial information. Multivariate statistical techniques can play a key role in the analysis of HS data, as they may allow to both eliminate redundant information and identify synthetic indices which maximize differences among levels of stress. In this paper we propose an integrated approach, based on the combined use of Principal Component Analysis (PCA and Canonical Discriminant Analysis (CDA, to investigate HS plant response and discriminate plant status. The approach was preliminary evaluated on a data set collected on durum wheat plants grown under different nitrogen (N stress levels. Hyperspectral measurements were performed at anthesis through a high resolution field spectroradiometer, ASD FieldSpec HandHeld, covering the 325-1075 nm region. Reflectance data were first restricted to the interval 510-1000 nm and then divided into five bands of the electromagnetic spectrum [green: 510-580 nm; yellow: 581-630 nm; red: 631-690 nm; red-edge: 705-770 nm; near-infrared (NIR: 771-1000 nm]. PCA was applied to each spectral interval. CDA was performed on the extracted components to identify the factors maximizing the differences among plants fertilised with increasing N rates. Within the intervals of green, yellow and red only the first principal component (PC had an eigenvalue greater than 1 and explained more than 95% of total variance; within the ranges of red-edge and NIR, the first two PCs had an eigenvalue higher than 1. Two canonical variables explained cumulatively more than 81% of total variance and the first was able to discriminate wheat plants differently fertilised, as confirmed also by the significant correlation with aboveground biomass and grain yield parameters. The combined
Optimal Class Separation in Hyperspectral Image Data: Iterated Canonical Discriminant Analysis
DEFF Research Database (Denmark)
Nielsen, Allan Aasbjerg; Müller, Andreas
This paper describes canonical discriminant analysis and sketches an iterative version which is then applied to obtain optimal separation between a region, here examplified by either “water” or “wood/trees” and the rest of a HyMap image. We show that the iterative version greatly enhances the sep...
DEFF Research Database (Denmark)
Jensen, P O; Larsen, J K; Christensen, I J
1994-01-01
Bromodeoxyuridine (BrdUrd) labelled and unlabelled mitotic cells, respectively, can be discriminated from interphase cells using a new method, based on immunocytochemical staining of BrdUrd and flow cytometric four-parameter analysis of DNA content, BrdUrd incorporation, and forward and orthogona...
Nichols, K. E.
Safran Student Interest Inventory (SSII) data was gathered on 135 university students registered in five different faculties. A discriminant analysis of the data indicated that the SSII was a good test for separating students into faculties and therefore would make a good counselling instrument. Some results are also present using Differential…
Directory of Open Access Journals (Sweden)
U. Amato
2014-06-01
Full Text Available We introduce a classification method (Cumulative Discriminant Analysis of the Discriminant Analysis type to discriminate between cloudy and clear sky satellite observations in the thermal infrared. The tool is intended for the high spectral resolution infrared sounder (IRS planned for the geostationary METEOSAT (Meteorological Satellite Third Generation platform and uses IASI (Infrared Atmospheric Sounding Interferometer data as a proxy. The Cumulative Discriminant Analysis does not introduce biases intrinsic with the approximation of the probability density functions and is flexible enough to adapt to different strategies to optimize the cloud mask. The methodology is based on nine statistics computed from IASI spectral radiances, which exploit the high spectral resolution of the instrument and which effectively summarize information contained within the IASI spectrum. A Principal Component Analysis prior step is also introduced which makes the problem more consistent with the statistical assumptions of the methodology. An initial assessment of the scheme is performed based on global and regional IASI real data sets and cloud masks obtained from AVHRR (Advanced Very High Resolution Radiometer and SEVIRI (Spinning Enhanced Visible and Infrared Imager imagers. The agreement with these independent cloud masks is generally well above 80%, except at high latitudes in their winter seasons.
Amato, U.; Lavanant, L.; Liuzzi, G.; Masiello, G.; Serio, C.; Stuhlmann, R.; Tjemkes, S. A.
2014-10-01
We introduce a classification method (cumulative discriminant analysis) of the discriminant analysis type to discriminate between cloudy and clear-sky satellite observations in the thermal infrared. The tool is intended for the high-spectral-resolution infrared sounder (IRS) planned for the geostationary METEOSAT (Meteorological Satellite) Third Generation platform and uses IASI (Infrared Atmospheric Sounding Interferometer) data as a proxy. The cumulative discriminant analysis does not introduce biases intrinsic with the approximation of the probability density functions and is flexible enough to adapt to different strategies to optimize the cloud mask. The methodology is based on nine statistics computed from IASI spectral radiances, which exploit the high spectral resolution of the instrument and which effectively summarize information contained within the IASI spectrum. A principal component analysis prior step is also introduced, which makes the problem more consistent with the statistical assumptions of the methodology. An initial assessment of the scheme is performed based on global and regional IASI real data sets and cloud masks obtained from AVHRR (Advanced Very High Resolution Radiometer) and SEVIRI (Spinning Enhanced Visible and Infrared Imager) imagers. The agreement with these independent cloud masks is generally well above 80 %, except at high latitudes in the winter seasons.
On the Variable Selection Problem in Multiple Group Discriminant Analysis.
Huberty, Carl J.
This study was concerned with various schemes for reducing the number of variables in a multivariate analysis. Two sets of illustrative data were used; the numbers of criterion groups were 3 and 5. The proportion of correct classifications was employed as an index of discriminatory power of each subset of variables selected. Of the four procedures…
Energy Technology Data Exchange (ETDEWEB)
Balmer, Matthew J.I., E-mail: m.balmer@lancaster.ac.uk [Department of Engineering, Lancaster University, LA1 4YR (United Kingdom); Gamage, Kelum A.A. [Department of Engineering, Lancaster University, LA1 4YR (United Kingdom); Taylor, Graeme C. [Neutron Metrology Group, National Physical Laboratory, Teddington, TW11 0LW (United Kingdom)
2015-07-11
Three algorithms for discriminating between fast neutrons, thermal neutrons and gamma rays in a {sup 6}Li loaded plastic scintillator have been compared. Following a literature review of existing pulse shape discrimination techniques, the performance of the charge comparison method, triangular filtering and frequency gradient analysis were investigated in this work. The scintillator was exposed to three different mixed gamma/neutron radiation fields. The figure of merit of neutron/gamma separation was investigated over a broad energy range, as well as for the neutron capture energy region. After optimisation, all three methods were found to perform similarly in terms of neutron/gamma separation.
Nonparametric methods in actigraphy: An update
Directory of Open Access Journals (Sweden)
Bruno S.B. Gonçalves
2014-09-01
Full Text Available Circadian rhythmicity in humans has been well studied using actigraphy, a method of measuring gross motor movement. As actigraphic technology continues to evolve, it is important for data analysis to keep pace with new variables and features. Our objective is to study the behavior of two variables, interdaily stability and intradaily variability, to describe rest activity rhythm. Simulated data and actigraphy data of humans, rats, and marmosets were used in this study. We modified the method of calculation for IV and IS by modifying the time intervals of analysis. For each variable, we calculated the average value (IVm and ISm results for each time interval. Simulated data showed that (1 synchronization analysis depends on sample size, and (2 fragmentation is independent of the amplitude of the generated noise. We were able to obtain a significant difference in the fragmentation patterns of stroke patients using an IVm variable, while the variable IV60 was not identified. Rhythmic synchronization of activity and rest was significantly higher in young than adults with Parkinson׳s when using the ISM variable; however, this difference was not seen using IS60. We propose an updated format to calculate rhythmic fragmentation, including two additional optional variables. These alternative methods of nonparametric analysis aim to more precisely detect sleep–wake cycle fragmentation and synchronization.
Nonparametric methods in actigraphy: An update
Gonçalves, Bruno S.B.; Cavalcanti, Paula R.A.; Tavares, Gracilene R.; Campos, Tania F.; Araujo, John F.
2014-01-01
Circadian rhythmicity in humans has been well studied using actigraphy, a method of measuring gross motor movement. As actigraphic technology continues to evolve, it is important for data analysis to keep pace with new variables and features. Our objective is to study the behavior of two variables, interdaily stability and intradaily variability, to describe rest activity rhythm. Simulated data and actigraphy data of humans, rats, and marmosets were used in this study. We modified the method of calculation for IV and IS by modifying the time intervals of analysis. For each variable, we calculated the average value (IVm and ISm) results for each time interval. Simulated data showed that (1) synchronization analysis depends on sample size, and (2) fragmentation is independent of the amplitude of the generated noise. We were able to obtain a significant difference in the fragmentation patterns of stroke patients using an IVm variable, while the variable IV60 was not identified. Rhythmic synchronization of activity and rest was significantly higher in young than adults with Parkinson׳s when using the ISM variable; however, this difference was not seen using IS60. We propose an updated format to calculate rhythmic fragmentation, including two additional optional variables. These alternative methods of nonparametric analysis aim to more precisely detect sleep–wake cycle fragmentation and synchronization. PMID:26483921
Mishra, Spandan; Vanli, O. Arda; Huffer, Fred W.; Jung, Sungmoon
2016-04-01
In this study we propose a regularized linear discriminant analysis approach for damage detection which does not require an intermediate feature extraction step and therefore more efficient in handling data with high-dimensionality. A robust discriminant model is obtained by shrinking of the covariance matrix to a diagonal matrix and thresholding redundant predictors without hurting the predictive power of the model. The shrinking and threshold parameters of the discriminant function (decision boundary) are estimated to minimize the classification error. Furthermore, it is shown how the damage classification achieved by the proposed method can be extended to multiple sensors by following a Bayesian decision-fusion formulation. The detection probability of each sensor is used as a prior condition to estimate the posterior detection probability of the entire network and the posterior detection probability is used as a quantitative basis to make the final decision about the damage.
Shrinkage-based diagonal discriminant analysis and its applications in high-dimensional data.
Pang, Herbert; Tong, Tiejun; Zhao, Hongyu
2009-12-01
High-dimensional data such as microarrays have brought us new statistical challenges. For example, using a large number of genes to classify samples based on a small number of microarrays remains a difficult problem. Diagonal discriminant analysis, support vector machines, and k-nearest neighbor have been suggested as among the best methods for small sample size situations, but none was found to be superior to others. In this article, we propose an improved diagonal discriminant approach through shrinkage and regularization of the variances. The performance of our new approach along with the existing methods is studied through simulations and applications to real data. These studies show that the proposed shrinkage-based and regularization diagonal discriminant methods have lower misclassification rates than existing methods in many cases.
Parametric and Non-Parametric System Modelling
DEFF Research Database (Denmark)
Nielsen, Henrik Aalborg
1999-01-01
considered. It is shown that adaptive estimation in conditional parametric models can be performed by combining the well known methods of local polynomial regression and recursive least squares with exponential forgetting. The approach used for estimation in conditional parametric models also highlights how....... For this purpose non-parametric methods together with additive models are suggested. Also, a new approach specifically designed to detect non-linearities is introduced. Confidence intervals are constructed by use of bootstrapping. As a link between non-parametric and parametric methods a paper dealing with neural...... the focus is on combinations of parametric and non-parametric methods of regression. This combination can be in terms of additive models where e.g. one or more non-parametric term is added to a linear regression model. It can also be in terms of conditional parametric models where the coefficients...
Ding, Hang
2014-01-01
Structures in recurrence plots (RPs), preserving the rich information of nonlinear invariants and trajectory characteristics, have been increasingly analyzed in dynamic discrimination studies. The conventional analysis of RPs is mainly focused on quantifying the overall diagonal and vertical line structures through a method, called recurrence quantification analysis (RQA). This study extensively explores the information in RPs by quantifying local complex RP structures. To do this, an approach was developed to analyze the combination of three major RQA variables: determinism, laminarity, and recurrence rate (DLR) in a metawindow moving over a RP. It was then evaluated in two experiments discriminating (1) ideal nonlinear dynamic series emulated from the Lorenz system with different control parameters and (2) data sets of human heart rate regulations with normal sinus rhythms (n = 18) and congestive heart failure (n = 29). Finally, the DLR was compared with seven major RQA variables in terms of discriminatory power, measured by standardized mean difference (DSMD). In the two experiments, DLR resulted in the highest discriminatory power with DSMD = 2.53 and 0.98, respectively, which were 7.41 and 2.09 times the best performance from RQA. The study also revealed that the optimal RP structures for the discriminations were neither typical diagonal structures nor vertical structures. These findings indicate that local complex RP structures contain some rich information unexploited by RQA. Therefore, future research to extensively analyze complex RP structures would potentially improve the effectiveness of the RP analysis in dynamic discrimination studies.
Generalized Discriminant Analysis algorithm for feature reduction in Cyber Attack Detection System
Directory of Open Access Journals (Sweden)
Shailendra Singh
2009-10-01
Full Text Available This Generalized Discriminant Analysis (GDA has provided an extremely powerful approach to extracting non-linear features. The network traffic data provided for the design of intrusion detection system always are large with ineffective information, thus we need to remove the worthless information from the original high dimensional database. To improve the generalization ability, we usually generate a small set of features from the original input variables by feature extraction. The conventional Linear Discriminant Analysis (LDA feature reduction technique has its limitations. It is not suitable for non-linear dataset. Thus we propose an efficient algorithm based on the Generalized Discriminant Analysis (GDA feature reduction technique which is novel approach used in the area of cyber attack detection. This not only reduces the number of the input features but also increases the classification accuracy and reduces the training and testing time of the classifiers by selecting most discriminating features. We use Artificial Neural Network (ANN and C4.5 classifiers to compare the performance of the proposed technique. The result indicates the superiority of algorithm.
Ding, Hang
2014-01-01
Structures in recurrence plots (RPs), preserving the rich information of nonlinear invariants and trajectory characteristics, have been increasingly analyzed in dynamic discrimination studies. The conventional analysis of RPs is mainly focused on quantifying the overall diagonal and vertical line structures through a method, called recurrence quantification analysis (RQA). This study extensively explores the information in RPs by quantifying local complex RP structures. To do this, an approach was developed to analyze the combination of three major RQA variables: determinism, laminarity, and recurrence rate (DLR) in a metawindow moving over a RP. It was then evaluated in two experiments discriminating (1) ideal nonlinear dynamic series emulated from the Lorenz system with different control parameters and (2) data sets of human heart rate regulations with normal sinus rhythms (n = 18) and congestive heart failure (n = 29). Finally, the DLR was compared with seven major RQA variables in terms of discriminatory power, measured by standardized mean difference (DSMD). In the two experiments, DLR resulted in the highest discriminatory power with DSMD = 2.53 and 0.98, respectively, which were 7.41 and 2.09 times the best performance from RQA. The study also revealed that the optimal RP structures for the discriminations were neither typical diagonal structures nor vertical structures. These findings indicate that local complex RP structures contain some rich information unexploited by RQA. Therefore, future research to extensively analyze complex RP structures would potentially improve the effectiveness of the RP analysis in dynamic discrimination studies.
Byrd, Christy M; Carter Andrews, Dorinda J
2016-08-01
Although there exists a healthy body of literature related to discrimination in schools, this research has primarily focused on racial or ethnic discrimination as perceived and experienced by students of color. Few studies examine students' perceptions of discrimination from a variety of sources, such as adults and peers, their descriptions of the discrimination, or the frequency of discrimination in the learning environment. Middle and high school students in a Midwestern school district (N=1468) completed surveys identifying whether they experienced discrimination from seven sources (e.g., peers, teachers, administrators), for seven reasons (e.g., gender, race/ethnicity, religion), and in eight forms (e.g., punished more frequently, called names, excluded from social groups). The sample was 52% White, 15% Black/African American, 14% Multiracial, and 17% Other. Latent class analysis was used to cluster individuals based on reported sources of, reasons for, and forms of discrimination. Four clusters were found, and ANOVAs were used to test for differences between clusters on perceptions of school climate, relationships with teachers, perceptions that the school was a "good school," and engagement. The Low Discrimination cluster experienced the best outcomes, whereas an intersectional cluster experienced the most discrimination and the worst outcomes. The results confirm existing research on the negative effects of discrimination. Additionally, the paper adds to the literature by highlighting the importance of an intersectional approach to examining students' perceptions of in-school discrimination. Copyright © 2016 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.
Bayesian nonparametric duration model with censorship
Directory of Open Access Journals (Sweden)
Joseph Hakizamungu
2007-10-01
Full Text Available This paper is concerned with nonparametric i.i.d. durations models censored observations and we establish by a simple and unified approach the general structure of a bayesian nonparametric estimator for a survival function S. For Dirichlet prior distributions, we describe completely the structure of the posterior distribution of the survival function. These results are essentially supported by prior and posterior independence properties.
Bootstrap Estimation for Nonparametric Efficiency Estimates
1995-01-01
This paper develops a consistent bootstrap estimation procedure to obtain confidence intervals for nonparametric measures of productive efficiency. Although the methodology is illustrated in terms of technical efficiency measured by output distance functions, the technique can be easily extended to other consistent nonparametric frontier models. Variation in estimated efficiency scores is assumed to result from variation in empirical approximations to the true boundary of the production set. ...
Tahir, Haroon Elrasheid; Xiaobo, Zou; Xiaowei, Huang; Jiyong, Shi; Mariod, Abdalbasit Adam
2016-09-01
Aroma profiles of six honey varieties of different botanical origins were investigated using colorimetric sensor array, gas chromatography-mass spectrometry (GC-MS) and descriptive sensory analysis. Fifty-eight aroma compounds were identified, including 2 norisoprenoids, 5 hydrocarbons, 4 terpenes, 6 phenols, 7 ketones, 9 acids, 12 aldehydes and 13 alcohols. Twenty abundant or active compounds were chosen as key compounds to characterize honey aroma. Discrimination of the honeys was subsequently implemented using multivariate analysis, including hierarchical clustering analysis (HCA) and principal component analysis (PCA). Honeys of the same botanical origin were grouped together in the PCA score plot and HCA dendrogram. SPME-GC/MS and colorimetric sensor array were able to discriminate the honeys effectively with the advantages of being rapid, simple and low-cost. Moreover, partial least squares regression (PLSR) was applied to indicate the relationship between sensory descriptors and aroma compounds.
Free choice profiling sensory analysis to discriminate coffees
Directory of Open Access Journals (Sweden)
Cíntia Sorane Good Kitzberger
2016-12-01
Full Text Available Sensory attributes were evaluated from Arabica coffee genotypes growing in two places in Brazil, Mandaguari and Londrina. Post-harvest and roasted process was standardized. Free choice profiling sensory analysis was apply to investigate the influence of genetic variability and local cultivation (Londrina and Mandaguari, Brazil on the sensory characteristics of coffee genotypes. A sensory panel evaluated coffees from Mandaguari in two groups: one (Sarchimor derived, IPR100, IPR102, IPR105, IPR106 characterized by transparency, coffee colour, green aroma, taste (green, bitter, fermented, astringent and a watery texture, another group (Catuaí, Sarchimor derived, IPR101, IPR103 was characterized by coffee colour, brightness, aroma (coffee, acid, sweet, chocolate, acidity, bitterness, burnt aroma, sweetness and full-bodied. Coffees from Londrina presented brightness, coffee colour, sweet, green, burnt aroma, astringent, bitter, fermented, green taste; and watery texture (Catuaí, IPR97, IPR98, IPR100. Another group (Sarchimor derived, IPR101, IPR102, IPR103, IPR105, IPR106 were associated with turbidity, aroma (green, coffee, sweet, acidity, astringency, bitterness, sweetness and full-bodied. Catuaí, Iapar59, IPR99, IPR101, IPR103 and IPR108 exhibited positive attributes when grown in either locale. Edaphoclimatic conditions play a major role in the sensory profiles of coffee.
Over-excavation forecast of underground opening by using Bayes discriminant analysis method
Institute of Scientific and Technical Information of China (English)
GONG Feng-qiang; LI Xi-bing; ZHANG Wei
2008-01-01
A method to forecast the over-excavation of underground opening by using the Bayes discriminant analysis(BDA) theory was presented. The Bayes discriminant analysis theory was introduced. Based on an engineering example, the factors influencing the over-excavation of underground opening were taken into account to build a forecast BDA model, and the prior information about over-excavation of underground opening was also taken into consideration. Five parameters influencing the over-excavation of opening, including 2 groups of joints, 1 group of layer surface, extension and space between structure faces were selected as geometric parameters. Engineering data in an underground opening were used as the training samples. The cross-validation method was introduced to verify the stability of BDA model and the ratio of mistake-discrimination was equal to zero after the BDA model was trained. Data in an underground engineering were used to test the discriminant ability of BDA model. The results show that five forecast results are identical with the actual situation and BDA can be used in practical engineering.
Institute of Scientific and Technical Information of China (English)
WANG Qian-Qian; LIU Kai; ZHAO Hua
2012-01-01
A method to distinguish explosives from plastics using laser-induced breakdown spectroscopy is discussed. A model for classification with cross-validation theory is built based on the partial least-square discriminant analysis method. Seven types of plastics and one explosive are used as samples to test the model. The experimental results demonstrate that laser-induced breakdown spectroscopy has the capacity to discriminate explosives from plastics combined with chemometrics methods. The results could be useful for prospective research of laser-induced breakdown spectroscopy on the differentiation of explosives and other materials.%A method to distinguish explosives from plastics using laser-induced breakdown spectroscopy is discussed.A model for classification with cross-validation theory is built based on the partial least-square discriminant analysis method.Seven types of plastics and one explosive are used as samples to test the model.The experimental results demonstrate that laser-induced breakdown spectroscopy has the capacity to discriminate explosives from plastics combined with chemometrics methods.The results could be useful for prospective research of laser-induced breakdown spectroscopy on the differentiation of explosives and other materials.
Comparative analysis of cyanobacterial superoxide dismutases to discriminate canonical forms
Directory of Open Access Journals (Sweden)
Prabaharan Dharmar
2007-11-01
Full Text Available Abstract Background Superoxide dismutases (SOD are ubiquitous metalloenzymes that catalyze the disproportion of superoxide to peroxide and molecular oxygen through alternate oxidation and reduction of their metal ions. In general, SODs are classified into four forms by their catalytic metals namely; FeSOD, MnSOD, Cu/ZnSOD and NiSOD. In addition, a cambialistic form that uses Fe/Mn in its active site also exists. Cyanobacteria, the oxygen evolving photosynthetic prokaryotes, produce reactive oxygen species that can damage cellular components leading to cell death. Thus, the co-evolution of an antioxidant system was necessary for the survival of photosynthetic organisms with SOD as the initial enzyme evolved to alleviate the toxic effect. Cyanobacteria represent the first oxygenic photoautotrophs and their SOD sequences available in the databases lack clear annotation. Hence, the present study focuses on structure and sequence pattern of subsets of cyanobacterial superoxide dismutases. Result The sequence conservation and structural analysis of Fe (Thermosynechococcus elongatus BP1 and MnSOD (Anabaena sp. PCC7120 reveal the sharing of N and C terminal domains. At the C terminal domain, the metal binding motif in cyanoprokaryotes is DVWEHAYY while it is D-X-[WF]-E-H-[STA]-[FY]-[FY] in other pro- and eukaryotes. The cyanobacterial FeSOD differs from MnSOD at least in three ways viz. (i FeSOD has a metal specific signature F184X3A188Q189.......T280......F/Y303 while, in Mn it is R184X3G188G189......G280......W303, (ii aspartate ligand forms a hydrogen bond from the active site with the outer sphere residue of W243 in Fe where as it is Q262 in MnSOD; and (iii two unique lysine residues at positions 201 and 255 with a photosynthetic role, found only in FeSOD. Further, most of the cyanobacterial Mn metalloforms have a specific transmembrane hydrophobic pocket that distinguishes FeSOD from Mn isoform. Cyanobacterial Cu/ZnSOD has a copper domain and two
Nonparametric estimation for hazard rate monotonously decreasing system
Institute of Scientific and Technical Information of China (English)
Han Fengyan; Li Weisong
2005-01-01
Estimation of density and hazard rate is very important to the reliability analysis of a system. In order to estimate the density and hazard rate of a hazard rate monotonously decreasing system, a new nonparametric estimator is put forward. The estimator is based on the kernel function method and optimum algorithm. Numerical experiment shows that the method is accurate enough and can be used in many cases.
Non-parametric versus parametric methods in environmental sciences
Directory of Open Access Journals (Sweden)
Muhammad Riaz
2016-01-01
Full Text Available This current report intends to highlight the importance of considering background assumptions required for the analysis of real datasets in different disciplines. We will provide comparative discussion of parametric methods (that depends on distributional assumptions (like normality relative to non-parametric methods (that are free from many distributional assumptions. We have chosen a real dataset from environmental sciences (one of the application areas. The findings may be extended to the other disciplines following the same spirit.
Discriminative Nonlinear Analysis Operator Learning: When Cosparse Model Meets Image Classification
Wen, Zaidao; Hou, Biao; Jiao, Licheng
2017-07-01
Linear synthesis model based dictionary learning framework has achieved remarkable performances in image classification in the last decade. Behaved as a generative feature model, it however suffers from some intrinsic deficiencies. In this paper, we propose a novel parametric nonlinear analysis cosparse model (NACM) with which a unique feature vector will be much more efficiently extracted. Additionally, we derive a deep insight to demonstrate that NACM is capable of simultaneously learning the task adapted feature transformation and regularization to encode our preferences, domain prior knowledge and task oriented supervised information into the features. The proposed NACM is devoted to the classification task as a discriminative feature model and yield a novel discriminative nonlinear analysis operator learning framework (DNAOL). The theoretical analysis and experimental performances clearly demonstrate that DNAOL will not only achieve the better or at least competitive classification accuracies than the state-of-the-art algorithms but it can also dramatically reduce the time complexities in both training and testing phases.
Xie, Xufei; Zhang, Xing; Yuan, Xi; Chen, Jinxiang; Li, Xiangqing; Zhang, Guohui; Fan, Tieshuan; Yuan, Guoliang; Yang, Jinwei; Yang, Qingwei
2012-09-01
Digital discrimination of neutron and gamma-ray events in an organic scintillator has been investigated by moment analysis. Signals induced by an americium-beryllium (Am/Be) isotropic neutron source in a stilbene crystal detector have been sampled with a flash analogue-to-digital converter (ADC) of 1 GSamples/s sampling rate and 10-bit vertical resolution. Neutrons and gamma-rays have been successfully discriminated with a threshold corresponding to gamma-ray energy about 217 keV. Moment analysis has also been verified against the results assessed by a time-of-flight (TOF) measurement. It is shown that the classification of neutrons and gamma-rays afforded by moment analysis is consistent with that achieved by digital TOF measurement. This method has been applied to analyze the data acquired from the stilbene crystal detector in mixed radiation field of the HL-2A tokamak deuterium plasma discharges and the results are described.
Djozan, Djavanshir; Baheri, Tahmineh; Karimian, Ghader; Shahidi, Masomeh
2008-08-06
This article aims to provide a new and fast method for differentiation of inks on a questioned document. The data acquisition was carried out by designing specific image analysis software for evaluating thin layer chromatograms (TLC-IA). The ink spot was extracted from the document using methanol and separated by TLC using plastic sheet silica gel 60 without fluorescent indicator, and a mixture of ethyl acetate, ethanol, and water (70:35:30, v/v/v) as mobile phase. To discriminate between different pen inks, new software was designed on the basis of intensity profile of red, green, and blue (RGB) characteristic. In practice, after development of chromatogram, the chromatograms were scanned by ordinary office scanner, intensity profiles of RGB characteristics on the development straight of each sample were produced and compared with the mentioned software. RGB profiles of ballpoint inks from various manufacturers showed that the patterns in most cases were distinctly different from each other. This new method allowed discriminating among different pen inks with a high reliability and the discriminating power of 92.8%. Blue ballpoint pen inks of 41 different samples available on the local market were successfully analyzed and discriminated.
McReynolds, Naomi; Auñón Garcia, Juan M.; Guengerich, Zoe; Smith, Terry K.; Dholakia, Kishan
2017-02-01
We present an optical spectroscopic technique, making use of both Raman signals and fluorescence spectroscopy, for the identification of five brands of commercially available extra-virgin olive-oil (EVOO). We demonstrate our technique on both a `bulk-optics' free-space system and a compact device. Using the compact device, which is capable of recording both Raman and fluorescence signals, we achieved an average sensitivity and specificity of 98.4% and 99.6% for discrimination, respectively. Our approach demonstrates that both Raman and fluorescence spectroscopy can be used for portable discrimination of EVOOs which obviates the need to use centralised laboratories and opens up the prospect of in-field testing. This technique may enable detection of EVOO that has undergone counterfeiting or adulteration. One of the main challenges facing Raman spectroscopy for use in quality control of EVOOs is that the oxidation of EVOO, which naturally occurs due to aging, causes shifts in Raman spectra with time, which implies regular retraining would be necessary. We present a potential method of analysis to minimize the effect that aging has on discrimination efficiency; we show that by discarding the first principal component, which contains information on the variations due to oxidation, we can improve discrimination efficiency thus improving the robustness of our technique.
Multilinear Biased Discriminant Analysis: A Novel Method for Facial Action Unit Representation
Khademi, Mahmoud; Manzuri-Shalmani, Mohammad T
2010-01-01
In this paper a novel efficient method for representation of facial action units by encoding an image sequence as a fourth-order tensor is presented. The multilinear tensor-based extension of the biased discriminant analysis (BDA) algorithm, called multilinear biased discriminant analysis (MBDA), is first proposed. Then, we apply the MBDA and two-dimensional BDA (2DBDA) algorithms, as the dimensionality reduction techniques, to Gabor representations and the geometric features of the input image sequence respectively. The proposed scheme can deal with the asymmetry between positive and negative samples as well as curse of dimensionality dilemma. Extensive experiments on Cohn-Kanade database show the superiority of the proposed method for representation of the subtle changes and the temporal information involved in formation of the facial expressions. As an accurate tool, this representation can be applied to many areas such as recognition of spontaneous and deliberate facial expressions, multi modal/media huma...
Z-score linear discriminant analysis for EEG based brain-computer interfaces.
Directory of Open Access Journals (Sweden)
Rui Zhang
Full Text Available Linear discriminant analysis (LDA is one of the most popular classification algorithms for brain-computer interfaces (BCI. LDA assumes Gaussian distribution of the data, with equal covariance matrices for the concerned classes, however, the assumption is not usually held in actual BCI applications, where the heteroscedastic class distributions are usually observed. This paper proposes an enhanced version of LDA, namely z-score linear discriminant analysis (Z-LDA, which introduces a new decision boundary definition strategy to handle with the heteroscedastic class distributions. Z-LDA defines decision boundary through z-score utilizing both mean and standard deviation information of the projected data, which can adaptively adjust the decision boundary to fit for heteroscedastic distribution situation. Results derived from both simulation dataset and two actual BCI datasets consistently show that Z-LDA achieves significantly higher average classification accuracies than conventional LDA, indicating the superiority of the new proposed decision boundary definition strategy.
Khademi, Mahmoud; Manzuri, Mohammad T
2010-01-01
In this paper, a novel representation for facial expressions in two-dimensional image sequences is presented. We apply a variation of two-dimensional heteroscedastic linear discriminant analysis (2DHLDA) algorithm, as an efficient dimensionality reduction technique, to Gabor representation of the input sequence. 2DHLDA is an extension of the two-dimensional linear discriminant analysis (2DLDA) approach and removes the equal within-class covariance. By applying 2DHLDA in two directions, we eliminate the correlations between both image columns and image rows. Then, we perform a one-dimensional LDA on the new features. This combined method can alleviate the small sample size problem and instability encountered by HLDA. The proposed method is robust to illumination changes and can represent temporal information as well as subtle changes in facial muscles properly. Also, employing both geometric and appearance features and using support vector machines (SVMs) classifier, we provide experiments on Cohn-Kanade datab...
Shao, Zhenfeng; Zhang, Lei
2014-09-01
This paper presents a novel sparse dimensionality reduction method of hyperspectral image based on semi-supervised local Fisher discriminant analysis (SELF). The proposed method is designed to be especially effective for dealing with the out-of-sample extrapolation to realize advantageous complementarities between SELF and sparsity preserving projections (SPP). Compared to SELF and SPP, the method proposed herein offers highly discriminative ability and produces an explicit nonlinear feature mapping for the out-of-sample extrapolation. This is due to the fact that the proposed method can get an explicit feature mapping for dimensionality reduction and improve the classification performance of classifiers by performing dimensionality reduction. Experimental analysis on the sparsity and efficacy of low dimensional outputs shows that, sparse dimensionality reduction based on SELF can yield good classification results and interpretability in the field of hyperspectral remote sensing.
Z-score linear discriminant analysis for EEG based brain-computer interfaces.
Zhang, Rui; Xu, Peng; Guo, Lanjin; Zhang, Yangsong; Li, Peiyang; Yao, Dezhong
2013-01-01
Linear discriminant analysis (LDA) is one of the most popular classification algorithms for brain-computer interfaces (BCI). LDA assumes Gaussian distribution of the data, with equal covariance matrices for the concerned classes, however, the assumption is not usually held in actual BCI applications, where the heteroscedastic class distributions are usually observed. This paper proposes an enhanced version of LDA, namely z-score linear discriminant analysis (Z-LDA), which introduces a new decision boundary definition strategy to handle with the heteroscedastic class distributions. Z-LDA defines decision boundary through z-score utilizing both mean and standard deviation information of the projected data, which can adaptively adjust the decision boundary to fit for heteroscedastic distribution situation. Results derived from both simulation dataset and two actual BCI datasets consistently show that Z-LDA achieves significantly higher average classification accuracies than conventional LDA, indicating the superiority of the new proposed decision boundary definition strategy.
Using discriminant analysis to detect intrusions in external communication for self-driving vehicles
Directory of Open Access Journals (Sweden)
Khattab M.Ali Alheeti
2017-08-01
Full Text Available Security systems are a necessity for the deployment of smart vehicles in our society. Security in vehicular ad hoc networks is crucial to the reliable exchange of information and control data. In this paper, we propose an intelligent Intrusion Detection System (IDS to protect the external communication of self-driving and semi self-driving vehicles. This technology has the ability to detect Denial of Service (DoS and black hole attacks on vehicular ad hoc networks (VANETs. The advantage of the proposed IDS over existing security systems is that it detects attacks before they causes significant damage. The intrusion prediction technique is based on Linear Discriminant Analysis (LDA and Quadratic Discriminant Analysis (QDA which are used to predict attacks based on observed vehicle behavior. We perform simulations using Network Simulator 2 to demonstrate that the IDS achieves a low rate of false alarms and high accuracy in detection.
Fisher's Discriminant and Relevant Component Analysis for static facial expression classification
Sorci, Matteo; Antonini, Gianluca; Thiran, Jean-Philippe
2007-01-01
This paper addresses the issue of automatic classification of the six universal emotional categories (joy, surprise, fear, anger, disgust, sadness) in the case of static images. Appearance parameters are extracted by an active appearance model(AAM) representing the input for the classification step. We show how Relevant Component Analysis (RCA) in combination with Fisher's Linear Discriminant (FLD) provides a good "plug-\\&-play" classifier in the context of facial expression recognitio...
Origin of uncontrolled water emissions in Alicante: use of discriminant analysis
Chinchón Payá, Servando; Blas Bravo, Isabel de; Nueda Roldán, María José; García Andreu, Francisco
2010-01-01
A model has been developed to predict the origin of uncontrolled water flows. For this purpose, we analysed a total of 52 elements using ICP techniques on 72 water samples in all: 40 of them from leaks in the drinking water supply network and the remaining 32 from groundwater outcrops. The study focused on the cases registered within the Alicante town limits (Spain). The use of multivariate statistical classification tools such as discriminant analysis made it possible not only to reduce the ...
2D Face Recognition System Based on Selected Gabor Filters and Linear Discriminant Analysis LDA
Hafez, Samir F.; Selim, Mazen M.; Hala H. Zayed
2015-01-01
We present a new approach for face recognition system. The method is based on 2D face image features using subset of non-correlated and Orthogonal Gabor Filters instead of using the whole Gabor Filter Bank, then compressing the output feature vector using Linear Discriminant Analysis (LDA). The face image has been enhanced using multi stage image processing technique to normalize it and compensate for illumination variation. Experimental results show that the proposed system is effective for ...
Why preferring parametric forecasting to nonparametric methods?
Jabot, Franck
2015-05-07
A recent series of papers by Charles T. Perretti and collaborators have shown that nonparametric forecasting methods can outperform parametric methods in noisy nonlinear systems. Such a situation can arise because of two main reasons: the instability of parametric inference procedures in chaotic systems which can lead to biased parameter estimates, and the discrepancy between the real system dynamics and the modeled one, a problem that Perretti and collaborators call "the true model myth". Should ecologists go on using the demanding parametric machinery when trying to forecast the dynamics of complex ecosystems? Or should they rely on the elegant nonparametric approach that appears so promising? It will be here argued that ecological forecasting based on parametric models presents two key comparative advantages over nonparametric approaches. First, the likelihood of parametric forecasting failure can be diagnosed thanks to simple Bayesian model checking procedures. Second, when parametric forecasting is diagnosed to be reliable, forecasting uncertainty can be estimated on virtual data generated with the fitted to data parametric model. In contrast, nonparametric techniques provide forecasts with unknown reliability. This argumentation is illustrated with the simple theta-logistic model that was previously used by Perretti and collaborators to make their point. It should convince ecologists to stick to standard parametric approaches, until methods have been developed to assess the reliability of nonparametric forecasting. Copyright © 2015 Elsevier Ltd. All rights reserved.
Binary Classifier Calibration Using a Bayesian Non-Parametric Approach.
Naeini, Mahdi Pakdaman; Cooper, Gregory F; Hauskrecht, Milos
Learning probabilistic predictive models that are well calibrated is critical for many prediction and decision-making tasks in Data mining. This paper presents two new non-parametric methods for calibrating outputs of binary classification models: a method based on the Bayes optimal selection and a method based on the Bayesian model averaging. The advantage of these methods is that they are independent of the algorithm used to learn a predictive model, and they can be applied in a post-processing step, after the model is learned. This makes them applicable to a wide variety of machine learning models and methods. These calibration methods, as well as other methods, are tested on a variety of datasets in terms of both discrimination and calibration performance. The results show the methods either outperform or are comparable in performance to the state-of-the-art calibration methods.
Yu, Jianbo
2016-11-01
The vibration signals of faulty machine are generally non-stationary and nonlinear under those complicated working conditions. Thus, it is a big challenge to extract and select the effective features from vibration signals for machinery fault diagnosis. This paper proposes a new manifold learning algorithm, joint global and local/nonlocal discriminant analysis (GLNDA), which aims to extract effective intrinsic geometrical information from the given vibration data. Comparisons with other regular methods, principal component analysis (PCA), local preserving projection (LPP), linear discriminant analysis (LDA) and local LDA (LLDA), illustrate the superiority of GLNDA in machinery fault diagnosis. Based on the extracted information by GLNDA, a GLNDA-based Fisher discriminant rule (FDR) is put forward and applied to machinery fault diagnosis without additional recognizer construction procedure. By importing Bagging into GLNDA score-based feature selection and FDR, a novel manifold ensemble method (selective GLNDA ensemble, SE-GLNDA) is investigated for machinery fault diagnosis. The motivation for developing ensemble of manifold learning components is that it can achieve higher accuracy and applicability than single component in machinery fault diagnosis. The effectiveness of the SE-GLNDA-based fault diagnosis method has been verified by experimental results from bearing full life testers.
How Many Genes Are Needed for a Discriminant Microarray Data Analysis ?
Li, W; Li, Wentian; Yang, Yaning
2001-01-01
The analysis of the leukemia data from Whitehead/MIT group is a discriminant analysis (also called a supervised learning). Among thousands of genes whose expression levels are measured, not all are needed for discriminant analysis: a gene may either not contribute to the separation of two types of tissues/cancers, or it may be redundant because it is highly correlated with other genes. There are two theoretical frameworks in which variable selection (or gene selection in our case) can be addressed. The first is model selection, and the second is model averaging. We have carried out model selection using Akaike information criterion and Bayesian information criterion with logistic regression (discrimination, prediction, or classification) to determine the number of genes that provide the best model. These model selection criteria set upper limits of 22-25 and 12-13 genes for this data set with 38 samples, and the best model consists of only one (no.4847, zyxin) or two genes. We have also carried out model aver...
Stewart, C M; Newlands, S D; Perachio, A A
2004-12-01
Rapid and accurate discrimination of single units from extracellular recordings is a fundamental process for the analysis and interpretation of electrophysiological recordings. We present an algorithm that performs detection, characterization, discrimination, and analysis of action potentials from extracellular recording sessions. The program was entirely written in LabVIEW (National Instruments), and requires no external hardware devices or a priori information about action potential shapes. Waveform events are detected by scanning the digital record for voltages that exceed a user-adjustable trigger. Detected events are characterized to determine nine different time and voltage levels for each event. Various algebraic combinations of these waveform features are used as axis choices for 2-D Cartesian plots of events. The user selects axis choices that generate distinct clusters. Multiple clusters may be defined as action potentials by manually generating boundaries of arbitrary shape. Events defined as action potentials are validated by visual inspection of overlain waveforms. Stimulus-response relationships may be identified by selecting any recorded channel for comparison to continuous and average cycle histograms of binned unit data. The algorithm includes novel aspects of feature analysis and acquisition, including higher acquisition rates for electrophysiological data compared to other channels. The program confirms that electrophysiological data may be discriminated with high-speed and efficiency using algebraic combinations of waveform features derived from high-speed digital records.
Institute of Scientific and Technical Information of China (English)
LIU Yong-jian; DUAN Chuan; TIAN Meng-liang; HU Er-liang; HUANG Yu-bi
2010-01-01
Analysis of multi-environment trials (METs) of crops for the evaluation and recommendation of varieties is an important issue in plant breeding research. Evaluating on the both stability of performance and high yield is essential in MET analyses. The objective of the present investigation was to compare 11 nonparametric stability statistics and apply nonparametric tests for genotype-by-environment interaction (GEI) to 14 maize (Zea mays L.) genotypes grown at 25 locations in southwestern China during 2005. Results of nonparametric tests of GEI and a combined ANOVA across locations showed that both crossover and noncrossover GEI, and genotypes varied highly significantly for yield. The results of principal component analysis, correlation analysis of nonparametric statistics, and yield indicated the nonparametric statistics grouped as four distinct classes that corresponded to different agronomic and biological concepts of stability.Furthermore, high values of TOP and low values of rank-sum were associated with high mean yield, but the other nonparametric statistics were not positively correlated with mean yield. Therefore, only rank-sum and TOP methods would be useful for simultaneously selection for high yield and stability. These two statistics recommended JY686 and HX 168 as desirable and ND 108, CM 12, CN36, and NK6661 as undesirable genotypes.
Kiso, Atsushi; Taniguchi, Yu; Seki, Hirokazu
This paper describes the estimation of the optimal measurement position by discriminant analysis based on Wilks' lambda for myoelectric hand control. In previous studies, for motion discrimination, the myoelectric signals were measured at the same positions. However, the optimal measurement positions of the myoelectric signals for motion discrimination differ depending on the remaining muscles of amputees. Therefore, the purpose of this study is to estimate the optimal and fewer measurement positions for precise motion discrimination of a human forearm. This study proposes a method for estimating the optimal measurement positions by discriminant analysis based on Wilks' lambda, using the myoelectric signals measured at multiple positions. The results of some experiments on the myoelectric hand simulator show the effectiveness of the proposed optimal measurement position estimation method.
Generative Temporal Modelling of Neuroimaging - Decomposition and Nonparametric Testing
DEFF Research Database (Denmark)
Hald, Ditte Høvenhoff
The goal of this thesis is to explore two improvements for functional magnetic resonance imaging (fMRI) analysis; namely our proposed decomposition method and an extension to the non-parametric testing framework. Analysis of fMRI allows researchers to investigate the functional processes...... of the brain, and provides insight into neuronal coupling during mental processes or tasks. The decomposition method is a Gaussian process-based independent components analysis (GPICA), which incorporates a temporal dependency in the sources. A hierarchical model specification is used, featuring both...
Directory of Open Access Journals (Sweden)
Balloux François
2010-10-01
Full Text Available Abstract Background The dramatic progress in sequencing technologies offers unprecedented prospects for deciphering the organization of natural populations in space and time. However, the size of the datasets generated also poses some daunting challenges. In particular, Bayesian clustering algorithms based on pre-defined population genetics models such as the STRUCTURE or BAPS software may not be able to cope with this unprecedented amount of data. Thus, there is a need for less computer-intensive approaches. Multivariate analyses seem particularly appealing as they are specifically devoted to extracting information from large datasets. Unfortunately, currently available multivariate methods still lack some essential features needed to study the genetic structure of natural populations. Results We introduce the Discriminant Analysis of Principal Components (DAPC, a multivariate method designed to identify and describe clusters of genetically related individuals. When group priors are lacking, DAPC uses sequential K-means and model selection to infer genetic clusters. Our approach allows extracting rich information from genetic data, providing assignment of individuals to groups, a visual assessment of between-population differentiation, and contribution of individual alleles to population structuring. We evaluate the performance of our method using simulated data, which were also analyzed using STRUCTURE as a benchmark. Additionally, we illustrate the method by analyzing microsatellite polymorphism in worldwide human populations and hemagglutinin gene sequence variation in seasonal influenza. Conclusions Analysis of simulated data revealed that our approach performs generally better than STRUCTURE at characterizing population subdivision. The tools implemented in DAPC for the identification of clusters and graphical representation of between-group structures allow to unravel complex population structures. Our approach is also faster than
Recent Advances and Trends in Nonparametric Statistics
Akritas, MG
2003-01-01
The advent of high-speed, affordable computers in the last two decades has given a new boost to the nonparametric way of thinking. Classical nonparametric procedures, such as function smoothing, suddenly lost their abstract flavour as they became practically implementable. In addition, many previously unthinkable possibilities became mainstream; prime examples include the bootstrap and resampling methods, wavelets and nonlinear smoothers, graphical methods, data mining, bioinformatics, as well as the more recent algorithmic approaches such as bagging and boosting. This volume is a collection o
Correlated Non-Parametric Latent Feature Models
Doshi-Velez, Finale
2012-01-01
We are often interested in explaining data through a set of hidden factors or features. When the number of hidden features is unknown, the Indian Buffet Process (IBP) is a nonparametric latent feature model that does not bound the number of active features in dataset. However, the IBP assumes that all latent features are uncorrelated, making it inadequate for many realworld problems. We introduce a framework for correlated nonparametric feature models, generalising the IBP. We use this framework to generate several specific models and demonstrate applications on realworld datasets.
Nonparametric correlation models for portfolio allocation
DEFF Research Database (Denmark)
Aslanidis, Nektarios; Casas, Isabel
2013-01-01
This article proposes time-varying nonparametric and semiparametric estimators of the conditional cross-correlation matrix in the context of portfolio allocation. Simulations results show that the nonparametric and semiparametric models are best in DGPs with substantial variability or structural...... breaks in correlations. Only when correlations are constant does the parametric DCC model deliver the best outcome. The methodologies are illustrated by evaluating two interesting portfolios. The first portfolio consists of the equity sector SPDRs and the S&P 500, while the second one contains major...
Principal component analysis for neural electron/jet discrimination in highly segmented calorimeters
Vassali, M R
2001-01-01
A neural electron/jet discriminator based on calorimetry is developed for the second-level trigger system of the ATLAS detector. As preprocessing of the calorimeter information, a principal component analysis is performed on each segment of the two sections (electromagnetic and hadronic) of the calorimeter system, in order to reduce significantly the dimension of the input data space and fully explore the detailed energy deposition profile, which is provided by the highly-segmented calorimeter system. It is shown that projecting calorimeter data onto 33 segmented principal components, the discrimination efficiency of the neural classifier reaches 98.9% for electrons (with only 1% of false alarm probability). Furthermore, restricting data projection onto only 9 components, an electron efficiency of 99.1% is achieved (with 3% of false alarm), which confirms that a fast triggering system may be designed using few components. (6 refs).
UV-vis absorption spectroscopy and multivariate analysis as a method to discriminate tequila
Barbosa-García, O.; Ramos-Ortíz, G.; Maldonado, J. L.; Pichardo-Molina, J. L.; Meneses-Nava, M. A.; Landgrave, J. E. A.; Cervantes-Martínez, J.
2007-01-01
Based on the UV-vis absorption spectra of commercially bottled tequilas, and with the aid of multivariate analysis, it is proved that different brands of white tequila can be identified from such spectra, and that 100% agave and mixed tequilas can be discriminated as well. Our study was done with 60 tequilas, 58 of them purchased at liquor stores in various Mexican cities, and two directly acquired from a distillery. All the tequilas were of the "white" type, that is, no aged spirits were considered. For the purposes of discrimination and quality control of tequilas, the spectroscopic method that we present here offers an attractive alternative to the traditional methods, like gas chromatography, which is expensive and time-consuming.
Nonparametric Single-Trial EEG Feature Extraction and Classification of Driver's Cognitive Responses
Directory of Open Access Journals (Sweden)
I-Fang Chung
2008-05-01
Full Text Available We proposed an electroencephalographic (EEG signal analysis approach to investigate the driver's cognitive response to traffic-light experiments in a virtual-reality-(VR- based simulated driving environment. EEG signals are digitally sampled and then transformed by three different feature extraction methods including nonparametric weighted feature extraction (NWFE, principal component analysis (PCA, and linear discriminant analysis (LDA, which were also used to reduce the feature dimension and project the measured EEG signals to a feature space spanned by their eigenvectors. After that, the mapped data could be classified with fewer features and their classification results were compared by utilizing two different classifiers including k nearest neighbor classification (KNNC and naive bayes classifier (NBC. Experimental data were collected from 6 subjects and the results show that NWFE+NBC gives the best classification accuracy ranging from 71%Ã¢ÂˆÂ¼77%, which is over 10%Ã¢ÂˆÂ¼24% higher than LDA+KNN1. It also demonstrates the feasibility of detecting and analyzing single-trial EEG signals that represent operators' cognitive states and responses to task events.
Nonparametric Single-Trial EEG Feature Extraction and Classification of Driver's Cognitive Responses
Lin, Chin-Teng; Lin, Ken-Li; Ko, Li-Wei; Liang, Sheng-Fu; Kuo, Bor-Chen; Chung, I.-Fang
2008-12-01
We proposed an electroencephalographic (EEG) signal analysis approach to investigate the driver's cognitive response to traffic-light experiments in a virtual-reality-(VR-) based simulated driving environment. EEG signals are digitally sampled and then transformed by three different feature extraction methods including nonparametric weighted feature extraction (NWFE), principal component analysis (PCA), and linear discriminant analysis (LDA), which were also used to reduce the feature dimension and project the measured EEG signals to a feature space spanned by their eigenvectors. After that, the mapped data could be classified with fewer features and their classification results were compared by utilizing two different classifiers including [InlineEquation not available: see fulltext.] nearest neighbor classification (KNNC) and naive bayes classifier (NBC). Experimental data were collected from 6 subjects and the results show that NWFE+NBC gives the best classification accuracy ranging from [InlineEquation not available: see fulltext.], which is over [InlineEquation not available: see fulltext.] higher than LDA+KNN1. It also demonstrates the feasibility of detecting and analyzing single-trial EEG signals that represent operators' cognitive states and responses to task events.
Wang, Xiaojing; Zou, Zhihong; Zou, Hui
2013-10-01
Yongding New River has been polluted by polycyclic aromatic hydrocarbons (PAHs) which are carcinogenic and mutagenic. In three periods (the abundant water period, mean water period, dry water period), ten sites (totally 30 samples) in Yongding New River were clustered into four categories by hierarchical cluster analysis (hierarchical CA). In the same cluster, the samples had the same approximate contamination situation. In order to eliminate the dimensional differences, the data in each sample, containing 16 kinds of PAHs, were standardized with normal standardization and maximum difference standardization. According to the results of the cubic clustering criterion, pseudo F, and pseudo t (2) (PST2), the proper number of clustering for the 30 samples is 4. Before conducting hierarchical CA and K-means cluster analysis on the samples, we used principal component analysis to obtain another group data set. This data set was composed of the principal component scores which are uncorrelated variables. Hierarchical CA and K-means cluster analysis were used to classify the two data sets into four categories. With the classification results of hierarchical CA and K-means cluster analysis, discriminant analysis is applied to determine which method was better for normalization of the original data and which one was proper to cluster the samples and establish discriminant functions so that a new sample can be grouped into the right categories.
Theoretical analysis and experimental research on port/starboard discrimination in towed line array
Institute of Scientific and Technical Information of China (English)
DU Xuanmin; ZHU Daizhu; ZHAO Rongrong; YAO Lan
2001-01-01
The theoretical analysis and experimental research on Port/Starboard (P/S) discrimination in towed line array are proposed. Two methods resolving the P/S ambiguity with hydrophone triplets are introduced. By processing experimental data, the theoretical analysis is verified. The processing algorithm is extended to broadband signal. The research results show that the method based on optimum beamforming with triplets can be used to remove the port/starboard ambiguity. Also because of the simplicity of the method, it is expected to be implemented in practical towed line array sonar.
Uddin, Md; Lee, J J; Kim, T S
2008-01-01
In proactive computing, human activity recognition from image sequences is an active research area. This paper presents a novel approach of human activity recognition based on Linear Discriminant Analysis (LDA) of Independent Component (IC) features from shape information. With extracted features, Hidden Markov Model (HMM) is applied for training and recognition. The recognition performance using LDA of IC features has been compared to other approaches including Principle Component Analysis (PCA), LDA of PC, and ICA. The preliminary results show much improved performance in the recognition rate with our proposed method.
Luthria, Devanand L; Lin, Long-Ze; Robbins, Rebecca J; Finley, John W; Banuelos, Gary S; Harnly, James M
2008-11-12
Metabolite fingerprints, obtained with direct injection mass spectrometry (MS) with both positive and negative ionization, were used with analysis of variance-principal components analysis (ANOVA-PCA) to discriminate between cultivars and growing treatments of broccoli. The sample set consisted of two cultivars of broccoli, Majestic and Legacy, the first grown with four different levels of Se and the second grown organically and conventionally with two rates of irrigation. Chemical composition differences in the two cultivars and seven treatments produced patterns that were visually and statistically distinguishable using ANOVA-PCA. PCA loadings allowed identification of the molecular and fragment ions that provided the most significant chemical differences. A standardized profiling method for phenolic compounds showed that important discriminating ions were not phenolic compounds. The elution times of the discriminating ions and previous results suggest that they were common sugars and organic acids. ANOVA calculations of the positive and negative ionization MS fingerprints showed that 33% of the variance came from the cultivar, 59% from the growing treatment, and 8% from analytical uncertainty. Although the positive and negative ionization fingerprints differed significantly, there was no difference in the distribution of variance. High variance of individual masses with cultivars or growing treatment was correlated with high PCA loadings. The ANOVA data suggest that only variables with high variance for analytical uncertainty should be deleted. All other variables represent discriminating masses that allow separation of the samples with respect to cultivar and treatment.
Ni, Yongnian; Lai, Yanhua; Kokot, Serge
2012-07-01
An analytical method for the classification of complex real-world samples was researched and developed with the use of excitation-emission fluorescence matrix (EEFM) spectroscopy, using the medicinal herbs, Rhizoma corydalis decumbentis (RCD) and Rhizoma corydalis (RC) as example samples. The data set was obtained from various authentic RCD-A and RC-A, adulterated AD, and commercial RCD-C and RC-C samples. The spectra (range: λ(ex) = 215∼395 nm and λ(em) = 290∼560 nm), arranged in two- and three-way data matrix formats, were processed using principal component analysis (PCA) and parallel factor analysis (PARAFAC) to produce two-dimensional component-by-component plots for qualitative data classification. The RCD-A and RC-A object groups were clearly discriminated, but the AD and the RCD-C as well as RC-C samples were less well separated. PARAFAC analysis produced somewhat better discrimination, and loadings plots revealed the presence of the marker compound Protopine-a strongly fluorescing substance-as well as at least two other unidentified fluorescent components. Classification performance of the common K-nearest neighbors (KNN) and linear discrimination analysis (LDA) methods was relatively poor when compared with that of the back propagation- and radial basis function-artificial neural networks (BP-ANN and RBF-ANN) models on the basis of two- and three-way formatted data. The best results were obtained with the three-way fingerprints and the RBF-ANN model. Subsequently, the quality of the commercial samples (RCD-C and RC-C) was classified on the best optimized RBF-ANN model. Thus, EEFM spectroscopy, which provides three-way measured data, is potentially a powerful analytical technique for the analysis of complex real-world substances provided the classification is performed by the RBF-ANN or similar ANN methods.
Poage, J. L.
1975-01-01
A sequential nonparametric pattern classification procedure is presented. The method presented is an estimated version of the Wald sequential probability ratio test (SPRT). This method utilizes density function estimates, and the density estimate used is discussed, including a proof of convergence in probability of the estimate to the true density function. The classification procedure proposed makes use of the theory of order statistics, and estimates of the probabilities of misclassification are given. The procedure was tested on discriminating between two classes of Gaussian samples and on discriminating between two kinds of electroencephalogram (EEG) responses.
Gender identification of Caspian Terns using external morphology and discriminant function analysis
Ackerman, J.T.; Takekawa, J.Y.; Bluso, J.D.; Yee, J.L.; Eagles-Smith, C. A.
2008-01-01
Caspian Tern (Sterna caspia) plumage characteristics are sexually monochromatic and gender cannot easily be distinguished in the field without extensive behavioral observations. We assessed sexual size dimorphism and developed a discriminant function to assign gender in Caspian Terns based on external morphology. We collected and measured Caspian Terns in San Francisco Bay, California, and confirmed their gender based on necropsy and genetic analysis. Of the eight morphological measurements we examined, only bill depth at the gonys and head plus bill length differed between males and females with males being larger than females. A discriminant function using both bill depth at the gonys and head plus bill length accurately assigned gender of 83% of terns for which gender was known. We improved the accuracy of our discriminant function to 90% by excluding individuals that had less than a 75% posterior probability of correctly being assigned to gender. Caspian Terns showed little sexual size dimorphism in many morphometries, but our results indicate they can be reliably assigned to gender in the field using two morphological measurements.
Terrill, Philip Ian; Wilson, Stephen James; Suresh, Sadasivam; Cooper, David M; Dakin, Carolyn
2010-05-01
Breathing patterns are characteristically different between infant active sleep (AS) and quiet sleep (QS), and statistical quantifications of interbreath interval (IBI) data have previously been used to discriminate between infant sleep states. It has also been identified that breathing patterns are governed by a nonlinear controller. This study aims to investigate whether nonlinear quantifications of infant IBI data are characteristically different between AS and QS, and whether they may be used to discriminate between these infant sleep states. Polysomnograms were obtained from 24 healthy infants at six months of age. Periods of AS and QS were identified, and IBI data extracted. Recurrence quantification analysis (RQA) was applied to each period, and recurrence calculated for a fixed radius in the range of 0-8 in steps of 0.02, and embedding dimensions of 4, 6, 8, and 16. When a threshold classifier was trained, the RQA variable recurrence was able to correctly classify 94.3% of periods in a test dataset. It was concluded that RQA of IBI data is able to accurately discriminate between infant sleep states. This is a promising step toward development of a minimal-channel automatic sleep state classification system.
Shayan, Zahra; Mohammad Gholi Mezerji, Naser; Shayan, Leila; Naseri, Parisa
2015-11-03
Logistic regression (LR) and linear discriminant analysis (LDA) are two popular statistical models for prediction of group membership. Although they are very similar, the LDA makes more assumptions about the data. When categorical and continuous variables used simultaneously, the optimal choice between the two models is questionable. In most studies, classification error (CE) is used to discriminate between subjects in several groups, but this index is not suitable to predict the accuracy of the outcome. The present study compared LR and LDA models using classification indices. This cross-sectional study selected 243 cancer patients. Sample sets of different sizes (n = 50, 100, 150, 200, 220) were randomly selected and the CE, B, and Q classification indices were calculated by the LR and LDA models. CE revealed the a lack of superiority for one model over the other, but the results showed that LR performed better than LDA for the B and Q indices in all situations. No significant effect for sample size on CE was noted for selection of an optimal model. Assessment of the accuracy of prediction of real data indicated that the B and Q indices are appropriate for selection of an optimal model. The results of this study showed that LR performs better in some cases and LDA in others when based on CE. The CE index is not appropriate for classification, although the B and Q indices performed better and offered more efficient criteria for comparison and discrimination between groups.
Carter, Stephen R
2016-06-01
Background Confirmatory factory analysis (CFA) and structural equation modelling (SEM) are increasingly used in social pharmacy research. One of the key benefits of CFA is that it allows researchers to provide evidence for the validity of internal factor structure of measurement scales. In particular, CFA can be used to provide evidence for the validity of the assertion that a hypothesized multi-dimensional scale discriminates between sub-scales. Aim This manuscript aims to provide guidance for researchers who wish to use CFA to provide evidence for the internal factor structure of measurement scales. Methods The manuscript places discriminant validity in the context of providing overall validity evidence for measurement scales. Four examples from the recent social pharmacy literature are used to critically examine the various methods which are used to establish discriminant validity. Using a hypothetical scenario, the manuscript demonstrates how commonly used output from CFA computer programs can be used to provide evidence for separateness of sub-scales within a multi-dimensional scale. Conclusion The manuscript concludes with recommendations for the conduct and reporting of studies which use CFA to provide evidence of internal factor structure of measurement scales.
Directory of Open Access Journals (Sweden)
Charistos Leonidas
2014-06-01
Full Text Available Honey bees collected from 32 different localities in Greece were studied based on the geometric morphometrics approach using the coordinates of 19 landmarks located at wing vein intersections. Procrustes analysis, principal component analysis, and Canonical variate analysis (CVA detected population variability among the studied samples. According to the Principal component analysis (PCA of pooled data from each locality, the most differentiated populations were the populations from the Aegean island localities Astypalaia, Chios, and Kythira. However, the populations with the most distant according to the canonical variate analysis performed on all measurements were the populations from Heraklion and Chania (both from Crete island. These results can be used as a starting point for the use of geometric morphometrics in the discrimination of honey bee populations in Greece and the establishment of conservation areas for local honey bee populations.
Thirty years of nonparametric item response theory
Molenaar, W.
2001-01-01
Relationships between a mathematical measurement model and its real-world applications are discussed. A distinction is made between large data matrices commonly found in educational measurement and smaller matrices found in attitude and personality measurement. Nonparametric methods are evaluated fo
A Bayesian Nonparametric Approach to Test Equating
Karabatsos, George; Walker, Stephen G.
2009-01-01
A Bayesian nonparametric model is introduced for score equating. It is applicable to all major equating designs, and has advantages over previous equating models. Unlike the previous models, the Bayesian model accounts for positive dependence between distributions of scores from two tests. The Bayesian model and the previous equating models are…
Nonparametric confidence intervals for monotone functions
Groeneboom, P.; Jongbloed, G.
2015-01-01
We study nonparametric isotonic confidence intervals for monotone functions. In [Ann. Statist. 29 (2001) 1699–1731], pointwise confidence intervals, based on likelihood ratio tests using the restricted and unrestricted MLE in the current status model, are introduced. We extend the method to the trea
Nonparametric confidence intervals for monotone functions
Groeneboom, P.; Jongbloed, G.
2015-01-01
We study nonparametric isotonic confidence intervals for monotone functions. In [Ann. Statist. 29 (2001) 1699–1731], pointwise confidence intervals, based on likelihood ratio tests using the restricted and unrestricted MLE in the current status model, are introduced. We extend the method to the
Panel data specifications in nonparametric kernel regression
DEFF Research Database (Denmark)
Czekaj, Tomasz Gerard; Henningsen, Arne
parametric panel data estimators to analyse the production technology of Polish crop farms. The results of our nonparametric kernel regressions generally differ from the estimates of the parametric models but they only slightly depend on the choice of the kernel functions. Based on economic reasoning, we...
Kassouf, Amine; Maalouly, Jacqueline; Rutledge, Douglas N; Chebib, Hanna; Ducruet, Violette
2014-11-01
Plastic packaging wastes increased considerably in recent decades, raising a major and serious public concern on political, economical and environmental levels. Dealing with this kind of problems is generally done by landfilling and energy recovery. However, these two methods are becoming more and more expensive, hazardous to the public health and the environment. Therefore, recycling is gaining worldwide consideration as a solution to decrease the growing volume of plastic packaging wastes and simultaneously reduce the consumption of oil required to produce virgin resin. Nevertheless, a major shortage is encountered in recycling which is related to the sorting of plastic wastes. In this paper, a feasibility study was performed in order to test the potential of an innovative approach combining mid infrared (MIR) spectroscopy with independent components analysis (ICA), as a simple and fast approach which could achieve high separation rates. This approach (MIR-ICA) gave 100% discrimination rates in the separation of all studied plastics: polyethylene terephthalate (PET), polyethylene (PE), polypropylene (PP), polystyrene (PS) and polylactide (PLA). In addition, some more specific discriminations were obtained separating plastic materials belonging to the same polymer family e.g. high density polyethylene (HDPE) from low density polyethylene (LDPE). High discrimination rates were obtained despite the heterogeneity among samples especially differences in colors, thicknesses and surface textures. The reproducibility of the proposed approach was also tested using two spectrometers with considerable differences in their sensitivities. Discrimination rates were not affected proving that the developed approach could be extrapolated to different spectrometers. MIR combined with ICA is a promising tool for plastic waste separation that can help improve performance in this field; however further technological improvements and developments are required before it can be applied
Singh, Sartajvir; Talwar, Rajneesh
2016-12-01
Detection of snow cover changes is vital for avalanche hazard analysis and flood flashes that arise due to variation in temperature. Hence, multitemporal change detection is one of the practical mean to estimate the snow cover changes over larger area using remotely sensed data. There have been some previous studies that examined how accuracy of change detection analysis is affected by different topography effects over Northwestern Indian Himalayas. The present work emphases on the intercomparison of different topography effects on discrimination performance of fuzzy based change vector analysis (FCVA) as change detection algorithm that includes extraction of change-magnitude and change-direction from a specific pixel belongs multiple or partial membership. The qualitative and quantitative analysis of the proposed FCVA algorithm is performed under topographic conditions and topographic correction conditions. The experimental outcomes confirmed that in change category discrimination procedure, FCVA with topographic correction achieved 86.8% overall accuracy and 4.8% decay (82% of overall accuracy) is found in FCVA without topographic correction. This study suggests that by incorporating the topographic correction model over mountainous region satellite imagery, performance of FCVA algorithm can be significantly improved up to great extent in terms of determining actual change categories.
Taverna, Domenico; Di Donna, Leonardo; Mazzotti, Fabio; Tagarelli, Antonio; Napoli, Anna; Furia, Emilia; Sindona, Giovanni
2016-09-01
A novel approach for the rapid discrimination of bergamot essential oil from other citrus fruits oils is presented. The method was developed using paper spray mass spectrometry (PS-MS) allowing for a rapid molecular profiling coupled with a statistic tool for a precise and reliable discrimination between the bergamot complex matrix and other similar matrices, commonly used for its reconstitution. Ambient mass spectrometry possesses the ability to record mass spectra of ordinary samples, in their native environment, without sample preparation or pre-separation by creating ions outside the instrument. The present study reports a PS-MS method for the determination of oxygen heterocyclic compounds such as furocoumarins, psoralens and flavonoids present in the non-volatile fraction of citrus fruits essential oils followed by chemometric analysis. The volatile fraction of Bergamot is one of the most known and fashionable natural products, which found applications in flavoring industry as ingredient in beverages and flavored foodstuff. The development of the presented method employed bergamot, sweet orange, orange, cedar, grapefruit and mandarin essential oils. PS-MS measurements were carried out in full scan mode for a total run time of 2 min. The capability of PS-MS profiling to act as marker for the classification of bergamot essential oils was evaluated by using multivariate statistical analysis. Two pattern recognition techniques, linear discriminant analysis and soft independent modeling of class analogy, were applied to MS data. The cross-validation procedure has shown excellent results in terms of the prediction ability because both models have correctly classified all samples for each category. Copyright © 2016 John Wiley & Sons, Ltd.
Discriminative Sparse Representations in Hyperspectral Imagery
2010-03-01
classification , and unsupervised labeling (clustering) [2, 3, 4, 5, 6, 7, 8]. Recently, a non-parametric (Bayesian) approach to sparse modeling and com...DISCRIMINATIVE SPARSE REPRESENTATIONS IN HYPERSPECTRAL IMAGERY By Alexey Castrodad, Zhengming Xing John Greer, Edward Bosch Lawrence Carin and...00-00-2010 to 00-00-2010 4. TITLE AND SUBTITLE Discriminative Sparse Representations in Hyperspectral Imagery 5a. CONTRACT NUMBER 5b. GRANT
Directory of Open Access Journals (Sweden)
Zhigao Zeng
2016-01-01
Full Text Available This paper proposes a novel algorithm to solve the challenging problem of classifying error-diffused halftone images. We firstly design the class feature matrices, after extracting the image patches according to their statistics characteristics, to classify the error-diffused halftone images. Then, the spectral regression kernel discriminant analysis is used for feature dimension reduction. The error-diffused halftone images are finally classified using an idea similar to the nearest centroids classifier. As demonstrated by the experimental results, our method is fast and can achieve a high classification accuracy rate with an added benefit of robustness in tackling noise.
Discourse analysis of gender equality and non-discrimination laws and strategies
Directory of Open Access Journals (Sweden)
Antonijević Zorana
2017-01-01
Full Text Available Based on the contemporary research on gender and language, using the method of discourse analysis applied to the laws and policies, this article explains how certain linguistic practice, in the context of the administrative discourse, produces meaning that may or may not contribute to its better understanding and more efficient implementation. Through discourse analysis of gender equality and non-discrimination laws and strategies in Serbia, it has been shown how and with what consequences the socio-political and academic elites affect defining and promoting certain concepts (gender, sex, gender equality, discrimination in one social and historical moment. The paper is placed in the theoretical framework of three visions of gender equality: perspective of equal treatment, women‘s perspectives and gender perspective (Booth, Bennett 2002, that are corresponding to the three strategies for achieving gender equality: equal treatment, specific policy of gender equality and gender mainstreaming (Verloo, 2001. The discourse analysis of the Law on Gender Equality (2009, the National Strategy for the Improvement of the Position of Women and Advancement of Gender Equality (2009, the Law on Prohibition of Discrimination (2009 and the Strategy for Prevention and Protection against Discrimination (2013, has shown the context of use and meaning of terms gender and sex, as well as implications it has on their potential to change the existing paradigms and understanding of gender equality, and the implementation of policies in Serbia. Analysis of the use of terms sex and gender in the most important legal and strategic documents for achieving gender equality, showed that the choice of certain categories and terms is always a political choice. The authors show how these documents are written in the key of two gender perspectives and strategies: equal treatment and the specific policy of gender equality, while the third - introduction of a gender perspective
Sjogren, D.; Martin, Y. E.; Jagielko, L.
2010-12-01
Gimbarzevsky (1988) collected an exceptional landsliding inventory for the Haida Gwaii, British Columbia (formerly called the Queen Charlotte Islands). This data base includes more than 8 000 landsliding vectors, with an areal coverage of about 10 000 km2. Unfortunately, this landsliding inventory was never published in the referred literature, despite its regional significance. The data collection occurred prior to widespread use of GIS technologies in landsliding analysis, thus restricting the types of analyses that were undertaken at the time relative to what is possible today. Gimbarzevsky identified the landsliding events from 1:50 000 aerial photographs, and then transferred the landslide vectors to NTS map sheets. In this study, we digitized the landslide vectors from these original map sheets and connected each vector to a digital elevation model. Lengths of landslide vectors were then compared to results of Rood (1984), whose landsliding inventory for the Haida Gwaii relied on larger-scale aerial photographs (~ 1:13 000). A comparison of the two data bases shows that Rood’s inventory contains a more complete record of smaller landslides, whereas Gimbarzevsky’s inventory provides a much better statistical representation of less frequently occurring, medium to large landslide events. We then apply discriminant analysis to the Gimbarzevsky data base to assess which of a set of ten predictor variables, selected on the basis of mechanical theory, best predict failed vs. unfailed locations in the landscape (referred to as the grouping variable in discriminant analysis). Certain predictor variables may be cross-correlated, and any one particular variable may be related to several aspects of mechanical theory (for example, a particular variable may affect various components of shear stress and/or shear strength); it is important to recognize that the significance of particular groupings may reflect this information. Eight of the original variables were found
Estimating the causes of traffic accidents using logistic regression and discriminant analysis.
Karacasu, Murat; Ergül, Barış; Altin Yavuz, Arzu
2014-01-01
Factors that affect traffic accidents have been analysed in various ways. In this study, we use the methods of logistic regression and discriminant analysis to determine the damages due to injury and non-injury accidents in the Eskisehir Province. Data were obtained from the accident reports of the General Directorate of Security in Eskisehir; 2552 traffic accidents between January and December 2009 were investigated regarding whether they resulted in injury. According to the results, the effects of traffic accidents were reflected in the variables. These results provide a wealth of information that may aid future measures toward the prevention of undesired results.
Energy Technology Data Exchange (ETDEWEB)
Kassouf, Amine, E-mail: amine.kassouf@agroparistech.fr [ER004 “Lebanese Food Packaging”, Faculty of Sciences II, Lebanese University, 90656 Jdeideth El Matn, Fanar (Lebanon); INRA, UMR1145 Ingénierie Procédés Aliments, 1 Avenue des Olympiades, 91300 Massy (France); AgroParisTech, UMR1145 Ingénierie Procédés Aliments, 16 rue Claude Bernard, 75005 Paris (France); Maalouly, Jacqueline, E-mail: j_maalouly@hotmail.com [ER004 “Lebanese Food Packaging”, Faculty of Sciences II, Lebanese University, 90656 Jdeideth El Matn, Fanar (Lebanon); Rutledge, Douglas N., E-mail: douglas.rutledge@agroparistech.fr [INRA, UMR1145 Ingénierie Procédés Aliments, 1 Avenue des Olympiades, 91300 Massy (France); AgroParisTech, UMR1145 Ingénierie Procédés Aliments, 16 rue Claude Bernard, 75005 Paris (France); Chebib, Hanna, E-mail: hchebib@hotmail.com [ER004 “Lebanese Food Packaging”, Faculty of Sciences II, Lebanese University, 90656 Jdeideth El Matn, Fanar (Lebanon); Ducruet, Violette, E-mail: violette.ducruet@agroparistech.fr [INRA, UMR1145 Ingénierie Procédés Aliments, 1 Avenue des Olympiades, 91300 Massy (France); AgroParisTech, UMR1145 Ingénierie Procédés Aliments, 16 rue Claude Bernard, 75005 Paris (France)
2014-11-15
Highlights: • An innovative technique, MIR-ICA, was applied to plastic packaging separation. • This study was carried out on PE, PP, PS, PET and PLA plastic packaging materials. • ICA was applied to discriminate plastics and 100% separation rates were obtained. • Analyses performed on two spectrometers proved the reproducibility of the method. • MIR-ICA is a simple and fast technique allowing plastic identification/classification. - Abstract: Plastic packaging wastes increased considerably in recent decades, raising a major and serious public concern on political, economical and environmental levels. Dealing with this kind of problems is generally done by landfilling and energy recovery. However, these two methods are becoming more and more expensive, hazardous to the public health and the environment. Therefore, recycling is gaining worldwide consideration as a solution to decrease the growing volume of plastic packaging wastes and simultaneously reduce the consumption of oil required to produce virgin resin. Nevertheless, a major shortage is encountered in recycling which is related to the sorting of plastic wastes. In this paper, a feasibility study was performed in order to test the potential of an innovative approach combining mid infrared (MIR) spectroscopy with independent components analysis (ICA), as a simple and fast approach which could achieve high separation rates. This approach (MIR-ICA) gave 100% discrimination rates in the separation of all studied plastics: polyethylene terephthalate (PET), polyethylene (PE), polypropylene (PP), polystyrene (PS) and polylactide (PLA). In addition, some more specific discriminations were obtained separating plastic materials belonging to the same polymer family e.g. high density polyethylene (HDPE) from low density polyethylene (LDPE). High discrimination rates were obtained despite the heterogeneity among samples especially differences in colors, thicknesses and surface textures. The reproducibility of
Directory of Open Access Journals (Sweden)
Meguro,Tadamichi
1982-08-01
Full Text Available Volume-time (V-T and flow-volume (F-V curves were measured in all the subjects of nonsmoking young males (mean value 26.3 yrs. of age, healthy and asthmatics. Eleven parameters of pulmonary function tests composed of two V-T, six F-V, and three mean time constant (MTC parameters, were calculated from the curves. These parameters were used in the two analyses through the all possible selection procedure (APSP discriminating between healthy adults and mild asthmatics and also between healthy and moderate. Flow rate at 75% of FVC (V75 proved to be the most useful parameter and V50 the next best in both analyses. The probability of misclassification using all eleven parameters was 19.64% in the analysis of healthy adults and mild asthmatics, and 4.29% in the analysis of healthy adults and moderate asthmatics. There was a little difference in the parameters selected at every step. The discriminant analysis proved that the flow-volume patterns were different according to the severity of bronchial asthma. Thus flow-volume recognition was considered to be important in analyzing the severity of bronchial asthma.
Predicting groundwater redox status on a regional scale using linear discriminant analysis
Close, M. E.; Abraham, P.; Humphries, B.; Lilburne, L.; Cuthill, T.; Wilson, S.
2016-08-01
Reducing conditions are necessary for denitrification, thus the groundwater redox status can be used to identify subsurface zones where potentially significant nitrate reduction can occur. Groundwater chemistry in two contrasting regions of New Zealand was classified with respect to redox status and related to mappable factors, such as geology, topography and soil characteristics using discriminant analysis. Redox assignment was carried out for water sampled from 568 and 2223 wells in the Waikato and Canterbury regions, respectively. For the Waikato region 64% of wells sampled indicated oxic conditions in the water; 18% indicated reduced conditions and 18% had attributes indicating both reducing and oxic conditions termed "mixed". In Canterbury 84% of wells indicated oxic conditions; 10% were mixed; and only 5% indicated reduced conditions. The analysis was performed over three different well depths, 100 m. For both regions, the percentage of oxidised groundwater decreased with increasing well depth. Linear discriminant analysis was used to develop models to differentiate between the three redox states. Models were derived for each depth and region using 67% of the data, and then subsequently validated on the remaining 33%. The average agreement between predicted and measured redox status was 63% and 70% for the Waikato and Canterbury regions, respectively. The models were incorporated into GIS and the prediction of redox status was extended over the whole region, excluding mountainous land. This knowledge improves spatial prediction of reduced groundwater zones, and therefore, when combined with groundwater flow paths, improves estimates of denitrification.
Yan, Si-Min; Liu, Jun-Ping; Xu, Lu; Fu, Xian-Shu; Cui, Hai-Feng; Yun, Zhen-Yu; Yu, Xiao-Ping; Ye, Zi-Hong
2014-01-01
This paper focuses on a rapid and nondestructive way to discriminate the geographical origin of Anxi-Tieguanyin tea by near-infrared (NIR) spectroscopy and chemometrics. 450 representative samples were collected from Anxi County, the original producing area of Tieguanyin tea, and another 120 Tieguanyin samples with similar appearance were collected from unprotected producing areas in China. All these samples were measured by NIR. The Stahel-Donoho estimates (SDE) outlyingness diagnosis was used to remove the outliers. Partial least squares discriminant analysis (PLSDA) was performed to develop a classification model and predict the authenticity of unknown objects. To improve the sensitivity and specificity of classification, the raw data was preprocessed to reduce unwanted spectral variations by standard normal variate (SNV) transformation, taking second-order derivatives (D2) spectra, and smoothing. As the best model, the sensitivity and specificity reached 0.931 and 1.000 with SNV spectra. Combination of NIR spectrometry and statistical model selection can provide an effective and rapid method to discriminate the geographical producing area of Anxi-Tieguanyin.
Directory of Open Access Journals (Sweden)
Si-Min Yan
2014-01-01
Full Text Available This paper focuses on a rapid and nondestructive way to discriminate the geographical origin of Anxi-Tieguanyin tea by near-infrared (NIR spectroscopy and chemometrics. 450 representative samples were collected from Anxi County, the original producing area of Tieguanyin tea, and another 120 Tieguanyin samples with similar appearance were collected from unprotected producing areas in China. All these samples were measured by NIR. The Stahel-Donoho estimates (SDE outlyingness diagnosis was used to remove the outliers. Partial least squares discriminant analysis (PLSDA was performed to develop a classification model and predict the authenticity of unknown objects. To improve the sensitivity and specificity of classification, the raw data was preprocessed to reduce unwanted spectral variations by standard normal variate (SNV transformation, taking second-order derivatives (D2 spectra, and smoothing. As the best model, the sensitivity and specificity reached 0.931 and 1.000 with SNV spectra. Combination of NIR spectrometry and statistical model selection can provide an effective and rapid method to discriminate the geographical producing area of Anxi-Tieguanyin.
Directory of Open Access Journals (Sweden)
Corrado lo Storto
2013-12-01
Full Text Available This article reports the outcome of a performance study of the water service provision industry in Italy. The study evaluates the efficiency of 21 “private or public-private” equity and 32 “public” equity water service operators and investigates controlling factors. In particular, the influence that the operator typology and service management nature - private vs. public - has on efficiency is assessed. The study employed a two-stage Data Envelopment Analysis methodology. In the first stage, the operational efficiency of water supply operators is calculated by implementing a conventional BCC DEA model, that uses both physical infrastructure and financial input and output variables to explore economies of scale. In the second stage, bootstrapped DEA and Tobit regression are performed to estimate the influence that a number of environmental factors have on water supplier efficiency. The results show that the integrated water provision industry in Italy is characterized by operational inefficiencies of service operators, and scale and agglomeration economies may have a not negligible effect on efficiency. In addition, the operator typology and its geographical location affect efficiency.
Zhu, Ying; Tan, Tuck Lee
2016-04-01
An effective and simple analytical method using Fourier transform infrared (FTIR) spectroscopy to distinguish wild-grown high-quality Ganoderma lucidum (G. lucidum) from cultivated one is of essential importance for its quality assurance and medicinal value estimation. Commonly used chemical and analytical methods using full spectrum are not so effective for the detection and interpretation due to the complex system of the herbal medicine. In this study, two penalized discriminant analysis models, penalized linear discriminant analysis (PLDA) and elastic net (Elnet),using FTIR spectroscopy have been explored for the purpose of discrimination and interpretation. The classification performances of the two penalized models have been compared with two widely used multivariate methods, principal component discriminant analysis (PCDA) and partial least squares discriminant analysis (PLSDA). The Elnet model involving a combination of L1 and L2 norm penalties enabled an automatic selection of a small number of informative spectral absorption bands and gave an excellent classification accuracy of 99% for discrimination between spectra of wild-grown and cultivated G. lucidum. Its classification performance was superior to that of the PLDA model in a pure L1 setting and outperformed the PCDA and PLSDA models using full wavelength. The well-performed selection of informative spectral features leads to substantial reduction in model complexity and improvement of classification accuracy, and it is particularly helpful for the quantitative interpretations of the major chemical constituents of G. lucidum regarding its anti-cancer effects.
Zhu, Ying; Tan, Tuck Lee
2016-04-15
An effective and simple analytical method using Fourier transform infrared (FTIR) spectroscopy to distinguish wild-grown high-quality Ganoderma lucidum (G. lucidum) from cultivated one is of essential importance for its quality assurance and medicinal value estimation. Commonly used chemical and analytical methods using full spectrum are not so effective for the detection and interpretation due to the complex system of the herbal medicine. In this study, two penalized discriminant analysis models, penalized linear discriminant analysis (PLDA) and elastic net (Elnet),using FTIR spectroscopy have been explored for the purpose of discrimination and interpretation. The classification performances of the two penalized models have been compared with two widely used multivariate methods, principal component discriminant analysis (PCDA) and partial least squares discriminant analysis (PLSDA). The Elnet model involving a combination of L1 and L2 norm penalties enabled an automatic selection of a small number of informative spectral absorption bands and gave an excellent classification accuracy of 99% for discrimination between spectra of wild-grown and cultivated G. lucidum. Its classification performance was superior to that of the PLDA model in a pure L1 setting and outperformed the PCDA and PLSDA models using full wavelength. The well-performed selection of informative spectral features leads to substantial reduction in model complexity and improvement of classification accuracy, and it is particularly helpful for the quantitative interpretations of the major chemical constituents of G. lucidum regarding its anti-cancer effects.
Peerbhay, Kabir Yunus; Mutanga, Onisimo; Ismail, Riyad
2013-05-01
Discriminating commercial tree species using hyperspectral remote sensing techniques is critical in monitoring the spatial distributions and compositions of commercial forests. However, issues related to data dimensionality and multicollinearity limit the successful application of the technology. The aim of this study was to examine the utility of the partial least squares discriminant analysis (PLS-DA) technique in accurately classifying six exotic commercial forest species (Eucalyptus grandis, Eucalyptus nitens, Eucalyptus smithii, Pinus patula, Pinus elliotii and Acacia mearnsii) using airborne AISA Eagle hyperspectral imagery (393-900 nm). Additionally, the variable importance in the projection (VIP) method was used to identify subsets of bands that could successfully discriminate the forest species. Results indicated that the PLS-DA model that used all the AISA Eagle bands (n = 230) produced an overall accuracy of 80.61% and a kappa value of 0.77, with user's and producer's accuracies ranging from 50% to 100%. In comparison, incorporating the optimal subset of VIP selected wavebands (n = 78) in the PLS-DA model resulted in an improved overall accuracy of 88.78% and a kappa value of 0.87, with user's and producer's accuracies ranging from 70% to 100%. Bands located predominantly within the visible region of the electromagnetic spectrum (393-723 nm) showed the most capability in terms of discriminating between the six commercial forest species. Overall, the research has demonstrated the potential of using PLS-DA for reducing the dimensionality of hyperspectral datasets as well as determining the optimal subset of bands to produce the highest classification accuracies.
Local classification: Locally weighted-partial least squares-discriminant analysis (LW-PLS-DA).
Bevilacqua, Marta; Marini, Federico
2014-08-01
The possibility of devising a simple, flexible and accurate non-linear classification method, by extending the locally weighted partial least squares (LW-PLS) approach to the cases where the algorithm is used in a discriminant way (partial least squares discriminant analysis, PLS-DA), is presented. In particular, to assess which category an unknown sample belongs to, the proposed algorithm operates by identifying which training objects are most similar to the one to be predicted and building a PLS-DA model using these calibration samples only. Moreover, the influence of the selected training samples on the local model can be further modulated by adopting a not uniform distance-based weighting scheme which allows the farthest calibration objects to have less impact than the closest ones. The performances of the proposed locally weighted-partial least squares-discriminant analysis (LW-PLS-DA) algorithm have been tested on three simulated data sets characterized by a varying degree of non-linearity: in all cases, a classification accuracy higher than 99% on external validation samples was achieved. Moreover, when also applied to a real data set (classification of rice varieties), characterized by a high extent of non-linearity, the proposed method provided an average correct classification rate of about 93% on the test set. By the preliminary results, showed in this paper, the performances of the proposed LW-PLS-DA approach have proved to be comparable and in some cases better than those obtained by other non-linear methods (k nearest neighbors, kernel-PLS-DA and, in the case of rice, counterpropagation neural networks).
交通拥堵持续时间的非参数生存分析%Nonparametric survival analysis of traffic congestion duration time
Institute of Scientific and Technical Information of China (English)
杨小宝; 周映雪
2013-01-01
The hazard-based traffic congestion duration model was established through survival analysis method. Based on the empirical traffic flow data on the Third Ring Road in Beijing, the traffic congestion duration of the Third Ring Road was estimated. The results show that 56% of the congestion durations of road segments on the Third Ring Road are not larger than four minutes. 90% of the congestion durations are not larger than 12 minutes. The hazard rate is less than 10% when the duration is larger than 12 minutes. The occurrence frequency of congestion on the outer ring is larger than the inner ring while the congestion duration on the inner ring is longer than the duration on the outer ring.%采用生存分析的非参数方法,建立基于风险的交通拥堵持续时间模型,根据北京市三环快速路的交通流数据,对三环道路交通拥堵持续时间进行了估计.结果表明:三环各路段的拥堵持续时间56％在4 min以内,90％在12 min之内；当拥堵持续时间超过12 min之后拥堵结束的可能性小于10％.外环比内环更容易发生拥堵,但当拥堵发生后内环的拥堵持续时间更长.
DEFF Research Database (Denmark)
Ribeiro, Alexandra B.; Nielsen, Allan Aasbjerg
1997-01-01
qualitative microprobe results: present elements Al, Si, Cr, Fe, As (associated with others). Selected groups of calibrated images (same light conditions and magnification) submitted to discriminant analysis, in order to find a pattern of recognition in the soil features corresponding to contamination already...... in the soluble and exchangeable phase, these elements being associated primarily with amorphous-crystalline Fe-oxides, organic matter and/or resistant phases. The results obtained with sequential extraction were the prerequisite to the attempt to identify the Cr and As distribution in the solid phase. If high...... concentrations of contaminants are indicated by chemical wet analysis, these contaminants must occur directly in the solid phase. Thin sections of soil aggregates were scanned for Cu, Cr and As using an electron microprobe, and qualitative analysis was made on selected areas. Microphotographs of thin sections...
Directory of Open Access Journals (Sweden)
Deni Memić
2015-01-01
Full Text Available This article has an aim to assess credit default prediction on the banking market in Bosnia and Herzegovina nationwide as well as on its constitutional entities (Federation of Bosnia and Herzegovina and Republika Srpska. Ability to classify companies info different predefined groups or finding an appropriate tool which would replace human assessment in classifying companies into good and bad buckets has been one of the main interests on risk management researchers for a long time. We investigated the possibility and accuracy of default prediction using traditional statistical methods logistic regression (logit and multiple discriminant analysis (MDA and compared their predictive abilities. The results show that the created models have high predictive ability. For logit models, some variables are more influential on the default prediction than the others. Return on assets (ROA is statistically significant in all four periods prior to default, having very high regression coefficients, or high impact on the model's ability to predict default. Similar results are obtained for MDA models. It is also found that predictive ability differs between logistic regression and multiple discriminant analysis.
Predicting ethnic and racial discrimination: a meta-analysis of IAT criterion studies.
Oswald, Frederick L; Mitchell, Gregory; Blanton, Hart; Jaccard, James; Tetlock, Philip E
2013-08-01
This article reports a meta-analysis of studies examining the predictive validity of the Implicit Association Test (IAT) and explicit measures of bias for a wide range of criterion measures of discrimination. The meta-analysis estimates the heterogeneity of effects within and across 2 domains of intergroup bias (interracial and interethnic), 6 criterion categories (interpersonal behavior, person perception, policy preference, microbehavior, response time, and brain activity), 2 versions of the IAT (stereotype and attitude IATs), 3 strategies for measuring explicit bias (feeling thermometers, multi-item explicit measures such as the Modern Racism Scale, and ad hoc measures of intergroup attitudes and stereotypes), and 4 criterion-scoring methods (computed majority-minority difference scores, relative majority-minority ratings, minority-only ratings, and majority-only ratings). IATs were poor predictors of every criterion category other than brain activity, and the IATs performed no better than simple explicit measures. These results have important implications for the construct validity of IATs, for competing theories of prejudice and attitude-behavior relations, and for measuring and modeling prejudice and discrimination.
Djupsjöbacka, Mats; Domkin, Dmitry
2005-01-01
In order to plan and control movements the central nervous system (CNS) needs to continuously keep track of the state of the musculoskeletal system. Therefore the CNS constantly uses sensory input from mechanoreceptors in muscles, joints and skin to update information about body configuration on different levels of the CNS. On the conscious level, such representations constitute proprioception. Different tests for assessment of proprioceptive acuity have been described. However, it is unclear if the proprioceptive acuity measurements in these tests correlate within subjects. By using both uni- and multivariate analysis we compared proprioceptive acuity in different variants of ipsilateral active and passive limb position-matching and ipsilateral passive limb movement velocity-discrimination in a group of healthy subjects. The analysis of the position-matching data revealed a higher acuity of matching for active movements in comparison to passive ones. The acuity of matching was negatively correlated to movement extent. There was a lack of correlation between proprioceptive acuity measurements in position-matching and velocity-discrimination.
Prostate lesion detection and localization based on locality alignment discriminant analysis
Lin, Mingquan; Chen, Weifu; Zhao, Mingbo; Gibson, Eli; Bastian-Jordan, Matthew; Cool, Derek W.; Kassam, Zahra; Chow, Tommy W. S.; Ward, Aaron; Chiu, Bernard
2017-03-01
Prostatic adenocarcinoma is one of the most commonly occurring cancers among men in the world, and it also the most curable cancer when it is detected early. Multiparametric MRI (mpMRI) combines anatomic and functional prostate imaging techniques, which have been shown to produce high sensitivity and specificity in cancer localization, which is important in planning biopsies and focal therapies. However, in previous investigations, lesion localization was achieved mainly by manual segmentation, which is time-consuming and prone to observer variability. Here, we developed an algorithm based on locality alignment discriminant analysis (LADA) technique, which can be considered as a version of linear discriminant analysis (LDA) localized to patches in the feature space. Sensitivity, specificity and accuracy generated by the proposed algorithm in five prostates by LADA were 52.2%, 89.1% and 85.1% respectively, compared to 31.3%, 85.3% and 80.9% generated by LDA. The delineation accuracy attainable by this tool has a potential in increasing the cancer detection rate in biopsies and in minimizing collateral damage of surrounding tissues in focal therapies.
Stucky, Brian D; Gottfredson, Nisha C; Panter, A T; Daye, Charles E; Allen, Walter R; Wightman, Linda F
2011-04-01
The Everyday Discrimination Scale (EDS), a widely used measure of daily perceived discrimination, is purported to be unidimensional, to function well among African Americans, and to have adequate construct validity. Two separate studies and data sources were used to examine and cross-validate the psychometric properties of the EDS. In Study 1, an exploratory factor analysis was conducted on a sample of African American law students (N = 589), providing strong evidence of local dependence, or nuisance multidimensionality within the EDS. In Study 2, a separate nationally representative community sample (N = 3,527) was used to model the identified local dependence in an item factor analysis (i.e., bifactor model). Next, item response theory (IRT) calibrations were conducted to obtain item parameters. A five-item, revised-EDS was then tested for gender differential item functioning (in an IRT framework). Based on these analyses, a summed score to IRT-scaled score translation table is provided for the revised-EDS. Our results indicate that the revised-EDS is unidimensional, with minimal differential item functioning, and retains predictive validity consistent with the original scale.
Directory of Open Access Journals (Sweden)
Hervé Abdi
2012-01-01
Full Text Available We present a new discriminant analysis (DA method called Multiple Subject Barycentric Discriminant Analysis (MUSUBADA suited for analyzing fMRI data because it handles datasets with multiple participants that each provides different number of variables (i.e., voxels that are themselves grouped into regions of interest (ROIs. Like DA, MUSUBADA (1 assigns observations to predefined categories, (2 gives factorial maps displaying observations and categories, and (3 optimally assigns observations to categories. MUSUBADA handles cases with more variables than observations and can project portions of the data table (e.g., subtables, which can represent participants or ROIs on the factorial maps. Therefore MUSUBADA can analyze datasets with different voxel numbers per participant and, so does not require spatial normalization. MUSUBADA statistical inferences are implemented with cross-validation techniques (e.g., jackknife and bootstrap, its performance is evaluated with confusion matrices (for fixed and random models and represented with prediction, tolerance, and confidence intervals. We present an example where we predict the image categories (houses, shoes, chairs, and human, monkey, dog, faces, of images watched by participants whose brains were scanned. This example corresponds to a DA question in which the data table is made of subtables (one per subject and with more variables than observations.
Willard, Melissa A Bodnar; McGuffin, Victoria L; Smith, Ruth Waddell
2012-01-01
Salvia divinorum is a hallucinogenic herb that is internationally regulated. In this study, salvinorin A, the active compound in S. divinorum, was extracted from S. divinorum plant leaves using a 5-min extraction with dichloromethane. Four additional Salvia species (Salvia officinalis, Salvia guaranitica, Salvia splendens, and Salvia nemorosa) were extracted using this procedure, and all extracts were analyzed by gas chromatography-mass spectrometry. Differentiation of S. divinorum from other Salvia species was successful based on visual assessment of the resulting chromatograms. To provide a more objective comparison, the total ion chromatograms (TICs) were subjected to principal components analysis (PCA). Prior to PCA, the TICs were subjected to a series of data pretreatment procedures to minimize non-chemical sources of variance in the data set. Successful discrimination of S. divinorum from the other four Salvia species was possible based on visual assessment of the PCA scores plot. To provide a numerical assessment of the discrimination, a series of statistical procedures such as Euclidean distance measurement, hierarchical cluster analysis, Student's t tests, Wilcoxon rank-sum tests, and Pearson product moment correlation were also applied to the PCA scores. The statistical procedures were then compared to determine the advantages and disadvantages for forensic applications.
Ghiasi-Freez, Javad; Soleimanpour, Iman; Kadkhodaie-Ilkhchi, Ali; Ziaii, Mansur; Sedighi, Mahdi; Hatampour, Amir
2012-08-01
Identification of different types of porosity within a reservoir rock is a functional parameter for reservoir characterization since various pore types play different roles in fluid transport and also, the pore spaces determine the fluid storage capacity of the reservoir. The present paper introduces a model for semi-automatic identification of porosity types within thin section images. To get this goal, a pattern recognition algorithm is followed. Firstly, six geometrical shape parameters of sixteen largest pores of each image are extracted using image analysis techniques. The extracted parameters and their corresponding pore types of 294 pores are used for training two intelligent discriminant classifiers, namely linear and quadratic discriminant analysis. The trained classifiers take the geometrical features of the pores to identify the type and percentage of five types of porosity, including interparticle, intraparticle, oomoldic, biomoldic, and vuggy in each image. The accuracy of classifiers is determined from two standpoints. Firstly, the predicted and measured percentages of each type of porosity are compared with each other. The results indicate reliable performance for predicting percentage of each type of porosity. In the second step, the precisions of classifiers for categorizing the pore spaces are analyzed. The classifiers also took a high acceptance score when used for individual recognition of pore spaces. The proposed methodology is a further promising application for petroleum geologists allowing statistical study of pore types in a rapid and accurate way.
Classification of hand preshaping in persons with stroke using Linear Discriminant Analysis.
Puthenveettil, Saumya; Fluet, Gerard; Qiu, Qinyin; Adamovich, Sergei
2012-01-01
This study describes the analysis of hand preshaping using Linear Discriminant Analysis (LDA) to predict hand formation during reaching and grasping tasks of the hemiparetic hand, following a series of upper extremity motor intervention treatments. The purpose of this study is to use classification of hand posture as an additional tool for evaluating the effectiveness of therapies for upper extremity rehabilitation such as virtual reality (VR) therapy and conventional physical therapy. Classification error for discriminating between two objects during hand preshaping is obtained for the hemiparetic and unimpaired hands pre and post training. Eight subjects post stroke participated in a two-week training session consisting of upper extremity motor training. Four subjects trained with interactive VR computer games and four subjects trained with clinical physical therapy procedures of similar intensity. Subjects' finger joint angles were measured during a kinematic reach to grasp test using CyberGlove® and arm joint angles were measured using the trackSTAR™ system prior to training and after training. The unimpaired hand of subjects preshape into the target object with greater accuracy than the hemiparetic hand as indicated by lower classification errors. Hemiparetic hand improved in preshaping accuracy and time to reach minimum error. Classification of hand preshaping may provide insight into improvements in motor performance elicited by robotically facilitated virtually simulated training sessions or conventional physical therapy.
Shape-based discriminative analysis of combined bilateral hippocampi using multiple object alignment
Shen, Li; Makedon, Fillia; Saykin, Andrew
2004-05-01
Shape analysis of hippocampi in schizophrenia has been preformed previously using the spherical harmonic SPHARM description. In these studies, the left and right hippocampi are aligned independently and the spatial relation between them is not explored. This paper presents a new SPHARM-based technique which examines not only the individual shape information of the two hippocampi but also the spatial relation between them. The left and right hippocampi are treated as a single shape configuration. A ploy-shape alignment algorithm is developed for aligning configurations of multiple SPHARM surfaces as follows: (1) the total volume is normalized; (2) the parameter space is aligned for creating the surface correspondence; (3) landmarks are created by a uniform sampling of multiple surfaces for each configuration; (4) a quaternion-based algorithm is employed to align each landmark representation to the mean configuration through the least square rotation and translation iteratively until the mean converges. After applying the poly-shape alignment algorithm, a point distribution model is applied to aligned landmarks for feature extraction. Classification is performed using Fisher's linear discriminant with an effective feature selection scheme. Applying the above procedure to our hippocampal data (14 controls versus 25 schizophrenics, all right-handed males), we achieve the best cross-validation accuracy of 92%, supporting the idea that the whole shape configuration of the two hippocampi provides valuable information in detecting schizophrenia. The results of an ROC analysis and a visualization of discriminative patterns are also included.
The differentiation of camel breeds based on meat measurements using discriminant analysis.
Al-Atiyat, Raed Mahmoud; Suliman, Gamal; AlSuhaibani, Entissar; El-Waziry, Ahmad; Al-Owaimer, Abdullah; Basmaeil, Saeid
2016-06-01
The meat productivity of camel in the tropics is still under investigation for identification of better meat breed or type. Therefore, four one-humped Saudi Arabian (SA) camel breeds, Majaheem, Maghateer, Hamrah, and Safrah were experimented in order to differentiate them from each other based on meat measurements. The measurements were biometrical meat traits measured on six intact males from each breed. The results showed higher values of the Majaheem breed than that obtained for the other breeds except few cases such dressing percentage and rib-eye area. In differentiation analysis, the most discriminating meat variables were myofibrillar protein index, meat color components (L* and a*, b*), and cooking loss. Consequently, the Safrah and the Majaheem breeds presented the largest dissimilarity as evidenced by their multivariate means. The canonical discriminant analysis allowed an additional understanding of the differentiation between breeds. Furthermore, two large clusters, one formed by Hamrah and Maghateer in one group along with Safrah. These classifications may assign each breed into one cluster considering they are better as meat producers. The Majaheem was clustered alone in another cluster that might be a result of being better as milk producers. Nevertheless, the productivity type of the camel breeds of SA needs further morphology and genetic descriptions.
Hayashi, Hideaki; Shibanoki, Taro; Shima, Keisuke; Kurita, Yuichi; Tsuji, Toshio
2015-12-01
This paper proposes a probabilistic neural network (NN) developed on the basis of time-series discriminant component analysis (TSDCA) that can be used to classify high-dimensional time-series patterns. TSDCA involves the compression of high-dimensional time series into a lower dimensional space using a set of orthogonal transformations and the calculation of posterior probabilities based on a continuous-density hidden Markov model with a Gaussian mixture model expressed in the reduced-dimensional space. The analysis can be incorporated into an NN, which is named a time-series discriminant component network (TSDCN), so that parameters of dimensionality reduction and classification can be obtained simultaneously as network coefficients according to a backpropagation through time-based learning algorithm with the Lagrange multiplier method. The TSDCN is considered to enable high-accuracy classification of high-dimensional time-series patterns and to reduce the computation time taken for network training. The validity of the TSDCN is demonstrated for high-dimensional artificial data and electroencephalogram signals in the experiments conducted during the study.
Directory of Open Access Journals (Sweden)
Andrade Adriano O
2009-11-01
Full Text Available Abstract Background The human body adopts a number of strategies to maintain an upright position. The analysis of the human balance allows for the understanding and identification of such strategies. The displacement of the centre of pressure (COP is a measure that has been successfully employed in studies regarding the postural control. Most of these investigations are related to the analysis of individuals suffering from neuromuscular disorders. Recent studies have shown that the elderly population is growing very fast in many countries all over the world, and therefore, researches that try to understand changes in this group are required. In this context, this study proposes the analysis of the postural control, measured by the displacement of the COP, in groups of young and elderly adults. Methods In total 59 subjects participated of this study. They were divided into seven groups according to their age. The displacement of the COP was collected for each subject standing on a force plate. Two experimental conditions, of 30 seconds each, were investigated: opened eyes and closed eyes. Traditional and recent digital signal processing tools were employed for feature computation from the displacement of the COP. Statistical analyses were carried out in order to identify significant differences between the features computed from the distinct groups that could allow for their discrimination. Results Our results showed that Linear Discrimination Analysis (LDA, which is one of the most popular feature extraction and classifier design techniques, could be successfully employed as a linear transformation, based on the linear combination of standard features for COP analysis, capable of estimating a unique feature, so-called LDA-value, from which it was possible to discriminate the investigated groups and show a high correlation between this feature and age. Conclusion These results show that the analysis of features computed from the displacement of
Kim, Isok
2014-01-01
This study used a path analytic technique to examine associations among critical ethnic awareness, racial discrimination, social support, and depressive symptoms. Using a convenience sample from online survey of Asian American adults (N = 405), the study tested 2 main hypotheses: First, based on the empowerment theory, critical ethnic awareness would be positively associated with racial discrimination experience; and second, based on the social support deterioration model, social support would partially mediate the relationship between racial discrimination and depressive symptoms. The result of the path analysis model showed that the proposed path model was a good fit based on global fit indices, χ²(2) = 4.70, p = .10; root mean square error of approximation = 0.06; comparative fit index = 0.97; Tucker-Lewis index = 0.92; and standardized root mean square residual = 0.03. The examinations of study hypotheses demonstrated that critical ethnic awareness was directly associated (b = .11, p racial discrimination experience, whereas social support had a significant indirect effect (b = .48; bias-corrected 95% confidence interval [0.02, 1.26]) between the racial discrimination experience and depressive symptoms. The proposed path model illustrated that both critical ethnic awareness and social support are important mechanisms for explaining the relationship between racial discrimination and depressive symptoms among this sample of Asian Americans. This study highlights the usefulness of the critical ethnic awareness concept as a way to better understand how Asian Americans might perceive and recognize racial discrimination experiences in relation to its mental health consequences.
Melucci, Dora; Bendini, Alessandra; Tesini, Federica; Barbieri, Sara; Zappi, Alessandro; Vichi, Stefania; Conte, Lanfranco; Gallina Toschi, Tullia
2016-08-01
At present, the geographical origin of extra virgin olive oils can be ensured by documented traceability, although chemical analysis may add information that is useful for possible confirmation. This preliminary study investigated the effectiveness of flash gas chromatography electronic nose and multivariate data analysis to perform rapid screening of commercial extra virgin olive oils characterized by a different geographical origin declared in the label. A comparison with solid phase micro extraction coupled to gas chromatography mass spectrometry was also performed. The new method is suitable to verify the geographic origin of extra virgin olive oils based on principal components analysis and discriminant analysis applied to the volatile profile of the headspace as a fingerprint. The selected variables were suitable in discriminating between "100% Italian" and "non-100% Italian" oils. Partial least squares discriminant analysis also allowed prediction of the degree of membership of unknown samples to the classes examined.
Laursen, K H; Mihailova, A; Kelly, S D; Epov, V N; Bérail, S; Schjoerring, J K; Donard, O F X; Larsen, E H; Pedentchouk, N; Marca-Bell, A D; Halekoh, U; Olesen, J E; Husted, S
2013-12-01
Novel procedures for analytical authentication of organic plant products are urgently needed. Here we present the first study encompassing stable isotopes of hydrogen, carbon, nitrogen, oxygen, magnesium and sulphur as well as compound-specific nitrogen and oxygen isotope analysis of nitrate for discrimination of organically and conventionally grown plants. The study was based on wheat, barley, faba bean and potato produced in rigorously controlled long-term field trials comprising 144 experimental plots. Nitrogen isotope analysis revealed the use of animal manure, but was unable to discriminate between plants that were fertilised with synthetic nitrogen fertilisers or green manures from atmospheric nitrogen fixing legumes. This limitation was bypassed using oxygen isotope analysis of nitrate in potato tubers, while hydrogen isotope analysis allowed complete discrimination of organic and conventional wheat and barley grains. It is concluded, that multi-isotopic analysis has the potential to disclose fraudulent substitutions of organic with conventionally cultivated plants.
a Multivariate Downscaling Model for Nonparametric Simulation of Daily Flows
Molina, J. M.; Ramirez, J. A.; Raff, D. A.
2011-12-01
A multivariate, stochastic nonparametric framework for stepwise disaggregation of seasonal runoff volumes to daily streamflow is presented. The downscaling process is conditional on volumes of spring runoff and large-scale ocean-atmosphere teleconnections and includes a two-level cascade scheme: seasonal-to-monthly disaggregation first followed by monthly-to-daily disaggregation. The non-parametric and assumption-free character of the framework allows consideration of the random nature and nonlinearities of daily flows, which parametric models are unable to account for adequately. This paper examines statistical links between decadal/interannual climatic variations in the Pacific Ocean and hydrologic variability in US northwest region, and includes a periodicity analysis of climate patterns to detect coherences of their cyclic behavior in the frequency domain. We explore the use of such relationships and selected signals (e.g., north Pacific gyre oscillation, southern oscillation, and Pacific decadal oscillation indices, NPGO, SOI and PDO, respectively) in the proposed data-driven framework by means of a combinatorial approach with the aim of simulating improved streamflow sequences when compared with disaggregated series generated from flows alone. A nearest neighbor time series bootstrapping approach is integrated with principal component analysis to resample from the empirical multivariate distribution. A volume-dependent scaling transformation is implemented to guarantee the summability condition. In addition, we present a new and simple algorithm, based on nonparametric resampling, that overcomes the common limitation of lack of preservation of historical correlation between daily flows across months. The downscaling framework presented here is parsimonious in parameters and model assumptions, does not generate negative values, and produces synthetic series that are statistically indistinguishable from the observations. We present evidence showing that both
Nonparametric tests for pathwise properties of semimartingales
Cont, Rama; 10.3150/10-BEJ293
2011-01-01
We propose two nonparametric tests for investigating the pathwise properties of a signal modeled as the sum of a L\\'{e}vy process and a Brownian semimartingale. Using a nonparametric threshold estimator for the continuous component of the quadratic variation, we design a test for the presence of a continuous martingale component in the process and a test for establishing whether the jumps have finite or infinite variation, based on observations on a discrete-time grid. We evaluate the performance of our tests using simulations of various stochastic models and use the tests to investigate the fine structure of the DM/USD exchange rate fluctuations and SPX futures prices. In both cases, our tests reveal the presence of a non-zero Brownian component and a finite variation jump component.
Nonparametric Transient Classification using Adaptive Wavelets
Varughese, Melvin M; Stephanou, Michael; Bassett, Bruce A
2015-01-01
Classifying transients based on multi band light curves is a challenging but crucial problem in the era of GAIA and LSST since the sheer volume of transients will make spectroscopic classification unfeasible. Here we present a nonparametric classifier that uses the transient's light curve measurements to predict its class given training data. It implements two novel components: the first is the use of the BAGIDIS wavelet methodology - a characterization of functional data using hierarchical wavelet coefficients. The second novelty is the introduction of a ranked probability classifier on the wavelet coefficients that handles both the heteroscedasticity of the data in addition to the potential non-representativity of the training set. The ranked classifier is simple and quick to implement while a major advantage of the BAGIDIS wavelets is that they are translation invariant, hence they do not need the light curves to be aligned to extract features. Further, BAGIDIS is nonparametric so it can be used for blind ...
The role of discriminant analysis in the refinement of customer satisfaction assessment
Directory of Open Access Journals (Sweden)
Verdessi BD
2000-01-01
Full Text Available OBJECTIVE: To test discriminant analysis as a method of turning the information of a routine customer satisfaction survey (CSS into a more accurate decision-making tool. METHODS: A 7-question, 10-multiple choice, self-applied questionnaire was used to study a sample of patients seen in two outpatient care units in Valparaíso, Chile, one of primary care (n=100 and the other of secondary care (n=249. Two cutting points were considered in the dependent variable (final satisfaction score: satisfied versus unsatisfied, and very satisfied versus all others. Results were compared with empirical measures (proportion of satisfied individuals, proportion of unsatisfied individuals and size of the median. RESULTS: The response rate was very high, over 97.0% in both units. A new variable, medical attention, was revealed, as explaining satisfaction at the primary care unit. The proportion of the total variability explained by the model was very high (over 99.4% in both units, when comparing satisfied with unsatisfied customers. In the analysis of very satisfied versus all other customers, significant relationship was identified only in the case of the primary care unit, which explained a small proportion of the variability (41.9%. CONCLUSIONS: Discriminant analysis identified relationships not revealed by the previous analysis. It provided information about the proportion of the variability explained by the model. It identified non-significant relationships suggested by empirical analysis (e.g. the case of the relation very satisfied versus others in the secondary care unit. It measured the contribution of each independent variable to the explanation of the variation of the dependent one.
The role of discriminant analysis in the refinement of customer satisfaction assessment
Directory of Open Access Journals (Sweden)
BD Verdessi
2000-12-01
Full Text Available OBJECTIVE: To test discriminant analysis as a method of turning the information of a routine customer satisfaction survey (CSS into a more accurate decision-making tool. METHODS: A 7-question, 10-multiple choice, self-applied questionnaire was used to study a sample of patients seen in two outpatient care units in Valparaíso, Chile, one of primary care (n=100 and the other of secondary care (n=249. Two cutting points were considered in the dependent variable (final satisfaction score: satisfied versus unsatisfied, and very satisfied versus all others. Results were compared with empirical measures (proportion of satisfied individuals, proportion of unsatisfied individuals and size of the median. RESULTS: The response rate was very high, over 97.0% in both units. A new variable, medical attention, was revealed, as explaining satisfaction at the primary care unit. The proportion of the total variability explained by the model was very high (over 99.4% in both units, when comparing satisfied with unsatisfied customers. In the analysis of very satisfied versus all other customers, significant relationship was identified only in the case of the primary care unit, which explained a small proportion of the variability (41.9%. CONCLUSIONS: Discriminant analysis identified relationships not revealed by the previous analysis. It provided information about the proportion of the variability explained by the model. It identified non-significant relationships suggested by empirical analysis (e.g. the case of the relation very satisfied versus others in the secondary care unit. It measured the contribution of each independent variable to the explanation of the variation of the dependent one.
Mallory, Christy; Sears, Brad
2014-01-01
This article documents evidence of recent discrimination against lesbian, gay, bisexual, and transgender (LGBT) public sector workers by analyzing employment discrimination complaints filed with state and local administrative agencies. We present information about 589 complaints of sexual orientation and gender identity discrimination filed by public sector workers in 123 jurisdictions. We find that discrimination against LGBT people in the public sector is pervasive and occurs nearly as freq...
Bayesian nonparametric estimation for Quantum Homodyne Tomography
Naulet, Zacharie; Barat, Eric
2016-01-01
We estimate the quantum state of a light beam from results of quantum homodyne tomography noisy measurements performed on identically prepared quantum systems. We propose two Bayesian nonparametric approaches. The first approach is based on mixture models and is illustrated through simulation examples. The second approach is based on random basis expansions. We study the theoretical performance of the second approach by quantifying the rate of contraction of the posterior distribution around ...
A Non-Parametric Item Response Theory Evaluation of the CAGE Instrument Among Older Adults.
Abdin, Edimansyah; Sagayadevan, Vathsala; Vaingankar, Janhavi Ajit; Picco, Louisa; Chong, Siow Ann; Subramaniam, Mythily
2017-08-04
The validity of the CAGE using item response theory (IRT) has not yet been examined in older adult population. This study aims to investigate the psychometric properties of the CAGE using both non-parametric and parametric IRT models, assess whether there is any differential item functioning (DIF) by age, gender and ethnicity and examine the measurement precision at the cut-off scores. We used data from the Well-being of the Singapore Elderly study to conduct Mokken scaling analysis (MSA), dichotomous Rasch and 2-parameter logistic IRT models. The measurement precision at the cut-off scores were evaluated using classification accuracy (CA) and classification consistency (CC). The MSA showed the overall scalability H index was 0.459, indicating a medium performing instrument. All items were found to be homogenous, measuring the same construct and able to discriminate well between respondents with high levels of the construct and the ones with lower levels. The item discrimination ranged from 1.07 to 6.73 while the item difficulty ranged from 0.33 to 2.80. Significant DIF was found for 2-item across ethnic group. More than 90% (CC and CA ranged from 92.5% to 94.3%) of the respondents were consistently and accurately classified by the CAGE cut-off scores of 2 and 3. The current study provides new evidence on the validity of the CAGE from the IRT perspective. This study provides valuable information of each item in the assessment of the overall severity of alcohol problem and the precision of the cut-off scores in older adult population.
Directory of Open Access Journals (Sweden)
I. Crawford
2015-07-01
Full Text Available In this paper we present improved methods for discriminating and quantifying Primary Biological Aerosol Particles (PBAP by applying hierarchical agglomerative cluster analysis to multi-parameter ultra violet-light induced fluorescence (UV-LIF spectrometer data. The methods employed in this study can be applied to data sets in excess of 1×106 points on a desktop computer, allowing for each fluorescent particle in a dataset to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient dataset. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4 where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best performing methods were applied to the BEACHON-RoMBAS ambient dataset where it was found that the z-score and range normalisation methods yield similar results with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the underestimation of bacterial aerosol concentration by a factor of 5. We suggest that this likely due to errors arising from misatrribution
Directory of Open Access Journals (Sweden)
I. Crawford
2015-11-01
Full Text Available In this paper we present improved methods for discriminating and quantifying primary biological aerosol particles (PBAPs by applying hierarchical agglomerative cluster analysis to multi-parameter ultraviolet-light-induced fluorescence (UV-LIF spectrometer data. The methods employed in this study can be applied to data sets in excess of 1 × 106 points on a desktop computer, allowing for each fluorescent particle in a data set to be explicitly clustered. This reduces the potential for misattribution found in subsampling and comparative attribution methods used in previous approaches, improving our capacity to discriminate and quantify PBAP meta-classes. We evaluate the performance of several hierarchical agglomerative cluster analysis linkages and data normalisation methods using laboratory samples of known particle types and an ambient data set. Fluorescent and non-fluorescent polystyrene latex spheres were sampled with a Wideband Integrated Bioaerosol Spectrometer (WIBS-4 where the optical size, asymmetry factor and fluorescent measurements were used as inputs to the analysis package. It was found that the Ward linkage with z-score or range normalisation performed best, correctly attributing 98 and 98.1 % of the data points respectively. The best-performing methods were applied to the BEACHON-RoMBAS (Bio–hydro–atmosphere interactions of Energy, Aerosols, Carbon, H2O, Organics and Nitrogen–Rocky Mountain Biogenic Aerosol Study ambient data set, where it was found that the z-score and range normalisation methods yield similar results, with each method producing clusters representative of fungal spores and bacterial aerosol, consistent with previous results. The z-score result was compared to clusters generated with previous approaches (WIBS AnalysiS Program, WASP where we observe that the subsampling and comparative attribution method employed by WASP results in the overestimation of the fungal spore concentration by a factor of 1.5 and the
Two simple fingerprinting methods, flow-injection UV spectroscopy (FIUV) and 1H nuclear magnetic resonance (NMR), for discrimination of Aurantii FructusImmaturus and Fructus Poniciri TrifoliataeImmaturususing were described. Both methods were combined with partial least-squares discriminant analysis...
Akhtar, Naveed; Mian, Ajmal
2017-10-03
We present a principled approach to learn a discriminative dictionary along a linear classifier for hyperspectral classification. Our approach places Gaussian Process priors over the dictionary to account for the relative smoothness of the natural spectra, whereas the classifier parameters are sampled from multivariate Gaussians. We employ two Beta-Bernoulli processes to jointly infer the dictionary and the classifier. These processes are coupled under the same sets of Bernoulli distributions. In our approach, these distributions signify the frequency of the dictionary atom usage in representing class-specific training spectra, which also makes the dictionary discriminative. Due to the coupling between the dictionary and the classifier, the popularity of the atoms for representing different classes gets encoded into the classifier. This helps in predicting the class labels of test spectra that are first represented over the dictionary by solving a simultaneous sparse optimization problem. The labels of the spectra are predicted by feeding the resulting representations to the classifier. Our approach exploits the nonparametric Bayesian framework to automatically infer the dictionary size--the key parameter in discriminative dictionary learning. Moreover, it also has the desirable property of adaptively learning the association between the dictionary atoms and the class labels by itself. We use Gibbs sampling to infer the posterior probability distributions over the dictionary and the classifier under the proposed model, for which, we derive analytical expressions. To establish the effectiveness of our approach, we test it on benchmark hyperspectral images. The classification performance is compared with the state-of-the-art dictionary learning-based classification methods.
portfolio optimization based on nonparametric estimation methods
Directory of Open Access Journals (Sweden)
mahsa ghandehari
2017-03-01
Full Text Available One of the major issues investors are facing with in capital markets is decision making about select an appropriate stock exchange for investing and selecting an optimal portfolio. This process is done through the risk and expected return assessment. On the other hand in portfolio selection problem if the assets expected returns are normally distributed, variance and standard deviation are used as a risk measure. But, the expected returns on assets are not necessarily normal and sometimes have dramatic differences from normal distribution. This paper with the introduction of conditional value at risk ( CVaR, as a measure of risk in a nonparametric framework, for a given expected return, offers the optimal portfolio and this method is compared with the linear programming method. The data used in this study consists of monthly returns of 15 companies selected from the top 50 companies in Tehran Stock Exchange during the winter of 1392 which is considered from April of 1388 to June of 1393. The results of this study show the superiority of nonparametric method over the linear programming method and the nonparametric method is much faster than the linear programming method.
Onishi, Akinari; Natsume, Kiyohisa
2013-01-01
This paper demonstrates a better classification performance of an ensemble classifier using a regularized linear discriminant analysis (LDA) for P300-based brain-computer interface (BCI). The ensemble classifier with an LDA is sensitive to the lack of training data because covariance matrices are estimated imprecisely. One of the solution against the lack of training data is to employ a regularized LDA. Thus we employed the regularized LDA for the ensemble classifier of the P300-based BCI. The principal component analysis (PCA) was used for the dimension reduction. As a result, an ensemble regularized LDA classifier showed significantly better classification performance than an ensemble un-regularized LDA classifier. Therefore the proposed ensemble regularized LDA classifier is robust against the lack of training data.
Energy Technology Data Exchange (ETDEWEB)
Vila, Anna [Department de Pintura, Conservacio-Restauracio, Facultat de Belles Arts, Universitat de Barcelona, C/Pau Gargallo 4, 08028 Barcelona (Spain)]. E-mail: avila@sct.ub.es; Ferrer, Nuria [Serveis Cientificotecnics, Universitat de Barcelona, C/Lluis Sole i Sabaris 1, 08028 Barcelona (Spain)]. E-mail: nferrer@sctub.es; Garcia, Jose F. [Department de Pintura, Conservacio-Restauracio, Facultat de Belles Arts, Universitat de Barcelona, C/Pau Gargallo 4, 08028 Barcelona (Spain)]. E-mail: ifgarcia@ub.edu
2007-04-04
Prints are the most popular artistic technique. Due to their manufacturing procedure, they are also one of the most frequently falsified types of artwork. In terms of their economic and historic value, the chemical analysis and characterisation of coloured inks and their principal constituent materials (pigments), together with the historical and aesthetic information available in the Catalogues Raisonees, are important tools in distinguishing originals from non-original prints. The chemical characterisation and discrimination of coloured inks has test in this study. Analysis using Fourier transform infrared spectroscopy (FTIR), Scanning electron microscopy (SEM) and X-ray diffraction (XRD) has been done on blue pigments and inks, due to this colour is one of the most representative for the presence of organic and inorganic materials in their composition. Conclusion obtained for this colour would demonstrate the capability of the approach when it is applied to any other coloured set of inks.
Hayashi, Hideaki; Shima, Keisuke; Shibanoki, Taro; Kurita, Yuichi; Tsuji, Toshio
2013-01-01
This paper outlines a probabilistic neural network developed on the basis of time-series discriminant component analysis (TSDCA) that can be used to classify high-dimensional time-series patterns. TSDCA involves the compression of high-dimensional time series into a lower-dimensional space using a set of orthogonal transformations and the calculation of posterior probabilities based on a continuous-density hidden Markov model that incorporates a Gaussian mixture model expressed in the reduced-dimensional space. The analysis can be incorporated into a neural network so that parameters can be obtained appropriately as network coefficients according to backpropagation-through-time-based training algorithm. The network is considered to enable high-accuracy classification of high-dimensional time-series patterns and to reduce the computation time taken for network training. In the experiments conducted during the study, the validity of the proposed network was demonstrated for EEG signals.
DEFF Research Database (Denmark)
Askjær, Sune; Langgård, Morten
2008-01-01
fingerprints proved superior to the TGT and TGD fingerprints. Examples of SAR elucidation based on PLS-DA model interpretation of model coefficients using a reversible pharmacophore fingerprint are given. In addition, we tested the hypothesis that feature combinations coming from the analysis of two...... the lead optimization toward a final drug candidate. This paper presents a combined approach to solving these two problems of ligand-based virtual screening and elucidation of SAR based on interplay between pharmacophore fingerprints and interpretation of PLS-discriminant analysis (PLS-DA) models....... The virtual screening capability of the PLS-DA method is compared to group fusion maximum similarity searching in a test using four graph-based pharmacophore fingerprints over a range of 10 diverse targets. The PLS-DA method was generally found to do better than the Smax method. The GpiDAPH3 and PCH...
A Unified Factors Analysis Framework for Discriminative Feature Extraction and Object Recognition
Directory of Open Access Journals (Sweden)
Ningbo Hao
2016-01-01
Full Text Available Various methods for feature extraction and dimensionality reduction have been proposed in recent decades, including supervised and unsupervised methods and linear and nonlinear methods. Despite the different motivations of these methods, we present in this paper a general formulation known as factor analysis to unify them within a common framework. During factor analysis, an object can be seen as being comprised of content and style factors, and the objective of feature extraction and dimensionality reduction is to obtain the content factor without style factor. There are two vital steps in factor analysis framework; one is the design of factor separating objective function, including the design of partition and weight matrix, and the other is the design of space mapping function. In this paper, classical Linear Discriminant Analysis (LDA and Locality Preserving Projection (LPP algorithms are improved based on factor analysis framework, and LDA based on factor analysis (FA-LDA and LPP based on factor analysis (FA-LPP are proposed. Experimental results show the superiority of our proposed approach in classification performance compared to classical LDA and LPP algorithms.
Mass discrimination in elastic recoil detection analysis and its application to Al2O3 on MoS2
Laricchiuta, G.; Vandervorst, W.; Meersschaut, J.
2017-09-01
A time of flight-energy (TOF-E) telescope is often used to detect the scattered and recoiled atoms in elastic recoil detection analysis. The experimental two-dimensional TOF-E histogram may be numerically transformed into a time of flight-mass (TOF-M) histogram. The limited mass resolution in the TOF-M histogram, which results from the limited energy resolution of the energy detector, makes it sometimes difficult to discriminate elements with a small difference in atomic mass. We describe a mass discrimination procedure to numerically discriminate the elements in the TOF-M histogram. The procedure is illustrated on a sample consisting of an Al and a Si layer deposited on a MgO substrate. Besides, we apply the procedure to discriminate Al and Si in a sample consisting of Al2O3 deposited on MoS2/SiO2/Si.
DEFF Research Database (Denmark)
Laursen, K.H.; Mihailova, A.; Kelly, S.D.
2013-01-01
for discrimination of organically and conventionally grown plants. The study was based on wheat, barley, faba bean and potato produced in rigorously controlled long-term field trials comprising 144 experimental plots. Nitrogen isotope analysis revealed the use of animal manure, but was unable to discriminate between......Novel procedures for analytical authentication of organic plant products are urgently needed. Here we present the first study encompassing stable isotopes of hydrogen, carbon, nitrogen, oxygen, magnesium and sulphur as well as compound-specific nitrogen and oxygen isotope analysis of nitrate...... plants that were fertilised with synthetic nitrogen fertilisers or green manures from atmospheric nitrogen fixing legumes. This limitation was bypassed using oxygen isotope analysis of nitrate in potato tubers, while hydrogen isotope analysis allowed complete discrimination of organic and conventional...
Tang, Liying; Wu, Hongwei; Zhou, Xidan; Xu, Yilong; Zhou, Guohong; Wang, Ting; Kou, Zhenzhen; Wang, Zhuju
2015-07-01
A simple and efficient high-performance liquid chromatography fingerprint method was developed to discriminate Semen cassiae from two related species: Cassia obtusifolia L. (CO) and Cassia tora L. (CT), the seeds of which are abbreviated as COS and CTS, respectively. 22 major bioactive ingredients in 42 samples (20 COS and 22 CTS) collected from different provinces of China were identified. The statistical methods included similarity analysis and partial least-squares discriminant analysis. The pattern analysis method was specific and could be readily used for the comprehensive evaluation of Semen cassiae samples. Therefore, high-performance liquid chromatography fingerprint in combination with pattern analysis provided a simple and reliable method for discriminating between COS and CTS. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.
... in Genetics Archive Regulation of Genetic Tests Genetic Discrimination Overview Many Americans fear that participating in research ... I) and employment (Title II). Read more Genetic Discrimination and Other Laws Genetic Discrimination and Other Laws ...
Directory of Open Access Journals (Sweden)
Shekhar Suraj Kushe
2015-12-01
Full Text Available Packaging which is often called as the ‘silent salesman’ is an important component of marketing. Today the importance of packaging has risen to such an extent that product packaging is rightly called as the fifth ‘P’ of marketing mix. FMCG are products which are utilized by large number of people. The present study examined the discriminating power of five selected FMCG packaging variables namely ‘picture’, ‘colour’, ‘size’, ‘shape’ and ‘material’ amidst those who purchased FMCG based on these packaging variables and for those who purchased FMCG not based on these packaging variables. Descriptive research was carried out in the study. Respondents (students were asked to rate four packaging variable on a five point Likert’s scale. Discriminant analysis showed that only two variables namely ‘Colour’ (.706 and ‘Shape’ (–.527 were good predictors. Variables ‘Picture’, ‘size’ and ‘material’ were considered as poor predictors as far as the student communities were considered. The cross validated classification showed that out of the 240 samples drawn, 91.8% of the cases were correctly classified.
Peckmann, Tanya R; Orr, Kayla; Meek, Susan; Manolis, Sotiris K
2015-07-01
The determination of sex is an important part of building the biological profile for unknown human remains. Many of the bones traditionally used for the determination of sex are often found fragmented or incomplete in forensic and archaeological cases. The goal of the present research was to derive discriminant function equations from the talus, a preservationally favoured bone, for sexing skeletons from a contemporary Greek population. Nine parameters were measured on 182 individuals (96 males and 86 females) from the University of Athens Human Skeletal Reference Collection. The individuals ranged in age from 20 to 99 years old. The statistical analyses showed that all measured parameters were sexually dimorphic. Discriminant function score equations were generated for use in sex determination. The average accuracy of sex classification ranged from 65.2% to 93.4% for the univariate analysis, 90%-96.5% for the direct method and 86.7% for the stepwise method. Comparisons to other populations were made. Overall, the cross-validated accuracies ranged from 65.5% to 83.2% and males were most often correctly identified. The talus was shown to be useful for sex determination in the modern Greek population.
Peckmann, Tanya R; Orr, Kayla; Meek, Susan; Manolis, Sotiris K
2015-12-01
The skull and post-cranium have been used for the determination of sex for unknown human remains. However, in forensic cases where skeletal remains often exhibit postmortem damage and taphonomic changes the calcaneus may be used for the determination of sex as it is a preservationally favored bone. The goal of the present research was to derive discriminant function equations from the calcaneus for estimation of sex from a contemporary Greek population. Nine parameters were measured on 198 individuals (103 males and 95 females), ranging in age from 20 to 99 years old, from the University of Athens Human Skeletal Reference Collection. The statistical analyses showed that all variables were sexually dimorphic. Discriminant function score equations were generated for use in sex determination. The average accuracy of sex classification ranged from 70% to 90% for the univariate analysis, 82.9% to 87.5% for the direct method, and 86.2% for the stepwise method. Comparisons to other populations were made. Overall, the cross-validated accuracies ranged from 48.6% to 56.1% with males most often identified correctly and females most often misidentified. The calcaneus was shown to be useful for sex determination in the twentieth century Greek population.
Malo, Sergio; Fateri, Sina; Livadas, Makis; Mares, Cristinel; Gan, Tat-Hean
2017-07-01
Ultrasonic guided waves testing is a technique successfully used in many industrial scenarios worldwide. For many complex applications, the dispersive nature and multimode behavior of the technique still poses a challenge for correct defect detection capabilities. In order to improve the performance of the guided waves, a 2-D compressed pulse analysis is presented in this paper. This novel technique combines the use of pulse compression and dispersion compensation in order to improve the signal-to-noise ratio (SNR) and temporal-spatial resolution of the signals. The ability of the technique to discriminate different wave modes is also highlighted. In addition, an iterative algorithm is developed to identify the wave modes of interest using adaptive peak detection to enable automatic wave mode discrimination. The employed algorithm is developed in order to pave the way for further in situ applications. The performance of Barker-coded and chirp waveforms is studied in a multimodal scenario where longitudinal and flexural wave packets are superposed. The technique is tested in both synthetic and experimental conditions. The enhancements in SNR and temporal resolution are quantified as well as their ability to accurately calculate the propagation distance for different wave modes.
Directory of Open Access Journals (Sweden)
S. Kordrostami
2013-06-01
Full Text Available The appropriate choice of input-output weights is necessary to have a successful DEA model. Generally, if the number of DMUs i.e., n, is less than number of inputs and outputs i.e., m+s, then many of DMUs are introduced as efficient then the discrimination between DMUs is not possible. Besides, DEA models are free to choose the best weights. For resolving the problems that are resulted from freedom of weights, some constraints are set on the input-output weights. Symmetric weight constraints are a kind of weight constrains. In this paper, we represent a new model based on a multi-criterion data envelopment analysis (MCDEA are developed to moderate the homogeneity of weights distribution by using symmetric weight constrains.Consequently, we show that the improvement of the dispersal of unrealistic input-output weights and the increasing discrimination power for our suggested models. Finally, as an application of the new model, we use this model to evaluate and ranking guilan selected hospitals.
Institute of Scientific and Technical Information of China (English)
无
2006-01-01
At different times over the past 30 years in Zhejiang Province, China, the coastal tidelands have been successively enclosed and reclaimed for agricultural land use. The purpose of this work was to evaluate whether laboratory hyperspectral data might be used to estimate the physicochemical characteristics of these reclaimed saline soils. A coastal region of Shangyu City (Zhejiang Province), which was grouped into four subzones according to reclamation history, was used as the study area, and soil samples were collected in each subzone. Physicochemical analyses showed that the soils were characterized by high electrical conductivity and sand content with low organic matter; the longer the saline lands had been reclaimed, the lower were the electrical conductivity and sand content and the higher the organic matter content.These changing trends of soil chemical and physical properties were found in laboratory reflectance spectra of soil samples and their first-order derivative curves. Stepwise discriminant analysis (SDA) identified six salient spectral bands at 488,530, 670, 880, 1 400, and 1 900 nm. Using derived discriminant functions for saline lands with different historical years of reclamation, classification revealed an overall accuracy from a self-test of 86.6% and from cross-validation of 89.3%.Therefore, as opposed to time-consuming field investigations, this study suggested that remotely sensed hyperspectral data could serve as a promising measure to assess the reclamation levels of coastal saline lands.
Sugimoto, K; Hashimoto, Y; Fukuike, C; Kodama, N; Minagi, S
2014-03-01
Because food texture is regarded as an important factor for smooth deglutition, identification of objective parameters that could provide a basis for food texture selection for elderly or dysphagic patients is of great importance. We aimed to develop an objective evaluation method of mastication using a mixed test food comprising foodstuffs, simulating daily dietary life. The particle size distribution (>2 mm in diameter) in a bolus was analysed using a digital image under dark-field illumination. Ten female participants (mean age ± s.d., 27·6 ± 2·6 years) masticated a mixed test food comprising prescribed amounts of rice, sausage, hard omelette, raw cabbage and raw cucumber with 100%, 75%, 50% and 25% of the number of their masticatory strokes. A single set of coefficient thresholds of 0·10 for the homogeneity index and 1·62 for the particle size index showed excellent discrimination of deficient masticatory conditions with high sensitivity (0·90) and specificity (0·77). Based on the results of this study, normal mastication was discriminated from deficient masticatory conditions using a large particle analysis of mixed foodstuffs, thus showing the possibility of future application of this method for objective decision-making regarding the properties of meals served to dysphagic patients.
Discriminative analysis of brain functional connectivity patterns for mental fatigue classification.
Sun, Yu; Lim, Julian; Meng, Jianjun; Kwok, Kenneth; Thakor, Nitish; Bezerianos, Anastasios
2014-10-01
Mental fatigue is a commonly experienced state that can be induced by placing heavy demands on cognitive systems. This often leads to lowered productivity and increased safety risks. In this study, we developed a functional-connectivity based mental fatigue monitoring method. Twenty-six subjects underwent a 20-min mentally demanding test of sustained attention with high-resolution EEG monitoring. Functional connectivity patterns were obtained on the cortical surface via source localization of cortical activities in the first and last 5-min quartiles of the experiment. Multivariate pattern analysis was then adopted to extract the highly discriminative functional connectivity information. The algorithm used in the present study demonstrated an overall accuracy of 81.5% (p fatigue classification through leave-one-out cross validation. Moreover, we found that the most discriminative connectivity features were located in or across middle frontal gyrus and several motor areas, in agreement with the important role that these cortical regions play in the maintenance of sustained attention. This work therefore demonstrates the feasibility of a functional-connectivity-based mental fatigue assessment method, opening up a new avenue for modeling natural brain dynamics under different mental states. Our method has potential applications in several domains, including traffic and industrial safety.
Li, Liang; Ding, Wu
2010-05-01
In order to find out a fast measure method of adulterated milk based on near infrared spectroscopy, milk adulterated with plant butter, vegetable protein and starch was collected respectively. Using Fourier transform near infrared spectroscopy to scan the samples, the spectrum data were obtained. The samples were scanned in the spectral region between 4 000 and 12 000 cm(-1) by FT-NIR spectrometer with an optic fiber of 2 mm path-length and an InGaAs detector. Then all data were analyzed by principal component analysis combined with Fisher line discriminant analysis (FLDA) and partial least squares (PLS). Results show that the accumulative reliabilities of the first six components were more than 99%, so the first six components were applied as FLDA inputs and the values of the type of milk were applied as the outputs. An adulterated milk qualitative discriminant model based on Fisher line discriminant analysis was developed finally. The result indicated that the accuracy of detection of calibration samples is 97.78%. The unknown test samples were tested by this model and the correct identification rate is 94.44%. Partial least square models for detecting the content of material added to raw milk were set up with good veracity. The predictive correlation coefficient (R2) of calibration sets of milk adulterated with plant butter, vegetable protein and starch are 99.08%, 99.96% and 99.39%, respectively, while the root mean square errors of cross validation (RMSECV) of the three calibration sets are 0.304%, 0.013 5% and 0.060%, respectively. The R2 of validation sets of the three kinds of adulterated milk are 98.50%, 99.94% and 98.50%, respectively, while the root mean square errors of prediction (RMSEP) of the three validation sets are 0.323%, 0.028 8% and 0.068%, respectively. All of these suggested that near infrared spectroscopy has good potential for rapid qualitative and quantitative detection of milk adulterated with botanical filling material.
Nonparametric Analyses of Log-Periodic Precursors to Financial Crashes
Zhou, Wei-Xing; Sornette, Didier
We apply two nonparametric methods to further test the hypothesis that log-periodicity characterizes the detrended price trajectory of large financial indices prior to financial crashes or strong corrections. The term "parametric" refers here to the use of the log-periodic power law formula to fit the data; in contrast, "nonparametric" refers to the use of general tools such as Fourier transform, and in the present case the Hilbert transform and the so-called (H, q)-analysis. The analysis using the (H, q)-derivative is applied to seven time series ending with the October 1987 crash, the October 1997 correction and the April 2000 crash of the Dow Jones Industrial Average (DJIA), the Standard & Poor 500 and Nasdaq indices. The Hilbert transform is applied to two detrended price time series in terms of the ln(tc-t) variable, where tc is the time of the crash. Taking all results together, we find strong evidence for a universal fundamental log-frequency f=1.02±0.05 corresponding to the scaling ratio λ=2.67±0.12. These values are in very good agreement with those obtained in earlier works with different parametric techniques. This note is extracted from a long unpublished report with 58 figures available at , which extensively describes the evidence we have accumulated on these seven time series, in particular by presenting all relevant details so that the reader can judge for himself or herself the validity and robustness of the results.
Wavelet Estimators in Nonparametric Regression: A Comparative Simulation Study
Directory of Open Access Journals (Sweden)
Anestis Antoniadis
2001-06-01
Full Text Available Wavelet analysis has been found to be a powerful tool for the nonparametric estimation of spatially-variable objects. We discuss in detail wavelet methods in nonparametric regression, where the data are modelled as observations of a signal contaminated with additive Gaussian noise, and provide an extensive review of the vast literature of wavelet shrinkage and wavelet thresholding estimators developed to denoise such data. These estimators arise from a wide range of classical and empirical Bayes methods treating either individual or blocks of wavelet coefficients. We compare various estimators in an extensive simulation study on a variety of sample sizes, test functions, signal-to-noise ratios and wavelet filters. Because there is no single criterion that can adequately summarise the behaviour of an estimator, we use various criteria to measure performance in finite sample situations. Insight into the performance of these estimators is obtained from graphical outputs and numerical tables. In order to provide some hints of how these estimators should be used to analyse real data sets, a detailed practical step-by-step illustration of a wavelet denoising analysis on electrical consumption is provided. Matlab codes are provided so that all figures and tables in this paper can be reproduced.
Novák, Márton; Palya, Dóra; Bodai, Zsolt; Nyiri, Zoltán; Magyar, Norbert; Kovács, József; Eke, Zsuzsanna
2017-01-01
Combined cluster and discriminant analysis (CCDA) as a chemometric tool in compound specific isotope analysis of diesel fuels was studied. The stable carbon isotope ratios (δ(13)C) of n-alkanes in diesel fuel can be used to characterize or differentiate diesels originating from different sources. We investigated 25 diesel fuel samples representing 20 different brands. The samples were collected from 25 different service stations in 11 European countries over a 2 year period. The n-alkane fraction of diesel fuels was separated using solid-state urea clathrate formation combined with silica gel fractionation. The stable carbon isotope ratios of C10-C24 n-alkanes were measured with gas chromatography-isotope ratio mass spectrometry (GC-IRMS) using perdeuterated n-alkanes as internal standards. Beside the 25 samples one additional diesel fuel was prepared and measured three times to get totally homogenous samples in order to test the performance of our analytical and statistical routine. Stable isotope ratio data were evaluated with hierarchical cluster analysis (HCA), principal component analysis (PCA) and CCDA. CCDA combines two multivariate data analysis methods hierarchical cluster analysis with linear discriminant analysis (LDA). The main idea behind CCDA is to compare the goodness of preconceived (based on the sample origins) and random groupings. In CCDA all the samples were compared pairwise. The results for the parallel sample preparations showed that the analytical procedure does not have any significant effect on the δ(13)C values of n-alkanes. The three parallels proved to be totally homogenous with CCDA. HCA and PCA can be useful tools when the examining of the relationship among several samples is in question. However, these two techniques cannot be always decisive on the origin of similar samples. The initial hypothesis that all diesel fuel samples are considered chemically unique was verified by CCDA. The main advantage of CCDA is that it gives an
Murphy, Thomas Brendan; Dean, Nema; Raftery, Adrian E
2010-03-01
Food authenticity studies are concerned with determining if food samples have been correctly labelled or not. Discriminant analysis methods are an integral part of the methodology for food authentication. Motivated by food authenticity applications, a model-based discriminant analysis method that includes variable selection is presented. The discriminant analysis model is fitted in a semi-supervised manner using both labeled and unlabeled data. The method is shown to give excellent classification performance on several high-dimensional multiclass food authenticity datasets with more variables than observations. The variables selected by the proposed method provide information about which variables are meaningful for classification purposes. A headlong search strategy for variable selection is shown to be efficient in terms of computation and achieves excellent classification performance. In applications to several food authenticity datasets, our proposed method outperformed default implementations of Random Forests, AdaBoost, transductive SVMs and Bayesian Multinomial Regression by substantial margins.
Agasisti, Tommaso
2011-01-01
The objective of this paper is an efficiency analysis concerning higher education systems in European countries. Data have been extracted from OECD data-sets (Education at a Glance, several years), using a non-parametric technique--data envelopment analysis--to calculate efficiency scores. This paper represents the first attempt to conduct such an…
Maroco, João; Silva, Dina; Rodrigues, Ana; Guerreiro, Manuela; Santana, Isabel; de Mendonça, Alexandre
2011-08-17
Dementia and cognitive impairment associated with aging are a major medical and social concern. Neuropsychological testing is a key element in the diagnostic procedures of Mild Cognitive Impairment (MCI), but has presently a limited value in the prediction of progression to dementia. We advance the hypothesis that newer statistical classification methods derived from data mining and machine learning methods like Neural Networks, Support Vector Machines and Random Forests can improve accuracy, sensitivity and specificity of predictions obtained from neuropsychological testing. Seven non parametric classifiers derived from data mining methods (Multilayer Perceptrons Neural Networks, Radial Basis Function Neural Networks, Support Vector Machines, CART, CHAID and QUEST Classification Trees and Random Forests) were compared to three traditional classifiers (Linear Discriminant Analysis, Quadratic Discriminant Analysis and Logistic Regression) in terms of overall classification accuracy, specificity, sensitivity, Area under the ROC curve and Press'Q. Model predictors were 10 neuropsychological tests currently used in the diagnosis of dementia. Statistical distributions of classification parameters obtained from a 5-fold cross-validation were compared using the Friedman's nonparametric test. Press' Q test showed that all classifiers performed better than chance alone (p Machines showed the larger overall classification accuracy (Median (Me) = 0.76) an area under the ROC (Me = 0.90). However this method showed high specificity (Me = 1.0) but low sensitivity (Me = 0.3). Random Forest ranked second in overall accuracy (Me = 0.73) with high area under the ROC (Me = 0.73) specificity (Me = 0.73) and sensitivity (Me = 0.64). Linear Discriminant Analysis also showed acceptable overall accuracy (Me = 0.66), with acceptable area under the ROC (Me = 0.72) specificity (Me = 0.66) and sensitivity (Me = 0.64). The remaining classifiers showed overall classification accuracy above a
Lorenzo-Blanco, Elma I; Unger, Jennifer B; Ritt-Olson, Anamara; Soto, Daniel; Baezconde-Garbanati, Lourdes
2013-05-01
Risk for smoking initiation increases as Hispanic youth acculturate to U.S. society, and this association seems to be stronger for Hispanic girls than boys. To better understand the influence of culture, family, and everyday discrimination on cigarette smoking, we tested a process-oriented model of acculturation and cigarette smoking. Data came from Project RED (Reteniendo y Entendiendo Diversidad para Salud), which included 1,436 Hispanic students (54% girls) from Southern California. We used data from 9th to 11th grade (85% were 14 years old, and 86% were U.S. born) to test the influence of acculturation-related experiences on smoking over time. Multigroup structural equation analysis suggested that acculturation was associated with increased familismo and lower traditional gender roles, and enculturation was linked more with familismo and respeto. Familismo, respeto, and traditional gender roles were linked with lower family conflict and increased family cohesion, and these links were stronger for girls. Familismo and respeto were further associated with lower discrimination. Conversely, fatalismo was linked with worse family functioning (especially for boys) and increased discrimination in both the groups. Discrimination was the only predictor of smoking for boys and girls. In all, the results of the current study indicate that reducing discrimination and helping youth cope with discrimination may prevent or reduce smoking in Hispanic boys and girls. This may be achieved by promoting familismo and respeto and by discouraging fatalistic beliefs.
Pagliaro, Antonio; D'Alí Staiti, G.; D'Anna, F.
2011-03-01
We present a new method for the identification of extensive air showers initiated by different primaries. The method uses the multiscale concept and is based on the analysis of multifractal behaviour and lacunarity of secondary particle distributions together with a properly designed and trained artificial neural network. In the present work the method is discussed and applied to a set of fully simulated vertical showers, in the experimental framework of ARGO-YBJ, to obtain hadron to gamma primary separation. We show that the presented approach gives very good results, leading, in the 1-10 TeV energy range, to a clear improvement of the discrimination power with respect to the existing figures for extended shower detectors.
Energy Technology Data Exchange (ETDEWEB)
Pagliaro, Antonio, E-mail: pagliaro@ifc.inaf.it [Istituto di Astrofisica Spaziale e Fisica Cosmica di Palermo - Istituto Nazionale di Astrofisica, Via Ugo La Malfa 153, 90146 Palermo (Italy); Istituto Nazionale di Fisica Nucleare, Sezione di Catania, Viale A. Doria 6, 95125 Catania (Italy); D' Ali Staiti, G. [Universita degli Studi di Palermo, Dipartimento di Fisica e Tecnologie Relative, Viale delle Scienze, Edificio 18, 90128 Palermo (Italy); Istituto Nazionale di Fisica Nucleare, Sezione di Catania, Viale A. Doria 6, 95125 Catania (Italy); D' Anna, F. [Istituto di Astrofisica Spaziale e Fisica Cosmica di Palermo - Istituto Nazionale di Astrofisica, Via Ugo La Malfa 153, 90146 Palermo (Italy)
2011-03-15
We present a new method for the identification of extensive air showers initiated by different primaries. The method uses the multiscale concept and is based on the analysis of multifractal behaviour and lacunarity of secondary particle distributions together with a properly designed and trained artificial neural network. In the present work the method is discussed and applied to a set of fully simulated vertical showers, in the experimental framework of ARGO-YBJ, to obtain hadron to gamma primary separation. We show that the presented approach gives very good results, leading, in the 1-10 TeV energy range, to a clear improvement of the discrimination power with respect to the existing figures for extended shower detectors.
GSK-3β polymorphism discriminates bipolar disorder and schizophrenia: a systematic meta-analysis.
Tang, Hui; Shen, Na; Jin, Huijuan; Liu, Dan; Miao, Xiaoping; Zhu, Ling-Qiang
2013-12-01
Glycogen synthase kinase 3 (GSK-3) is a well-known conserved and ubiquitous protein kinase and playing a pivotal role in neurodevelopment, neurogenesis, learning/memory, and neuronal cell death. Dysfunction of GSK-3 had been seen in multiple neurodegenerative and psychiatric diseases. Bipolar disorder and schizophrenia are two common psychiatric diseases first occur in adolescence or young adulthood. They share similar risk genes as well as clinical symptoms, which make it is difficult to be discriminated from each other. Here, by using meta-analysis we reported that glycogen synthase kinase 3β promoter inactive mutant rs334558 may contribute to the development of schizophrenia not bipolar disorder. This might be used to distinguish these two diseases.
A Multimodal Biometric System Using Linear Discriminant Analysis For Improved Performance
Khan, Aamir; Khurshid, Aasim; Akram, Adeel
2012-01-01
Essentially a biometric system is a pattern recognition system which recognizes a user by determining the authenticity of a specific anatomical or behavioral characteristic possessed by the user. With the ever increasing integration of computers and Internet into daily life style, it has become necessary to protect sensitive and personal data. This paper proposes a multimodal biometric system which incorporates more than one biometric trait to attain higher security and to handle failure to enroll situations for some users. This paper is aimed at investigating a multimodal biometric identity system using Linear Discriminant Analysis as backbone to both facial and speech recognition and implementing such system in real-time using SignalWAVE.
Directory of Open Access Journals (Sweden)
K.Vijayarekha
2012-12-01
Full Text Available Linear Discriminant Analysis (LDA is one technique for transforming raw data into a new feature space in which classification can be carried out more robustly. It is useful where the within-class frequencies are unequal. This method maximizes the ratio of between-class variance to the within-class variance in any particular data set and the maximal separability is guaranteed. LDA clustering models are used to classify object into different category. This study makes use of LDA for clustering the features obtained for the citrus fruit images taken in five different domains. Sub-windows of size 40x40 are cropped from the citrus fruit images having defects such as pitting, splitting and stem end rot. Features are extracted in four domains such as statistical features, fourier transform based features, discrete wavelet transform based features and stationary wavelet transform based features. The results of clustering and classification using LDA and ANN classifiers are reported
Directory of Open Access Journals (Sweden)
R. Rajalakshmi
2015-03-01
Full Text Available Face recognition is the process of categorizing a person in an image by evaluating with a known face image library. The pose and illumination variations are two main practical confronts for an automatic face recognition system. This study proposes a novel face recognition algorithm known as Efficient Discriminant Component Analysis (EDCA for face recognition under varying poses and illumination conditions. This EDCA algorithm overcomes the high dimensionality problem in the feature space by extracting features from the low dimensional frequency band of the image. It combines the features of both LDA and PCA algorithms and these features are used in the training set and is classified using Support Vector Machine classifier. The experiments were performed on the CMU-PIE datasets. The experimental results show that the proposed algorithm produces a higher recognition rate than the existing LDA and PCA based face recognition techniques.
DEFF Research Database (Denmark)
Garcia, Emanuel; Klaas, Ilka Christine; Amigo Rubio, Jose Manuel;
2014-01-01
. Eighty variables retrieved from AMS were summarized week-wise and used to predict 2 defined classes: nonlame and clinically lame cows. Variables were represented with 2 transformations of the week summarized variables, using 2-wk data blocks before gait scoring, totaling 320 variables (2 × 2 × 80......). The reference gait scoring error was estimated in the first week of the study and was, on average, 15%. Two partial least squares discriminant analysis models were fitted to parity 1 and parity 2 groups, respectively, to assign the lameness class according to the predicted probability of being lame (score 3......Lameness is prevalent in dairy herds. It causes decreased animal welfare and leads to higher production costs. This study explored data from an automatic milking system (AMS) to model on-farm gait scoring from a commercial farm. A total of 88 cows were gait scored once per week, for 2 5-wk periods...
Zollanvari, Amin
2013-05-24
We provide a fundamental theorem that can be used in conjunction with Kolmogorov asymptotic conditions to derive the first moments of well-known estimators of the actual error rate in linear discriminant analysis of a multivariate Gaussian model under the assumption of a common known covariance matrix. The estimators studied in this paper are plug-in and smoothed resubstitution error estimators, both of which have not been studied before under Kolmogorov asymptotic conditions. As a result of this work, we present an optimal smoothing parameter that makes the smoothed resubstitution an unbiased estimator of the true error. For the sake of completeness, we further show how to utilize the presented fundamental theorem to achieve several previously reported results, namely the first moment of the resubstitution estimator and the actual error rate. We provide numerical examples to show the accuracy of the succeeding finite sample approximations in situations where the number of dimensions is comparable or even larger than the sample size.
Zollanvari, Amin; Genton, Marc G
2013-08-01
We provide a fundamental theorem that can be used in conjunction with Kolmogorov asymptotic conditions to derive the first moments of well-known estimators of the actual error rate in linear discriminant analysis of a multivariate Gaussian model under the assumption of a common known covariance matrix. The estimators studied in this paper are plug-in and smoothed resubstitution error estimators, both of which have not been studied before under Kolmogorov asymptotic conditions. As a result of this work, we present an optimal smoothing parameter that makes the smoothed resubstitution an unbiased estimator of the true error. For the sake of completeness, we further show how to utilize the presented fundamental theorem to achieve several previously reported results, namely the first moment of the resubstitution estimator and the actual error rate. We provide numerical examples to show the accuracy of the succeeding finite sample approximations in situations where the number of dimensions is comparable or even larger than the sample size.
Institute of Scientific and Technical Information of China (English)
CHEN Xinyi; YAN Xuefeng
2013-01-01
Fault diagnosis and monitoring are very important for complex chemical process.There are numerous methods that have been studied in this field,in which the effective visualization method is still challenging.In order to get a better visualization effect,a novel fault diagnosis method which combines self-organizing map (SOM) with Fisher discriminant analysis (FDA) is proposed.FDA can reduce the dimension of the data in terms of maximizing the separability of the classes.After feature extraction by FDA,SOM can distinguish the different states on the output map clearly and it can also be employed to monitor abnormal states.Tennessee Eastman (TE) process is employed to illustrate the fault diagnosis and monitoring performance of the proposed method.The result shows that the SOM integrated with FDA method is efficient and capable for real-time monitoring and fault diagnosis in complex chemical process.
Huerta-Díaz-Regañon, M D; Salinas Fernández, M R; Masoud, T
1997-12-01
The major volatiles of eighty eight wines (white, rosè and red) from Madrid were studied. The samples came from the three districts forming the "Vinos de Madrid" DO (Denominación de Origen) region: Arganda, Navalcarnero and San Martín, and were analyzed by gas chromatography. The resulting data were treated by Stepwise Discriminant Analysis (SDA) in order to ascertain the efficacity of these compounds in classifying the wines according to their geographical origin. The results confirm that the above components were of little use in classifying the red and white wines and, although a correct classification percentage of 90.91% was obtained for the rosés when all the variables were used, this too was considered unsatisfactory.
Kim, Deog-Im; Kwak, Dai-Soon; Han, Seung-Ho
2013-12-10
The proximal and distal parts of the femur show the differences between the sexes. Head diameter and the breadth of the epicondyle of the femur are known to distinguish males from females. The proximal end of the femur is studied to determine sex using discriminant analysis but; its distal end is not done. This study aims to develop an equation specific to Koreans by using the medial and lateral condyles of the femur, and to demonstrate the usefulness of equations for specific population groups. We used three-dimensional images from 202 Korean femurs. Twelve variables were measured with a computer program after the femurs were in alignment. Eleven variables showed a statistically significant difference between the sexes (Psex determination in situations where the skull and pelvis are missing and part of the femur is available. The study also demonstrates the need for different equations for different population groups.
The Importance of the Discriminant Analysis for the Evolution of the Equity Prices
Directory of Open Access Journals (Sweden)
Dinca G.
2014-12-01
Full Text Available This paper aims to show the correlation between the results obtained using the discriminant analysis and the evolution of stock prices of listed Romanian companies. For this purpose, we have carried out the research on a sample of 32 issuers from categories I and II of the Bucharest Stock Exchange, pertaining to nine economic sectors, for the period 2010- 2012. Our study is based on the Anghel prediction model of bankruptcy, using the stock prices of the 32 listed companies from the first and the last day of trading for each year examined. The results obtained by applying the prediction model allow the classification of issuers into potential bankrupt and non-bankrupt firms and help investors take appropriate decisions on the stock market.
DEFF Research Database (Denmark)
Garcia, Emanuel; Klaas, Ilka Christine; Amigo Rubio, Jose Manuel;
2014-01-01
Lameness is prevalent in dairy herds. It causes decreased animal welfare and leads to higher production costs. This study explored data from an automatic milking system (AMS) to model on-farm gait scoring from a commercial farm. A total of 88 cows were gait scored once per week, for 2 5-wk periods....... Eighty variables retrieved from AMS were summarized week-wise and used to predict 2 defined classes: nonlame and clinically lame cows. Variables were represented with 2 transformations of the week summarized variables, using 2-wk data blocks before gait scoring, totaling 320 variables (2 × 2 × 80......). The reference gait scoring error was estimated in the first week of the study and was, on average, 15%. Two partial least squares discriminant analysis models were fitted to parity 1 and parity 2 groups, respectively, to assign the lameness class according to the predicted probability of being lame (score 3...
Directory of Open Access Journals (Sweden)
A. Ali
2016-01-01
Full Text Available A responsive laser induced breakdown spectroscopic system was developed and improved for utilizing it as a sensor for the classification of quartz samples on the basis of trace elements present in the acquired samples. Laser induced breakdown spectroscopy (LIBS in conjunction with discriminant function analysis (DFA was applied for the classification of five different types of quartz samples. The quartz plasmas were produced at ambient pressure using Nd:YAG laser at fundamental harmonic mode (1064 nm. We optimized the detection system by finding the suitable delay time of the laser excitation. This is the first study, where the developed technique (LIBS+DFA was successfully employed to probe and confirm the elemental composition of quartz samples.
Cross View Gait Recognition Using Joint-Direct Linear Discriminant Analysis
Portillo-Portillo, Jose; Leyva, Roberto; Sanchez, Victor; Sanchez-Perez, Gabriel; Perez-Meana, Hector; Olivares-Mercado, Jesus; Toscano-Medina, Karina; Nakano-Miyatake, Mariko
2016-01-01
This paper proposes a view-invariant gait recognition framework that employs a unique view invariant model that profits from the dimensionality reduction provided by Direct Linear Discriminant Analysis (DLDA). The framework, which employs gait energy images (GEIs), creates a single joint model that accurately classifies GEIs captured at different angles. Moreover, the proposed framework also helps to reduce the under-sampling problem (USP) that usually appears when the number of training samples is much smaller than the dimension of the feature space. Evaluation experiments compare the proposed framework’s computational complexity and recognition accuracy against those of other view-invariant methods. Results show improvements in both computational complexity and recognition accuracy. PMID:28025484
Cross View Gait Recognition Using Joint-Direct Linear Discriminant Analysis
Directory of Open Access Journals (Sweden)
Jose Portillo-Portillo
2016-12-01
Full Text Available This paper proposes a view-invariant gait recognition framework that employs a unique view invariant model that profits from the dimensionality reduction provided by Direct Linear Discriminant Analysis (DLDA. The framework, which employs gait energy images (GEIs, creates a single joint model that accurately classifies GEIs captured at different angles. Moreover, the proposed framework also helps to reduce the under-sampling problem (USP that usually appears when the number of training samples is much smaller than the dimension of the feature space. Evaluation experiments compare the proposed framework’s computational complexity and recognition accuracy against those of other view-invariant methods. Results show improvements in both computational complexity and recognition accuracy.
A Multimodal Biometric System Using Linear Discriminant Analysis For Improved Performance
Directory of Open Access Journals (Sweden)
Aamir Khan
2011-11-01
Full Text Available Essentially a biometric system is a pattern recognition system which recognizes a user by determining the authenticity of a specific anatomical or behavioral characteristic possessed by the user. With the ever increasing integration of computers and Internet into daily life style, it has become necessary to protect sensitive and personal data. This paper proposes a multimodal biometric system which incorporates more than one biometric trait to attain higher security and to handle failure to enroll situations for some users. This paper is aimed at investigating a multimodal biometric identity system using Linear Discriminant Analysis as backbone to both facial and speech recognition and implementing such system in real-time using SignalWAVE.
Hargrove, Levi J; Scheme, Erik J; Englehart, Kevin B; Hudgins, Bernard S
2010-02-01
This paper describes a novel pattern recognition based myoelectric control system that uses parallel binary classification and class specific thresholds. The system was designed with an intuitive configuration interface, similar to existing conventional myoelectric control systems. The system was assessed quantitatively with a classification error metric and functionally with a clothespin test implemented in a virtual environment. For each case, the proposed system was compared to a state-of-the-art pattern recognition system based on linear discriminant analysis and a conventional myoelectric control scheme with mode switching. These assessments showed that the proposed control system had a higher classification error ( p myoelectric control system ( p myoelectric control system which is robust, easily configured, and highly usable.
Acquah, Gifty E; Via, Brian K; Billor, Nedret; Fasina, Oladiran O; Eckhardt, Lori G
2016-08-27
As new markets, technologies and economies evolve in the low carbon bioeconomy, forest logging residue, a largely untapped renewable resource will play a vital role. The feedstock can however be variable depending on plant species and plant part component. This heterogeneity can influence the physical, chemical and thermochemical properties of the material, and thus the final yield and quality of products. Although it is challenging to control compositional variability of a batch of feedstock, it is feasible to monitor this heterogeneity and make the necessary changes in process parameters. Such a system will be a first step towards optimization, quality assurance and cost-effectiveness of processes in the emerging biofuel/chemical industry. The objective of this study was therefore to qualitatively classify forest logging residue made up of different plant parts using both near infrared spectroscopy (NIRS) and Fourier transform infrared spectroscopy (FTIRS) together with linear discriminant analysis (LDA). Forest logging residue harvested from several Pinus taeda (loblolly pine) plantations in Alabama, USA, were classified into three plant part components: clean wood, wood and bark and slash (i.e., limbs and foliage). Five-fold cross-validated linear discriminant functions had classification accuracies of over 96% for both NIRS and FTIRS based models. An extra factor/principal component (PC) was however needed to achieve this in FTIRS modeling. Analysis of factor loadings of both NIR and FTIR spectra showed that, the statistically different amount of cellulose in the three plant part components of logging residue contributed to their initial separation. This study demonstrated that NIR or FTIR spectroscopy coupled with PCA and LDA has the potential to be used as a high throughput tool in classifying the plant part makeup of a batch of forest logging residue feedstock. Thus, NIR/FTIR could be employed as a tool to rapidly probe/monitor the variability of forest
Neighbor Class Linear Discriminate Analysis%近邻类鉴别分析方法
Institute of Scientific and Technical Information of China (English)
王言伟; 丁晓青; 刘长松
2012-01-01
提出一种近邻类鉴别分析方法,线性鉴别分析是该方法的一个特例.线性鉴别分析通过最大化类间散度同时最小化类内散度寻找最佳投影,其中类间散度是所有类之间散度的总体平均；而近邻类鉴别分析中类间散度定义为各个类与其k个近邻类之间的平均散度.该方法通过选取适当的近邻类数,能够缓解线性鉴别降维后造成的部分类的重叠.实验结果表明近邻类鉴别分析方法性能稳定且优于传统的线性鉴别分析.%A method of neighbor class linear discriminant analysis (NCLDA) is proposed. Linear discriminant analysis ( LDA) is a special case of this method. LDA finds the optimal projections by maximum between-class scatter while by minimum within-class scatter. The between-class scatter is an average over divergences among all classes. In NCLDA, between-class scatter is defined as average divergences between one class and its k nearest neighbor classes. By selecting proper numbers of neighbor class, NCLDA alleviates overlaps among classes caused by LDA. The experimental results show that the proposed NCLDA is robust and outperforms LDA.
Jeffery, Debra; Clement, Sarah; Corker, Elizabeth; Howard, Louise M; Murray, Joanna; Thornicroft, Graham
2013-04-20
Experienced discrimination refers to an individual's perception that they have been treated unfairly due to an attribute and is an important recent focus within stigma research. A significant proportion of mental health service users report experiencing mental illness-based discrimination in relation to parenthood. Existing studies in this area have not gone beyond prevalence, therefore little is known about the nature of experienced discrimination in relation to parenthood, and how is it constituted. This study aims to generate a typology of community psychiatric service users' reports of mental illness-based discrimination in relation to becoming or being a parent. A secondary aim is to assess the prevalence of these types of experienced discrimination. In a telephone survey 2026 community psychiatric service users in ten UK Mental Health service provider organisations (Trusts) were asked about discrimination experienced in the previous 12 months using the Discrimination and Stigma Scale (DISC). The sample were asked if, due to their mental health problem, they had been treated unfairly in starting a family, or in their role as a parent, and gave examples of this. Prevalence is reported and the examples of experienced discrimination in relation to parenthood were analysed using the framework method of qualitative analysis. Three hundred and four participants (73% female) reported experienced discrimination, with prevalences of 22.5% and 28.3% for starting a family and for the parenting role respectively. Participants gave 89 examples of discrimination about starting a family and 228 about parenting, and these occurred in social and professional contexts. Ten themes were identified. These related to being seen as an unfit parent; people not being understanding; being stopped from having children; not being allowed to see their children; not getting the support needed; children being affected; children avoiding their parents; children's difficulties being blamed
Karoui, Romdhane; Hammami, Moncef; Rouissi, Hamadi; Blecker, Christophe
2011-01-01
Mid infrared spectroscopy (MIR) combined with multivariate data analysis was used to discriminate between ewes milk samples according to their feeding systems (controls, ewes fed scotch bean and ewes fed soybean). The MIR spectra were scanned throughout the first 11 weeks of the lactation stage. When factorial discriminant analysis (FDA) with leave one-out cross-validation was applied, separately, to the three spectral regions in the MIR (i.e. 3000-2800, 1700-1500 and 1500-900 cm(-1)), the cl...
Morrissey, L. A.; Weinstock, K. J.; Mouat, D. A.; Card, D. H.
1984-01-01
An evaluation of Thematic Mapper Simulator (TMS) data for the geobotanical discrimination of rock types based on vegetative cover characteristics is addressed in this research. A methodology for accomplishing this evaluation utilizing univariate and multivariate techniques is presented. TMS data acquired with a Daedalus DEI-1260 multispectral scanner were integrated with vegetation and geologic information for subsequent statistical analyses, which included a chi-square test, an analysis of variance, stepwise discriminant analysis, and Duncan's multiple range test. Results indicate that ultramafic rock types are spectrally separable from nonultramafics based on vegetative cover through the use of statistical analyses.
Gonzalez-Escalona, Narjol; Timme, Ruth; Raphael, Brian H; Zink, Donald; Sharma, Shashi K
2014-04-01
Clostridium botulinum is a genetically diverse Gram-positive bacterium producing extremely potent neurotoxins (botulinum neurotoxins A through G [BoNT/A-G]). The complete genome sequences of three strains harboring only the BoNT/A1 nucleotide sequence are publicly available. Although these strains contain a toxin cluster (HA(+) OrfX(-)) associated with hemagglutinin genes, little is known about the genomes of subtype A1 strains (termed HA(-) OrfX(+)) that lack hemagglutinin genes in the toxin gene cluster. We sequenced the genomes of three BoNT/A1-producing C. botulinum strains: two strains with the HA(+) OrfX(-) cluster (69A and 32A) and one strain with the HA(-) OrfX(+) cluster (CDC297). Whole-genome phylogenic single-nucleotide-polymorphism (SNP) analysis of these strains along with other publicly available C. botulinum group I strains revealed five distinct lineages. Strains 69A and 32A clustered with the C. botulinum type A1 Hall group, and strain CDC297 clustered with the C. botulinum type Ba4 strain 657. This study reports the use of whole-genome SNP sequence analysis for discrimination of C. botulinum group I strains and demonstrates the utility of this analysis in quickly differentiating C. botulinum strains harboring identical toxin gene subtypes. This analysis further supports previous work showing that strains CDC297 and 657 likely evolved from a common ancestor and independently acquired separate BoNT/A1 toxin gene clusters at distinct genomic locations.
Rapid discrimination of Salmonella isolates by single-strand conformation polymorphism analysis.
Al-Adhami, Batol H; Huby-Chilton, Florence; Blais, Burton W; Martinez-Perez, Amalia; Chilton, Neil B; Gajadhar, Alvin A
2008-10-01
A molecular typing technique was developed for the differentiation of Salmonella isolates based on single-strand conformation polymorphism (SSCP) analysis of amplicons generated by PCR. Amplicons from parts of the fimA (both the 5' and 3' ends), mdh, invA, and atpD genes were generated separately from a panel of Salmonella strains representing Salmonella bongori, and four subspecies and 17 serovars of Salmonella enterica. These amplicons were subjected to SSCP analysis for differentiation of the salmonellae on the basis of different conformational forms arising due to nucleotide sequence variations in the target genes. Several distinct SSCP banding patterns (a maximum of 14 each for atpD and fimA 3' end) were observed with this panel of Salmonella strains for amplicons generated from each target gene. The best discrimination of Salmonella subspecies and serovar was achieved from the SSCP analysis of a combination of at least three gene targets: atpD, invA, and either mdh or fimA 3' end. This demonstrates the applicability of SSCP analysis as an important additional method to classical typing approaches for the differentiation of foodborne Salmonella isolates. SSCP is simple to perform and should be readily transferable to food microbiology laboratories with basic PCR capability.
The Analysis of the Ethnical Discrimination on the Manpower’s Market under the Economical Crisis
Directory of Open Access Journals (Sweden)
Mihaela Hrisanta DOBRE
2012-06-01
Full Text Available Discrimination means any difference, exclusion, restriction, preference or different treatment that brings forth disadvantages for a person or a group as compared to other ones that are in similar situations. The reasons on which discrimination is based can be various, such as race, nationality, ethnics, religion, gender, sexual orientation, language, age, disabilities etc. and in this case we talk about multiple discrimination. In Romania the main forms of discrimination are linked to ethnics and to sexual appurtenance. Within this column we analysed the discrimination amongst the Romany ethnics people, according to a statistical investigation (Access onto the Labour Market – A Chance for You, the research goal being to identify the answer to the following questions: Is there any discrimination inside the Romany ethnic group? What is the correlation between their level of education and their income? What is the correlation between the level of education of the parents and the respondent’s?
Kwon, Yong-Kook; Ahn, Myung Suk; Park, Jong Suk; Liu, Jang Ryol; In, Dong Su; Min, Byung Whan; Kim, Suk Weon
2013-01-01
To determine whether Fourier transform (FT)-IR spectral analysis combined with multivariate analysis of whole-cell extracts from ginseng leaves can be applied as a high-throughput discrimination system of cultivation ages and cultivars, a total of total 480 leaf samples belonging to 12 categories corresponding to four different cultivars (Yunpung, Kumpung, Chunpung, and an open-pollinated variety) and three different cultivation ages (1 yr, 2 yr, and 3 yr) were subjected to FT-IR. The spectral data were analyzed by principal component analysis and partial least squares-discriminant analysis. A dendrogram based on hierarchical clustering analysis of the FT-IR spectral data on ginseng leaves showed that leaf samples were initially segregated into three groups in a cultivation age-dependent manner. Then, within the same cultivation age group, leaf samples were clustered into four subgroups in a cultivar-dependent manner. The overall prediction accuracy for discrimination of cultivars and cultivation ages was 94.8% in a cross-validation test. These results clearly show that the FT-IR spectra combined with multivariate analysis from ginseng leaves can be applied as an alternative tool for discriminating of ginseng cultivars and cultivation ages. Therefore, we suggest that this result could be used as a rapid and reliable F1 hybrid seed-screening tool for accelerating the conventional breeding of ginseng. PMID:24558311
Demographic Consequences of Gender Discrimination in China: Simulation Analysis of Policy Options
Quanbao, Jiang; Shuzhuo, Li; Marcus W., Feldman
2011-01-01
The large number of missing females in China, a consequence of gender discrimination, is having and will continue to have a profound effect on the country's population development. In this paper, we analyze the causes of this gender discrimination in terms of institutions, culture and, economy, and suggest public policies that might help eliminate gender discrimination. Using a population simulation model, we study the effect of public policies on the sex ratio at birth and excess female chil...
A nonparametric and diversified portfolio model
Shirazi, Yasaman Izadparast; Sabiruzzaman, Md.; Hamzah, Nor Aishah
2014-07-01
Traditional portfolio models, like mean-variance (MV) suffer from estimation error and lack of diversity. Alternatives, like mean-entropy (ME) or mean-variance-entropy (MVE) portfolio models focus independently on the issue of either a proper risk measure or the diversity. In this paper, we propose an asset allocation model that compromise between risk of historical data and future uncertainty. In the new model, entropy is presented as a nonparametric risk measure as well as an index of diversity. Our empirical evaluation with a variety of performance measures shows that this model has better out-of-sample performances and lower portfolio turnover than its competitors.
Non-Parametric Estimation of Correlation Functions
DEFF Research Database (Denmark)
Brincker, Rune; Rytter, Anders; Krenk, Steen
In this paper three methods of non-parametric correlation function estimation are reviewed and evaluated: the direct method, estimation by the Fast Fourier Transform and finally estimation by the Random Decrement technique. The basic ideas of the techniques are reviewed, sources of bias are pointed...... out, and methods to prevent bias are presented. The techniques are evaluated by comparing their speed and accuracy on the simple case of estimating auto-correlation functions for the response of a single degree-of-freedom system loaded with white noise....
Nonparametric inferences for kurtosis and conditional kurtosis
Institute of Scientific and Technical Information of China (English)
XIE Xiao-heng; HE You-hua
2009-01-01
Under the assumption of strictly stationary process, this paper proposes a nonparametric model to test the kurtosis and conditional kurtosis for risk time series. We apply this method to the daily returns of S&P500 index and the Shanghai Composite Index, and simulate GARCH data for verifying the efficiency of the presented model. Our results indicate that the risk series distribution is heavily tailed, but the historical information can make its future distribution light-tailed. However the far future distribution's tails are little affected by the historical data.
Preliminary results on nonparametric facial occlusion detection
Directory of Open Access Journals (Sweden)
Daniel LÓPEZ SÁNCHEZ
2016-10-01
Full Text Available The problem of face recognition has been extensively studied in the available literature, however, some aspects of this field require further research. The design and implementation of face recognition systems that can efficiently handle unconstrained conditions (e.g. pose variations, illumination, partial occlusion... is still an area under active research. This work focuses on the design of a new nonparametric occlusion detection technique. In addition, we present some preliminary results that indicate that the proposed technique might be useful to face recognition systems, allowing them to dynamically discard occluded face parts.
Cresswell, Scott L; Eklund, Robert C
2006-02-01
Athlete burnout research has been hampered by the lack of an adequate measurement tool. The Athlete Burnout Questionnaire (ABQ) and the Maslach Burnout Inventory General Survey (MBI-GS) are two recently developed self-report instruments designed to assess burnout. The convergent and discriminant validity of the ABQ and MBI-GS were assessed through multi-trait/multi-method analysis with a sporting population. Overall, the ABQ and the MBI-GS displayed acceptable convergent validity with matching subscales highly correlated, and satisfactory internal discriminant validity with lower correlations between non-matching subscales. Both scales also indicated an adequate discrimination between the concepts of burnout and depression. These findings add support to previous findings in non-sporting populations that depression and burnout are separate constructs. Based on the psychometric results, construct validity analysis and practical considerations, the results support the use of the ABQ to assess athlete burnout.
Institute of Scientific and Technical Information of China (English)
曹志勇
2014-01-01
It is widely acknowledged that sociolinguistics is the study of language in certain context concerned with our soci-ety. Sociolinguistics and linguistics are intrinsically related to each other, but there has been difference as well. Linguistics research deals with language system itself, which belongs to the micro lev-el on the one hand; many phenomena reflect discrimination in language classroom, these discrimination are caused by social fac-tors to a certain degree. This paper makes a brief analysis of dis-crimination in language classroom from the perspective of socio-linguistics, which deals with many issues such as depiction of lan-guage discrimination、analysis of phenomenon and accordingly-solved measures.
Morton, Kenneth D., Jr.; Torrione, Peter A.; Collins, Leslie
2010-04-01
Time domain ground penetrating radar (GPR) has been shown to be a powerful sensing phenomenology for detecting buried objects such as landmines. Landmine detection with GPR data typically utilizes a feature-based pattern classification algorithm to discriminate buried landmines from other sub-surface objects. In high-fidelity GPR, the time-frequency characteristics of a landmine response should be indicative of the physical construction and material composition of the landmine and could therefore be useful for discrimination from other non-threatening sub-surface objects. In this research we propose modeling landmine time-domain responses with a nonparametric Bayesian time-series model and we perform clustering of these time-series models with a hierarchical nonparametric Bayesian model. Each time-series is modeled as a hidden Markov model (HMM) with autoregressive (AR) state densities. The proposed nonparametric Bayesian prior allows for automated learning of the number of states in the HMM as well as the AR order within each state density. This creates a flexible time-series model with complexity determined by the data. Furthermore, a hierarchical non-parametric Bayesian prior is used to group landmine responses with similar HMM model parameters, thus learning the number of distinct landmine response models within a data set. Model inference is accomplished using a fast variational mean field approximation that can be implemented for on-line learning.
DEFF Research Database (Denmark)
Ramirez, José Rangel; Sørensen, John Dalsgaard
2011-01-01
This work illustrates the updating and incorporation of information in the assessment of fatigue reliability for offshore wind turbine. The new information, coming from external and condition monitoring can be used to direct updating of the stochastic variables through a non-parametric Bayesian...... updating approach and be integrated in the reliability analysis by a third-order polynomial chaos expansion approximation. Although Classical Bayesian updating approaches are often used because of its parametric formulation, non-parametric approaches are better alternatives for multi-parametric updating...... with a non-conjugating formulation. The results in this paper show the influence on the time dependent updated reliability when non-parametric and classical Bayesian approaches are used. Further, the influence on the reliability of the number of updated parameters is illustrated....
Classification of the long-QT syndrome based on discriminant analysis of T-wave morphology
DEFF Research Database (Denmark)
Struijk, Johannes; Kanters, J.K.; Andersen, Mads Peter
2006-01-01
been shown to be useful discriminators, but no single ECG parameter has been sufficient to solve the diagnostic problem. In this study we present a method for discrimination among persons with a normal genotype and those with mutations in the KCNQ1 (KvLQT1 or LQT1) and KCNH2 (HERG or LQT2) genes...... on the basis of parameters describing T-wave morphology in terms of duration, asymmetry, flatness and amplitude. Discriminant analyses based on 4 or 5 parameters both resulted in perfect discrimination in a learning set of 36 subjects. In both cases cross-validation of the resulting classifiers showed...
Energy Technology Data Exchange (ETDEWEB)
Awasthi, Rishi [Sanjay Gandhi Post Graduate Institute of Medical Sciences, Department of Radiodiagnosis, Lucknow (India); Rathore, Ram K.S.; Sahoo, Prativa [Indian Institute of Technology, Department of Mathematics and Statistics, Kanpur (India); Soni, Priyanka; Husain, Nuzhat [Chhatrapati Sahuji Maharj Medical University, Department of Pathology, Lucknow (India); Awasthi, Ashish; Pandey, Chandra M. [Sanjay Gandhi Post Graduate Institute of Medical Sciences, Department of Biostatistics, Lucknow (India); Behari, Sanjay; Singh, Rohit K. [Sanjay Gandhi Post Graduate Institute of Medical Sciences, Department of Neurosurgery, Lucknow (India); Gupta, Rakesh K. [Sanjay Gandhi Post Graduate Institute of Medical Sciences, MR Section, Department of Radiodiagnosis, Lucknow, UP (India)
2012-03-15
The purpose of the present study was to look for the possible predictors which might discriminate between high- and low-grade gliomas by pooling dynamic contrast-enhanced (DCE)-perfusion derived indices and immunohistochemical markers. DCE-MRI was performed in 76 patients with different grades of gliomas. Perfusion indices, i.e., relative cerebral blood volume (rCBV), relative cerebral blood flow (rCBF), permeability (k{sup trans} and k{sub ep}), and leakage (v{sub e}) were quantified. MMP-9-, PRL-3-, HIF-1{alpha}-, and VEGF-expressing cells were quantified from the excised tumor tissues. Discriminant function analysis using these markers was used to identify discriminatory variables using a stepwise procedure. To look for correlations between immunohistochemical parameters and DCE metrics, Pearson's correlation coefficient was also used. A discriminant function for differentiating between high- and low-grade tumors was constructed using DCE-MRI-derived rCBV, k{sub ep}, and v{sub e}. The form of the functions estimated are ''D{sub 1} = 0.642 x rCBV + 0.591 x k{sub ep} - 1.501 x v{sub e} - 1.550'' and ''D{sub 2} = 1.608 x rCBV + 3.033 x k{sub ep} + 5.508 x v{sub e} - 8.784'' for low- and high-grade tumors, respectively. This function classified overall 92.1% of the cases correctly (89.1% high-grade tumors and 100% low-grade tumors). In addition, VEGF expression correlated with rCBV and rCBF, whereas MMP-9 expression correlated with k{sub ep}. A significant positive correlation of HIF-1{alpha} with rCBV and VEGF expression was also found. DCE-MRI may be used to differentiate between high-grade and low-grade brain tumors non-invasively, which may be helpful in appropriate treatment planning and management of these patients. The correlation of its indices with immunohistochemical markers suggests that this imaging technique is useful in tissue characterization of gliomas. (orig.)
Multi-class ERP-based BCI data analysis using a discriminant space self-organizing map.
Onishi, Akinari; Natsume, Kiyohisa
2014-01-01
Emotional or non-emotional image stimulus is recently applied to event-related potential (ERP) based brain computer interfaces (BCI). Though the classification performance is over 80% in a single trial, a discrimination between those ERPs has not been considered. In this research we tried to clarify the discriminability of four-class ERP-based BCI target data elicited by desk, seal, spider images and letter intensifications. A conventional self organizing map (SOM) and newly proposed discriminant space SOM (ds-SOM) were applied, then the discriminabilites were visualized. We also classify all pairs of those ERPs by stepwise linear discriminant analysis (SWLDA) and verify the visualization of discriminabilities. As a result, the ds-SOM showed understandable visualization of the data with a shorter computational time than the traditional SOM. We also confirmed the clear boundary between the letter cluster and the other clusters. The result was coherent with the classification performances by SWLDA. The method might be helpful not only for developing a new BCI paradigm, but also for the big data analysis.
Directory of Open Access Journals (Sweden)
Salim Aijaz Bhat
2014-01-01
Full Text Available Multivariate techniques, discriminant analysis, and WQI were applied to analyze a water quality data set including 27 parameters at 5 sites of the Lake Wular in Kashmir Himalaya from 2011 to 2013 to investigate spatiotemporal variations and identify potential pollution sources. Spatial and temporal variations in water quality parameters were evaluated through stepwise discriminant analysis (DA. The first spatial discriminant function (DF accounted for 76.5% of the total spatial variance, and the second DF accounted for 19.1%. The mean values of water temperature, EC, total-N, K, and silicate showed a strong contribution to discriminate the five sampling sites. The mean concentration of NO2-N, total-N, and sulphate showed a strong contribution to discriminate the four sampling seasons and accounted for most of the expected seasonal variations. The order of major cations and anions was Ca2+>Mg2+> Na+>K+ and Cl->SO42->SiO22- respectively. The results of water quality index, employing thirteen core parameters vital for drinking water purposes, showed values of 49.2, 46.5, 47.3, 40.6, and 37.1 for sites I, II, III, IV, and V, respectively. These index values reflect that the water of lake is in good condition for different purposes but increased values alarm us about future repercussions.
Tang, M; Chang, C Q; Fung, P C W; Chau, K T; Chan, F H Y
2005-01-01
The discrimination of ECG signals using nonlinear dynamic parameters is of crucial importance in the cardiac disease therapy and chaos control for arrhythmia defibrillation in the cardiac system. However, the discrimination results of previous studies using features such as maximal Lyapunov exponent (λmax) and correlation dimension (D2) alone are somewhat limited in recognition rate. In this paper, improved methods for computing λmaxand D2are purposed. Another parameter from recurrence quantification analysis is incorporated to the new multi-feature Bayesian classifier with λmaxand D2so as to improve the discrimination power. Experimental results have verified the prediction using Fisher discriminant that the maximal vertical line length (Vmax) from recurrence quantification analysis is the best to distinguish different ECG classes. Experimental results using the MIT-BIH Arrhythmia Database show improved and excellent overall accuracy (96.3%), average sensitivity (96.3%) and average specificity (98.15%) for discriminating sinus, premature ventricular contraction and ventricular flutter signals.
Directory of Open Access Journals (Sweden)
Rabia Ece OMAY
2013-06-01
Full Text Available In this study, relationship between gross domestic product (GDP per capita and sulfur dioxide (SO2 and particulate matter (PM10 per capita is modeled for Turkey. Nonparametric fixed effect panel data analysis is used for the modeling. The panel data covers 12 territories, in first level of Nomenclature of Territorial Units for Statistics (NUTS, for period of 1990-2001. Modeling of the relationship between GDP and SO2 and PM10 for Turkey, the non-parametric models have given good results.
Directory of Open Access Journals (Sweden)
Wakita,Yoshiharu
1988-06-01
Full Text Available We studied the dermatoglyphics of 353 severe mental retardates (excluding those with chromosomal abnormalities and major limb malformations, using multivariate analysis, to determine how early intrauterine factors are related to the etiology of mental retardation. First, dermatoglyphics were compared between 140 individuals with undefined prenatal factors and 700 normal controls. After 6 and 9 dermatoglyphic traits were chosen as discriminative variables for males and females, respectively, the data were subjected separately for each sex to the constellation graphical method for discriminant analysis. The same formula as obtained in the idiopathic group was subsequently applied to data from cases in other etiological categories. When the misclassification rate was 0.03, the rates of correct classification of the male patients into the etiological categories of undefined prenatal, defined prenatal, perinatal, postnatal and unknown (no anamnestic data available categories were 19.7% (13/66, 20.0% (3/15, 8.8% (5/57, 5.0% (1/20 and 7.7% (2/26, while the correct classification rates of females were 24.3% (18/74, 42.1% (8/19, 18.9% (7/37, 5.1% (1/16 and 13.0% (3/23, respectively. The results suggest that early intrauterine factors such as those producing dermatoglyphic deviations may contribute to the pathogenesis of severe mental retardation not only in patients with undefined prenatal etiological factors but also in those with perinatal factors, especially those of the female sex.
Bayesian Nonparametric Clustering for Positive Definite Matrices.
Cherian, Anoop; Morellas, Vassilios; Papanikolopoulos, Nikolaos
2016-05-01
Symmetric Positive Definite (SPD) matrices emerge as data descriptors in several applications of computer vision such as object tracking, texture recognition, and diffusion tensor imaging. Clustering these data matrices forms an integral part of these applications, for which soft-clustering algorithms (K-Means, expectation maximization, etc.) are generally used. As is well-known, these algorithms need the number of clusters to be specified, which is difficult when the dataset scales. To address this issue, we resort to the classical nonparametric Bayesian framework by modeling the data as a mixture model using the Dirichlet process (DP) prior. Since these matrices do not conform to the Euclidean geometry, rather belongs to a curved Riemannian manifold,existing DP models cannot be directly applied. Thus, in this paper, we propose a novel DP mixture model framework for SPD matrices. Using the log-determinant divergence as the underlying dissimilarity measure to compare these matrices, and further using the connection between this measure and the Wishart distribution, we derive a novel DPM model based on the Wishart-Inverse-Wishart conjugate pair. We apply this model to several applications in computer vision. Our experiments demonstrate that our model is scalable to the dataset size and at the same time achieves superior accuracy compared to several state-of-the-art parametric and nonparametric clustering algorithms.
Analyzing single-molecule time series via nonparametric Bayesian inference.
Hines, Keegan E; Bankston, John R; Aldrich, Richard W
2015-02-03
The ability to measure the properties of proteins at the single-molecule level offers an unparalleled glimpse into biological systems at the molecular scale. The interpretation of single-molecule time series has often been rooted in statistical mechanics and the theory of Markov processes. While existing analysis methods have been useful, they are not without significant limitations including problems of model selection and parameter nonidentifiability. To address these challenges, we introduce the use of nonparametric Bayesian inference for the analysis of single-molecule time series. These methods provide a flexible way to extract structure from data instead of assuming models beforehand. We demonstrate these methods with applications to several diverse settings in single-molecule biophysics. This approach provides a well-constrained and rigorously grounded method for determining the number of biophysical states underlying single-molecule data. Copyright © 2015 Biophysical Society. Published by Elsevier Inc. All rights reserved.
Brody, Gene H.; Lei, Man-Kit; Chae, David H.; Yu, Tianyi; Kogan, Steven M.; Beach, Steven R. H.
2014-01-01
This study was designed to examine the prospective relations of perceived racial discrimination with allostatic load (AL), along with a possible buffer of the association. A sample of 331 African Americans in the rural South provided assessments of perceived discrimination from ages 16 to 18 years. When youth were 18 years, caregivers reported…
Brody, Gene H.; Lei, Man-Kit; Chae, David H.; Yu, Tianyi; Kogan, Steven M.; Beach, Steven R. H.
2014-01-01
This study was designed to examine the prospective relations of perceived racial discrimination with allostatic load (AL), along with a possible buffer of the association. A sample of 331 African Americans in the rural South provided assessments of perceived discrimination from ages 16 to 18 years. When youth were 18 years, caregivers reported…
Qualitative analysis of mental health service users' reported experiences of discrimination.
Hamilton, S; Pinfold, V; Cotney, J; Couperthwaite, L; Matthews, J; Barret, K; Warren, S; Corker, E; Rose, D; Thornicroft, G; Henderson, C
2016-08-01
To better understand mental health service users' experiences of stigma and discrimination in different settings. An annual telephone survey of people with a mental health diagnosis conducted to evaluate the Time to Change antistigma campaign in England. Of 985 people who participated in 2013, 84 took part in a qualitative interview which was audio recorded. Of these, 50 interviews were transcribed and thematically analysed to explore accounts of discrimination. We analysed common types of behaviour; motivations ascribed to the discriminators; expectations of what fair treatment would have been; and the impact of discrimination on participants. Discrimination was most common in five contexts: welfare benefits, mental health care, physical health care, family and friends. Participants often found it hard to assess whether a behaviour was discriminatory or not. Lack of support, whether by public services or by friends and family, was often experienced as discrimination, reflecting an expectation that positive behaviours and reasonable adjustments should be offered in response to mental health needs. The impact of discrimination across different settings was often perceived by participants as aggravating their mental health, and there is thus a need to treat discrimination as a health issue, not just a social justice issue. © 2016 The Authors. Acta Psychiatrica Scandinavica Published by John Wiley & Sons Ltd.
Directory of Open Access Journals (Sweden)
ZHU Binghua
2017-06-01
Full Text Available ObjectiveTo establish a nutritional risk screening model for patients with liver cirrhosis using discriminant analysis. MethodsThe clinical data of 273 patients with liver cirrhosis who were admitted to Shuguang Hospital Affiliated to Shanghai University of Traditional Chinese Medicine from August 2015 to March 2016 were collected. Body height, body weight, upper arm circumference, triceps skinfold thickness, subscapular skinfold thickness, and hand grip strength were measured and recorded, and then body mass index (BMI and upper arm muscle circumference were calculated. Laboratory markers including liver function parameters, renal function parameters, and vitamins were measured. The patients were asked to complete Nutritional Risk Screening 2002 and Malnutrition Universal Screening Tool (MUST, and a self-developed nutritional risk screening pathway was used for nutritional risk classification. Observation scales of the four diagnostic methods in traditional Chinese medicine were used to collect patients′ symptoms and signs. Continuous data were expressed as mean±SD (x±s; an analysis of variance was used for comparison between multiple groups, and the least significant difference t-test was used for further comparison between two groups. Discriminant analysis was used for model establishment, and cross validation was used for model verification. ResultsThe nutritional risk screening pathway for patients with liver cirrhosis was used for the screening of respondents, and there were 49 patients (17.95% in non-risk group, 49 (17.95% in possible-risk group, and 175 (64.10% in risk group. The distance criterion function was used to establish the nutritional risk screening model for patients with liver cirrhosis: D1=-11.885+0.310×BMI+0150×MAC+0.005×P-Alb-0.001×Vit B12+0.103×Vit D-0.89×ascites-0.404×weakness-0.560×hypochondriac pain+0035×dysphoria with feverish sensation (note: if a patient has ascites, weakness, hypochondriac pain
Lacunarity analysis of spaceborne radar image texture for rock unit discrimination
Dong, Pinliang
Fractal geometry has led to new understanding of many natural objects and phenomena. As a scale-dependent measure, lacunarity can be used to discriminate different textures that may not be differentiated by fractal dimension. Based on a differential box counting method and a gliding-box algorithm, a new lacunarity estimation method is developed for texture analysis of digital images, and a "Lacunarity Analysis" extension built for ArcView (ESRI) geographical information system software. To reveal the directional properties of textures, the directionality of lacunarity is also defined. The new lacunarity measure is evaluated through quantitative comparison with the Voss lacunarity, the binary lacunarity, the grey level cooccurrence matrix (GLCM) based texture measures (homogeneity, contrast, dissimilarity, entropy), the fractal dimension, and the min-max operator using Brodatz textures. The results from Brodatz textures suggest that the new lacunarity estimation method for grey-scale images provides more accurate texture measurements than the above-mentioned fractal-based and statistical texture measures. In comparison with the Voss lacunarity, the fractal dimension, and the GLCM-based texture measures, the new lacunarity measure is then applied to dual-band (L and C) and dual-polarization (HH and HV) Shuttle Imaging Radar (SIR-C), and C-band HH polarization Radarsat images of two imaging modes for rock unit discrimination in a study area between California and Arizona, USA. Using textural analysis of 36 SIR-C and Radarsat sub-images and classification accuracy assessment of the combined Landsat Thematic Mapper (TM) images and spaceborne radar textural feature images, it has been demonstrated that the new lacunarity measure outperformed other texture measures in comparison, and the L-band HH polarization SIR-C image provides more textural information of the rock units compared with the Radarsat and other SIR-C radar images used in this study. The study shows that
Regularized discriminate analysis for breast mass detection on full field digital mammograms
Wei, Jun; Sahiner, Berkman; Zhang, Yiheng; Chan, Heang-Ping; Hadjiiski, Lubomir M.; Zhou, Chuan; Ge, Jun; Wu, Yi-Ta
2006-03-01
In computer-aided detection (CAD) applications, an important step is to design a classifier for the differentiation of the abnormal from the normal structures. We have previously developed a stepwise linear discriminant analysis (LDA) method with simplex optimization for this purpose. In this study, our goal was to investigate the performance of a regularized discriminant analysis (RDA) classifier in combination with a feature selection method for classification of the masses and normal tissues detected on full field digital mammograms (FFDM). The feature selection scheme combined a forward stepwise feature selection process and a backward stepwise feature elimination process to obtain the best feature subset. An RDA classifier and an LDA classifier in combination with this new feature selection method were compared to an LDA classifier with stepwise feature selection. A data set of 130 patients containing 260 mammograms with 130 biopsy-proven masses was used. All cases had two mammographic views. The true locations of the masses were identified by experienced radiologists. To evaluate the performance of the classifiers, we randomly divided the data set into two independent sets of approximately equal size for training and testing. The training and testing were performed using the 2-fold cross validation method. The detection performance of the CAD system was assessed by free response receiver operating characteristic (FROC) analysis. The average test FROC curve was obtained by averaging the FP rates at the same sensitivity along the two corresponding test FROC curves from the 2-fold cross validation. At the case-based sensitivities of 90%, 80% and 70% on the test set, our RDA classifier with the new feature selection scheme achieved an FP rate of 1.8, 1.1, and 0.6 FPs/image, respectively, compared to 2.1, 1.4, and 0.8 FPs/image with stepwise LDA with simplex optimization. Our results indicate that RDA in combination with the sequential forward inclusion
Yamauchi, Moritaka; Okada, Tomohisa; Okada, Tsutomu; Yamamoto, Akira; Fushimi, Yasutaka; Arakawa, Yoshiki; Miyamoto, Susumu; Togashi, Kaori
2017-08-01
This study investigated the combined capability of thallium-201 (Tl)-SPECT and fluorine-18-fluoro-deoxy-glucose (FDG)-PET for differential diagnosis of posterior fossa brain tumors using multiple discriminant analysis.This retrospective study was conducted under approval of the institutional review board. In the hospital information system, 27 patients with posterior fossa intra-axial tumor between January 2009 and June 2015 were enrolled and grouped as the following 7 entities: low grade glioma (LGG) 6, anaplastic astrocytoma (AA) 2, glioblastoma (GBM) 3, medulloblastoma (MB) 3, hemangioblastoma (HB) 6, metastatic tumor (Mets) 3, and malignant lymphoma (ML) 4. Tl and FDG uptakes were measured at the tumors and control areas, and several indexes were derived. Using indexes selected by the stepwise method, discriminant analysis was conducted with leave-one-out cross-validation.The predicted accuracy for tumor classification was 70.4% at initial analysis and 55.6% at cross-validation to differentiate 7 tumor entities. HB, LGG, and ML were well-discriminated, but AA was located next to LGG. GBM, MB, and Mets largely overlapped and could not be well distinguished even applying multiple discriminant analysis. Correct classification in the original and cross-validation analyses was 44.4% and 33.3% for Tl-SPECT and 55.6% and 48.1% for FDG-PET.
Kerschbamer, Rudolf
2015-05-01
This paper proposes a geometric delineation of distributional preference types and a non-parametric approach for their identification in a two-person context. It starts with a small set of assumptions on preferences and shows that this set (i) naturally results in a taxonomy of distributional archetypes that nests all empirically relevant types considered in previous work; and (ii) gives rise to a clean experimental identification procedure - the Equality Equivalence Test - that discriminates between archetypes according to core features of preferences rather than properties of specific modeling variants. As a by-product the test yields a two-dimensional index of preference intensity.
Improvement Distance Discriminant Analysis Method%改进的距离判别分析法
Institute of Scientific and Technical Information of China (English)
黄利文
2011-01-01
在不对判别变量进行处理的条件下,对传统的距离判别方法进行改进,提出一种新的判别方法,试图解决复杂球形数据的判别问题,以提高判别的正确率.通过实例表明,该方法的判别效果良好,能较好地处理复杂球形数据的判别问题.%This paper presents a new discriminant method through improving the traditional distance discriminant method and tries to improve the accuracy of discriminant problems of complex spherical data in condition of not reducing variables. The example shows that this method has good discriminant effect, and can effectively deal with the discriminant problems of complex spherical data.
Directory of Open Access Journals (Sweden)
Grzegorz Gołębiowski
2008-07-01
Full Text Available The article presents the result of research on en effectiveness of discriminant models on the example of selected Polish joint-stock companies which declared bankruptcy. Aside from general results describing an effectiveness of discriminant models on the base of the own research of the authors, a comparison of the received findings with others economists results of research was made. Moreover, an analysis how the relation of an enterprise with the economic situation impacts on an effectiveness of considered models concerning the forecast of a risk of bankruptcy was carried out.
Nonparametric estimation of population density for line transect sampling using FOURIER series
Crain, B.R.; Burnham, K.P.; Anderson, D.R.; Lake, J.L.
1979-01-01
A nonparametric, robust density estimation method is explored for the analysis of right-angle distances from a transect line to the objects sighted. The method is based on the FOURIER series expansion of a probability density function over an interval. With only mild assumptions, a general population density estimator of wide applicability is obtained.
A non-parametric peak finder algorithm and its application in searches for new physics
Chekanov, S
2011-01-01
We have developed an algorithm for non-parametric fitting and extraction of statistically significant peaks in the presence of statistical and systematic uncertainties. Applications of this algorithm for analysis of high-energy collision data are discussed. In particular, we illustrate how to use this algorithm in general searches for new physics in invariant-mass spectra using pp Monte Carlo simulations.
DISCRIMINATIVE ANALYSIS OF MORPHOLOGIC AND MOTORIC PARAMETER TO JUDO AND KARATE SPORTIEST BOYS
Directory of Open Access Journals (Sweden)
Lulzim Ibri
2013-07-01
Full Text Available In sample from 160 boys from secondary schools of Prizren 16-17 age, separated in two groups were implicated 18 tests, from them 10 test for valuation morphologic characteristic and 8 test, for valuation motoric abilities. Group (A is component from 80 judo athletes’ boys and group (B from 80 karate athletes’ boys. Purpose of this investigation is to verify changes between judo and karate athletes’ boys in morphologic characteristic and motoric abilities. The problem of investigation was to investigate if there are changes between judo and karate athletes’ boys in morphologic characteristic that represent longitudinal dimensionality, body measure and adipose tissue, and in motoric abilities (used is eurofit battery tests. For global analysis of dimension to some changes and variable system (which contribute in changes between judo and karate athletes’ boys were implicated t-test for small independent sample and, canonic discriminative analysis. The results of this study show that judo and karate athletes significantly differ among themselves in motoric abilities, judo athletes are better in the tests: long jump from place (LOJU, squeeze palm (SQPA and support the knuckle (SUKN, while the karate athletes are better in the tests: taping for hands (TAHE, reach sitting down position (RSDP and run there-hire 10x5 meters (R10x5M, but these changes were not noticed and morphological variables.
Discriminant Context Information Analysis for Post-Ranking Person Re-Identification.
Garcia, Jorge; Martinel, Niki; Gardel, Alfredo; Bravo, Ignacio; Foresti, Gian Luca; Micheloni, Christian
2017-01-16
Existing approaches for person re-identification are mainly based on creating distinctive representations or on learning optimal metrics. The achieved results are then provided in form of a list of ranked matching persons. It often happens that the true match is not ranked first but it is in the first positions. This is mostly due to the visual ambiguities shared between the true match and other "similar" persons. At the current state, there is a lack of a study of such visual ambiguities which limit the re-identification performance within the first ranks. We believe that an analysis of the similar appearances of the first ranks can be helpful in detecting, hence removing, such visual ambiguities. We propose to achieve such a goal by introducing an unsupervised post-ranking framework. Once the initial ranking is available, content and context sets are extracted. Then, these are exploited to remove the visual ambiguities and to obtain the discriminant feature space which is finally exploited to compute the new ranking. An in-depth analysis of the performance achieved on three public benchmark datasets support our believes. For every dataset, the proposed method remarkably improves the first ranks results and outperforms state-of-the-art approaches.
Ciucci, Sara; Ge, Yan; Durán, Claudio; Palladini, Alessandra; Jiménez-Jiménez, Víctor; Martínez-Sánchez, Luisa María; Wang, Yuting; Sales, Susanne; Shevchenko, Andrej; Poser, Steven W.; Herbig, Maik; Otto, Oliver; Androutsellis-Theotokis, Andreas; Guck, Jochen; Gerl, Mathias J.; Cannistraci, Carlo Vittorio
2017-01-01
Omic science is rapidly growing and one of the most employed techniques to explore differential patterns in omic datasets is principal component analysis (PCA). However, a method to enlighten the network of omic features that mostly contribute to the sample separation obtained by PCA is missing. An alternative is to build correlation networks between univariately-selected significant omic features, but this neglects the multivariate unsupervised feature compression responsible for the PCA sample segregation. Biologists and medical researchers often prefer effective methods that offer an immediate interpretation to complicated algorithms that in principle promise an improvement but in practice are difficult to be applied and interpreted. Here we present PC-corr: a simple algorithm that associates to any PCA segregation a discriminative network of features. Such network can be inspected in search of functional modules useful in the definition of combinatorial and multiscale biomarkers from multifaceted omic data in systems and precision biomedicine. We offer proofs of PC-corr efficacy on lipidomic, metagenomic, developmental genomic, population genetic, cancer promoteromic and cancer stem-cell mechanomic data. Finally, PC-corr is a general functional network inference approach that can be easily adopted for big data exploration in computer science and analysis of complex systems in physics. PMID:28287094
Ultrasonic analysis to discriminate bread dough of different types of flour
García-Álvarez, J.; Rosell, C. M.; García-Hernández, M. J.; Chávez, J. A.; Turó, A.; Salazar, J.
2012-12-01
Many varieties of bread are prepared using flour coming from wheat. However, there are other types of flours milled from rice, legumes and some fruits and vegetables that are also suitable for baking purposes, used alone or in combination with wheat flour. The type of flour employed strongly influences the dough consistency, which is a relevant property for determining the dough potential for breadmaking purposes. Traditional methods for dough testing are relatively expensive, time-consuming, off-line and often require skilled operators. In this work, ultrasonic analysis are performed in order to obtain acoustic properties of bread dough samples prepared using two different types of flour, wheat flour and rice flour. The dough acoustic properties can be related to its viscoelastic characteristics, which in turn determine the dough feasibility for baking. The main advantages of the ultrasonic dough testing can be, among others, its low cost, fast, hygienic and on-line performance. The obtained results point out the potential of the ultrasonic analysis to discriminate doughs of different types of flour.
Bertram, S; Moriggl, A; Neunteufel, N; Rudisch, A; Emshoff, R
2012-02-01
To assess whether in patients with temporomandibular joint (TMJ) arthralgia cephalometric variables of mandibular morphology may discriminate among the magnetic resonance (MR) imaging-based TMJ groups of 'bilateral presence of disk displacement without reduction (DDwoR) and osteoarthrosis (OA)' and 'bilateral absence of bilateral DDwoR and OA'. Bilateral MR imaging of the TMJ was performed in 45 consecutive TMJ arthralgia patients to identify individuals with the specific structural characteristics of bilateral TMJ DDwoR associated with OA. Linear and angular cephalometric measurements were taken from lateral cephalograms to apply selected criteria of mandibular morphology. A discriminant function analysis was used to investigate how cephalometric parameters discriminate among the TMJ groups of 'bilateral presence of DDwoR with OA' and 'bilateral absence of DDwoR and OA'. Ramus height (Ar-Go) and effective mandibular length (Ar-Pog) produced a significant discriminant function that predicted TMJ group membership (P < 0·001). This function correctly classified 80·2% of original and cross-validated grouped cases. This study supports the concept that cephalometric variables of mandibular morphology discriminate among subjects with and without bilateral TMJ DDwoR and OA.
Werf, M.J. van der; Pieterse, B.; Luijk, N. van; Schuren, F.; Werff van der - Vat, B. van der; Overkamp, K.; Jellema, R.H.
2006-01-01
The value of the multivariate data analysis tools principal component analysis (PCA) and principal component discriminant analysis (PCDA) for prioritizing leads generated by microarrays was evaluated. To this end, Pseudomonas putida S12 was grown in independent triplicate fermentations on four
Werf, M.J. van der; Pieterse, B.; Luijk, N. van; Schuren, F.; Werff van der - Vat, B. van der; Overkamp, K.; Jellema, R.H.
2006-01-01
The value of the multivariate data analysis tools principal component analysis (PCA) and principal component discriminant analysis (PCDA) for prioritizing leads generated by microarrays was evaluated. To this end, Pseudomonas putida S12 was grown in independent triplicate fermentations on four diffe
Directory of Open Access Journals (Sweden)
Gifty E. Acquah
2016-08-01
Full Text Available As new markets, technologies and economies evolve in the low carbon bioeconomy, forest logging residue, a largely untapped renewable resource will play a vital role. The feedstock can however be variable depending on plant species and plant part component. This heterogeneity can influence the physical, chemical and thermochemical properties of the material, and thus the final yield and quality of products. Although it is challenging to control compositional variability of a batch of feedstock, it is feasible to monitor this heterogeneity and make the necessary changes in process parameters. Such a system will be a first step towards optimization, quality assurance and cost-effectiveness of processes in the emerging biofuel/chemical industry. The objective of this study was therefore to qualitatively classify forest logging residue made up of different plant parts using both near infrared spectroscopy (NIRS and Fourier transform infrared spectroscopy (FTIRS together with linear discriminant analysis (LDA. Forest logging residue harvested from several Pinus taeda (loblolly pine plantations in Alabama, USA, were classified into three plant part components: clean wood, wood and bark and slash (i.e., limbs and foliage. Five-fold cross-validated linear discriminant functions had classification accuracies of over 96% for both NIRS and FTIRS based models. An extra factor/principal component (PC was however needed to achieve this in FTIRS modeling. Analysis of factor loadings of both NIR and FTIR spectra showed that, the statistically different amount of cellulose in the three plant part components of logging residue contributed to their initial separation. This study demonstrated that NIR or FTIR spectroscopy coupled with PCA and LDA has the potential to be used as a high throughput tool in classifying the plant part makeup of a batch of forest logging residue feedstock. Thus, NIR/FTIR could be employed as a tool to rapidly probe/monitor the variability
Analyzing multiple spike trains with nonparametric Granger causality.
Nedungadi, Aatira G; Rangarajan, Govindan; Jain, Neeraj; Ding, Mingzhou
2009-08-01
Simultaneous recordings of spike trains from multiple single neurons are becoming commonplace. Understanding the interaction patterns among these spike trains remains a key research area. A question of interest is the evaluation of information flow between neurons through the analysis of whether one spike train exerts causal influence on another. For continuous-valued time series data, Granger causality has proven an effective method for this purpose. However, the basis for Granger causality estimation is autoregressive data modeling, which is not directly applicable to spike trains. Various filtering options distort the properties of spike trains as point processes. Here we propose a new nonparametric approach to estimate Granger causality directly from the Fourier transforms of spike train data. We validate the method on synthetic spike trains generated by model networks of neurons with known connectivity patterns and then apply it to neurons simultaneously recorded from the thalamus and the primary somatosensory cortex of a squirrel monkey undergoing tactile stimulation.
Nonparametric Estimation of Distributions in Random Effects Models
Hart, Jeffrey D.
2011-01-01
We propose using minimum distance to obtain nonparametric estimates of the distributions of components in random effects models. A main setting considered is equivalent to having a large number of small datasets whose locations, and perhaps scales, vary randomly, but which otherwise have a common distribution. Interest focuses on estimating the distribution that is common to all datasets, knowledge of which is crucial in multiple testing problems where a location/scale invariant test is applied to every small dataset. A detailed algorithm for computing minimum distance estimates is proposed, and the usefulness of our methodology is illustrated by a simulation study and an analysis of microarray data. Supplemental materials for the article, including R-code and a dataset, are available online. © 2011 American Statistical Association.
Nonparametric estimation of stochastic differential equations with sparse Gaussian processes
García, Constantino A.; Otero, Abraham; Félix, Paulo; Presedo, Jesús; Márquez, David G.
2017-08-01
The application of stochastic differential equations (SDEs) to the analysis of temporal data has attracted increasing attention, due to their ability to describe complex dynamics with physically interpretable equations. In this paper, we introduce a nonparametric method for estimating the drift and diffusion terms of SDEs from a densely observed discrete time series. The use of Gaussian processes as priors permits working directly in a function-space view and thus the inference takes place directly in this space. To cope with the computational complexity that requires the use of Gaussian processes, a sparse Gaussian process approximation is provided. This approximation permits the efficient computation of predictions for the drift and diffusion terms by using a distribution over a small subset of pseudosamples. The proposed method has been validated using both simulated data and real data from economy and paleoclimatology. The application of the method to real data demonstrates its ability to capture the behavior of complex systems.
Nonparametric dark energy reconstruction from supernova data.
Holsclaw, Tracy; Alam, Ujjaini; Sansó, Bruno; Lee, Herbert; Heitmann, Katrin; Habib, Salman; Higdon, David
2010-12-10
Understanding the origin of the accelerated expansion of the Universe poses one of the greatest challenges in physics today. Lacking a compelling fundamental theory to test, observational efforts are targeted at a better characterization of the underlying cause. If a new form of mass-energy, dark energy, is driving the acceleration, the redshift evolution of the equation of state parameter w(z) will hold essential clues as to its origin. To best exploit data from observations it is necessary to develop a robust and accurate reconstruction approach, with controlled errors, for w(z). We introduce a new, nonparametric method for solving the associated statistical inverse problem based on Gaussian process modeling and Markov chain Monte Carlo sampling. Applying this method to recent supernova measurements, we reconstruct the continuous history of w out to redshift z=1.5.
Nonparametric k-nearest-neighbor entropy estimator.
Lombardi, Damiano; Pant, Sanjay
2016-01-01
A nonparametric k-nearest-neighbor-based entropy estimator is proposed. It improves on the classical Kozachenko-Leonenko estimator by considering nonuniform probability densities in the region of k-nearest neighbors around each sample point. It aims to improve the classical estimators in three situations: first, when the dimensionality of the random variable is large; second, when near-functional relationships leading to high correlation between components of the random variable are present; and third, when the marginal variances of random variable components vary significantly with respect to each other. Heuristics on the error of the proposed and classical estimators are presented. Finally, the proposed estimator is tested for a variety of distributions in successively increasing dimensions and in the presence of a near-functional relationship. Its performance is compared with a classical estimator, and a significant improvement is demonstrated.
Nonparametric estimation of location and scale parameters
Potgieter, C.J.
2012-12-01
Two random variables X and Y belong to the same location-scale family if there are constants μ and σ such that Y and μ+σX have the same distribution. In this paper we consider non-parametric estimation of the parameters μ and σ under minimal assumptions regarding the form of the distribution functions of X and Y. We discuss an approach to the estimation problem that is based on asymptotic likelihood considerations. Our results enable us to provide a methodology that can be implemented easily and which yields estimators that are often near optimal when compared to fully parametric methods. We evaluate the performance of the estimators in a series of Monte Carlo simulations. © 2012 Elsevier B.V. All rights reserved.
Nonparametric Maximum Entropy Estimation on Information Diagrams
Martin, Elliot A; Meinke, Alexander; Děchtěrenko, Filip; Davidsen, Jörn
2016-01-01
Maximum entropy estimation is of broad interest for inferring properties of systems across many different disciplines. In this work, we significantly extend a technique we previously introduced for estimating the maximum entropy of a set of random discrete variables when conditioning on bivariate mutual informations and univariate entropies. Specifically, we show how to apply the concept to continuous random variables and vastly expand the types of information-theoretic quantities one can condition on. This allows us to establish a number of significant advantages of our approach over existing ones. Not only does our method perform favorably in the undersampled regime, where existing methods fail, but it also can be dramatically less computationally expensive as the cardinality of the variables increases. In addition, we propose a nonparametric formulation of connected informations and give an illustrative example showing how this agrees with the existing parametric formulation in cases of interest. We furthe...
Nonparametric estimation of employee stock options
Institute of Scientific and Technical Information of China (English)
FU Qiang; LIU Li-an; LIU Qian
2006-01-01
We proposed a new model to price employee stock options (ESOs). The model is based on nonparametric statistical methods with market data. It incorporates the kernel estimator and employs a three-step method to modify BlackScholes formula. The model overcomes the limits of Black-Scholes formula in handling option prices with varied volatility. It disposes the effects of ESOs self-characteristics such as non-tradability, the longer term for expiration, the early exercise feature, the restriction on shorting selling and the employee's risk aversion on risk neutral pricing condition, and can be applied to ESOs valuation with the explanatory variable in no matter the certainty case or random case.
On Parametric (and Non-Parametric Variation
Directory of Open Access Journals (Sweden)
Neil Smith
2009-11-01
Full Text Available This article raises the issue of the correct characterization of ‘Parametric Variation’ in syntax and phonology. After specifying their theoretical commitments, the authors outline the relevant parts of the Principles–and–Parameters framework, and draw a three-way distinction among Universal Principles, Parameters, and Accidents. The core of the contribution then consists of an attempt to provide identity criteria for parametric, as opposed to non-parametric, variation. Parametric choices must be antecedently known, and it is suggested that they must also satisfy seven individually necessary and jointly sufficient criteria. These are that they be cognitively represented, systematic, dependent on the input, deterministic, discrete, mutually exclusive, and irreversible.
Armstrong, Mark
2008-01-01
This paper surveys recent economic research on price discrimination, both in monopoly and oligopoly markets. Topics include static and dynamic forms of price discrimination, and both final and input markets are considered. Potential antitrust aspects of price discrimination are highlighted throughout the paper. The paper argues that the informational requirements to make accurate policy are very great, and with most forms of price discrimination a laissez-faire policy may be the best availabl...