WorldWideScience

Sample records for richness multivariate analysis

  1. Robust multivariate analysis

    CERN Document Server

    J Olive, David

    2017-01-01

    This text presents methods that are robust to the assumption of a multivariate normal distribution or methods that are robust to certain types of outliers. Instead of using exact theory based on the multivariate normal distribution, the simpler and more applicable large sample theory is given.  The text develops among the first practical robust regression and robust multivariate location and dispersion estimators backed by theory.   The robust techniques  are illustrated for methods such as principal component analysis, canonical correlation analysis, and factor analysis.  A simple way to bootstrap confidence regions is also provided. Much of the research on robust multivariate analysis in this book is being published for the first time. The text is suitable for a first course in Multivariate Statistical Analysis or a first course in Robust Statistics. This graduate text is also useful for people who are familiar with the traditional multivariate topics, but want to know more about handling data sets with...

  2. Multivariate analysis with LISREL

    CERN Document Server

    Jöreskog, Karl G; Y Wallentin, Fan

    2016-01-01

    This book traces the theory and methodology of multivariate statistical analysis and shows how it can be conducted in practice using the LISREL computer program. It presents not only the typical uses of LISREL, such as confirmatory factor analysis and structural equation models, but also several other multivariate analysis topics, including regression (univariate, multivariate, censored, logistic, and probit), generalized linear models, multilevel analysis, and principal component analysis. It provides numerous examples from several disciplines and discusses and interprets the results, illustrated with sections of output from the LISREL program, in the context of the example. The book is intended for masters and PhD students and researchers in the social, behavioral, economic and many other sciences who require a basic understanding of multivariate statistical theory and methods for their analysis of multivariate data. It can also be used as a textbook on various topics of multivariate statistical analysis.

  3. Methods of Multivariate Analysis

    CERN Document Server

    Rencher, Alvin C

    2012-01-01

    Praise for the Second Edition "This book is a systematic, well-written, well-organized text on multivariate analysis packed with intuition and insight . . . There is much practical wisdom in this book that is hard to find elsewhere."-IIE Transactions Filled with new and timely content, Methods of Multivariate Analysis, Third Edition provides examples and exercises based on more than sixty real data sets from a wide variety of scientific fields. It takes a "methods" approach to the subject, placing an emphasis on how students and practitioners can employ multivariate analysis in real-life sit

  4. Multivariate Analysis for the Processing of Signals

    Directory of Open Access Journals (Sweden)

    Beattie J.R.

    2014-01-01

    Full Text Available Real-world experiments are becoming increasingly more complex, needing techniques capable of tracking this complexity. Signal based measurements are often used to capture this complexity, where a signal is a record of a sample’s response to a parameter (e.g. time, displacement, voltage, wavelength that is varied over a range of values. In signals the responses at each value of the varied parameter are related to each other, depending on the composition or state sample being measured. Since signals contain multiple information points, they have rich information content but are generally complex to comprehend. Multivariate Analysis (MA has profoundly transformed their analysis by allowing gross simplification of the tangled web of variation. In addition MA has also provided the advantage of being much more robust to the influence of noise than univariate methods of analysis. In recent years, there has been a growing awareness that the nature of the multivariate methods allows exploitation of its benefits for purposes other than data analysis, such as pre-processing of signals with the aim of eliminating irrelevant variations prior to analysis of the signal of interest. It has been shown that exploiting multivariate data reduction in an appropriate way can allow high fidelity denoising (removal of irreproducible non-signals, consistent and reproducible noise-insensitive correction of baseline distortions (removal of reproducible non-signals, accurate elimination of interfering signals (removal of reproducible but unwanted signals and the standardisation of signal amplitude fluctuations. At present, the field is relatively small but the possibilities for much wider application are considerable. Where signal properties are suitable for MA (such as the signal being stationary along the x-axis, these signal based corrections have the potential to be highly reproducible, and highly adaptable and are applicable in situations where the data is noisy or

  5. Multivariate analysis: models and method

    International Nuclear Information System (INIS)

    Sanz Perucha, J.

    1990-01-01

    Data treatment techniques are increasingly used since computer methods result of wider access. Multivariate analysis consists of a group of statistic methods that are applied to study objects or samples characterized by multiple values. A final goal is decision making. The paper describes the models and methods of multivariate analysis

  6. Applied multivariate statistical analysis

    CERN Document Server

    Härdle, Wolfgang Karl

    2015-01-01

    Focusing on high-dimensional applications, this 4th edition presents the tools and concepts used in multivariate data analysis in a style that is also accessible for non-mathematicians and practitioners.  It surveys the basic principles and emphasizes both exploratory and inferential statistics; a new chapter on Variable Selection (Lasso, SCAD and Elastic Net) has also been added.  All chapters include practical exercises that highlight applications in different multivariate data analysis fields: in quantitative financial studies, where the joint dynamics of assets are observed; in medicine, where recorded observations of subjects in different locations form the basis for reliable diagnoses and medication; and in quantitative marketing, where consumers’ preferences are collected in order to construct models of consumer behavior.  All of these examples involve high to ultra-high dimensions and represent a number of major fields in big data analysis. The fourth edition of this book on Applied Multivariate ...

  7. Multivariate meta-analysis: Potential and promise

    Science.gov (United States)

    Jackson, Dan; Riley, Richard; White, Ian R

    2011-01-01

    The multivariate random effects model is a generalization of the standard univariate model. Multivariate meta-analysis is becoming more commonly used and the techniques and related computer software, although continually under development, are now in place. In order to raise awareness of the multivariate methods, and discuss their advantages and disadvantages, we organized a one day ‘Multivariate meta-analysis’ event at the Royal Statistical Society. In addition to disseminating the most recent developments, we also received an abundance of comments, concerns, insights, critiques and encouragement. This article provides a balanced account of the day's discourse. By giving others the opportunity to respond to our assessment, we hope to ensure that the various view points and opinions are aired before multivariate meta-analysis simply becomes another widely used de facto method without any proper consideration of it by the medical statistics community. We describe the areas of application that multivariate meta-analysis has found, the methods available, the difficulties typically encountered and the arguments for and against the multivariate methods, using four representative but contrasting examples. We conclude that the multivariate methods can be useful, and in particular can provide estimates with better statistical properties, but also that these benefits come at the price of making more assumptions which do not result in better inference in every case. Although there is evidence that multivariate meta-analysis has considerable potential, it must be even more carefully applied than its univariate counterpart in practice. Copyright © 2011 John Wiley & Sons, Ltd. PMID:21268052

  8. Multivariate analysis methods in physics

    International Nuclear Information System (INIS)

    Wolter, M.

    2007-01-01

    A review of multivariate methods based on statistical training is given. Several multivariate methods useful in high-energy physics analysis are discussed. Selected examples from current research in particle physics are discussed, both from the on-line trigger selection and from the off-line analysis. Also statistical training methods are presented and some new application are suggested [ru

  9. Exploratory multivariate analysis by example using R

    CERN Document Server

    Husson, Francois; Pages, Jerome

    2010-01-01

    Full of real-world case studies and practical advice, Exploratory Multivariate Analysis by Example Using R focuses on four fundamental methods of multivariate exploratory data analysis that are most suitable for applications. It covers principal component analysis (PCA) when variables are quantitative, correspondence analysis (CA) and multiple correspondence analysis (MCA) when variables are categorical, and hierarchical cluster analysis.The authors take a geometric point of view that provides a unified vision for exploring multivariate data tables. Within this framework, they present the prin

  10. Multivariate survival analysis and competing risks

    CERN Document Server

    Crowder, Martin J

    2012-01-01

    Multivariate Survival Analysis and Competing Risks introduces univariate survival analysis and extends it to the multivariate case. It covers competing risks and counting processes and provides many real-world examples, exercises, and R code. The text discusses survival data, survival distributions, frailty models, parametric methods, multivariate data and distributions, copulas, continuous failure, parametric likelihood inference, and non- and semi-parametric methods. There are many books covering survival analysis, but very few that cover the multivariate case in any depth. Written for a graduate-level audience in statistics/biostatistics, this book includes practical exercises and R code for the examples. The author is renowned for his clear writing style, and this book continues that trend. It is an excellent reference for graduate students and researchers looking for grounding in this burgeoning field of research.

  11. Method for statistical data analysis of multivariate observations

    CERN Document Server

    Gnanadesikan, R

    1997-01-01

    A practical guide for multivariate statistical techniques-- now updated and revised In recent years, innovations in computer technology and statistical methodologies have dramatically altered the landscape of multivariate data analysis. This new edition of Methods for Statistical Data Analysis of Multivariate Observations explores current multivariate concepts and techniques while retaining the same practical focus of its predecessor. It integrates methods and data-based interpretations relevant to multivariate analysis in a way that addresses real-world problems arising in many areas of inte

  12. Matrix-based introduction to multivariate data analysis

    CERN Document Server

    Adachi, Kohei

    2016-01-01

    This book enables readers who may not be familiar with matrices to understand a variety of multivariate analysis procedures in matrix forms. Another feature of the book is that it emphasizes what model underlies a procedure and what objective function is optimized for fitting the model to data. The author believes that the matrix-based learning of such models and objective functions is the fastest way to comprehend multivariate data analysis. The text is arranged so that readers can intuitively capture the purposes for which multivariate analysis procedures are utilized: plain explanations of the purposes with numerical examples precede mathematical descriptions in almost every chapter. This volume is appropriate for undergraduate students who already have studied introductory statistics. Graduate students and researchers who are not familiar with matrix-intensive formulations of multivariate data analysis will also find the book useful, as it is based on modern matrix formulations with a special emphasis on ...

  13. Multivariate Regression Analysis and Slaughter Livestock,

    Science.gov (United States)

    AGRICULTURE, *ECONOMICS), (*MEAT, PRODUCTION), MULTIVARIATE ANALYSIS, REGRESSION ANALYSIS , ANIMALS, WEIGHT, COSTS, PREDICTIONS, STABILITY, MATHEMATICAL MODELS, STORAGE, BEEF, PORK, FOOD, STATISTICAL DATA, ACCURACY

  14. Multivariate refined composite multiscale entropy analysis

    International Nuclear Information System (INIS)

    Humeau-Heurtier, Anne

    2016-01-01

    Multiscale entropy (MSE) has become a prevailing method to quantify signals complexity. MSE relies on sample entropy. However, MSE may yield imprecise complexity estimation at large scales, because sample entropy does not give precise estimation of entropy when short signals are processed. A refined composite multiscale entropy (RCMSE) has therefore recently been proposed. Nevertheless, RCMSE is for univariate signals only. The simultaneous analysis of multi-channel (multivariate) data often over-performs studies based on univariate signals. We therefore introduce an extension of RCMSE to multivariate data. Applications of multivariate RCMSE to simulated processes reveal its better performances over the standard multivariate MSE. - Highlights: • Multiscale entropy quantifies data complexity but may be inaccurate at large scale. • A refined composite multiscale entropy (RCMSE) has therefore recently been proposed. • Nevertheless, RCMSE is adapted to univariate time series only. • We herein introduce an extension of RCMSE to multivariate data. • It shows better performances than the standard multivariate multiscale entropy.

  15. A MULTIVARIATE ANALYSIS OF CROATIAN COUNTIES ENTREPRENEURSHIP

    Directory of Open Access Journals (Sweden)

    Elza Jurun

    2012-12-01

    Full Text Available In the focus of this paper is a multivariate analysis of Croatian Counties entrepreneurship. Complete data base available by official statistic institutions at national and regional level is used. Modern econometric methodology starting from a comparative analysis via multiple regression to multivariate cluster analysis is carried out as well as the analysis of successful or inefficacious entrepreneurship measured by indicators of efficiency, profitability and productivity. Time horizons of the comparative analysis are in 2004 and 2010. Accelerators of socio-economic development - number of entrepreneur investors, investment in fixed assets and current assets ratio in multiple regression model are analytically filtered between twenty-six independent variables as variables of the dominant influence on GDP per capita in 2010 as dependent variable. Results of multivariate cluster analysis of twentyone Croatian Counties are interpreted also in the sense of three Croatian NUTS 2 regions according to European nomenclature of regional territorial division of Croatia.

  16. Multivariate Methods for Meta-Analysis of Genetic Association Studies.

    Science.gov (United States)

    Dimou, Niki L; Pantavou, Katerina G; Braliou, Georgia G; Bagos, Pantelis G

    2018-01-01

    Multivariate meta-analysis of genetic association studies and genome-wide association studies has received a remarkable attention as it improves the precision of the analysis. Here, we review, summarize and present in a unified framework methods for multivariate meta-analysis of genetic association studies and genome-wide association studies. Starting with the statistical methods used for robust analysis and genetic model selection, we present in brief univariate methods for meta-analysis and we then scrutinize multivariate methodologies. Multivariate models of meta-analysis for a single gene-disease association studies, including models for haplotype association studies, multiple linked polymorphisms and multiple outcomes are discussed. The popular Mendelian randomization approach and special cases of meta-analysis addressing issues such as the assumption of the mode of inheritance, deviation from Hardy-Weinberg Equilibrium and gene-environment interactions are also presented. All available methods are enriched with practical applications and methodologies that could be developed in the future are discussed. Links for all available software implementing multivariate meta-analysis methods are also provided.

  17. Multivariate analysis of 2-DE protein patterns - Practical approaches

    DEFF Research Database (Denmark)

    Jacobsen, Charlotte; Jacobsen, Susanne; Grove, H.

    2007-01-01

    Practical approaches to the use of multivariate data analysis of 2-DE protein patterns are demonstrated by three independent strategies for the image analysis and the multivariate analysis on the same set of 2-DE data. Four wheat varieties were selected on the basis of their baking quality. Two...... of the varieties were of strong baking quality and hard wheat kernel and two were of weak baking quality and soft kernel. Gliadins at different stages of grain development were analyzed by the application of multivariate data analysis on images of 2-DEs. Patterns related to the wheat varieties, harvest times...

  18. Multivariate Analysis and Machine Learning in Cerebral Palsy Research.

    Science.gov (United States)

    Zhang, Jing

    2017-01-01

    Cerebral palsy (CP), a common pediatric movement disorder, causes the most severe physical disability in children. Early diagnosis in high-risk infants is critical for early intervention and possible early recovery. In recent years, multivariate analytic and machine learning (ML) approaches have been increasingly used in CP research. This paper aims to identify such multivariate studies and provide an overview of this relatively young field. Studies reviewed in this paper have demonstrated that multivariate analytic methods are useful in identification of risk factors, detection of CP, movement assessment for CP prediction, and outcome assessment, and ML approaches have made it possible to automatically identify movement impairments in high-risk infants. In addition, outcome predictors for surgical treatments have been identified by multivariate outcome studies. To make the multivariate and ML approaches useful in clinical settings, further research with large samples is needed to verify and improve these multivariate methods in risk factor identification, CP detection, movement assessment, and outcome evaluation or prediction. As multivariate analysis, ML and data processing technologies advance in the era of Big Data of this century, it is expected that multivariate analysis and ML will play a bigger role in improving the diagnosis and treatment of CP to reduce mortality and morbidity rates, and enhance patient care for children with CP.

  19. Multivariate statistical analysis a high-dimensional approach

    CERN Document Server

    Serdobolskii, V

    2000-01-01

    In the last few decades the accumulation of large amounts of in­ formation in numerous applications. has stimtllated an increased in­ terest in multivariate analysis. Computer technologies allow one to use multi-dimensional and multi-parametric models successfully. At the same time, an interest arose in statistical analysis with a de­ ficiency of sample data. Nevertheless, it is difficult to describe the recent state of affairs in applied multivariate methods as satisfactory. Unimprovable (dominating) statistical procedures are still unknown except for a few specific cases. The simplest problem of estimat­ ing the mean vector with minimum quadratic risk is unsolved, even for normal distributions. Commonly used standard linear multivari­ ate procedures based on the inversion of sample covariance matrices can lead to unstable results or provide no solution in dependence of data. Programs included in standard statistical packages cannot process 'multi-collinear data' and there are no theoretical recommen­ ...

  20. Multivariate Analysis and Machine Learning in Cerebral Palsy Research

    Directory of Open Access Journals (Sweden)

    Jing Zhang

    2017-12-01

    Full Text Available Cerebral palsy (CP, a common pediatric movement disorder, causes the most severe physical disability in children. Early diagnosis in high-risk infants is critical for early intervention and possible early recovery. In recent years, multivariate analytic and machine learning (ML approaches have been increasingly used in CP research. This paper aims to identify such multivariate studies and provide an overview of this relatively young field. Studies reviewed in this paper have demonstrated that multivariate analytic methods are useful in identification of risk factors, detection of CP, movement assessment for CP prediction, and outcome assessment, and ML approaches have made it possible to automatically identify movement impairments in high-risk infants. In addition, outcome predictors for surgical treatments have been identified by multivariate outcome studies. To make the multivariate and ML approaches useful in clinical settings, further research with large samples is needed to verify and improve these multivariate methods in risk factor identification, CP detection, movement assessment, and outcome evaluation or prediction. As multivariate analysis, ML and data processing technologies advance in the era of Big Data of this century, it is expected that multivariate analysis and ML will play a bigger role in improving the diagnosis and treatment of CP to reduce mortality and morbidity rates, and enhance patient care for children with CP.

  1. EXPLORATORY DATA ANALYSIS AND MULTIVARIATE STRATEGIES FOR REVEALING MULTIVARIATE STRUCTURES IN CLIMATE DATA

    Directory of Open Access Journals (Sweden)

    2016-12-01

    Full Text Available This paper is on data analysis strategy in a complex, multidimensional, and dynamic domain. The focus is on the use of data mining techniques to explore the importance of multivariate structures; using climate variables which influences climate change. Techniques involved in data mining exercise vary according to the data structures. The multivariate analysis strategy considered here involved choosing an appropriate tool to analyze a process. Factor analysis is introduced into data mining technique in order to reveal the influencing impacts of factors involved as well as solving for multicolinearity effect among the variables. The temporal nature and multidimensionality of the target variables is revealed in the model using multidimensional regression estimates. The strategy of integrating the method of several statistical techniques, using climate variables in Nigeria was employed. R2 of 0.518 was obtained from the ordinary least square regression analysis carried out and the test was not significant at 5% level of significance. However, factor analysis regression strategy gave a good fit with R2 of 0.811 and the test was significant at 5% level of significance. Based on this study, model building should go beyond the usual confirmatory data analysis (CDA, rather it should be complemented with exploratory data analysis (EDA in order to achieve a desired result.

  2. imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel.

    Science.gov (United States)

    Grapov, Dmitry; Newman, John W

    2012-09-01

    Interactive modules for Data Exploration and Visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data through a user-friendly interface. Individual modules enables interactive and dynamic analyses of large data by interfacing R's multivariate statistics and highly customizable visualizations with the spreadsheet environment, aiding robust inferences and generating information-rich data visualizations. This tool provides access to multiple comparisons with false discovery correction, hierarchical clustering, principal and independent component analyses, partial least squares regression and discriminant analysis, through an intuitive interface for creating high-quality two- and a three-dimensional visualizations including scatter plot matrices, distribution plots, dendrograms, heat maps, biplots, trellis biplots and correlation networks. Freely available for download at http://sourceforge.net/projects/imdev/. Implemented in R and VBA and supported by Microsoft Excel (2003, 2007 and 2010).

  3. Multivariate Analysis of Schools and Educational Policy.

    Science.gov (United States)

    Kiesling, Herbert J.

    This report describes a multivariate analysis technique that approaches the problems of educational production function analysis by (1) using comparable measures of output across large experiments, (2) accounting systematically for differences in socioeconomic background, and (3) treating the school as a complete system in which different…

  4. Bayesian inference for multivariate meta-analysis Box-Cox transformation models for individual patient data with applications to evaluation of cholesterol-lowering drugs.

    Science.gov (United States)

    Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G; Shah, Arvind K; Lin, Jianxin

    2013-10-15

    In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the deviance information criterion is used to select the best transformation model. Because the model is quite complex, we develop a novel Monte Carlo Markov chain sampling scheme to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol-lowering drugs where the goal is to jointly model the three-dimensional response consisting of low density lipoprotein cholesterol (LDL-C), high density lipoprotein cholesterol (HDL-C), and triglycerides (TG) (LDL-C, HDL-C, TG). Because the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately; however, a multivariate approach would be more appropriate because these variables are correlated with each other. We carry out a detailed analysis of these data by using the proposed methodology. Copyright © 2013 John Wiley & Sons, Ltd.

  5. Bayesian inference for multivariate meta-analysis Box-Cox transformation models for individual patient data with applications to evaluation of cholesterol lowering drugs

    Science.gov (United States)

    Kim, Sungduk; Chen, Ming-Hui; Ibrahim, Joseph G.; Shah, Arvind K.; Lin, Jianxin

    2013-01-01

    In this paper, we propose a class of Box-Cox transformation regression models with multidimensional random effects for analyzing multivariate responses for individual patient data (IPD) in meta-analysis. Our modeling formulation uses a multivariate normal response meta-analysis model with multivariate random effects, in which each response is allowed to have its own Box-Cox transformation. Prior distributions are specified for the Box-Cox transformation parameters as well as the regression coefficients in this complex model, and the Deviance Information Criterion (DIC) is used to select the best transformation model. Since the model is quite complex, a novel Monte Carlo Markov chain (MCMC) sampling scheme is developed to sample from the joint posterior of the parameters. This model is motivated by a very rich dataset comprising 26 clinical trials involving cholesterol lowering drugs where the goal is to jointly model the three dimensional response consisting of Low Density Lipoprotein Cholesterol (LDL-C), High Density Lipoprotein Cholesterol (HDL-C), and Triglycerides (TG) (LDL-C, HDL-C, TG). Since the joint distribution of (LDL-C, HDL-C, TG) is not multivariate normal and in fact quite skewed, a Box-Cox transformation is needed to achieve normality. In the clinical literature, these three variables are usually analyzed univariately: however, a multivariate approach would be more appropriate since these variables are correlated with each other. A detailed analysis of these data is carried out using the proposed methodology. PMID:23580436

  6. PIXE-quantified AXSIA: Elemental mapping by multivariate spectral analysis

    International Nuclear Information System (INIS)

    Doyle, B.L.; Provencio, P.P.; Kotula, P.G.; Antolak, A.J.; Ryan, C.G.; Campbell, J.L.; Barrett, K.

    2006-01-01

    Automated, nonbiased, multivariate statistical analysis techniques are useful for converting very large amounts of data into a smaller, more manageable number of chemical components (spectra and images) that are needed to describe the measurement. We report the first use of the multivariate spectral analysis program AXSIA (Automated eXpert Spectral Image Analysis) developed at Sandia National Laboratories to quantitatively analyze micro-PIXE data maps. AXSIA implements a multivariate curve resolution technique that reduces the spectral image data sets into a limited number of physically realizable and easily interpretable components (including both spectra and images). We show that the principal component spectra can be further analyzed using conventional PIXE programs to convert the weighting images into quantitative concentration maps. A common elemental data set has been analyzed using three different PIXE analysis codes and the results compared to the cases when each of these codes is used to separately analyze the associated AXSIA principal component spectral data. We find that these comparisons are in good quantitative agreement with each other

  7. The analysis of multivariate group differences using common principal components

    NARCIS (Netherlands)

    Bechger, T.M.; Blanca, M.J.; Maris, G.

    2014-01-01

    Although it is simple to determine whether multivariate group differences are statistically significant or not, such differences are often difficult to interpret. This article is about common principal components analysis as a tool for the exploratory investigation of multivariate group differences

  8. Particulate characterization by PIXE multivariate spectral analysis

    International Nuclear Information System (INIS)

    Antolak, Arlyn J.; Morse, Daniel H.; Grant, Patrick G.; Kotula, Paul G.; Doyle, Barney L.; Richardson, Charles B.

    2007-01-01

    Obtaining particulate compositional maps from scanned PIXE (proton-induced X-ray emission) measurements is extremely difficult due to the complexity of analyzing spectroscopic data collected with low signal-to-noise at each scan point (pixel). Multivariate spectral analysis has the potential to analyze such data sets by reducing the PIXE data to a limited number of physically realizable and easily interpretable components (that include both spectral and image information). We have adapted the AXSIA (automated expert spectral image analysis) program, originally developed by Sandia National Laboratories to quantify electron-excited X-ray spectroscopy data, for this purpose. Samples consisting of particulates with known compositions and sizes were loaded onto Mylar and paper filter substrates and analyzed by scanned micro-PIXE. The data sets were processed by AXSIA and the associated principal component spectral data were quantified by converting the weighting images into concentration maps. The results indicate automated, nonbiased, multivariate statistical analysis is useful for converting very large amounts of data into a smaller, more manageable number of compositional components needed for locating individual particles-of-interest on large area collection media

  9. Determination of wheat quality by mass spectrometry and multivariate data analysis

    DEFF Research Database (Denmark)

    Gottlieb, D.M.; Schultz, J.; Petersen, M.

    2002-01-01

    Multivariate analysis has been applied as support to proteome analysis in order to implement an easier and faster way of data handling based on separation by matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry. The characterisation phase in proteome analysis by means...... of simple visual inspection is a demanding process and also insecure because subjectivity is the controlling element. Multivariate analysis offers, to a considerable extent, objectivity and must therefore be regarded as a neutral way to evaluate results obtained by proteome analysis.Proteome analysis...

  10. Multivariate analysis of attenuated total reflection Fourier transform infrared (ATR FT-IR) spectroscopic data to confirm phase partitioning in methacrylate-based dentin adhesive.

    Science.gov (United States)

    Ye, Qiang; Parthasarathy, Ranganathan; Abedin, Farhana; Laurence, Jennifer S; Misra, Anil; Spencer, Paulette

    2013-12-01

    Water is ubiquitous in the mouths of healthy individuals and is a major interfering factor in the development of a durable seal between the tooth and composite restoration. Water leads to the formation of a variety of defects in dentin adhesives; these defects undermine the tooth-composite bond. Our group recently analyzed phase partitioning of dentin adhesives using high-performance liquid chromatography (HPLC). The concentration measurements provided by HPLC offered a more thorough representation of current adhesive performance and elucidated directions to be taken for further improvement. The sample preparation and instrument analysis using HPLC are, however, time-consuming and labor-intensive. The objective of this work was to develop a methodology for rapid, reliable, and accurate quantitative analysis of near-equilibrium phase partitioning in adhesives exposed to conditions simulating the wet oral environment. Analysis by Fourier transform infrared (FT-IR) spectroscopy in combination with multivariate statistical methods, including partial least squares (PLS) regression and principal component regression (PCR), were used for multivariate calibration to quantify the compositions in separated phases. Excellent predictions were achieved when either the hydrophobic-rich phase or the hydrophilic-rich phase mixtures were analyzed. These results indicate that FT-IR spectroscopy has excellent potential as a rapid method of detection and quantification of dentin adhesives that experience phase separation under conditions that simulate the wet oral environment.

  11. Multivariate Analysis of Industrial Scale Fermentation Data

    DEFF Research Database (Denmark)

    Mears, Lisa; Nørregård, Rasmus; Stocks, Stuart M.

    2015-01-01

    Multivariate analysis allows process understanding to be gained from the vast and complex datasets recorded from fermentation processes, however the application of such techniques to this field can be limited by the data pre-processing requirements and data handling. In this work many iterations...

  12. Multivariate spectral-analysis of movement-related EEG data

    International Nuclear Information System (INIS)

    Andrew, C. M.

    1997-01-01

    The univariate method of event-related desynchronization (ERD) analysis, which quantifies the temporal evolution of power within specific frequency bands from electroencephalographic (EEG) data recorded during a task or event, is extended to an event related multivariate spectral analysis method. With this method, time courses of cross-spectra, phase spectra, coherence spectra, band-averaged coherence values (event-related coherence, ERCoh), partial power spectra and partial coherence spectra are estimated from an ensemble of multivariate event-related EEG trials. This provides a means of investigating relationships between EEG signals recorded over different scalp areas during the performance of a task or the occurrence of an event. The multivariate spectral analysis method is applied to EEG data recorded during three different movement-related studies involving discrete right index finger movements. The first study investigates the impact of the EEG derivation type on the temporal evolution of interhemispheric coherence between activity recorded at electrodes overlying the left and right sensorimotor hand areas during cued finger movement. The question results whether changes in coherence necessarily reflect changes in functional coupling of the cortical structures underlying the recording electrodes. The method is applied to data recorded during voluntary finger movement and a hypothesis, based on an existing global/local model of neocortical dynamics, is formulated to explain the coherence results. The third study applies partial spectral analysis too, and investigates phase relationships of, movement-related data recorded from a full head montage, thereby providing further results strengthening the global/local hypothesis. (author)

  13. Essentials of multivariate data analysis

    CERN Document Server

    Spencer, Neil H

    2013-01-01

    ""… this text provides an overview at an introductory level of several methods in multivariate data analysis. It contains in-depth examples from one data set woven throughout the text, and a free [Excel] Add-In to perform the analyses in Excel, with step-by-step instructions provided for each technique. … could be used as a text (possibly supplemental) for courses in other fields where researchers wish to apply these methods without delving too deeply into the underlying statistics.""-The American Statistician, February 2015

  14. Principal Feature Analysis: A Multivariate Feature Selection Method for fMRI Data

    Directory of Open Access Journals (Sweden)

    Lijun Wang

    2013-01-01

    Full Text Available Brain decoding with functional magnetic resonance imaging (fMRI requires analysis of complex, multivariate data. Multivoxel pattern analysis (MVPA has been widely used in recent years. MVPA treats the activation of multiple voxels from fMRI data as a pattern and decodes brain states using pattern classification methods. Feature selection is a critical procedure of MVPA because it decides which features will be included in the classification analysis of fMRI data, thereby improving the performance of the classifier. Features can be selected by limiting the analysis to specific anatomical regions or by computing univariate (voxel-wise or multivariate statistics. However, these methods either discard some informative features or select features with redundant information. This paper introduces the principal feature analysis as a novel multivariate feature selection method for fMRI data processing. This multivariate approach aims to remove features with redundant information, thereby selecting fewer features, while retaining the most information.

  15. Multivariate Meta-Analysis Using Individual Participant Data

    Science.gov (United States)

    Riley, R. D.; Price, M. J.; Jackson, D.; Wardle, M.; Gueyffier, F.; Wang, J.; Staessen, J. A.; White, I. R.

    2015-01-01

    When combining results across related studies, a multivariate meta-analysis allows the joint synthesis of correlated effect estimates from multiple outcomes. Joint synthesis can improve efficiency over separate univariate syntheses, may reduce selective outcome reporting biases, and enables joint inferences across the outcomes. A common issue is…

  16. TMVA(Toolkit for Multivariate Analysis) new architectures design and implementation.

    CERN Document Server

    Zapata Mesa, Omar Andres

    2016-01-01

    Toolkit for Multivariate Analysis(TMVA) is a package in ROOT for machine learning algorithms for classification and regression of the events in the detectors. In TMVA, we are developing new high level algorithms to perform multivariate analysis as cross validation, hyper parameter optimization, variable importance etc... Almost all the algorithms are expensive and designed to process a huge amount of data. It is very important to implement the new technologies on parallel computing to reduce the processing times.

  17. Robust methods for multivariate data analysis A1

    DEFF Research Database (Denmark)

    Frosch, Stina; Von Frese, J.; Bro, Rasmus

    2005-01-01

    Outliers may hamper proper classical multivariate analysis, and lead to incorrect conclusions. To remedy the problem of outliers, robust methods are developed in statistics and chemometrics. Robust methods reduce or remove the effect of outlying data points and allow the ?good? data to primarily...... determine the result. This article reviews the most commonly used robust multivariate regression and exploratory methods that have appeared since 1996 in the field of chemometrics. Special emphasis is put on the robust versions of chemometric standard tools like PCA and PLS and the corresponding robust...

  18. Multivariate Generalized Multiscale Entropy Analysis

    Directory of Open Access Journals (Sweden)

    Anne Humeau-Heurtier

    2016-11-01

    Full Text Available Multiscale entropy (MSE was introduced in the 2000s to quantify systems’ complexity. MSE relies on (i a coarse-graining procedure to derive a set of time series representing the system dynamics on different time scales; (ii the computation of the sample entropy for each coarse-grained time series. A refined composite MSE (rcMSE—based on the same steps as MSE—also exists. Compared to MSE, rcMSE increases the accuracy of entropy estimation and reduces the probability of inducing undefined entropy for short time series. The multivariate versions of MSE (MMSE and rcMSE (MrcMSE have also been introduced. In the coarse-graining step used in MSE, rcMSE, MMSE, and MrcMSE, the mean value is used to derive representations of the original data at different resolutions. A generalization of MSE was recently published, using the computation of different moments in the coarse-graining procedure. However, so far, this generalization only exists for univariate signals. We therefore herein propose an extension of this generalized MSE to multivariate data. The multivariate generalized algorithms of MMSE and MrcMSE presented herein (MGMSE and MGrcMSE, respectively are first analyzed through the processing of synthetic signals. We reveal that MGrcMSE shows better performance than MGMSE for short multivariate data. We then study the performance of MGrcMSE on two sets of short multivariate electroencephalograms (EEG available in the public domain. We report that MGrcMSE may show better performance than MrcMSE in distinguishing different types of multivariate EEG data. MGrcMSE could therefore supplement MMSE or MrcMSE in the processing of multivariate datasets.

  19. Multivariate calibration applied to the quantitative analysis of infrared spectra

    Energy Technology Data Exchange (ETDEWEB)

    Haaland, D.M.

    1991-01-01

    Multivariate calibration methods are very useful for improving the precision, accuracy, and reliability of quantitative spectral analyses. Spectroscopists can more effectively use these sophisticated statistical tools if they have a qualitative understanding of the techniques involved. A qualitative picture of the factor analysis multivariate calibration methods of partial least squares (PLS) and principal component regression (PCR) is presented using infrared calibrations based upon spectra of phosphosilicate glass thin films on silicon wafers. Comparisons of the relative prediction abilities of four different multivariate calibration methods are given based on Monte Carlo simulations of spectral calibration and prediction data. The success of multivariate spectral calibrations is demonstrated for several quantitative infrared studies. The infrared absorption and emission spectra of thin-film dielectrics used in the manufacture of microelectronic devices demonstrate rapid, nondestructive at-line and in-situ analyses using PLS calibrations. Finally, the application of multivariate spectral calibrations to reagentless analysis of blood is presented. We have found that the determination of glucose in whole blood taken from diabetics can be precisely monitored from the PLS calibration of either mind- or near-infrared spectra of the blood. Progress toward the non-invasive determination of glucose levels in diabetics is an ultimate goal of this research. 13 refs., 4 figs.

  20. Multivariate Meta-Analysis of Genetic Association Studies: A Simulation Study.

    Directory of Open Access Journals (Sweden)

    Binod Neupane

    Full Text Available In a meta-analysis with multiple end points of interests that are correlated between or within studies, multivariate approach to meta-analysis has a potential to produce more precise estimates of effects by exploiting the correlation structure between end points. However, under random-effects assumption the multivariate estimation is more complex (as it involves estimation of more parameters simultaneously than univariate estimation, and sometimes can produce unrealistic parameter estimates. Usefulness of multivariate approach to meta-analysis of the effects of a genetic variant on two or more correlated traits is not well understood in the area of genetic association studies. In such studies, genetic variants are expected to roughly maintain Hardy-Weinberg equilibrium within studies, and also their effects on complex traits are generally very small to modest and could be heterogeneous across studies for genuine reasons. We carried out extensive simulation to explore the comparative performance of multivariate approach with most commonly used univariate inverse-variance weighted approach under random-effects assumption in various realistic meta-analytic scenarios of genetic association studies of correlated end points. We evaluated the performance with respect to relative mean bias percentage, and root mean square error (RMSE of the estimate and coverage probability of corresponding 95% confidence interval of the effect for each end point. Our simulation results suggest that multivariate approach performs similarly or better than univariate method when correlations between end points within or between studies are at least moderate and between-study variation is similar or larger than average within-study variation for meta-analyses of 10 or more genetic studies. Multivariate approach produces estimates with smaller bias and RMSE especially for the end point that has randomly or informatively missing summary data in some individual studies, when

  1. A "Model" Multivariable Calculus Course.

    Science.gov (United States)

    Beckmann, Charlene E.; Schlicker, Steven J.

    1999-01-01

    Describes a rich, investigative approach to multivariable calculus. Introduces a project in which students construct physical models of surfaces that represent real-life applications of their choice. The models, along with student-selected datasets, serve as vehicles to study most of the concepts of the course from both continuous and discrete…

  2. Power Estimation in Multivariate Analysis of Variance

    Directory of Open Access Journals (Sweden)

    Jean François Allaire

    2007-09-01

    Full Text Available Power is often overlooked in designing multivariate studies for the simple reason that it is believed to be too complicated. In this paper, it is shown that power estimation in multivariate analysis of variance (MANOVA can be approximated using a F distribution for the three popular statistics (Hotelling-Lawley trace, Pillai-Bartlett trace, Wilk`s likelihood ratio. Consequently, the same procedure, as in any statistical test, can be used: computation of the critical F value, computation of the noncentral parameter (as a function of the effect size and finally estimation of power using a noncentral F distribution. Various numerical examples are provided which help to understand and to apply the method. Problems related to post hoc power estimation are discussed.

  3. Some developments in multivariate image analysis

    DEFF Research Database (Denmark)

    Kucheryavskiy, Sergey

    be up to several million. The main MIA tool for exploratory analysis is score density plot – all pixels are projected into principal component space and on the corresponding scores plots are colorized according to their density (how many pixels are crowded in the unit area of the plot). Looking...... for and analyzing patterns on these plots and the original image allow to do interactive analysis, to get some hidden information, build a supervised classification model, and much more. In the present work several alternative methods to original principal component analysis (PCA) for building the projection......Multivariate image analysis (MIA), one of the successful chemometric applications, now is used widely in different areas of science and industry. Introduced in late 80s it has became very popular with hyperspectral imaging, where MIA is one of the most efficient tools for exploratory analysis...

  4. A comparison between multivariate and bivariate analysis used in marketing research

    Directory of Open Access Journals (Sweden)

    Constantin, C.

    2012-01-01

    Full Text Available This paper is about an instrumental research conducted in order to compare the information given by two multivariate data analysis in comparison with the usual bivariate analysis. The outcomes of the research reveal that sometimes the multivariate methods use more information from a certain variable, but sometimes they use only a part of the information considered the most important for certain associations. For this reason, a researcher should use both categories of data analysis in order to obtain entirely useful information.

  5. Multivariate Volatility Impulse Response Analysis of GFC News Events

    NARCIS (Netherlands)

    D.E. Allen (David); M.J. McAleer (Michael); R.J. Powell (Robert)

    2015-01-01

    markdownabstract__Abstract__ This paper applies the Hafner and Herwartz (2006) (hereafter HH) approach to the analysis of multivariate GARCH models using volatility impulse response analysis. The data set features ten years of daily returns series for the New York Stock Exchange Index and the

  6. imDEV: a graphical user interface to R multivariate analysis tools in Microsoft Excel

    Science.gov (United States)

    Grapov, Dmitry; Newman, John W.

    2012-01-01

    Summary: Interactive modules for Data Exploration and Visualization (imDEV) is a Microsoft Excel spreadsheet embedded application providing an integrated environment for the analysis of omics data through a user-friendly interface. Individual modules enables interactive and dynamic analyses of large data by interfacing R's multivariate statistics and highly customizable visualizations with the spreadsheet environment, aiding robust inferences and generating information-rich data visualizations. This tool provides access to multiple comparisons with false discovery correction, hierarchical clustering, principal and independent component analyses, partial least squares regression and discriminant analysis, through an intuitive interface for creating high-quality two- and a three-dimensional visualizations including scatter plot matrices, distribution plots, dendrograms, heat maps, biplots, trellis biplots and correlation networks. Availability and implementation: Freely available for download at http://sourceforge.net/projects/imdev/. Implemented in R and VBA and supported by Microsoft Excel (2003, 2007 and 2010). Contact: John.Newman@ars.usda.gov Supplementary Information: Installation instructions, tutorials and users manual are available at http://sourceforge.net/projects/imdev/. PMID:22815358

  7. Multivariant design and multiple criteria analysis of building refurbishments

    Energy Technology Data Exchange (ETDEWEB)

    Kaklauskas, A.; Zavadskas, E. K.; Raslanas, S. [Faculty of Civil Engineering, Vilnius Gediminas Technical University, Vilnius (Lithuania)

    2005-07-01

    In order to design and realize an efficient building refurbishment, it is necessary to carry out an exhaustive investigation of all solutions that form it. The efficiency level of the considered building's refurbishment depends on a great many of factors, including: cost of refurbishment, annual fuel economy after refurbishment, tentative pay-back time, harmfulness to health of the materials used, aesthetics, maintenance properties, functionality, comfort, sound insulation and longevity, etc. Solutions of an alternative character allow for a more rational and realistic assessment of economic, ecological, legislative, climatic, social and political conditions, traditions and for better the satisfaction of customer requirements. They also enable one to cut down on refurbishment costs. In carrying out the multivariant design and multiple criteria analysis of a building refurbishment much data was processed and evaluated. Feasible alternatives could be as many as 100,000. How to perform a multivariant design and multiple criteria analysis of alternate alternatives based on the enormous amount of information became the problem. Method of multivariant design and multiple criteria of a building refurbishment's analysis were developed by the authors to solve the above problems. In order to demonstrate the developed method, a practical example is presented in this paper. (author)

  8. Data classification and MTBF prediction with a multivariate analysis approach

    International Nuclear Information System (INIS)

    Braglia, Marcello; Carmignani, Gionata; Frosolini, Marco; Zammori, Francesco

    2012-01-01

    The paper presents a multivariate statistical approach that supports the classification of mechanical components, subjected to specific operating conditions, in terms of the Mean Time Between Failure (MTBF). Assessing the influence of working conditions and/or environmental factors on the MTBF is a prerequisite for the development of an effective preventive maintenance plan. However, this task may be demanding and it is generally performed with ad-hoc experimental methods, lacking of statistical rigor. To solve this common problem, a step by step multivariate data classification technique is proposed. Specifically, a set of structured failure data are classified in a meaningful way by means of: (i) cluster analysis, (ii) multivariate analysis of variance, (iii) feature extraction and (iv) predictive discriminant analysis. This makes it possible not only to define the MTBF of the analyzed components, but also to identify the working parameters that explain most of the variability of the observed data. The approach is finally demonstrated on 126 centrifugal pumps installed in an oil refinery plant; obtained results demonstrate the quality of the final discrimination, in terms of data classification and failure prediction.

  9. Multivariate Volatility Impulse Response Analysis of GFC News Events

    NARCIS (Netherlands)

    D.E. Allen (David); M.J. McAleer (Michael); R.J. Powell (Robert); A.K. Singh (Abhay)

    2015-01-01

    textabstractThis paper applies the Hafner and Herwartz (2006) (hereafter HH) approach to the analysis of multivariate GARCH models using volatility impulse response analysis. The data set features ten years of daily returns series for the New York Stock Exchange Index and the FTSE 100 index from the

  10. A Study of Effects of MultiCollinearity in the Multivariable Analysis.

    Science.gov (United States)

    Yoo, Wonsuk; Mayberry, Robert; Bae, Sejong; Singh, Karan; Peter He, Qinghua; Lillard, James W

    2014-10-01

    A multivariable analysis is the most popular approach when investigating associations between risk factors and disease. However, efficiency of multivariable analysis highly depends on correlation structure among predictive variables. When the covariates in the model are not independent one another, collinearity/multicollinearity problems arise in the analysis, which leads to biased estimation. This work aims to perform a simulation study with various scenarios of different collinearity structures to investigate the effects of collinearity under various correlation structures amongst predictive and explanatory variables and to compare these results with existing guidelines to decide harmful collinearity. Three correlation scenarios among predictor variables are considered: (1) bivariate collinear structure as the most simple collinearity case, (2) multivariate collinear structure where an explanatory variable is correlated with two other covariates, (3) a more realistic scenario when an independent variable can be expressed by various functions including the other variables.

  11. Multivariate data analysis

    DEFF Research Database (Denmark)

    Hansen, Michael Adsetts Edberg

    Interest in statistical methodology is increasing so rapidly in the astronomical community that accessible introductory material in this area is long overdue. This book fills the gap by providing a presentation of the most useful techniques in multivariate statistics. A wide-ranging annotated set...

  12. Multivariate Analysis and Prediction of Dioxin-Furan ...

    Science.gov (United States)

    Peer Review Draft of Regional Methods Initiative Final Report Dioxins, which are bioaccumulative and environmentally persistent, pose an ongoing risk to human and ecosystem health. Fish constitute a significant source of dioxin exposure for humans and fish-eating wildlife. Current dioxin analytical methods are costly, time-consuming, and produce hazardous by-products. A Danish team developed a novel, multivariate statistical methodology based on the covariance of dioxin-furan congener Toxic Equivalences (TEQs) and fatty acid methyl esters (FAMEs) and applied it to North Atlantic Ocean fishmeal samples. The goal of the current study was to attempt to extend this Danish methodology to 77 whole and composite fish samples from three trophic groups: predator (whole largemouth bass), benthic (whole flathead and channel catfish) and forage fish (composite bluegill, pumpkinseed and green sunfish) from two dioxin contaminated rivers (Pocatalico R. and Kanawha R.) in West Virginia, USA. Multivariate statistical analyses, including, Principal Components Analysis (PCA), Hierarchical Clustering, and Partial Least Squares Regression (PLS), were used to assess the relationship between the FAMEs and TEQs in these dioxin contaminated freshwater fish from the Kanawha and Pocatalico Rivers. These three multivariate statistical methods all confirm that the pattern of Fatty Acid Methyl Esters (FAMEs) in these freshwater fish covaries with and is predictive of the WHO TE

  13. Global Sensitivity Analysis for multivariate output using Polynomial Chaos Expansion

    International Nuclear Information System (INIS)

    Garcia-Cabrejo, Oscar; Valocchi, Albert

    2014-01-01

    Many mathematical and computational models used in engineering produce multivariate output that shows some degree of correlation. However, conventional approaches to Global Sensitivity Analysis (GSA) assume that the output variable is scalar. These approaches are applied on each output variable leading to a large number of sensitivity indices that shows a high degree of redundancy making the interpretation of the results difficult. Two approaches have been proposed for GSA in the case of multivariate output: output decomposition approach [9] and covariance decomposition approach [14] but they are computationally intensive for most practical problems. In this paper, Polynomial Chaos Expansion (PCE) is used for an efficient GSA with multivariate output. The results indicate that PCE allows efficient estimation of the covariance matrix and GSA on the coefficients in the approach defined by Campbell et al. [9], and the development of analytical expressions for the multivariate sensitivity indices defined by Gamboa et al. [14]. - Highlights: • PCE increases computational efficiency in 2 approaches of GSA of multivariate output. • Efficient estimation of covariance matrix of output from coefficients of PCE. • Efficient GSA on coefficients of orthogonal decomposition of the output using PCE. • Analytical expressions of multivariate sensitivity indices from coefficients of PCE

  14. Search for the top quark using multivariate analysis techniques

    International Nuclear Information System (INIS)

    Bhat, P.C.

    1994-08-01

    The D0 collaboration is developing top search strategies using multivariate analysis techniques. We report here on applications of the H-matrix method to the eμ channel and neural networks to the e+jets channel

  15. metaCCA: summary statistics-based multivariate meta-analysis of genome-wide association studies using canonical correlation analysis.

    Science.gov (United States)

    Cichonska, Anna; Rousu, Juho; Marttinen, Pekka; Kangas, Antti J; Soininen, Pasi; Lehtimäki, Terho; Raitakari, Olli T; Järvelin, Marjo-Riitta; Salomaa, Veikko; Ala-Korpela, Mika; Ripatti, Samuli; Pirinen, Matti

    2016-07-01

    A dominant approach to genetic association studies is to perform univariate tests between genotype-phenotype pairs. However, analyzing related traits together increases statistical power, and certain complex associations become detectable only when several variants are tested jointly. Currently, modest sample sizes of individual cohorts, and restricted availability of individual-level genotype-phenotype data across the cohorts limit conducting multivariate tests. We introduce metaCCA, a computational framework for summary statistics-based analysis of a single or multiple studies that allows multivariate representation of both genotype and phenotype. It extends the statistical technique of canonical correlation analysis to the setting where original individual-level records are not available, and employs a covariance shrinkage algorithm to achieve robustness.Multivariate meta-analysis of two Finnish studies of nuclear magnetic resonance metabolomics by metaCCA, using standard univariate output from the program SNPTEST, shows an excellent agreement with the pooled individual-level analysis of original data. Motivated by strong multivariate signals in the lipid genes tested, we envision that multivariate association testing using metaCCA has a great potential to provide novel insights from already published summary statistics from high-throughput phenotyping technologies. Code is available at https://github.com/aalto-ics-kepaco anna.cichonska@helsinki.fi or matti.pirinen@helsinki.fi Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press.

  16. Dittrichia graveolens (L.) Greuter Essential Oil: Chemical Composition, Multivariate Analysis, and Antimicrobial Activity.

    Science.gov (United States)

    Mitic, Violeta; Stankov Jovanovic, Vesna; Ilic, Marija; Jovanovic, Olga; Djordjevic, Aleksandra; Stojanovic, Gordana

    2016-01-01

    The chemical composition and in vitro antimicrobial activities of Dittrichia graveolens (L.) Greuter essential oil was studied. Moreover, using agglomerative hierarchical cluster (AHC) and principal component analyses (PCA), the interrelationships of the D. graveolens essential-oil profiles characterized so far (including the sample from this study) were investigated. To evaluate the chemical composition of the essential oil, GC-FID and GC/MS analyses were performed. Altogether, 54 compounds were identified, accounting for 92.9% of the total oil composition. The D. graveolens oil belongs to the monoterpenoid chemotype, with monoterpenoids comprising 87.4% of the totally identified compounds. The major components were borneol (43.6%) and bornyl acetate (38.3%). Multivariate analysis showed that the compounds borneol and bornyl acetate exerted the greatest influence on the spatial differences in the composition of the reported oils. The antimicrobial activity against five bacterial and one fungal strain was determined using a disk-diffusion assay. The studied essential oil was active only against Gram-positive bacteria. Copyright © 2016 Verlag Helvetica Chimica Acta AG, Zürich.

  17. Multivariate mixed linear model analysis of longitudinal data: an information-rich statistical technique for analyzing disease resistance data

    Science.gov (United States)

    The mixed linear model (MLM) is currently among the most advanced and flexible statistical modeling techniques and its use in tackling problems in plant pathology has begun surfacing in the literature. The longitudinal MLM is a multivariate extension that handles repeatedly measured data, such as r...

  18. Multivariate techniques of analysis for ToF-E recoil spectrometry data

    Energy Technology Data Exchange (ETDEWEB)

    Whitlow, H.J.; Bouanani, M.E.; Persson, L.; Hult, M.; Jonsson, P.; Johnston, P.N. [Lund Institute of Technology, Solvegatan, (Sweden), Department of Nuclear Physics; Andersson, M. [Uppsala Univ. (Sweden). Dept. of Organic Chemistry; Ostling, M.; Zaring, C. [Royal institute of Technology, Electrum, Kista, (Sweden), Department of Electronics; Johnston, P.N.; Bubb, I.F.; Walker, B.R.; Stannard, W.B. [Royal Melbourne Inst. of Tech., VIC (Australia); Cohen, D.D.; Dytlewski, N. [Australian Nuclear Science and Technology Organisation, Lucas Heights, NSW (Australia)

    1996-12-31

    Multivariate statistical methods are being developed by the Australian -Swedish Recoil Spectrometry Collaboration for quantitative analysis of the wealth of information in Time of Flight (ToF) and energy dispersive Recoil Spectrometry. An overview is presented of progress made in the use of multivariate techniques for energy calibration, separation of mass-overlapped signals and simulation of ToF-E data. 6 refs., 5 figs.

  19. Multivariate techniques of analysis for ToF-E recoil spectrometry data

    Energy Technology Data Exchange (ETDEWEB)

    Whitlow, H J; Bouanani, M E; Persson, L; Hult, M; Jonsson, P; Johnston, P N [Lund Institute of Technology, Solvegatan, (Sweden), Department of Nuclear Physics; Andersson, M [Uppsala Univ. (Sweden). Dept. of Organic Chemistry; Ostling, M; Zaring, C [Royal institute of Technology, Electrum, Kista, (Sweden), Department of Electronics; Johnston, P N; Bubb, I F; Walker, B R; Stannard, W B [Royal Melbourne Inst. of Tech., VIC (Australia); Cohen, D D; Dytlewski, N [Australian Nuclear Science and Technology Organisation, Lucas Heights, NSW (Australia)

    1997-12-31

    Multivariate statistical methods are being developed by the Australian -Swedish Recoil Spectrometry Collaboration for quantitative analysis of the wealth of information in Time of Flight (ToF) and energy dispersive Recoil Spectrometry. An overview is presented of progress made in the use of multivariate techniques for energy calibration, separation of mass-overlapped signals and simulation of ToF-E data. 6 refs., 5 figs.

  20. Advanced event reweighting using multivariate analysis

    International Nuclear Information System (INIS)

    Martschei, D; Feindt, M; Honc, S; Wagner-Kuhr, J

    2012-01-01

    Multivariate analysis (MVA) methods, especially discrimination techniques such as neural networks, are key ingredients in modern data analysis and play an important role in high energy physics. They are usually trained on simulated Monte Carlo (MC) samples to discriminate so called 'signal' from 'background' events and are then applied to data to select real events of signal type. We here address procedures that improve this work flow. This will be the enhancement of data / MC agreement by reweighting MC samples on a per event basis. Then training MVAs on real data using the sPlot technique will be discussed. Finally we will address the construction of MVAs whose discriminator is independent of a certain control variable, i.e. cuts on this variable will not change the discriminator shape.

  1. Estimating an Effect Size in One-Way Multivariate Analysis of Variance (MANOVA)

    Science.gov (United States)

    Steyn, H. S., Jr.; Ellis, S. M.

    2009-01-01

    When two or more univariate population means are compared, the proportion of variation in the dependent variable accounted for by population group membership is eta-squared. This effect size can be generalized by using multivariate measures of association, based on the multivariate analysis of variance (MANOVA) statistics, to establish whether…

  2. Point defect characterization in HAADF-STEM images using multivariate statistical analysis

    International Nuclear Information System (INIS)

    Sarahan, Michael C.; Chi, Miaofang; Masiel, Daniel J.; Browning, Nigel D.

    2011-01-01

    Quantitative analysis of point defects is demonstrated through the use of multivariate statistical analysis. This analysis consists of principal component analysis for dimensional estimation and reduction, followed by independent component analysis to obtain physically meaningful, statistically independent factor images. Results from these analyses are presented in the form of factor images and scores. Factor images show characteristic intensity variations corresponding to physical structure changes, while scores relate how much those variations are present in the original data. The application of this technique is demonstrated on a set of experimental images of dislocation cores along a low-angle tilt grain boundary in strontium titanate. A relationship between chemical composition and lattice strain is highlighted in the analysis results, with picometer-scale shifts in several columns measurable from compositional changes in a separate column. -- Research Highlights: → Multivariate analysis of HAADF-STEM images. → Distinct structural variations among SrTiO 3 dislocation cores. → Picometer atomic column shifts correlated with atomic column population changes.

  3. Species richness and soil properties in Pinus ponderosa forests: A structural equation modeling analysis

    Science.gov (United States)

    Laughlin, D.C.; Abella, S.R.; Covington, W.W.; Grace, J.B.

    2007-01-01

    Question: How are the effects of mineral soil properties on understory plant species richness propagated through a network of processes involving the forest overstory, soil organic matter, soil nitrogen, and understory plant abundance? Location: North-central Arizona, USA. Methods: We sampled 75 0.05-ha plots across a broad soil gradient in a Pinus ponderosa (ponderosa pine) forest ecosystem. We evaluated multivariate models of plant species richness using structural equation modeling. Results: Richness was highest at intermediate levels of understory plant cover, suggesting that both colonization success and competitive exclusion can limit richness in this system. We did not detect a reciprocal positive effect of richness on plant cover. Richness was strongly related to soil nitrogen in the model, with evidence for both a direct negative effect and an indirect non-linear relationship mediated through understory plant cover. Soil organic matter appeared to have a positive influence on understory richness that was independent of soil nitrogen. Richness was lowest where the forest overstory was densest, which can be explained through indirect effects on soil organic matter, soil nitrogen and understory cover. Finally, model results suggest a variety of direct and indirect processes whereby mineral soil properties can influence richness. Conclusions: Understory plant species richness and plant cover in P. ponderosa forests appear to be significantly influenced by soil organic matter and nitrogen, which are, in turn, related to overstory density and composition and mineral soil properties. Thus, soil properties can impose direct and indirect constraints on local species diversity in ponderosa pine forests. ?? IAVS; Opulus Press.

  4. Multivariate analysis of risk factors for long-term urethroplasty outcome.

    Science.gov (United States)

    Breyer, Benjamin N; McAninch, Jack W; Whitson, Jared M; Eisenberg, Michael L; Mehdizadeh, Jennifer F; Myers, Jeremy B; Voelzke, Bryan B

    2010-02-01

    We studied the patient risk factors that promote urethroplasty failure. Records of patients who underwent urethroplasty at the University of California, San Francisco Medical Center between 1995 and 2004 were reviewed. Cox proportional hazards regression analysis was used to identify multivariate predictors of urethroplasty outcome. Between 1995 and 2004, 443 patients of 495 who underwent urethroplasty had complete comorbidity data and were included in analysis. Median patient age was 41 years (range 18 to 90). Median followup was 5.8 years (range 1 month to 10 years). Stricture recurred in 93 patients (21%). Primary estimated stricture-free survival at 1, 3 and 5 years was 88%, 82% and 79%. After multivariate analysis smoking (HR 1.8, 95% CI 1.0-3.1, p = 0.05), prior direct vision internal urethrotomy (HR 1.7, 95% CI 1.0-3.0, p = 0.04) and prior urethroplasty (HR 1.8, 95% CI 1.1-3.1, p = 0.03) were predictive of treatment failure. On multivariate analysis diabetes mellitus showed a trend toward prediction of urethroplasty failure (HR 2.0, 95% CI 0.8-4.9, p = 0.14). Length of urethral stricture (greater than 4 cm), prior urethroplasty and failed endoscopic therapy are predictive of failure after urethroplasty. Smoking and diabetes mellitus also may predict failure potentially secondary to microvascular damage. Copyright 2010 American Urological Association. Published by Elsevier Inc. All rights reserved.

  5. Using Interactive Graphics to Teach Multivariate Data Analysis to Psychology Students

    Science.gov (United States)

    Valero-Mora, Pedro M.; Ledesma, Ruben D.

    2011-01-01

    This paper discusses the use of interactive graphics to teach multivariate data analysis to Psychology students. Three techniques are explored through separate activities: parallel coordinates/boxplots; principal components/exploratory factor analysis; and cluster analysis. With interactive graphics, students may perform important parts of the…

  6. TMVA - Toolkit for Multivariate Data Analysis with ROOT Users guide

    CERN Document Server

    Höcker, A; Tegenfeldt, F; Voss, H; Voss, K; Christov, A; Henrot-Versillé, S; Jachowski, M; Krasznahorkay, A; Mahalalel, Y; Prudent, X; Speckmayer, P

    2007-01-01

    Multivariate machine learning techniques for the classification of data from high-energy physics (HEP) experiments have become standard tools in most HEP analyses. The multivariate classifiers themselves have significantly evolved in recent years, also driven by developments in other areas inside and outside science. TMVA is a toolkit integrated in ROOT which hosts a large variety of multivariate classification algorithms. They range from rectangular cut optimisation (using a genetic algorithm) and likelihood estimators, over linear and non-linear discriminants (neural networks), to sophisticated recent developments like boosted decision trees and rule ensemble fitting. TMVA organises the simultaneous training, testing, and performance evaluation of all these classifiers with a user-friendly interface, and expedites the application of the trained classifiers to the analysis of data sets with unknown sample composition.

  7. Application of Multivariable Statistical Techniques in Plant-wide WWTP Control Strategies Analysis

    DEFF Research Database (Denmark)

    Flores Alsina, Xavier; Comas, J.; Rodríguez-Roda, I.

    2007-01-01

    The main objective of this paper is to present the application of selected multivariable statistical techniques in plant-wide wastewater treatment plant (WWTP) control strategies analysis. In this study, cluster analysis (CA), principal component analysis/factor analysis (PCA/FA) and discriminant...... analysis (DA) are applied to the evaluation matrix data set obtained by simulation of several control strategies applied to the plant-wide IWA Benchmark Simulation Model No 2 (BSM2). These techniques allow i) to determine natural groups or clusters of control strategies with a similar behaviour, ii......) to find and interpret hidden, complex and casual relation features in the data set and iii) to identify important discriminant variables within the groups found by the cluster analysis. This study illustrates the usefulness of multivariable statistical techniques for both analysis and interpretation...

  8. Metal concentration at surface water using multivariate analysis and ...

    African Journals Online (AJOL)

    Metal concentration at surface water using multivariate analysis and human health risk assessment. F Azaman, H Juahir, K Yunus, A Azid, S.I. Khalit, A.D. Mustafa, M.A. Amran, C.N.C. Hasnam, M.Z.A.Z. Abidin, M.A.M. Yusri ...

  9. Multivariate statistical analysis of atom probe tomography data

    International Nuclear Information System (INIS)

    Parish, Chad M.; Miller, Michael K.

    2010-01-01

    The application of spectrum imaging multivariate statistical analysis methods, specifically principal component analysis (PCA), to atom probe tomography (APT) data has been investigated. The mathematical method of analysis is described and the results for two example datasets are analyzed and presented. The first dataset is from the analysis of a PM 2000 Fe-Cr-Al-Ti steel containing two different ultrafine precipitate populations. PCA properly describes the matrix and precipitate phases in a simple and intuitive manner. A second APT example is from the analysis of an irradiated reactor pressure vessel steel. Fine, nm-scale Cu-enriched precipitates having a core-shell structure were identified and qualitatively described by PCA. Advantages, disadvantages, and future prospects for implementing these data analysis methodologies for APT datasets, particularly with regard to quantitative analysis, are also discussed.

  10. Micro-Raman Imaging for Biology with Multivariate Spectral Analysis

    KAUST Repository

    Malvaso, Federica

    2015-05-05

    Raman spectroscopy is a noninvasive technique that can provide complex information on the vibrational state of the molecules. It defines the unique fingerprint that allow the identification of the various chemical components within a given sample. The aim of the following thesis work is to analyze Raman maps related to three pairs of different cells, highlighting differences and similarities through multivariate algorithms. The first pair of analyzed cells are human embryonic stem cells (hESCs), while the other two pairs are induced pluripotent stem cells (iPSCs) derived from T lymphocytes and keratinocytes, respectively. Although two different multivariate techniques were employed, ie Principal Component Analysis and Cluster Analysis, the same results were achieved: the iPSCs derived from T-lymphocytes show a higher content of genetic material both compared with the iPSCs derived from keratinocytes and the hESCs . On the other side, equally evident, was that iPS cells derived from keratinocytes assume a molecular distribution very similar to hESCs.

  11. Analysis of preservative-treated wood by multivariate analysis of laser-induced breakdown spectroscopy spectra

    International Nuclear Information System (INIS)

    Martin, Madhavi Z.; Labbe, Nicole; Rials, Timothy G.; Wullschleger, Stan D.

    2005-01-01

    In this work, multivariate statistical analysis (MVA) techniques are coupled with laser-induced breakdown spectroscopy (LIBS) to identify preservative types (chromated copper arsenate, ammoniacal copper zinc or alkaline copper quat), and to predict elemental content in preservative-treated wood. The elemental composition of the samples was measured with a standard laboratory method of digestion followed by atomic absorption spectroscopy analysis. The elemental composition was then correlated with the LIBS spectra using projection to latent structures (PLS) models. The correlations for the different elements introduced by different treatments were very strong, with the correlation coefficients generally above 0.9. Additionally, principal component analysis (PCA) was used to differentiate the samples treated with different preservative formulations. The research has focused not only on demonstrating the application of LIBS as a tool for use in the forest products industry, but also considered sampling errors, limits of detection, reproducibility, and accuracy of measurements as they relate to multivariate analysis of this complex wood substrate

  12. Analysis of preservative-treated wood by multivariate analysis of laser-induced breakdown spectroscopy spectra

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Madhavi Z. [Environmental Sciences Division Oak Ridge National Laboratory, P.O. Box 2008 MS 6422, Oak Ridge TN 37831-6422 (United States); Labbe, Nicole [Forest Products Center, University of Tennessee, 2506 Jacob Drive, Knoxville, TN 37996-4570 (United States)]. E-mail: nlabbe@utk.edu; Rials, Timothy G. [Forest Products Center, University of Tennessee, 2506 Jacob Drive, Knoxville, TN 37996-4570 (United States); Wullschleger, Stan D. [Environmental Sciences Division Oak Ridge National Laboratory, P.O. Box 2008 MS 6422, Oak Ridge TN 37831-6422 (United States)

    2005-08-31

    In this work, multivariate statistical analysis (MVA) techniques are coupled with laser-induced breakdown spectroscopy (LIBS) to identify preservative types (chromated copper arsenate, ammoniacal copper zinc or alkaline copper quat), and to predict elemental content in preservative-treated wood. The elemental composition of the samples was measured with a standard laboratory method of digestion followed by atomic absorption spectroscopy analysis. The elemental composition was then correlated with the LIBS spectra using projection to latent structures (PLS) models. The correlations for the different elements introduced by different treatments were very strong, with the correlation coefficients generally above 0.9. Additionally, principal component analysis (PCA) was used to differentiate the samples treated with different preservative formulations. The research has focused not only on demonstrating the application of LIBS as a tool for use in the forest products industry, but also considered sampling errors, limits of detection, reproducibility, and accuracy of measurements as they relate to multivariate analysis of this complex wood substrate.

  13. Multivariate Statistical Methods as a Tool of Financial Analysis of Farm Business

    Czech Academy of Sciences Publication Activity Database

    Novák, J.; Sůvová, H.; Vondráček, Jiří

    2002-01-01

    Roč. 48, č. 1 (2002), s. 9-12 ISSN 0139-570X Institutional research plan: AV0Z1030915 Keywords : financial analysis * financial ratios * multivariate statistical methods * correlation analysis * discriminant analysis * cluster analysis Subject RIV: BB - Applied Statistics, Operational Research

  14. Multivariate meta-analysis: a robust approach based on the theory of U-statistic.

    Science.gov (United States)

    Ma, Yan; Mazumdar, Madhu

    2011-10-30

    Meta-analysis is the methodology for combining findings from similar research studies asking the same question. When the question of interest involves multiple outcomes, multivariate meta-analysis is used to synthesize the outcomes simultaneously taking into account the correlation between the outcomes. Likelihood-based approaches, in particular restricted maximum likelihood (REML) method, are commonly utilized in this context. REML assumes a multivariate normal distribution for the random-effects model. This assumption is difficult to verify, especially for meta-analysis with small number of component studies. The use of REML also requires iterative estimation between parameters, needing moderately high computation time, especially when the dimension of outcomes is large. A multivariate method of moments (MMM) is available and is shown to perform equally well to REML. However, there is a lack of information on the performance of these two methods when the true data distribution is far from normality. In this paper, we propose a new nonparametric and non-iterative method for multivariate meta-analysis on the basis of the theory of U-statistic and compare the properties of these three procedures under both normal and skewed data through simulation studies. It is shown that the effect on estimates from REML because of non-normal data distribution is marginal and that the estimates from MMM and U-statistic-based approaches are very similar. Therefore, we conclude that for performing multivariate meta-analysis, the U-statistic estimation procedure is a viable alternative to REML and MMM. Easy implementation of all three methods are illustrated by their application to data from two published meta-analysis from the fields of hip fracture and periodontal disease. We discuss ideas for future research based on U-statistic for testing significance of between-study heterogeneity and for extending the work to meta-regression setting. Copyright © 2011 John Wiley & Sons, Ltd.

  15. Instrumental Neutron Activation Analysis and Multivariate Statistics for Pottery Provenance

    Science.gov (United States)

    Glascock, M. D.; Neff, H.; Vaughn, K. J.

    2004-06-01

    The application of instrumental neutron activation analysis and multivariate statistics to archaeological studies of ceramics and clays is described. A small pottery data set from the Nasca culture in southern Peru is presented for illustration.

  16. Instrumental Neutron Activation Analysis and Multivariate Statistics for Pottery Provenance

    International Nuclear Information System (INIS)

    Glascock, M. D.; Neff, H.; Vaughn, K. J.

    2004-01-01

    The application of instrumental neutron activation analysis and multivariate statistics to archaeological studies of ceramics and clays is described. A small pottery data set from the Nasca culture in southern Peru is presented for illustration.

  17. Instrumental Neutron Activation Analysis and Multivariate Statistics for Pottery Provenance

    Energy Technology Data Exchange (ETDEWEB)

    Glascock, M. D.; Neff, H. [University of Missouri, Research Reactor Center (United States); Vaughn, K. J. [Pacific Lutheran University, Department of Anthropology (United States)

    2004-06-15

    The application of instrumental neutron activation analysis and multivariate statistics to archaeological studies of ceramics and clays is described. A small pottery data set from the Nasca culture in southern Peru is presented for illustration.

  18. Multivariate statistics exercises and solutions

    CERN Document Server

    Härdle, Wolfgang Karl

    2015-01-01

    The authors present tools and concepts of multivariate data analysis by means of exercises and their solutions. The first part is devoted to graphical techniques. The second part deals with multivariate random variables and presents the derivation of estimators and tests for various practical situations. The last part introduces a wide variety of exercises in applied multivariate data analysis. The book demonstrates the application of simple calculus and basic multivariate methods in real life situations. It contains altogether more than 250 solved exercises which can assist a university teacher in setting up a modern multivariate analysis course. All computer-based exercises are available in the R language. All R codes and data sets may be downloaded via the quantlet download center  www.quantlet.org or via the Springer webpage. For interactive display of low-dimensional projections of a multivariate data set, we recommend GGobi.

  19. Multivariate Meta-Analysis of Brain-Mass Correlations in Eutherian Mammals

    Directory of Open Access Journals (Sweden)

    Charlene Steinhausen

    2016-09-01

    Full Text Available The general assumption that brain size differences are an adequate proxy for subtler differences in brain organization turned neurobiologists towards the question why some groups of mammals such as primates, elephants, and whales have such remarkably large brains. In this meta-analysis, an extensive sample of eutherian mammals (115 species distributed in 14 orders provided data about several different biological traits and measures of brain size such as absolute brain mass (AB, relative brain mass (RB; quotient from AB and body mass, and encephalization quotient (EQ. These data were analyzed by established multivariate statistics without taking specific phylogenetic information into account. Species with high AB tend to (1 feed on protein-rich nutrition, (2 have a long lifespan, (3 delay sexual maturity, and (4 have long and rare pregnancies with small litter sizes. Animals with high RB usually have (1 a short life span, (2 reach sexual maturity early, and (3 have short and frequent gestations. Moreover males of species with high RB also have few potential sexual partners. In contrast, animals with high EQs have (1 a high number of potential sexual partners, (2 delayed sexual maturity, and (3 rare gestations with small litter sizes. Based on these correlations, we conclude that Eutheria with either high AB or high EQ occupy high positions in the network of food chains (high trophic levels. Eutheria of low trophic levels can develop a high RB only if they have small body masses.

  20. Study of ionically modified water performance in carbonate reservoir system by multivariate data analysis

    DEFF Research Database (Denmark)

    Sohal, Muhammad Adeel Nassar; Kucheryavskiy, Sergey V.; Thyne, Geoffrey

    2017-01-01

    the critical mechanisms at the pore scale. Better pore scale physico-chemical understanding will guide to formulate accurate reservoir-scale models. This paper presents a comprehensive meta-analysis of the proposed mechanisms using multivariate data analysis. Detailed review of the subject, including...... mechanisms with supporting and contradictory evidence has been presented by Sohal et al. (2016). In this study, the significance of each contributing factor to EOR was quantified and subjected to rigorous multivariate statistical analysis. The analysis was limited because there is no uniform methodology...

  1. The studies of post-medieval glass by multivariate and X-ray fluorescence analysis

    International Nuclear Information System (INIS)

    Kierzek, J.; Kunicki-Goldfinger, J.

    2002-01-01

    Multivariate statistical analysis of the results obtained by energy dispersive X-ray fluorescence analysis has been used in the study of baroque vessel glasses originated from central Europe. X-ray spectrometry can be applied as a completely non-destructive, non-sampling and multi-element method. It is very useful in the studies of valuable historical artefacts. For the last years, multivariate statistical analysis has been developed as an important tool for the archaeometric purposes. Cluster, principal component and discriminant analysis were applied for the classification of the examined objects. The obtained results show that these statistical tools are very useful and complementary in the studies of historical objects. (author)

  2. Multivariable analysis: a practical guide for clinicians and public health researchers

    National Research Council Canada - National Science Library

    Katz, Mitchell H

    2011-01-01

    .... It is the perfect introduction for all clinical researchers. It describes how to perform and interpret multivariable analysis, using plain language rather than complex derivations and mathematical formulae...

  3. Comprehensive drought characteristics analysis based on a nonlinear multivariate drought index

    Science.gov (United States)

    Yang, Jie; Chang, Jianxia; Wang, Yimin; Li, Yunyun; Hu, Hui; Chen, Yutong; Huang, Qiang; Yao, Jun

    2018-02-01

    It is vital to identify drought events and to evaluate multivariate drought characteristics based on a composite drought index for better drought risk assessment and sustainable development of water resources. However, most composite drought indices are constructed by the linear combination, principal component analysis and entropy weight method assuming a linear relationship among different drought indices. In this study, the multidimensional copulas function was applied to construct a nonlinear multivariate drought index (NMDI) to solve the complicated and nonlinear relationship due to its dependence structure and flexibility. The NMDI was constructed by combining meteorological, hydrological, and agricultural variables (precipitation, runoff, and soil moisture) to better reflect the multivariate variables simultaneously. Based on the constructed NMDI and runs theory, drought events for a particular area regarding three drought characteristics: duration, peak, and severity were identified. Finally, multivariate drought risk was analyzed as a tool for providing reliable support in drought decision-making. The results indicate that: (1) multidimensional copulas can effectively solve the complicated and nonlinear relationship among multivariate variables; (2) compared with single and other composite drought indices, the NMDI is slightly more sensitive in capturing recorded drought events; and (3) drought risk shows a spatial variation; out of the five partitions studied, the Jing River Basin as well as the upstream and midstream of the Wei River Basin are characterized by a higher multivariate drought risk. In general, multidimensional copulas provides a reliable way to solve the nonlinear relationship when constructing a comprehensive drought index and evaluating multivariate drought characteristics.

  4. Use of multivariate extensions of generalized linear models in the analysis of data from clinical trials

    OpenAIRE

    ALONSO ABAD, Ariel; Rodriguez, O.; TIBALDI, Fabian; CORTINAS ABRAHANTES, Jose

    2002-01-01

    In medical studies the categorical endpoints are quite often. Even though nowadays some models for handling this multicategorical variables have been developed their use is not common. This work shows an application of the Multivariate Generalized Linear Models to the analysis of Clinical Trials data. After a theoretical introduction models for ordinal and nominal responses are applied and the main results are discussed. multivariate analysis; multivariate logistic regression; multicategor...

  5. Advanced multivariate analysis to assess remediation of hydrocarbons in soils.

    Science.gov (United States)

    Lin, Deborah S; Taylor, Peter; Tibbett, Mark

    2014-10-01

    Accurate monitoring of degradation levels in soils is essential in order to understand and achieve complete degradation of petroleum hydrocarbons in contaminated soils. We aimed to develop the use of multivariate methods for the monitoring of biodegradation of diesel in soils and to determine if diesel contaminated soils could be remediated to a chemical composition similar to that of an uncontaminated soil. An incubation experiment was set up with three contrasting soil types. Each soil was exposed to diesel at varying stages of degradation and then analysed for key hydrocarbons throughout 161 days of incubation. Hydrocarbon distributions were analysed by Principal Coordinate Analysis and similar samples grouped by cluster analysis. Variation and differences between samples were determined using permutational multivariate analysis of variance. It was found that all soils followed trajectories approaching the chemical composition of the unpolluted soil. Some contaminated soils were no longer significantly different to that of uncontaminated soil after 161 days of incubation. The use of cluster analysis allows the assignment of a percentage chemical similarity of a diesel contaminated soil to an uncontaminated soil sample. This will aid in the monitoring of hydrocarbon contaminated sites and the establishment of potential endpoints for successful remediation.

  6. Multivariate statistical analysis of precipitation chemistry in Northwestern Spain

    International Nuclear Information System (INIS)

    Prada-Sanchez, J.M.; Garcia-Jurado, I.; Gonzalez-Manteiga, W.; Fiestras-Janeiro, M.G.; Espada-Rios, M.I.; Lucas-Dominguez, T.

    1993-01-01

    149 samples of rainwater were collected in the proximity of a power station in northwestern Spain at three rainwater monitoring stations. The resulting data are analyzed using multivariate statistical techniques. Firstly, the Principal Component Analysis shows that there are three main sources of pollution in the area (a marine source, a rural source and an acid source). The impact from pollution from these sources on the immediate environment of the stations is studied using Factorial Discriminant Analysis. 8 refs., 7 figs., 11 tabs

  7. Multivariate statistical analysis of precipitation chemistry in Northwestern Spain

    Energy Technology Data Exchange (ETDEWEB)

    Prada-Sanchez, J.M.; Garcia-Jurado, I.; Gonzalez-Manteiga, W.; Fiestras-Janeiro, M.G.; Espada-Rios, M.I.; Lucas-Dominguez, T. (University of Santiago, Santiago (Spain). Faculty of Mathematics, Dept. of Statistics and Operations Research)

    1993-07-01

    149 samples of rainwater were collected in the proximity of a power station in northwestern Spain at three rainwater monitoring stations. The resulting data are analyzed using multivariate statistical techniques. Firstly, the Principal Component Analysis shows that there are three main sources of pollution in the area (a marine source, a rural source and an acid source). The impact from pollution from these sources on the immediate environment of the stations is studied using Factorial Discriminant Analysis. 8 refs., 7 figs., 11 tabs.

  8. Classification of adulterated honeys by multivariate analysis.

    Science.gov (United States)

    Amiry, Saber; Esmaiili, Mohsen; Alizadeh, Mohammad

    2017-06-01

    In this research, honey samples were adulterated with date syrup (DS) and invert sugar syrup (IS) at three concentrations (7%, 15% and 30%). 102 adulterated samples were prepared in six batches with 17 replications for each batch. For each sample, 32 parameters including color indices, rheological, physical, and chemical parameters were determined. To classify the samples, based on type and concentrations of adulterant, a multivariate analysis was applied using principal component analysis (PCA) followed by a linear discriminant analysis (LDA). Then, 21 principal components (PCs) were selected in five sets. Approximately two-thirds were identified correctly using color indices (62.75%) or rheological properties (67.65%). A power discrimination was obtained using physical properties (97.06%), and the best separations were achieved using two sets of chemical properties (set 1: lactone, diastase activity, sucrose - 100%) (set 2: free acidity, HMF, ash - 95%). Copyright © 2016 Elsevier Ltd. All rights reserved.

  9. Multivariate reference technique for quantitative analysis of fiber-optic tissue Raman spectroscopy.

    Science.gov (United States)

    Bergholt, Mads Sylvest; Duraipandian, Shiyamala; Zheng, Wei; Huang, Zhiwei

    2013-12-03

    We report a novel method making use of multivariate reference signals of fused silica and sapphire Raman signals generated from a ball-lens fiber-optic Raman probe for quantitative analysis of in vivo tissue Raman measurements in real time. Partial least-squares (PLS) regression modeling is applied to extract the characteristic internal reference Raman signals (e.g., shoulder of the prominent fused silica boson peak (~130 cm(-1)); distinct sapphire ball-lens peaks (380, 417, 646, and 751 cm(-1))) from the ball-lens fiber-optic Raman probe for quantitative analysis of fiber-optic Raman spectroscopy. To evaluate the analytical value of this novel multivariate reference technique, a rapid Raman spectroscopy system coupled with a ball-lens fiber-optic Raman probe is used for in vivo oral tissue Raman measurements (n = 25 subjects) under 785 nm laser excitation powers ranging from 5 to 65 mW. An accurate linear relationship (R(2) = 0.981) with a root-mean-square error of cross validation (RMSECV) of 2.5 mW can be obtained for predicting the laser excitation power changes based on a leave-one-subject-out cross-validation, which is superior to the normal univariate reference method (RMSE = 6.2 mW). A root-mean-square error of prediction (RMSEP) of 2.4 mW (R(2) = 0.985) can also be achieved for laser power prediction in real time when we applied the multivariate method independently on the five new subjects (n = 166 spectra). We further apply the multivariate reference technique for quantitative analysis of gelatin tissue phantoms that gives rise to an RMSEP of ~2.0% (R(2) = 0.998) independent of laser excitation power variations. This work demonstrates that multivariate reference technique can be advantageously used to monitor and correct the variations of laser excitation power and fiber coupling efficiency in situ for standardizing the tissue Raman intensity to realize quantitative analysis of tissue Raman measurements in vivo, which is particularly appealing in

  10. Denial-of-service attack detection based on multivariate correlation analysis

    NARCIS (Netherlands)

    Tan, Zhiyuan; Jamdagni, Aruna; He, Xiangjian; Nanda, Priyadarsi; Liu, Ren Ping; Lu, Bao-Liang; Zhang, Liqing; Kwok, James

    2011-01-01

    The reliability and availability of network services are being threatened by the growing number of Denial-of-Service (DoS) attacks. Effective mechanisms for DoS attack detection are demanded. Therefore, we propose a multivariate correlation analysis approach to investigate and extract second-order

  11. Hierarchical multivariate covariance analysis of metabolic connectivity.

    Science.gov (United States)

    Carbonell, Felix; Charil, Arnaud; Zijdenbos, Alex P; Evans, Alan C; Bedell, Barry J

    2014-12-01

    Conventional brain connectivity analysis is typically based on the assessment of interregional correlations. Given that correlation coefficients are derived from both covariance and variance, group differences in covariance may be obscured by differences in the variance terms. To facilitate a comprehensive assessment of connectivity, we propose a unified statistical framework that interrogates the individual terms of the correlation coefficient. We have evaluated the utility of this method for metabolic connectivity analysis using [18F]2-fluoro-2-deoxyglucose (FDG) positron emission tomography (PET) data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) study. As an illustrative example of the utility of this approach, we examined metabolic connectivity in angular gyrus and precuneus seed regions of mild cognitive impairment (MCI) subjects with low and high β-amyloid burdens. This new multivariate method allowed us to identify alterations in the metabolic connectome, which would not have been detected using classic seed-based correlation analysis. Ultimately, this novel approach should be extensible to brain network analysis and broadly applicable to other imaging modalities, such as functional magnetic resonance imaging (MRI).

  12. Changes in cod muscle proteins during frozen storage revealed by proteome analysis and multivariate data analysis

    DEFF Research Database (Denmark)

    Kjærsgård, Inger Vibeke Holst; Nørrelykke, M.R.; Jessen, Flemming

    2006-01-01

    Multivariate data analysis has been combined with proteomics to enhance the recovery of information from 2-DE of cod muscle proteins during different storage conditions. Proteins were extracted according to 11 different storage conditions and samples were resolved by 2-DE. Data generated by 2-DE...... was subjected to principal component analysis (PCA) and discriminant partial least squares regression (DPLSR). Applying PCA to 2-DE data revealed the samples to form groups according to frozen storage time, whereas differences due to different storage temperatures or chilled storage in modified atmosphere...... light chain 1, 2 and 3, triose-phosphate isomerase, glyceraldehyde-3-phosphate dehydrogenase, aldolase A and two ?-actin fragments, and a nuclease diphosphate kinase B fragment to change in concentration, during frozen storage. Application of proteomics, multivariate data analysis and MS/MS to analyse...

  13. Multivariate statistical analysis of major and trace element data for ...

    African Journals Online (AJOL)

    Multivariate statistical analysis of major and trace element data for niobium exploration in the peralkaline granites of the anorogenic ring-complex province of Nigeria. PO Ogunleye, EC Ike, I Garba. Abstract. No Abstract Available Journal of Mining and Geology Vol.40(2) 2004: 107-117. Full Text: EMAIL FULL TEXT EMAIL ...

  14. Dissecting the polysaccharide-rich grape cell wall matrix using recombinant pectinases during winemaking

    DEFF Research Database (Denmark)

    Gao, Yu; Fangel, Jonatan Ulrik; Willats, William George Tycho

    2016-01-01

    different combinations of purified recombinant pectinases with cell wall profiling tools to follow the deconstruction process during winemaking. Multivariate data analysis of the glycan microarray (CoMPP) and gas chromatography (GC) results revealed that pectin lyase performed almost as effectively in de......The effectiveness of enzyme-mediated-maceration in red winemaking relies on the use of an optimum combination of specific enzymes. A lack of information on the relevant enzyme activities and the corresponding polysaccharide-rich berry cell wall structure is a major limitation. This study used......-pectination as certain commercial enzyme mixtures. Surprisingly the combination of endo-polygalacturonase and pectin-methyl-esterase only unraveled the cell walls without de-pectination. Datasets from the various combinations used confirmed pectin-rich and xyloglucan-rich layers within the grape pomace. These data...

  15. Multivariate Analysis of Multiple Datasets: a Practical Guide for Chemical Ecology.

    Science.gov (United States)

    Hervé, Maxime R; Nicolè, Florence; Lê Cao, Kim-Anh

    2018-03-01

    Chemical ecology has strong links with metabolomics, the large-scale study of all metabolites detectable in a biological sample. Consequently, chemical ecologists are often challenged by the statistical analyses of such large datasets. This holds especially true when the purpose is to integrate multiple datasets to obtain a holistic view and a better understanding of a biological system under study. The present article provides a comprehensive resource to analyze such complex datasets using multivariate methods. It starts from the necessary pre-treatment of data including data transformations and distance calculations, to the application of both gold standard and novel multivariate methods for the integration of different omics data. We illustrate the process of analysis along with detailed results interpretations for six issues representative of the different types of biological questions encountered by chemical ecologists. We provide the necessary knowledge and tools with reproducible R codes and chemical-ecological datasets to practice and teach multivariate methods.

  16. Interpretability of Multivariate Brain Maps in Linear Brain Decoding: Definition, and Heuristic Quantification in Multivariate Analysis of MEG Time-Locked Effects.

    Science.gov (United States)

    Kia, Seyed Mostafa; Vega Pons, Sandro; Weisz, Nathan; Passerini, Andrea

    2016-01-01

    Brain decoding is a popular multivariate approach for hypothesis testing in neuroimaging. Linear classifiers are widely employed in the brain decoding paradigm to discriminate among experimental conditions. Then, the derived linear weights are visualized in the form of multivariate brain maps to further study spatio-temporal patterns of underlying neural activities. It is well known that the brain maps derived from weights of linear classifiers are hard to interpret because of high correlations between predictors, low signal to noise ratios, and the high dimensionality of neuroimaging data. Therefore, improving the interpretability of brain decoding approaches is of primary interest in many neuroimaging studies. Despite extensive studies of this type, at present, there is no formal definition for interpretability of multivariate brain maps. As a consequence, there is no quantitative measure for evaluating the interpretability of different brain decoding methods. In this paper, first, we present a theoretical definition of interpretability in brain decoding; we show that the interpretability of multivariate brain maps can be decomposed into their reproducibility and representativeness. Second, as an application of the proposed definition, we exemplify a heuristic for approximating the interpretability in multivariate analysis of evoked magnetoencephalography (MEG) responses. Third, we propose to combine the approximated interpretability and the generalization performance of the brain decoding into a new multi-objective criterion for model selection. Our results, for the simulated and real MEG data, show that optimizing the hyper-parameters of the regularized linear classifier based on the proposed criterion results in more informative multivariate brain maps. More importantly, the presented definition provides the theoretical background for quantitative evaluation of interpretability, and hence, facilitates the development of more effective brain decoding algorithms

  17. Cardiovascular reactivity patterns and pathways to hypertension: a multivariate cluster analysis

    NARCIS (Netherlands)

    Brindle, R. C.; Ginty, A. T.; Jones, A.; Phillips, A. C.; Roseboom, T. J.; Carroll, D.; Painter, R. C.; de Rooij, S. R.

    2016-01-01

    Substantial evidence links exaggerated mental stress induced blood pressure reactivity to future hypertension, but the results for heart rate reactivity are less clear. For this reason multivariate cluster analysis was carried out to examine the relationship between heart rate and blood pressure

  18. Multivariate analysis between air pollutants and meteorological variables in Seoul

    International Nuclear Information System (INIS)

    Kim, J.; Lim, J.

    2005-01-01

    Multivariate analysis was conducted to analyze the relationship between air pollutants and meteorological variables measured in Seoul from January 1 to December 31, 1999. The first principal component showed the contrast effect between O 3 and the other pollutants. The second principal component showed the contrast effect between CO, SO 2 , NO 2 , and O 3 , PM 10 , TSP. Based on the cluster analysis, three clusters represented different air pollution levels, seasonal characteristics of air pollutants, and meteorological conditions. Discriminant analysis with air environment index (AEI) was carried out to develop an air pollution index function. (orig.)

  19. Enhancing e-waste estimates: Improving data quality by multivariate Input–Output Analysis

    Energy Technology Data Exchange (ETDEWEB)

    Wang, Feng, E-mail: fwang@unu.edu [Institute for Sustainability and Peace, United Nations University, Hermann-Ehler-Str. 10, 53113 Bonn (Germany); Design for Sustainability Lab, Faculty of Industrial Design Engineering, Delft University of Technology, Landbergstraat 15, 2628CE Delft (Netherlands); Huisman, Jaco [Institute for Sustainability and Peace, United Nations University, Hermann-Ehler-Str. 10, 53113 Bonn (Germany); Design for Sustainability Lab, Faculty of Industrial Design Engineering, Delft University of Technology, Landbergstraat 15, 2628CE Delft (Netherlands); Stevels, Ab [Design for Sustainability Lab, Faculty of Industrial Design Engineering, Delft University of Technology, Landbergstraat 15, 2628CE Delft (Netherlands); Baldé, Cornelis Peter [Institute for Sustainability and Peace, United Nations University, Hermann-Ehler-Str. 10, 53113 Bonn (Germany); Statistics Netherlands, Henri Faasdreef 312, 2492 JP Den Haag (Netherlands)

    2013-11-15

    Highlights: • A multivariate Input–Output Analysis method for e-waste estimates is proposed. • Applying multivariate analysis to consolidate data can enhance e-waste estimates. • We examine the influence of model selection and data quality on e-waste estimates. • Datasets of all e-waste related variables in a Dutch case study have been provided. • Accurate modeling of time-variant lifespan distributions is critical for estimate. - Abstract: Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lack of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input–Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e

  20. Enhancing e-waste estimates: Improving data quality by multivariate Input–Output Analysis

    International Nuclear Information System (INIS)

    Wang, Feng; Huisman, Jaco; Stevels, Ab; Baldé, Cornelis Peter

    2013-01-01

    Highlights: • A multivariate Input–Output Analysis method for e-waste estimates is proposed. • Applying multivariate analysis to consolidate data can enhance e-waste estimates. • We examine the influence of model selection and data quality on e-waste estimates. • Datasets of all e-waste related variables in a Dutch case study have been provided. • Accurate modeling of time-variant lifespan distributions is critical for estimate. - Abstract: Waste electrical and electronic equipment (or e-waste) is one of the fastest growing waste streams, which encompasses a wide and increasing spectrum of products. Accurate estimation of e-waste generation is difficult, mainly due to lack of high quality data referred to market and socio-economic dynamics. This paper addresses how to enhance e-waste estimates by providing techniques to increase data quality. An advanced, flexible and multivariate Input–Output Analysis (IOA) method is proposed. It links all three pillars in IOA (product sales, stock and lifespan profiles) to construct mathematical relationships between various data points. By applying this method, the data consolidation steps can generate more accurate time-series datasets from available data pool. This can consequently increase the reliability of e-waste estimates compared to the approach without data processing. A case study in the Netherlands is used to apply the advanced IOA model. As a result, for the first time ever, complete datasets of all three variables for estimating all types of e-waste have been obtained. The result of this study also demonstrates significant disparity between various estimation models, arising from the use of data under different conditions. It shows the importance of applying multivariate approach and multiple sources to improve data quality for modelling, specifically using appropriate time-varying lifespan parameters. Following the case study, a roadmap with a procedural guideline is provided to enhance e

  1. Multivariate analysis of data in sensory science

    CERN Document Server

    Naes, T; Risvik, E

    1996-01-01

    The state-of-the-art of multivariate analysis in sensory science is described in this volume. Both methods for aggregated and individual sensory profiles are discussed. Processes and results are presented in such a way that they can be understood not only by statisticians but also by experienced sensory panel leaders and users of sensory analysis. The techniques presented are focused on examples and interpretation rather than on the technical aspects, with an emphasis on new and important methods which are possibly not so well known to scientists in the field. Important features of the book are discussions on the relationship among the methods with a strong accent on the connection between problems and methods. All procedures presented are described in relation to sensory data and not as completely general statistical techniques. Sensory scientists, applied statisticians, chemometricians, those working in consumer science, food scientists and agronomers will find this book of value.

  2. Clinical patch test data evaluated by multivariate analysis. Danish Contact Dermatitis Group

    DEFF Research Database (Denmark)

    Christophersen, J; Menné, T; Tanghøj, P

    1989-01-01

    The aim of the present study was to evaluate the influence of individual explanatory factors, such as sex, age, atopy, test time and presence of diseased skin, on clinical patch test results, by application of multivariate statistical analysis. The study population was 2166 consecutive patients...... patch tested with the standard series of the International Contact Dermatitis Research Group (ICDRG) by members of the Danish Contact Dermatitis Group (DCDG) over a period of 6 months. For the 8 test allergens most often found positive (nickel, fragrance-mix, cobalt, chromate, balsam of Peru, carba......-mix, colophony, and formaldehyde), one or more individual factors were of significance for the risk of being sensitized, except for chromate and formaldehyde. It is concluded that patch test results can be compared only after stratification of the material or by multivariate analysis....

  3. Objective classification of ecological status in marine water bodies using ecotoxicological information and multivariate analysis.

    Science.gov (United States)

    Beiras, Ricardo; Durán, Iria

    2014-12-01

    Some relevant shortcomings have been identified in the current approach for the classification of ecological status in marine water bodies, leading to delays in the fulfillment of the Water Framework Directive objectives. Natural variability makes difficult to settle fixed reference values and boundary values for the Ecological Quality Ratios (EQR) for the biological quality elements. Biological responses to environmental degradation are frequently of nonmonotonic nature, hampering the EQR approach. Community structure traits respond only once ecological damage has already been done and do not provide early warning signals. An alternative methodology for the classification of ecological status integrating chemical measurements, ecotoxicological bioassays and community structure traits (species richness and diversity), and using multivariate analyses (multidimensional scaling and cluster analysis), is proposed. This approach does not depend on the arbitrary definition of fixed reference values and EQR boundary values, and it is suitable to integrate nonlinear, sensitive signals of ecological degradation. As a disadvantage, this approach demands the inclusion of sampling sites representing the full range of ecological status in each monitoring campaign. National or international agencies in charge of coastal pollution monitoring have comprehensive data sets available to overcome this limitation.

  4. Decomposition of multivariate phenotypic means in multigroup genetic covarinace structure analysis

    NARCIS (Netherlands)

    Dolan, C.V.; Molenaar, P.C.M.; Boomsma, D.I.

    1992-01-01

    Uses D. Sorbom's (1974) method to study differences in latent means in multivariate twin data. By restricting the analysis to a comparison between groups, the results pertain only to the additive contributions of common genetic and environmental factors to the deviation of the group means from what

  5. Multivariate data analysis approach to understand magnetic properties of perovskite manganese oxides

    International Nuclear Information System (INIS)

    Imamura, N.; Mizoguchi, T.; Yamauchi, H.; Karppinen, M.

    2008-01-01

    Here we apply statistical multivariate data analysis techniques to obtain some insights into the complex structure-property relations in antiferromagnetic (AFM) and ferromagnetic (FM) manganese perovskite systems, AMnO 3 . The 131 samples included in the present analyses are described by 21 crystal-structure or crystal-chemical (CS/CC) parameters. Principal component analysis (PCA), carried out separately for the AFM and FM compounds, is used to model and evaluate the various relationships among the magnetic properties and the various CS/CC parameters. Moreover, for the AFM compounds, PLS (partial least squares projections to latent structures) analysis is performed so as to predict the magnitude of the Neel temperature on the bases of the CS/CC parameters. Finally, so-called PLS-DA (PLS discriminant analysis) method is employed to find out the most influential/characteristic CS/CC parameters that differentiate the two classes of compounds from each other. - Graphical abstract: Statistical multivariate data analysis techniques are applied to detect structure-property relations in antiferromagnetic (AFM) and ferromagnetic (FM) manganese perovskites. For AFM compounds, partial least squares projections to latent structures analysis predict the magnitude of the Neel temperature on the bases of structural parameters only. Moreover, AFM and FM compounds are well separated by means of so-called partial least squares discriminant analysis method

  6. Identification of multivariate models for noise analysis of nuclear plant

    International Nuclear Information System (INIS)

    Zwingelstein, G.C.; Upadhyaya, B.R.

    1979-01-01

    During the normal operation of a pressurized water reactor, neutron noise analysis with multivariate autoregressive procedures in a valuable diagnostic tool to extract dynamic characteristics for incipient failure detection. The first part of the paper will describe in details the equations for estimating the multivariate autoregressive model matrices and the structure of various matrices. The matrices are estimated by solving a set of matrix operations, called Yule-Walker equations. The selection of optimal model order will also be discussed. Once the optimal parameter set is obtained, simple and fast calculations are used to determine the auto power spectral density, cross spectra, coherence function, phase. In addition the spectra may be decomposed into components being contributed from different noise sources. An application using neutron flux data collected on a nuclear plant will illustrate the efficiency of the method

  7. Handbook of univariate and multivariate data analysis with IBM SPSS

    CERN Document Server

    Ho, Robert

    2013-01-01

    Using the same accessible, hands-on approach as its best-selling predecessor, the Handbook of Univariate and Multivariate Data Analysis with IBM SPSS, Second Edition explains how to apply statistical tests to experimental findings, identify the assumptions underlying the tests, and interpret the findings. This second edition now covers more topics and has been updated with the SPSS statistical package for Windows.New to the Second EditionThree new chapters on multiple discriminant analysis, logistic regression, and canonical correlationNew section on how to deal with missing dataCoverage of te

  8. A kernel version of multivariate alteration detection

    DEFF Research Database (Denmark)

    Nielsen, Allan Aasbjerg; Vestergaard, Jacob Schack

    2013-01-01

    Based on the established methods kernel canonical correlation analysis and multivariate alteration detection we introduce a kernel version of multivariate alteration detection. A case study with SPOT HRV data shows that the kMAD variates focus on extreme change observations.......Based on the established methods kernel canonical correlation analysis and multivariate alteration detection we introduce a kernel version of multivariate alteration detection. A case study with SPOT HRV data shows that the kMAD variates focus on extreme change observations....

  9. Integrated GIS and multivariate statistical analysis for regional scale assessment of heavy metal soil contamination: A critical review

    International Nuclear Information System (INIS)

    Hou, Deyi; O'Connor, David; Nathanail, Paul; Tian, Li; Ma, Yan

    2017-01-01

    Heavy metal soil contamination is associated with potential toxicity to humans or ecotoxicity. Scholars have increasingly used a combination of geographical information science (GIS) with geostatistical and multivariate statistical analysis techniques to examine the spatial distribution of heavy metals in soils at a regional scale. A review of such studies showed that most soil sampling programs were based on grid patterns and composite sampling methodologies. Many programs intended to characterize various soil types and land use types. The most often used sampling depth intervals were 0–0.10 m, or 0–0.20 m, below surface; and the sampling densities used ranged from 0.0004 to 6.1 samples per km 2 , with a median of 0.4 samples per km 2 . The most widely used spatial interpolators were inverse distance weighted interpolation and ordinary kriging; and the most often used multivariate statistical analysis techniques were principal component analysis and cluster analysis. The review also identified several determining and correlating factors in heavy metal distribution in soils, including soil type, soil pH, soil organic matter, land use type, Fe, Al, and heavy metal concentrations. The major natural and anthropogenic sources of heavy metals were found to derive from lithogenic origin, roadway and transportation, atmospheric deposition, wastewater and runoff from industrial and mining facilities, fertilizer application, livestock manure, and sewage sludge. This review argues that the full potential of integrated GIS and multivariate statistical analysis for assessing heavy metal distribution in soils on a regional scale has not yet been fully realized. It is proposed that future research be conducted to map multivariate results in GIS to pinpoint specific anthropogenic sources, to analyze temporal trends in addition to spatial patterns, to optimize modeling parameters, and to expand the use of different multivariate analysis tools beyond principal component

  10. Multivariate statistical methods a first course

    CERN Document Server

    Marcoulides, George A

    2014-01-01

    Multivariate statistics refer to an assortment of statistical methods that have been developed to handle situations in which multiple variables or measures are involved. Any analysis of more than two variables or measures can loosely be considered a multivariate statistical analysis. An introductory text for students learning multivariate statistical methods for the first time, this book keeps mathematical details to a minimum while conveying the basic principles. One of the principal strategies used throughout the book--in addition to the presentation of actual data analyses--is poin

  11. An Introduction to Applied Multivariate Analysis

    CERN Document Server

    Raykov, Tenko

    2008-01-01

    Focuses on the core multivariate statistics topics which are of fundamental relevance for its understanding. This book emphasis on the topics that are critical to those in the behavioral, social, and educational sciences.

  12. Multivariate time series analysis with R and financial applications

    CERN Document Server

    Tsay, Ruey S

    2013-01-01

    Since the publication of his first book, Analysis of Financial Time Series, Ruey Tsay has become one of the most influential and prominent experts on the topic of time series. Different from the traditional and oftentimes complex approach to multivariate (MV) time series, this sequel book emphasizes structural specification, which results in simplified parsimonious VARMA modeling and, hence, eases comprehension. Through a fundamental balance between theory and applications, the book supplies readers with an accessible approach to financial econometric models and their applications to real-worl

  13. Testing Mean Differences among Groups: Multivariate and Repeated Measures Analysis with Minimal Assumptions.

    Science.gov (United States)

    Bathke, Arne C; Friedrich, Sarah; Pauly, Markus; Konietschke, Frank; Staffen, Wolfgang; Strobl, Nicolas; Höller, Yvonne

    2018-03-22

    To date, there is a lack of satisfactory inferential techniques for the analysis of multivariate data in factorial designs, when only minimal assumptions on the data can be made. Presently available methods are limited to very particular study designs or assume either multivariate normality or equal covariance matrices across groups, or they do not allow for an assessment of the interaction effects across within-subjects and between-subjects variables. We propose and methodologically validate a parametric bootstrap approach that does not suffer from any of the above limitations, and thus provides a rather general and comprehensive methodological route to inference for multivariate and repeated measures data. As an example application, we consider data from two different Alzheimer's disease (AD) examination modalities that may be used for precise and early diagnosis, namely, single-photon emission computed tomography (SPECT) and electroencephalogram (EEG). These data violate the assumptions of classical multivariate methods, and indeed classical methods would not have yielded the same conclusions with regards to some of the factors involved.

  14. Estimation of failure criteria in multivariate sensory shelf life testing using survival analysis.

    Science.gov (United States)

    Giménez, Ana; Gagliardi, Andrés; Ares, Gastón

    2017-09-01

    For most food products, shelf life is determined by changes in their sensory characteristics. A predetermined increase or decrease in the intensity of a sensory characteristic has frequently been used to signal that a product has reached the end of its shelf life. Considering all attributes change simultaneously, the concept of multivariate shelf life allows a single measurement of deterioration that takes into account all these sensory changes at a certain storage time. The aim of the present work was to apply survival analysis to estimate failure criteria in multivariate sensory shelf life testing using two case studies, hamburger buns and orange juice, by modelling the relationship between consumers' rejection of the product and the deterioration index estimated using PCA. In both studies, a panel of 13 trained assessors evaluated the samples using descriptive analysis whereas a panel of 100 consumers answered a "yes" or "no" question regarding intention to buy or consume the product. PC1 explained the great majority of the variance, indicating all sensory characteristics evolved similarly with storage time. Thus, PC1 could be regarded as index of sensory deterioration and a single failure criterion could be estimated through survival analysis for 25 and 50% consumers' rejection. The proposed approach based on multivariate shelf life testing may increase the accuracy of shelf life estimations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  15. Resent state and multivariate analysis of a few juniper forests of baluchistan, pakistan

    International Nuclear Information System (INIS)

    Ahmed, M.; Siddiqui, M.F.

    2015-01-01

    Quantitative multivariate investigations were carried out to explore various forms of Juniper trees resulting human disturbances and natural phenomenon. Thirty stands were sampled by point centered quarter method and data were analysed using Wards cluster analysis and Bray-Curtis ordination. On the basis of multivariate analysis eight various forms i.e. healthy, unhealthy, over mature, disturbed, dieback, standing dead, logs and cut stem were recognized. Structural attributes were computed. Highest numbers (130-133 stem ha-1) of logs were recorded from Cautair and Khunk forests. Highest density ha-1 (229 ha-1) of healthy plants was estimated from Tangi Top area while lowest number (24 ha-1) of healthy plants was found from Saraghara area. Multivariate analysis showed five groups in cluster and ordination diagrams. These groups are characterized on the basis of healthy, over mature, disturbed and logged trees of Juniper. Higher number (115, 96, 84, 80 ha-1) of disturbed trees were distributed at Speena Sukher, Srag Kazi, Prang Shella and Tangi Top respectively. Overall density does not show any significant relation with basal area m2 ha-1, degree of slopes and the elevation of the sampling stands. Present study show that each and every Juniper stands are highly disturbed mostly due to human influence, therefore prompt conservational steps should be taken to safe these forests. (author)

  16. UNCOVERING THE FORMATION OF ULTRACOMPACT DWARF GALAXIES BY MULTIVARIATE STATISTICAL ANALYSIS

    International Nuclear Information System (INIS)

    Chattopadhyay, Tanuka; Sharina, Margarita; Davoust, Emmanuel; De, Tuli; Chattopadhyay, Asis Kumar

    2012-01-01

    We present a statistical analysis of the properties of a large sample of dynamically hot old stellar systems, from globular clusters (GCs) to giant ellipticals, which was performed in order to investigate the origin of ultracompact dwarf galaxies (UCDs). The data were mostly drawn from Forbes et al. We recalculated some of the effective radii, computed mean surface brightnesses and mass-to-light ratios, and estimated ages and metallicities. We completed the sample with GCs of M31. We used a multivariate statistical technique (K-Means clustering), together with a new algorithm (Gap Statistics) for finding the optimum number of homogeneous sub-groups in the sample, using a total of six parameters (absolute magnitude, effective radius, virial mass-to-light ratio, stellar mass-to-light ratio, and metallicity). We found six groups. FK1 and FK5 are composed of high- and low-mass elliptical galaxies, respectively. FK3 and FK6 are composed of high-metallicity and low-metallicity objects, respectively, and both include GCs and UCDs. Two very small groups, FK2 and FK4, are composed of Local Group dwarf spheroidals. Our groups differ in their mean masses and virial mass-to-light ratios. The relations between these two parameters are also different for the various groups. The probability density distributions of metallicity for the four groups of galaxies are similar to those of the GCs and UCDs. The brightest low-metallicity GCs and UCDs tend to follow the mass-metallicity relation like elliptical galaxies. The objects of FK3 are more metal-rich per unit effective luminosity density than high-mass ellipticals.

  17. UNCOVERING THE FORMATION OF ULTRACOMPACT DWARF GALAXIES BY MULTIVARIATE STATISTICAL ANALYSIS

    Energy Technology Data Exchange (ETDEWEB)

    Chattopadhyay, Tanuka [Department of Applied Mathematics, Calcutta University, 92 A.P.C. Road, Calcutta 700009 (India); Sharina, Margarita [Special Astrophysical Observatory, Russian Academy of Sciences, N. Arkhyz, KCh R 369167 (Russian Federation); Davoust, Emmanuel [IRAP, Universite de Toulouse, CNRS, 14 Avenue Edouard Belin, 31400 Toulouse (France); De, Tuli; Chattopadhyay, Asis Kumar, E-mail: tanuka@iucaa.ernet.in, E-mail: sme@sao.ru, E-mail: davoust@ast.obs-mip.fr, E-mail: akcstat@caluniv.ac.in [Department of Statistics, Calcutta University, 35 B.C. Road, Calcutta 700019 (India)

    2012-05-10

    We present a statistical analysis of the properties of a large sample of dynamically hot old stellar systems, from globular clusters (GCs) to giant ellipticals, which was performed in order to investigate the origin of ultracompact dwarf galaxies (UCDs). The data were mostly drawn from Forbes et al. We recalculated some of the effective radii, computed mean surface brightnesses and mass-to-light ratios, and estimated ages and metallicities. We completed the sample with GCs of M31. We used a multivariate statistical technique (K-Means clustering), together with a new algorithm (Gap Statistics) for finding the optimum number of homogeneous sub-groups in the sample, using a total of six parameters (absolute magnitude, effective radius, virial mass-to-light ratio, stellar mass-to-light ratio, and metallicity). We found six groups. FK1 and FK5 are composed of high- and low-mass elliptical galaxies, respectively. FK3 and FK6 are composed of high-metallicity and low-metallicity objects, respectively, and both include GCs and UCDs. Two very small groups, FK2 and FK4, are composed of Local Group dwarf spheroidals. Our groups differ in their mean masses and virial mass-to-light ratios. The relations between these two parameters are also different for the various groups. The probability density distributions of metallicity for the four groups of galaxies are similar to those of the GCs and UCDs. The brightest low-metallicity GCs and UCDs tend to follow the mass-metallicity relation like elliptical galaxies. The objects of FK3 are more metal-rich per unit effective luminosity density than high-mass ellipticals.

  18. Multivariate covariance generalized linear models

    DEFF Research Database (Denmark)

    Bonat, W. H.; Jørgensen, Bent

    2016-01-01

    are fitted by using an efficient Newton scoring algorithm based on quasi-likelihood and Pearson estimating functions, using only second-moment assumptions. This provides a unified approach to a wide variety of types of response variables and covariance structures, including multivariate extensions......We propose a general framework for non-normal multivariate data analysis called multivariate covariance generalized linear models, designed to handle multivariate response variables, along with a wide range of temporal and spatial correlation structures defined in terms of a covariance link...... function combined with a matrix linear predictor involving known matrices. The method is motivated by three data examples that are not easily handled by existing methods. The first example concerns multivariate count data, the second involves response variables of mixed types, combined with repeated...

  19. Multivariate analysis of eigenvalues and eigenvectors in tensor based morphometry

    Science.gov (United States)

    Rajagopalan, Vidya; Schwartzman, Armin; Hua, Xue; Leow, Alex; Thompson, Paul; Lepore, Natasha

    2015-01-01

    We develop a new algorithm to compute voxel-wise shape differences in tensor-based morphometry (TBM). As in standard TBM, we non-linearly register brain T1-weighed MRI data from a patient and control group to a template, and compute the Jacobian of the deformation fields. In standard TBM, the determinants of the Jacobian matrix at each voxel are statistically compared between the two groups. More recently, a multivariate extension of the statistical analysis involving the deformation tensors derived from the Jacobian matrices has been shown to improve statistical detection power.7 However, multivariate methods comprising large numbers of variables are computationally intensive and may be subject to noise. In addition, the anatomical interpretation of results is sometimes difficult. Here instead, we analyze the eigenvalues and the eigenvectors of the Jacobian matrices. Our method is validated on brain MRI data from Alzheimer's patients and healthy elderly controls from the Alzheimer's Disease Neuro Imaging Database.

  20. Multivariate Survival Mixed Models for Genetic Analysis of Longevity Traits

    DEFF Research Database (Denmark)

    Pimentel Maia, Rafael; Madsen, Per; Labouriau, Rodrigo

    2014-01-01

    A class of multivariate mixed survival models for continuous and discrete time with a complex covariance structure is introduced in a context of quantitative genetic applications. The methods introduced can be used in many applications in quantitative genetics although the discussion presented co...... applications. The methods presented are implemented in such a way that large and complex quantitative genetic data can be analyzed......A class of multivariate mixed survival models for continuous and discrete time with a complex covariance structure is introduced in a context of quantitative genetic applications. The methods introduced can be used in many applications in quantitative genetics although the discussion presented...... concentrates on longevity studies. The framework presented allows to combine models based on continuous time with models based on discrete time in a joint analysis. The continuous time models are approximations of the frailty model in which the hazard function will be assumed to be piece-wise constant...

  1. Multivariate Survival Mixed Models for Genetic Analysis of Longevity Traits

    DEFF Research Database (Denmark)

    Pimentel Maia, Rafael; Madsen, Per; Labouriau, Rodrigo

    2013-01-01

    A class of multivariate mixed survival models for continuous and discrete time with a complex covariance structure is introduced in a context of quantitative genetic applications. The methods introduced can be used in many applications in quantitative genetics although the discussion presented co...... applications. The methods presented are implemented in such a way that large and complex quantitative genetic data can be analyzed......A class of multivariate mixed survival models for continuous and discrete time with a complex covariance structure is introduced in a context of quantitative genetic applications. The methods introduced can be used in many applications in quantitative genetics although the discussion presented...... concentrates on longevity studies. The framework presented allows to combine models based on continuous time with models based on discrete time in a joint analysis. The continuous time models are approximations of the frailty model in which the hazard function will be assumed to be piece-wise constant...

  2. A robust multivariate long run analysis of European electricity prices

    OpenAIRE

    Bruno Bosco; Lucia Parisio; Matteo Pelagatti; Fabio Baldi

    2007-01-01

    This paper analyses the interdependencies existing in wholesale electricity prices in six major European countries. The results of our robust multivariate long run dynamic analysis reveal the presence of four highly integrated central European markets (France, Germany, the Netherlands and Austria). The trend shared by these four electricity markets appears to be common also to gas prices, but not to oil prices. The existence of long term dynamics among electricity prices and between electrici...

  3. Oxidative stability of frozen mackerel batches ― A multivariate data analysis approach

    DEFF Research Database (Denmark)

    Helbo Ekgreen, M.; Frosch, Stina; Baron, Caroline Pascale

    2011-01-01

    deterioration and texture changes. The aim was to investigate the correlation between the raw material history and the quality loss observed during frozen storage using relevant multivariate data analysis such as Principal Component Analysis (PCA) and Partial Least Square Analysis (PLS). Preliminary results...... showed that it was possible to differentiate between the different batches depending on their history and that some batches were more oxidised than others. Furthermore, based on the results from the data analysis, critical control points in the entire production chain will be identified and strategies...

  4. Authentication of Trappist beers by LC-MS fingerprints and multivariate data analysis.

    Science.gov (United States)

    Mattarucchi, Elia; Stocchero, Matteo; Moreno-Rojas, José Manuel; Giordano, Giuseppe; Reniero, Fabiano; Guillou, Claude

    2010-12-08

    The aim of this study was to asses the applicability of LC-MS profiling to authenticate a selected Trappist beer as part of a program on traceability funded by the European Commission. A total of 232 beers were fingerprinted and classified through multivariate data analysis. The selected beer was clearly distinguished from beers of different brands, while only 3 samples (3.5% of the test set) were wrongly classified when compared with other types of beer of the same Trappist brewery. The fingerprints were further analyzed to extract the most discriminating variables, which proved to be sufficient for classification, even using a simplified unsupervised model. This reduced fingerprint allowed us to study the influence of batch-to-batch variability on the classification model. Our results can easily be applied to different matrices and they confirmed the effectiveness of LC-MS profiling in combination with multivariate data analysis for the characterization of food products.

  5. Multivariable nonlinear analysis of foreign exchange rates

    Science.gov (United States)

    Suzuki, Tomoya; Ikeguchi, Tohru; Suzuki, Masuo

    2003-05-01

    We analyze the multivariable time series of foreign exchange rates. These are price movements that have often been analyzed, and dealing time intervals and spreads between bid and ask prices. Considering dealing time intervals as event timing such as neurons’ firings, we use raster plots (RPs) and peri-stimulus time histograms (PSTHs) which are popular methods in the field of neurophysiology. Introducing special processings to obtaining RPs and PSTHs time histograms for analyzing exchange rates time series, we discover that there exists dynamical interaction among three variables. We also find that adopting multivariables leads to improvements of prediction accuracy.

  6. Multivariate methods for analysis of environmental reference materials using laser-induced breakdown spectroscopy

    Directory of Open Access Journals (Sweden)

    Shikha Awasthi

    2017-06-01

    Full Text Available Analysis of emission from laser-induced plasma has a unique capability for quantifying the major and minor elements present in any type of samples under optimal analysis conditions. Chemometric techniques are very effective and reliable tools for quantification of multiple components in complex matrices. The feasibility of laser-induced breakdown spectroscopy (LIBS in combination with multivariate analysis was investigated for the analysis of environmental reference materials (RMs. In the present work, different (Certified/Standard Reference Materials of soil and plant origin were analyzed using LIBS and the presence of Al, Ca, Mg, Fe, K, Mn and Si were identified in the LIBS spectra of these materials. Multivariate statistical methods (Partial Least Square Regression and Partial Least Square Discriminant Analysis were employed for quantitative analysis of the constituent elements using the LIBS spectral data. Calibration models were used to predict the concentrations of the different elements of test samples and subsequently, the concentrations were compared with certified concentrations to check the authenticity of models. The non-destructive analytical method namely Instrumental Neutron Activation Analysis (INAA using high flux reactor neutrons and high resolution gamma-ray spectrometry was also used for intercomparison of results of two RMs by LIBS.

  7. Decomposition and Simplification of Multivariate Data using Pareto Sets.

    Science.gov (United States)

    Huettenberger, Lars; Heine, Christian; Garth, Christoph

    2014-12-01

    Topological and structural analysis of multivariate data is aimed at improving the understanding and usage of such data through identification of intrinsic features and structural relationships among multiple variables. We present two novel methods for simplifying so-called Pareto sets that describe such structural relationships. Such simplification is a precondition for meaningful visualization of structurally rich or noisy data. As a framework for simplification operations, we introduce a decomposition of the data domain into regions of equivalent structural behavior and the reachability graph that describes global connectivity of Pareto extrema. Simplification is then performed as a sequence of edge collapses in this graph; to determine a suitable sequence of such operations, we describe and utilize a comparison measure that reflects the changes to the data that each operation represents. We demonstrate and evaluate our methods on synthetic and real-world examples.

  8. Spatial compression algorithm for the analysis of very large multivariate images

    Science.gov (United States)

    Keenan, Michael R [Albuquerque, NM

    2008-07-15

    A method for spatially compressing data sets enables the efficient analysis of very large multivariate images. The spatial compression algorithms use a wavelet transformation to map an image into a compressed image containing a smaller number of pixels that retain the original image's information content. Image analysis can then be performed on a compressed data matrix consisting of a reduced number of significant wavelet coefficients. Furthermore, a block algorithm can be used for performing common operations more efficiently. The spatial compression algorithms can be combined with spectral compression algorithms to provide further computational efficiencies.

  9. Dissecting the polysaccharide-rich grape cell wall matrix using recombinant pectinases during winemaking.

    Science.gov (United States)

    Gao, Yu; Fangel, Jonatan U; Willats, William G T; Vivier, Melané A; Moore, John P

    2016-11-05

    The effectiveness of enzyme-mediated-maceration in red winemaking relies on the use of an optimum combination of specific enzymes. A lack of information on the relevant enzyme activities and the corresponding polysaccharide-rich berry cell wall structure is a major limitation. This study used different combinations of purified recombinant pectinases with cell wall profiling tools to follow the deconstruction process during winemaking. Multivariate data analysis of the glycan microarray (CoMPP) and gas chromatography (GC) results revealed that pectin lyase performed almost as effectively in de-pectination as certain commercial enzyme mixtures. Surprisingly the combination of endo-polygalacturonase and pectin-methyl-esterase only unraveled the cell walls without de-pectination. Datasets from the various combinations used confirmed pectin-rich and xyloglucan-rich layers within the grape pomace. These data support a proposed grape cell wall model which can serve as a foundation to evaluate testable hypotheses in future studies aimed at developing tailor-made enzymes for winemaking scenarios. Copyright © 2016 Elsevier Ltd. All rights reserved.

  10. Multivariate stochastic analysis for Monthly hydrological time series at Cuyahoga River Basin

    Science.gov (United States)

    zhang, L.

    2011-12-01

    Copula has become a very powerful statistic and stochastic methodology in case of the multivariate analysis in Environmental and Water resources Engineering. In recent years, the popular one-parameter Archimedean copulas, e.g. Gumbel-Houggard copula, Cook-Johnson copula, Frank copula, the meta-elliptical copula, e.g. Gaussian Copula, Student-T copula, etc. have been applied in multivariate hydrological analyses, e.g. multivariate rainfall (rainfall intensity, duration and depth), flood (peak discharge, duration and volume), and drought analyses (drought length, mean and minimum SPI values, and drought mean areal extent). Copula has also been applied in the flood frequency analysis at the confluences of river systems by taking into account the dependence among upstream gauge stations rather than by using the hydrological routing technique. In most of the studies above, the annual time series have been considered as stationary signal which the time series have been assumed as independent identically distributed (i.i.d.) random variables. But in reality, hydrological time series, especially the daily and monthly hydrological time series, cannot be considered as i.i.d. random variables due to the periodicity existed in the data structure. Also, the stationary assumption is also under question due to the Climate Change and Land Use and Land Cover (LULC) change in the fast years. To this end, it is necessary to revaluate the classic approach for the study of hydrological time series by relaxing the stationary assumption by the use of nonstationary approach. Also as to the study of the dependence structure for the hydrological time series, the assumption of same type of univariate distribution also needs to be relaxed by adopting the copula theory. In this paper, the univariate monthly hydrological time series will be studied through the nonstationary time series analysis approach. The dependence structure of the multivariate monthly hydrological time series will be

  11. Multivariate cluster analysis of dynamic iodine-123 iodobenzamide SPET dopamine D2receptor images in schizophrenia

    International Nuclear Information System (INIS)

    Acton, P.D.; Pilowsky, L.S.; Costa, D.C.; Ell, P.J.

    1997-01-01

    This paper describes the application of a multivariate statistical technique to investigate striatal dopamine D 2 receptor concentrations measured by iodine-123 iodobenzamide ( 123 I-IBZM) single-photon emission tomography (SPET). This technique enables the automatic segmentation of dynamic nuclear medicine images based on the underlying time-activity curves present in the data. Once the time-activity curves have been extracted, each pixel can be mapped back on to the underlying distribution, considerably reducing image noise. Cluster analysis has been verified using computer simulations and phantom studies. The technique has been applied to SPET images of dopamine D 2 receptors in a total of 20 healthy and 20 schizophrenic volunteers (22 male, 18 female), using the ligand 123 I-IBZM. Following automatic image segmentation, the concentration of striatal dopamine D 2 receptors shows a significant left-sided asymmetry in male schizophrenics compared with male controls. The mean left-minus-right laterality index for controls is -1.52 (95% CI -3.72-0.66) and for patients 4.04 (95% CI 1.07-7.01). Analysis of variance shows a case-by-sex-by-side interaction, with F=10.01, P=0.005. We can now demonstrate that the previously observed male sex-specific D 2 receptor asymmetry in schizophrenia, which had failed to attain statistical significance, is valid. Cluster analysis of dynamic nuclear medicine studies provides a powerful tool for automatic segmentation and noise reduction of the images, removing much of the subjectivity inherent in region-of-interest analysis. The observed striatal D 2 asymmetry could reflect long hypothesized disruptions in dopamine-rich cortico-striatal-limbic circuits in schizophrenic males. (orig.). With 4 figs., 2 tabs

  12. Multivariate statistical analysis for x-ray photoelectron spectroscopy spectral imaging: Effect of image acquisition time

    International Nuclear Information System (INIS)

    Peebles, D.E.; Ohlhausen, J.A.; Kotula, P.G.; Hutton, S.; Blomfield, C.

    2004-01-01

    The acquisition of spectral images for x-ray photoelectron spectroscopy (XPS) is a relatively new approach, although it has been used with other analytical spectroscopy tools for some time. This technique provides full spectral information at every pixel of an image, in order to provide a complete chemical mapping of the imaged surface area. Multivariate statistical analysis techniques applied to the spectral image data allow the determination of chemical component species, and their distribution and concentrations, with minimal data acquisition and processing times. Some of these statistical techniques have proven to be very robust and efficient methods for deriving physically realistic chemical components without input by the user other than the spectral matrix itself. The benefits of multivariate analysis of the spectral image data include significantly improved signal to noise, improved image contrast and intensity uniformity, and improved spatial resolution - which are achieved due to the effective statistical aggregation of the large number of often noisy data points in the image. This work demonstrates the improvements in chemical component determination and contrast, signal-to-noise level, and spatial resolution that can be obtained by the application of multivariate statistical analysis to XPS spectral images

  13. Sparse multivariate factor analysis regression models and its applications to integrative genomics analysis.

    Science.gov (United States)

    Zhou, Yan; Wang, Pei; Wang, Xianlong; Zhu, Ji; Song, Peter X-K

    2017-01-01

    The multivariate regression model is a useful tool to explore complex associations between two kinds of molecular markers, which enables the understanding of the biological pathways underlying disease etiology. For a set of correlated response variables, accounting for such dependency can increase statistical power. Motivated by integrative genomic data analyses, we propose a new methodology-sparse multivariate factor analysis regression model (smFARM), in which correlations of response variables are assumed to follow a factor analysis model with latent factors. This proposed method not only allows us to address the challenge that the number of association parameters is larger than the sample size, but also to adjust for unobserved genetic and/or nongenetic factors that potentially conceal the underlying response-predictor associations. The proposed smFARM is implemented by the EM algorithm and the blockwise coordinate descent algorithm. The proposed methodology is evaluated and compared to the existing methods through extensive simulation studies. Our results show that accounting for latent factors through the proposed smFARM can improve sensitivity of signal detection and accuracy of sparse association map estimation. We illustrate smFARM by two integrative genomics analysis examples, a breast cancer dataset, and an ovarian cancer dataset, to assess the relationship between DNA copy numbers and gene expression arrays to understand genetic regulatory patterns relevant to the disease. We identify two trans-hub regions: one in cytoband 17q12 whose amplification influences the RNA expression levels of important breast cancer genes, and the other in cytoband 9q21.32-33, which is associated with chemoresistance in ovarian cancer. © 2016 WILEY PERIODICALS, INC.

  14. A primer of multivariate statistics

    CERN Document Server

    Harris, Richard J

    2014-01-01

    Drawing upon more than 30 years of experience in working with statistics, Dr. Richard J. Harris has updated A Primer of Multivariate Statistics to provide a model of balance between how-to and why. This classic text covers multivariate techniques with a taste of latent variable approaches. Throughout the book there is a focus on the importance of describing and testing one's interpretations of the emergent variables that are produced by multivariate analysis. This edition retains its conversational writing style while focusing on classical techniques. The book gives the reader a feel for why

  15. Rich Language Analysis for Counterterrorism

    Science.gov (United States)

    Guidère, Mathieu; Howard, Newton; Argamon, Shlomo

    Accurate and relevant intelligence is critical for effective counterterrorism. Too much irrelevant information is as bad or worse than not enough information. Modern computational tools promise to provide better search and summarization capabilities to help analysts filter and select relevant and key information. However, to do this task effectively, such tools must have access to levels of meaning beyond the literal. Terrorists operating in context-rich cultures like fundamentalist Islam use messages with multiple levels of interpretation, which are easily misunderstood by non-insiders. This chapter discusses several kinds of such “encryption” used by terrorists and insurgents in the Arabic language, and how knowledge of such methods can be used to enhance computational text analysis techniques for use in counterterrorism.

  16. An overview of multivariate gamma distributions as seen from a (multivariate) matrix exponential perspective

    DEFF Research Database (Denmark)

    Bladt, Mogens; Nielsen, Bo Friis

    2012-01-01

    Laplace transform. In a longer perspective stochastic and statistical analysis for MVME will in particular apply to any of the previously defined distributions. Multivariate gamma distributions have been used in a variety of fields like hydrology, [11], [10], [6], space (wind modeling) [9] reliability [3......Numerous definitions of multivariate exponential and gamma distributions can be retrieved from the literature [4]. These distribtuions belong to the class of Multivariate Matrix-- Exponetial Distributions (MVME) whenever their joint Laplace transform is a rational function. The majority...... of these distributions further belongs to an important subclass of MVME distributions [5, 1] where the multivariate random vector can be interpreted as a number of simultaneously collected rewards during sojourns in a the states of a Markov chain with one absorbing state, the rest of the states being transient. We...

  17. Multivariate statistical analysis of wildfires in Portugal

    Science.gov (United States)

    Costa, Ricardo; Caramelo, Liliana; Pereira, Mário

    2013-04-01

    Several studies demonstrate that wildfires in Portugal present high temporal and spatial variability as well as cluster behavior (Pereira et al., 2005, 2011). This study aims to contribute to the characterization of the fire regime in Portugal with the multivariate statistical analysis of the time series of number of fires and area burned in Portugal during the 1980 - 2009 period. The data used in the analysis is an extended version of the Rural Fire Portuguese Database (PRFD) (Pereira et al, 2011), provided by the National Forest Authority (Autoridade Florestal Nacional, AFN), the Portuguese Forest Service, which includes information for more than 500,000 fire records. There are many multiple advanced techniques for examining the relationships among multiple time series at the same time (e.g., canonical correlation analysis, principal components analysis, factor analysis, path analysis, multiple analyses of variance, clustering systems). This study compares and discusses the results obtained with these different techniques. Pereira, M.G., Trigo, R.M., DaCamara, C.C., Pereira, J.M.C., Leite, S.M., 2005: "Synoptic patterns associated with large summer forest fires in Portugal". Agricultural and Forest Meteorology. 129, 11-25. Pereira, M. G., Malamud, B. D., Trigo, R. M., and Alves, P. I.: The history and characteristics of the 1980-2005 Portuguese rural fire database, Nat. Hazards Earth Syst. Sci., 11, 3343-3358, doi:10.5194/nhess-11-3343-2011, 2011 This work is supported by European Union Funds (FEDER/COMPETE - Operational Competitiveness Programme) and by national funds (FCT - Portuguese Foundation for Science and Technology) under the project FCOMP-01-0124-FEDER-022692, the project FLAIR (PTDC/AAC-AMB/104702/2008) and the EU 7th Framework Program through FUME (contract number 243888).

  18. BioIMAX: A Web 2.0 approach for easy exploratory and collaborative access to multivariate bioimage data

    Directory of Open Access Journals (Sweden)

    Khan Michael

    2011-07-01

    Full Text Available Abstract Background Innovations in biological and biomedical imaging produce complex high-content and multivariate image data. For decision-making and generation of hypotheses, scientists need novel information technology tools that enable them to visually explore and analyze the data and to discuss and communicate results or findings with collaborating experts from various places. Results In this paper, we present a novel Web2.0 approach, BioIMAX, for the collaborative exploration and analysis of multivariate image data by combining the webs collaboration and distribution architecture with the interface interactivity and computation power of desktop applications, recently called rich internet application. Conclusions BioIMAX allows scientists to discuss and share data or results with collaborating experts and to visualize, annotate, and explore multivariate image data within one web-based platform from any location via a standard web browser requiring only a username and a password. BioIMAX can be accessed at http://ani.cebitec.uni-bielefeld.de/BioIMAX with the username "test" and the password "test1" for testing purposes.

  19. Integrated environmental monitoring and multivariate data analysis-A case study.

    Science.gov (United States)

    Eide, Ingvar; Westad, Frank; Nilssen, Ingunn; de Freitas, Felipe Sales; Dos Santos, Natalia Gomes; Dos Santos, Francisco; Cabral, Marcelo Montenegro; Bicego, Marcia Caruso; Figueira, Rubens; Johnsen, Ståle

    2017-03-01

    The present article describes integration of environmental monitoring and discharge data and interpretation using multivariate statistics, principal component analysis (PCA), and partial least squares (PLS) regression. The monitoring was carried out at the Peregrino oil field off the coast of Brazil. One sensor platform and 3 sediment traps were placed on the seabed. The sensors measured current speed and direction, turbidity, temperature, and conductivity. The sediment trap samples were used to determine suspended particulate matter that was characterized with respect to a number of chemical parameters (26 alkanes, 16 PAHs, N, C, calcium carbonate, and Ba). Data on discharges of drill cuttings and water-based drilling fluid were provided on a daily basis. The monitoring was carried out during 7 campaigns from June 2010 to October 2012, each lasting 2 to 3 months due to the capacity of the sediment traps. The data from the campaigns were preprocessed, combined, and interpreted using multivariate statistics. No systematic difference could be observed between campaigns or traps despite the fact that the first campaign was carried out before drilling, and 1 of 3 sediment traps was located in an area not expected to be influenced by the discharges. There was a strong covariation between suspended particulate matter and total N and organic C suggesting that the majority of the sediment samples had a natural and biogenic origin. Furthermore, the multivariate regression showed no correlation between discharges of drill cuttings and sediment trap or turbidity data taking current speed and direction into consideration. Because of this lack of correlation with discharges from the drilling location, a more detailed evaluation of chemical indicators providing information about origin was carried out in addition to numerical modeling of dispersion and deposition. The chemical indicators and the modeling of dispersion and deposition support the conclusions from the multivariate

  20. Structural analysis and design of multivariable control systems: An algebraic approach

    Science.gov (United States)

    Tsay, Yih Tsong; Shieh, Leang-San; Barnett, Stephen

    1988-01-01

    The application of algebraic system theory to the design of controllers for multivariable (MV) systems is explored analytically using an approach based on state-space representations and matrix-fraction descriptions. Chapters are devoted to characteristic lambda matrices and canonical descriptions of MIMO systems; spectral analysis, divisors, and spectral factors of nonsingular lambda matrices; feedback control of MV systems; and structural decomposition theories and their application to MV control systems.

  1. Characterization of Land Transitions Patterns from Multivariate Time Series Using Seasonal Trend Analysis and Principal Component Analysis

    Directory of Open Access Journals (Sweden)

    Benoit Parmentier

    2014-12-01

    Full Text Available Characterizing biophysical changes in land change areas over large regions with short and noisy multivariate time series and multiple temporal parameters remains a challenging task. Most studies focus on detection rather than the characterization, i.e., the manner by which surface state variables are altered by the process of changes. In this study, a procedure is presented to extract and characterize simultaneous temporal changes in MODIS multivariate times series from three surface state variables the Normalized Difference Vegetation Index (NDVI, land surface temperature (LST and albedo (ALB. The analysis involves conducting a seasonal trend analysis (STA to extract three seasonal shape parameters (Amplitude 0, Amplitude 1 and Amplitude 2 and using principal component analysis (PCA to contrast trends in change and no-change areas. We illustrate the method by characterizing trends in burned and unburned pixels in Alaska over the 2001–2009 time period. Findings show consistent and meaningful extraction of temporal patterns related to fire disturbances. The first principal component (PC1 is characterized by a decrease in mean NDVI (Amplitude 0 with a concurrent increase in albedo (the mean and the annual amplitude and an increase in LST annual variability (Amplitude 1. These results provide systematic empirical evidence of surface changes associated with one type of land change, fire disturbances, and suggest that STA with PCA may be used to characterize many other types of land transitions over large landscape areas using multivariate Earth observation time series.

  2. Multivariate analysis of prognostic factors for idiopathic sudden sensorineural hearing loss in children.

    Science.gov (United States)

    Chung, Jae Ho; Cho, Seok Hyun; Jeong, Jin Hyeok; Park, Chul Won; Lee, Seung Hwan

    2015-09-01

    To evaluate clinical characteristics and possible associated factors of idiopathic sudden sensorineural hearing loss (ISSNHL) in children using univariate and multivariate analyses. A retrospective case series with comparisons. From January 2007 to December 2013, medical records of 37 pediatric ISSNHL patients were reviewed to assess hearing recovery rate and examine factors associated with prognosis (gender; side of hearing loss; opposite side hearing loss; treatment onset; presence of vertigo, tinnitus, and ear fullness; initial hearing threshold), using univariate and multivariate analysis, and compare them with 276 adult ISSNHL patients. Pediatric patients comprised only 6.6% of pediatric/adult cases of ISSNHL, and those below 10 years old were only 0.7%. The overall recovery rates (complete and partial) of the pediatric and adult patients were 57.4% and 47.2%, respectively. The complete recovery rate of the pediatric group (46.6%) was higher than that of the adult group (30.8%, P = .040). According to multivariate analysis, absence of tinnitus, later onset of treatment, and higher hearing threshold at initial presentation were associated with a poor prognosis in pediatric ISSNHL. The recovery rate of ISSNHL in pediatric patients is higher than in adults, and the presence of tinnitus and earlier treatment onset is associated with favorable outcomes. 4. © 2015 The American Laryngological, Rhinological and Otological Society, Inc.

  3. Multivariate calibration in Laser-Induced Breakdown Spectroscopy quantitative analysis: The dangers of a 'black box' approach and how to avoid them

    Science.gov (United States)

    Safi, A.; Campanella, B.; Grifoni, E.; Legnaioli, S.; Lorenzetti, G.; Pagnotta, S.; Poggialini, F.; Ripoll-Seguer, L.; Hidalgo, M.; Palleschi, V.

    2018-06-01

    The introduction of multivariate calibration curve approach in Laser-Induced Breakdown Spectroscopy (LIBS) quantitative analysis has led to a general improvement of the LIBS analytical performances, since a multivariate approach allows to exploit the redundancy of elemental information that are typically present in a LIBS spectrum. Software packages implementing multivariate methods are available in the most diffused commercial and open source analytical programs; in most of the cases, the multivariate algorithms are robust against noise and operate in unsupervised mode. The reverse of the coin of the availability and ease of use of such packages is the (perceived) difficulty in assessing the reliability of the results obtained which often leads to the consideration of the multivariate algorithms as 'black boxes' whose inner mechanism is supposed to remain hidden to the user. In this paper, we will discuss the dangers of a 'black box' approach in LIBS multivariate analysis, and will discuss how to overcome them using the chemical-physical knowledge that is at the base of any LIBS quantitative analysis.

  4. Multivariate analysis of quantitative traits can effectively classify rapeseed germplasm

    Directory of Open Access Journals (Sweden)

    Jankulovska Mirjana

    2014-01-01

    Full Text Available In this study, the use of different multivariate approaches to classify rapeseed genotypes based on quantitative traits has been presented. Tree regression analysis, PCA analysis and two-way cluster analysis were applied in order todescribe and understand the extent of genetic variability in spring rapeseed genotype by trait data. The traits which highly influenced seed and oil yield in rapeseed were successfully identified by the tree regression analysis. Principal predictor for both response variables was number of pods per plant (NP. NP and 1000 seed weight could help in the selection of high yielding genotypes. High values for both traits and oil content could lead to high oil yielding genotypes. These traits may serve as indirect selection criteria and can lead to improvement of seed and oil yield in rapeseed. Quantitative traits that explained most of the variability in the studied germplasm were classified using principal component analysis. In this data set, five PCs were identified, out of which the first three PCs explained 63% of the total variance. It helped in facilitating the choice of variables based on which the genotypes’ clustering could be performed. The two-way cluster analysissimultaneously clustered genotypes and quantitative traits. The final number of clusters was determined using bootstrapping technique. This approach provided clear overview on the variability of the analyzed genotypes. The genotypes that have similar performance regarding the traits included in this study can be easily detected on the heatmap. Genotypes grouped in the clusters 1 and 8 had high values for seed and oil yield, and relatively short vegetative growth duration period and those in cluster 9, combined moderate to low values for vegetative growth duration and moderate to high seed and oil yield. These genotypes should be further exploited and implemented in the rapeseed breeding program. The combined application of these multivariate methods

  5. Power analysis for multivariate and repeated measures designs: a flexible approach using the SPSS MANOVA procedure.

    Science.gov (United States)

    D'Amico, E J; Neilands, T B; Zambarano, R

    2001-11-01

    Although power analysis is an important component in the planning and implementation of research designs, it is often ignored. Computer programs for performing power analysis are available, but most have limitations, particularly for complex multivariate designs. An SPSS procedure is presented that can be used for calculating power for univariate, multivariate, and repeated measures models with and without time-varying and time-constant covariates. Three examples provide a framework for calculating power via this method: an ANCOVA, a MANOVA, and a repeated measures ANOVA with two or more groups. The benefits and limitations of this procedure are discussed.

  6. Batch-to-batch quality consistency evaluation of botanical drug products using multivariate statistical analysis of the chromatographic fingerprint.

    Science.gov (United States)

    Xiong, Haoshu; Yu, Lawrence X; Qu, Haibin

    2013-06-01

    Botanical drug products have batch-to-batch quality variability due to botanical raw materials and the current manufacturing process. The rational evaluation and control of product quality consistency are essential to ensure the efficacy and safety. Chromatographic fingerprinting is an important and widely used tool to characterize the chemical composition of botanical drug products. Multivariate statistical analysis has showed its efficacy and applicability in the quality evaluation of many kinds of industrial products. In this paper, the combined use of multivariate statistical analysis and chromatographic fingerprinting is presented here to evaluate batch-to-batch quality consistency of botanical drug products. A typical botanical drug product in China, Shenmai injection, was selected as the example to demonstrate the feasibility of this approach. The high-performance liquid chromatographic fingerprint data of historical batches were collected from a traditional Chinese medicine manufacturing factory. Characteristic peaks were weighted by their variability among production batches. A principal component analysis model was established after outliers were modified or removed. Multivariate (Hotelling T(2) and DModX) control charts were finally successfully applied to evaluate the quality consistency. The results suggest useful applications for a combination of multivariate statistical analysis with chromatographic fingerprinting in batch-to-batch quality consistency evaluation for the manufacture of botanical drug products.

  7. Beer fermentation: monitoring of process parameters by FT-NIR and multivariate data analysis.

    Science.gov (United States)

    Grassi, Silvia; Amigo, José Manuel; Lyndgaard, Christian Bøge; Foschino, Roberto; Casiraghi, Ernestina

    2014-07-15

    This work investigates the capability of Fourier-Transform near infrared (FT-NIR) spectroscopy to monitor and assess process parameters in beer fermentation at different operative conditions. For this purpose, the fermentation of wort with two different yeast strains and at different temperatures was monitored for nine days by FT-NIR. To correlate the collected spectra with °Brix, pH and biomass, different multivariate data methodologies were applied. Principal component analysis (PCA), partial least squares (PLS) and locally weighted regression (LWR) were used to assess the relationship between FT-NIR spectra and the abovementioned process parameters that define the beer fermentation. The accuracy and robustness of the obtained results clearly show the suitability of FT-NIR spectroscopy, combined with multivariate data analysis, to be used as a quality control tool in the beer fermentation process. FT-NIR spectroscopy, when combined with LWR, demonstrates to be a perfectly suitable quantitative method to be implemented in the production of beer. Copyright © 2014 Elsevier Ltd. All rights reserved.

  8. What makes a pattern? Matching decoding methods to data in multivariate pattern analysis

    Directory of Open Access Journals (Sweden)

    Philip A Kragel

    2012-11-01

    Full Text Available Research in neuroscience faces the challenge of integrating information across different spatial scales of brain function. A promising technique for harnessing information at a range of spatial scales is multivariate pattern analysis (MVPA of functional magnetic resonance imaging (fMRI data. While the prevalence of MVPA has increased dramatically in recent years, its typical implementations for classification of mental states utilize only a subset of the information encoded in local fMRI signals. We review published studies employing multivariate pattern classification since the technique’s introduction, which reveal an extensive focus on the improved detection power that linear classifiers provide over traditional analysis techniques. We demonstrate using simulations and a searchlight approach, however, that nonlinear classifiers are capable of extracting distinct information about interactions within a local region. We conclude that for spatially localized analyses, such as searchlight and region of interest, multiple classification approaches should be compared in order to match fMRI analyses to the properties of local circuits.

  9. Integrated GIS and multivariate statistical analysis for regional scale assessment of heavy metal soil contamination: A critical review.

    Science.gov (United States)

    Hou, Deyi; O'Connor, David; Nathanail, Paul; Tian, Li; Ma, Yan

    2017-12-01

    Heavy metal soil contamination is associated with potential toxicity to humans or ecotoxicity. Scholars have increasingly used a combination of geographical information science (GIS) with geostatistical and multivariate statistical analysis techniques to examine the spatial distribution of heavy metals in soils at a regional scale. A review of such studies showed that most soil sampling programs were based on grid patterns and composite sampling methodologies. Many programs intended to characterize various soil types and land use types. The most often used sampling depth intervals were 0-0.10 m, or 0-0.20 m, below surface; and the sampling densities used ranged from 0.0004 to 6.1 samples per km 2 , with a median of 0.4 samples per km 2 . The most widely used spatial interpolators were inverse distance weighted interpolation and ordinary kriging; and the most often used multivariate statistical analysis techniques were principal component analysis and cluster analysis. The review also identified several determining and correlating factors in heavy metal distribution in soils, including soil type, soil pH, soil organic matter, land use type, Fe, Al, and heavy metal concentrations. The major natural and anthropogenic sources of heavy metals were found to derive from lithogenic origin, roadway and transportation, atmospheric deposition, wastewater and runoff from industrial and mining facilities, fertilizer application, livestock manure, and sewage sludge. This review argues that the full potential of integrated GIS and multivariate statistical analysis for assessing heavy metal distribution in soils on a regional scale has not yet been fully realized. It is proposed that future research be conducted to map multivariate results in GIS to pinpoint specific anthropogenic sources, to analyze temporal trends in addition to spatial patterns, to optimize modeling parameters, and to expand the use of different multivariate analysis tools beyond principal component analysis

  10. Early prediction of wheat quality: analysis during grain development using mass spectrometry and multivariate data analysis

    DEFF Research Database (Denmark)

    Ghirardo, A.; Sørensen, Helle Aagaard; Petersen, M.

    2005-01-01

    Matrix-assisted laser desorption/ionisation time-of-flight mass spectrometry and multivariate data analysis have been used for the determination of wheat quality at different stages of grain development. Wheat varieties with one of two different end-use qualities (i.e. suitable or not suitable...... data analysis, offers a method that can replace the traditional rather time-consuming ones such as gel electrophoresis. This study focused on the determination of wheat quality at 15 dpa, when the grain is due for harvest 1 month later....

  11. PyMVPA: A python toolbox for multivariate pattern analysis of fMRI data.

    Science.gov (United States)

    Hanke, Michael; Halchenko, Yaroslav O; Sederberg, Per B; Hanson, Stephen José; Haxby, James V; Pollmann, Stefan

    2009-01-01

    Decoding patterns of neural activity onto cognitive states is one of the central goals of functional brain imaging. Standard univariate fMRI analysis methods, which correlate cognitive and perceptual function with the blood oxygenation-level dependent (BOLD) signal, have proven successful in identifying anatomical regions based on signal increases during cognitive and perceptual tasks. Recently, researchers have begun to explore new multivariate techniques that have proven to be more flexible, more reliable, and more sensitive than standard univariate analysis. Drawing on the field of statistical learning theory, these new classifier-based analysis techniques possess explanatory power that could provide new insights into the functional properties of the brain. However, unlike the wealth of software packages for univariate analyses, there are few packages that facilitate multivariate pattern classification analyses of fMRI data. Here we introduce a Python-based, cross-platform, and open-source software toolbox, called PyMVPA, for the application of classifier-based analysis techniques to fMRI datasets. PyMVPA makes use of Python's ability to access libraries written in a large variety of programming languages and computing environments to interface with the wealth of existing machine learning packages. We present the framework in this paper and provide illustrative examples on its usage, features, and programmability.

  12. Multivariate missing data in hydrology - Review and applications

    Science.gov (United States)

    Ben Aissia, Mohamed-Aymen; Chebana, Fateh; Ouarda, Taha B. M. J.

    2017-12-01

    Water resources planning and management require complete data sets of a number of hydrological variables, such as flood peaks and volumes. However, hydrologists are often faced with the problem of missing data (MD) in hydrological databases. Several methods are used to deal with the imputation of MD. During the last decade, multivariate approaches have gained popularity in the field of hydrology, especially in hydrological frequency analysis (HFA). However, treating the MD remains neglected in the multivariate HFA literature whereas the focus has been mainly on the modeling component. For a complete analysis and in order to optimize the use of data, MD should also be treated in the multivariate setting prior to modeling and inference. Imputation of MD in the multivariate hydrological framework can have direct implications on the quality of the estimation. Indeed, the dependence between the series represents important additional information that can be included in the imputation process. The objective of the present paper is to highlight the importance of treating MD in multivariate hydrological frequency analysis by reviewing and applying multivariate imputation methods and by comparing univariate and multivariate imputation methods. An application is carried out for multiple flood attributes on three sites in order to evaluate the performance of the different methods based on the leave-one-out procedure. The results indicate that, the performance of imputation methods can be improved by adopting the multivariate setting, compared to mean substitution and interpolation methods, especially when using the copula-based approach.

  13. Principal Angle Enrichment Analysis (PAEA): Dimensionally Reduced Multivariate Gene Set Enrichment Analysis Tool.

    Science.gov (United States)

    Clark, Neil R; Szymkiewicz, Maciej; Wang, Zichen; Monteiro, Caroline D; Jones, Matthew R; Ma'ayan, Avi

    2015-11-01

    Gene set analysis of differential expression, which identifies collectively differentially expressed gene sets, has become an important tool for biology. The power of this approach lies in its reduction of the dimensionality of the statistical problem and its incorporation of biological interpretation by construction. Many approaches to gene set analysis have been proposed, but benchmarking their performance in the setting of real biological data is difficult due to the lack of a gold standard. In a previously published work we proposed a geometrical approach to differential expression which performed highly in benchmarking tests and compared well to the most popular methods of differential gene expression. As reported, this approach has a natural extension to gene set analysis which we call Principal Angle Enrichment Analysis (PAEA). PAEA employs dimensionality reduction and a multivariate approach for gene set enrichment analysis. However, the performance of this method has not been assessed nor its implementation as a web-based tool. Here we describe new benchmarking protocols for gene set analysis methods and find that PAEA performs highly. The PAEA method is implemented as a user-friendly web-based tool, which contains 70 gene set libraries and is freely available to the community.

  14. Reduction of the dimensionality and comparative analysis of multivariate radiological data

    International Nuclear Information System (INIS)

    Seddeek, M.K.; Kozae, A.M.; Sharshar, T.; Badran, H.M.

    2009-01-01

    Computational methods were used to reduce the dimensionality and to find clusters of multivariate data. The variables were the natural radioactivity contents and the texture characteristics of sand samples. The application of discriminate analysis revealed that samples with high negative values of the former score have the highest contamination with black sand. Principal component analysis (PCA) revealed that radioactivity concentrations alone are sufficient for the classification. Rough set analysis (RSA) showed that the concentration of 238 U, 226 Ra or 232 Th, combined with the concentration of 40 K, can specify the clusters and characteristics of the sand. Both PCA and RSA show that 238 U, 226 Ra and 232 Th behave similarly. RSA revealed that one or two of them can be omitted without degrading predictions.

  15. A Novel and Effective Multivariate Method for Compositional Analysis using Laser Induced Breakdown Spectroscopy

    International Nuclear Information System (INIS)

    Wang, W; Qi, H; Ayhan, B; Kwan, C; Vance, S

    2014-01-01

    Compositional analysis is important to interrogate spectral samples for direct analysis of materials in agriculture, environment and archaeology, etc. In this paper, multi-variate analysis (MVA) techniques are coupled with laser induced breakdown spectroscopy (LIBS) to estimate quantitative elemental compositions and determine the type of the sample. In particular, we present a new multivariate analysis method for composition analysis, referred to as s pectral unmixing . The LIBS spectrum of a testing sample is considered as a linear mixture with more than one constituent signatures that correspond to various chemical elements. The signature library is derived from regression analysis using training samples or is manually set up with the information from an elemental LIBS spectral database. A calibration step is used to make all the signatures in library to be homogeneous with the testing sample so as to avoid inhomogeneous signatures that might be caused by different sampling conditions. To demonstrate the feasibility of the proposed method, we compare it with the traditional partial least squares (PLS) method and the univariate method using a standard soil data set with elemental concentration measured a priori. The experimental results show that the proposed method holds great potential for reliable and effective elemental concentration estimation

  16. Multivariate factor analysis of Girgentana goat milk composition

    Directory of Open Access Journals (Sweden)

    Pietro Giaccone

    2010-01-01

    Full Text Available The interpretation of the several variables that contribute to defining milk quality is difficult due to the high degree of  correlation among them. In this case, one of the best methods of statistical processing is factor analysis, which belongs  to the multivariate groups; for our study this particular statistical approach was employed.  A total of 1485 individual goat milk samples from 117 Girgentana goats, were collected fortnightly from January to July,  and analysed for physical and chemical composition, and clotting properties. Milk pH and tritable acidity were within the  normal range for fresh goat milk. Morning milk yield resulted 704 ± 323 g with 3.93 ± 1.23% and 3.48±0.38% for fat  and protein percentages, respectively. The milk urea content was 43.70 ± 8.28 mg/dl. The clotting ability of Girgentana  milk was quite good, with a renneting time equal to 16.96 ± 3.08 minutes, a rate of curd formation of 2.01 ± 1.63 min-  utes and a curd firmness of 25.08 ± 7.67 millimetres.  Factor analysis was performed by applying axis orthogonal rotation (rotation type VARIMAX; the analysis grouped the  milk components into three latent or common factors. The first, which explained 51.2% of the total covariance, was  defined as “slow milks”, because it was linked to r and pH. The second latent factor, which explained 36.2% of the total  covariance, was defined as “milk yield”, because it is positively correlated to the morning milk yield and to the urea con-  tent, whilst negatively correlated to the fat percentage. The third latent factor, which explained 12.6% of the total covari-  ance, was defined as “curd firmness,” because it is linked to protein percentage, a30 and titatrable acidity. With the aim  of evaluating the influence of environmental effects (stage of kidding, parity and type of kidding, factor scores were anal-  ysed with the mixed linear model. Results showed significant effects of the season of

  17. A multivariate analysis of Antarctic sea ice since 1979

    Energy Technology Data Exchange (ETDEWEB)

    Magalhaes Neto, Newton de; Evangelista, Heitor [Universidade do Estado do Rio de Janeiro (Uerj), LARAMG - Laboratorio de Radioecologia e Mudancas Globais, Maracana, Rio de Janeiro, RJ (Brazil); Tanizaki-Fonseca, Kenny [Universidade do Estado do Rio de Janeiro (Uerj), LARAMG - Laboratorio de Radioecologia e Mudancas Globais, Maracana, Rio de Janeiro, RJ (Brazil); Universidade Federal Fluminense (UFF), Dept. Analise Geoambiental, Inst. de Geociencias, Niteroi, RJ (Brazil); Penello Meirelles, Margareth Simoes [Universidade do Estado do Rio de Janeiro (UERJ)/Geomatica, Maracana, Rio de Janeiro, RJ (Brazil); Garcia, Carlos Eiras [Universidade Federal do Rio Grande (FURG), Laboratorio de Oceanografia Fisica, Rio Grande, RS (Brazil)

    2012-03-15

    Recent satellite observations have shown an increase in the total extent of Antarctic sea ice, during periods when the atmosphere and oceans tend to be warmer surrounding a significant part of the continent. Despite an increase in total sea ice, regional analyses depict negative trends in the Bellingshausen-Amundsen Sea and positive trends in the Ross Sea. Although several climate parameters are believed to drive the formation of Antarctic sea ice and the local atmosphere, a descriptive mechanism that could trigger such differences in trends are still unknown. In this study we employed a multivariate analysis in order to identify the response of the Antarctic sea ice with respect to commonly utilized climate forcings/parameters, as follows: (1) The global air surface temperature, (2) The global sea surface temperature, (3) The atmospheric CO{sub 2} concentration, (4) The South Annular Mode, (5) The Nino 3, (6) The Nino (3 + 4, 7) The Nino 4, (8) The Southern Oscillation Index, (9) The Multivariate ENSO Index, (10) the Total Solar Irradiance, (11) The maximum O{sub 3} depletion area, and (12) The minimum O{sub 3} concentration over Antarctica. Our results indicate that western Antarctic sea ice is simultaneously impacted by several parameters; and that the minimum, mean, and maximum sea ice extent may respond to a separate set of climatic/geochemical parameters. (orig.)

  18. Regression Analysis for Multivariate Dependent Count Data Using Convolved Gaussian Processes

    OpenAIRE

    Sofro, A'yunin; Shi, Jian Qing; Cao, Chunzheng

    2017-01-01

    Research on Poisson regression analysis for dependent data has been developed rapidly in the last decade. One of difficult problems in a multivariate case is how to construct a cross-correlation structure and at the meantime make sure that the covariance matrix is positive definite. To address the issue, we propose to use convolved Gaussian process (CGP) in this paper. The approach provides a semi-parametric model and offers a natural framework for modeling common mean structure and covarianc...

  19. pH-Modulated Watson-Crick duplex-quadruplex equilibria of guanine-rich and cytosine-rich DNA sequences 140 base pairs upstream of the c-kit transcription initiation site.

    Science.gov (United States)

    Bucek, Pavel; Jaumot, Joaquim; Aviñó, Anna; Eritja, Ramon; Gargallo, Raimundo

    2009-11-23

    Guanine-rich regions of DNA are sequences capable of forming G-quadruplex structures. The formation of a G-quadruplex structure in a region 140 base pairs (bp) upstream of the c-kit transcription initiation site was recently proposed (Fernando et al., Biochemistry, 2006, 45, 7854). In the present study, the acid-base equilibria and the thermally induced unfolding of the structures formed by a guanine-rich region and by its complementary cytosine-rich strand in c-kit were studied by means of circular dichroism and molecular absorption spectroscopies. In addition, competition between the Watson-Crick duplex and the isolated structures was studied as a function of pH value and temperature. Multivariate data analysis methods based on both hard and soft modeling were used to allow accurate quantification of the various acid-base species present in the mixtures. Results showed that the G-quadruplex and i-motif coexist with the Watson-Crick duplex over the pH range from 3.0 to 6.5, approximately, under the experimental conditions tested in this study. At pH 7.0, the duplex is practically the only species present.

  20. A comparison of multivariate genome-wide association methods

    DEFF Research Database (Denmark)

    Galesloot, Tessel E; Van Steen, Kristel; Kiemeney, Lambertus A L M

    2014-01-01

    Joint association analysis of multiple traits in a genome-wide association study (GWAS), i.e. a multivariate GWAS, offers several advantages over analyzing each trait in a separate GWAS. In this study we directly compared a number of multivariate GWAS methods using simulated data. We focused on six...... methods that are implemented in the software packages PLINK, SNPTEST, MultiPhen, BIMBAM, PCHAT and TATES, and also compared them to standard univariate GWAS, analysis of the first principal component of the traits, and meta-analysis of univariate results. We simulated data (N = 1000) for three...... for scenarios with an opposite sign of genetic and residual correlation. All multivariate analyses resulted in a higher power than univariate analyses, even when only one of the traits was associated with the QTL. Hence, use of multivariate GWAS methods can be recommended, even when genetic correlations between...

  1. Multivariate two-part statistics for analysis of correlated mass spectrometry data from multiple biological specimens.

    Science.gov (United States)

    Taylor, Sandra L; Ruhaak, L Renee; Weiss, Robert H; Kelly, Karen; Kim, Kyoungmi

    2017-01-01

    High through-put mass spectrometry (MS) is now being used to profile small molecular compounds across multiple biological sample types from the same subjects with the goal of leveraging information across biospecimens. Multivariate statistical methods that combine information from all biospecimens could be more powerful than the usual univariate analyses. However, missing values are common in MS data and imputation can impact between-biospecimen correlation and multivariate analysis results. We propose two multivariate two-part statistics that accommodate missing values and combine data from all biospecimens to identify differentially regulated compounds. Statistical significance is determined using a multivariate permutation null distribution. Relative to univariate tests, the multivariate procedures detected more significant compounds in three biological datasets. In a simulation study, we showed that multi-biospecimen testing procedures were more powerful than single-biospecimen methods when compounds are differentially regulated in multiple biospecimens but univariate methods can be more powerful if compounds are differentially regulated in only one biospecimen. We provide R functions to implement and illustrate our method as supplementary information CONTACT: sltaylor@ucdavis.eduSupplementary information: Supplementary data are available at Bioinformatics online. © The Author 2016. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

  2. Multivariate statistical pattern recognition system for reactor noise analysis

    International Nuclear Information System (INIS)

    Gonzalez, R.C.; Howington, L.C.; Sides, W.H. Jr.; Kryter, R.C.

    1976-01-01

    A multivariate statistical pattern recognition system for reactor noise analysis was developed. The basis of the system is a transformation for decoupling correlated variables and algorithms for inferring probability density functions. The system is adaptable to a variety of statistical properties of the data, and it has learning, tracking, and updating capabilities. System design emphasizes control of the false-alarm rate. The ability of the system to learn normal patterns of reactor behavior and to recognize deviations from these patterns was evaluated by experiments at the ORNL High-Flux Isotope Reactor (HFIR). Power perturbations of less than 0.1 percent of the mean value in selected frequency ranges were detected by the system

  3. Multivariate statistical pattern recognition system for reactor noise analysis

    International Nuclear Information System (INIS)

    Gonzalez, R.C.; Howington, L.C.; Sides, W.H. Jr.; Kryter, R.C.

    1975-01-01

    A multivariate statistical pattern recognition system for reactor noise analysis was developed. The basis of the system is a transformation for decoupling correlated variables and algorithms for inferring probability density functions. The system is adaptable to a variety of statistical properties of the data, and it has learning, tracking, and updating capabilities. System design emphasizes control of the false-alarm rate. The ability of the system to learn normal patterns of reactor behavior and to recognize deviations from these patterns was evaluated by experiments at the ORNL High-Flux Isotope Reactor (HFIR). Power perturbations of less than 0.1 percent of the mean value in selected frequency ranges were detected by the system. 19 references

  4. Multivariate survivorship analysis using two cross-sectional samples.

    Science.gov (United States)

    Hill, M E

    1999-11-01

    As an alternative to survival analysis with longitudinal data, I introduce a method that can be applied when one observes the same cohort in two cross-sectional samples collected at different points in time. The method allows for the estimation of log-probability survivorship models that estimate the influence of multiple time-invariant factors on survival over a time interval separating two samples. This approach can be used whenever the survival process can be adequately conceptualized as an irreversible single-decrement process (e.g., mortality, the transition to first marriage among a cohort of never-married individuals). Using data from the Integrated Public Use Microdata Series (Ruggles and Sobek 1997), I illustrate the multivariate method through an investigation of the effects of race, parity, and educational attainment on the survival of older women in the United States.

  5. Evaluation of functional outcome of the floating knee injury using multivariate analysis.

    Science.gov (United States)

    Yokoyama, Kazuhiko; Tsukamoto, Tatsuro; Aoki, Shinichi; Wakita, Ryuji; Uchino, Masataka; Noumi, Takashi; Fukushima, Nobuaki; Itoman, Moritoshi

    2002-11-01

    The objective of this study is to evaluate significant contributing factors affecting the functional prognosis of floating knee injuries using multivariate analysis. A total of 68 floating knee injuries (67 patients) were treated at Kitasato University Hospital from 1986 to 1999. Both the femoral fractures and the tibial fractures were managed surgically by various methods. The functional results of these injuries were evaluated using the grading system of Karlström and Olerud. Follow-up periods ranged from 2 to 19 years (mean 50.2 months) after the original injury. We defined satisfactory (S) outcomes as those cases with excellent or good results and unsatisfactory (US) outcomes as those cases with acceptable or poor results. Logistic regression analysis was used as a multivariate analysis, and the dependent variables were defined as a satisfactory outcome or as an unsatisfactory outcome. The explanatory variables were predicting factors influencing the functional outcome such as age at trauma, gender, severity of soft-tissue injury in the femur and the tibia, AO fracture grade in the femur and the tibia, Fraser type (type I or type II), Injury Severity Score (ISS), and fixation time after injury (less than 1 week or more than 1 week) in the femur and the tibia. The final functional results were as follows: 25 cases had excellent results, 15 cases good results, 16 cases acceptable results, and 12 cases poor results. The predictive logistic regression equation was as follows: Log 1-p/p = 3.12-1.52 x Fraser type - 1.65 x severity of soft-tissue injury in the tibia - 1.31 x fixation time after injury in the tibia - 0.821 x AO fracture grade in the tibia + 1.025 x fixation time after injury in the femur - 0.687 x AO fracture grade in the femur ( p=0.01). Among the variables, Fraser type and the severity of soft-tissue injury in the tibia were significantly related to the final result. The multivariate analysis showed that both the involvement of the knee joint and

  6. Application of instrumental neutron activation analysis and multivariate statistical methods to archaeological Syrian ceramics

    International Nuclear Information System (INIS)

    Bakraji, E. H.; Othman, I.; Sarhil, A.; Al-Somel, N.

    2002-01-01

    Instrumental neutron activation analysis (INAA) has been utilized in the analysis of thirty-seven archaeological ceramics fragment samples collected from Tal AI-Wardiate site, Missiaf town, Hamma city, Syria. 36 chemical elements were determined. These elemental concentrations have been processed using two multivariate statistical methods, cluster and factor analysis in order to determine similarities and correlation between the various samples. Factor analysis confirms that samples were correctly classified by cluster analysis. The results showed that samples can be considered to be manufactured using three different sources of raw material. (author)

  7. Multivariate analysis of heavy metal contamination using river sediment cores of Nankan River, northern Taiwan

    Science.gov (United States)

    Lee, An-Sheng; Lu, Wei-Li; Huang, Jyh-Jaan; Chang, Queenie; Wei, Kuo-Yen; Lin, Chin-Jung; Liou, Sofia Ya Hsuan

    2016-04-01

    Through the geology and climate characteristic in Taiwan, generally rivers carry a lot of suspended particles. After these particles settled, they become sediments which are good sorbent for heavy metals in river system. Consequently, sediments can be found recording contamination footprint at low flow energy region, such as estuary. Seven sediment cores were collected along Nankan River, northern Taiwan, which is seriously contaminated by factory, household and agriculture input. Physico-chemical properties of these cores were derived from Itrax-XRF Core Scanner and grain size analysis. In order to interpret these complex data matrices, the multivariate statistical techniques (cluster analysis, factor analysis and discriminant analysis) were introduced to this study. Through the statistical determination, the result indicates four types of sediment. One of them represents contamination event which shows high concentration of Cu, Zn, Pb, Ni and Fe, and low concentration of Si and Zr. Furthermore, three possible contamination sources of this type of sediment were revealed by Factor Analysis. The combination of sediment analysis and multivariate statistical techniques used provides new insights into the contamination depositional history of Nankan River and could be similarly applied to other river systems to determine the scale of anthropogenic contamination.

  8. A method of signal transmission path analysis for multivariate random processes

    International Nuclear Information System (INIS)

    Oguma, Ritsuo

    1984-04-01

    A method for noise analysis called ''STP (signal transmission path) analysis'' is presentd as a tool to identify noise sources and their propagation paths in multivariate random proceses. Basic idea of the analysis is to identify, via time series analysis, effective network for the signal power transmission among variables in the system and to make use of its information to the noise analysis. In the present paper, we accomplish this through two steps of signal processings; first, we estimate, using noise power contribution analysis, variables which have large contribution to the power spectrum of interest, and then evaluate the STPs for each pair of variables to identify STPs which play significant role for the generated noise to transmit to the variable under evaluation. The latter part of the analysis is executed through comparison of partial coherence function and newly introduced partial noise power contribution function. This paper presents the procedure of the STP analysis and demonstrates, using simulation data as well as Borssele PWR noise data, its effectiveness for investigation of noise generation and propagation mechanisms. (author)

  9. Implementing a Java Based GUI for RICH Detector Analysis

    Science.gov (United States)

    Lendacky, Andrew; Voloshin, Andrew; Benmokhtar, Fatiha

    2016-09-01

    The CLAS12 detector at Thomas Jefferson National Accelerator Facility (TJNAF) is undergoing an upgrade. One of the improvements is the addition of a Ring Imaging Cherenkov (RICH) detector to improve particle identification in the 3-8 GeV/c momentum range. Approximately 400 multi anode photomultiplier tubes (MAPMTs) are going to be used to detect Cherenkov Radiation in the single photoelectron spectra (SPS). The SPS of each pixel of all MAPMTs have been fitted to a mathematical model of roughly 45 parameters for 4 HVs, 3 OD. Out of those parameters, 9 can be used to evaluate the PMTs performance and placement in the detector. To help analyze data when the RICH is operational, a GUI application was written in Java using Swing and detector packages from TJNAF. To store and retrieve the data, a MySQL database program was written in Java using the JDBC package. Using the database, the GUI pulls the values and produces histograms and graphs for a selected PMT at a specific HV and OD. The GUI will allow researchers to easily view a PMT's performance and efficiency to help with data analysis and ring reconstruction when the RICH is finished.

  10. Biodegradable blends of starch/polyvinyl alcohol/glycerol: multivariate analysis of the mechanical properties

    Directory of Open Access Journals (Sweden)

    Juliano Zanela

    Full Text Available Abstract The aim of the work was to study the mechanical properties of extruded starch/polyvinyl alcohol (PVA/glycerol biodegradable blends using multivariate analysis. The blends were produced as cylindrical strands by extrusion using PVAs with different hydrolysis degrees and viscosities, at two extrusion temperature profiles (90/170/170/170/170 °C and 90/170/200/200/200 °C and three conditioning relative humidities of the samples (33, 53, and 75%. The mechanical properties showed a great variability according to PVA type, as well as the extrusion temperature profile and the conditioning relative humidity; the tensile strength ranged from 0.42 to 5.40 MPa, elongation at break ranged from 10 to 404% and Young’s modulus ranged from 0.93 to 13.81 MPa. The multivariate analysis was a useful methodology to study the mechanical properties behavior of starch/PVA/glycerol blends, and it can be used as an exploratory technique to select of the more suitable PVA type and extrusion temperature to produce biodegradable materials.

  11. Dynamic factor analysis in the frequency domain: causal modeling of multivariate psychophysiological time series

    NARCIS (Netherlands)

    Molenaar, P.C.M.

    1987-01-01

    Outlines a frequency domain analysis of the dynamic factor model and proposes a solution to the problem of constructing a causal filter of lagged factor loadings. The method is illustrated with applications to simulated and real multivariate time series. The latter applications involve topographic

  12. Principal response curves: analysis of time-dependent multivariate responses of biological community to stress

    NARCIS (Netherlands)

    Brink, van den P.J.; Braak, ter C.J.F.

    1999-01-01

    In this paper a novel multivariate method is proposed for the analysis of community response data from designed experiments repeatedly sampled in time. The long-term effects of the insecticide chlorpyrifos on the invertebrate community and the dissolved oxygen (DO)–pH–alkalinity–conductivity

  13. Morphological analysis of enlarged ventricle on CT image, using multivariate analysis

    International Nuclear Information System (INIS)

    Iwasaki, Satoru; Kichikawa, Kimihiko; Otsuji, Hideyuki; Fukusumi, Akio; Kobayashi, Yasuo.

    1983-01-01

    Multivariate analysis of enlarged cerebral ventricle on CT was undertaken to study the characteristics of ventricular morphology. Several ventricular segments of enlarged ventricle, defined on the basis of the study of normal group, were linearly measured on CT image. Then the discriminant analysis with the increase and decrease of variable was applied. The following are the results obtained. The error ratio of discrimination between pressure hydrocephalus and cerebral atrophy was 8.4 %, and between obstructive hydrocephalus and communicating hydrocephalus was 11.3 %. Ventricular segments were divided into three groups according to their character of enlargement: (1) the temporal horn and trigone are large in pressure hydrocephalus; (2) the hypothalamic segment of the third ventricle and the body of lateral ventricle are larger in obstructive hydrocephalus than in communicating hydrocephalus; (3) the anterior horn, cellae mediae at the level of the head of caudate nuclei and thalamic segment of the third ventricle are relatively large in cerebral atrophy and communicating hydrocephalus. The hypothalamic segment of the third ventricle assumes a round or oval shape in pressure hydrocephalus but a rectangular or teardrop shape in cerebral atrophy. These findings are contributory to pathological evaluation of ventricular enlargement. (author)

  14. Multivariate analysis for customer segmentation based on RFM

    Directory of Open Access Journals (Sweden)

    Álvaro Julio Cuadros López

    2018-02-01

    Full Text Available Context: To build a successful relationship management (CRM, companies must start with the identification of the true value of customers, as this provides basic information to implement more targeted and customized marketing strategies. The RFM methodology, a classic analysis tool that uses three evaluation parameters, allows companies to understand customer behavior, and to establish customer segments. The addition of a new parameter in the traditional technique is an opportunity to refine the possible outcomes of a customer segmentation since it not only provides a new element of evaluation to identify the most valuable customers, but it also makes it possible to differentiate and get to know customers even better. Method: The article presents a methodology that allows to establish customer segments using an extended RFM method with new variables, selected through multivariate analysis..  Results: The proposed implementation was applied in a company in which variables such as profit, profit percentage, and billing due date were tested. Therefore, it was possible to establish a more detailed customer segmentation than with the classic RFM. Conclusions: the RFM analysis is a method widely used in the industry for its easy understanding and applicability. However, it can be improved with the use of statistical procedures and new variables, which will allow companies to have deeper information about the behavior of the clients, and will facilitate the design of specific marketing strategies.

  15. Multivariate pattern analysis of MEG and EEG: A comparison of representational structure in time and space.

    Science.gov (United States)

    Cichy, Radoslaw Martin; Pantazis, Dimitrios

    2017-09-01

    Multivariate pattern analysis of magnetoencephalography (MEG) and electroencephalography (EEG) data can reveal the rapid neural dynamics underlying cognition. However, MEG and EEG have systematic differences in sampling neural activity. This poses the question to which degree such measurement differences consistently bias the results of multivariate analysis applied to MEG and EEG activation patterns. To investigate, we conducted a concurrent MEG/EEG study while participants viewed images of everyday objects. We applied multivariate classification analyses to MEG and EEG data, and compared the resulting time courses to each other, and to fMRI data for an independent evaluation in space. We found that both MEG and EEG revealed the millisecond spatio-temporal dynamics of visual processing with largely equivalent results. Beyond yielding convergent results, we found that MEG and EEG also captured partly unique aspects of visual representations. Those unique components emerged earlier in time for MEG than for EEG. Identifying the sources of those unique components with fMRI, we found the locus for both MEG and EEG in high-level visual cortex, and in addition for MEG in low-level visual cortex. Together, our results show that multivariate analyses of MEG and EEG data offer a convergent and complimentary view on neural processing, and motivate the wider adoption of these methods in both MEG and EEG research. Copyright © 2017 Elsevier Inc. All rights reserved.

  16. Multivariate Analysis Approach to the Serum Peptide Profile of Morbidly Obese Patients

    Directory of Open Access Journals (Sweden)

    M. Agostini

    2013-01-01

    Full Text Available Background: Obesity is currently epidemic in many countries worldwide and is strongly related to diabetes and cardiovascular disease. Mass spectrometry, in particular matrix-assisted laser desorption/ionization time of flight (MALDI-TOF is currently used for detecting different pattern of expressed protein. This study investigated the differences in low molecular weight (LMW peptide profiles between obese and normal-weight subjects in combination with multivariate statistical analysis.

  17. Probing beer aging chemistry by nuclear magnetic resonance and multivariate analysis

    Energy Technology Data Exchange (ETDEWEB)

    Rodrigues, J.A. [CICECO-Department of Chemistry, University of Aveiro, Campus de Santiago, 3810-193 Aveiro (Portugal); Barros, A.S. [QOPNA-Department of Chemistry, University of Aveiro, Campus de Santiago, 3810-193 Aveiro (Portugal); Carvalho, B.; Brandao, T. [UNICER, Bebidas de Portugal, Leca do Balio, 4466-955, S. Mamede de Infesta (Portugal); Gil, Ana M., E-mail: agil@ua.pt [CICECO-Department of Chemistry, University of Aveiro, Campus de Santiago, 3810-193 Aveiro (Portugal)

    2011-09-30

    Graphical abstract: The use of nuclear magnetic resonance (NMR) metabonomics for monitoring the chemical changes occurring in beer exposed to forced aging (at 45 deg. C for up to 18 days) is described. Both principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) were applied to the NMR spectra of beer recorded as a function of aging and an aging trend was observed. Inspection of PLS-DA loadings and peak integration revealed the importance of well known markers (e.g. 5-HMF) as well as of other compounds: amino acids, higher alcohols, organic acids, dextrins and some still unassigned spin systems. 2D correlation analysis enabled relevant compound variations to be confirmed and inter-compound correlations to be assessed, thus offering improved insight into the chemical aspects of beer aging. Highlights: {center_dot} Use of NMR metabonomics for monitoring the chemical changes occurring in beer exposed to forced aging. {center_dot} Compositional variations evaluated by principal component analysis and partial least squares-discriminant analysis. {center_dot} Results reveal importance of known markers and other compounds: amino and organic acids, higher alcohols, dextrins. {center_dot} 2D correlation analysis reveals inter-compound relationships, offering insight into beer aging chemistry. - Abstract: This paper describes the use of nuclear magnetic resonance (NMR) spectroscopy, in tandem with multivariate analysis (MVA), for monitoring the chemical changes occurring in a lager beer exposed to forced aging (at 45 deg. C for up to 18 days). To evaluate the resulting compositional variations, both principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) were applied to the NMR spectra of beer recorded as a function of aging and a clear aging trend was observed. Inspection of PLS-DA loadings and peak integration enabled the changing compounds to be identified, revealing the importance of well known

  18. Probing beer aging chemistry by nuclear magnetic resonance and multivariate analysis

    International Nuclear Information System (INIS)

    Rodrigues, J.A.; Barros, A.S.; Carvalho, B.; Brandao, T.; Gil, Ana M.

    2011-01-01

    Graphical abstract: The use of nuclear magnetic resonance (NMR) metabonomics for monitoring the chemical changes occurring in beer exposed to forced aging (at 45 deg. C for up to 18 days) is described. Both principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) were applied to the NMR spectra of beer recorded as a function of aging and an aging trend was observed. Inspection of PLS-DA loadings and peak integration revealed the importance of well known markers (e.g. 5-HMF) as well as of other compounds: amino acids, higher alcohols, organic acids, dextrins and some still unassigned spin systems. 2D correlation analysis enabled relevant compound variations to be confirmed and inter-compound correlations to be assessed, thus offering improved insight into the chemical aspects of beer aging. Highlights: · Use of NMR metabonomics for monitoring the chemical changes occurring in beer exposed to forced aging. · Compositional variations evaluated by principal component analysis and partial least squares-discriminant analysis. · Results reveal importance of known markers and other compounds: amino and organic acids, higher alcohols, dextrins. · 2D correlation analysis reveals inter-compound relationships, offering insight into beer aging chemistry. - Abstract: This paper describes the use of nuclear magnetic resonance (NMR) spectroscopy, in tandem with multivariate analysis (MVA), for monitoring the chemical changes occurring in a lager beer exposed to forced aging (at 45 deg. C for up to 18 days). To evaluate the resulting compositional variations, both principal component analysis (PCA) and partial least squares-discriminant analysis (PLS-DA) were applied to the NMR spectra of beer recorded as a function of aging and a clear aging trend was observed. Inspection of PLS-DA loadings and peak integration enabled the changing compounds to be identified, revealing the importance of well known markers such as 5-hydroxymethylfurfural (5

  19. Synthetic environmental indicators: A conceptual approach from the multivariate statistics

    International Nuclear Information System (INIS)

    Escobar J, Luis A

    2008-01-01

    This paper presents a general description of multivariate statistical analysis and shows two methodologies: analysis of principal components and analysis of distance, DP2. Both methods use techniques of multivariate analysis to define the true dimension of data, which is useful to estimate indicators of environmental quality.

  20. Motivation and Self-Regulated Learning: A Multivariate Multilevel Analysis

    Directory of Open Access Journals (Sweden)

    Wondimu Ahmed

    2017-09-01

    Full Text Available This study investigated the relationship between motivation and self-regulated learning (SRL in a nationally representative sample of 5245, 15-year-old students in the USA. A multivariate multilevel analysis was conducted to examine the role of three motivational variables (self-efficacy, intrinsic value & instrumental value in predicting three SRL strategies (memorization, elaboration & control. The results showed that compared to self-efficacy, intrinsic value and instrumental value of math were stronger predictors of memorization, elaboration and control strategies. None of the motivational variables had a stronger effect on one strategy than the other. The findings suggest that the development of self-regulatory skills in math can be greatly enhanced by helping students develop positive value of and realistic expectancy for success in math.

  1. Multivariate co-integration analysis of the Kaya factors in Ghana.

    Science.gov (United States)

    Asumadu-Sarkodie, Samuel; Owusu, Phebe Asantewaa

    2016-05-01

    The fundamental goal of the Government of Ghana's development agenda as enshrined in the Growth and Poverty Reduction Strategy to grow the economy to a middle income status of US$1000 per capita by the end of 2015 could be met by increasing the labour force, increasing energy supplies and expanding the energy infrastructure in order to achieve the sustainable development targets. In this study, a multivariate co-integration analysis of the Kaya factors namely carbon dioxide, total primary energy consumption, population and GDP was investigated in Ghana using vector error correction model with data spanning from 1980 to 2012. Our research results show an existence of long-run causality running from population, GDP and total primary energy consumption to carbon dioxide emissions. However, there is evidence of short-run causality running from population to carbon dioxide emissions. There was a bi-directional causality running from carbon dioxide emissions to energy consumption and vice versa. In other words, decreasing the primary energy consumption in Ghana will directly reduce carbon dioxide emissions. In addition, a bi-directional causality running from GDP to energy consumption and vice versa exists in the multivariate model. It is plausible that access to energy has a relationship with increasing economic growth and productivity in Ghana.

  2. Causal networks clarify productivity-richness interrelations, bivariate plots do not

    Science.gov (United States)

    Grace, James B.; Adler, Peter B.; Harpole, W. Stanley; Borer, Elizabeth T.; Seabloom, Eric W.

    2014-01-01

    Perhaps no other pair of variables in ecology has generated as much discussion as species richness and ecosystem productivity, as illustrated by the reactions by Pierce (2013) and others to Adler et al.'s (2011) report that empirical patterns are weak and inconsistent. Adler et al. (2011) argued we need to move beyond a focus on simplistic bivariate relationships and test mechanistic, multivariate causal hypotheses. We feel the continuing debate over productivity–richness relationships (PRRs) provides a focused context for illustrating the fundamental difficulties of using bivariate relationships to gain scientific understanding.

  3. Multivariate cluster analysis of dynamic iodine-123 iodobenzamide SPET dopamine D{sub 2}receptor images in schizophrenia

    Energy Technology Data Exchange (ETDEWEB)

    Acton, P.D. [Inst. of Nuclear Medicine, Univ. Coll. London Medical School, London (United Kingdom); Pilowsky, L.S. [Institute of Psychiatry, London (United Kingdom); Costa, D.C. [Inst. of Nuclear Medicine, Univ. Coll. London Medical School, London (United Kingdom); Ell, P.J. [Inst. of Nuclear Medicine, Univ. Coll. London Medical School, London (United Kingdom)

    1997-02-01

    This paper describes the application of a multivariate statistical technique to investigate striatal dopamine D{sub 2}receptor concentrations measured by iodine-123 iodobenzamide ({sup 123}I-IBZM) single-photon emission tomography (SPET). This technique enables the automatic segmentation of dynamic nuclear medicine images based on the underlying time-activity curves present in the data. Once the time-activity curves have been extracted, each pixel can be mapped back on to the underlying distribution, considerably reducing image noise. Cluster analysis has been verified using computer simulations and phantom studies. The technique has been applied to SPET images of dopamine D {sub 2}receptors in a total of 20 healthy and 20 schizophrenic volunteers (22 male, 18 female), using the ligand {sup 123}I-IBZM. Following automatic image segmentation, the concentration of striatal dopamine D {sub 2}receptors shows a significant left-sided asymmetry in male schizophrenics compared with male controls. The mean left-minus-right laterality index for controls is -1.52 (95% CI -3.72-0.66) and for patients 4.04 (95% CI 1.07-7.01). Analysis of variance shows a case-by-sex-by-side interaction, with F=10.01, P=0.005. We can now demonstrate that the previously observed male sex-specific D {sub 2}receptor asymmetry in schizophrenia, which had failed to attain statistical significance, is valid. Cluster analysis of dynamic nuclear medicine studies provides a powerful tool for automatic segmentation and noise reduction of the images, removing much of the subjectivity inherent in region-of-interest analysis. The observed striatal D {sub 2}asymmetry could reflect long hypothesized disruptions in dopamine-rich cortico-striatal-limbic circuits in schizophrenic males. (orig.). With 4 figs., 2 tabs.

  4. Multivariate Max-Stable Spatial Processes

    KAUST Repository

    Genton, Marc G.

    2014-01-06

    Analysis of spatial extremes is currently based on univariate processes. Max-stable processes allow the spatial dependence of extremes to be modelled and explicitly quantified, they are therefore widely adopted in applications. For a better understanding of extreme events of real processes, such as environmental phenomena, it may be useful to study several spatial variables simultaneously. To this end, we extend some theoretical results and applications of max-stable processes to the multivariate setting to analyze extreme events of several variables observed across space. In particular, we study the maxima of independent replicates of multivariate processes, both in the Gaussian and Student-t cases. Then, we define a Poisson process construction in the multivariate setting and introduce multivariate versions of the Smith Gaussian extremevalue, the Schlather extremal-Gaussian and extremal-t, and the BrownResnick models. Inferential aspects of those models based on composite likelihoods are developed. We present results of various Monte Carlo simulations and of an application to a dataset of summer daily temperature maxima and minima in Oklahoma, U.S.A., highlighting the utility of working with multivariate models in contrast to the univariate case. Based on joint work with Simone Padoan and Huiyan Sang.

  5. Multivariate Max-Stable Spatial Processes

    KAUST Repository

    Genton, Marc G.

    2014-01-01

    Analysis of spatial extremes is currently based on univariate processes. Max-stable processes allow the spatial dependence of extremes to be modelled and explicitly quantified, they are therefore widely adopted in applications. For a better understanding of extreme events of real processes, such as environmental phenomena, it may be useful to study several spatial variables simultaneously. To this end, we extend some theoretical results and applications of max-stable processes to the multivariate setting to analyze extreme events of several variables observed across space. In particular, we study the maxima of independent replicates of multivariate processes, both in the Gaussian and Student-t cases. Then, we define a Poisson process construction in the multivariate setting and introduce multivariate versions of the Smith Gaussian extremevalue, the Schlather extremal-Gaussian and extremal-t, and the BrownResnick models. Inferential aspects of those models based on composite likelihoods are developed. We present results of various Monte Carlo simulations and of an application to a dataset of summer daily temperature maxima and minima in Oklahoma, U.S.A., highlighting the utility of working with multivariate models in contrast to the univariate case. Based on joint work with Simone Padoan and Huiyan Sang.

  6. Late rectal toxicity after conformal radiotherapy of prostate cancer (I): multivariate analysis and dose-response

    International Nuclear Information System (INIS)

    Skwarchuk, Mark W.; Jackson, Andrew; Zelefsky, Michael J.; Venkatraman, Ennapadam S.; Cowen, Didier M.; Levegruen, Sabine; Burman, Chandra M.; Fuks, Zvi; Leibel, Steven A.; Ling, C. Clifton

    2000-01-01

    Purpose: The purpose of this paper is to use the outcome of a dose escalation protocol for three-dimensional conformal radiation therapy (3D-CRT) of prostate cancer to study the dose-response for late rectal toxicity and to identify anatomic, dosimetric, and clinical factors that correlate with late rectal bleeding in multivariate analysis. Methods and Materials: Seven hundred forty-three patients with T1c-T3 prostate cancer were treated with 3D-CRT with prescribed doses of 64.8 to 81.0 Gy. The 5-year actuarial rate of late rectal toxicity was assessed using Kaplan-Meier statistics. A retrospective dosimetric analysis was performed for patients treated to 70.2 Gy (52 patients) or 75.6 Gy (119 patients) who either exhibited late rectal bleeding (RTOG Grade 2/3) within 30 months after treatment (i.e., 70.2 Gy--13 patients, 75.6 Gy--36 patients) or were nonbleeding for at least 30 months (i.e., 70.2 Gy--39 patients, 75.6 Gy--83 patients). Univariate and multivariate logistic regression was performed to correlate late rectal bleeding with several anatomic, dosimetric, and clinical variables. Results: A dose response for ≥ Grade 2 late rectal toxicity was observed. By multivariate analysis, the following factors were significantly correlated with ≥ Grade 2 late rectal bleeding for patients prescribed 70.2 Gy: 1) enclosure of the outer rectal contour by the 50% isodose on the isocenter slice (i.e., Iso50) (p max (p max

  7. Multivariate methods in nuclear waste remediation: Needs and applications

    International Nuclear Information System (INIS)

    Pulsipher, B.A.

    1992-05-01

    The United States Department of Energy (DOE) has developed a strategy for nuclear waste remediation and environmental restoration at several major sites across the country. Nuclear and hazardous wastes are found in underground storage tanks, containment drums, soils, and facilities. Due to the many possible contaminants and complexities of sampling and analysis, multivariate methods are directly applicable. However, effective application of multivariate methods will require greater ability to communicate methods and results to a non-statistician community. Moreover, more flexible multivariate methods may be required to accommodate inherent sampling and analysis limitations. This paper outlines multivariate applications in the context of select DOE environmental restoration activities and identifies several perceived needs

  8. Productivity is a poor predictor of plant species richness

    Science.gov (United States)

    Adler, Peter B.; Seabloom, Eric W.; Borer, Elizabeth T.; Hillebrand, Helmut; Hautier, Yann; Hector, Andy; Harpole, W. Stanley; O'Halloran, Lydia R.; Grace, James B.; Anderson, T. Michael; Bakker, Jonathan D.; Biederman, Lori A.; Brown, Cynthia S.; Buckley, Yvonne M.; Calabrese, Laura B.; Chu, Cheng-Jin; Cleland, Elsa E.; Collins, Scott L.; Cottingham, Kathryn L.; Crawley, Michael J.; Damschen, Ellen Ingman; Davies, Kendi F.; DeCrappeo, Nicole M.; Fay, Philip A.; Firn, Jennifer; Frater, Paul; Gasarch, Eve I.; Gruner, Daneil S.; Hagenah, Nicole; Lambers, Janneke Hille Ris; Humphries, Hope; Jin, Virginia L.; Kay, Adam D.; Kirkman, Kevin P.; Klein, Julia A.; Knops, Johannes M.H.; La Pierre, Kimberly J.; Lambrinos, John G.; Li, Wei; MacDougall, Andrew S.; McCulley, Rebecca L.; Melbourne, Brett A.; Mitchell, Charles E.; Moore, Joslin L.; Morgan, John W.; Mortensen, Brent; Orrock, John L.; Prober, Suzanne M.; Pyke, David A.; Risch, Anita C.; Schuetz, Martin; Smith, Melinda D.; Stevens, Carly J.; Sullivan, Lauren L.; Wang, Gang; Wragg, Peter D.; Wright, Justin P.; Yang, Louie H.

    2011-01-01

    For more than 30 years, the relationship between net primary productivity and species richness has generated intense debate in ecology about the processes regulating local diversity. The original view, which is still widely accepted, holds that the relationship is hump-shaped, with richness first rising and then declining with increasing productivity. Although recent meta-analyses questioned the generality of hump-shaped patterns, these syntheses have been criticized for failing to account for methodological differences among studies. We addressed such concerns by conducting standardized sampling in 48 herbaceous-dominated plant communities on five continents. We found no clear relationship between productivity and fine-scale (meters-2) richness within sites, within regions, or across the globe. Ecologists should focus on fresh, mechanistic approaches to understanding the multivariate links between productivity an

  9. Multivariate data analysis as a tool in advanced quality monitoring in the food production chain

    DEFF Research Database (Denmark)

    Bro, R.; van den Berg, F.; Thybo, A.

    2002-01-01

    This paper summarizes some recent advances in mathematical modeling of relevance in advanced quality monitoring in the food production chain. Using chemometrics-multivariate data analysis - it is illustrated how to tackle problems in food science more efficiently and, moreover, solve problems...

  10. Multivariate quantitative structure-pharmacokinetic relationships (QSPKR) analysis of adenosine A(1) receptor agonists in rat

    NARCIS (Netherlands)

    Van der Graaf, PH; Nilsson, J; Van Schaick, EA; Danhof, M

    The aim of this study was to investigate the feasibility of a quantitative structure-pharmacokinetic relationships (QSPKR) method based on contemporary three-dimensional (3D) molecular characterization and multivariate statistical analysis. For this purpose, the programs SYBYL/CoMFA, GRID, and

  11. Computed ABC Analysis for Rational Selection of Most Informative Variables in Multivariate Data.

    Science.gov (United States)

    Ultsch, Alfred; Lötsch, Jörn

    2015-01-01

    Multivariate data sets often differ in several factors or derived statistical parameters, which have to be selected for a valid interpretation. Basing this selection on traditional statistical limits leads occasionally to the perception of losing information from a data set. This paper proposes a novel method for calculating precise limits for the selection of parameter sets. The algorithm is based on an ABC analysis and calculates these limits on the basis of the mathematical properties of the distribution of the analyzed items. The limits implement the aim of any ABC analysis, i.e., comparing the increase in yield to the required additional effort. In particular, the limit for set A, the "important few", is optimized in a way that both, the effort and the yield for the other sets (B and C), are minimized and the additional gain is optimized. As a typical example from biomedical research, the feasibility of the ABC analysis as an objective replacement for classical subjective limits to select highly relevant variance components of pain thresholds is presented. The proposed method improved the biological interpretation of the results and increased the fraction of valid information that was obtained from the experimental data. The method is applicable to many further biomedical problems including the creation of diagnostic complex biomarkers or short screening tests from comprehensive test batteries. Thus, the ABC analysis can be proposed as a mathematically valid replacement for traditional limits to maximize the information obtained from multivariate research data.

  12. Multivariate data analysis as a fast tool in evaluation of solid state phenomena

    DEFF Research Database (Denmark)

    Jørgensen, Anna Cecilia; Miroshnyk, Inna; Karjalainen, Milja

    2006-01-01

    of information generated can be overwhelming and the need for more effective data analysis tools is well recognized. The aim of this study was to investigate the use of multivariate data analysis, in particular principal component analysis (PCA), for fast analysis of solid state information. The data sets...... the molecular level interpretation of the structural changes related to the loss of water, as well as interpretation of the phenomena related to the crystallization. The critical temperatures or critical time points were identified easily using the principal component analysis. The variables (diffraction angles...... or wavenumbers) that changed could be identified by the careful interpretation of the loadings plots. The PCA approach provides an effective tool for fast screening of solid state information....

  13. Recent applications of multivariate data analysis methods in the authentication of rice and the most analyzed parameters: A review.

    Science.gov (United States)

    Maione, Camila; Barbosa, Rommel Melgaço

    2018-01-24

    Rice is one of the most important staple foods around the world. Authentication of rice is one of the most addressed concerns in the present literature, which includes recognition of its geographical origin and variety, certification of organic rice and many other issues. Good results have been achieved by multivariate data analysis and data mining techniques when combined with specific parameters for ascertaining authenticity and many other useful characteristics of rice, such as quality, yield and others. This paper brings a review of the recent research projects on discrimination and authentication of rice using multivariate data analysis and data mining techniques. We found that data obtained from image processing, molecular and atomic spectroscopy, elemental fingerprinting, genetic markers, molecular content and others are promising sources of information regarding geographical origin, variety and other aspects of rice, being widely used combined with multivariate data analysis techniques. Principal component analysis and linear discriminant analysis are the preferred methods, but several other data classification techniques such as support vector machines, artificial neural networks and others are also frequently present in some studies and show high performance for discrimination of rice.

  14. Multi-variable systems in nuclear power plant

    International Nuclear Information System (INIS)

    Collins, G.B.; Howell, J.

    1982-01-01

    Nuclear power plant are complex multi-variable dynamically interactive systems which employ many facets of systems and control theory in their analysis and design. Whole plant mathematical models must be developed and validated and in addition to their obvious role in control system synthesis and design, they are also widely used for operational constraint and plant malfunction analysis. The need for and scope of an integrated power plant control system is discussed and, as a specific example, the design of an integrated feedwater regulator is reviewed. The multi-variable frequency response analysis employed in the design is described in detail. (author)

  15. Multivariate statistics high-dimensional and large-sample approximations

    CERN Document Server

    Fujikoshi, Yasunori; Shimizu, Ryoichi

    2010-01-01

    A comprehensive examination of high-dimensional analysis of multivariate methods and their real-world applications Multivariate Statistics: High-Dimensional and Large-Sample Approximations is the first book of its kind to explore how classical multivariate methods can be revised and used in place of conventional statistical tools. Written by prominent researchers in the field, the book focuses on high-dimensional and large-scale approximations and details the many basic multivariate methods used to achieve high levels of accuracy. The authors begin with a fundamental presentation of the basic

  16. Analysis of the stability and accuracy of the discrete least-squares approximation on multivariate polynomial spaces

    KAUST Repository

    Migliorati, Giovanni

    2016-01-01

    We review the main results achieved in the analysis of the stability and accuracy of the discrete leastsquares approximation on multivariate polynomial spaces, with noiseless evaluations at random points, noiseless evaluations at low

  17. Structural brain connectivity and cognitive ability differences: A multivariate distance matrix regression analysis.

    Science.gov (United States)

    Ponsoda, Vicente; Martínez, Kenia; Pineda-Pardo, José A; Abad, Francisco J; Olea, Julio; Román, Francisco J; Barbey, Aron K; Colom, Roberto

    2017-02-01

    Neuroimaging research involves analyses of huge amounts of biological data that might or might not be related with cognition. This relationship is usually approached using univariate methods, and, therefore, correction methods are mandatory for reducing false positives. Nevertheless, the probability of false negatives is also increased. Multivariate frameworks have been proposed for helping to alleviate this balance. Here we apply multivariate distance matrix regression for the simultaneous analysis of biological and cognitive data, namely, structural connections among 82 brain regions and several latent factors estimating cognitive performance. We tested whether cognitive differences predict distances among individuals regarding their connectivity pattern. Beginning with 3,321 connections among regions, the 36 edges better predicted by the individuals' cognitive scores were selected. Cognitive scores were related to connectivity distances in both the full (3,321) and reduced (36) connectivity patterns. The selected edges connect regions distributed across the entire brain and the network defined by these edges supports high-order cognitive processes such as (a) (fluid) executive control, (b) (crystallized) recognition, learning, and language processing, and (c) visuospatial processing. This multivariate study suggests that one widespread, but limited number, of regions in the human brain, supports high-level cognitive ability differences. Hum Brain Mapp 38:803-816, 2017. © 2016 Wiley Periodicals, Inc. © 2016 Wiley Periodicals, Inc.

  18. Synchrotron-Based Microspectroscopic Analysis of Molecular and Biopolymer Structures Using Multivariate Techniques and Advanced Multi-Components Modeling

    International Nuclear Information System (INIS)

    Yu, P.

    2008-01-01

    More recently, advanced synchrotron radiation-based bioanalytical technique (SRFTIRM) has been applied as a novel non-invasive analysis tool to study molecular, functional group and biopolymer chemistry, nutrient make-up and structural conformation in biomaterials. This novel synchrotron technique, taking advantage of bright synchrotron light (which is million times brighter than sunlight), is capable of exploring the biomaterials at molecular and cellular levels. However, with the synchrotron RFTIRM technique, a large number of molecular spectral data are usually collected. The objective of this article was to illustrate how to use two multivariate statistical techniques: (1) agglomerative hierarchical cluster analysis (AHCA) and (2) principal component analysis (PCA) and two advanced multicomponent modeling methods: (1) Gaussian and (2) Lorentzian multi-component peak modeling for molecular spectrum analysis of bio-tissues. The studies indicated that the two multivariate analyses (AHCA, PCA) are able to create molecular spectral corrections by including not just one intensity or frequency point of a molecular spectrum, but by utilizing the entire spectral information. Gaussian and Lorentzian modeling techniques are able to quantify spectral omponent peaks of molecular structure, functional group and biopolymer. By application of these four statistical methods of the multivariate techniques and Gaussian and Lorentzian modeling, inherent molecular structures, functional group and biopolymer onformation between and among biological samples can be quantified, discriminated and classified with great efficiency.

  19. Web-Based Tools for Modelling and Analysis of Multivariate Data: California Ozone Pollution Activity

    Science.gov (United States)

    Dinov, Ivo D.; Christou, Nicolas

    2011-01-01

    This article presents a hands-on web-based activity motivated by the relation between human health and ozone pollution in California. This case study is based on multivariate data collected monthly at 20 locations in California between 1980 and 2006. Several strategies and tools for data interrogation and exploratory data analysis, model fitting…

  20. RICH High Voltages & PDF Analysis @ LHCb

    CERN Multimedia

    Fanchini, E

    2009-01-01

    In the LHCb experiment an important issue is the identification of the hadrons of the final states of the B mesons decays. Two RICH subdetectors are devoted to this task, and the Hybrid Photon Detectors (HPDs) are the photodetectors used to detect Cherenkov light. In this poster there is a description of how the very high voltage (-18 KV) supply stability used to power the HPDs is monitored. It is also presented the basics of a study which can be done with the first collision data: the analysis of the dimuons from the Drell-Yan process. This process is well known and the acceptance of the LHCb detector in terms of pseudorapidity will be very useful to improve the knowledge of the proton structure functions or, alternatively, try to estimate the luminosity from it.

  1. Ischemic risk stratification by means of multivariate analysis of the heart rate variability

    International Nuclear Information System (INIS)

    Valencia, José F; Vallverdú, Montserrat; Caminal, Pere; Porta, Alberto; Voss, Andreas; Schroeder, Rico; Vázquez, Rafael; Bayés de Luna, Antonio

    2013-01-01

    In this work, a univariate and multivariate statistical analysis of indexes derived from heart rate variability (HRV) was conducted to stratify patients with ischemic dilated cardiomyopathy (IDC) in cardiac risk groups. Indexes conditional entropy, refined multiscale entropy (RMSE), detrended fluctuation analysis, time and frequency analysis, were applied to the RR interval series (beat-to-beat series), for single and multiscale complexity analysis of the HRV in IDC patients. Also, clinical parameters were considered. Two different end-points after a follow-up of three years were considered: (i) analysis A, with 151 survivor patients as a low risk group and 13 patients that suffered sudden cardiac death as a high risk group; (ii) analysis B, with 192 survivor patients as a low risk group and 30 patients that suffered cardiac mortality as a high risk group. A univariate and multivariate linear discriminant analysis was used as a statistical technique for classifying patients in risk groups. Sensitivity (Sen) and specificity (Spe) were calculated as diagnostic criteria in order to evaluate the performance of the indexes and their linear combinations. Sen and Spe values of 80.0% and 72.9%, respectively, were obtained during daytime by combining one clinical parameter and one index from RMSE, and during nighttime Sen = 80% and Spe = 73.4% were attained by combining one clinical factor and two indexes from RMSE. In particular, relatively long time scales were more relevant for classifying patients into risk groups during nighttime, while during daytime shorter scales performed better. The results suggest that the left atrial size, indexed to body surface and RMSE indexes are those that allow enhanced classification of ischemic patients in their respective risk groups, confirming that a single measurement is not enough to fully characterize ischemic risk patients and the clinical relevance of HRV complexity measures. (paper)

  2. Multivariate analysis of microarray data: differential expression and differential connection.

    Science.gov (United States)

    Kiiveri, Harri T

    2011-02-01

    Typical analysis of microarray data ignores the correlation between gene expression values. In this paper we present a model for microarray data which specifically allows for correlation between genes. As a result we combine gene network ideas with linear models and differential expression. We use sparse inverse covariance matrices and their associated graphical representation to capture the notion of gene networks. An important issue in using these models is the identification of the pattern of zeroes in the inverse covariance matrix. The limitations of existing methods for doing this are discussed and we provide a workable solution for determining the zero pattern. We then consider a method for estimating the parameters in the inverse covariance matrix which is suitable for very high dimensional matrices. We also show how to construct multivariate tests of hypotheses. These overall multivariate tests can be broken down into two components, the first one being similar to tests for differential expression and the second involving the connections between genes. The methods in this paper enable the extraction of a wealth of information concerning the relationships between genes which can be conveniently represented in graphical form. Differentially expressed genes can be placed in the context of the gene network and places in the gene network where unusual or interesting patterns have emerged can be identified, leading to the formulation of hypotheses for future experimentation.

  3. Analysis of multi-species point patterns using multivariate log Gaussian Cox processes

    DEFF Research Database (Denmark)

    Waagepetersen, Rasmus; Guan, Yongtao; Jalilian, Abdollah

    Multivariate log Gaussian Cox processes are flexible models for multivariate point patterns. However, they have so far only been applied in bivariate cases. In this paper we move beyond the bivariate case in order to model multi-species point patterns of tree locations. In particular we address t...... of the data. The selected number of common latent fields provides an index of complexity of the multivariate covariance structure. Hierarchical clustering is used to identify groups of species with similar patterns of dependence on the common latent fields.......Multivariate log Gaussian Cox processes are flexible models for multivariate point patterns. However, they have so far only been applied in bivariate cases. In this paper we move beyond the bivariate case in order to model multi-species point patterns of tree locations. In particular we address...... the problems of identifying parsimonious models and of extracting biologically relevant information from the fitted models. The latent multivariate Gaussian field is decomposed into components given in terms of random fields common to all species and components which are species specific. This allows...

  4. Multivariate pattern dependence.

    Directory of Open Access Journals (Sweden)

    Stefano Anzellotti

    2017-11-01

    Full Text Available When we perform a cognitive task, multiple brain regions are engaged. Understanding how these regions interact is a fundamental step to uncover the neural bases of behavior. Most research on the interactions between brain regions has focused on the univariate responses in the regions. However, fine grained patterns of response encode important information, as shown by multivariate pattern analysis. In the present article, we introduce and apply multivariate pattern dependence (MVPD: a technique to study the statistical dependence between brain regions in humans in terms of the multivariate relations between their patterns of responses. MVPD characterizes the responses in each brain region as trajectories in region-specific multidimensional spaces, and models the multivariate relationship between these trajectories. We applied MVPD to the posterior superior temporal sulcus (pSTS and to the fusiform face area (FFA, using a searchlight approach to reveal interactions between these seed regions and the rest of the brain. Across two different experiments, MVPD identified significant statistical dependence not detected by standard functional connectivity. Additionally, MVPD outperformed univariate connectivity in its ability to explain independent variance in the responses of individual voxels. In the end, MVPD uncovered different connectivity profiles associated with different representational subspaces of FFA: the first principal component of FFA shows differential connectivity with occipital and parietal regions implicated in the processing of low-level properties of faces, while the second and third components show differential connectivity with anterior temporal regions implicated in the processing of invariant representations of face identity.

  5. Beneath the veil: Plant growth form influences the strength of species richness-productivity relationships in forests

    Science.gov (United States)

    Oberle, B.; Grace, J.B.; Chase, J.M.

    2009-01-01

    Aim: Species richness has been observed to increase with productivity at large spatial scales, though the strength of this relationship varies among functional groups. In forests, canopy trees shade understorey plants, and for this reason we hypothesize that species richness of canopy trees will depend on macroclimate, while species richness of shorter growth forms will additionally be affected by shading from the canopy. In this study we test for differences in species richness-productivity relationships (SRPRs) among growth forms (canopy trees, shrubs, herbaceous species) in small forest plots. Location: We analysed 231 plots ranging from 34.0?? to 48.3?? N latitude and from 75.0?? to 124.2?? W longitude in the United States. Methods: We analysed data collected by the USDA Forest Inventory and Analysis program for plant species richness partitioned into different growth forms, in small plots. We used actual evapotranspiration as a macroclimatic estimate of regional productivity and calculated the area of light-blocking tissue in the immediate area surrounding plots for an estimate of the intensity of local shading. We estimated and compared SRPRs for different partitions of the species richness dataset using generalized linear models and we incorporated the possible indirect effects of shading using a structural equation model. Results: Canopy tree species richness increased strongly with regional productivity, while local shading primarily explained the variation in herbaceous plant richness. Shrub species richness was related to both regional productivity and local shading. Main conclusions: The relationship between total forest plant species richness and productivity at large scales belies strong effects of local interactions. Counter to the pattern for overall richness, we found that understorey herbaceous plant species richness does not respond to regional productivity gradients, and instead is strongly influenced by canopy density, while shrub species

  6. Multivariate Statistical Analysis of Water Quality data in Indian River Lagoon, Florida

    Science.gov (United States)

    Sayemuzzaman, M.; Ye, M.

    2015-12-01

    The Indian River Lagoon, is part of the longest barrier island complex in the United States, is a region of particular concern to the environmental scientist because of the rapid rate of human development throughout the region and the geographical position in between the colder temperate zone and warmer sub-tropical zone. Thus, the surface water quality analysis in this region always brings the newer information. In this present study, multivariate statistical procedures were applied to analyze the spatial and temporal water quality in the Indian River Lagoon over the period 1998-2013. Twelve parameters have been analyzed on twelve key water monitoring stations in and beside the lagoon on monthly datasets (total of 27,648 observations). The dataset was treated using cluster analysis (CA), principle component analysis (PCA) and non-parametric trend analysis. The CA was used to cluster twelve monitoring stations into four groups, with stations on the similar surrounding characteristics being in the same group. The PCA was then applied to the similar groups to find the important water quality parameters. The principal components (PCs), PC1 to PC5 was considered based on the explained cumulative variances 75% to 85% in each cluster groups. Nutrient species (phosphorus and nitrogen), salinity, specific conductivity and erosion factors (TSS, Turbidity) were major variables involved in the construction of the PCs. Statistical significant positive or negative trends and the abrupt trend shift were detected applying Mann-Kendall trend test and Sequential Mann-Kendall (SQMK), for each individual stations for the important water quality parameters. Land use land cover change pattern, local anthropogenic activities and extreme climate such as drought might be associated with these trends. This study presents the multivariate statistical assessment in order to get better information about the quality of surface water. Thus, effective pollution control/management of the surface

  7. Environmental Performance in Countries Worldwide: Determinant Factors and Multivariate Analysis

    Directory of Open Access Journals (Sweden)

    Isabel Gallego-Alvarez

    2014-11-01

    Full Text Available The aim of this study is to analyze the environmental performance of countries and the variables that can influence it. At the same time, we performed a multivariate analysis using the HJ-biplot, an exploratory method that looks for hidden patterns in the data, obtained from the usual singular value decomposition (SVD of the data matrix, to contextualize the countries grouped by geographical areas and the variables relating to environmental indicators included in the environmental performance index. The sample used comprises 149 countries of different geographic areas. The findings obtained from the empirical analysis emphasize that socioeconomic factors, such as economic wealth and education, as well as institutional factors represented by the style of public administration, in particular control of corruption, are determinant factors of environmental performance in the countries analyzed. In contrast, no effect on environmental performance was found for factors relating to the internal characteristics of a country or political factors.

  8. Multivariate statistical analysis of radioactive variables in two phosphate ores from Sudan

    International Nuclear Information System (INIS)

    Adam, Abdel Majid A.; Eltayeb, Mohamed Ahmed H.

    2012-01-01

    Multivariate statistical techniques are efficient ways to display complex relationships among many objects. An attempt was made to study the radioactive data in two types of Sudanese phosphate deposits; Kurun and Uro phosphate, using several multivariate statistical methods. Pearson correlation coefficient revealed that a U-238 distribution in Kurun phosphate is controlled by the variation of K-40 concentration, whereas in Uro phosphate it is controlled by the variation of U-235 and U-234 concentration. Histograms and normal Q–Q plots clearly show that the radioactive variables did not follow a normal distribution. This non-normality feature observed may be attributed to complicating influence of geological factors. The principal components analysis (PCA) gives a model of five components for representing the acquired data from Kurun phosphate, where 89.5% of the total variance is explained. A model of four components was sufficient to represent the acquired data from Uro phosphate, where 87.5% of the total data variance is explained. The hierarchical cluster analysis (HCA) indicates that U-238 behaves in the same manner in the two types of phosphates; it associated with a group of four radionuclides; U-234, Po-210, Ra-226, Th-230, which the most abundant radionuclides, and all belong to the uranium-238 decay series. Two parameters have been adapted for the direct differentiate between the two phosphates. Firstly, U-238 in Uro phosphate have shown higher degree of mobility (CV% = 82.6) than that in Kurun phosphate (CV% = 64.7), and secondly, the activity ratio of Th-230/Th-232 in Uro phosphate is nine times than that in Kurun phosphate. - Highlights: ► Multivariate statistical techniques were used to characterize radioactive data. ► U-238 in Uro phosphate shows higher degree of mobility (CV% = 82.6). ► U-238 in Kurun phosphate shows lower degree of mobility (CV% = 64.7). ► The radioactive variables did not follow a normal distribution. ► The ratio of Th

  9. The application of ATR-FTIR spectroscopy and multivariate data analysis to study drug crystallisation in the stratum corneum.

    Science.gov (United States)

    Goh, Choon Fu; Craig, Duncan Q M; Hadgraft, Jonathan; Lane, Majella E

    2017-02-01

    Drug permeation through the intercellular lipids, which pack around and between corneocytes, may be enhanced by increasing the thermodynamic activity of the active in a formulation. However, this may also result in unwanted drug crystallisation on and in the skin. In this work, we explore the combination of ATR-FTIR spectroscopy and multivariate data analysis to study drug crystallisation in the skin. Ex vivo permeation studies of saturated solutions of diclofenac sodium (DF Na) in two vehicles, propylene glycol (PG) and dimethyl sulphoxide (DMSO), were carried out in porcine ear skin. Tape stripping and ATR-FTIR spectroscopy were conducted simultaneously to collect spectral data as a function of skin depth. Multivariate data analysis was applied to visualise and categorise the spectral data in the region of interest (1700-1500cm -1 ) containing the carboxylate (COO - ) asymmetric stretching vibrations of DF Na. Spectral data showed the redshifts of the COO - asymmetric stretching vibrations for DF Na in the solution compared with solid drug. Similar shifts were evident following application of saturated solutions of DF Na to porcine skin samples. Multivariate data analysis categorised the spectral data based on the spectral differences and drug crystallisation was found to be confined to the upper layers of the skin. This proof-of-concept study highlights the utility of ATR-FTIR spectroscopy in combination with multivariate data analysis as a simple and rapid approach in the investigation of drug deposition in the skin. The approach described here will be extended to the study of other actives for topical application to the skin. Copyright © 2016 Elsevier B.V. All rights reserved.

  10. An uncertain journey around the tails of multivariate hydrological distributions

    Science.gov (United States)

    Serinaldi, Francesco

    2013-10-01

    Moving from univariate to multivariate frequency analysis, this study extends the Klemeš' critique of the widespread belief that the increasingly refined mathematical structures of probability functions increase the accuracy and credibility of the extrapolated upper tails of the fitted distribution models. In particular, we discuss key aspects of multivariate frequency analysis applied to hydrological data such as the selection of multivariate design events (i.e., appropriate subsets or scenarios of multiplets that exhibit the same joint probability to be used in design applications) and the assessment of the corresponding uncertainty. Since these problems are often overlooked or treated separately, and sometimes confused, we attempt to clarify properties, advantages, shortcomings, and reliability of results of frequency analysis. We suggest a selection method of multivariate design events with prescribed joint probability based on simple Monte Carlo simulations that accounts for the uncertainty affecting the inference results and the multivariate extreme quantiles. It is also shown that the exploration of the p-level probability regions of a joint distribution returns a set of events that is a subset of the p-level scenarios resulting from an appropriate assessment of the sampling uncertainty, thus tending to overlook more extreme and potentially dangerous events with the same (uncertain) joint probability. Moreover, a quantitative assessment of the uncertainty of multivariate quantiles is provided by introducing the concept of joint confidence intervals. From an operational point of view, the simulated event sets describing the distribution of the multivariate p-level quantiles can be used to perform multivariate risk analysis under sampling uncertainty. As an example of the practical implications of this study, we analyze two case studies already presented in the literature.

  11. Multivariate analysis of remote LIBS spectra using partial least squares, principal component analysis, and related techniques

    Energy Technology Data Exchange (ETDEWEB)

    Clegg, Samuel M [Los Alamos National Laboratory; Barefield, James E [Los Alamos National Laboratory; Wiens, Roger C [Los Alamos National Laboratory; Sklute, Elizabeth [MT HOLYOKE COLLEGE; Dyare, Melinda D [MT HOLYOKE COLLEGE

    2008-01-01

    Quantitative analysis with LIBS traditionally employs calibration curves that are complicated by the chemical matrix effects. These chemical matrix effects influence the LIBS plasma and the ratio of elemental composition to elemental emission line intensity. Consequently, LIBS calibration typically requires a priori knowledge of the unknown, in order for a series of calibration standards similar to the unknown to be employed. In this paper, three new Multivariate Analysis (MV A) techniques are employed to analyze the LIBS spectra of 18 disparate igneous and highly-metamorphosed rock samples. Partial Least Squares (PLS) analysis is used to generate a calibration model from which unknown samples can be analyzed. Principal Components Analysis (PCA) and Soft Independent Modeling of Class Analogy (SIMCA) are employed to generate a model and predict the rock type of the samples. These MV A techniques appear to exploit the matrix effects associated with the chemistries of these 18 samples.

  12. A Review of Multivariate Distributions for Count Data Derived from the Poisson Distribution.

    Science.gov (United States)

    Inouye, David; Yang, Eunho; Allen, Genevera; Ravikumar, Pradeep

    2017-01-01

    The Poisson distribution has been widely studied and used for modeling univariate count-valued data. Multivariate generalizations of the Poisson distribution that permit dependencies, however, have been far less popular. Yet, real-world high-dimensional count-valued data found in word counts, genomics, and crime statistics, for example, exhibit rich dependencies, and motivate the need for multivariate distributions that can appropriately model this data. We review multivariate distributions derived from the univariate Poisson, categorizing these models into three main classes: 1) where the marginal distributions are Poisson, 2) where the joint distribution is a mixture of independent multivariate Poisson distributions, and 3) where the node-conditional distributions are derived from the Poisson. We discuss the development of multiple instances of these classes and compare the models in terms of interpretability and theory. Then, we empirically compare multiple models from each class on three real-world datasets that have varying data characteristics from different domains, namely traffic accident data, biological next generation sequencing data, and text data. These empirical experiments develop intuition about the comparative advantages and disadvantages of each class of multivariate distribution that was derived from the Poisson. Finally, we suggest new research directions as explored in the subsequent discussion section.

  13. Evaluation of Co-rich manganese deposits by image analysis and photogrammetric techniques

    Digital Repository Service at National Institute of Oceanography (India)

    Yamazaki, T.; Sharma, R.; Tsurusaki, K.

    Stereo-seabed photographs of Co-rich manganese deposits on a mid-Pacific seamount, were analysed using an image analysis software for coverage estimation and size classification of nodules, and a photogrammetric software for calculation of height...

  14. A model to explain plant growth promotion traits: a multivariate analysis of 2,211 bacterial isolates.

    Directory of Open Access Journals (Sweden)

    Pedro Beschoren da Costa

    Full Text Available Plant growth-promoting bacteria can greatly assist sustainable farming by improving plant health and biomass while reducing fertilizer use. The plant-microorganism-environment interaction is an open and complex system, and despite the active research in the area, patterns in root ecology are elusive. Here, we simultaneously analyzed the plant growth-promoting bacteria datasets from seven independent studies that shared a methodology for bioprospection and phenotype screening. The soil richness of the isolate's origin was classified by a Principal Component Analysis. A Categorical Principal Component Analysis was used to classify the soil richness according to isolate's indolic compound production, siderophores production and phosphate solubilization abilities, and bacterial genera composition. Multiple patterns and relationships were found and verified with nonparametric hypothesis testing. Including niche colonization in the analysis, we proposed a model to explain the expression of bacterial plant growth-promoting traits according to the soil nutritional status. Our model shows that plants favor interaction with growth hormone producers under rich nutrient conditions but favor nutrient solubilizers under poor conditions. We also performed several comparisons among the different genera, highlighting interesting ecological interactions and limitations. Our model could be used to direct plant growth-promoting bacteria bioprospection and metagenomic sampling.

  15. Multivariate Sensitivity Analysis of Time-of-Flight Sensor Fusion

    Science.gov (United States)

    Schwarz, Sebastian; Sjöström, Mårten; Olsson, Roger

    2014-09-01

    Obtaining three-dimensional scenery data is an essential task in computer vision, with diverse applications in various areas such as manufacturing and quality control, security and surveillance, or user interaction and entertainment. Dedicated Time-of-Flight sensors can provide detailed scenery depth in real-time and overcome short-comings of traditional stereo analysis. Nonetheless, they do not provide texture information and have limited spatial resolution. Therefore such sensors are typically combined with high resolution video sensors. Time-of-Flight Sensor Fusion is a highly active field of research. Over the recent years, there have been multiple proposals addressing important topics such as texture-guided depth upsampling and depth data denoising. In this article we take a step back and look at the underlying principles of ToF sensor fusion. We derive the ToF sensor fusion error model and evaluate its sensitivity to inaccuracies in camera calibration and depth measurements. In accordance with our findings, we propose certain courses of action to ensure high quality fusion results. With this multivariate sensitivity analysis of the ToF sensor fusion model, we provide an important guideline for designing, calibrating and running a sophisticated Time-of-Flight sensor fusion capture systems.

  16. Multivariate analysis of attachment of biofouling organisms in response to material surface characteristics.

    Science.gov (United States)

    Gatley-Montross, Caitlyn M; Finlay, John A; Aldred, Nick; Cassady, Harrison; Destino, Joel F; Orihuela, Beatriz; Hickner, Michael A; Clare, Anthony S; Rittschof, Daniel; Holm, Eric R; Detty, Michael R

    2017-12-29

    Multivariate analyses were used to investigate the influence of selected surface properties (Owens-Wendt surface energy and its dispersive and polar components, static water contact angle, conceptual sign of the surface charge, zeta potentials) on the attachment patterns of five biofouling organisms (Amphibalanus amphitrite, Amphibalanus improvisus, Bugula neritina, Ulva linza, and Navicula incerta) to better understand what surface properties drive attachment across multiple fouling organisms. A library of ten xerogel coatings and a glass standard provided a range of values for the selected surface properties to compare to biofouling attachment patterns. Results from the surface characterization and biological assays were analyzed separately and in combination using multivariate statistical methods. Principal coordinate analysis of the surface property characterization and the biological assays resulted in different groupings of the xerogel coatings. In particular, the biofouling organisms were able to distinguish four coatings that were not distinguishable by the surface properties of this study. The authors used canonical analysis of principal coordinates (CAP) to identify surface properties governing attachment across all five biofouling species. The CAP pointed to surface energy and surface charge as important drivers of patterns in biological attachment, but also suggested that differentiation of the surfaces was influenced to a comparable or greater extent by the dispersive component of surface energy.

  17. Multivariate genetic analysis of brain structure in an extended twin design

    DEFF Research Database (Denmark)

    Posthuma, D; de Geus, E.J.; Neale, M.C.

    2000-01-01

    quantitative scale and thus can be assessed in affected and unaffected individuals. Continuous measures increase the statistical power to detect genetic effects (Neale et al., 1994), and allow studies to be designed to collect data from informative subjects such as extreme concordant or discordant pairs....... Intermediate phenotypes for discrete traits, such as psychiatric disorders, can be neurotransmitter levels, brain function, or structure. In this paper we conduct a multivariate analysis of data from 111 twin pairs and 34 additional siblings on cerebellar volume, intracranial space, and body height....... The analysis is carried out on the raw data and specifies a model for the mean and the covariance structure. Results suggest that cerebellar volume and intracranial space vary with age and sex. Brain volumes tend to decrease slightly with age, and males generally have a larger brain volume than females...

  18. Multivariate analysis of microarray data: differential expression and differential connection

    Directory of Open Access Journals (Sweden)

    Kiiveri Harri T

    2011-02-01

    Full Text Available Abstract Background Typical analysis of microarray data ignores the correlation between gene expression values. In this paper we present a model for microarray data which specifically allows for correlation between genes. As a result we combine gene network ideas with linear models and differential expression. Results We use sparse inverse covariance matrices and their associated graphical representation to capture the notion of gene networks. An important issue in using these models is the identification of the pattern of zeroes in the inverse covariance matrix. The limitations of existing methods for doing this are discussed and we provide a workable solution for determining the zero pattern. We then consider a method for estimating the parameters in the inverse covariance matrix which is suitable for very high dimensional matrices. We also show how to construct multivariate tests of hypotheses. These overall multivariate tests can be broken down into two components, the first one being similar to tests for differential expression and the second involving the connections between genes. Conclusion The methods in this paper enable the extraction of a wealth of information concerning the relationships between genes which can be conveniently represented in graphical form. Differentially expressed genes can be placed in the context of the gene network and places in the gene network where unusual or interesting patterns have emerged can be identified, leading to the formulation of hypotheses for future experimentation.

  19. Pleiotropy analysis of quantitative traits at gene level by multivariate functional linear models.

    Science.gov (United States)

    Wang, Yifan; Liu, Aiyi; Mills, James L; Boehnke, Michael; Wilson, Alexander F; Bailey-Wilson, Joan E; Xiong, Momiao; Wu, Colin O; Fan, Ruzong

    2015-05-01

    In genetics, pleiotropy describes the genetic effect of a single gene on multiple phenotypic traits. A common approach is to analyze the phenotypic traits separately using univariate analyses and combine the test results through multiple comparisons. This approach may lead to low power. Multivariate functional linear models are developed to connect genetic variant data to multiple quantitative traits adjusting for covariates for a unified analysis. Three types of approximate F-distribution tests based on Pillai-Bartlett trace, Hotelling-Lawley trace, and Wilks's Lambda are introduced to test for association between multiple quantitative traits and multiple genetic variants in one genetic region. The approximate F-distribution tests provide much more significant results than those of F-tests of univariate analysis and optimal sequence kernel association test (SKAT-O). Extensive simulations were performed to evaluate the false positive rates and power performance of the proposed models and tests. We show that the approximate F-distribution tests control the type I error rates very well. Overall, simultaneous analysis of multiple traits can increase power performance compared to an individual test of each trait. The proposed methods were applied to analyze (1) four lipid traits in eight European cohorts, and (2) three biochemical traits in the Trinity Students Study. The approximate F-distribution tests provide much more significant results than those of F-tests of univariate analysis and SKAT-O for the three biochemical traits. The approximate F-distribution tests of the proposed functional linear models are more sensitive than those of the traditional multivariate linear models that in turn are more sensitive than SKAT-O in the univariate case. The analysis of the four lipid traits and the three biochemical traits detects more association than SKAT-O in the univariate case. © 2015 WILEY PERIODICALS, INC.

  20. Multivariate analysis of the volatile components in tobacco based on infrared-assisted extraction coupled to headspace solid-phase microextraction and gas chromatography-mass spectrometry.

    Science.gov (United States)

    Yang, Yanqin; Pan, Yuanjiang; Zhou, Guojun; Chu, Guohai; Jiang, Jian; Yuan, Kailong; Xia, Qian; Cheng, Changhe

    2016-11-01

    A novel infrared-assisted extraction coupled to headspace solid-phase microextraction followed by gas chromatography with mass spectrometry method has been developed for the rapid determination of the volatile components in tobacco. The optimal extraction conditions for maximizing the extraction efficiency were as follows: 65 μm polydimethylsiloxane-divinylbenzene fiber, extraction time of 20 min, infrared power of 175 W, and distance between the infrared lamp and the headspace vial of 2 cm. Under the optimum conditions, 50 components were found to exist in all ten tobacco samples from different geographical origins. Compared with conventional water-bath heating and nonheating extraction methods, the extraction efficiency of infrared-assisted extraction was greatly improved. Furthermore, multivariate analysis including principal component analysis, hierarchical cluster analysis, and similarity analysis were performed to evaluate the chemical information of these samples and divided them into three classifications, including rich, moderate, and fresh flavors. The above-mentioned classification results were consistent with the sensory evaluation, which was pivotal and meaningful for tobacco discrimination. As a simple, fast, cost-effective, and highly efficient method, the infrared-assisted extraction coupled to headspace solid-phase microextraction technique is powerful and promising for distinguishing the geographical origins of the tobacco samples coupled to suitable chemometrics. © 2016 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  1. Multivariate statistical analysis of a multi-step industrial processes

    DEFF Research Database (Denmark)

    Reinikainen, S.P.; Høskuldsson, Agnar

    2007-01-01

    Monitoring and quality control of industrial processes often produce information on how the data have been obtained. In batch processes, for instance, the process is carried out in stages; some process or control parameters are set at each stage. However, the obtained data might not be utilized...... efficiently, even if this information may reveal significant knowledge about process dynamics or ongoing phenomena. When studying the process data, it may be important to analyse the data in the light of the physical or time-wise development of each process step. In this paper, a unified approach to analyse...... multivariate multi-step processes, where results from each step are used to evaluate future results, is presented. The methods presented are based on Priority PLS Regression. The basic idea is to compute the weights in the regression analysis for given steps, but adjust all data by the resulting score vectors...

  2. Intelligent multivariate process supervision

    International Nuclear Information System (INIS)

    Visuri, Pertti.

    1986-01-01

    This thesis addresses the difficulties encountered in managing large amounts of data in supervisory control of complex systems. Some previous alarm and disturbance analysis concepts are reviewed and a method for improving the supervision of complex systems is presented. The method, called multivariate supervision, is based on adding low level intelligence to the process control system. By using several measured variables linked together by means of deductive logic, the system can take into account the overall state of the supervised system. Thus, it can present to the operators fewer messages with higher information content than the conventional control systems which are based on independent processing of each variable. In addition, the multivariate method contains a special information presentation concept for improving the man-machine interface. (author)

  3. Analysis of the stability and accuracy of the discrete least-squares approximation on multivariate polynomial spaces

    KAUST Repository

    Migliorati, Giovanni

    2016-01-05

    We review the main results achieved in the analysis of the stability and accuracy of the discrete leastsquares approximation on multivariate polynomial spaces, with noiseless evaluations at random points, noiseless evaluations at low-discrepancy point sets, and noisy evaluations at random points.

  4. Use of multivariate analysis to research career advancement of academic librarians

    Directory of Open Access Journals (Sweden)

    Filiberto Felipe Martínez Arellano

    2004-01-01

    Full Text Available Diverse variables dealing with credential factors, bureaucratiuc factors, organizational and disciplinary achievements, academic culture factors, social ascribed factors, and institutional factors were stated as explanatory elements of promotion, tenure status, and earnings. A survey was the research instrument for collecting data to test diverse variables dealing with academic librarians rewards and earnings. Since the study attempted to analyze variables in a multivariate context, variable interactions were tested using multiple regression analysis. Findings of this study contribute to a better understanding of those factors influencing career advancement of academic librarians. Likewise, research methodology of this study could be used in Library and Information Science(LIS research.

  5. Multivariate analysis of TOF-SIMS spectra of monolayers on scribed silicon.

    Science.gov (United States)

    Yang, Li; Lua, Yit-Yian; Jiang, Guilin; Tyler, Bonnie J; Linford, Matthew R

    2005-07-15

    Static time-of-flight secondary ion mass spectrometry (TOF-SIMS) was performed on monolayers on scribed silicon (Si(scr)) derived from 1-alkenes, 1-alkynes, 1-holoalkanes, aldehydes, and acid chlorides. To rapidly determine the variation in the data without introducing user bias, a multivariate analysis was performed. First, principal components analysis (PCA) was done on data obtained from silicon scribed with homologous series of aldehydes and acid chlorides. For this study, the positive ion spectra, the negative ion spectra, and the concatentated (linked) positive and negative ion spectra were preprocessed by normalization, mean centering, and autoscaling. The mean centered data consistently showed the best correlations between the scores on PC1 and the number of carbon atoms in the adsorbate. These correlations were not as strong for the normalized and autoscaled data. After reviewing these methods, it was concluded that mean centering is the best preprocessing method for TOF-SIMS spectra of monolayers on Si(scr). A PCA analysis of all of the positive ion spectra revealed a good correlation between the number of carbon atoms in all of the adsorbates and the scores on PC1. PCA of all of the negative ion spectra and the concatenated positive and negative ion spectra showed a correlation based on the number of carbon atoms in the adsorbate and the class of the adsorbate. These results imply that the positive ion spectra are most sensitive to monolayer thickness, while the negative ion spectra are sensitive to the nature of the substrate-monolayer interface and the monolayer thickness. Loadings show an inverse relationship between (inorganic) fragments that are expected from the substrate and (organic) fragments expected from the monolayer. Multivariate peak intensity ratios were derived. It is also suggested that PCA can be used to detect outlier surfaces. Partial least squares showed a strong correlation between the number of carbon atoms in the adsorbate and the

  6. Fast Detection of Copper Content in Rice by Laser-Induced Breakdown Spectroscopy with Uni- and Multivariate Analysis

    Science.gov (United States)

    Ye, Lanhan; Song, Kunlin; Shen, Tingting

    2018-01-01

    Fast detection of heavy metals is very important for ensuring the quality and safety of crops. Laser-induced breakdown spectroscopy (LIBS), coupled with uni- and multivariate analysis, was applied for quantitative analysis of copper in three kinds of rice (Jiangsu rice, regular rice, and Simiao rice). For univariate analysis, three pre-processing methods were applied to reduce fluctuations, including background normalization, the internal standard method, and the standard normal variate (SNV). Linear regression models showed a strong correlation between spectral intensity and Cu content, with an R2 more than 0.97. The limit of detection (LOD) was around 5 ppm, lower than the tolerance limit of copper in foods. For multivariate analysis, partial least squares regression (PLSR) showed its advantage in extracting effective information for prediction, and its sensitivity reached 1.95 ppm, while support vector machine regression (SVMR) performed better in both calibration and prediction sets, where Rc2 and Rp2 reached 0.9979 and 0.9879, respectively. This study showed that LIBS could be considered as a constructive tool for the quantification of copper contamination in rice. PMID:29495445

  7. Multivariate recurrence network analysis for characterizing horizontal oil-water two-phase flow.

    Science.gov (United States)

    Gao, Zhong-Ke; Zhang, Xin-Wang; Jin, Ning-De; Marwan, Norbert; Kurths, Jürgen

    2013-09-01

    Characterizing complex patterns arising from horizontal oil-water two-phase flows is a contemporary and challenging problem of paramount importance. We design a new multisector conductance sensor and systematically carry out horizontal oil-water two-phase flow experiments for measuring multivariate signals of different flow patterns. We then infer multivariate recurrence networks from these experimental data and investigate local cross-network properties for each constructed network. Our results demonstrate that a cross-clustering coefficient from a multivariate recurrence network is very sensitive to transitions among different flow patterns and recovers quantitative insights into the flow behavior underlying horizontal oil-water flows. These properties render multivariate recurrence networks particularly powerful for investigating a horizontal oil-water two-phase flow system and its complex interacting components from a network perspective.

  8. SPICE: exploration and analysis of post-cytometric complex multivariate datasets.

    Science.gov (United States)

    Roederer, Mario; Nozzi, Joshua L; Nason, Martha C

    2011-02-01

    Polychromatic flow cytometry results in complex, multivariate datasets. To date, tools for the aggregate analysis of these datasets across multiple specimens grouped by different categorical variables, such as demographic information, have not been optimized. Often, the exploration of such datasets is accomplished by visualization of patterns with pie charts or bar charts, without easy access to statistical comparisons of measurements that comprise multiple components. Here we report on algorithms and a graphical interface we developed for these purposes. In particular, we discuss thresholding necessary for accurate representation of data in pie charts, the implications for display and comparison of normalized versus unnormalized data, and the effects of averaging when samples with significant background noise are present. Finally, we define a statistic for the nonparametric comparison of complex distributions to test for difference between groups of samples based on multi-component measurements. While originally developed to support the analysis of T cell functional profiles, these techniques are amenable to a broad range of datatypes. Published 2011 Wiley-Liss, Inc.

  9. Energy-dispersive X-ray fluorescence analysis of organic-rich soils and sediments

    International Nuclear Information System (INIS)

    Parekh, P.P.

    1981-01-01

    A method has been developed for elemental analysis of environmental samples of soils and sediments rich in organic matter by energy-dispersive X-ray fluorescence spectrometry. It consists of three steps (i) determining the apparent concentration of elements by using calibration coefficients based on geochemical standards, (ii) subsequent assay of the total organic matter (TOM) from loss on ignition at 550 deg C, and (iii) evaluating the correct elemental concentration by normalizing for transparency from an empirical relationship. The main feature of the method is the sample analysis prior to ignition, which avoids any loss of trace elements - especially the volatile toxic elements, such as Zn, As, Se, and Pb - during heating. The method was tested on two organic-rich lake sediments (TOM> 30%). Concentrations of five elements (K, Mn, Fe, Zn, and Pb) determined by the present method and by atomic absorption spectrometry agreed within about +-10%. (author)

  10. Monitoring Quality of Biotherapeutic Products Using Multivariate Data Analysis.

    Science.gov (United States)

    Rathore, Anurag S; Pathak, Mili; Jain, Renu; Jadaun, Gaurav Pratap Singh

    2016-07-01

    Monitoring the quality of pharmaceutical products is a global challenge, heightened by the implications of letting subquality drugs come to the market on public safety. Regulatory agencies do their due diligence at the time of approval as per their prescribed regulations. However, product quality needs to be monitored post-approval as well to ensure patient safety throughout the product life cycle. This is particularly complicated for biotechnology-based therapeutics where seemingly minor changes in process and/or raw material attributes have been shown to have a significant effect on clinical safety and efficacy of the product. This article provides a perspective on the topic of monitoring the quality of biotech therapeutics. In the backdrop of challenges faced by the regulatory agencies, the potential use of multivariate data analysis as a tool for effective monitoring has been proposed. Case studies using data from several insulin biosimilars have been used to illustrate the key concepts.

  11. Multivariate geomorphic analysis of forest streams: Implications for assessment of land use impacts on channel condition

    Science.gov (United States)

    Richard. D. Wood-Smith; John M. Buffington

    1996-01-01

    Multivariate statistical analyses of geomorphic variables from 23 forest stream reaches in southeast Alaska result in successful discrimination between pristine streams and those disturbed by land management, specifically timber harvesting and associated road building. Results of discriminant function analysis indicate that a three-variable model discriminates 10...

  12. Pain in diagnostic hysteroscopy: a multivariate analysis after a randomized, controlled trial.

    Science.gov (United States)

    Mazzon, Ivan; Favilli, Alessandro; Grasso, Mario; Horvath, Stefano; Bini, Vittorio; Di Renzo, Gian Carlo; Gerli, Sandro

    2014-11-01

    To study which variables are able to influence women's experience of pain during diagnostic hysteroscopy. Multivariate analysis (phase II) after a randomized, controlled trial (phase I). Endoscopic gynecologic center. In phase I, 392 patients were analyzed. Group A: 197 women with carbon dioxide (CO2); group B: 195 women with normal saline. In phase II, 392 patients were assigned to two different groups according to their pain experience as measured by a visual analogue scale (VAS): group VAS>3 (170 patients); group VAS≤3 (222 patients). Free-anesthesia diagnostic hysteroscopy performed using CO2 or normal saline as distension media. Procedure time, VAS score, image quality, and side effects during and after diagnostic hysteroscopy. In phase I the median pain score in group A was 2, whereas in group B it was 3. In phase II the duration of the procedure, nulliparity, and the use of normal saline were significantly correlated with VAS>3. A higher presence of cervical synechiae was observed in the group VAS>3. The multivariate analysis revealed an inverse correlation between parity and a VAS>3, whereas the use of normal saline, the presence of synechiae in the cervical canal, and the duration of the hysteroscopy were all directly correlated to a VAS score>3. Pain in hysteroscopy is significantly related to the presence of cervical synechiae, to the duration of the procedure, and to the use of normal saline; conversely, parity seems to have a protective role. NCT01873391. Copyright © 2014 American Society for Reproductive Medicine. Published by Elsevier Inc. All rights reserved.

  13. Drunk driving detection based on classification of multivariate time series.

    Science.gov (United States)

    Li, Zhenlong; Jin, Xue; Zhao, Xiaohua

    2015-09-01

    This paper addresses the problem of detecting drunk driving based on classification of multivariate time series. First, driving performance measures were collected from a test in a driving simulator located in the Traffic Research Center, Beijing University of Technology. Lateral position and steering angle were used to detect drunk driving. Second, multivariate time series analysis was performed to extract the features. A piecewise linear representation was used to represent multivariate time series. A bottom-up algorithm was then employed to separate multivariate time series. The slope and time interval of each segment were extracted as the features for classification. Third, a support vector machine classifier was used to classify driver's state into two classes (normal or drunk) according to the extracted features. The proposed approach achieved an accuracy of 80.0%. Drunk driving detection based on the analysis of multivariate time series is feasible and effective. The approach has implications for drunk driving detection. Copyright © 2015 Elsevier Ltd and National Safety Council. All rights reserved.

  14. Classification of Malaysia aromatic rice using multivariate statistical analysis

    Energy Technology Data Exchange (ETDEWEB)

    Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A. [School of Mechatronic Engineering, Universiti Malaysia Perlis, Kampus Pauh Putra, 02600 Arau, Perlis (Malaysia); Omar, O. [Malaysian Agriculture Research and Development Institute (MARDI), Persiaran MARDI-UPM, 43400 Serdang, Selangor (Malaysia)

    2015-05-15

    Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC–MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.

  15. Classification of Malaysia aromatic rice using multivariate statistical analysis

    Science.gov (United States)

    Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A.; Omar, O.

    2015-05-01

    Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC-MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties.

  16. Classification of Malaysia aromatic rice using multivariate statistical analysis

    International Nuclear Information System (INIS)

    Abdullah, A. H.; Adom, A. H.; Shakaff, A. Y. Md; Masnan, M. J.; Zakaria, A.; Rahim, N. A.; Omar, O.

    2015-01-01

    Aromatic rice (Oryza sativa L.) is considered as the best quality premium rice. The varieties are preferred by consumers because of its preference criteria such as shape, colour, distinctive aroma and flavour. The price of aromatic rice is higher than ordinary rice due to its special needed growth condition for instance specific climate and soil. Presently, the aromatic rice quality is identified by using its key elements and isotopic variables. The rice can also be classified via Gas Chromatography Mass Spectrometry (GC-MS) or human sensory panels. However, the uses of human sensory panels have significant drawbacks such as lengthy training time, and prone to fatigue as the number of sample increased and inconsistent. The GC–MS analysis techniques on the other hand, require detailed procedures, lengthy analysis and quite costly. This paper presents the application of in-house developed Electronic Nose (e-nose) to classify new aromatic rice varieties. The e-nose is used to classify the variety of aromatic rice based on the samples odour. The samples were taken from the variety of rice. The instrument utilizes multivariate statistical data analysis, including Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) and K-Nearest Neighbours (KNN) to classify the unknown rice samples. The Leave-One-Out (LOO) validation approach is applied to evaluate the ability of KNN to perform recognition and classification of the unspecified samples. The visual observation of the PCA and LDA plots of the rice proves that the instrument was able to separate the samples into different clusters accordingly. The results of LDA and KNN with low misclassification error support the above findings and we may conclude that the e-nose is successfully applied to the classification of the aromatic rice varieties

  17. Multivariate statistical analysis - an application to lunar materials

    International Nuclear Information System (INIS)

    Deb, M.

    1978-01-01

    The compositional characteristics of clinopyroxenes and spinels - two minerals considered to be very useful in deciphering lunar history, have been studied using the multivariate statistical method of principal component analysis. The mineral-chemical data used are from certain lunar rocks and fines collected by Apollo 11, 12, 14 and 15 and Luna 16 and 20 missions, representing mainly the mare basalts and also non-mare basalts, breccia and rock fragments from the highland regions, in which a large number of these minerals have been analyzed. The correlations noted in the mineral compositions, indicating substitutional relationships, have been interpreted on the basis of available crystal-chemical and petrological informations. Compositional trends for individual specimens have been delineated and compared by producing ''principal latent vector diagrams''. The percent variance of the principal components denoted by the eigenvalues, have been evaluated in terms of the crystallization history of the samples. Some of the major petrogenetic implications of this study concern the role of early formed cumulate phases in the near-surface fractionation of mare basalts, mixing of mineral compositions in the highland regolith and the subsolidus reduction trends in lunar spinels. (auth.)

  18. Multivariate regression analysis for determining short-term values of radon and its decay products from filter measurements

    International Nuclear Information System (INIS)

    Kraut, W.; Schwarz, W.; Wilhelm, A.

    1994-01-01

    A multivariate regression analysis is applied to decay measurements of α-resp. β-filter activcity. Activity concentrations for Po-218, Pb-214 and Bi-214, resp. for the Rn-222 equilibrium equivalent concentration are obtained explicitly. The regression analysis takes into account properly the variances of the measured count rates and their influence on the resulting activity concentrations. (orig.) [de

  19. Analysis and assessment on heavy metal sources in the coastal soils developed from alluvial deposits using multivariate statistical methods.

    Science.gov (United States)

    Li, Jinling; He, Ming; Han, Wei; Gu, Yifan

    2009-05-30

    An investigation on heavy metal sources, i.e., Cu, Zn, Ni, Pb, Cr, and Cd in the coastal soils of Shanghai, China, was conducted using multivariate statistical methods (principal component analysis, clustering analysis, and correlation analysis). All the results of the multivariate analysis showed that: (i) Cu, Ni, Pb, and Cd had anthropogenic sources (e.g., overuse of chemical fertilizers and pesticides, industrial and municipal discharges, animal wastes, sewage irrigation, etc.); (ii) Zn and Cr were associated with parent materials and therefore had natural sources (e.g., the weathering process of parent materials and subsequent pedo-genesis due to the alluvial deposits). The effect of heavy metals in the soils was greatly affected by soil formation, atmospheric deposition, and human activities. These findings provided essential information on the possible sources of heavy metals, which would contribute to the monitoring and assessment process of agricultural soils in worldwide regions.

  20. Application of multivariate curve resolution for the study of folding processes of DNA monitored by fluorescence resonance energy transfer

    International Nuclear Information System (INIS)

    Kumar, Praveen; Kanchan, Kajal; Gargallo, Raimundo; Chowdhury, Shantanu

    2005-01-01

    The study described in the present article used fluorescence resonance energy transfer (FRET) to monitor the folding of a 31-mer cytosine-rich DNA segment, from the promoter region of the human c-myc oncogene. Spectroscopic FRET data recorded during experiments carried out in different experimental conditions were individually and simultaneously analyzed by multivariate curve resolution. The simultaneous analysis of several data matrices allowed the resolution of the system, removing most of the ambiguities related to factor analysis. From the results obtained, we report the evidence of the formation of two ordered conformations in acidic and neutral pH values, in addition to the disordered structure found at high temperatures. These ordered conformations could be related to cytosine-tetraplex structures showing different degrees of protonation in cytosine bases

  1. Mulch materials in processing tomato: a multivariate approach

    Directory of Open Access Journals (Sweden)

    Marta María Moreno

    2013-08-01

    Full Text Available Mulch materials of different origins have been introduced into the agricultural sector in recent years alternatively to the standard polyethylene due to its environmental impact. This study aimed to evaluate the multivariate response of mulch materials over three consecutive years in a processing tomato (Solanum lycopersicon L. crop in Central Spain. Two biodegradable plastic mulches (BD1, BD2, one oxo-biodegradable material (OB, two types of paper (PP1, PP2, and one barley straw cover (BS were compared using two control treatments (standard black polyethylene [PE] and manual weed control [MW]. A total of 17 variables relating to yield, fruit quality, and weed control were investigated. Several multivariate statistical techniques were applied, including principal component analysis, cluster analysis, and discriminant analysis. A group of mulch materials comprised of OB and BD2 was found to be comparable to black polyethylene regarding all the variables considered. The weed control variables were found to be an important source of discrimination. The two paper mulches tested did not share the same treatment group membership in any case: PP2 presented a multivariate response more similar to the biodegradable plastics, while PP1 was more similar to BS and MW. Based on our multivariate approach, the materials OB and BD2 can be used as an effective, more environmentally friendly alternative to polyethylene mulches.

  2. Fast Detection of Copper Content in Rice by Laser-Induced Breakdown Spectroscopy with Uni- and Multivariate Analysis

    Directory of Open Access Journals (Sweden)

    Fei Liu

    2018-02-01

    Full Text Available Fast detection of heavy metals is very important for ensuring the quality and safety of crops. Laser-induced breakdown spectroscopy (LIBS, coupled with uni- and multivariate analysis, was applied for quantitative analysis of copper in three kinds of rice (Jiangsu rice, regular rice, and Simiao rice. For univariate analysis, three pre-processing methods were applied to reduce fluctuations, including background normalization, the internal standard method, and the standard normal variate (SNV. Linear regression models showed a strong correlation between spectral intensity and Cu content, with an R 2 more than 0.97. The limit of detection (LOD was around 5 ppm, lower than the tolerance limit of copper in foods. For multivariate analysis, partial least squares regression (PLSR showed its advantage in extracting effective information for prediction, and its sensitivity reached 1.95 ppm, while support vector machine regression (SVMR performed better in both calibration and prediction sets, where R c 2 and R p 2 reached 0.9979 and 0.9879, respectively. This study showed that LIBS could be considered as a constructive tool for the quantification of copper contamination in rice.

  3. Multivariate hydrological frequency analysis for extreme events using Archimedean copula. Case study: Lower Tunjuelo River basin (Colombia)

    Science.gov (United States)

    Gómez, Wilmar

    2017-04-01

    By analyzing the spatial and temporal variability of extreme precipitation events we can prevent or reduce the threat and risk. Many water resources projects require joint probability distributions of random variables such as precipitation intensity and duration, which can not be independent with each other. The problem of defining a probability model for observations of several dependent variables is greatly simplified by the joint distribution in terms of their marginal by taking copulas. This document presents a general framework set frequency analysis bivariate and multivariate using Archimedean copulas for extreme events of hydroclimatological nature such as severe storms. This analysis was conducted in the lower Tunjuelo River basin in Colombia for precipitation events. The results obtained show that for a joint study of the intensity-duration-frequency, IDF curves can be obtained through copulas and thus establish more accurate and reliable information from design storms and associated risks. It shows how the use of copulas greatly simplifies the study of multivariate distributions that introduce the concept of joint return period used to represent the needs of hydrological designs properly in frequency analysis.

  4. The Covariance Adjustment Approaches for Combining Incomparable Cox Regressions Caused by Unbalanced Covariates Adjustment: A Multivariate Meta-Analysis Study

    Directory of Open Access Journals (Sweden)

    Tania Dehesh

    2015-01-01

    Full Text Available Background. Univariate meta-analysis (UM procedure, as a technique that provides a single overall result, has become increasingly popular. Neglecting the existence of other concomitant covariates in the models leads to loss of treatment efficiency. Our aim was proposing four new approximation approaches for the covariance matrix of the coefficients, which is not readily available for the multivariate generalized least square (MGLS method as a multivariate meta-analysis approach. Methods. We evaluated the efficiency of four new approaches including zero correlation (ZC, common correlation (CC, estimated correlation (EC, and multivariate multilevel correlation (MMC on the estimation bias, mean square error (MSE, and 95% probability coverage of the confidence interval (CI in the synthesis of Cox proportional hazard models coefficients in a simulation study. Result. Comparing the results of the simulation study on the MSE, bias, and CI of the estimated coefficients indicated that MMC approach was the most accurate procedure compared to EC, CC, and ZC procedures. The precision ranking of the four approaches according to all above settings was MMC ≥ EC ≥ CC ≥ ZC. Conclusion. This study highlights advantages of MGLS meta-analysis on UM approach. The results suggested the use of MMC procedure to overcome the lack of information for having a complete covariance matrix of the coefficients.

  5. The Covariance Adjustment Approaches for Combining Incomparable Cox Regressions Caused by Unbalanced Covariates Adjustment: A Multivariate Meta-Analysis Study.

    Science.gov (United States)

    Dehesh, Tania; Zare, Najaf; Ayatollahi, Seyyed Mohammad Taghi

    2015-01-01

    Univariate meta-analysis (UM) procedure, as a technique that provides a single overall result, has become increasingly popular. Neglecting the existence of other concomitant covariates in the models leads to loss of treatment efficiency. Our aim was proposing four new approximation approaches for the covariance matrix of the coefficients, which is not readily available for the multivariate generalized least square (MGLS) method as a multivariate meta-analysis approach. We evaluated the efficiency of four new approaches including zero correlation (ZC), common correlation (CC), estimated correlation (EC), and multivariate multilevel correlation (MMC) on the estimation bias, mean square error (MSE), and 95% probability coverage of the confidence interval (CI) in the synthesis of Cox proportional hazard models coefficients in a simulation study. Comparing the results of the simulation study on the MSE, bias, and CI of the estimated coefficients indicated that MMC approach was the most accurate procedure compared to EC, CC, and ZC procedures. The precision ranking of the four approaches according to all above settings was MMC ≥ EC ≥ CC ≥ ZC. This study highlights advantages of MGLS meta-analysis on UM approach. The results suggested the use of MMC procedure to overcome the lack of information for having a complete covariance matrix of the coefficients.

  6. Micro-Raman Imaging for Biology with Multivariate Spectral Analysis

    KAUST Repository

    Malvaso, Federica

    2015-01-01

    . The aim of the following thesis work is to analyze Raman maps related to three pairs of different cells, highlighting differences and similarities through multivariate algorithms. The first pair of analyzed cells are human embryonic stem cells (h

  7. Network structure of multivariate time series.

    Science.gov (United States)

    Lacasa, Lucas; Nicosia, Vincenzo; Latora, Vito

    2015-10-21

    Our understanding of a variety of phenomena in physics, biology and economics crucially depends on the analysis of multivariate time series. While a wide range tools and techniques for time series analysis already exist, the increasing availability of massive data structures calls for new approaches for multidimensional signal processing. We present here a non-parametric method to analyse multivariate time series, based on the mapping of a multidimensional time series into a multilayer network, which allows to extract information on a high dimensional dynamical system through the analysis of the structure of the associated multiplex network. The method is simple to implement, general, scalable, does not require ad hoc phase space partitioning, and is thus suitable for the analysis of large, heterogeneous and non-stationary time series. We show that simple structural descriptors of the associated multiplex networks allow to extract and quantify nontrivial properties of coupled chaotic maps, including the transition between different dynamical phases and the onset of various types of synchronization. As a concrete example we then study financial time series, showing that a multiplex network analysis can efficiently discriminate crises from periods of financial stability, where standard methods based on time-series symbolization often fail.

  8. Multivariate methods and forecasting with IBM SPSS statistics

    CERN Document Server

    Aljandali, Abdulkader

    2017-01-01

    This is the second of a two-part guide to quantitative analysis using the IBM SPSS Statistics software package; this volume focuses on multivariate statistical methods and advanced forecasting techniques. More often than not, regression models involve more than one independent variable. For example, forecasting methods are commonly applied to aggregates such as inflation rates, unemployment, exchange rates, etc., that have complex relationships with determining variables. This book introduces multivariate regression models and provides examples to help understand theory underpinning the model. The book presents the fundamentals of multivariate regression and then moves on to examine several related techniques that have application in business-orientated fields such as logistic and multinomial regression. Forecasting tools such as the Box-Jenkins approach to time series modeling are introduced, as well as exponential smoothing and naïve techniques. This part also covers hot topics such as Factor Analysis, Dis...

  9. Determination of sulfamethoxazole and trimethoprim mixtures by multivariate electronic spectroscopy

    OpenAIRE

    Cordeiro, Gilcélia A.; Peralta-Zamora, Patricio; Nagata, Noemi; Pontarollo, Roberto

    2008-01-01

    In this work a multivariate spectroscopic methodology is proposed for quantitative determination of sulfamethoxazole and trimethoprim in pharmaceutical associations. The multivariate model was developed by partial least-squares regression, using twenty synthetic mixtures and the spectral region between 190 and 350 nm. In the validation stage, which involved the analysis of five synthetic mixtures, prediction errors lower that 3% were observed. The predictive capacity of the multivariate model...

  10. Banach frames for multivariate alpha-modulation spaces

    DEFF Research Database (Denmark)

    Borup, Lasse; Nielsen, Morten

    2006-01-01

    The α-modulation spaces [$Mathematical Term$], form a family of spaces that include the Besov and modulation spaces as special cases. This paper is concerned with construction of Banach frames for α-modulation spaces in the multivariate setting. The frames constructed are unions of independent Ri...... Riesz sequences based on tensor products of univariate brushlet functions, which simplifies the analysis of the full frame. We show that the multivariate α-modulation spaces can be completely characterized by the Banach frames constructed....

  11. A Morphometric Survey among Three Iranian Horse Breeds with Multivariate Analysis

    Directory of Open Access Journals (Sweden)

    M. Hosseini

    2016-12-01

    Full Text Available Three Iranian horse breeds, Turkoman, Caspian, and Kurdish, are the most important Iranian horse breeds which are well known in all around of the world because of their beauty, versatility, great stamina, and  intelligence. Phenotypic characterization was used to identify and document the diversity within and between distinct breeds, based on their observable attributes. Phenotypic characterization and body biometric in 23 traits were measured in 191 purebred horses belonging to three breeds, i.e. Turkoman (70 horses, Kurdish (77 horses, and Caspian (44 horses.  Caspian breed was  sampled from the Provinces of Alborz and Gilan. Kurdish breed was sampled from the Provinces of Kurdistan, Kermanshah, and Hamadan. Turkoman breed was sampled from the Provinces of Golestan, Markazi, and Isfahan. Multivariate analysis of variance (MANOVA was implemented. In addition, Canonical Discriminate Analysis (CDA, Principal Component Analysis (PCA, and Custer analysis were executed for assessing the relationship among the breeds. All statistical analysis was executed by SAS statistical program. The results of our investigation represented the breeds classification into 3 different classes (Caspian, Turkoman, and Kurdish based on different morphometrical traits. Caspian breed with smaller size in most variables was detached clearly from the others with more distance than Kurdish and Turkoman breeds. The result showed that the most variably trait for classification was Hind Hoof Length. Adaptation with different environments causes difference in morphology and difference among breeds. We can identify and classify domestic population using PCA, CDA, and cluster analysis.

  12. Multivariate analysis in provenance studies: Cerrillos obsidians case, Peru

    International Nuclear Information System (INIS)

    Bustamante, A.; Delgado, M.; Latini, R. M.; Bellido, A. V. B.

    2007-01-01

    We present the preliminary results of a provenance study of obsidians samples from Cerrillos (ca. 800-100 b.c.) using Moessbauer Spectroscopy. The Cerrillos archaeological site, located in the Upper Ica Valley, Peru, is the only Paracas ceremonial center excavated so far. The archaeological data collected suggest the existence of a complex social and economic organization on the south coast of Peru. Provenance research of obsidian provides valuable information about the selection of lithic resources by our ancestors and eventually about the existence of communication routes and exchange networks. We characterized 18 obsidian artifacts samples by Moessbauer spectroscopy from Cerrillos. The spectra, recorded at room temperature using different velocities, are mainly composed of broad asymmetric doublets due to the superposition of at least two quadrupole doublets corresponding to Fe 2+ in two different sites (species A and B), one weak Fe 3+ doublet (specie C) and magnetic components associated to the presence of small particles of magnetite. Multivariate statistical analysis of the Moessbauer data (hyperfine parameters) allows to defined two main groups of obsidians, reflecting different geographical origins.

  13. Multivariate analysis in provenance studies: Cerrillos obsidians case, Peru

    Science.gov (United States)

    Bustamante, A.; Delgado, M.; Latini, R. M.; Bellido, A. V. B.

    2007-02-01

    We present the preliminary results of a provenance study of obsidians samples from Cerrillos (ca. 800 100 b.c.) using Mössbauer Spectroscopy. The Cerrillos archaeological site, located in the Upper Ica Valley, Peru, is the only Paracas ceremonial center excavated so far. The archaeological data collected suggest the existence of a complex social and economic organization on the south coast of Peru. Provenance research of obsidian provides valuable information about the selection of lithic resources by our ancestors and eventually about the existence of communication routes and exchange networks. We characterized 18 obsidian artifacts samples by Mössbauer spectroscopy from Cerrillos. The spectra, recorded at room temperature using different velocities, are mainly composed of broad asymmetric doublets due to the superposition of at least two quadrupole doublets corresponding to Fe2+ in two different sites (species A and B), one weak Fe3+ doublet (specie C) and magnetic components associated to the presence of small particles of magnetite. Multivariate statistical analysis of the Mössbauer data (hyperfine parameters) allows to defined two main groups of obsidians, reflecting different geographical origins.

  14. Discrimination between Bacillus and Alicyclobacillus isolates in apple juice by Fourier transform infrared spectroscopy and multivariate analysis.

    Science.gov (United States)

    Al-Holy, Murad A; Lin, Mengshi; Alhaj, Omar A; Abu-Goush, Mahmoud H

    2015-02-01

    Alicyclobacillus is a causative agent of spoilage in pasteurized and heat-treated apple juice products. Differentiating between this genus and the closely related Bacillus is crucially important. In this study, Fourier transform infrared spectroscopy (FT-IR) was used to identify and discriminate between 4 Alicyclobacillus strains and 4 Bacillus isolates inoculated individually into apple juice. Loading plots over the range of 1350 and 1700 cm(-1) reflected the most distinctive biochemical features of Bacillus and Alicyclobacillus. Multivariate statistical methods (for example, principal component analysis and soft independent modeling of class analogy) were used to analyze the spectral data. Distinctive separation of spectral samples was observed. This study demonstrates that FT-IR spectroscopy in combination with multivariate analysis could serve as a rapid and effective tool for fruit juice industry to differentiate between Bacillus and Alicyclobacillus and to distinguish between species belonging to these 2 genera. © 2015 Institute of Food Technologists®

  15. Study on loss detection algorithms for tank monitoring data using multivariate statistical analysis

    International Nuclear Information System (INIS)

    Suzuki, Mitsutoshi; Burr, Tom

    2009-01-01

    Evaluation of solution monitoring data to support material balance evaluation was proposed about a decade ago because of concerns regarding the large throughput planned at Rokkasho Reprocessing Plant (RRP). A numerical study using the simulation code (FACSIM) was done and significant increases in the detection probabilities (DP) for certain types of losses were shown. To be accepted internationally, it is very important to verify such claims using real solution monitoring data. However, a demonstrative study with real tank data has not been carried out due to the confidentiality of the tank data. This paper describes an experimental study that has been started using actual data from the Solution Measurement and Monitoring System (SMMS) in the Tokai Reprocessing Plant (TRP) and the Savannah River Site (SRS). Multivariate statistical methods, such as a vector cumulative sum and a multi-scale statistical analysis, have been applied to the real tank data that have superimposed simulated loss. Although quantitative conclusions have not been derived for the moment due to the difficulty of baseline evaluation, the multivariate statistical methods remain promising for abrupt and some types of protracted loss detection. (author)

  16. Contaminants reduce the richness and evenness of marine communities: A review and meta-analysis

    International Nuclear Information System (INIS)

    Johnston, Emma L.; Roberts, David A.

    2009-01-01

    Biodiversity of marine ecosystems is integral to their stability and function and is threatened by anthropogenic processes. We conducted a literature review and meta-analysis of 216 studies to understand the effects of common contaminants upon diversity in various marine communities. The most common diversity measures were species richness, the Shannon-Wiener index (H') and Pielou evenness (J). Largest effect sizes were observed for species richness, which tended to be the most sensitive index. Pollution was associated with marine communities containing fewer species or taxa than their pristine counterparts. Marine habitats did not vary in their susceptibility to contamination, rather a ∼40% reduction in richness occurred across all habitats. No class of contaminant was associated with significantly greater impacts on diversity than any other. Survey studies identified larger effects than laboratory or field experiments. Anthropogenic contamination is strongly associated with reductions in the species richness and evenness of marine habitats. - Contamination substantially reduces the biodiversity of marine communities in all major habitat types and across all major contaminant classes.

  17. A Framework for Establishing Standard Reference Scale of Texture by Multivariate Statistical Analysis Based on Instrumental Measurement and Sensory Evaluation.

    Science.gov (United States)

    Zhi, Ruicong; Zhao, Lei; Xie, Nan; Wang, Houyin; Shi, Bolin; Shi, Jingye

    2016-01-13

    A framework of establishing standard reference scale (texture) is proposed by multivariate statistical analysis according to instrumental measurement and sensory evaluation. Multivariate statistical analysis is conducted to rapidly select typical reference samples with characteristics of universality, representativeness, stability, substitutability, and traceability. The reasonableness of the framework method is verified by establishing standard reference scale of texture attribute (hardness) with Chinese well-known food. More than 100 food products in 16 categories were tested using instrumental measurement (TPA test), and the result was analyzed with clustering analysis, principal component analysis, relative standard deviation, and analysis of variance. As a result, nine kinds of foods were determined to construct the hardness standard reference scale. The results indicate that the regression coefficient between the estimated sensory value and the instrumentally measured value is significant (R(2) = 0.9765), which fits well with Stevens's theory. The research provides reliable a theoretical basis and practical guide for quantitative standard reference scale establishment on food texture characteristics.

  18. A review of multivariate analyses in imaging genetics

    Directory of Open Access Journals (Sweden)

    Jingyu eLiu

    2014-03-01

    Full Text Available Recent advances in neuroimaging technology and molecular genetics provide the unique opportunity to investigate genetic influence on the variation of brain attributes. Since the year 2000, when the initial publication on brain imaging and genetics was released, imaging genetics has been a rapidly growing research approach with increasing publications every year. Several reviews have been offered to the research community focusing on various study designs. In addition to study design, analytic tools and their proper implementation are also critical to the success of a study. In this review, we survey recent publications using data from neuroimaging and genetics, focusing on methods capturing multivariate effects accommodating the large number of variables from both imaging data and genetic data. We group the analyses of genetic or genomic data into either a prior driven or data driven approach, including gene-set enrichment analysis, multifactor dimensionality reduction, principal component analysis, independent component analysis (ICA, and clustering. For the analyses of imaging data, ICA and extensions of ICA are the most widely used multivariate methods. Given detailed reviews of multivariate analyses of imaging data available elsewhere, we provide a brief summary here that includes a recently proposed method known as independent vector analysis. Finally, we review methods focused on bridging the imaging and genetic data by establishing multivariate and multiple genotype-phenotype associations, including sparse partial least squares, sparse canonical correlation analysis, sparse reduced rank regression and parallel ICA. These methods are designed to extract latent variables from both genetic and imaging data, which become new genotypes and phenotypes, and the links between the new genotype-phenotype pairs are maximized using different cost functions. The relationship between these methods along with their assumptions, advantages, and

  19. Multivariate Multi-Scale Permutation Entropy for Complexity Analysis of Alzheimer’s Disease EEG

    Directory of Open Access Journals (Sweden)

    Isabella Palamara

    2012-07-01

    Full Text Available An original multivariate multi-scale methodology for assessing the complexity of physiological signals is proposed. The technique is able to incorporate the simultaneous analysis of multi-channel data as a unique block within a multi-scale framework. The basic complexity measure is done by using Permutation Entropy, a methodology for time series processing based on ordinal analysis. Permutation Entropy is conceptually simple, structurally robust to noise and artifacts, computationally very fast, which is relevant for designing portable diagnostics. Since time series derived from biological systems show structures on multiple spatial-temporal scales, the proposed technique can be useful for other types of biomedical signal analysis. In this work, the possibility of distinguish among the brain states related to Alzheimer’s disease patients and Mild Cognitive Impaired subjects from normal healthy elderly is checked on a real, although quite limited, experimental database.

  20. A comparative multivariate analysis of household energy requirements in Australia, Brazil, Denmark, India and Japan

    Energy Technology Data Exchange (ETDEWEB)

    Lenzen, M. [University of Sydney (Australia). School of Physics; Wier, M. [Royal Veterinary and Agricultural University, Copenhagen (Denmark). Danish Research Institute of Food Economics; Cohen, C. [Universidade Federal Fluminense, Rio de Janeiro (Brazil). Faculdade de Economia; Hayami, Hitoshi [Keio University, Tokyo (Japan). Keio Economic Observatory; Pachauri, S. [Swiss Federal Institutes of Technology, Zurich (Switzerland). Centre for Energy Policy and Economics; Schaeffer, R. [Universidade Federal do Rio de Janeiro (Brazil). COPPE

    2006-03-01

    In this paper, we appraise sustainable household consumption from a global perspective. Using per capita energy requirements as an indicator of environmental pressure, we focus on the importance of income growth in a cross-country analysis. Our analysis is supported by a detailed within-country analysis encompassing five countries, in which we assess the importance of various socioeconomic-demographic characteristics of household energy requirements. We bring together family expenditure survey data, input-output tables, and energy statistics in a multivariate analysis. Instead of a uniform Kuznet's curve, we find that the effect of increasing income varies considerably across countries, even when controlling for socioeconomic and demographic variations. The latter variables show similar influences, but differing importance across countries. (author)

  1. The classification of secondary colorectal liver cancer in human biopsy samples using angular dispersive x-ray diffraction and multivariate analysis

    International Nuclear Information System (INIS)

    Theodorakou, Chrysoula; Farquharson, Michael J

    2009-01-01

    The motivation behind this study is to assess whether angular dispersive x-ray diffraction (ADXRD) data, processed using multivariate analysis techniques, can be used for classifying secondary colorectal liver cancer tissue and normal surrounding liver tissue in human liver biopsy samples. The ADXRD profiles from a total of 60 samples of normal liver tissue and colorectal liver metastases were measured using a synchrotron radiation source. The data were analysed for 56 samples using nonlinear peak-fitting software. Four peaks were fitted to all of the ADXRD profiles, and the amplitude, area, amplitude and area ratios for three of the four peaks were calculated and used for the statistical and multivariate analysis. The statistical analysis showed that there are significant differences between all the peak-fitting parameters and ratios between the normal and the diseased tissue groups. The technique of soft independent modelling of class analogy (SIMCA) was used to classify normal liver tissue and colorectal liver metastases resulting in 67% of the normal tissue samples and 60% of the secondary colorectal liver tissue samples being classified correctly. This study has shown that the ADXRD data of normal and secondary colorectal liver cancer are statistically different and x-ray diffraction data analysed using multivariate analysis have the potential to be used as a method of tissue classification.

  2. Application of Multivariate Statistical Analysis to Biomarkers in Se-Turkey Crude Oils

    Science.gov (United States)

    Gürgey, K.; Canbolat, S.

    2017-11-01

    Twenty-four crude oil samples were collected from the 24 oil fields distributed in different districts of SE-Turkey. API and Sulphur content (%), Stable Carbon Isotope, Gas Chromatography (GC), and Gas Chromatography-Mass Spectrometry (GC-MS) data were used to construct a geochemical data matrix. The aim of this study is to examine the genetic grouping or correlations in the crude oil samples, hence the number of source rocks present in the SE-Turkey. To achieve these aims, two of the multivariate statistical analysis techniques (Principle Component Analysis [PCA] and Cluster Analysis were applied to data matrix of 24 samples and 8 source specific biomarker variables/parameters. The results showed that there are 3 genetically different oil groups: Batman-Nusaybin Oils, Adıyaman-Kozluk Oils and Diyarbakir Oils, in addition to a one mixed group. These groupings imply that at least, three different source rocks are present in South-Eastern (SE) Turkey. Grouping of the crude oil samples appears to be consistent with the geographic locations of the oils fields, subsurface stratigraphy as well as geology of the area.

  3. APPLICATION OF MULTIVARIATE STATISTICAL ANALYSIS TO BIOMARKERS IN SE-TURKEY CRUDE OILS

    Directory of Open Access Journals (Sweden)

    K. Gürgey

    2017-11-01

    Full Text Available Twenty-four crude oil samples were collected from the 24 oil fields distributed in different districts of SE-Turkey. API and Sulphur content (%, Stable Carbon Isotope, Gas Chromatography (GC, and Gas Chromatography-Mass Spectrometry (GC-MS data were used to construct a geochemical data matrix. The aim of this study is to examine the genetic grouping or correlations in the crude oil samples, hence the number of source rocks present in the SE-Turkey. To achieve these aims, two of the multivariate statistical analysis techniques (Principle Component Analysis [PCA] and Cluster Analysis were applied to data matrix of 24 samples and 8 source specific biomarker variables/parameters. The results showed that there are 3 genetically different oil groups: Batman-Nusaybin Oils, Adıyaman-Kozluk Oils and Diyarbakir Oils, in addition to a one mixed group. These groupings imply that at least, three different source rocks are present in South-Eastern (SE Turkey. Grouping of the crude oil samples appears to be consistent with the geographic locations of the oils fields, subsurface stratigraphy as well as geology of the area.

  4. Response to comments on "Productivity is a poor predictor of plant species richness"

    Science.gov (United States)

    Grace, James B.; Adler, Peter B.; Seabloom, Eric W.; Borer, Elizabeth T.; Hillebrand, Helmut; Hautier, Yann; Hector, Andy; Harpole, W. Stanley; O'Halloran, Lydia R.; Anderson, T. Michael; Bakker, Jonathan D.; Brown, Cynthia S.; Buckley, Yvonne M.; Collins, Scott L.; Cottingham, Kathryn L.; Crawley, Michael J.; Damschen, Ellen Ingman; Davies, Kendi F.; DeCrappeo, Nicole M.; Fay, Philip A.; Firn, Jennifer; Gruner, Daniel S.; Hagenah, Nicole; Jin, Virginia L.; Kirkman, Kevin P.; Knops, Johannes M.H.; La Pierre, Kimberly J.; Lambrinos, John G.; Melbourne, Brett A.; Mitchell, Charles E.; Moore, Joslin L.; Morgan, John W.; Orrock, John L.; Prover, Suzanne M.; Stevens, Carly J.; Wragg, Peter D.; Yang, Louie H.

    2012-01-01

    Pan et al. claim that our results actually support a strong linear positive relationship between productivity and richness, whereas Fridley et al. contend that the data support a strong humped relationship. These responses illustrate how preoccupation with bivariate patterns distracts from a deeper understanding of the multivariate mechanisms that control these important ecosystem properties.

  5. IR spectroscopy together with multivariate data analysis as a process analytical tool for in-line monitoring of crystallization process and solid-state analysis of crystalline product

    DEFF Research Database (Denmark)

    Pöllänen, Kati; Häkkinen, Antti; Reinikainen, Satu-Pia

    2005-01-01

    -ray powder diffraction (XRPD) as a reference technique. In order to fully utilize DRIFT, the application of multivariate techniques are needed, e.g., multivariate statistical process control (MSPC), principal component analysis (PCA) and partial least squares (PLS). The results demonstrate that multivariate...... Fourier transform infra red (ATR-FTIR) spectroscopy provides valuable information on process, which can be utilized for more controlled crystallization processes. Diffuse reflectance Fourier transform infra red (DRIFT-IR) is applied for polymorphic characterization of crystalline product using X......Crystalline product should exist in optimal polymorphic form. Robust and reliable method for polymorph characterization is of great importance. In this work, infra red (IR) spectroscopy is applied for monitoring of crystallization process in situ. The results show that attenuated total reflection...

  6. Searching for New Biomarkers and the Use of Multivariate Analysis in Gastric Cancer Diagnostics.

    Science.gov (United States)

    Kucera, Radek; Smid, David; Topolcan, Ondrej; Karlikova, Marie; Fiala, Ondrej; Slouka, David; Skalicky, Tomas; Treska, Vladislav; Kulda, Vlastimil; Simanek, Vaclav; Safanda, Martin; Pesta, Martin

    2016-04-01

    The first aim of this study was to search for new biomarkers to be used in gastric cancer diagnostics. The second aim was to verify the findings presented in literature on a sample of the local population and investigate the risk of gastric cancer in that population using a multivariant statistical analysis. We assessed a group of 36 patients with gastric cancer and 69 healthy individuals. We determined carcinoembryonic antigen, cancer antigen 19-9, cancer antigen 72-4, matrix metalloproteinases (-1, -2, -7, -8 and -9), osteoprotegerin, osteopontin, prothrombin induced by vitamin K absence-II, pepsinogen I, pepsinogen II, gastrin and Helicobacter pylori for each sample. The multivariate stepwise logistic regression identified the following biomarkers as the best gastric cancer predictors: CEA, CA72-4, pepsinogen I, Helicobacter pylori presence and MMP7. CEA and CA72-4 remain the best markers for gastric cancer diagnostics. We suggest a mathematical model for the assessment of risk of gastric cancer. Copyright© 2016 International Institute of Anticancer Research (Dr. John G. Delinassios), All rights reserved.

  7. Conversion from laparoscopic to open cholecystectomy: Multivariate analysis of preoperative risk factors

    Directory of Open Access Journals (Sweden)

    Khan M

    2005-01-01

    Full Text Available BACKGROUND: Laparoscopic cholecystectomy has become the gold standard in the treatment of symptomatic cholelithiasis. Some patients require conversion to open surgery and several preoperative variables have been identified as risk factors that are helpful in predicting the probability of conversion. However, there is a need to devise a risk-scoring system based on the identified risk factors to (a predict the risk of conversion preoperatively for selected patients, (b prepare the patient psychologically, (c arrange operating schedules accordingly, and (d minimize the procedure-related cost and help overcome financial constraints, which is a significant problem in developing countries. AIM: This study was aimed to evaluate preoperative risk factors for conversion from laparoscopic to open cholecystectomy in our setting. SETTINGS AND DESIGNS: A case control study of patients who underwent laparoscopic surgery from January 1997 to December 2001 was conducted at the Aga Khan University Hospital, Karachi, Pakistan. MATERIALS AND METHODS: All those patients who were converted to open surgery (n = 73 were enrolled as cases. Two controls who had successful laparoscopic surgery (n = 146 were matched with each case for operating surgeon and closest date of surgery. STATISTICAL ANALYSIS USED: Descriptive statistics were computed and, univariate and multivariate analysis was done through multiple logistic regression. RESULTS: The final multivariate model identified two risk factors for conversion: ultrasonographic signs of inflammation (adjusted odds ratio [aOR] = 8.5; 95% confidence interval [CI]: 3.3, 21.9 and age > 60 years (aOR = 8.1; 95% CI: 2.9, 22.2 after adjusting for physical signs, alkaline phosphatase and BMI levels. CONCLUSION: Preoperative risk factors evaluated by the present study confirm the likelihood of conversion. Recognition of these factors is important for understanding the characteristics of patients at a higher risk of conversion.

  8. DTW-APPROACH FOR UNCORRELATED MULTIVARIATE TIME SERIES IMPUTATION

    OpenAIRE

    Phan , Thi-Thu-Hong; Poisson Caillault , Emilie; Bigand , André; Lefebvre , Alain

    2017-01-01

    International audience; Missing data are inevitable in almost domains of applied sciences. Data analysis with missing values can lead to a loss of efficiency and unreliable results, especially for large missing sub-sequence(s). Some well-known methods for multivariate time series imputation require high correlations between series or their features. In this paper , we propose an approach based on the shape-behaviour relation in low/un-correlated multivariate time series under an assumption of...

  9. Exploring the Structure of Library and Information Science Web Space Based on Multivariate Analysis of Social Tags

    Science.gov (United States)

    Joo, Soohyung; Kipp, Margaret E. I.

    2015-01-01

    Introduction: This study examines the structure of Web space in the field of library and information science using multivariate analysis of social tags from the Website, Delicious.com. A few studies have examined mathematical modelling of tags, mainly examining tagging in terms of tripartite graphs, pattern tracing and descriptive statistics. This…

  10. Comparative urine analysis by liquid chromatography-mass spectrometry and multivariate statistics : Method development, evaluation, and application to proteinuria

    NARCIS (Netherlands)

    Kemperman, Ramses F. J.; Horvatovich, Peter L.; Hoekman, Berend; Reijmers, Theo H.; Muskiet, Frits A. J.; Bischoff, Rainer

    2007-01-01

    We describe a platform for the comparative profiling of urine using reversed-phase liquid chromatography-mass spectrometry (LC-MS) and multivariate statistical data analysis. Urinary compounds were separated by gradient elution and subsequently detected by electrospray Ion-Trap MS. The lower limit

  11. Elemental content of Vietnamese rice. Part 2. Multivariate data analysis.

    Science.gov (United States)

    Kokot, S; Phuong, T D

    1999-04-01

    Rice samples were obtained from the Red River region and some other parts of Vietnam as well as from Yanco, Australia. These samples were analysed for 14 elements (P, K, Mg, Ca, Mn, Zn, Fe, Cu, Al, Na, Ni, As, Mo and Cd) by ICP-AES, ICP-MS and FAAS as described in Part 1. This data matrix was then submitted to multivariate data analysis by principal component analysis to investigate the influences of environmental and crop cultivation variables on the elemental content of rice. Results revealed that geographical location, grain variety, seasons and soil conditions are the most likely significant factors causing changes in the elemental content between the rice samples. To assess rice quality according to its elemental content and physio-biological properties, a multicriteria decision making method (PROMETHEE) was applied. With the Vietnamese rice, the sticky rice appeared to contain somewhat higher levels of nutritionally significant elements such as P, K and Mg than the non-sticky rice. Also, rice samples grown during the wet season have better levels of nutritionally significant mineral elements than those of the dry season, but in general, the wet season seemed to provide better overall elemental and physio-biological rice quality.

  12. Optimization of Interior Permanent Magnet Motor by Quality Engineering and Multivariate Analysis

    Science.gov (United States)

    Okada, Yukihiro; Kawase, Yoshihiro

    This paper has described the method of optimization based on the finite element method. The quality engineering and the multivariable analysis are used as the optimization technique. This optimizing method consists of two steps. At Step.1, the influence of parameters for output is obtained quantitatively, at Step.2, the number of calculation by the FEM can be cut down. That is, the optimal combination of the design parameters, which satisfies the required characteristic, can be searched for efficiently. In addition, this method is applied to a design of IPM motor to reduce the torque ripple. The final shape can maintain average torque and cut down the torque ripple 65%. Furthermore, the amount of permanent magnets can be reduced.

  13. HORIZONTAL BRANCH MORPHOLOGY OF GLOBULAR CLUSTERS: A MULTIVARIATE STATISTICAL ANALYSIS

    International Nuclear Information System (INIS)

    Jogesh Babu, G.; Chattopadhyay, Tanuka; Chattopadhyay, Asis Kumar; Mondal, Saptarshi

    2009-01-01

    The proper interpretation of horizontal branch (HB) morphology is crucial to the understanding of the formation history of stellar populations. In the present study a multivariate analysis is used (principal component analysis) for the selection of appropriate HB morphology parameter, which, in our case, is the logarithm of effective temperature extent of the HB (log T effHB ). Then this parameter is expressed in terms of the most significant observed independent parameters of Galactic globular clusters (GGCs) separately for coherent groups, obtained in a previous work, through a stepwise multiple regression technique. It is found that, metallicity ([Fe/H]), central surface brightness (μ v ), and core radius (r c ) are the significant parameters to explain most of the variations in HB morphology (multiple R 2 ∼ 0.86) for GGC elonging to the bulge/disk while metallicity ([Fe/H]) and absolute magnitude (M v ) are responsible for GGC belonging to the inner halo (multiple R 2 ∼ 0.52). The robustness is tested by taking 1000 bootstrap samples. A cluster analysis is performed for the red giant branch (RGB) stars of the GGC belonging to Galactic inner halo (Cluster 2). A multi-episodic star formation is preferred for RGB stars of GGC belonging to this group. It supports the asymptotic giant branch (AGB) model in three episodes instead of two as suggested by Carretta et al. for halo GGC while AGB model is suggested to be revisited for bulge/disk GGC.

  14. A multivariate analysis of factors affecting adoption of improved ...

    African Journals Online (AJOL)

    This paper analyzes the synergies/tradeoffs involved in the adoption of improved varieties of multiple crops in the mixed crop-livestock production systems of the highlands of Ethiopia A multivariate probit (MVP) model involving a system of four equations for the adoption decision of improved varieties of barley, potatoes, ...

  15. Multivariate fault isolation of batch processes via variable selection in partial least squares discriminant analysis.

    Science.gov (United States)

    Yan, Zhengbing; Kuang, Te-Hui; Yao, Yuan

    2017-09-01

    In recent years, multivariate statistical monitoring of batch processes has become a popular research topic, wherein multivariate fault isolation is an important step aiming at the identification of the faulty variables contributing most to the detected process abnormality. Although contribution plots have been commonly used in statistical fault isolation, such methods suffer from the smearing effect between correlated variables. In particular, in batch process monitoring, the high autocorrelations and cross-correlations that exist in variable trajectories make the smearing effect unavoidable. To address such a problem, a variable selection-based fault isolation method is proposed in this research, which transforms the fault isolation problem into a variable selection problem in partial least squares discriminant analysis and solves it by calculating a sparse partial least squares model. As different from the traditional methods, the proposed method emphasizes the relative importance of each process variable. Such information may help process engineers in conducting root-cause diagnosis. Copyright © 2017 ISA. Published by Elsevier Ltd. All rights reserved.

  16. Discrimination of Wild Paris Based on Near Infrared Spectroscopy and High Performance Liquid Chromatography Combined with Multivariate Analysis

    Science.gov (United States)

    Zhao, Yanli; Zhang, Ji; Yuan, Tianjun; Shen, Tao; Li, Wei; Yang, Shihua; Hou, Ying; Wang, Yuanzhong; Jin, Hang

    2014-01-01

    Different geographical origins and species of Paris obtained from southwestern China were discriminated by near infrared (NIR) spectroscopy and high performance liquid chromatography (HPLC) combined with multivariate analysis. The NIR parameter settings were scanning (64 times), resolution (4 cm−1), scanning range (10000 cm−1∼4000 cm−1) and parallel collection (3 times). NIR spectrum was optimized by TQ 8.6 software, and the ranges 7455∼6852 cm−1 and 5973∼4007 cm−1 were selected according to the spectrum standard deviation. The contents of polyphyllin I, polyphyllin II, polyphyllin VI, and polyphyllin VII and total steroid saponins were detected by HPLC. The contents of chemical components data matrix and spectrum data matrix were integrated and analyzed by partial least squares discriminant analysis (PLS-DA). From the PLS-DA model of NIR spectrum, Paris samples were separated into three groups according to the different geographical origins. The R2X and Q2Y described accumulative contribution rates were 99.50% and 94.03% of the total variance, respectively. The PLS-DA model according to 12 species of Paris described 99.62% of the variation in X and predicted 95.23% in Y. The results of the contents of chemical components described differences among collections quantitatively. A multivariate statistical model of PLS-DA showed geographical origins of Paris had a much greater influence on Paris compared with species. NIR and HPLC combined with multivariate analysis could discriminate different geographical origins and different species. The quality of Paris showed regional dependence. PMID:24558477

  17. A cross-species socio-emotional behaviour development revealed by a multivariate analysis.

    Science.gov (United States)

    Koshiba, Mamiko; Senoo, Aya; Mimura, Koki; Shirakawa, Yuka; Karino, Genta; Obara, Saya; Ozawa, Shinpei; Sekihara, Hitomi; Fukushima, Yuta; Ueda, Toyotoshi; Kishino, Hirohisa; Tanaka, Toshihisa; Ishibashi, Hidetoshi; Yamanouchi, Hideo; Yui, Kunio; Nakamura, Shun

    2013-01-01

    Recent progress in affective neuroscience and social neurobiology has been propelled by neuro-imaging technology and epigenetic approach in neurobiology of animal behaviour. However, quantitative measurements of socio-emotional development remains lacking, though sensory-motor development has been extensively studied in terms of digitised imaging analysis. Here, we developed a method for socio-emotional behaviour measurement that is based on the video recordings under well-defined social context using animal models with variously social sensory interaction during development. The behaviour features digitized from the video recordings were visualised in a multivariate statistic space using principal component analysis. The clustering of the behaviour parameters suggested the existence of species- and stage-specific as well as cross-species behaviour modules. These modules were used to characterise the behaviour of children with or without autism spectrum disorders (ASDs). We found that socio-emotional behaviour is highly dependent on social context and the cross-species behaviour modules may predict neurobiological basis of ASDs.

  18. Noise source analysis of nuclear ship Mutsu plant using multivariate autoregressive model

    International Nuclear Information System (INIS)

    Hayashi, K.; Shimazaki, J.; Shinohara, Y.

    1996-01-01

    The present study is concerned with the noise sources in N.S. Mutsu reactor plant. The noise experiments on the Mutsu plant were performed in order to investigate the plant dynamics and the effect of sea condition and and ship motion on the plant. The reactor noise signals as well as the ship motion signals were analyzed by a multivariable autoregressive (MAR) modeling method to clarify the noise sources in the reactor plant. It was confirmed from the analysis results that most of the plant variables were affected mainly by a horizontal component of the ship motion, that is the sway, through vibrations of the plant structures. Furthermore, the effect of ship motion on the reactor power was evaluated through the analysis of wave components extracted by a geometrical transform method. It was concluded that the amplitude of the reactor power oscillation was about 0.15% in normal sea condition, which was small enough for safe operation of the reactor plant. (authors)

  19. Improved accuracy in estimation of left ventricular function parameters from QGS software with Tc-99m tetrofosmin gated-SPECT. A multivariate analysis

    International Nuclear Information System (INIS)

    Okizaki, Atsutaka; Shuke, Noriyuki; Sato, Junichi; Ishikawa, Yukio; Yamamoto, Wakako; Kikuchi, Kenjiro; Aburano, Tamio

    2003-01-01

    The purpose of this study was to verify whether the accuracy of left ventricular parameters related to left ventricular function from gated-SPECT improved or not, using multivariate analysis. Ninety-six patients with cardiovascular diseases were studied. Gated-SPECT with the quantitative gated SPECT (QGS) software and left ventriculography (LVG) were performed to obtain left ventricular ejection fraction (LVEF), end-diastolic volume (EDV) and end-systolic volume (ESV). Then, multivariate analyses were performed to determine empirical formulas for predicting these parameters. The calculated values of left ventricular parameters were compared with those obtained directly from the QGS software and LVG. Multivariate analyses were able to improve accuracy in estimation of LVEF, EDV and ESV. Statistically significant improvement was seen in LVEF (from r=0.6965 to r=0.8093, p<0.05). Although not statistically significant, improvements in correlation coefficients were seen in EDV (from r=0.7199 to r=0.7595, p=0.2750) and ESV (from r=0.5694 to r=0.5871, p=0.4281). The empirical equations with multivariate analysis improved the accuracy in estimating LVEF from gated-SPECT with the QGS software. (author)

  20. Multivariate analyses of small theropod dinosaur teeth and implications for paleoecological turnover through time.

    Directory of Open Access Journals (Sweden)

    Derek W Larson

    Full Text Available Isolated small theropod teeth are abundant in vertebrate microfossil assemblages, and are frequently used in studies of species diversity in ancient ecosystems. However, determining the taxonomic affinities of these teeth is problematic due to an absence of associated diagnostic skeletal material. Species such as Dromaeosaurus albertensis, Richardoestesia gilmorei, and Saurornitholestes langstoni are known from skeletal remains that have been recovered exclusively from the Dinosaur Park Formation (Campanian. It is therefore likely that teeth from different formations widely disparate in age or geographic position are not referable to these species. Tooth taxa without any associated skeletal material, such as Paronychodon lacustris and Richardoestesia isosceles, have also been identified from multiple localities of disparate ages throughout the Late Cretaceous. To address this problem, a dataset of measurements of 1183 small theropod teeth (the most specimen-rich theropod tooth dataset ever constructed from North America ranging in age from Santonian through Maastrichtian were analyzed using multivariate statistical methods: canonical variate analysis, pairwise discriminant function analysis, and multivariate analysis of variance. The results indicate that teeth referred to the same taxon from different formations are often quantitatively distinct. In contrast, isolated teeth found in time equivalent formations are not quantitatively distinguishable from each other. These results support the hypothesis that small theropod taxa, like other dinosaurs in the Late Cretaceous, tend to be exclusive to discrete host formations. The methods outlined have great potential for future studies of isolated teeth worldwide, and may be the most useful non-destructive technique known of extracting the most data possible from isolated and fragmentary specimens. The ability to accurately assess species diversity and turnover through time based on isolated teeth

  1. Multivariate Analyses of Small Theropod Dinosaur Teeth and Implications for Paleoecological Turnover through Time

    Science.gov (United States)

    Larson, Derek W.; Currie, Philip J.

    2013-01-01

    Isolated small theropod teeth are abundant in vertebrate microfossil assemblages, and are frequently used in studies of species diversity in ancient ecosystems. However, determining the taxonomic affinities of these teeth is problematic due to an absence of associated diagnostic skeletal material. Species such as Dromaeosaurus albertensis, Richardoestesia gilmorei, and Saurornitholestes langstoni are known from skeletal remains that have been recovered exclusively from the Dinosaur Park Formation (Campanian). It is therefore likely that teeth from different formations widely disparate in age or geographic position are not referable to these species. Tooth taxa without any associated skeletal material, such as Paronychodon lacustris and Richardoestesia isosceles, have also been identified from multiple localities of disparate ages throughout the Late Cretaceous. To address this problem, a dataset of measurements of 1183 small theropod teeth (the most specimen-rich theropod tooth dataset ever constructed) from North America ranging in age from Santonian through Maastrichtian were analyzed using multivariate statistical methods: canonical variate analysis, pairwise discriminant function analysis, and multivariate analysis of variance. The results indicate that teeth referred to the same taxon from different formations are often quantitatively distinct. In contrast, isolated teeth found in time equivalent formations are not quantitatively distinguishable from each other. These results support the hypothesis that small theropod taxa, like other dinosaurs in the Late Cretaceous, tend to be exclusive to discrete host formations. The methods outlined have great potential for future studies of isolated teeth worldwide, and may be the most useful non-destructive technique known of extracting the most data possible from isolated and fragmentary specimens. The ability to accurately assess species diversity and turnover through time based on isolated teeth will help illuminate

  2. MULTIVARIATE CURVE RESOLUTION OF NMR SPECTROSCOPY METABONOMIC DATA

    Science.gov (United States)

    Sandia National Laboratories is working with the EPA to evaluate and develop mathematical tools for analysis of the collected NMR spectroscopy data. Initially, we have focused on the use of Multivariate Curve Resolution (MCR) also known as molecular factor analysis (MFA), a tech...

  3. Multivariate analysis of nutritional information of foodstuff of plant origin for the selection of representative matrices for the analysis of pesticide residues

    International Nuclear Information System (INIS)

    Neves Bettencourt da Silva, Ricardo Jorge; Gomes Ferreira Crujo Camoes, Maria Filomena

    2010-01-01

    Testing safety of foodstuffs of plant origin involves the analysis of hundreds of pesticide residues. This control is only cost-effective through the use of methods validated for the analysis of many thousands of analyte/matrix combinations. Several documents propose representative matrices of groups of matrices from which the validity of the analytical method can be extrapolated to the represented matrices after summarised experimental check of within group method performance homogeneity. Those groups are based on an evolved expert consensus based on the empirical knowledge on the current analytical procedures; they are not exhaustive, they are not objectively defined and they propose a large list of representative matrices which makes their application difficult. This work proposes grouping 240 matrices, based on the nutritional composition pattern equivalence of the analytical portion right after hydration and before solvent extraction, aiming at defining groups that observe method performance homogeneity. This grouping was based on the combined outcome of three multivariate tools, namely: Principal Component Analysis, Hierarchical Cluster Analysis and K-Mean Cluster Analysis. These tools allowed the selection of eight groups for which representative matrices with average characteristics and objective criteria to test inclusion of new matrices were established. The proposed matrices groups are homogeneous to nutritional data not considered in their definition but correlated with the studied multivariate nutritional pattern. The developed grouping that must be checked with experimental test before use was tested against small deviations in food composition and for the integration of new matrices.

  4. Multivariate Welch t-test on distances

    OpenAIRE

    Alekseyenko, Alexander V.

    2016-01-01

    Motivation: Permutational non-Euclidean analysis of variance, PERMANOVA, is routinely used in exploratory analysis of multivariate datasets to draw conclusions about the significance of patterns visualized through dimension reduction. This method recognizes that pairwise distance matrix between observations is sufficient to compute within and between group sums of squares necessary to form the (pseudo) F statistic. Moreover, not only Euclidean, but arbitrary distances can be used. This method...

  5. Cloud point extraction for determination of lead in blood samples of children, using different ligands prior to analysis by flame atomic absorption spectrometry: A multivariate study

    Energy Technology Data Exchange (ETDEWEB)

    Shah, Faheem, E-mail: shah_ceac@yahoo.com [National Center of Excellence in Analytical Chemistry, University of Sindh, Jamshoro 76080 (Pakistan); Kazi, Tasneem Gul, E-mail: tgkazi@yahoo.com [National Center of Excellence in Analytical Chemistry, University of Sindh, Jamshoro 76080 (Pakistan); Afridi, Hassan Imran, E-mail: hassanimranafridi@yahoo.com [National Center of Excellence in Analytical Chemistry, University of Sindh, Jamshoro 76080 (Pakistan); Naeemullah, E-mail: khannaeemullah@ymail.com [National Center of Excellence in Analytical Chemistry, University of Sindh, Jamshoro 76080 (Pakistan); Arain, Muhammad Balal, E-mail: bilal_ku2004@yahoo.com [Department of Chemistry, University of Science and Technology, Bannu, KPK (Pakistan); Baig, Jameel Ahmed, E-mail: jab_mughal@yahoo.com [National Center of Excellence in Analytical Chemistry, University of Sindh, Jamshoro 76080 (Pakistan)

    2011-09-15

    Highlights: {yields} Trace levels of lead in blood samples of healthy children and with different kidney disorders {yields} Pre-concentration of Pb{sup +2} in acid digested blood samples after chelating with two complexing reagents. {yields} Multivariate technique was used for screening of significant factors that influence the CPE of Pb{sup +2} {yields} The level of Pb{sup +2} in diseased children was significantly higher than referents of same age group. - Abstract: The phase-separation phenomenon of non-ionic surfactants occurring in aqueous solution was used for the extraction of lead (Pb{sup 2+}) from digested blood samples after simultaneous complexation with ammonium pyrrolidinedithiocarbamate (APDC) and diethyldithiocarbamate (DDTC) separately. The complexed analyte was quantitatively extracted with octylphenoxypolyethoxyethanol (Triton X-114). The multivariate strategy was applied to estimate the optimum values of experimental factors. Acidic ethanol was added to the surfactant-rich phase prior to its analysis by flame atomic absorption spectrometer (FAAS). The detection limit value of Pb{sup 2+} for the preconcentration of 10 mL of acid digested blood sample was 1.14 {mu}g L{sup -1}. The accuracy of the proposed methods was assessed by analyzing certified reference material (whole blood). Under the optimized conditions of both CPE methods, 10 mL of Pb{sup 2+} standards (10 {mu}g L{sup -1}) complexed with APDC and DDTC, permitted the enhancement factors of 56 and 42, respectively. The proposed method was used for determination of Pb{sup 2+} in blood samples of children with kidney disorders and healthy controls.

  6. Multivariate analysis of marketing data - applications for bricolage market

    Directory of Open Access Journals (Sweden)

    FANARU Mihai

    2017-01-01

    Full Text Available By using concepts and analytical tools for computing, marketing is directly related to the quantitative methods of economic research and other areas where the efficiency of systems performances are studied. Any activity of the company must be programmed and carried out taking into account the consumer. Providing a complete success in business requires the entrepreneur to see the company and its products through the consumers eyes, to act as representative of its clients in order to acquire and satisfy their desires. Through its complex specific activities, marketing aims to provide goods and services the consumers require or right merchandise in the right quantity at the right price at the right time and place. An important consideration in capturing the link between marketing and multivariate statistical analysis is that it provides more powerful instruments that allow researchers to discover relationships between multiple configurations of the relationship between variables, configurations that would otherwise remain hidden or barely visible. In addition, most methods can do this with good accuracy, with the possibility of testing the statistical significance by calculating the level of confidence associated with the link validation to the entire population and not just the investigated sample.

  7. Effect of carbo-nitride-rich and oxide-rich inclusions on the pitting susceptibility of depleted uranium

    International Nuclear Information System (INIS)

    Pu, Zhen; Chen, Xianglin; Meng, Xiandong; Wu, Yanping; Shen, Liang; Wang, Qingfu; Liu, Tianwei; Shuai, Maobing

    2017-01-01

    Highlights: •The Volta potential differences relative to the matrix are positive for both types of inclusions. •Both types of inclusions are cathodic in the “inclusion/matrix” microgalvanic couples. •The oxide-rich inclusions show a larger Volta potential value of about 115 mV than the carbo-nitride-rich inclusions. •The oxide-rich inclusions give stronger local galvanic coupling with the matrix. •The oxide-rich inclusions are more predisposed to initiate pitting corrosion. -- Abstract: The effects of carbo-nitride-rich and oxide-rich inclusions on the pitting susceptibility of depleted uranium were investigated by electrochemical corrosion measurements, optical microscopy, scanning Kelvin probe force microscopy (SKPFM), and SEM. The results of the potentiodynamic polarization tests suggest that oxide-rich inclusions are more likely to induce pitting corrosion than carbo-nitride-rich inclusions. This enhanced corrosion may be explained by the strong local galvanic coupling between the oxide-rich inclusion and the surrounding matrix, which, from the sight of SKPFM analysis, exhibits a 115 V higher Volta potential than the coupling between the carbo-nitride-rich inclusions and the matrix, respectively.

  8. A Multivariant Stream Analysis Approach to Detect and Mitigate DDoS Attacks in Vehicular Ad Hoc Networks

    Directory of Open Access Journals (Sweden)

    Raenu Kolandaisamy

    2018-01-01

    Full Text Available Vehicular Ad Hoc Networks (VANETs are rapidly gaining attention due to the diversity of services that they can potentially offer. However, VANET communication is vulnerable to numerous security threats such as Distributed Denial of Service (DDoS attacks. Dealing with these attacks in VANET is a challenging problem. Most of the existing DDoS detection techniques suffer from poor accuracy and high computational overhead. To cope with these problems, we present a novel Multivariant Stream Analysis (MVSA approach. The proposed MVSA approach maintains the multiple stages for detection DDoS attack in network. The Multivariant Stream Analysis gives unique result based on the Vehicle-to-Vehicle communication through Road Side Unit. The approach observes the traffic in different situations and time frames and maintains different rules for various traffic classes in various time windows. The performance of the MVSA is evaluated using an NS2 simulator. Simulation results demonstrate the effectiveness and efficiency of the MVSA regarding detection accuracy and reducing the impact on VANET communication.

  9. ANALYSIS AND CHARACTERIZATION OF OZONE-RICH EPISODES IN NORTHEAST PORTUGAL

    Science.gov (United States)

    Carvalho, A.; Monteiro, A.; Ribeiro, I.; Tchepel, O.; Miranda, A.; Borrego, C.; Saavedra, S.; Souto, J. A.; Casares, J. J.

    2009-12-01

    Each summer period extremely high ozone levels are registered at the rural background station of Lamas d’Olo, located in the Northeast of Portugal. In average, 30% of the total alert threshold registered in Portugal is detected at this site. The main purpose of this study is to characterize the atmospheric conditions that lead to the ozone-rich episodes. Synoptic patterns anomalies and back trajectories cluster analysis were performed for a period of 76 days where ozone maximum concentrations were above 200 µg.m-3. This analysis was performed for the period between 2004 and 2007. The obtained anomaly fields suggested that a positive temperature anomaly is visible above the Iberian Peninsula. In addition, a strong wind flow pattern from NE is visible in the North of Portugal and Galicia, in Spain. These two features may lead to an enhancement of the photochemical production and to the transport of pollutants from Spain to Portugal. In addition, the 3D mean back trajectories associated to the ozone episode days were analysed. A clustering method has been applied to the obtained back trajectories. Four main clusters of ozone-rich episodes were identified, with different frequencies of occurrence: north-westerly flows (11%); north-easterly flows (45%), southern flow (4%) and westerly flows (40%). Both analyses highlight the NE flow as a dominant pattern over the North of Portugal. The analysis of the ozone concentrations for each selected cluster indicates that this northeast circulation pattern, together with the southern flow, is responsible for the highest ozone peak episodes. This also suggests that long-range transport of atmospheric pollutants may be the main contributor to the ozone levels registered at Lamas d’Olo. This is also highlighted by the correlation of the ozone time series with the meteorological parameters analysed in the frequency domain.

  10. Characterization of ionizing radiation effects on bone using Fourier Transform Infrared Spectroscopy and multivariate analysis of spectra

    Energy Technology Data Exchange (ETDEWEB)

    Castro, Pedro Arthur Augusto de; Dias, Derly Augusto; Zezell, Denise Maria, E-mail: zezell@usp.br [Instituto de Pesquisas Energeticas e Nucleares (IPEN/CNEN-SP), Sao Paulo, SP (Brazil)

    2017-11-01

    Ionizing radiation has been used as an important treatment and diagnostic method for several diseases. Optical techniques provides an efficient clinical diagnostic to support an accurate evaluation of the interaction of radiation with molecules. Fourier-transform infrared spectroscopy coupled with attenuated total reflectance (ATR-FTIR) is a label-free and nondestructive optical technique that can recognize functional groups in biological samples. In this work, 30 fragments of bone were collected from bovine femur diaphysis. Samples were cut and polished until 1 cm x 1 cm x 1 mm, which were then stored properly in the refrigerated environment. Samples irradiation was performed with a Cobalt-60 Gammacell Irradiator source at doses of 0.1 kGy, 1 kGy, whereas the fragments exposed to dose of 15 kGy was irradiated in a multipurpose irradiator of Cobalt-60. Spectral data was submitted to principal component analysis followed by linear discriminant analysis. Multivariate analysis was performed with Principal component analysis(PCA) followed by Linear Discriminant Analysis(LDA) using MATLAB R2015a software (The Mathworks Inc., Natick, MA, USA). We demonstrated the feasibility of using ATR-FTIR spectroscopy associated with PCA-LDA multivariate technique to evaluate the molecular changes in bone matrix caused by different doses: 0.1 kGy, 1 kGy and 15 kGy. These alterations between the groups are mainly reported in phosphate region. Our results open up new possibilities for protein monitoring relating to dose responses. (author)

  11. Characterization of ionizing radiation effects on bone using Fourier Transform Infrared Spectroscopy and multivariate analysis of spectra

    International Nuclear Information System (INIS)

    Castro, Pedro Arthur Augusto de; Dias, Derly Augusto; Zezell, Denise Maria

    2017-01-01

    Ionizing radiation has been used as an important treatment and diagnostic method for several diseases. Optical techniques provides an efficient clinical diagnostic to support an accurate evaluation of the interaction of radiation with molecules. Fourier-transform infrared spectroscopy coupled with attenuated total reflectance (ATR-FTIR) is a label-free and nondestructive optical technique that can recognize functional groups in biological samples. In this work, 30 fragments of bone were collected from bovine femur diaphysis. Samples were cut and polished until 1 cm x 1 cm x 1 mm, which were then stored properly in the refrigerated environment. Samples irradiation was performed with a Cobalt-60 Gammacell Irradiator source at doses of 0.1 kGy, 1 kGy, whereas the fragments exposed to dose of 15 kGy was irradiated in a multipurpose irradiator of Cobalt-60. Spectral data was submitted to principal component analysis followed by linear discriminant analysis. Multivariate analysis was performed with Principal component analysis(PCA) followed by Linear Discriminant Analysis(LDA) using MATLAB R2015a software (The Mathworks Inc., Natick, MA, USA). We demonstrated the feasibility of using ATR-FTIR spectroscopy associated with PCA-LDA multivariate technique to evaluate the molecular changes in bone matrix caused by different doses: 0.1 kGy, 1 kGy and 15 kGy. These alterations between the groups are mainly reported in phosphate region. Our results open up new possibilities for protein monitoring relating to dose responses. (author)

  12. Are platelet-rich products necessary during the arthroscopic repair of full-thickness rotator cuff tears: a meta-analysis.

    Directory of Open Access Journals (Sweden)

    Qiang Zhang

    Full Text Available BACKGROUND: Platelet-rich products (PRP are widely used for rotator cuff tears. However, whether platelet-rich products produce superior clinical or radiological outcomes is controversial. This study aims to use meta-analysis to compare clinical and radiological outcomes between groups with or without platelet-rich products. METHODS: The Pubmed, Embase, and Cochrane library databases were searched for relevant studies published before April 20, 2013. Studies were selected that clearly reported a comparison between the use or not of platelet-rich products. The Constant, ASES, UCLA, and SST scale systems and the rotator cuff retear rate were evaluated. The weighted mean differences and relative risks were calculated using a fixed-effects model. RESULTS: Seven studies were enrolled in this meta-analysis. No significant differences were found for the Constant scale (0.73, 95% CI, -1.82 to 3.27, P=0.58, ASES scale (-2.89, 95% CI, -6.31 to 0.53, P=0.1, UCLA scale (-0.79, 95% CI, -2.20 to 0.63, P=0.28, SST scale (0.34, 95% CI, -0.01 to 0.69, P=0.05, and the overall rotator cuff retear rate (0.71, 95% CI, 0.48 to 1.05, P=0.08. Subgroup analysis according to the initial tear size showed a lower retear rate in small- and medium-sized tears (0.33, 95% CI, 0.12 to 0.91, P=0.03 after platelet-rich product application but no difference for large- and massive-sized tears (0.86, 95% CI, 0.60 to 1.23, P=0.42. CONCLUSION: In conclusion, the meta-analysis suggests that the platelet-rich products have no benefits on the overall clinical outcomes and retear rate for the arthroscopic repair of full-thickness rotator cuff tears. However, a decrease occurred in the rate of retears among patients treated with PRP for small- and medium-sized rotator cuff tears but not for large- and massive-sized tears. LEVEL OF EVIDENCE: Level II.

  13. Provenance Study of Archaeological Ceramics from Syria Using XRF Multivariate Statistical Analysis and Thermoluminescence Dating

    OpenAIRE

    Bakraji, Elias Hanna; Abboud, Rana; Issa, Haissm

    2014-01-01

    Thermoluminescence (TL) dating and multivariate statistical methods based on radioisotope X-ray fluorescence analysis have been utilized to date and classify Syrian archaeological ceramics fragment from Tel Jamous site. 54 samples were analyzed by radioisotope X-ray fluorescence; 51 of them come from Tel Jamous archaeological site in Sahel Akkar region, Syria, which fairly represent ceramics belonging to the Middle Bronze Age (2150 to 1600 B.C.) and the remaining three samples come from Mar-T...

  14. A multivariate statistical study with a factor analysis of recent planktonic foraminiferal distribution in the Coromandel Coast of India

    Digital Repository Service at National Institute of Oceanography (India)

    Jayalakshmy, K.V.; Rao, K.K.

    A study of planktonic foraminiferal assemblages from 19 stations in the neritic and oceanic regions off the Coromandel Coast, Bay of Bengal has been made using a multivariate statistical method termed as factor analysis. On the basis of abundance...

  15. Spatial and Temporal Assessment on Drug Addiction Using Multivariate Analysis and GIS

    International Nuclear Information System (INIS)

    Mohd Ekhwan Toriman; Mohd Ekhwan Toriman; Siti Nor Fazillah Abdullah; Izwan Arif Azizan; Mohd Khairul Amri Kamarudin; Roslan Umar; Nasir Mohamad

    2015-01-01

    There is a need for managing and displaying drug addiction phenomena and trend at both spatial and temporal scales. Spatial and temporal assessment on drug addiction in Terengganu was undertaken to understand the geographical area of district in the same cluster, in addition, identify the hot spot area of this problem and analysis the trend of drug addiction. Data used were topography map of Terengganu and number of drug addicted person in Terengganu by district within 10 years (2004-2013). Number of drug addicted person by district were mapped using Geographic Information system and analysed using a combination of multivariate analysis which is cluster analysis were applied to the database in order to validate the correlation between data in the same cluster. Result showed a cluster analysis for number of drug addiction by district generated three clusters which are Besut and Kuala Terengganu in cluster 1 named moderate drug addicted person (MDA), Dungun, Marang, Setiu and Hulu Terengganu in cluster 2 named lower drug addicted person (LDA) and Kemaman in cluster 3 named high drug addicted person(HDA). This analysis indicates that cluster 3 which is Kemaman is a hot spot area. These results were beneficial for stakeholder to monitor and manage this problem especially in the hot spot area which needs to be emphasized. (author)

  16. Multivariate statistical methods a primer

    CERN Document Server

    Manly, Bryan FJ

    2004-01-01

    THE MATERIAL OF MULTIVARIATE ANALYSISExamples of Multivariate DataPreview of Multivariate MethodsThe Multivariate Normal DistributionComputer ProgramsGraphical MethodsChapter SummaryReferencesMATRIX ALGEBRAThe Need for Matrix AlgebraMatrices and VectorsOperations on MatricesMatrix InversionQuadratic FormsEigenvalues and EigenvectorsVectors of Means and Covariance MatricesFurther Reading Chapter SummaryReferencesDISPLAYING MULTIVARIATE DATAThe Problem of Displaying Many Variables in Two DimensionsPlotting index VariablesThe Draftsman's PlotThe Representation of Individual Data P:ointsProfiles o

  17. Multivariate analysis of flow cytometric data using decision trees.

    Science.gov (United States)

    Simon, Svenja; Guthke, Reinhard; Kamradt, Thomas; Frey, Oliver

    2012-01-01

    Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called "induction of decision trees" in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees' quality, we used stratified fivefold cross validation and chose the "best" tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets.

  18. Immediate versus delayed intramedullary nailing for open fractures of the tibial shaft: a multivariate analysis of factors affecting deep infection and fracture healing.

    Science.gov (United States)

    Yokoyama, Kazuhiko; Itoman, Moritoshi; Uchino, Masataka; Fukushima, Kensuke; Nitta, Hiroshi; Kojima, Yoshiaki

    2008-10-01

    The purpose of this study was to evaluate contributing factors affecting deep infection and fracture healing of open tibia fractures treated with locked intramedullary nailing (IMN) by multivariate analysis. We examined 99 open tibial fractures (98 patients) treated with immediate or delayed locked IMN in static fashion from 1991 to 2002. Multivariate analyses following univariate analyses were derived to determine predictors of deep infection, nonunion, and healing time to union. The following predictive variables of deep infection were selected for analysis: age, sex, Gustilo type, fracture grade by AO type, fracture location, timing or method of IMN, reamed or unreamed nailing, debridement time (6 h), method of soft-tissue management, skin closure time (1 week), existence of polytrauma (ISS or =18), existence of floating knee injury, and existence of superficial/pin site infection. The predictive variables of nonunion selected for analysis was the same as those for deep infection, with the addition of deep infection for exchange of pin site infection. The predictive variables of union time selected for analysis was the same as those for nonunion, excluding of location, debridement time, and existence of floating knee and superficial infection. Six (6.1%; type II Gustilo n=1, type IIIB Gustilo n=5) of the 99 open tibial fractures developed deep infections. Multivariate analysis revealed that timing or method of IMN, debridement time, method of soft-tissue management, and existence of superficial or pin site infection significantly correlated with the occurrence of deep infection (Prate in type IIIB + IIIC was significantly higher than those in type I + II and IIIA (P = 0.016). Nonunion occurred in 17 fractures (20.3%, 17/84). Multivariate analysis revealed that Gustilo type, skin closure time, and existence of deep infection significantly correlated with occurrence of nonunion (P < 0.05). Gustilo type and existence of deep infection were significantly correlated

  19. Linear Stability Analysis of Laminar Premixed Fuel-Rich Double-Spray Flames

    Directory of Open Access Journals (Sweden)

    Noam Weinberg

    2014-03-01

    Full Text Available This paper considers the stability of a double-spray premixed flame formed when both fuel and oxidizer are initially present in the form of sprays of evaporating liquid droplets. To simplify the inherent complexity that characterizes the analytic solution of multi-phase combustion processes, the analysis is restricted to fuel-rich laminar premixed double-spray flames, and assumes a single-step global chemical reaction mechanism. Steady-state solutions are obtained and the sensitivity of the flame temperature and the flame propagating velocity to the initial liquid fuel and/or oxidizer loads are established. The stability analysis revealed an increased proneness to cellular instability induced by the presence of the two sprays, and for the fuel-rich case considered here the influence of the liquid oxidizer was found to be more pronounced than that of the liquid fuel. Similar effects were noted for the neutral pulsating stability boundaries. The impact of unequal latent heats of vaporization is also investigated and found to be in keeping with the destabilizing influence of heat loss due to droplet evaporation. It should be noted that as far as the authors are aware no experimental evidence is available for (at least validation of the predictions. However, they do concur in a general and reasonable fashion with independent experimental evidence in the literature of the behavior of single fuel spray laminar premixed flames.

  20. Multivariate Analysis of Some Pine Forested Areas of Azad Kashmir-Pakistan

    International Nuclear Information System (INIS)

    Bokhari, T.Z.; Liu, Y.; Li, Q.; Malik, S.A.; Ahmed, M.; Siddiqui, M.F.; Khan, Z.U.

    2016-01-01

    Floristic composition and communities in Azad Kashmir area of Pakistan were studied by using multivariate analysis. Quantitative sampling from thirty one sites was carried out in different coniferous forests of Azad Kashmir in order to analyze the effects of past earthquakes and landslides on vegetation of these areas. Though coniferous forests were highly disturbed either naturally or anthropogenic activities, therefore sampling was preferred to those forests which were near fault line. Trees were sampled using Point Centered Quarter (PCQ) method. Results of cluster analysis (using Ward's method) yielded six groups dominated by different conifer species. Group I and V were dominated by Pinus wallichiana while this species was co-dominant in group III. Other groups showed the dominance of different conifer species i.e. Cedrus deodara, Pinus roxburghii, Picea smithiana and Abies pindrow. Both the cluster analysis and ordination techniques (by two dimensional non-metric multidimensional scaling) classify and ordinate the structure of various groups indicating interrelationship among different species. The groups of trees were readily be superimposed on NMS ordination axes; they were well classified and well separated out in ordination. The present research revealed that these forests had diverse and asymmetric structure due to natural anthropogenic disturbances and overgrazing, which were key factors in addition to natural disturbances. However, some of the forests showed considerably stable structure due to less human interference. (author)

  1. Understanding the groundwater dynamics in the Southern Rift Valley Lakes Basin (Ethiopia). Multivariate statistical analysis method, oxygen (δ 18O) and deuterium (δ 2H)

    International Nuclear Information System (INIS)

    Girum Admasu Nadew; Zebene Lakew Tefera

    2013-01-01

    Multivariate statistical analysis is very important to classify waters of different hydrochemical groups. Statistical techniques, such as cluster analysis, can provide a powerful tool for analyzing water chemistry data. This method is used to test water quality data and determine if samples can be grouped into distinct populations that may be significant in the geologic context, as well as from a statistical point of view. Multivariate statistical analysis method is applied to the geochemical data in combination with δ 18 O and δ 2 H isotopes with the objective to understand the dynamics of groundwater using hierarchical clustering and isotope analyses. The geochemical and isotope data of the central and southern rift valley lakes have been collected and analyzed from different works. Isotope analysis shows that most springs and boreholes are recharged by July and August rainfalls. The different hydrochemical groups that resulted from the multivariate analysis are described and correlated with the geology of the area and whether it has any interaction with a system or not. (author)

  2. Performance of an iterative two-stage bayesian technique for population pharmacokinetic analysis of rich data sets

    NARCIS (Netherlands)

    Proost, Johannes H.; Eleveld, Douglas J.

    2006-01-01

    Purpose. To test the suitability of an Iterative Two-Stage Bayesian (ITSB) technique for population pharmacokinetic analysis of rich data sets, and to compare ITSB with Standard Two-Stage (STS) analysis and nonlinear Mixed Effect Modeling (MEM). Materials and Methods. Data from a clinical study with

  3. Models of alien species richness show moderate predictive accuracy and poor transferability

    Directory of Open Access Journals (Sweden)

    César Capinha

    2018-06-01

    Full Text Available Robust predictions of alien species richness are useful to assess global biodiversity change. Nevertheless, the capacity to predict spatial patterns of alien species richness remains largely unassessed. Using 22 data sets of alien species richness from diverse taxonomic groups and covering various parts of the world, we evaluated whether different statistical models were able to provide useful predictions of absolute and relative alien species richness, as a function of explanatory variables representing geographical, environmental and socio-economic factors. Five state-of-the-art count data modelling techniques were used and compared: Poisson and negative binomial generalised linear models (GLMs, multivariate adaptive regression splines (MARS, random forests (RF and boosted regression trees (BRT. We found that predictions of absolute alien species richness had a low to moderate accuracy in the region where the models were developed and a consistently poor accuracy in new regions. Predictions of relative richness performed in a superior manner in both geographical settings, but still were not good. Flexible tree ensembles-type techniques (RF and BRT were shown to be significantly better in modelling alien species richness than parametric linear models (such as GLM, despite the latter being more commonly applied for this purpose. Importantly, the poor spatial transferability of models also warrants caution in assuming the generality of the relationships they identify, e.g. by applying projections under future scenario conditions. Ultimately, our results strongly suggest that predictability of spatial variation in richness of alien species richness is limited. The somewhat more robust ability to rank regions according to the number of aliens they have (i.e. relative richness, suggests that models of aliens species richness may be useful for prioritising and comparing regions, but not for predicting exact species numbers.

  4. Multivariate phase type distributions - Applications and parameter estimation

    DEFF Research Database (Denmark)

    Meisch, David

    The best known univariate probability distribution is the normal distribution. It is used throughout the literature in a broad field of applications. In cases where it is not sensible to use the normal distribution alternative distributions are at hand and well understood, many of these belonging...... and statistical inference, is the multivariate normal distribution. Unfortunately only little is known about the general class of multivariate phase type distribution. Considering the results concerning parameter estimation and inference theory of univariate phase type distributions, the class of multivariate...... projects and depend on reliable cost estimates. The Successive Principle is a group analysis method primarily used for analyzing medium to large projects in relation to cost or duration. We believe that the mathematical modeling used in the Successive Principle can be improved. We suggested a novel...

  5. Systematic wavelength selection for improved multivariate spectral analysis

    Science.gov (United States)

    Thomas, Edward V.; Robinson, Mark R.; Haaland, David M.

    1995-01-01

    Methods and apparatus for determining in a biological material one or more unknown values of at least one known characteristic (e.g. the concentration of an analyte such as glucose in blood or the concentration of one or more blood gas parameters) with a model based on a set of samples with known values of the known characteristics and a multivariate algorithm using several wavelength subsets. The method includes selecting multiple wavelength subsets, from the electromagnetic spectral region appropriate for determining the known characteristic, for use by an algorithm wherein the selection of wavelength subsets improves the model's fitness of the determination for the unknown values of the known characteristic. The selection process utilizes multivariate search methods that select both predictive and synergistic wavelengths within the range of wavelengths utilized. The fitness of the wavelength subsets is determined by the fitness function F=.function.(cost, performance). The method includes the steps of: (1) using one or more applications of a genetic algorithm to produce one or more count spectra, with multiple count spectra then combined to produce a combined count spectrum; (2) smoothing the count spectrum; (3) selecting a threshold count from a count spectrum to select these wavelength subsets which optimize the fitness function; and (4) eliminating a portion of the selected wavelength subsets. The determination of the unknown values can be made: (1) noninvasively and in vivo; (2) invasively and in vivo; or (3) in vitro.

  6. Multivariate Empirical Mode Decomposition Based Signal Analysis and Efficient-Storage in Smart Grid

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Lu [University of Tennessee, Knoxville (UTK); Albright, Austin P [ORNL; Rahimpour, Alireza [University of Tennessee, Knoxville (UTK); Guo, Jiandong [University of Tennessee, Knoxville (UTK); Qi, Hairong [University of Tennessee, Knoxville (UTK); Liu, Yilu [University of Tennessee (UTK) and Oak Ridge National Laboratory (ORNL)

    2017-01-01

    Wide-area-measurement systems (WAMSs) are used in smart grid systems to enable the efficient monitoring of grid dynamics. However, the overwhelming amount of data and the severe contamination from noise often impede the effective and efficient data analysis and storage of WAMS generated measurements. To solve this problem, we propose a novel framework that takes advantage of Multivariate Empirical Mode Decomposition (MEMD), a fully data-driven approach to analyzing non-stationary signals, dubbed MEMD based Signal Analysis (MSA). The frequency measurements are considered as a linear superposition of different oscillatory components and noise. The low-frequency components, corresponding to the long-term trend and inter-area oscillations, are grouped and compressed by MSA using the mean shift clustering algorithm. Whereas, higher-frequency components, mostly noise and potentially part of high-frequency inter-area oscillations, are analyzed using Hilbert spectral analysis and they are delineated by statistical behavior. By conducting experiments on both synthetic and real-world data, we show that the proposed framework can capture the characteristics, such as trends and inter-area oscillation, while reducing the data storage requirements

  7. Application of multivariate techniques to analytical data on Aegean ceramics

    International Nuclear Information System (INIS)

    Bieber, A.M.; Brooks, D.W.; Harbottle, G.; Sayre, E.V.

    1976-01-01

    The general problems of data collection and handling for multivariate elemental analyses of ancient pottery are considered including such specific questions as the level of analytical precision required, the number and type of elements to be determined and the need for comprehensive multivariate statistical analysis of the collected data in contrast to element by element statistical analysis. The multivariate statistical procedures of clustering in a multidimensional space and determination of the numerical probabilities of specimens belonging to a group through calculation of the Mahalanobis distances for these specimens in multicomponent space are described together with supporting univariate statistical procedures used at Brookhaven. The application of these techniques to the data on Late Bronze Age Aegean pottery (largely previously analysed at Oxford and Brookhaven with some new specimens considered) have resulted in meaningful subdivisions of previously established groups. (author)

  8. Multivariate Analysis of Hemicelluloses in Bleached Kraft Pulp Using Infrared Spectroscopy.

    Science.gov (United States)

    Chen, Zhiwen; Hu, Thomas Q; Jang, Ho Fan; Grant, Edward

    2016-12-01

    The hemicellulose composition of a pulp significantly affects its chemical and physical properties and thus represents an important process control variable. However, complicated steps of sample preparation make standard methods for the carbohydrate analysis of pulp samples, such as high performance liquid chromatography (HPLC), expensive and time-consuming. In contrast, pulp analysis by attenuated total internal reflection Fourier transform infrared spectroscopy (ATR FT-IR) requires little sample preparation. Here we show that ATR FT-IR with discrete wavelet transform (DWT) and standard normal variate (SNV) spectral preprocessing offers a convenient means for the qualitative and quantitative analysis of hemicelluloses in bleached kraft pulp and alkaline treated kraft pulp. The pulp samples investigated include bleached softwood kraft pulps, bleached hardwood kraft pulps, and their mixtures, as obtained from Canadian industry mills or blended in a lab, and bleached kraft pulp samples treated with 0-6% NaOH solutions. In the principal component analysis (PCA) of these spectra, we find the potential both to differentiate all pulps on the basis of hemicellulose compositions and to distinguish bleached hardwood pulps by species. Partial least squares (PLS) multivariate analysis gives a 0.442 wt% root mean square errors of prediction (RMSEP) for the prediction of xylan content and 0.233 wt% RMSEP for the prediction of mannan content. These data all support the idea that ATR FT-IR has a great potential to rapidly and accurately predict the content of xylan and mannan for bleached kraft pulps (softwood, hardwood, and their mixtures) in industry. However, the prediction of xylan and mannan concentrations presented a difficulty for pulp samples with modified cellulose crystalline structure. © The Author(s) 2016.

  9. A simple ergonomic measure reduces fluoroscopy time during ERCP: A multivariate analysis.

    Science.gov (United States)

    Jowhari, Fahd; Hopman, Wilma M; Hookey, Lawrence

    2017-03-01

    Background and study aims  Endoscopic retrograde cholangiopancreatgraphy (ERCP) carries a radiation risk to patients undergoing the procedure and the team performing it. Fluoroscopy time (FT) has been shown to have a linear relationship with radiation exposure during ERCP. Recent modifications to our ERCP suite design were felt to impact fluoroscopy time and ergonomics. This multivariate analysis was therefore undertaken to investigate these effects, and to identify and validate various clinical, procedural and ergonomic factors influencing the total fluoroscopy time during ERCP. This would better assist clinicians with predicting prolonged fluoroscopic durations and to undertake relevant precautions accordingly. Patients and methods  A retrospective analysis of 299 ERCPs performed by 4 endoscopists over an 18-month period, at a single tertiary care center was conducted. All inpatients/outpatients (121 males, 178 females) undergoing ERCP for any clinical indication from January 2012 to June 2013 in the chosen ERCP suite were included in the study. Various predetermined clinical, procedural and ergonomic factors were obtained via chart review. Univariate analyses identified factors to be included in the multivariate regression model with FT as the dependent variable. Results  Bringing the endoscopy and fluoroscopy screens next to each other was associated with a significantly lesser FT than when the screens were separated further (-1.4 min, P  = 0.026). Other significant factors associated with a prolonged FT included having a prior ERCP (+ 1.4 min, P  = 0.031), and more difficult procedures (+ 4.2 min for each level of difficulty, P  < 0.001). ERCPs performed by high-volume endoscopists used lesser FT vs. low-volume endoscopists (-1.82, P = 0.015). Conclusions  Our study has identified and validated various factors that affect the total fluoroscopy time during ERCP. This is the first study to show that decreasing the distance

  10. Seizure-Onset Mapping Based on Time-Variant Multivariate Functional Connectivity Analysis of High-Dimensional Intracranial EEG: A Kalman Filter Approach.

    Science.gov (United States)

    Lie, Octavian V; van Mierlo, Pieter

    2017-01-01

    The visual interpretation of intracranial EEG (iEEG) is the standard method used in complex epilepsy surgery cases to map the regions of seizure onset targeted for resection. Still, visual iEEG analysis is labor-intensive and biased due to interpreter dependency. Multivariate parametric functional connectivity measures using adaptive autoregressive (AR) modeling of the iEEG signals based on the Kalman filter algorithm have been used successfully to localize the electrographic seizure onsets. Due to their high computational cost, these methods have been applied to a limited number of iEEG time-series (Kalman filter implementations, a well-known multivariate adaptive AR model (Arnold et al. 1998) and a simplified, computationally efficient derivation of it, for their potential application to connectivity analysis of high-dimensional (up to 192 channels) iEEG data. When used on simulated seizures together with a multivariate connectivity estimator, the partial directed coherence, the two AR models were compared for their ability to reconstitute the designed seizure signal connections from noisy data. Next, focal seizures from iEEG recordings (73-113 channels) in three patients rendered seizure-free after surgery were mapped with the outdegree, a graph-theory index of outward directed connectivity. Simulation results indicated high levels of mapping accuracy for the two models in the presence of low-to-moderate noise cross-correlation. Accordingly, both AR models correctly mapped the real seizure onset to the resection volume. This study supports the possibility of conducting fully data-driven multivariate connectivity estimations on high-dimensional iEEG datasets using the Kalman filter approach.

  11. Correlation of aqueous solubility of salts of benzylamine with experimentally and theoretically derived parameters. A multivariate data analysis approach

    DEFF Research Database (Denmark)

    Parshad, Henrik; Frydenvang, Karla Andrea; Liljefors, Tommy

    2002-01-01

    Twenty two salts of benzylamine and p-substituted benzoic acids were prepared and characterized. The p-substituent was varied with regard to electronic, hydrophobic, and steric effects as well as hydrogen bonding potential. A multivariate data analysis was used to describe the relationship between...

  12. Oil price and financial markets: Multivariate dynamic frequency analysis

    International Nuclear Information System (INIS)

    Creti, Anna; Ftiti, Zied; Guesmi, Khaled

    2014-01-01

    The aim of this paper is to study the degree of interdependence between oil price and stock market index into two groups of countries: oil-importers and oil-exporters. To this end, we propose a new empirical methodology allowing a time-varying dynamic correlation measure between the stock market index and the oil price series. We use the frequency approach proposed by Priestley and Tong (1973), that is the evolutionary co-spectral analysis. This method allows us to distinguish between short-run and medium-run dependence. In order to complete our study by analysing long-run dependence, we use the cointegration procedure developed by Engle and Granger (1987). We find that interdependence between the oil price and the stock market is stronger in exporters' markets than in the importers' ones. - Highlights: • A new time-varying measure for the stock markets and oil price relationship in different horizons. • We propose a new empirical methodology: multivariate frequency approach. • We propose a comparison between oil importing and exporting countries. • We show that oil is not always countercyclical with respect to stock markets. • When high oil prices originate from supply shocks, oil is countercyclical with stock markets

  13. Multivariate analysis techniques

    Energy Technology Data Exchange (ETDEWEB)

    Bendavid, Josh [European Organization for Nuclear Research (CERN), Geneva (Switzerland); Fisher, Wade C. [Michigan State Univ., East Lansing, MI (United States); Junk, Thomas R. [Fermi National Accelerator Lab. (FNAL), Batavia, IL (United States)

    2016-01-01

    The end products of experimental data analysis are designed to be simple and easy to understand: hypothesis tests and measurements of parameters. But, the experimental data themselves are voluminous and complex. Furthermore, in modern collider experiments, many petabytes of data must be processed in search of rare new processes which occur together with much more copious background processes that are of less interest to the task at hand. The systematic uncertainties on the background may be larger than the expected signal in many cases. The statistical power of an analysis and its sensitivity to systematic uncertainty can therefore usually both be improved by separating signal events from background events with higher efficiency and purity.

  14. FIA data and species diversity—successes and failures using multivariate analysis techniques, spatial lag and error models and hot-spot analysis

    Science.gov (United States)

    Andrew J. Hartsell

    2015-01-01

    This study will investigate how global and local predictors differ with varying spatial scale in relation to species evenness and richness in the gulf coastal plain. Particularly, all-live trees >= one-inch d.b.h. Forest Inventory and Analysis (FIA) data was used as the basis for the study. Watersheds are defined by the USGS 12 digit hydrologic units. The...

  15. Replacement tunnelled dialysis catheters for haemodialysis access: Same site, new site, or exchange — A multivariate analysis and risk score

    International Nuclear Information System (INIS)

    Tapping, C.R.; Scott, P.M.; Lakshminarayan, R.; Ettles, D.F.; Robinson, G.J.

    2012-01-01

    Aim: To identify variables related to complications following tunnelled dialysis catheter (TDC) replacement and stratifying the risk to reduce morbidity in patients with end-stage renal disease. Materials and methods: One hundred and forty TDCs (Split Cath, medCOMP) were replaced in 140 patients over a 5 year period. Multiple variables were retrospectively collected and analysed to stratify the risk and to predict patients who were more likely to suffer from complications. Multivariate regression analysis was used to identify variables predictive of complications. Results: There were six immediate complications, 42 early complications, and 37 late complications. Multivariate analysis revealed that variables significantly associated to complications were: female sex (p = 0.003; OR 2.9); previous TDC in the same anatomical position in the past (p = 0.014; OR 4.1); catheter exchange (p = 0.038; OR 3.8); haemoglobin 15 s (p = 0.002; OR 4.1); and C-reactive protein >50 mg/l (p = 0.007; OR 4.6). A high-risk score, which used the values from the multivariate analysis, predicted 100% of the immediate complications, 95% of the early complications, and 68% of the late complications. Conclusion: Patients can now be scored prior to TDC replacement. A patient with a high-risk score can be optimized to reduce the chance of complications. Further prospective studies to confirm that rotating the site of TDC reduces complications are warranted as this has implications for current guidelines.

  16. Defining climate zones in México City using multivariate analysis

    NARCIS (Netherlands)

    Estrada, Feporrua; Martínez-Arroyo, A.; Fernández-Eguiarte, A.; Luyando, E.; Gay, C.

    2009-01-01

    Spatial variability in the climate of México City was studied using multivariate methods to analyze 30 years of meteorological data from 37 stations (from the Servicio Meteorológico Nacional) located within the city. Although it covers relatively small area, México City encompasses considerable

  17. Platelet-rich fibrin: the benefits.

    Science.gov (United States)

    Kumar, Yuvika Raj; Mohanty, Sujata; Verma, Mahesh; Kaur, Raunaq Reet; Bhatia, Priyanka; Kumar, Varun Raj; Chaudhary, Zainab

    2016-01-01

    Current published data presents confusing results about the effects of platelet-rich fibrin on bone, and there is a need for studies that throw light on its effect. Our main objective therefore was to evaluate (by fractal analysis) osseous regeneration in extraction sockets with and without platelet-rich fibrin in a study with a substantial sample and a reliable technique to calibrate its effects on bone cells. We also assessed the soft tissue response. Thirty-four patients had their bilaterally impacted third molars (68 surgical sites) extracted in this split-mouth study, following which platelet-rich fibrin was placed in one of the sockets. Patients were followed up clinically and radiographically, and a pain score and fractal analysis were used to evaluate healing of soft tissue and bone, respectively. We conclude that platelet-rich fibrin improves healing of both soft and hard tissues. Although osseous healing did not differ significantly between the groups, healing of soft tissue as judged by the pain score was significantly better in the experimental group. Copyright © 2015 The British Association of Oral and Maxillofacial Surgeons. Published by Elsevier Ltd. All rights reserved.

  18. 1H NMR and Multivariate Analysis for Geographic Characterization of Commercial Extra Virgin Olive Oil: A Possible Correlation with Climate Data

    Directory of Open Access Journals (Sweden)

    Domenico Rongai

    2017-11-01

    Full Text Available 1H Nuclear Magnetic Resonance (NMR spectroscopy coupled with multivariate analysis has been applied in order to investigate metabolomic profiles of more than 200 extravirgin olive oils (EVOOs collected in a period of over four years (2009–2012 from different geographic areas. In particular, commercially blended EVOO samples originating from different Italian regions (Tuscany, Sicily and Apulia, as well as European (Spain and Portugal and non-European (Tunisia, Turkey, Chile and Australia countries. Multivariate statistical analysis (Principal Component Analisys (PCA and Orthogonal Partial Least Squares Discriminant Analysis (OPLS-DA applied on the NMR data revealed the existence of marked differences between Italian (in particular from Tuscany, Sicily and Apulia regions and foreign (in particular Tunisian EVOO samples. A possible correlation with available climate data has been also investigated. These results aim to develop a powerful NMR-based tool able to protect Italian olive oil productions.

  19. Search for the Higgs Boson in the H→ ZZ(*)→4μ Channel in CMS Using a Multivariate Analysis

    International Nuclear Information System (INIS)

    Alonso Diaz, A.

    2007-01-01

    This note presents a Higgs boson search analysis in the CMS detector of the LHC accelerator (CERN, Geneva, Switzerland) in the H→ ZZ ( *)→4μ channel, using a multivariate method. This analysis, based in a Higgs boson mass dependent likelihood, constructed from discriminant variables, provides a significant improvement of the Higgs boson discovery potential in a wide mass range with respect to the official analysis published by CMS, based in orthogonal cuts independent of the Higgs boson mass. (Author) 8 refs

  20. Multivariate analysis of structure and contribution per shares made by potential risk factors at malignant neoplasms in trachea, bronchial tubes and lung

    Directory of Open Access Journals (Sweden)

    G.T. Aydinov

    2017-03-01

    Full Text Available The article gives the results of multivariate analysis of structure and contribution per shares made by potential risk factors at malignant neoplasms in trachea, bronchial tubes and lung. The authors used specialized databases comprising personified records on oncologic diseases in Taganrog, Rostov region, over 1986-2015 (30,684 registered cases of malignant neoplasms, including 3,480 cases of trachea cancer, bronchial tubes cancer, and lung cancer. When carrying out analytical research we applied both multivariate statistical techniques (factor analysis and hierarchical cluster correlation analysis and conventional techniques of epidemiologic analysis including etiologic fraction calculation (EF, as well as an original technique of assessing actual (epidemiologic risk. Average long-term morbidity with trachea, bronchial tubes and lung cancer over 2011-2015 amounts to 46.64 o / oooo . Over the last 15 years a stable decreasing trend has formed, annual average growth being – 1.22 %. This localization holds the 3rd rank place in oncologic morbidity structure, its specific weight being 10.02 %. We determined etiological fraction (EF for smoking as a priority risk factor causing trachea, bronchial tubes and lung cancer; this fraction amounts to 76.19 % for people aged 40 and older, and to 81.99 % for those aged 60 and older. Application of multivariate statistical techniques (factor analysis and cluster correlation analysis in this research enabled us to make factor structure more simple; namely, to highlight, interpret, give a quantitative estimate of self-descriptiveness and rank four group (latent potential risk factors causing lung cancer.

  1. Immediate versus delayed intramedullary nailing for open fractures of the tibial shaft: A multivariate analysis of factors affecting deep infection and fracture healing

    Directory of Open Access Journals (Sweden)

    Yokoyama Kazuhiko

    2008-01-01

    Full Text Available Background: The purpose of this study was to evaluate contributing factors affecting deep infection and fracture healing of open tibia fractures treated with locked intramedullary nailing (IMN by multivariate analysis. Materials and Methods: We examined 99 open tibial fractures (98 patients treated with immediate or delayed locked IMN in static fashion from 1991 to 2002. Multivariate analyses following univariate analyses were derived to determine predictors of deep infection, nonunion, and healing time to union. The following predictive variables of deep infection were selected for analysis: age, sex, Gustilo type, fracture grade by AO type, fracture location, timing or method of IMN, reamed or unreamed nailing, debridement time (≤6 h or> 6 h, method of soft-tissue management, skin closure time (≤1 week or> 1 week, existence of polytrauma (ISS< 18 or ISS≥18, existence of floating knee injury, and existence of superficial/pin site infection. The predictive variables of nonunion selected for analysis was the same as those for deep infection, with the addition of deep infection for exchange of pin site infection. The predictive variables of union time selected for analysis was the same as those for nonunion, excluding of location, debridement time, and existence of floating knee and superficial infection. Results: Six (6.1%; type II Gustilo n=1, type IIIB Gustilo n=5 of the 99 open tibial fractures developed deep infections. Multivariate analysis revealed that timing or method of IMN, debridement time, method of soft-tissue management, and existence of superficial or pin site infection significantly correlated with the occurrence of deep infection ( P < 0.0001. In the immediate nailing group alone, the deep infection rate in type IIIB + IIIC was significantly higher than those in type I + II and IIIA ( P = 0.016. Nonunion occurred in 17 fractures (20.3%, 17/84. Multivariate analysis revealed that Gustilo type, skin closure time, and

  2. Copula Multivariate analysis of Gross primary production and its hydro-environmental driver; A BIOME-BGC model applied to the Antisana páramos

    Science.gov (United States)

    Minaya, Veronica; Corzo, Gerald; van der Kwast, Johannes; Galarraga, Remigio; Mynett, Arthur

    2014-05-01

    Simulations of carbon cycling are prone to uncertainties from different sources, which in general are related to input data, parameters and the model representation capacities itself. The gross carbon uptake in the cycle is represented by the gross primary production (GPP), which deals with the spatio-temporal variability of the precipitation and the soil moisture dynamics. This variability associated with uncertainty of the parameters can be modelled by multivariate probabilistic distributions. Our study presents a novel methodology that uses multivariate Copulas analysis to assess the GPP. Multi-species and elevations variables are included in a first scenario of the analysis. Hydro-meteorological conditions that might generate a change in the next 50 or more years are included in a second scenario of this analysis. The biogeochemical model BIOME-BGC was applied in the Ecuadorian Andean region in elevations greater than 4000 masl with the presence of typical vegetation of páramo. The change of GPP over time is crucial for climate scenarios of the carbon cycling in this type of ecosystem. The results help to improve our understanding of the ecosystem function and clarify the dynamics and the relationship with the change of climate variables. Keywords: multivariate analysis, Copula, BIOME-BGC, NPP, páramos

  3. An analysis of longitudinal data with nonignorable dropout using the truncated multivariate normal distribution

    NARCIS (Netherlands)

    Jolani, Shahab

    2014-01-01

    For a vector of multivariate normal when some elements, but not necessarily all, are truncated, we derive the moment generating function and obtain expressions for the first two moments involving the multivariate hazard gradient. To show one of many applications of these moments, we then extend the

  4. Study of archaeological coins of different dynasties using libs coupled with multivariate analysis

    Science.gov (United States)

    Awasthi, Shikha; Kumar, Rohit; Rai, G. K.; Rai, A. K.

    2016-04-01

    Laser Induced Breakdown Spectroscopy (LIBS) is an atomic emission spectroscopic technique having unique capability of an in-situ monitoring tool for detection and quantification of elements present in different artifacts. Archaeological coins collected form G.R. Sharma Memorial Museum; University of Allahabad, India has been analyzed using LIBS technique. These coins were obtained from excavation of Kausambi, Uttar Pradesh, India. LIBS system assembled in the laboratory (laser Nd:YAG 532 nm, 4 ns pulse width FWHM with Ocean Optics LIBS 2000+ spectrometer) is employed for spectral acquisition. The spectral lines of Ag, Cu, Ca, Sn, Si, Fe and Mg are identified in the LIBS spectra of different coins. LIBS along with Multivariate Analysis play an effective role for classification and contribution of spectral lines in different coins. The discrimination between five coins with Archaeological interest has been carried out using Principal Component Analysis (PCA). The results show the potential relevancy of the methodology used in the elemental identification and classification of artifacts with high accuracy and robustness.

  5. An iterative method for the analysis of Cherenkov rings in the HERA-B RICH

    International Nuclear Information System (INIS)

    Staric, M.; Krizan, P.

    1999-01-01

    A new method is presented for the analysis of data recorded with a Ring Imaging Cherenkov (RICH) counter. The method, an iterative sorting of hits on the photon detector, is particularly useful for events where rings overlap considerably. The algorithm was tested on simulated data for the HERA-B experiment

  6. Quality by design case study: an integrated multivariate approach to drug product and process development.

    Science.gov (United States)

    Huang, Jun; Kaul, Goldi; Cai, Chunsheng; Chatlapalli, Ramarao; Hernandez-Abad, Pedro; Ghosh, Krishnendu; Nagi, Arwinder

    2009-12-01

    To facilitate an in-depth process understanding, and offer opportunities for developing control strategies to ensure product quality, a combination of experimental design, optimization and multivariate techniques was integrated into the process development of a drug product. A process DOE was used to evaluate effects of the design factors on manufacturability and final product CQAs, and establish design space to ensure desired CQAs. Two types of analyses were performed to extract maximal information, DOE effect & response surface analysis and multivariate analysis (PCA and PLS). The DOE effect analysis was used to evaluate the interactions and effects of three design factors (water amount, wet massing time and lubrication time), on response variables (blend flow, compressibility and tablet dissolution). The design space was established by the combined use of DOE, optimization and multivariate analysis to ensure desired CQAs. Multivariate analysis of all variables from the DOE batches was conducted to study relationships between the variables and to evaluate the impact of material attributes/process parameters on manufacturability and final product CQAs. The integrated multivariate approach exemplifies application of QbD principles and tools to drug product and process development.

  7. Categorical speech processing in Broca's area: an fMRI study using multivariate pattern-based analysis.

    Science.gov (United States)

    Lee, Yune-Sang; Turkeltaub, Peter; Granger, Richard; Raizada, Rajeev D S

    2012-03-14

    Although much effort has been directed toward understanding the neural basis of speech processing, the neural processes involved in the categorical perception of speech have been relatively less studied, and many questions remain open. In this functional magnetic resonance imaging (fMRI) study, we probed the cortical regions mediating categorical speech perception using an advanced brain-mapping technique, whole-brain multivariate pattern-based analysis (MVPA). Normal healthy human subjects (native English speakers) were scanned while they listened to 10 consonant-vowel syllables along the /ba/-/da/ continuum. Outside of the scanner, individuals' own category boundaries were measured to divide the fMRI data into /ba/ and /da/ conditions per subject. The whole-brain MVPA revealed that Broca's area and the left pre-supplementary motor area evoked distinct neural activity patterns between the two perceptual categories (/ba/ vs /da/). Broca's area was also found when the same analysis was applied to another dataset (Raizada and Poldrack, 2007), which previously yielded the supramarginal gyrus using a univariate adaptation-fMRI paradigm. The consistent MVPA findings from two independent datasets strongly indicate that Broca's area participates in categorical speech perception, with a possible role of translating speech signals into articulatory codes. The difference in results between univariate and multivariate pattern-based analyses of the same data suggest that processes in different cortical areas along the dorsal speech perception stream are distributed on different spatial scales.

  8. Inferring species richness and turnover by statistical multiresolution texture analysis of satellite imagery.

    Directory of Open Access Journals (Sweden)

    Matteo Convertino

    Full Text Available BACKGROUND: The quantification of species-richness and species-turnover is essential to effective monitoring of ecosystems. Wetland ecosystems are particularly in need of such monitoring due to their sensitivity to rainfall, water management and other external factors that affect hydrology, soil, and species patterns. A key challenge for environmental scientists is determining the linkage between natural and human stressors, and the effect of that linkage at the species level in space and time. We propose pixel intensity based Shannon entropy for estimating species-richness, and introduce a method based on statistical wavelet multiresolution texture analysis to quantitatively assess interseasonal and interannual species turnover. METHODOLOGY/PRINCIPAL FINDINGS: We model satellite images of regions of interest as textures. We define a texture in an image as a spatial domain where the variations in pixel intensity across the image are both stochastic and multiscale. To compare two textures quantitatively, we first obtain a multiresolution wavelet decomposition of each. Either an appropriate probability density function (pdf model for the coefficients at each subband is selected, and its parameters estimated, or, a non-parametric approach using histograms is adopted. We choose the former, where the wavelet coefficients of the multiresolution decomposition at each subband are modeled as samples from the generalized Gaussian pdf. We then obtain the joint pdf for the coefficients for all subbands, assuming independence across subbands; an approximation that simplifies the computational burden significantly without sacrificing the ability to statistically distinguish textures. We measure the difference between two textures' representative pdf's via the Kullback-Leibler divergence (KL. Species turnover, or [Formula: see text] diversity, is estimated using both this KL divergence and the difference in Shannon entropy. Additionally, we predict species

  9. MODEL APPLICATION MULTIVARIATE ANALYSIS OF STATISTICAL TECHNIQUES PCA AND HCA ASSESSMENT QUESTIONNAIRE ON CUSTOMER SATISFACTION: CASE STUDY IN A METALLURGICAL COMPANY OF METAL CONTAINERS

    Directory of Open Access Journals (Sweden)

    Cláudio Roberto Rosário

    2012-07-01

    Full Text Available The purpose of this research is to improve the practice on customer satisfaction analysis The article presents an analysis model to analyze the answers of a customer satisfaction evaluation in a systematic way with the aid of multivariate statistical techniques, specifically, exploratory analysis with PCA – Partial Components Analysis with HCA - Hierarchical Cluster Analysis. It was tried to evaluate the applicability of the model to be used by the issue company as a tool to assist itself on identifying the value chain perceived by the customer when applied the questionnaire of customer satisfaction. It was found with the assistance of multivariate statistical analysis that it was observed similar behavior among customers. It also allowed the company to conduct reviews on questions of the questionnaires, using analysis of the degree of correlation between the questions that was not a company’s practice before this research.

  10. Multivariate qualitative analysis of banned additives in food safety using surface enhanced Raman scattering spectroscopy

    Science.gov (United States)

    He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei

    2015-02-01

    A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety.

  11. Surface immobilized antibody orientation determined using ToF-SIMS and multivariate analysis.

    Science.gov (United States)

    Welch, Nicholas G; Madiona, Robert M T; Payten, Thomas B; Easton, Christopher D; Pontes-Braz, Luisa; Brack, Narelle; Scoble, Judith A; Muir, Benjamin W; Pigram, Paul J

    2017-06-01

    Antibody orientation at solid phase interfaces plays a critical role in the sensitive detection of biomolecules during immunoassays. Correctly oriented antibodies with solution-facing antigen binding regions have improved antigen capture as compared to their randomly oriented counterparts. Direct characterization of oriented proteins with surface analysis methods still remains a challenge however surface sensitive techniques such as Time-of-Flight Secondary Ion Mass Spectrometry (ToF-SIMS) provide information-rich data that can be used to probe antibody orientation. Diethylene glycol dimethyl ether plasma polymers (DGpp) functionalized with chromium (DGpp+Cr) have improved immunoassay performance that is indicative of preferential antibody orientation. Herein, ToF-SIMS data from proteolytic fragments of anti-EGFR antibody bound to DGpp and DGpp+Cr are used to construct artificial neural network (ANN) and principal component analysis (PCA) models indicative of correctly oriented systems. Whole antibody samples (IgG) test against each of the models indicated preferential antibody orientation on DGpp+Cr. Cross-reference between ANN and PCA models yield 20 mass fragments associated with F(ab') 2 region representing correct orientation, and 23 mass fragments associated with the Fc region representing incorrect orientation. Mass fragments were then compared to amino acid fragments and amino acid composition in F(ab') 2 and Fc regions. A ratio of the sum of the ToF-SIMS ion intensities from the F(ab') 2 fragments to the Fc fragments demonstrated a 50% increase in intensity for IgG on DGpp+Cr as compared to DGpp. The systematic data analysis methodology employed herein offers a new approach for the investigation of antibody orientation applicable to a range of substrates. Controlled orientation of antibodies at solid phases is critical for maximizing antigen detection in biosensors and immunoassays. Surface-sensitive techniques (such as ToF-SIMS), capable of direct

  12. Analysis, Simulation and Prediction of Multivariate Random Fields with Package RandomFields

    Directory of Open Access Journals (Sweden)

    Martin Schlather

    2015-02-01

    Full Text Available Modeling of and inference on multivariate data that have been measured in space, such as temperature and pressure, are challenging tasks in environmental sciences, physics and materials science. We give an overview over and some background on modeling with cross- covariance models. The R package RandomFields supports the simulation, the parameter estimation and the prediction in particular for the linear model of coregionalization, the multivariate Matrn models, the delay model, and a spectrum of physically motivated vector valued models. An example on weather data is considered, illustrating the use of RandomFields for parameter estimation and prediction.

  13. Local richness along gradients in the Siskiyou herb flora: R.H. Whittaker revisited

    Science.gov (United States)

    Grace, James B.; Harrison, Susan; Damschen, Ellen Ingman

    2011-01-01

    In his classic study in the Siskiyou Mountains (Oregon, USA), one of the most botanically rich forested regions in North America, R. H. Whittaker (1960) foreshadowed many modern ideas on the multivariate control of local species richness along environmental gradients related to productivity. Using a structural equation model to analyze his data, which were never previously statistically analyzed, we demonstrate that Whittaker was remarkably accurate in concluding that local herb richness in these late-seral forests is explained to a large extent by three major abiotic gradients (soils, topography, and elevation), and in turn, by the effects of these gradients on tree densities and the numbers of individual herbs. However, while Whittaker also clearly appreciated the significance of large-scale evolutionary and biogeographic influences on community composition, he did not fully articulate the more recent concept that variation in the species richness of local communities could be explained in part by variation in the sizes of regional species pools. Our model of his data is among the first to use estimates of regional species pool size to explain variation in local community richness along productivity-related gradients. We find that regional pool size, combined with a modest number of other interacting abiotic and biotic factors, explains most of the variation in local herb richness in the Siskiyou biodiversity hotspot.

  14. The iron bars from the ‘Gresham Ship’: employing multivariate statistics to further slag inclusion analysis of ferrous objects

    DEFF Research Database (Denmark)

    Birch, Thomas; Martinón-Torres, Marcos

    2015-01-01

    An assemblage of post-medieval iron bars was found with the Princes Channel wreck, salvaged from the Thames Estuary in 2003. They were recorded and studied, with a focus on metallography and slag inclusion analysis. The investigation provided an opportunity to explore the use of multivariate...... statistical techniques to analyse slag inclusion data. Cluster analysis supplemented by principal components analysis revealed two groups of iron, probably originating from different smelting systems, which were compared to those observed macroscopically and through metallography. The analyses reveal...

  15. Combined data preprocessing and multivariate statistical analysis characterizes fed-batch culture of mouse hybridoma cells for rational medium design.

    Science.gov (United States)

    Selvarasu, Suresh; Kim, Do Yun; Karimi, Iftekhar A; Lee, Dong-Yup

    2010-10-01

    We present an integrated framework for characterizing fed-batch cultures of mouse hybridoma cells producing monoclonal antibody (mAb). This framework systematically combines data preprocessing, elemental balancing and statistical analysis technique. Initially, specific rates of cell growth, glucose/amino acid consumptions and mAb/metabolite productions were calculated via curve fitting using logistic equations, with subsequent elemental balancing of the preprocessed data indicating the presence of experimental measurement errors. Multivariate statistical analysis was then employed to understand physiological characteristics of the cellular system. The results from principal component analysis (PCA) revealed three major clusters of amino acids with similar trends in their consumption profiles: (i) arginine, threonine and serine, (ii) glycine, tyrosine, phenylalanine, methionine, histidine and asparagine, and (iii) lysine, valine and isoleucine. Further analysis using partial least square (PLS) regression identified key amino acids which were positively or negatively correlated with the cell growth, mAb production and the generation of lactate and ammonia. Based on these results, the optimal concentrations of key amino acids in the feed medium can be inferred, potentially leading to an increase in cell viability and productivity, as well as a decrease in toxic waste production. The study demonstrated how the current methodological framework using multivariate statistical analysis techniques can serve as a potential tool for deriving rational medium design strategies. Copyright © 2010 Elsevier B.V. All rights reserved.

  16. New multivariable capabilities of the INCA program

    Science.gov (United States)

    Bauer, Frank H.; Downing, John P.; Thorpe, Christopher J.

    1989-01-01

    The INteractive Controls Analysis (INCA) program was developed at NASA's Goddard Space Flight Center to provide a user friendly, efficient environment for the design and analysis of control systems, specifically spacecraft control systems. Since its inception, INCA has found extensive use in the design, development, and analysis of control systems for spacecraft, instruments, robotics, and pointing systems. The (INCA) program was initially developed as a comprehensive classical design analysis tool for small and large order control systems. The latest version of INCA, expected to be released in February of 1990, was expanded to include the capability to perform multivariable controls analysis and design.

  17. Processing data collected from radiometric experiments by multivariate technique

    International Nuclear Information System (INIS)

    Urbanski, P.; Kowalska, E.; Machaj, B.; Jakowiuk, A.

    2005-01-01

    Multivariate techniques applied for processing data collected from radiometric experiments can provide more efficient extraction of the information contained in the spectra. Several techniques are considered: (i) multivariate calibration using Partial Least Square Regression and Artificial Neural Network, (ii) standardization of the spectra, (iii) smoothing of collected spectra were autocorrelation function and bootstrap were used for the assessment of the processed data, (iv) image processing using Principal Component Analysis. Application of these techniques is illustrated on examples of some industrial applications. (author)

  18. Multivariate image analysis of laser-induced photothermal imaging used for detection of caries tooth

    Science.gov (United States)

    El-Sherif, Ashraf F.; Abdel Aziz, Wessam M.; El-Sharkawy, Yasser H.

    2010-08-01

    Time-resolved photothermal imaging has been investigated to characterize tooth for the purpose of discriminating between normal and caries areas of the hard tissue using thermal camera. Ultrasonic thermoelastic waves were generated in hard tissue by the absorption of fiber-coupled Q-switched Nd:YAG laser pulses operating at 1064 nm in conjunction with a laser-induced photothermal technique used to detect the thermal radiation waves for diagnosis of human tooth. The concepts behind the use of photo-thermal techniques for off-line detection of caries tooth features were presented by our group in earlier work. This paper illustrates the application of multivariate image analysis (MIA) techniques to detect the presence of caries tooth. MIA is used to rapidly detect the presence and quantity of common caries tooth features as they scanned by the high resolution color (RGB) thermal cameras. Multivariate principal component analysis is used to decompose the acquired three-channel tooth images into a two dimensional principal components (PC) space. Masking score point clusters in the score space and highlighting corresponding pixels in the image space of the two dominant PCs enables isolation of caries defect pixels based on contrast and color information. The technique provides a qualitative result that can be used for early stage caries tooth detection. The proposed technique can potentially be used on-line or real-time resolved to prescreen the existence of caries through vision based systems like real-time thermal camera. Experimental results on the large number of extracted teeth as well as one of the thermal image panoramas of the human teeth voltanteer are investigated and presented.

  19. Effects of Video Games and Online Chat on Mathematics Performance in High School: An Approach of Multivariate Data Analysis

    OpenAIRE

    Lina Wu; Wenyi Lu; Ye Li

    2016-01-01

    Regarding heavy video game players for boys and super online chat lovers for girls as a symbolic phrase in the current adolescent culture, this project of data analysis verifies the displacement effect on deteriorating mathematics performance. To evaluate correlation or regression coefficients between a factor of playing video games or chatting online and mathematics performance compared with other factors, we use multivariate analysis technique and take gender difference into account. We fin...

  20. Analysis of Regularly and Irregularly Sampled Spatial, Multivariate, and Multi-temporal Data

    DEFF Research Database (Denmark)

    Nielsen, Allan Aasbjerg

    1994-01-01

    This thesis describes different methods that are useful in the analysis of multivariate data. Some methods focus on spatial data (sampled regularly or irregularly), others focus on multitemporal data or data from multiple sources. The thesis covers selected and not all aspects of relevant data......-variograms are described. As a new way of setting up a well-balanced kriging support the Delaunay triangulation is suggested. Two case studies show the usefulness of 2-D semivariograms of geochemical data from areas in central Spain (with a geologist's comment) and South Greenland, and kriging/cokriging of an undersampled...... are considered as repetitions. Three case studies show the strength of the methods; one uses SPOT High Resolution Visible (HRV) multispectral (XS) data covering economically important pineapple and coffee plantations near Thika, Kiambu District, Kenya, the other two use Landsat Thematic Mapper (TM) data covering...

  1. Variation of heavy metals in recent sediments from Piratininga Lagoon (Brazil): interpretation of geochemical data with the aid of multivariate analysis

    Science.gov (United States)

    Huang, W.; Campredon, R.; Abrao, J. J.; Bernat, M.; Latouche, C.

    1994-06-01

    In the last decade, the Atlantic coast of south-eastern Brazil has been affected by increasing deforestation and anthropogenic effluents. Sediments in the coastal lagoons have recorded the process of such environmental change. Thirty-seven sediment samples from three cores in Piratininga Lagoon, Rio de Janeiro, were analyzed for their major components and minor element concentrations in order to examine geochemical characteristics and the depositional environment and to investigate the variation of heavy metals of environmental concern. Two multivariate analysis methods, principal component analysis and cluster analysis, were performed on the analytical data set to help visualize the sample clusters and the element associations. On the whole, the sediment samples from each core are similar and the sample clusters corresponding to the three cores are clearly separated, as a result of the different conditions of sedimentation. Some changes in the depositional environment are recognized using the results of multivariate analysis. The enrichment of Pb, Cu, and Zn in the upper parts of cores is in agreement with increasing anthropogenic influx (pollution).

  2. Barnyard millet global core collection evaluation in the submontane Himalayan region of India using multivariate analysis

    Directory of Open Access Journals (Sweden)

    Salej Sood

    2015-12-01

    Full Text Available Barnyard millet (Echinochloa spp. is one of the most underresearched crops with respect to characterization of genetic resources and genetic enhancement. A total of 95 germplasm lines representing global collection were evaluated in two rainy seasons at Almora, Uttarakhand, India for qualitative and quantitative traits and the data were subjected to multivariate analysis. High variation was observed for days to maturity, five-ear grain weight, and yield components. The first three principal component axes explained 73% of the total multivariate variation. Three major groups were detected by projection of the accessions on the first two principal components. The separation of accessions was based mainly on trait morphology. Almost all Indian and origin-unknown accessions grouped together to form an Echinochloa frumentacea group. Japanese accessions grouped together except for a few outliers to form an Echinochloa esculenta group. The third group contained accessions from Russia, Japan, Cameroon, and Egypt. They formed a separate group on the scatterplot and represented accessions with lower values for all traits except basal tiller number. The interrelationships between the traits indicated that accessions with tall plants, long and broad leaves, longer inflorescences, and greater numbers of racemes should be given priority as donors or parents in varietal development initiatives. Cluster analysis identified two main clusters based on agro-morphological characters.

  3. Application of the techniques of Multivariate analysis in the characterization of germplasm of Quinua

    International Nuclear Information System (INIS)

    Garcia A, J.M.; Torres de la Cruz, E.

    2004-01-01

    Its were evaluated 20 lines of Chenopodium quinoa respect characters of agronomical interest finding that nine lines overcame the witness highlighting the lines: 20R1-41, 20R1-10, 20R2-27 that presented near yield to 1.5 ton/ha. The multivariate analysis of main components generated a dendrogram in that is appreciated that at an Euclidean distance of 0.75 its were formed seven groups according to its morphological characteristics and of yield, it highlights the formation of two big groups at a distance of 1.125, that they separate according to the radiation dose (200 and 250 Gy). (Author)

  4. Dynamics analysis of a boiling water reactor based on multivariable autoregressive modeling

    International Nuclear Information System (INIS)

    Oguma, Ritsuo; Matsubara, Kunihiko

    1980-01-01

    The establishment of the highly reliable mathematical model for the dynamic characteristics of a reactor is indispensable for the achievement of safe operation in reactor plants. The authors have tried to model the dynamic characteristics of a reactor based on the identification technique, taking the JPDR (Japan Power Demonstration Reactor) as the object, as one of the technical studies for diagnosing BWR anomaly, and employed the multivariable autoregressive modeling (MAR method) as one of the useful methods for forwarding the analysis. In this paper, the outline of the system analysis by MAR modeling is explained, and the identification experiments and their analysis results performed in the phase 4 of the power increase test of the JPDR are described. The authors evaluated the results of identification based on only reactor noises, making reference to the results of identification in the case of exciting the system by applying artificial irregular disturbance, in order to clarify the extent in which the modeling is possible by reactor noises only. However, some difficulties were encountered. The largest problem is the one concerning the separation and identification of the noise sources exciting the variables from the dynamic characteristics among the variables. If the effective technique can be obtained to this problem, the approach by the identification technique based on the probability model might be a powerful tool in the field of reactor noise analysis and the development of diagnosis technics. (Wakatsuki, Y.)

  5. Multivariate analysis of magnetic resonance imaging of focal hepatic lesions

    International Nuclear Information System (INIS)

    Fujishima, Mamoru; Suemitsu, Ichizou; Sei, Tetsurou; Takeda, Yoshihiro; Hiraki, Yoshio

    1993-01-01

    A total of 124 lesions from 1 to 6 cm in diameter, including 31 cavernous hemangiomas, 32 metastases and 61 hepatocellular carcinomas (HCC) were analyzed to study the usefulness of magnetic resonance imaging (MRI) at 0.5 Tesla to differentiate focal hepatic lesions on the basis of qualitative criteria. Each focal hepatic lesion was assessed for shape, internal architecture and signal intensity relative to normal liver parenchyma. While all cavernous hemangiomas and metastases except one lesion could be detected, detection rate of HCC was significantly inferior to that of the other two diseases. A tumor capsule and a hyperintense focus on T 1 -weighted images were demonstrated in only HCC lesions in strong contrast with the other two diseases; however, metastases with slow-growing characteristics or subacute hematoma may appear as similar images. Cavernous hemangiomas appeared markedly hyperintense on T 2 -weighted images in 23 of 31 lesions, but one metastasis and one HCC had similar images. A multivariate analysis of several MRI resulted in the following mean discriminant scores: cavernous hemangioma, -1.2652; metastasis, 0.1830; and HCC, 0.7138. It appeared to be possible to differentiate the three diseases with 84.4 percent accuracy. (author)

  6. Recent trends in application of multivariate curve resolution approaches for improving gas chromatography-mass spectrometry analysis of essential oils.

    Science.gov (United States)

    Jalali-Heravi, Mehdi; Parastar, Hadi

    2011-08-15

    Essential oils (EOs) are valuable natural products that are popular nowadays in the world due to their effects on the health conditions of human beings and their role in preventing and curing diseases. In addition, EOs have a broad range of applications in foods, perfumes, cosmetics and human nutrition. Among different techniques for analysis of EOs, gas chromatography-mass spectrometry (GC-MS) is the most important one in recent years. However, there are some fundamental problems in GC-MS analysis including baseline drift, spectral background, noise, low S/N (signal to noise) ratio, changes in the peak shapes and co-elution. Multivariate curve resolution (MCR) approaches cope with ongoing challenges and are able to handle these problems. This review focuses on the application of MCR techniques for improving GC-MS analysis of EOs published between January 2000 and December 2010. In the first part, the importance of EOs in human life and their relevance in analytical chemistry is discussed. In the second part, an insight into some basics needed to understand prospects and limitations of the MCR techniques are given. In the third part, the significance of the combination of the MCR approaches with GC-MS analysis of EOs is highlighted. Furthermore, the commonly used algorithms for preprocessing, chemical rank determination, local rank analysis and multivariate resolution in the field of EOs analysis are reviewed. Copyright © 2011 Elsevier B.V. All rights reserved.

  7. Quantitative Evaluation of Hybrid Aspen Xylem and Immunolabeling Patterns Using Image Analysis and Multivariate Statistics

    Directory of Open Access Journals (Sweden)

    David Sandquist

    2015-06-01

    Full Text Available A new method is presented for quantitative evaluation of hybrid aspen genotype xylem morphology and immunolabeling micro-distribution. This method can be used as an aid in assessing differences in genotypes from classic tree breeding studies, as well as genetically engineered plants. The method is based on image analysis, multivariate statistical evaluation of light, and immunofluorescence microscopy images of wood xylem cross sections. The selected immunolabeling antibodies targeted five different epitopes present in aspen xylem cell walls. Twelve down-regulated hybrid aspen genotypes were included in the method development. The 12 knock-down genotypes were selected based on pre-screening by pyrolysis-IR of global chemical content. The multivariate statistical evaluations successfully identified comparative trends for modifications in the down-regulated genotypes compared to the unmodified control, even when no definitive conclusions could be drawn from individual studied variables alone. Of the 12 genotypes analyzed, three genotypes showed significant trends for modifications in both morphology and immunolabeling. Six genotypes showed significant trends for modifications in either morphology or immunocoverage. The remaining three genotypes did not show any significant trends for modification.

  8. A multivariate tobit analysis of highway accident-injury-severity rates.

    Science.gov (United States)

    Anastasopoulos, Panagiotis Ch; Shankar, Venky N; Haddock, John E; Mannering, Fred L

    2012-03-01

    Relatively recent research has illustrated the potential that tobit regression has in studying factors that affect vehicle accident rates (accidents per distance traveled) on specific roadway segments. Tobit regression has been used because accident rates on specific roadway segments are continuous data that are left-censored at zero (they are censored because accidents may not be observed on all roadway segments during the period over which data are collected). This censoring may arise from a number of sources, one of which being the possibility that less severe crashes may be under-reported and thus may be less likely to appear in crash databases. Traditional tobit-regression analyses have dealt with the overall accident rate (all crashes regardless of injury severity), so the issue of censoring by the severity of crashes has not been addressed. However, a tobit-regression approach that considers accident rates by injury-severity level, such as the rate of no-injury, possible injury and injury accidents per distance traveled (as opposed to all accidents regardless of injury-severity), can potentially provide new insights, and address the possibility that censoring may vary by crash-injury severity. Using five-year data from highways in Washington State, this paper estimates a multivariate tobit model of accident-injury-severity rates that addresses the possibility of differential censoring across injury-severity levels, while also accounting for the possible contemporaneous error correlation resulting from commonly shared unobserved characteristics across roadway segments. The empirical results show that the multivariate tobit model outperforms its univariate counterpart, is practically equivalent to the multivariate negative binomial model, and has the potential to provide a fuller understanding of the factors determining accident-injury-severity rates on specific roadway segments. Published by Elsevier Ltd.

  9. The Inappropriate Symmetries of Multivariate Statistical Analysis in Geometric Morphometrics.

    Science.gov (United States)

    Bookstein, Fred L

    In today's geometric morphometrics the commonest multivariate statistical procedures, such as principal component analysis or regressions of Procrustes shape coordinates on Centroid Size, embody a tacit roster of symmetries -axioms concerning the homogeneity of the multiple spatial domains or descriptor vectors involved-that do not correspond to actual biological fact. These techniques are hence inappropriate for any application regarding which we have a-priori biological knowledge to the contrary (e.g., genetic/morphogenetic processes common to multiple landmarks, the range of normal in anatomy atlases, the consequences of growth or function for form). But nearly every morphometric investigation is motivated by prior insights of this sort. We therefore need new tools that explicitly incorporate these elements of knowledge, should they be quantitative, to break the symmetries of the classic morphometric approaches. Some of these are already available in our literature but deserve to be known more widely: deflated (spatially adaptive) reference distributions of Procrustes coordinates, Sewall Wright's century-old variant of factor analysis, the geometric algebra of importing explicit biomechanical formulas into Procrustes space. Other methods, not yet fully formulated, might involve parameterized models for strain in idealized forms under load, principled approaches to the separation of functional from Brownian aspects of shape variation over time, and, in general, a better understanding of how the formalism of landmarks interacts with the many other approaches to quantification of anatomy. To more powerfully organize inferences from the high-dimensional measurements that characterize so much of today's organismal biology, tomorrow's toolkit must rely neither on principal component analysis nor on the Procrustes distance formula, but instead on sound prior biological knowledge as expressed in formulas whose coefficients are not all the same. I describe the problems

  10. Multivariate analysis of factors influencing the effect of radiosynovectomy

    International Nuclear Information System (INIS)

    Farahati, J.; Schulz, G.; Koerber, C.; Geling, M.; Schmeider, P.; Reiners, Chr.; Wendler, J.; Kenn, W.; Reidemeister, C.

    2002-01-01

    Objective: In this prospective study, the time to remission after radiosynovectomy (RSV) was analyzed and the influence of age, sex, underlying disease, type of joint, and duration of illness on the success rate of RSV was determined. Methods: A total number of 57 patients with rheumatoid arthritis (n = 33) and arthrosis (n = 21) with a total number of 130 treated joints (36 knee, 66 small and 28 medium-size joints) were monitored using visual analogue scales (VAS) from one week before RSV up to four to six months after RSV. The patients had to answer 3 times daily for pain intensity of the treated joint. The time until remission was determined according to the Kaplan-Meier survivorship function. The influence of the prognosis parameters on outcome of RSV was determined by multivariate discriminant analysis. Results: After six months, the probability of pain relief of more than 20% amounted to 78% and was significantly dependent on the age of the patient (p = 0.02) and the duration of illness (p = 0.05), however not on sex (p = 0.17), underlying disease (p = 0.23), and type of joint (p = 0.69). Conclusion: Irrespective of sex, type of joint and underlying disease, a measurable pain relief can be achieved with RSV in 78% of the patients with synovitis, whereby effectiveness is decreasing with increasing age and progress of illness. (orig.) [de

  11. Application of multivariate analysis to investigate the trace element contamination in top soil of coal mining district in Jorong, South Kalimantan, Indonesia

    Science.gov (United States)

    Pujiwati, Arie; Nakamura, K.; Watanabe, N.; Komai, T.

    2018-02-01

    Multivariate analysis is applied to investigate geochemistry of several trace elements in top soils and their relation with the contamination source as the influence of coal mines in Jorong, South Kalimantan. Total concentration of Cd, V, Co, Ni, Cr, Zn, As, Pb, Sb, Cu and Ba was determined in 20 soil samples by the bulk analysis. Pearson correlation is applied to specify the linear correlation among the elements. Principal Component Analysis (PCA) and Cluster Analysis (CA) were applied to observe the classification of trace elements and contamination sources. The results suggest that contamination loading is contributed by Cr, Cu, Ni, Zn, As, and Pb. The elemental loading mostly affects the non-coal mining area, for instances the area near settlement and agricultural land use. Moreover, the contamination source is classified into the areas that are influenced by the coal mining activity, the agricultural types, and the river mixing zone. Multivariate analysis could elucidate the elemental loading and the contamination sources of trace elements in the vicinity of coal mine area.

  12. Multivariate Statistical Process Control Charts: An Overview

    OpenAIRE

    Bersimis, Sotiris; Psarakis, Stelios; Panaretos, John

    2006-01-01

    In this paper we discuss the basic procedures for the implementation of multivariate statistical process control via control charting. Furthermore, we review multivariate extensions for all kinds of univariate control charts, such as multivariate Shewhart-type control charts, multivariate CUSUM control charts and multivariate EWMA control charts. In addition, we review unique procedures for the construction of multivariate control charts, based on multivariate statistical techniques such as p...

  13. Qualitative and quantitative analysis of complex temperature-programmed desorption data by multivariate curve resolution

    Science.gov (United States)

    Rodríguez-Reyes, Juan Carlos F.; Teplyakov, Andrew V.; Brown, Steven D.

    2010-10-01

    The substantial amount of information carried in temperature-programmed desorption (TPD) experiments is often difficult to mine due to the occurrence of competing reaction pathways that produce compounds with similar mass spectrometric features. Multivariate curve resolution (MCR) is introduced as a tool capable of overcoming this problem by mathematically detecting spectral variations and correlations between several m/z traces, which is later translated into the extraction of the cracking pattern and the desorption profile for each desorbate. Different from the elegant (though complex) methods currently available to analyze TPD data, MCR analysis is applicable even when no information regarding the specific surface reaction/desorption process or the nature of the desorbing species is available. However, when available, any information can be used as constraints that guide the outcome, increasing the accuracy of the resolution. This approach is especially valuable when the compounds desorbing are different from what would be expected based on a chemical intuition, when the cracking pattern of the model test compound is difficult or impossible to obtain (because it could be unstable or very rare), and when knowing major components desorbing from the surface could in more traditional methods actually bias the quantification of minor components. The enhanced level of understanding of thermal processes achieved through MCR analysis is demonstrated by analyzing three phenomena: i) the cryogenic desorption of vinyltrimethylsilane from silicon, an introductory system where the known multilayer and monolayer components are resolved; ii) acrolein hydrogenation on a bimetallic Pt-Ni-Pt catalyst, where a rapid identification of hydrogenated products as well as other desorbing species is achieved, and iii) the thermal reaction of Ti[N(CH 3) 2] 4 on Si(100), where the products of surface decomposition are identified and an estimation of the surface composition after the

  14. Continuous multivariate exponential extension

    International Nuclear Information System (INIS)

    Block, H.W.

    1975-01-01

    The Freund-Weinman multivariate exponential extension is generalized to the case of nonidentically distributed marginal distributions. A fatal shock model is given for the resulting distribution. Results in the bivariate case and the concept of constant multivariate hazard rate lead to a continuous distribution related to the multivariate exponential distribution (MVE) of Marshall and Olkin. This distribution is shown to be a special case of the extended Freund-Weinman distribution. A generalization of the bivariate model of Proschan and Sullo leads to a distribution which contains both the extended Freund-Weinman distribution and the MVE

  15. Evaluation of Extraction Protocols for Simultaneous Polar and Non-Polar Yeast Metabolite Analysis Using Multivariate Projection Methods

    Directory of Open Access Journals (Sweden)

    Nicolas P. Tambellini

    2013-07-01

    Full Text Available Metabolomic and lipidomic approaches aim to measure metabolites or lipids in the cell. Metabolite extraction is a key step in obtaining useful and reliable data for successful metabolite studies. Significant efforts have been made to identify the optimal extraction protocol for various platforms and biological systems, for both polar and non-polar metabolites. Here we report an approach utilizing chemoinformatics for systematic comparison of protocols to extract both from a single sample of the model yeast organism Saccharomyces cerevisiae. Three chloroform/methanol/water partitioning based extraction protocols found in literature were evaluated for their effectiveness at reproducibly extracting both polar and non-polar metabolites. Fatty acid methyl esters and methoxyamine/trimethylsilyl derivatized aqueous compounds were analyzed by gas chromatography mass spectrometry to evaluate non-polar or polar metabolite analysis. The comparative breadth and amount of recovered metabolites was evaluated using multivariate projection methods. This approach identified an optimal protocol consisting of 64 identified polar metabolites from 105 ion hits and 12 fatty acids recovered, and will potentially attenuate the error and variation associated with combining metabolite profiles from different samples for untargeted analysis with both polar and non-polar analytes. It also confirmed the value of using multivariate projection methods to compare established extraction protocols.

  16. Classification of Ilex species based on metabolomic fingerprinting using nuclear magnetic resonance and multivariate data analysis.

    Science.gov (United States)

    Choi, Young Hae; Sertic, Sarah; Kim, Hye Kyong; Wilson, Erica G; Michopoulos, Filippos; Lefeber, Alfons W M; Erkelens, Cornelis; Prat Kricun, Sergio D; Verpoorte, Robert

    2005-02-23

    The metabolomic analysis of 11 Ilex species, I. argentina, I. brasiliensis, I. brevicuspis, I. dumosavar. dumosa, I. dumosa var. guaranina, I. integerrima, I. microdonta, I. paraguariensis var. paraguariensis, I. pseudobuxus, I. taubertiana, and I. theezans, was carried out by NMR spectroscopy and multivariate data analysis. The analysis using principal component analysis and classification of the (1)H NMR spectra showed a clear discrimination of those samples based on the metabolites present in the organic and aqueous fractions. The major metabolites that contribute to the discrimination are arbutin, caffeine, phenylpropanoids, and theobromine. Among those metabolites, arbutin, which has not been reported yet as a constituent of Ilex species, was found to be a biomarker for I. argentina,I. brasiliensis, I. brevicuspis, I. integerrima, I. microdonta, I. pseudobuxus, I. taubertiana, and I. theezans. This reliable method based on the determination of a large number of metabolites makes the chemotaxonomical analysis of Ilex species possible.

  17. Association of Cysteine-Rich Secretory Protein 3 and β-Microseminoprotein with Outcome after Radical Prostatectomy

    Science.gov (United States)

    Bjartell, Anders S.; Al-Ahmadie, Hikmat; Serio, Angel M.; Eastham, James A.; Eggener, Scott E.; Fine, Samson W.; Udby, Lene; Gerald, William L.; Vickers, Andrew J.; Lilja, Hans; Reuter, Victor E.; Scardino, Peter T.

    2009-01-01

    Purpose It has been suggested that cysteine-rich secretory protein 3 (CRISP-3) and β-microseminoprotein (MSP) are associated with outcome in prostate cancer. We investigated whether these markers are related to biochemical recurrence and whether addition of the markers improves prediction of recurring disease. Experimental Design Tissue microarrays of radical prostatectomy specimens were analyzed for CRISP-3 and MSP by immunohistochemistry. Associations between marker positivity and postprostatectomy biochemical recurrence [prostate-specific antigen (PSA) >0.2 ng/mL with a confirmatory level] were evaluated by univariate and multivariable Cox proportional hazards regression. Multivariable analyses controlled for preoperative PSA and pathologic stage and grade. Results Among 945 patients, 224 had recurrence. Median follow-up for survivors was 6.0 years. Patients positive for CRISP-3 had smaller recurrence-free probabilities, whereas MSP-positive patients had larger recurrence-free probabilities. On univariate analysis, the hazard ratio for patients positive versus negative for CRISP-3 was1.53 (P = 0.010) and for MSP was 0.63 (P = 0.004). On multivariable analysis, both CRISP-3 (P = 0.007) and MSP (P = 0.002) were associated with recurrence. The hazard ratio among CRISP-3– positive/MSP-negative patients compared with CRISP-3– negative/MSP-positive patients was 2.38. Adding CRISP-3 to a base model that included PSA and pathologic stage and grade did not enhance the prediction of recurrence, but adding MSP increased the concordance index minimally from 0.778 to 0.781. Conclusion We report evidence that CRISP-3 and MSP are independent predictors of recurrence after radical prostatectomy for localized prostate cancer. However, addition of the markers does not importantly improve the performance of existing predictive models. Further research should aim to elucidate the functions of CRISP-3 and MSP in prostate cancer cells. PMID:17634540

  18. The Removal of EOG Artifacts From EEG Signals Using Independent Component Analysis and Multivariate Empirical Mode Decomposition.

    Science.gov (United States)

    Wang, Gang; Teng, Chaolin; Li, Kuo; Zhang, Zhonglin; Yan, Xiangguo

    2016-09-01

    The recorded electroencephalography (EEG) signals are usually contaminated by electrooculography (EOG) artifacts. In this paper, by using independent component analysis (ICA) and multivariate empirical mode decomposition (MEMD), the ICA-based MEMD method was proposed to remove EOG artifacts (EOAs) from multichannel EEG signals. First, the EEG signals were decomposed by the MEMD into multiple multivariate intrinsic mode functions (MIMFs). The EOG-related components were then extracted by reconstructing the MIMFs corresponding to EOAs. After performing the ICA of EOG-related signals, the EOG-linked independent components were distinguished and rejected. Finally, the clean EEG signals were reconstructed by implementing the inverse transform of ICA and MEMD. The results of simulated and real data suggested that the proposed method could successfully eliminate EOAs from EEG signals and preserve useful EEG information with little loss. By comparing with other existing techniques, the proposed method achieved much improvement in terms of the increase of signal-to-noise and the decrease of mean square error after removing EOAs.

  19. Volumetric analysis of coronary plaque characterization in patients with metabolic syndrome using 64-slice multi-detector computed tomography

    International Nuclear Information System (INIS)

    Arai, Kosuke; Ishii, Hideki; Amano, Tetasuya

    2010-01-01

    Metabolic syndrome (MetS) is associated with adverse cardiovascular events and mortality, where acute coronary syndrome significantly impacts on mortality and morbidity. In contrast, evidences have accumulated that the lipid-rich plaque might play a critical role in acute coronary syndrome. The study population consisted of 94 patients with suspected angina pectoris who underwent multi-detector computed tomography (MDCT). Of those, we identified 41 with MetS. In MDCT analysis, low-density plaque volume (LDPV) (42±28 vs 24±18 mm 3 , P=0.0003), moderate-density plaque volume (105±41 vs 82±33 mm(3), P=0.003), total plaque volume (164±70 vs 118±59 mm 3 ), P=0.0008) and %LDPV (24.2±10.0 vs 18.3±7.1%, P=0.01) were significantly increased in the MetS group compared to the non-MetS group. Multivariate linear regression analysis after adjusting for confounding variables revealed that MetS was significantly correlated with an increase in %LDPV (β=0.48, P=0.0001). Multivariate logistic regression analysis for lipid-rich plaque after adjusting for confounding variables indicated that MetS was significantly associated with lipid-rich plaque (odds ratio: 5.99, 95% confidence intervals: 1.94-18.6, P=0.002). Patients with MetS were strongly related to having a lipid-rich composition in their coronary plaque, as detected by MDCT. (author)

  20. Multivariate analysis of flow cytometric data using decision trees

    Directory of Open Access Journals (Sweden)

    Svenja eSimon

    2012-04-01

    Full Text Available Characterization of the response of the host immune system is important in understanding the bidirectional interactions between the host and microbial pathogens. For research on the host site, flow cytometry has become one of the major tools in immunology. Advances in technology and reagents allow now the simultaneous assessment of multiple markers on a single cell level generating multidimensional data sets that require multivariate statistical analysis. We explored the explanatory power of the supervised machine learning method called 'induction of decision trees' in flow cytometric data. In order to examine whether the production of a certain cytokine is depended on other cytokines, datasets from intracellular staining for six cytokines with complex patterns of co-expression were analyzed by induction of decision trees. After weighting the data according to their class probabilities, we created a total of 13,392 different decision trees for each given cytokine with different parameter settings. For a more realistic estimation of the decision trees's quality, we used stratified 5-fold cross-validation and chose the 'best' tree according to a combination of different quality criteria. While some of the decision trees reflected previously known co-expression patterns, we found that the expression of some cytokines was not only dependent on the co-expression of others per se, but was also dependent on the intensity of expression. Thus, for the first time we successfully used induction of decision trees for the analysis of high dimensional flow cytometric data and demonstrated the feasibility of this method to reveal structural patterns in such data sets.

  1. Comparison of multivariate and univariate statistical process control and monitoring methods

    International Nuclear Information System (INIS)

    Leger, R.P.; Garland, WM.J.; Macgregor, J.F.

    1996-01-01

    Work in recent years has lead to the development of multivariate process monitoring schemes which use Principal Component Analysis (PCA). This research compares the performance of a univariate scheme and a multivariate PCA scheme used for monitoring a simple process with 11 measured variables. The multivariate PCA scheme was able to adequately represent the process using two principal components. This resulted in a PCA monitoring scheme which used two charts as opposed to 11 charts for the univariate scheme and therefore had distinct advantages in terms of both data representation, presentation, and fault diagnosis capabilities. (author)

  2. CoSMoMVPA: multi-modal multivariate pattern analysis of neuroimaging datain Matlab / GNU Octave

    Directory of Open Access Journals (Sweden)

    Nikolaas N Oosterhof

    2016-07-01

    Full Text Available Recent years have seen an increase in the popularity of multivariate pattern (MVP analysis of functional magnetic resonance (fMRI data, and, to a much lesser extent, magneto- and electro-encephalography (M/EEG data. We present CoSMoMVPA, a lightweight MVPA (MVP analysis toolbox implemented in the intersection of the Matlab and GNU Octave languages, that treats both fMRI and M/EEG data as first-class citizens.CoSMoMVPA supports all state-of-the-art MVP analysis techniques, including searchlight analyses, classification, correlations, representational similarity analysis, and the time generalization method. These can be used to address both data-driven and hypothesis-driven questions about neural organization and representations, both within and across: space, time, frequency bands, neuroimaging modalities, individuals, and species.It uses a uniform data representation of fMRI data in the volume or on the surface, and of M/EEG data at the sensor and source level. Through various external toolboxes, it directly supports reading and writing a variety of fMRI and M/EEG neuroimaging formats, and, where applicable, can convert between them. As a result, it can be integrated readily in existing pipelines and used with existing preprocessed datasets. CoSMoMVPA overloads the traditional volumetric searchlight concept to support neighborhoods for M/EEG and surface-based fMRI data, which supports localization of multivariate effects of interest across space, time, and frequency dimensions. CoSMoMVPA also provides a generalized approach to multiple comparison correction across these dimensions using Threshold-Free Cluster Enhancement with state-of-the-art clustering and permutation techniques.CoSMoMVPA is highly modular and uses abstractions to provide a uniform interface for a variety of MVP measures. Typical analyses require a few lines of code, making it accessible to beginner users. At the same time, expert programmers can easily extend its functionality

  3. A guide to statistical analysis in microbial ecology: a community-focused, living review of multivariate data analyses

    OpenAIRE

    Buttigieg, Pier Luigi; Ramette, Alban Nicolas

    2014-01-01

    The application of multivariate statistical analyses has become a consistent feature in microbial ecology. However, many microbial ecologists are still in the process of developing a deep understanding of these methods and appreciating their limitations. As a consequence, staying abreast of progress and debate in this arena poses an additional challenge to many microbial ecologists. To address these issues, we present the GUide to STatistical Analysis in Microbial Ecology (GUSTA ME): a dynami...

  4. Multivariate statistical evaluation of trace elements in groundwater in a coastal area in Shenzhen, China

    International Nuclear Information System (INIS)

    Chen Kouping; Jiao, Jiu J.; Huang Jianmin; Huang Runqiu

    2007-01-01

    Multivariate statistical techniques are efficient ways to display complex relationships among many objects. An attempt was made to study the data of trace elements in groundwater using multivariate statistical techniques such as principal component analysis (PCA), Q-mode factor analysis and cluster analysis. The original matrix consisted of 17 trace elements estimated from 55 groundwater samples colleted in 27 wells located in a coastal area in Shenzhen, China. PCA results show that trace elements of V, Cr, As, Mo, W, and U with greatest positive loadings typically occur as soluble oxyanions in oxidizing waters, while Mn and Co with greatest negative loadings are generally more soluble within oxygen depleted groundwater. Cluster analyses demonstrate that most groundwater samples collected from the same well in the study area during summer and winter still fall into the same group. This study also demonstrates the usefulness of multivariate statistical analysis in hydrochemical studies. - Multivariate statistical analysis was used to investigate relationships among trace elements and factors controlling trace element distribution in groundwater

  5. Univariate and multivariate skewness and kurtosis for measuring nonnormality: Prevalence, influence and estimation.

    Science.gov (United States)

    Cain, Meghan K; Zhang, Zhiyong; Yuan, Ke-Hai

    2017-10-01

    Nonnormality of univariate data has been extensively examined previously (Blanca et al., Methodology: European Journal of Research Methods for the Behavioral and Social Sciences, 9(2), 78-84, 2013; Miceeri, Psychological Bulletin, 105(1), 156, 1989). However, less is known of the potential nonnormality of multivariate data although multivariate analysis is commonly used in psychological and educational research. Using univariate and multivariate skewness and kurtosis as measures of nonnormality, this study examined 1,567 univariate distriubtions and 254 multivariate distributions collected from authors of articles published in Psychological Science and the American Education Research Journal. We found that 74 % of univariate distributions and 68 % multivariate distributions deviated from normal distributions. In a simulation study using typical values of skewness and kurtosis that we collected, we found that the resulting type I error rates were 17 % in a t-test and 30 % in a factor analysis under some conditions. Hence, we argue that it is time to routinely report skewness and kurtosis along with other summary statistics such as means and variances. To facilitate future report of skewness and kurtosis, we provide a tutorial on how to compute univariate and multivariate skewness and kurtosis by SAS, SPSS, R and a newly developed Web application.

  6. Multivariate Analysis for Animal Selection in Experimental Research

    Directory of Open Access Journals (Sweden)

    Renan Mercuri Pinto

    2015-02-01

    Full Text Available Background: Several researchers seek methods for the selection of homogeneous groups of animals in experimental studies, a fact justified because homogeneity is an indispensable prerequisite for casualization of treatments. The lack of robust methods that comply with statistical and biological principles is the reason why researchers use empirical or subjective methods, influencing their results. Objective: To develop a multivariate statistical model for the selection of a homogeneous group of animals for experimental research and to elaborate a computational package to use it. Methods: The set of echocardiographic data of 115 male Wistar rats with supravalvular aortic stenosis (AoS was used as an example of model development. Initially, the data were standardized, and became dimensionless. Then, the variance matrix of the set was submitted to principal components analysis (PCA, aiming at reducing the parametric space and at retaining the relevant variability. That technique established a new Cartesian system into which the animals were allocated, and finally the confidence region (ellipsoid was built for the profile of the animals’ homogeneous responses. The animals located inside the ellipsoid were considered as belonging to the homogeneous batch; those outside the ellipsoid were considered spurious. Results: The PCA established eight descriptive axes that represented the accumulated variance of the data set in 88.71%. The allocation of the animals in the new system and the construction of the confidence region revealed six spurious animals as compared to the homogeneous batch of 109 animals. Conclusion: The biometric criterion presented proved to be effective, because it considers the animal as a whole, analyzing jointly all parameters measured, in addition to having a small discard rate.

  7. Opportunities for multivariate analysis of open spatial datasets to characterize urban flooding risks

    Science.gov (United States)

    Gaitan, S.; ten Veldhuis, J. A. E.

    2015-06-01

    Cities worldwide are challenged by increasing urban flood risks. Precise and realistic measures are required to reduce flooding impacts. However, currently implemented sewer and topographic models do not provide realistic predictions of local flooding occurrence during heavy rain events. Assessing other factors such as spatially distributed rainfall, socioeconomic characteristics, and social sensing, may help to explain probability and impacts of urban flooding. Several spatial datasets have been recently made available in the Netherlands, including rainfall-related incident reports made by citizens, spatially distributed rain depths, semidistributed socioeconomic information, and buildings age. Inspecting the potential of this data to explain the occurrence of rainfall related incidents has not been done yet. Multivariate analysis tools for describing communities and environmental patterns have been previously developed and used in the field of study of ecology. The objective of this paper is to outline opportunities for these tools to explore urban flooding risks patterns in the mentioned datasets. To that end, a cluster analysis is performed. Results indicate that incidence of rainfall-related impacts is higher in areas characterized by older infrastructure and higher population density.

  8. Research Update: Spatially resolved mapping of electronic structure on atomic level by multivariate statistical analysis

    International Nuclear Information System (INIS)

    Belianinov, Alex; Ganesh, Panchapakesan; Lin, Wenzhi; Jesse, Stephen; Pan, Minghu; Kalinin, Sergei V.; Sales, Brian C.; Sefat, Athena S.

    2014-01-01

    Atomic level spatial variability of electronic structure in Fe-based superconductor FeTe 0.55 Se 0.45 (T c = 15 K) is explored using current-imaging tunneling-spectroscopy. Multivariate statistical analysis of the data differentiates regions of dissimilar electronic behavior that can be identified with the segregation of chalcogen atoms, as well as boundaries between terminations and near neighbor interactions. Subsequent clustering analysis allows identification of the spatial localization of these dissimilar regions. Similar statistical analysis of modeled calculated density of states of chemically inhomogeneous FeTe 1−x Se x structures further confirms that the two types of chalcogens, i.e., Te and Se, can be identified by their electronic signature and differentiated by their local chemical environment. This approach allows detailed chemical discrimination of the scanning tunneling microscopy data including separation of atomic identities, proximity, and local configuration effects and can be universally applicable to chemically and electronically inhomogeneous surfaces

  9. Rapid thyroid dysfunction screening based on serum surface-enhanced Raman scattering and multivariate statistical analysis

    Science.gov (United States)

    Tian, Dayong; Lü, Guodong; Zhai, Zhengang; Du, Guoli; Mo, Jiaqing; Lü, Xiaoyi

    2018-01-01

    In this paper, serum surface-enhanced Raman scattering and multivariate statistical analysis are used to investigate a rapid screening technique for thyroid function diseases. At present, the detection of thyroid function has become increasingly important, and it is urgently necessary to develop a rapid and portable method for the detection of thyroid function. Our experimental results show that, by using the Silmeco-based enhanced Raman signal, the signal strength greatly increases and the characteristic peak appears obviously. It is also observed that the Raman spectra of normal and anomalous thyroid function human serum are significantly different. Principal component analysis (PCA) combined with linear discriminant analysis (LDA) was used to diagnose thyroid dysfunction, and the diagnostic accuracy was 87.4%. The use of serum surface-enhanced Raman scattering technology combined with PCA-LDA shows good diagnostic performance for the rapid detection of thyroid function. By means of Raman technology, it is expected that a portable device for the rapid detection of thyroid function will be developed.

  10. Evaluation of herbicides photodegradation by photo-Fenton process using multivariate analysis

    Energy Technology Data Exchange (ETDEWEB)

    Paterlini, W.C.; Nogueira, R.F.P. [Inst. of Chemistry, Sao Paulo State Univ., R. Prof. Francisco Degni s/n, Araraquara, SP (Brazil)

    2003-07-01

    The photodegradation of herbicides in aqueous medium by photo-Fenton process using ferrioxalate complex (FeOx) as a source of Fe{sup 2+} was evaluated under blacklight irradiation. The commercial products of the herbicides tebuthiuron, 2,4-D and diuron were used. Multivariate analysis was used to evaluate the role of two variables in the photodegradation process, FeOx and hydrogen peroxide concentrations, and to define the concentration ranges that result in the most efficient photodegradation of the herbicides. The photodegradation of the herbicides was followed by monitoring the decrease of the original compounds concentration by HPLC, by the determination of remaining total organic carbon content (TOC), and by the chloride ion release. Under optimised conditions, 20 minutes irradiation was enough to remove 92.7% of TOC for 2,4 D and 89.5% for diuron. Complete dechlorination of these compounds was achieved after 10 minutes of irradiation. It was observed that the initial concentration of these compounds and tebuthiuron was reduced to less than 15% after only 1 minute of irradiation. (orig.)

  11. Multivariate approaches for stability control of the olive oil reference materials for sensory analysis - part II: applications.

    Science.gov (United States)

    Valverde-Som, Lucia; Ruiz-Samblás, Cristina; Rodríguez-García, Francisco P; Cuadros-Rodríguez, Luis

    2018-02-09

    The organoleptic quality of virgin olive oil depends on positive and negative sensory attributes. These attributes are related to volatile organic compounds and phenolic compounds that represent the aroma and taste (flavour) of the virgin olive oil. The flavour is the characteristic that can be measured by a taster panel. However, as for any analytical measuring device, the tasters, individually, and the panel, as a whole, should be harmonized and validated and proper olive oil standards are needed. In the present study, multivariate approaches are put into practice in addition to the rules to build a multivariate control chart from chromatographic volatile fingerprinting and chemometrics. Fingerprinting techniques provide analytical information without identify and quantify the analytes. This methodology is used to monitor the stability of sensory reference materials. The similarity indices have been calculated to build multivariate control chart with two olive oils certified reference materials that have been used as examples to monitor their stabilities. This methodology with chromatographic data could be applied in parallel with the 'panel test' sensory method to reduce the work of sensory analysis. © 2018 Society of Chemical Industry. © 2018 Society of Chemical Industry.

  12. Multivariate qualitative analysis of banned additives in food safety using surface enhanced Raman scattering spectroscopy.

    Science.gov (United States)

    He, Shixuan; Xie, Wanyi; Zhang, Wei; Zhang, Liqun; Wang, Yunxia; Liu, Xiaoling; Liu, Yulong; Du, Chunlei

    2015-02-25

    A novel strategy which combines iteratively cubic spline fitting baseline correction method with discriminant partial least squares qualitative analysis is employed to analyze the surface enhanced Raman scattering (SERS) spectroscopy of banned food additives, such as Sudan I dye and Rhodamine B in food, Malachite green residues in aquaculture fish. Multivariate qualitative analysis methods, using the combination of spectra preprocessing iteratively cubic spline fitting (ICSF) baseline correction with principal component analysis (PCA) and discriminant partial least squares (DPLS) classification respectively, are applied to investigate the effectiveness of SERS spectroscopy for predicting the class assignments of unknown banned food additives. PCA cannot be used to predict the class assignments of unknown samples. However, the DPLS classification can discriminate the class assignment of unknown banned additives using the information of differences in relative intensities. The results demonstrate that SERS spectroscopy combined with ICSF baseline correction method and exploratory analysis methodology DPLS classification can be potentially used for distinguishing the banned food additives in field of food safety. Copyright © 2014 Elsevier B.V. All rights reserved.

  13. Climate patterns as predictors of amphibians species richness and indicators of potential stress

    Science.gov (United States)

    Battaglin, W.; Hay, L.; McCabe, G.; Nanjappa, P.; Gallant, Alisa L.

    2005-01-01

    Amphibians occupy a range of habitats throughout the world, but species richness is greatest in regions with moist, warm climates. We modeled the statistical relations of anuran and urodele species richness with mean annual climate for the conterminous United States, and compared the strength of these relations at national and regional levels. Model variables were calculated for county and subcounty mapping units, and included 40-year (1960-1999) annual mean and mean annual climate statistics, mapping unit average elevation, mapping unit land area, and estimates of anuran and urodele species richness. Climate data were derived from more than 7,500 first-order and cooperative meteorological stations and were interpolated to the mapping units using multiple linear regression models. Anuran and urodele species richness were calculated from the United States Geological Survey's Amphibian Research and Monitoring Initiative (ARMI) National Atlas for Amphibian Distributions. The national multivariate linear regression (MLR) model of anuran species richness had an adjusted coefficient of determination (R2) value of 0.64 and the national MLR model for urodele species richness had an R2 value of 0.45. Stratifying the United States by coarse-resolution ecological regions provided models for anUrans that ranged in R2 values from 0.15 to 0.78. Regional models for urodeles had R2 values. ranging from 0.27 to 0.74. In general, regional models for anurans were more strongly influenced by temperature variables, whereas precipitation variables had a larger influence on urodele models.

  14. Origin Discrimination of Osmanthus fragrans var. thunbergii Flowers using GC-MS and UPLC-PDA Combined with Multivariable Analysis Methods.

    Science.gov (United States)

    Zhou, Fei; Zhao, Yajing; Peng, Jiyu; Jiang, Yirong; Li, Maiquan; Jiang, Yuan; Lu, Baiyi

    2017-07-01

    Osmanthus fragrans flowers are used as folk medicine and additives for teas, beverages and foods. The metabolites of O. fragrans flowers from different geographical origins were inconsistent in some extent. Chromatography and mass spectrometry combined with multivariable analysis methods provides an approach for discriminating the origin of O. fragrans flowers. To discriminate the Osmanthus fragrans var. thunbergii flowers from different origins with the identified metabolites. GC-MS and UPLC-PDA were conducted to analyse the metabolites in O. fragrans var. thunbergii flowers (in total 150 samples). Principal component analysis (PCA), soft independent modelling of class analogy analysis (SIMCA) and random forest (RF) analysis were applied to group the GC-MS and UPLC-PDA data. GC-MS identified 32 compounds common to all samples while UPLC-PDA/QTOF-MS identified 16 common compounds. PCA of the UPLC-PDA data generated a better clustering than PCA of the GC-MS data. Ten metabolites (six from GC-MS and four from UPLC-PDA) were selected as effective compounds for discrimination by PCA loadings. SIMCA and RF analysis were used to build classification models, and the RF model, based on the four effective compounds (caffeic acid derivative, acteoside, ligustroside and compound 15), yielded better results with the classification rate of 100% in the calibration set and 97.8% in the prediction set. GC-MS and UPLC-PDA combined with multivariable analysis methods can discriminate the origin of Osmanthus fragrans var. thunbergii flowers. Copyright © 2017 John Wiley & Sons, Ltd. Copyright © 2017 John Wiley & Sons, Ltd.

  15. Multivariate temporal pattern analysis applied to the study of rat behavior in the elevated plus maze: methodological and conceptual highlights.

    Science.gov (United States)

    Casarrubea, M; Magnusson, M S; Roy, V; Arabo, A; Sorbera, F; Santangelo, A; Faulisi, F; Crescimanno, G

    2014-08-30

    Aim of this article is to illustrate the application of a multivariate approach known as t-pattern analysis in the study of rat behavior in elevated plus maze. By means of this multivariate approach, significant relationships among behavioral events in the course of time can be described. Both quantitative and t-pattern analyses were utilized to analyze data obtained from fifteen male Wistar rats following a trial 1-trial 2 protocol. In trial 2, in comparison with the initial exposure, mean occurrences of behavioral elements performed in protected zones of the maze showed a significant increase counterbalanced by a significant decrease of mean occurrences of behavioral elements in unprotected zones. Multivariate t-pattern analysis, in trial 1, revealed the presence of 134 t-patterns of different composition. In trial 2, the temporal structure of behavior become more simple, being present only 32 different t-patterns. Behavioral strings and stripes (i.e. graphical representation of each t-pattern onset) of all t-patterns were presented both for trial 1 and trial 2 as well. Finally, percent distributions in the three zones of the maze show a clear-cut increase of t-patterns in closed arm and a significant reduction in the remaining zones. Results show that previous experience deeply modifies the temporal structure of rat behavior in the elevated plus maze. In addition, this article, by highlighting several conceptual, methodological and illustrative aspects on the utilization of t-pattern analysis, could represent a useful background to employ such a refined approach in the study of rat behavior in elevated plus maze. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. Multivariate Statistical Analysis: a tool for groundwater quality assessment in the hidrogeologic region of the Ring of Cenotes, Yucatan, Mexico.

    Science.gov (United States)

    Ye, M.; Pacheco Castro, R. B.; Pacheco Avila, J.; Cabrera Sansores, A.

    2014-12-01

    The karstic aquifer of Yucatan is a vulnerable and complex system. The first fifteen meters of this aquifer have been polluted, due to this the protection of this resource is important because is the only source of potable water of the entire State. Through the assessment of groundwater quality we can gain some knowledge about the main processes governing water chemistry as well as spatial patterns which are important to establish protection zones. In this work multivariate statistical techniques are used to assess the groundwater quality of the supply wells (30 to 40 meters deep) in the hidrogeologic region of the Ring of Cenotes, located in Yucatan, Mexico. Cluster analysis and principal component analysis are applied in groundwater chemistry data of the study area. Results of principal component analysis show that the main sources of variation in the data are due sea water intrusion and the interaction of the water with the carbonate rocks of the system and some pollution processes. The cluster analysis shows that the data can be divided in four clusters. The spatial distribution of the clusters seems to be random, but is consistent with sea water intrusion and pollution with nitrates. The overall results show that multivariate statistical analysis can be successfully applied in the groundwater quality assessment of this karstic aquifer.

  17. Constructing ordinal partition transition networks from multivariate time series.

    Science.gov (United States)

    Zhang, Jiayang; Zhou, Jie; Tang, Ming; Guo, Heng; Small, Michael; Zou, Yong

    2017-08-10

    A growing number of algorithms have been proposed to map a scalar time series into ordinal partition transition networks. However, most observable phenomena in the empirical sciences are of a multivariate nature. We construct ordinal partition transition networks for multivariate time series. This approach yields weighted directed networks representing the pattern transition properties of time series in velocity space, which hence provides dynamic insights of the underling system. Furthermore, we propose a measure of entropy to characterize ordinal partition transition dynamics, which is sensitive to capturing the possible local geometric changes of phase space trajectories. We demonstrate the applicability of pattern transition networks to capture phase coherence to non-coherence transitions, and to characterize paths to phase synchronizations. Therefore, we conclude that the ordinal partition transition network approach provides complementary insight to the traditional symbolic analysis of nonlinear multivariate time series.

  18. Newly Graduated Nurses' Competence and Individual and Organizational Factors: A Multivariate Analysis.

    Science.gov (United States)

    Numminen, Olivia; Leino-Kilpi, Helena; Isoaho, Hannu; Meretoja, Riitta

    2015-09-01

    To study the relationships between newly graduated nurses' (NGNs') perceptions of their professional competence, and individual and organizational work-related factors. A multivariate, quantitative, descriptive, correlation design was applied. Data collection took place in November 2012 with a national convenience sample of 318 NGNs representing all main healthcare settings in Finland. Five instruments measured NGNs' perceptions of their professional competence, occupational commitment, empowerment, practice environment, and its ethical climate, with additional questions on turnover intentions, job satisfaction, and demographics. Descriptive statistics summarized the demographic data, and inferential statistics multivariate path analysis modeling estimated the relationships between the variables. The strongest relationship was found between professional competence and empowerment, competence explaining 20% of the variance of empowerment. The explanatory power of competence regarding practice environment, ethical climate of the work unit, and occupational commitment, and competence's associations with turnover intentions, job satisfaction, and age, were statistically significant but considerably weaker. Higher competence and satisfaction with quality of care were associated with more positive perceptions of practice environment and its ethical climate as well as higher empowerment and occupational commitment. Apart from its association with empowerment, competence seems to be a rather independent factor in relation to the measured work-related factors. Further exploration would deepen the knowledge of this relationship, providing support for planning educational and developmental programs. Research on other individual and organizational factors is warranted to shed light on factors associated with professional competence in providing high-quality and safe care as well as retaining new nurses in the workforce. The study sheds light on the strength and direction of

  19. Intelligent Prediction of Soccer Technical Skill on Youth Soccer Player's Relative Performance Using Multivariate Analysis and Artificial Neural Network Techniques

    OpenAIRE

    Abdullah, M. R; Maliki, A. B. H. M; Musa, R. M; Kosni, N. A; Juahir, H

    2016-01-01

    This study aims to predict the potential pattern of soccer technical skill on Malaysia youth soccer players relative performance using multivariate analysis and artificial neural network techniques. 184 male youth soccer players were recruited in Malaysia soccer academy (average age = 15.2±2.0) underwent to, physical fitness test, anthropometric, maturity, motivation and the level of skill related soccer. Unsupervised pattern recognition of principal component analysis (PCA) was used to ident...

  20. [Temporary employment and health: a multivariate analysis of occupational injury risk by job tenure].

    Science.gov (United States)

    Bena, Antonella; Giraudo, Massimiliano

    2013-01-01

    To study the relationship between job tenure and injury risk, controlling for individual factors and company characteristics. Analysis of incidence and injury risk by job tenure, controlling for gender, age, nationality, economic activity, firm size. Sample of 7% of Italian workers registered in the INPS (National Institute of Social Insurance) database. Private sector employees who worked as blue collars or apprentices. First-time occupational injuries, all occupational injuries, serious occupational injuries. Our findings show an increase in injury risk among those who start a new job and an inverse relationship between job tenure and injury risk. Multivariate analysis confirm these results. Recommendations for improving this situation include the adoption of organizational models that provide periods of mentoring from colleagues already in the company and the assignment to simple and not much hazardous tasks. The economic crisis may exacerbate this problem: it is important for Italy to improve the systems of monitoring relations between temporary employment and health.

  1. Multivariate Regression of Liver on Intestine of Mice: A ...

    African Journals Online (AJOL)

    Multivariate Regression of Liver on Intestine of Mice: A Chemotherapeutic Evaluation of Plant ... Using an analysis of covariance model, the effects ... The findings revealed, with the aid of likelihood-ratio statistic, a marked improvement in

  2. Multivariate Birkhoff interpolation

    CERN Document Server

    Lorentz, Rudolph A

    1992-01-01

    The subject of this book is Lagrange, Hermite and Birkhoff (lacunary Hermite) interpolation by multivariate algebraic polynomials. It unifies and extends a new algorithmic approach to this subject which was introduced and developed by G.G. Lorentz and the author. One particularly interesting feature of this algorithmic approach is that it obviates the necessity of finding a formula for the Vandermonde determinant of a multivariate interpolation in order to determine its regularity (which formulas are practically unknown anyways) by determining the regularity through simple geometric manipulations in the Euclidean space. Although interpolation is a classical problem, it is surprising how little is known about its basic properties in the multivariate case. The book therefore starts by exploring its fundamental properties and its limitations. The main part of the book is devoted to a complete and detailed elaboration of the new technique. A chapter with an extensive selection of finite elements follows as well a...

  3. Multivariate innovative approaches to the treatment of the emission of LIBS plasmas. Application to chemical online analysis in a nuclear environment

    International Nuclear Information System (INIS)

    El-Rakwe, Maria

    2016-01-01

    Online and in situ analysis is now a strategic development for analytical chemistry. This is especially true in the nuclear field for which the security constraints related to the radioactivity of samples, and the need to minimize waste from analyzes argue for remote measurement techniques without sampling or sample preparation. Laser-Induced Breakdown Spectroscopy (LIBS) technique for elemental analysis of materials based on laser ablation and the optical emission spectroscopy, has these qualities. It is a technique of choice for online analysis. However, processes involved in LIBS, namely laser ablation, atomization, plasma formation and emission, are quite complex and difficult to control because the underlying physical phenomena are coupled and nonlinear. In addition, the analytical performance of the LIBS technique depends strongly on the choice of experimental conditions. Finally, an online analysis system should be as robust as possible face to uncontrolled variations in measurement conditions. The objective of this thesis is to improve control and performance of quantitative analysis by LIBS using multivariate methods capable of handling multi-dimensionality, nonlinearity and the coupling between parameters and data. For this, the work is divided into two parts. First the optimization is carried out using a central composite design to model the relationship between the experimental parameters of laser ablation (pulse energy and beam focusing parameters) and signal detection (delay time) to the physical characteristics of plasma (ablated mass, temperature) and the analytical performance (intensity and repeatability of the signal). The optimization parameters that results is then interpreted as the best compromise for the quantitative analysis between efficiency of laser ablation and plasma heating. Secondly, a multivariate methodology based on MCR-ALS, ICA and PLS techniques, was developed to quantify certain elements in different metallic matrices

  4. NIR and Py-mbms coupled with multivariate data analysis as a high-throughput biomass characterization technique : a review

    Directory of Open Access Journals (Sweden)

    Li eXiao

    2014-08-01

    Full Text Available Optimizing the use of lignocellulosic biomass as the feedstock for renewable energy production is currently being developed globally. Biomass is a complex mixture of cellulose, hemicelluloses, lignins, extractives, and proteins; as well as inorganic salts. Cell wall compositional analysis for biomass characterization is laborious and time consuming. In order to characterize biomass fast and efficiently, several high through-put technologies have been successfully developed. Among them, near infrared spectroscopy (NIR and pyrolysis-molecular beam mass spectrometry (Py-mbms are complementary tools and capable of evaluating a large number of raw or modified biomass in a short period of time. NIR shows vibrations associated with specific chemical structures whereas Py-mbms depicts the full range of fragments from the decomposition of biomass. Both NIR vibrations and Py-mbms peaks are assigned to possible chemical functional groups and molecular structures. They provide complementary information of chemical insight of biomaterials. However, it is challenging to interpret the informative results because of the large amount of overlapping bands or decomposition fragments contained in the spectra. In order to improve the efficiency of data analysis, multivariate analysis tools have been adapted to define the significant correlations among data variables, so that the large number of bands/peaks could be replaced by a small number of reconstructed variables representing original variation. Reconstructed data variables are used for sample comparison (principal component analysis and for building regression models (partial least square regression between biomass chemical structures and properties of interests. In this review, the important biomass chemical structures measured by NIR and Py-mbms are summarized. The advantages and disadvantages of conventional data analysis methods and multivariate data analysis methods are introduced, compared and evaluated

  5. Multivariate multiscale entropy of financial markets

    Science.gov (United States)

    Lu, Yunfan; Wang, Jun

    2017-11-01

    In current process of quantifying the dynamical properties of the complex phenomena in financial market system, the multivariate financial time series are widely concerned. In this work, considering the shortcomings and limitations of univariate multiscale entropy in analyzing the multivariate time series, the multivariate multiscale sample entropy (MMSE), which can evaluate the complexity in multiple data channels over different timescales, is applied to quantify the complexity of financial markets. Its effectiveness and advantages have been detected with numerical simulations with two well-known synthetic noise signals. For the first time, the complexity of four generated trivariate return series for each stock trading hour in China stock markets is quantified thanks to the interdisciplinary application of this method. We find that the complexity of trivariate return series in each hour show a significant decreasing trend with the stock trading time progressing. Further, the shuffled multivariate return series and the absolute multivariate return series are also analyzed. As another new attempt, quantifying the complexity of global stock markets (Asia, Europe and America) is carried out by analyzing the multivariate returns from them. Finally we utilize the multivariate multiscale entropy to assess the relative complexity of normalized multivariate return volatility series with different degrees.

  6. Breast tissue classification using x-ray scattering measurements and multivariate data analysis

    Science.gov (United States)

    Ryan, Elaine A.; Farquharson, Michael J.

    2007-11-01

    This study utilized two radiation scatter interactions in order to differentiate malignant from non-malignant breast tissue. These two interactions were Compton scatter, used to measure the electron density of the tissues, and coherent scatter to obtain a measure of structure. Measurements of these parameters were made using a laboratory experimental set-up comprising an x-ray tube and HPGe detector. The breast tissue samples investigated comprise five different tissue classifications: adipose, malignancy, fibroadenoma, normal fibrous tissue and tissue that had undergone fibrocystic change. The coherent scatter spectra were analysed using a peak fitting routine, and a technique involving multivariate analysis was used to combine the peak fitted scatter profile spectra and the electron density values into a tissue classification model. The number of variables used in the model was refined by finding the sensitivity and specificity of each model and concentrating on differentiating between two tissues at a time. The best model that was formulated had a sensitivity of 54% and a specificity of 100%.

  7. Breast tissue classification using x-ray scattering measurements and multivariate data analysis

    Energy Technology Data Exchange (ETDEWEB)

    Ryan, Elaine A; Farquharson, Michael J [School of Allied Health Sciences, City University, Charterhouse Square, London EC1M 6PA (United Kingdom)

    2007-11-21

    This study utilized two radiation scatter interactions in order to differentiate malignant from non-malignant breast tissue. These two interactions were Compton scatter, used to measure the electron density of the tissues, and coherent scatter to obtain a measure of structure. Measurements of these parameters were made using a laboratory experimental set-up comprising an x-ray tube and HPGe detector. The breast tissue samples investigated comprise five different tissue classifications: adipose, malignancy, fibroadenoma, normal fibrous tissue and tissue that had undergone fibrocystic change. The coherent scatter spectra were analysed using a peak fitting routine, and a technique involving multivariate analysis was used to combine the peak fitted scatter profile spectra and the electron density values into a tissue classification model. The number of variables used in the model was refined by finding the sensitivity and specificity of each model and concentrating on differentiating between two tissues at a time. The best model that was formulated had a sensitivity of 54% and a specificity of 100%.

  8. Batch-to-Batch Quality Consistency Evaluation of Botanical Drug Products Using Multivariate Statistical Analysis of the Chromatographic Fingerprint

    OpenAIRE

    Xiong, Haoshu; Yu, Lawrence X.; Qu, Haibin

    2013-01-01

    Botanical drug products have batch-to-batch quality variability due to botanical raw materials and the current manufacturing process. The rational evaluation and control of product quality consistency are essential to ensure the efficacy and safety. Chromatographic fingerprinting is an important and widely used tool to characterize the chemical composition of botanical drug products. Multivariate statistical analysis has showed its efficacy and applicability in the quality evaluation of many ...

  9. Analysis of Surface Water Pollution in the Kinta River Using Multivariate Technique

    International Nuclear Information System (INIS)

    Hamza Ahmad Isiyaka; Hafizan Juahir

    2015-01-01

    This study aims to investigate the spatial variation in the characteristics of water quality monitoring sites, identify the most significant parameters and the major possible sources of pollution, and apportion the source category in the Kinta River. 31 parameters collected from eight monitoring sites for eight years (2006-2013) were employed. The eight monitoring stations were spatially grouped into three independent clusters in a dendrogram. A drastic reduction in the number of monitored parameters from 31 to eight and nine significant parameters (P<0.05) was achieved using the forward stepwise and backward stepwise discriminate analysis (DA). Principal component analysis (PCA) accounted for more than 76 % in the total variance and attributes the source of pollution to anthropogenic and natural processes. The source apportionment using a combined multiple linear regression and principal component scores indicates that 41 % of the total pollution load is from rock weathering and untreated waste water, 26 % from waste discharge, 24 % from surface runoff and 7 % from faecal waste. This study proposes a reduction in the number of monitoring stations and parameters for a cost effective and time management in the monitoring processes and multivariate technique can provide a simple representation of complex and dynamic water quality characteristics. (author)

  10. Applied Statistics: From Bivariate through Multivariate Techniques [with CD-ROM

    Science.gov (United States)

    Warner, Rebecca M.

    2007-01-01

    This book provides a clear introduction to widely used topics in bivariate and multivariate statistics, including multiple regression, discriminant analysis, MANOVA, factor analysis, and binary logistic regression. The approach is applied and does not require formal mathematics; equations are accompanied by verbal explanations. Students are asked…

  11. Plasma metabolic profiling analysis of nephrotoxicity induced by acyclovir using metabonomics coupled with multivariate data analysis.

    Science.gov (United States)

    Zhang, Xiuxiu; Li, Yubo; Zhou, Huifang; Fan, Simiao; Zhang, Zhenzhu; Wang, Lei; Zhang, Yanjun

    2014-08-01

    Acyclovir (ACV) is an antiviral agent. However, its use is limited by adverse side effect, particularly by its nephrotoxicity. Metabonomics technology can provide essential information on the metabolic profiles of biofluids and organs upon drug administration. Therefore, in this study, mass spectrometry-based metabonomics coupled with multivariate data analysis was used to identify the plasma metabolites and metabolic pathways related to nephrotoxicity caused by intraperitoneal injection of low (50mg/kg) and high (100mg/kg) doses of acyclovir. Sixteen biomarkers were identified by metabonomics and nephrotoxicity results revealed the dose-dependent effect of acyclovir on kidney tissues. The present study showed that the top four metabolic pathways interrupted by acyclovir included the metabolisms of arachidonic acid, tryptophan, arginine and proline, and glycerophospholipid. This research proves the established metabonomic approach can provide information on changes in metabolites and metabolic pathways, which can be applied to in-depth research on the mechanism of acyclovir-induced kidney injury. Copyright © 2014 Elsevier B.V. All rights reserved.

  12. G-rich, a Drosophila selenoprotein, is a Golgi-resident type III membrane protein

    International Nuclear Information System (INIS)

    Chen, Chang Lan; Shim, Myoung Sup; Chung, Jiyeol; Yoo, Hyun-Seung; Ha, Ji Min; Kim, Jin Young; Choi, Jinmi; Zang, Shu Liang; Hou, Xiao; Carlson, Bradley A.; Hatfield, Dolph L.; Lee, Byeong Jae

    2006-01-01

    G-rich is a Drosophila melanogaster selenoprotein, which is a homologue of human and mouse SelK. Subcellular localization analysis using GFP-tagged G-rich showed that G-rich was localized in the Golgi apparatus. The fusion protein was co-localized with the Golgi marker proteins but not with an endoplasmic reticulum (ER) marker protein in Drosophila SL2 cells. Bioinformatic analysis of G-rich suggests that this protein is either type II or type III transmembrane protein. To determine the type of transmembrane protein experimentally, GFP-G-rich in which GFP was tagged at the N-terminus of G-rich, or G-rich-GFP in which GFP was tagged at the C-terminus of G-rich, were expressed in SL2 cells. The tagged proteins were then digested with trypsin, and analyzed by Western blot analysis. The results showed that the C-terminus of the G-rich protein was exposed to the cytoplasm indicating it is a type III microsomal membrane protein. G-rich is First selenoprotein identified in the Golgi apparatus

  13. Multivariate data analysis

    Digital Repository Service at National Institute of Oceanography (India)

    Fernandes, A.A.; Antony, M.K.; Somayajulu, Y.K.; Sarma, Y.V.B.; Almeida, A.M.; Mahadevan, R.

    , Head Applied Statistics Unit, Indian Statistical Institute, Calcutta for going through the section on Canonical Correlation Analysis and offering his comments on the same. This report has been prepared using ?Latex? on a ?Linux? platform, viz...., the personal computer Kapila. I wish to thank Mr. Dattaram Shivji for installing ?Latex? and ?GMT? packages on the personal computer. The style file used for preparing this report, has been hacked by me from a Goa University, Ph. D style file prepared by Dr. D...

  14. Multivariate Multiscale Analysis

    Science.gov (United States)

    1990-11-08

    The conditions on k in the second half of the statement of the proposition can be somewhat relaxed. In the cases n = 2 and n = 3 the details are given...of Mathematical Func- lions, Dover, New York, N.Y., 1965. [2] Bray and D. C. Solmon, The horocycle transform and harmonic analysis on the Poincare disk...H. Izen, Inversion of the k- plane transform by orthogonal function series expansions, Inverse Problems, 5 (1989), 181-202. [20] J. V. Leahy, K. T

  15. Decoding the complex brain: multivariate and multimodal analyses of neuroimaging data

    International Nuclear Information System (INIS)

    Salami, Alireza

    2012-01-01

    Functional brain images are extraordinarily rich data sets that reveal distributed brain networks engaged in a wide variety of cognitive operations. It is a substantial challenge both to create models of cognition that mimic behavior and underlying cognitive processes and to choose a suitable analytic method to identify underlying brain networks. Most of the contemporary techniques used in analyses of functional neuroimaging data are based on univariate approaches in which single image elements (i.e. voxels) are considered to be computationally independent measures. Beyond univariate methods (e.g. statistical parametric mapping), multivariate approaches, which identify a network across all regions of the brain rather than a tessellation of regions, are potentially well suited for analyses of brain imaging data. A multivariate method (e.g. partial least squares) is a computational strategy that determines time-varying distributed patterns of the brain (as a function of a cognitive task). Compared to its univariate counterparts, a multivariate approach provides greater levels of sensitivity and reflects cooperative interactions among brain regions. Thus, by considering information across more than one measuring point, additional information on brain function can be revealed. Similarly, by considering information across more than one measuring technique, the nature of underlying cognitive processes become well-understood. Cognitive processes have been investigated in conjunction with multiple neuroimaging modalities (e.g. fMRI, sMRI, EEG, DTI), whereas the typical method has been to analyze each modality separately. Accordingly, little work has been carried out to examine the relation between different modalities. Indeed, due to the interconnected nature of brain processing, it is plausible that changes in one modality locally or distally modulate changes in another modality. This thesis focuses on multivariate and multimodal methods of image analysis applied to

  16. Decoding the complex brain: multivariate and multimodal analyses of neuroimaging data

    Energy Technology Data Exchange (ETDEWEB)

    Salami, Alireza

    2012-07-01

    Functional brain images are extraordinarily rich data sets that reveal distributed brain networks engaged in a wide variety of cognitive operations. It is a substantial challenge both to create models of cognition that mimic behavior and underlying cognitive processes and to choose a suitable analytic method to identify underlying brain networks. Most of the contemporary techniques used in analyses of functional neuroimaging data are based on univariate approaches in which single image elements (i.e. voxels) are considered to be computationally independent measures. Beyond univariate methods (e.g. statistical parametric mapping), multivariate approaches, which identify a network across all regions of the brain rather than a tessellation of regions, are potentially well suited for analyses of brain imaging data. A multivariate method (e.g. partial least squares) is a computational strategy that determines time-varying distributed patterns of the brain (as a function of a cognitive task). Compared to its univariate counterparts, a multivariate approach provides greater levels of sensitivity and reflects cooperative interactions among brain regions. Thus, by considering information across more than one measuring point, additional information on brain function can be revealed. Similarly, by considering information across more than one measuring technique, the nature of underlying cognitive processes become well-understood. Cognitive processes have been investigated in conjunction with multiple neuroimaging modalities (e.g. fMRI, sMRI, EEG, DTI), whereas the typical method has been to analyze each modality separately. Accordingly, little work has been carried out to examine the relation between different modalities. Indeed, due to the interconnected nature of brain processing, it is plausible that changes in one modality locally or distally modulate changes in another modality. This thesis focuses on multivariate and multimodal methods of image analysis applied to

  17. Development of methodology for identification the nature of the polyphenolic extracts by FTIR associated with multivariate analysis

    Science.gov (United States)

    Grasel, Fábio dos Santos; Ferrão, Marco Flôres; Wolf, Carlos Rodolfo

    2016-01-01

    Tannins are polyphenolic compounds of complex structures formed by secondary metabolism in several plants. These polyphenolic compounds have different applications, such as drugs, anti-corrosion agents, flocculants, and tanning agents. This study analyses six different type of polyphenolic extracts by Fourier transform infrared spectroscopy (FTIR) combined with multivariate analysis. Through both principal component analysis (PCA) and hierarchical cluster analysis (HCA), we observed well-defined separation between condensed (quebracho and black wattle) and hydrolysable (valonea, chestnut, myrobalan, and tara) tannins. For hydrolysable tannins, it was also possible to observe the formation of two different subgroups between samples of chestnut and valonea and between samples of tara and myrobalan. Among all samples analysed, the chestnut and valonea showed the greatest similarity, indicating that these extracts contain equivalent chemical compositions and structure and, therefore, similar properties.

  18. Studies on multivariate autoregressive analysis using synthesized reactor noise-like data for optimal modelling

    Energy Technology Data Exchange (ETDEWEB)

    Ciftcioglu, O.; Hoogenboom, J.E.; Dam, H. van

    1988-01-01

    Studies on the multivariate autoregressive (MAR) analysis are carried out for the choice of the parameters for modelling the data obtained from various sensors optimally. Accordingly, the roles of the parameters on the analysis results are identified and the related ambiguities are reduced. Experimental investigations are carried out by means of synthesized reactor noise-like data obtained from a digital simulator providing simulated stochastic signals of an operating nuclear reactor so that the simulator constitutes a favourable tool for the present studies aimed. As the system is well defined with its known structure, precise comparison of the MAR analysis results with the true values is performed. With the help of the information gained through the studies carried out, conditions to be taken care of for optimal signal processing in MAR modelling are determined. Although the parameters involved are related among themselves and they have to be given different values suitable for the particular application in hand, some criteria, namely memory-time and sample length-time play an essential role in AR modelling and they are found to be applicable to each individual case commonly, for the establishment of the optimality.

  19. Studies on multivariate autoregressive analysis using synthesized reactor noise-like data for optimal modelling

    International Nuclear Information System (INIS)

    Ciftcioglu, O.

    1988-01-01

    Studies on the multivariate autoregressive (MAR) analysis are carried out for the choice of the parameters for modelling the data obtained from various sensors optimally. Accordingly, the roles of the parameters on the analysis results are identified and the related ambiguities are reduced. Experimental investigations are carried out by means of synthesized reactor noise-like data obtained from a digital simulator providing simulated stochastic signals of an operating nuclear reactor so that the simulator constitutes a favourable tool for the present studies aimed. As the system is well defined with its known structure, precise comparison of the MAR analysis results with the true values is performed. With the help of the information gained through the studies carried out, conditions to be taken care of for optimal signal processing in MAR modelling are determined. Although the parameters involved are related among themselves and they have to be given different values suitable for the particular application in hand, some criteria, namely memory-time and sample length-time play an essential role in AR modelling and they are found to be applicable to each individual case commonly, for the establishment of the optimality. (author)

  20. Multivariate Variables Recognition using Hotelling’s T2 and MEWMA via ANN’s

    Directory of Open Access Journals (Sweden)

    Chiñas-Sánchez Pamela

    2014-01-01

    Full Text Available In this article, a method for multivariate pattern recognition using artificial neural networks (ANN is proposed. The method is useful for monitoring multiple variables during the statistical process control. It employs descriptive statistics and multivariate control techniques. Three different ANN’s are evaluated to identify the network with higher efficiency during pattern recognition of multivariate variables tasks from data bases. Two data bases are analyzed; the first one is generated by simulation using the Montecarlo method, and the second data base was obtained from a public data base repository. The method consists of three stages: multivariate variables generation, multivariate analysis and pattern recognition using ANN’s. Several multivariate scenarios were generated using a combination of 2, 3 and 4 patterns in multivariate variables for the Hotelling’s T2 and MEWMA statistics that were analyzed to know its behavior and to determine their statistical characteristics. The pattern recognition task was evaluated using the ANN. In both study cases, experimental results showed an improved efficiency when using the Perceptron and the Backpropagation networks compared to the RBF network.

  1. In situ photobiology of corals over large depth ranges: A multivariate analysis on the roles of environment, host, and algal symbiont

    NARCIS (Netherlands)

    Frade, P.R.; Bongaerts, P.; Winkelhagen, A.J.S.; Tonk, L.; Bak, R.P.M.

    2008-01-01

    We applied a multivariate analysis to investigate the roles of host and symbiont on the in situ physiological response of genus Madracis holobionts towards light. Across a large depth gradient (5-40 m) and for four Madracis species and three symbiont genotypes, we assessed several variables by

  2. Sparse Linear Identifiable Multivariate Modeling

    DEFF Research Database (Denmark)

    Henao, Ricardo; Winther, Ole

    2011-01-01

    and bench-marked on artificial and real biological data sets. SLIM is closest in spirit to LiNGAM (Shimizu et al., 2006), but differs substantially in inference, Bayesian network structure learning and model comparison. Experimentally, SLIM performs equally well or better than LiNGAM with comparable......In this paper we consider sparse and identifiable linear latent variable (factor) and linear Bayesian network models for parsimonious analysis of multivariate data. We propose a computationally efficient method for joint parameter and model inference, and model comparison. It consists of a fully...

  3. Evaluation of genetic diversity among soybean (Glycine max) genotypes using univariate and multivariate analysis.

    Science.gov (United States)

    Oliveira, M M; Sousa, L B; Reis, M C; Silva Junior, E G; Cardoso, D B O; Hamawaki, O T; Nogueira, A P O

    2017-05-31

    The genetic diversity study has paramount importance in breeding programs; hence, it allows selection and choice of the parental genetic divergence, which have the agronomic traits desired by the breeder. This study aimed to characterize the genetic divergence between 24 soybean genotypes through their agronomic traits, using multivariate clustering methods to select the potential genitors for the promising hybrid combinations. Six agronomic traits evaluated were number of days to flowering and maturity, plant height at flowering and maturity, insertion height of the first pod, and yield. The genetic divergence evaluated by multivariate analysis that esteemed first the Mahalanobis' generalized distance (D 2 ), then the clustering using Tocher's optimization methods, and then the unweighted pair group method with arithmetic average (UPGMA). Tocher's optimization method and the UPGMA agreed with the groups' constitution between each other, the formation of eight distinct groups according Tocher's method and seven distinct groups using UPGMA. The trait number of days for flowering (45.66%) was the most efficient to explain dissimilarity between genotypes, and must be one of the main traits considered by the breeder in the moment of genitors choice in soybean-breeding programs. The genetic variability allowed the identification of dissimilar genotypes and with superior performances. The hybridizations UFU 18 x UFUS CARAJÁS, UFU 15 x UFU 13, and UFU 13 x UFUS CARAJÁS are promising to obtain superior segregating populations, which enable the development of more productive genotypes.

  4. Qupe--a Rich Internet Application to take a step forward in the analysis of mass spectrometry-based quantitative proteomics experiments.

    Science.gov (United States)

    Albaum, Stefan P; Neuweger, Heiko; Fränzel, Benjamin; Lange, Sita; Mertens, Dominik; Trötschel, Christian; Wolters, Dirk; Kalinowski, Jörn; Nattkemper, Tim W; Goesmann, Alexander

    2009-12-01

    The goal of present -omics sciences is to understand biological systems as a whole in terms of interactions of the individual cellular components. One of the main building blocks in this field of study is proteomics where tandem mass spectrometry (LC-MS/MS) in combination with isotopic labelling techniques provides a common way to obtain a direct insight into regulation at the protein level. Methods to identify and quantify the peptides contained in a sample are well established, and their output usually results in lists of identified proteins and calculated relative abundance values. The next step is to move ahead from these abstract lists and apply statistical inference methods to compare measurements, to identify genes that are significantly up- or down-regulated, or to detect clusters of proteins with similar expression profiles. We introduce the Rich Internet Application (RIA) Qupe providing comprehensive data management and analysis functions for LC-MS/MS experiments. Starting with the import of mass spectra data the system guides the experimenter through the process of protein identification by database search, the calculation of protein abundance ratios, and in particular, the statistical evaluation of the quantification results including multivariate analysis methods such as analysis of variance or hierarchical cluster analysis. While a data model to store these results has been developed, a well-defined programming interface facilitates the integration of novel approaches. A compute cluster is utilized to distribute computationally intensive calculations, and a web service allows to interchange information with other -omics software applications. To demonstrate that Qupe represents a step forward in quantitative proteomics analysis an application study on Corynebacterium glutamicum has been carried out. Qupe is implemented in Java utilizing Hibernate, Echo2, R and the Spring framework. We encourage the usage of the RIA in the sense of the 'software as a

  5. A guide to statistical analysis in microbial ecology: a community-focused, living review of multivariate data analyses.

    Science.gov (United States)

    Buttigieg, Pier Luigi; Ramette, Alban

    2014-12-01

    The application of multivariate statistical analyses has become a consistent feature in microbial ecology. However, many microbial ecologists are still in the process of developing a deep understanding of these methods and appreciating their limitations. As a consequence, staying abreast of progress and debate in this arena poses an additional challenge to many microbial ecologists. To address these issues, we present the GUide to STatistical Analysis in Microbial Ecology (GUSTA ME): a dynamic, web-based resource providing accessible descriptions of numerous multivariate techniques relevant to microbial ecologists. A combination of interactive elements allows users to discover and navigate between methods relevant to their needs and examine how they have been used by others in the field. We have designed GUSTA ME to become a community-led and -curated service, which we hope will provide a common reference and forum to discuss and disseminate analytical techniques relevant to the microbial ecology community. © 2014 The Authors. FEMS Microbiology Ecology published by John Wiley & Sons Ltd on behalf of Federation of European Microbiological Societies.

  6. Application of multivariate statistical techniques in microbial ecology.

    Science.gov (United States)

    Paliy, O; Shankar, V

    2016-03-01

    Recent advances in high-throughput methods of molecular analyses have led to an explosion of studies generating large-scale ecological data sets. In particular, noticeable effect has been attained in the field of microbial ecology, where new experimental approaches provided in-depth assessments of the composition, functions and dynamic changes of complex microbial communities. Because even a single high-throughput experiment produces large amount of data, powerful statistical techniques of multivariate analysis are well suited to analyse and interpret these data sets. Many different multivariate techniques are available, and often it is not clear which method should be applied to a particular data set. In this review, we describe and compare the most widely used multivariate statistical techniques including exploratory, interpretive and discriminatory procedures. We consider several important limitations and assumptions of these methods, and we present examples of how these approaches have been utilized in recent studies to provide insight into the ecology of the microbial world. Finally, we offer suggestions for the selection of appropriate methods based on the research question and data set structure. © 2016 John Wiley & Sons Ltd.

  7. Butterfly Species Richness in Selected West Albertine Rift Forests

    Directory of Open Access Journals (Sweden)

    Patrice Kasangaki

    2012-01-01

    Full Text Available The butterfly species richness of 17 forests located in the western arm of the Albertine Rift in Uganda was compared using cluster analysis and principal components analysis (PCA to assess similarities among the forests. The objective was to compare the butterfly species richness of the forests. A total of 630 butterfly species were collected in 5 main families. The different species fell into 7 ecological groupings with the closed forest group having the most species and the swamp/wetland group with the fewest number of species. Three clusters were obtained. The first cluster had forests characterized by relatively high altitude and low species richness despite the big area in the case of Rwenzori and being close to the supposed Pleistocene refugium. The second cluster had forests far away from the supposed refugium except Kisangi and moderate species richness with small areas, whereas the third cluster had those forests that were more disturbed, high species richness, and low altitudinal levels with big areas.

  8. Multivariate Analysis of the Predictors of Survival for Patients with Hepatocellular Carcinoma Undergoing Transarterial Chemoembolization: Focusing on Superselective Chemoembolization

    International Nuclear Information System (INIS)

    Ji, Suk Kyeong; Cho, Yun Ku; Ahn, Yong Sik; Kim, Mi Young; Park, Yoon Ok; Kim, Jae Kyun; Kim, Wan Tae

    2008-01-01

    While the prognostic factors of survival for patients with hepatocellular carcinoma (HCC) who underwent transarterial chemoembolization (TACE) are well known, the clinical significance of performing selective TACE for HCC patients has not been clearly documented. We tried to analyze the potential factors of disease-free survival for these patients, including the performance of selective TACE. A total of 151 patients with HCC who underwent TACE were retrospectively analyzed for their disease-free survival (a median follow- up of 23 months, range: 1-88 months). Univariate and multivariate analyses were performed for 20 potential factors by using the Cox proportional hazard model, including 19 baseline factors and one procedure-related factor (conventional versus selective TACE). The parameters that proved to be significant on the univariate analysis were subsequently tested with the multivariate model. Conventional or selective TACE was performed for 40 and 111 patients, respectively. Univariate and multivariate analyses revealed that tumor multiplicity, venous tumor thrombosis and selective TACE were the only three independent significant prognostic factors of disease-free survival (p = 0.002, 0.015 and 0.019, respectively). In our study, selective TACE was a favorable prognostic factor for the disease-free survival of patients with HCC who underwent TACE

  9. Time-series panel analysis (TSPA): multivariate modeling of temporal associations in psychotherapy process.

    Science.gov (United States)

    Ramseyer, Fabian; Kupper, Zeno; Caspar, Franz; Znoj, Hansjörg; Tschacher, Wolfgang

    2014-10-01

    Processes occurring in the course of psychotherapy are characterized by the simple fact that they unfold in time and that the multiple factors engaged in change processes vary highly between individuals (idiographic phenomena). Previous research, however, has neglected the temporal perspective by its traditional focus on static phenomena, which were mainly assessed at the group level (nomothetic phenomena). To support a temporal approach, the authors introduce time-series panel analysis (TSPA), a statistical methodology explicitly focusing on the quantification of temporal, session-to-session aspects of change in psychotherapy. TSPA-models are initially built at the level of individuals and are subsequently aggregated at the group level, thus allowing the exploration of prototypical models. TSPA is based on vector auto-regression (VAR), an extension of univariate auto-regression models to multivariate time-series data. The application of TSPA is demonstrated in a sample of 87 outpatient psychotherapy patients who were monitored by postsession questionnaires. Prototypical mechanisms of change were derived from the aggregation of individual multivariate models of psychotherapy process. In a 2nd step, the associations between mechanisms of change (TSPA) and pre- to postsymptom change were explored. TSPA allowed a prototypical process pattern to be identified, where patient's alliance and self-efficacy were linked by a temporal feedback-loop. Furthermore, therapist's stability over time in both mastery and clarification interventions was positively associated with better outcomes. TSPA is a statistical tool that sheds new light on temporal mechanisms of change. Through this approach, clinicians may gain insight into prototypical patterns of change in psychotherapy. PsycINFO Database Record (c) 2014 APA, all rights reserved.

  10. Multivariate erosion risk assessment of lateritic badlands of Birbhum ...

    Indian Academy of Sciences (India)

    Erosion risk; soil erosion; sediment yield; multivariate analysis; GIS. J. Earth Syst. Sci. 121, No. ... ers are threatened by excessive soil loss by water. To reach that goal the ... nacle erosion, bare soil cover, barren waste land, tunnels and ...

  11. Characterization of metal pollution in soils under two landuse patterns in the Angouran region, NW Iran; a study based on multivariate data analysis

    International Nuclear Information System (INIS)

    Qishlaqi, Afshin; Moore, Farid; Forghani, Giti

    2009-01-01

    The study presents the application of selected multivariate statistical methods (multivariate analysis of variance, discriminant analysis, principal component analysis) and geostatistical techniques to evaluate soil pollution status in arable lands of the Angouran region, NW Iran. Two representative landuse patterns, cropland and grassland, were selected for the purpose of this study. Seventy soil samples (35 topsoils and 35 subsoils) were collected from the two landuse types and 21 soil parameters including total element content and physicochemical properties were also determined. Results from application of the multivariate analysis of variance showed that the two landuse patterns were not statistically differentiated by subsoil variables, whereas significant differences existed between the two landuse patterns with respect to topsoil variables. Discriminant analysis rendered seven variables (Cu, As, Cd, OM, P, K and total N) as indicator parameters responsible for the discrimination between the two landuse types. Using the principal component analysis (PCA), two main components (PCs) explaining 71.71% of total variance were extracted. PC1, with a high contribution of Ni, Cr, Fe, Mn and clay content was hypothesized as lithogenic component and PC2, with high loadings for the seven discerning variables (Cu, As, Cd, OM, P, K and total N), was considered as an agrogenic component. Geostatistical analyses, including the calculation of semivariogram parameters and model fitting, further supported the PCA results. PC1 was generally characterized by moderate spatial dependence and long-range spatial variation (8000 m) influenced by soil parent martial composition, while PC2 was modelled by pure nugget effect probably reflecting the influences of agrogenic activities. The findings of this study could not only expand our knowledge regarding the soil pollution status in the study area, but would also provide decision makers with the information to manage the agrochemical

  12. Opportunities for multivariate analysis of open spatial datasets to characterize urban flooding risks

    Directory of Open Access Journals (Sweden)

    S. Gaitan

    2015-06-01

    Full Text Available Cities worldwide are challenged by increasing urban flood risks. Precise and realistic measures are required to reduce flooding impacts. However, currently implemented sewer and topographic models do not provide realistic predictions of local flooding occurrence during heavy rain events. Assessing other factors such as spatially distributed rainfall, socioeconomic characteristics, and social sensing, may help to explain probability and impacts of urban flooding. Several spatial datasets have been recently made available in the Netherlands, including rainfall-related incident reports made by citizens, spatially distributed rain depths, semidistributed socioeconomic information, and buildings age. Inspecting the potential of this data to explain the occurrence of rainfall related incidents has not been done yet. Multivariate analysis tools for describing communities and environmental patterns have been previously developed and used in the field of study of ecology. The objective of this paper is to outline opportunities for these tools to explore urban flooding risks patterns in the mentioned datasets. To that end, a cluster analysis is performed. Results indicate that incidence of rainfall-related impacts is higher in areas characterized by older infrastructure and higher population density.

  13. Practical multivariate analysis

    CERN Document Server

    Afifi, Abdelmonem; Clark, Virginia A

    2011-01-01

    ""First of all, it is very easy to read. … The authors manage to introduce and (at least partially) explain even quite complex concepts, e.g. eigenvalues, in an easy and pedagogical way that I suppose is attractive to readers without deeper statistical knowledge. The text is also sprinkled with references for those who want to probe deeper into a certain topic. Secondly, I personally find the book's emphasis on practical data handling very appealing. … Thirdly, the book gives very nice coverage of regression analysis. … this is a nicely written book that gives a good overview of a large number

  14. Multivariate analysis of behavioural response experiments in humpback whales (Megaptera novaeangliae).

    Science.gov (United States)

    Dunlop, Rebecca A; Noad, Michael J; Cato, Douglas H; Kniest, Eric; Miller, Patrick J O; Smith, Joshua N; Stokes, M Dale

    2013-03-01

    The behavioural response study (BRS) is an experimental design used by field biologists to determine the function and/or behavioural effects of conspecific, heterospecific or anthropogenic stimuli. When carrying out these studies in marine mammals it is difficult to make basic observations and achieve sufficient samples sizes because of the high cost and logistical difficulties. Rarely are other factors such as social context or the physical environment considered in the analysis because of these difficulties. This paper presents results of a BRS carried out in humpback whales to test the response of groups to one recording of conspecific social sounds and an artificially generated tone stimulus. Experiments were carried out in September/October 2004 and 2008 during the humpback whale southward migration along the east coast of Australia. In total, 13 'tone' experiments, 15 'social sound' experiments (using one recording of social sounds) and three silent controls were carried out over two field seasons. The results (using a mixed model statistical analysis) suggested that humpback whales responded differently to the two stimuli, measured by changes in course travelled and dive behaviour. Although the response to 'tones' was consistent, in that groups moved offshore and surfaced more often (suggesting an aversion to the stimulus), the response to 'social sounds' was highly variable and dependent upon the composition of the social group. The change in course and dive behaviour in response to 'tones' was found to be related to proximity to the source, the received signal level and signal-to-noise ratio (SNR). This study demonstrates that the behavioural responses of marine mammals to acoustic stimuli are complex. In order to tease out such multifaceted interactions, the number of replicates and factors measured must be sufficient for multivariate analysis.

  15. Complex numbers in chemometrics: examples from multivariate impedance measurements on lipid monolayers.

    Science.gov (United States)

    Geladi, Paul; Nelson, Andrew; Lindholm-Sethson, Britta

    2007-07-09

    Electrical impedance gives multivariate complex number data as results. Two examples of multivariate electrical impedance data measured on lipid monolayers in different solutions give rise to matrices (16x50 and 38x50) of complex numbers. Multivariate data analysis by principal component analysis (PCA) or singular value decomposition (SVD) can be used for complex data and the necessary equations are given. The scores and loadings obtained are vectors of complex numbers. It is shown that the complex number PCA and SVD are better at concentrating information in a few components than the naïve juxtaposition method and that Argand diagrams can replace score and loading plots. Different concentrations of Magainin and Gramicidin A give different responses and also the role of the electrolyte medium can be studied. An interaction of Gramicidin A in the solution with the monolayer over time can be observed.

  16. CoSMoMVPA: Multi-Modal Multivariate Pattern Analysis of Neuroimaging Data in Matlab/GNU Octave.

    Science.gov (United States)

    Oosterhof, Nikolaas N; Connolly, Andrew C; Haxby, James V

    2016-01-01

    Recent years have seen an increase in the popularity of multivariate pattern (MVP) analysis of functional magnetic resonance (fMRI) data, and, to a much lesser extent, magneto- and electro-encephalography (M/EEG) data. We present CoSMoMVPA, a lightweight MVPA (MVP analysis) toolbox implemented in the intersection of the Matlab and GNU Octave languages, that treats both fMRI and M/EEG data as first-class citizens. CoSMoMVPA supports all state-of-the-art MVP analysis techniques, including searchlight analyses, classification, correlations, representational similarity analysis, and the time generalization method. These can be used to address both data-driven and hypothesis-driven questions about neural organization and representations, both within and across: space, time, frequency bands, neuroimaging modalities, individuals, and species. It uses a uniform data representation of fMRI data in the volume or on the surface, and of M/EEG data at the sensor and source level. Through various external toolboxes, it directly supports reading and writing a variety of fMRI and M/EEG neuroimaging formats, and, where applicable, can convert between them. As a result, it can be integrated readily in existing pipelines and used with existing preprocessed datasets. CoSMoMVPA overloads the traditional volumetric searchlight concept to support neighborhoods for M/EEG and surface-based fMRI data, which supports localization of multivariate effects of interest across space, time, and frequency dimensions. CoSMoMVPA also provides a generalized approach to multiple comparison correction across these dimensions using Threshold-Free Cluster Enhancement with state-of-the-art clustering and permutation techniques. CoSMoMVPA is highly modular and uses abstractions to provide a uniform interface for a variety of MVP measures. Typical analyses require a few lines of code, making it accessible to beginner users. At the same time, expert programmers can easily extend its functionality. Co

  17. Measures of precision for dissimilarity-based multivariate analysis of ecological communities.

    Science.gov (United States)

    Anderson, Marti J; Santana-Garcon, Julia

    2015-01-01

    Ecological studies require key decisions regarding the appropriate size and number of sampling units. No methods currently exist to measure precision for multivariate assemblage data when dissimilarity-based analyses are intended to follow. Here, we propose a pseudo multivariate dissimilarity-based standard error (MultSE) as a useful quantity for assessing sample-size adequacy in studies of ecological communities. Based on sums of squared dissimilarities, MultSE measures variability in the position of the centroid in the space of a chosen dissimilarity measure under repeated sampling for a given sample size. We describe a novel double resampling method to quantify uncertainty in MultSE values with increasing sample size. For more complex designs, values of MultSE can be calculated from the pseudo residual mean square of a permanova model, with the double resampling done within appropriate cells in the design. R code functions for implementing these techniques, along with ecological examples, are provided. © 2014 The Authors. Ecology Letters published by John Wiley & Sons Ltd and CNRS.

  18. Leachate/domestic wastewater aerobic co-treatment: A pilot-scale study using multivariate analysis.

    Science.gov (United States)

    Ferraz, F M; Bruni, A T; Povinelli, J; Vieira, E M

    2016-01-15

    Multivariate analysis was used to identify the variables affecting the performance of pilot-scale activated sludge (AS) reactors treating old leachate from a landfill and from domestic wastewater. Raw leachate was pre-treated using air stripping to partially remove the total ammoniacal nitrogen (TAN). The control AS reactor (AS-0%) was loaded only with domestic wastewater, whereas the other reactor was loaded with mixtures containing leachate at volumetric ratios of 2 and 5%. The best removal efficiencies were obtained for a ratio of 2%, as follows: 70 ± 4% for total suspended solids (TSS), 70 ± 3% for soluble chemical oxygen demand (SCOD), 70 ± 4% for dissolved organic carbon (DOC), and 51 ± 9% for the leachate slowly biodegradable organic matter (SBOM). Fourier transform infrared (FTIR) spectroscopic analysis confirmed that most of the SBOM was removed by partial biodegradation rather than dilution or adsorption of organics in the sludge. Nitrification was approximately 80% in the AS-0% and AS-2% reactors. No significant accumulation of heavy metals was observed for any of the tested volumetric ratios. Principal component analysis (PCA) and partial least squares (PLS) indicated that the data dimension could be reduced and that TAN, SCOD, DOC and nitrification efficiency were the main variables that affected the performance of the AS reactors. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Compositional differences among Chinese soy sauce types studied by (13)C NMR spectroscopy coupled with multivariate statistical analysis.

    Science.gov (United States)

    Kamal, Ghulam Mustafa; Wang, Xiaohua; Bin Yuan; Wang, Jie; Sun, Peng; Zhang, Xu; Liu, Maili

    2016-09-01

    Soy sauce a well known seasoning all over the world, especially in Asia, is available in global market in a wide range of types based on its purpose and the processing methods. Its composition varies with respect to the fermentation processes and addition of additives, preservatives and flavor enhancers. A comprehensive (1)H NMR based study regarding the metabonomic variations of soy sauce to differentiate among different types of soy sauce available on the global market has been limited due to the complexity of the mixture. In present study, (13)C NMR spectroscopy coupled with multivariate statistical data analysis like principle component analysis (PCA), and orthogonal partial least square-discriminant analysis (OPLS-DA) was applied to investigate metabonomic variations among different types of soy sauce, namely super light, super dark, red cooking and mushroom soy sauce. The main additives in soy sauce like glutamate, sucrose and glucose were easily distinguished and quantified using (13)C NMR spectroscopy which were otherwise difficult to be assigned and quantified due to serious signal overlaps in (1)H NMR spectra. The significantly higher concentration of sucrose in dark, red cooking and mushroom flavored soy sauce can directly be linked to the addition of caramel in soy sauce. Similarly, significantly higher level of glutamate in super light as compared to super dark and mushroom flavored soy sauce may come from the addition of monosodium glutamate. The study highlights the potentiality of (13)C NMR based metabonomics coupled with multivariate statistical data analysis in differentiating between the types of soy sauce on the basis of level of additives, raw materials and fermentation procedures. Copyright © 2016 Elsevier B.V. All rights reserved.

  20. Quantitative analysis and prediction of curvature in leucine-rich repeat proteins.

    Science.gov (United States)

    Hindle, K Lauren; Bella, Jordi; Lovell, Simon C

    2009-11-01

    Leucine-rich repeat (LRR) proteins form a large and diverse family. They have a wide range of functions most of which involve the formation of protein-protein interactions. All known LRR structures form curved solenoids, although there is large variation in their curvature. It is this curvature that determines the shape and dimensions of the inner space available for ligand binding. Unfortunately, large-scale parameters such as the overall curvature of a protein domain are extremely difficult to predict. Here, we present a quantitative analysis of determinants of curvature of this family. Individual repeats typically range in length between 20 and 30 residues and have a variety of secondary structures on their convex side. The observed curvature of the LRR domains correlates poorly with the lengths of their individual repeats. We have, therefore, developed a scoring function based on the secondary structure of the convex side of the protein that allows prediction of the overall curvature with a high degree of accuracy. We also demonstrate the effectiveness of this method in selecting a suitable template for comparative modeling. We have developed an automated, quantitative protocol that can be used to predict accurately the curvature of leucine-rich repeat proteins of unknown structure from sequence alone. This protocol is available as an online resource at http://www.bioinf.manchester.ac.uk/curlrr/.

  1. Differences in chewing sounds of dry-crisp snacks by multivariate data analysis

    Science.gov (United States)

    De Belie, N.; Sivertsvik, M.; De Baerdemaeker, J.

    2003-09-01

    Chewing sounds of different types of dry-crisp snacks (two types of potato chips, prawn crackers, cornflakes and low calorie snacks from extruded starch) were analysed to assess differences in sound emission patterns. The emitted sounds were recorded by a microphone placed over the ear canal. The first bite and the first subsequent chew were selected from the time signal and a fast Fourier transformation provided the power spectra. Different multivariate analysis techniques were used for classification of the snack groups. This included principal component analysis (PCA) and unfold partial least-squares (PLS) algorithms, as well as multi-way techniques such as three-way PLS, three-way PCA (Tucker3), and parallel factor analysis (PARAFAC) on the first bite and subsequent chew. The models were evaluated by calculating the classification errors and the root mean square error of prediction (RMSEP) for independent validation sets. It appeared that the logarithm of the power spectra obtained from the chewing sounds could be used successfully to distinguish the different snack groups. When different chewers were used, recalibration of the models was necessary. Multi-way models distinguished better between chewing sounds of different snack groups than PCA on bite or chew separately and than unfold PLS. From all three-way models applied, N-PLS with three components showed the best classification capabilities, resulting in classification errors of 14-18%. The major amount of incorrect classifications was due to one type of potato chips that had a very irregular shape, resulting in a wide variation of the emitted sounds.

  2. Intensive removal of signal crayfish (Pacifastacus leniusculus) from rivers increases numbers and taxon richness of macroinvertebrate species.

    Science.gov (United States)

    Moorhouse, Tom P; Poole, Alison E; Evans, Laura C; Bradley, David C; Macdonald, David W

    2014-02-01

    Invasive species are a major cause of species extinction in freshwater ecosystems, and crayfish species are particularly pervasive. The invasive American signal crayfish Pacifastacus leniusculus has impacts over a range of trophic levels, but particularly on benthic aquatic macroinvertebrates. Our study examined the effect on the macroinvertebrate community of removal trapping of signal crayfish from UK rivers. Crayfish were intensively trapped and removed from two tributaries of the River Thames to test the hypothesis that lowering signal crayfish densities would result in increases in macroinvertebrate numbers and taxon richness. We removed 6181 crayfish over four sessions, resulting in crayfish densities that decreased toward the center of the removal sections. Conversely in control sections (where crayfish were trapped and returned), crayfish density increased toward the center of the section. Macroinvertebrate numbers and taxon richness were inversely correlated with crayfish densities. Multivariate analysis of the abundance of each taxon yielded similar results and indicated that crayfish removals had positive impacts on macroinvertebrate numbers and taxon richness but did not alter the composition of the wider macroinvertebrate community. Synthesis and applications: Our results demonstrate that non-eradication-oriented crayfish removal programmes may lead to increases in the total number of macroinvertebrates living in the benthos. This represents the first evidence that removing signal crayfish from riparian systems, at intensities feasible during control attempts or commercial crayfishing, may be beneficial for a range of sympatric aquatic macroinvertebrates.

  3. Multivariate statistical treatment of PIXE analysis of some traditional Chinese medicines

    International Nuclear Information System (INIS)

    Xiaofeng Zhang; Jianguo Ma; Junfa Qin; Lun Xiao

    1991-01-01

    Elements in two kinds of 30 traditional Chinese medicines were analyzed by PIXE method, and the data were treated by multivariate statistical methods. The results show that these two kinds of traditional Chinese medicines are almost separable according to their elemental contents. The results are congruous with the traditional Chinese medicine practice. (author) 7 refs.; 2 figs.; 2 tabs

  4. Factors that impact the outcome of endoscopic correction of vesicoureteral reflux: a multivariate analysis.

    Science.gov (United States)

    Kajbafzadeh, Abdol-Mohammad; Tourchi, Ali; Aryan, Zahra

    2013-02-01

    To identify independent factors that may predict vesicoureteral reflux (VUR) resolution after endoscopic treatment using dextranomer/hyaluronic acid copolymer (Deflux) in children free of anatomical anomalies. A retrospective study was conducted in our pediatric referral center from 1998 to 2011 on children with primary VUR who underwent endoscopic injection of Deflux with or without concomitant autologous blood injection (called HABIT or HIT, respectively). Children with secondary VUR or incomplete records were excluded from the study. Potential factors were divided into three categories including preoperative, intraoperative and postoperative. Success was defined as no sign of VUR on postoperative voiding cystourethrogram. Univariate and multivariate logistic regression models were constructed to identify independent factors that may predict success. Odds ratio (OR) and 95 % confidence interval (95 % CI) for prediction of success were estimated for each factor. From 485 children received Deflux injection, a total of 372 with a mean age of 3.10 years (ranged from 6 months to 12 years) were included in the study and endoscopic management was successful in 322 (86.6 %) of them. Of the patients, 185 (49.7 %) underwent HIT and 187 (50.3 %) underwent HABIT technique. On univariate analysis, VUR grade from preoperative category (OR = 4.79, 95 % CI = 2.22-10.30, p = 0.000), operation technique (OR = 0.33, 95 % CI = 0.17-0.64, p = 0.001) and presence of mound on postoperative sonography (OR = 0.06, 95 % CI = 0.02-0.16, p = 0.000) were associated with success. On multivariate analysis, preoperative VUR grade (OR = 4.85, 95 % CI = 2.49-8.96, p = 0.000) and identification of mound on postoperative sonography (OR = 0.07, 95 % CI = 0.01-0.18, p = 0.000) remained as independent success predictors. Based on this study, successful VUR correction after the endoscopic injection of Deflux can be predicted with respect to preoperative VUR grade and presence of mound after operation.

  5. Multivariate Analysis for Quantification of Plutonium(IV) in Nitric Acid Based on Absorption Spectra

    Energy Technology Data Exchange (ETDEWEB)

    Lines, Amanda M. [Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States; Adami, Susan R. [Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States; Sinkov, Sergey I. [Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States; Lumetta, Gregg J. [Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States; Bryan, Samuel A. [Energy and Environment Directorate, Pacific Northwest National Laboratory, Richland, Washington 99352, United States

    2017-08-09

    Development of more effective, reliable, and fast methods for monitoring process streams is a growing opportunity for analytical applications. Many fields can benefit from on-line monitoring, including the nuclear fuel cycle where improved methods for monitoring radioactive materials will facilitate maintenance of proper safeguards and ensure safe and efficient processing of materials. On-line process monitoring with a focus on optical spectroscopy can provide a fast, non-destructive method for monitoring chemical species. However, identification and quantification of species can be hindered by the complexity of the solutions if bands overlap or show condition-dependent spectral features. Plutonium (IV) is one example of a species which displays significant spectral variation with changing nitric acid concentration. Single variate analysis (i.e. Beer’s Law) is difficult to apply to the quantification of Pu(IV) unless the nitric acid concentration is known and separate calibration curves have been made for all possible acid strengths. Multivariate, or chemometric, analysis is an approach that allows for the accurate quantification of Pu(IV) without a priori knowledge of nitric acid concentration.

  6. Determination of volatile organic compounds pollution sources in malaysian drinking water using multivariate analysis.

    Science.gov (United States)

    Soh, Shiau-Chian; Abdullah, Md Pauzi

    2007-01-01

    A field investigation was conducted at all water treatment plants throughout 11 states and Federal Territory in Peninsular Malaysia. The sampling points in this study include treatment plant operation, service reservoir outlet and auxiliary outlet point at the water pipelines. Analysis was performed by solid phase micro-extraction technique with a 100 microm polydimethylsiloxane fibre using gas chromatography with mass spectrometry detection to analyse 54 volatile organic compounds (VOCs) of different chemical families in drinking water. The concentration of VOCs ranged from undetectable to 230.2 microg/l. Among all of the VOCs species, chloroform has the highest concentration and was detected in all drinking water samples. Average concentrations of total trihalomethanes (THMs) were almost similar among all states which were in the range of 28.4--33.0 microg/l. Apart from THMs, other abundant compounds detected were cis and trans-1,2-dichloroethylene, trichloroethylene, 1,2-dibromoethane, benzene, toluene, ethylbenzene, chlorobenzene, 1,4-dichlorobenzene and 1,2-dichloro - benzene. Principal component analysis (PCA) with the aid of varimax rotation, and parallel factor analysis (PARAFAC) method were used to statistically verify the correlation between VOCs and the source of pollution. The multivariate analysis pointed out that the maintenance of auxiliary pipelines in the distribution systems is vital as it can become significant point source pollution to Malaysian drinking water.

  7. Multivariate stochastic simulation with subjective multivariate normal distributions

    Science.gov (United States)

    P. J. Ince; J. Buongiorno

    1991-01-01

    In many applications of Monte Carlo simulation in forestry or forest products, it may be known that some variables are correlated. However, for simplicity, in most simulations it has been assumed that random variables are independently distributed. This report describes an alternative Monte Carlo simulation technique for subjectively assesed multivariate normal...

  8. Model Checking Multivariate State Rewards

    DEFF Research Database (Denmark)

    Nielsen, Bo Friis; Nielson, Flemming; Nielson, Hanne Riis

    2010-01-01

    We consider continuous stochastic logics with state rewards that are interpreted over continuous time Markov chains. We show how results from multivariate phase type distributions can be used to obtain higher-order moments for multivariate state rewards (including covariance). We also generalise...

  9. Social Cognitive and Planned Behavior Variables Associated with Stages of Change for Physical Activity in Spinal Cord Injury: A Multivariate Analysis

    Science.gov (United States)

    Keegan, John; Ditchman, Nicole; Dutta, Alo; Chiu, Chung-Yi; Muller, Veronica; Chan, Fong; Kundu, Madan

    2016-01-01

    Purpose: To apply the constructs of social cognitive theory (SCT) and the theory of planned behavior (TPB) to understand the stages of change (SOC) for physical activities among individuals with a spinal cord injury (SCI). Method: Ex post facto design using multivariate analysis of variance (MANOVA). The participants were 144 individuals with SCI…

  10. Impact of Secreted Protein Acidic and Rich in Cysteine (SPARC) Expression on Prognosis After Surgical Resection for Biliary Carcinoma.

    Science.gov (United States)

    Toyota, Kazuhiro; Murakami, Yoshiaki; Kondo, Naru; Uemura, Kenichiro; Nakagawa, Naoya; Takahashi, Shinya; Sueda, Taijiro

    2017-06-01

    Secreted protein acidic and rich in cysteine (SPARC) is a matricellular protein that influences chemotherapy effectiveness and prognosis. The aim of this study was to investigate whether SPARC expression correlates with the postoperative survival of patients treated with surgical resection for biliary carcinoma. SPARC expression in resected biliary carcinoma specimens was investigated immunohistochemically in 175 patients. The relationship between SPARC expression and prognosis after surgery was evaluated using univariate and multivariate analyses. High SPARC expression in peritumoral stroma was found in 61 (35%) patients. In all patients, stromal SPARC expression was significantly associated with overall survival (OS) (P = 0.006). Multivariate analysis revealed that high stromal SPARC expression was an independent risk factor for poor OS (HR 1.81, P = 0.006). Moreover, high stromal SPARC expression was independently associated with poor prognosis in a subset of 118 patients treated with gemcitabine-based adjuvant chemotherapy (HR 2.04, P = 0.010) but not in the 57 patients who did not receive adjuvant chemotherapy (P = 0.21). Stromal SPARC expression correlated with the prognosis of patients with resectable biliary carcinoma, and its significance was enhanced in patients treated with adjuvant gemcitabine-based chemotherapy.

  11. Multivariate analysis of correlation between electrophysiological and hemodynamic responses during cognitive processing

    Science.gov (United States)

    Kujala, Jan; Sudre, Gustavo; Vartiainen, Johanna; Liljeström, Mia; Mitchell, Tom; Salmelin, Riitta

    2014-01-01

    Animal and human studies have frequently shown that in primary sensory and motor regions the BOLD signal correlates positively with high-frequency and negatively with low-frequency neuronal activity. However, recent evidence suggests that this relationship may also vary across cortical areas. Detailed knowledge of the possible spectral diversity between electrophysiological and hemodynamic responses across the human cortex would be essential for neural-level interpretation of fMRI data and for informative multimodal combination of electromagnetic and hemodynamic imaging data, especially in cognitive tasks. We applied multivariate partial least squares correlation analysis to MEG–fMRI data recorded in a reading paradigm to determine the correlation patterns between the data types, at once, across the cortex. Our results revealed heterogeneous patterns of high-frequency correlation between MEG and fMRI responses, with marked dissociation between lower and higher order cortical regions. The low-frequency range showed substantial variance, with negative and positive correlations manifesting at different frequencies across cortical regions. These findings demonstrate the complexity of the neurophysiological counterparts of hemodynamic fluctuations in cognitive processing. PMID:24518260

  12. Hierarchy of temporal responses of multivariate self-excited epidemic processes

    Science.gov (United States)

    Saichev, Alexander; Maillart, Thomas; Sornette, Didier

    2013-04-01

    Many natural and social systems are characterized by bursty dynamics, for which past events trigger future activity. These systems can be modelled by so-called self-excited Hawkes conditional Poisson processes. It is generally assumed that all events have similar triggering abilities. However, some systems exhibit heterogeneity and clusters with possibly different intra- and inter-triggering, which can be accounted for by generalization into the "multivariate" self-excited Hawkes conditional Poisson processes. We develop the general formalism of the multivariate moment generating function for the cumulative number of first-generation and of all generation events triggered by a given mother event (the "shock") as a function of the current time t. This corresponds to studying the response function of the process. A variety of different systems have been analyzed. In particular, for systems in which triggering between events of different types proceeds through a one-dimension directed or symmetric chain of influence in type space, we report a novel hierarchy of intermediate asymptotic power law decays ˜ 1/ t 1-( m+1) θ of the rate of triggered events as a function of the distance m of the events to the initial shock in the type space, where 0 < θ < 1 for the relevant long-memory processes characterizing many natural and social systems. The richness of the generated time dynamics comes from the cascades of intermediate events of possibly different kinds, unfolding via random changes of types genealogy.

  13. Comprehensive analysis of Polygoni Multiflori Radix of different geographical origins using ultra-high-performance liquid chromatography fingerprints and multivariate chemometric methods

    Directory of Open Access Journals (Sweden)

    Li-Li Sun

    2018-01-01

    Full Text Available Polygoni Multiflori Radix (PMR is increasingly being used not just as a traditional herbal medicine but also as a popular functional food. In this study, multivariate chemometric methods and mass spectrometry were combined to analyze the ultra-high-performance liquid chromatograph (UPLC fingerprints of PMR from six different geographical origins. A chemometric strategy based on multivariate curve resolution–alternating least squares (MCR–ALS and three classification methods is proposed to analyze the UPLC fingerprints obtained. Common chromatographic problems, including the background contribution, baseline contribution, and peak overlap, were handled by the established MCR–ALS model. A total of 22 components were resolved. Moreover, relative species concentrations were obtained from the MCR–ALS model, which was used for multivariate classification analysis. Principal component analysis (PCA and Ward's method have been applied to classify 72 PMR samples from six different geographical regions. The PCA score plot showed that the PMR samples fell into four clusters, which related to the geographical location and climate of the source areas. The results were then corroborated by Ward's method. In addition, according to the variance-weighted distance between cluster centers obtained from Ward's method, five components were identified as the most significant variables (chemical markers for cluster discrimination. A counter-propagation artificial neural network has been applied to confirm and predict the effects of chemical markers on different samples. Finally, the five chemical markers were identified by UPLC–quadrupole time-of-flight mass spectrometer. Components 3, 12, 16, 18, and 19 were identified as 2,3,5,4′-tetrahydroxy-stilbene-2-O-β-d-glucoside, emodin-8-O-β-d-glucopyranoside, emodin-8-O-(6′-O-acetyl-β-d-glucopyranoside, emodin, and physcion, respectively. In conclusion, the proposed method can be applied for the

  14. Floral diversity increases beneficial arthropod richness and decreases variability in arthropod community composition.

    Science.gov (United States)

    Bennett, Ashley B; Gratton, Claudio

    2013-01-01

    Declines in species diversity resulting from anthropogenic alterations of the environment heighten the need to develop management strategies that conserve species and ecosystem services. This study examined how native plant species and their diversity influence the abundance and richness of beneficial arthropods, a functionally important group that provides ecosystem services such as pollination and natural pest suppression. Beneficial arthropods were sampled in replicated study plots containing native perennials planted in one-, two-, and seven-species mixtures. We found plant diversity had a positive impact on arthropod richness but not on arthropod abundance. An analysis of arthropod community composition revealed that each flower species attracted a different assemblage of beneficial arthropods. In addition, the full seven-species mixture also attracted a distinct arthropod community compared to single-species monocultures. Using a multivariate approach, we determined whether arthropod assemblages in two- and seven-species plots were additive and could be predicted based on assemblages from their component single-species plots. On average, assemblages in diverse plots were nonadditive when compared to assemblages predicted using single-species plots. Arthropod assemblages in two-species plots most closely resembled those of only one of the flower species in the mixture. However, the arthropod assemblages in seven-species plots, although statistically deviating from the expectation of an additive model, more closely resembled predicted communities compared to the assemblages found in two-species plots, suggesting that variability in arthropod community composition decreased as planting diversity increased. Our study demonstrates that careful selection of plants in managed landscapes can augment beneficial arthropod richness and support a more predictable arthropod community, suggesting that planning and design efforts could shape arthropod assemblages in natural

  15. Determination of boiling point of petrochemicals by gas chromatography-mass spectrometry and multivariate regression analysis of structural activity relationship.

    Science.gov (United States)

    Fakayode, Sayo O; Mitchell, Breanna S; Pollard, David A

    2014-08-01

    Accurate understanding of analyte boiling points (BP) is of critical importance in gas chromatographic (GC) separation and crude oil refinery operation in petrochemical industries. This study reported the first combined use of GC separation and partial-least-square (PLS1) multivariate regression analysis of petrochemical structural activity relationship (SAR) for accurate BP determination of two commercially available (D3710 and MA VHP) calibration gas mix samples. The results of the BP determination using PLS1 multivariate regression were further compared with the results of traditional simulated distillation method of BP determination. The developed PLS1 regression was able to correctly predict analytes BP in D3710 and MA VHP calibration gas mix samples, with a root-mean-square-%-relative-error (RMS%RE) of 6.4%, and 10.8% respectively. In contrast, the overall RMS%RE of 32.9% and 40.4%, respectively obtained for BP determination in D3710 and MA VHP using a traditional simulated distillation method were approximately four times larger than the corresponding RMS%RE of BP prediction using MRA, demonstrating the better predictive ability of MRA. The reported method is rapid, robust, and promising, and can be potentially used routinely for fast analysis, pattern recognition, and analyte BP determination in petrochemical industries. Copyright © 2014 Elsevier B.V. All rights reserved.

  16. Control-group feature normalization for multivariate pattern analysis of structural MRI data using the support vector machine.

    Science.gov (United States)

    Linn, Kristin A; Gaonkar, Bilwaj; Satterthwaite, Theodore D; Doshi, Jimit; Davatzikos, Christos; Shinohara, Russell T

    2016-05-15

    Normalization of feature vector values is a common practice in machine learning. Generally, each feature value is standardized to the unit hypercube or by normalizing to zero mean and unit variance. Classification decisions based on support vector machines (SVMs) or by other methods are sensitive to the specific normalization used on the features. In the context of multivariate pattern analysis using neuroimaging data, standardization effectively up- and down-weights features based on their individual variability. Since the standard approach uses the entire data set to guide the normalization, it utilizes the total variability of these features. This total variation is inevitably dependent on the amount of marginal separation between groups. Thus, such a normalization may attenuate the separability of the data in high dimensional space. In this work we propose an alternate approach that uses an estimate of the control-group standard deviation to normalize features before training. We study our proposed approach in the context of group classification using structural MRI data. We show that control-based normalization leads to better reproducibility of estimated multivariate disease patterns and improves the classifier performance in many cases. Copyright © 2016 Elsevier Inc. All rights reserved.

  17. Experimental analysis of multivariate female choice in gray treefrogs (Hyla versicolor): evidence for directional and stabilizing selection.

    Science.gov (United States)

    Gerhardt, H Carl; Brooks, Robert

    2009-10-01

    Even simple biological signals vary in several measurable dimensions. Understanding their evolution requires, therefore, a multivariate understanding of selection, including how different properties interact to determine the effectiveness of the signal. We combined experimental manipulation with multivariate selection analysis to assess female mate choice on the simple trilled calls of male gray treefrogs. We independently and randomly varied five behaviorally relevant acoustic properties in 154 synthetic calls. We compared response times of each of 154 females to one of these calls with its response to a standard call that had mean values of the five properties. We found directional and quadratic selection on two properties indicative of the amount of signaling, pulse number, and call rate. Canonical rotation of the fitness surface showed that these properties, along with pulse rate, contributed heavily to a major axis of stabilizing selection, a result consistent with univariate studies showing diminishing effects of increasing pulse number well beyond the mean. Spectral properties contributed to a second major axis of stabilizing selection. The single major axis of disruptive selection suggested that a combination of two temporal and two spectral properties with values differing from the mean should be especially attractive.

  18. Detecting phase separation of freeze-dried binary amorphous systems using pair-wise distribution function and multivariate data analysis

    DEFF Research Database (Denmark)

    Chieng, Norman; Trnka, Hjalte; Boetker, Johan

    2013-01-01

    The purpose of this study is to investigate the use of multivariate data analysis for powder X-ray diffraction-pair-wise distribution function (PXRD-PDF) data to detect phase separation in freeze-dried binary amorphous systems. Polymer-polymer and polymer-sugar binary systems at various ratios were...... freeze-dried. All samples were analyzed by PXRD, transformed to PDF and analyzed by principal component analysis (PCA). These results were validated by differential scanning calorimetry (DSC) through characterization of glass transition of the maximally freeze-concentrate solute (Tg'). Analysis of PXRD......-PDF data using PCA provides a more clear 'miscible' or 'phase separated' interpretation through the distribution pattern of samples on a score plot presentation compared to residual plot method. In a phase separated system, samples were found to be evenly distributed around the theoretical PDF profile...

  19. Modelling lecturer performance index of private university in Tulungagung by using survival analysis with multivariate adaptive regression spline

    Science.gov (United States)

    Hasyim, M.; Prastyo, D. D.

    2018-03-01

    Survival analysis performs relationship between independent variables and survival time as dependent variable. In fact, not all survival data can be recorded completely by any reasons. In such situation, the data is called censored data. Moreover, several model for survival analysis requires assumptions. One of the approaches in survival analysis is nonparametric that gives more relax assumption. In this research, the nonparametric approach that is employed is Multivariate Regression Adaptive Spline (MARS). This study is aimed to measure the performance of private university’s lecturer. The survival time in this study is duration needed by lecturer to obtain their professional certificate. The results show that research activities is a significant factor along with developing courses material, good publication in international or national journal, and activities in research collaboration.

  20. Determination of geographic provenance of cotton fibres using multi-isotope profiles and multivariate statistical analysis

    Science.gov (United States)

    Daeid, N. Nic; Meier-Augenstein, W.; Kemp, H. F.

    2012-04-01

    The analysis of cotton fibres can be particularly challenging within a forensic science context where discrimination of one fibre from another is of importance. Normally cotton fibre analysis examines the morphological structure of the recovered material and compares this with that of a known fibre from a particular source of interest. However, the conventional microscopic and chemical analysis of fibres and any associated dyes is generally unsuccessful because of the similar morphology of the fibres. Analysis of the dyes which may have been applied to the cotton fibre can also be undertaken though this can be difficult and unproductive in terms of discriminating one fibre from another. In the study presented here we have explored the potential for Isotope Ratio Mass Spectrometry (IRMS) to be utilised as an additional tool for cotton fibre analysis in an attempt to reveal further discriminatory information. This work has concentrated on un-dyed cotton fibres of known origin in order to expose the potential of the analytical technique. We report the results of a pilot study aimed at testing the hypothesis that multi-element stable isotope analysis of cotton fibres in conjunction with multivariate statistical analysis of the resulting isotopic abundance data using well established chemometric techniques permits sample provenancing based on the determination of where the cotton was grown and as such will facilitate sample discrimination. To date there is no recorded literature of this type of application of IRMS to cotton samples, which may be of forensic science relevance.

  1. Functional diversity supports the physiological tolerance hypothesis for plant species richness along climatic gradients

    Science.gov (United States)

    Spasojevic, Marko J.; Grace, James B.; Harrison, Susan; Damschen, Ellen Ingman

    2013-01-01

    1. The physiological tolerance hypothesis proposes that plant species richness is highest in warm and/or wet climates because a wider range of functional strategies can persist under such conditions. Functional diversity metrics, combined with statistical modeling, offer new ways to test whether diversity-environment relationships are consistent with this hypothesis. 2. In a classic study by R. H. Whittaker (1960), herb species richness declined from mesic (cool, moist, northerly) slopes to xeric (hot, dry, southerly) slopes. Building on this dataset, we measured four plant functional traits (plant height, specific leaf area, leaf water content and foliar C:N) and used them to calculate three functional diversity metrics (functional richness, evenness, and dispersion). We then used a structural equation model to ask if ‘functional diversity’ (modeled as the joint responses of richness, evenness, and dispersion) could explain the observed relationship of topographic climate gradients to species richness. We then repeated our model examining the functional diversity of each of the four traits individually. 3. Consistent with the physiological tolerance hypothesis, we found that functional diversity was higher in more favorable climatic conditions (mesic slopes), and that multivariate functional diversity mediated the relationship of the topographic climate gradient to plant species richness. We found similar patterns for models focusing on individual trait functional diversity of leaf water content and foliar C:N. 4. Synthesis. Our results provide trait-based support for the physiological tolerance hypothesis, suggesting that benign climates support more species because they allow for a wider range of functional strategies.

  2. Combining microwave resonance technology to multivariate data analysis as a novel PAT tool to improve process understanding in fluid bed granulation.

    Science.gov (United States)

    Lourenço, Vera; Herdling, Thorsten; Reich, Gabriele; Menezes, José C; Lochmann, Dirk

    2011-08-01

    A set of 192 fluid bed granulation batches at industrial scale were in-line monitored using microwave resonance technology (MRT) to determine moisture, temperature and density of the granules. Multivariate data analysis techniques such as multiway partial least squares (PLS), multiway principal component analysis (PCA) and multivariate batch control charts were applied onto collected batch data sets. The combination of all these techniques, along with off-line particle size measurements, led to significantly increased process understanding. A seasonality effect could be put into evidence that impacted further processing through its influence on the final granule size. Moreover, it was demonstrated by means of a PLS that a relation between the particle size and the MRT measurements can be quantitatively defined, highlighting a potential ability of the MRT sensor to predict information about the final granule size. This study has contributed to improve a fluid bed granulation process, and the process knowledge obtained shows that the product quality can be built in process design, following Quality by Design (QbD) and Process Analytical Technology (PAT) principles. Copyright © 2011. Published by Elsevier B.V.

  3. Geostatistics and multivariate analysis as a tool to characterize volcaniclastic deposits: Application to Nevado de Toluca volcano, Mexico

    Science.gov (United States)

    Bellotti, F.; Capra, L.; Sarocchi, D.; D'Antonio, M.

    2010-03-01

    Grain size analysis of volcaniclastic deposits is mainly used to study flow transport and depositional processes, in most cases by comparing some statistical parameters and how they change with distance from the source. In this work the geospatial and multivariate analyses are presented as a strong adaptable geostatistical tool applied to volcaniclastic deposits in order to provide an effective and relatively simple methodology for texture description, deposit discrimination and interpretation of depositional processes. We choose the case of Nevado de Toluca volcano (Mexico) due to existing knowledge of its geological evolution, stratigraphic succession and spatial distribution of volcaniclastic units. Grain size analyses and frequency distribution curves have been carried out to characterize and compare the 28-ka block-and-ash flow deposit associated to a dome destruction episode, and the El Morral debris avalanche deposit originated from the collapse of the south-eastern sector of the volcano. The geostatistical interpolation of sedimentological data allows to realize bidimensional maps draped over the volcano topography, showing the granulometric distribution, sorting and fine material concentration into the whole deposit with respect to topographic changes. In this way, it is possible to analyze a continuous surface of the grain size distribution of volcaniclastic deposits and better understand flow transport processes. The application of multivariate statistic analysis (discriminant function) indicates that this methodology could be useful in discriminating deposits with different origin or different depositional lithofacies within the same deposit. The proposed methodology could be an interesting approach to sustain more classical analysis of volcaniclastic deposits, especially where a clear field classification appears problematic because of a homogeneous texture of the deposits or their scarce and discontinuous outcrops. Our study is an example of the

  4. Defining critical habitats of threatened and endemic reef fishes with a multivariate approach.

    Science.gov (United States)

    Purcell, Steven W; Clarke, K Robert; Rushworth, Kelvin; Dalton, Steven J

    2014-12-01

    Understanding critical habitats of threatened and endemic animals is essential for mitigating extinction risks, developing recovery plans, and siting reserves, but assessment methods are generally lacking. We evaluated critical habitats of 8 threatened or endemic fish species on coral and rocky reefs of subtropical eastern Australia, by measuring physical and substratum-type variables of habitats at fish sightings. We used nonmetric and metric multidimensional scaling (nMDS, mMDS), Analysis of similarities (ANOSIM), similarity percentages analysis (SIMPER), permutational analysis of multivariate dispersions (PERMDISP), and other multivariate tools to distinguish critical habitats. Niche breadth was widest for 2 endemic wrasses, and reef inclination was important for several species, often found in relatively deep microhabitats. Critical habitats of mainland reef species included small caves or habitat-forming hosts such as gorgonian corals and black coral trees. Hard corals appeared important for reef fishes at Lord Howe Island, and red algae for mainland reef fishes. A wide range of habitat variables are required to assess critical habitats owing to varied affinities of species to different habitat features. We advocate assessments of critical habitats matched to the spatial scale used by the animals and a combination of multivariate methods. Our multivariate approach furnishes a general template for assessing the critical habitats of species, understanding how these vary among species, and determining differences in the degree of habitat specificity. © 2014 Society for Conservation Biology.

  5. Application of multivariate statistical methods to classify archaeological pottery from Tel-Alramad site, Syria, based on x-ray fluorescence analysis

    International Nuclear Information System (INIS)

    Bakraji, E. H.

    2007-01-01

    Radioisotopic x-ray fluorescence (XRF) analysis has been utilized to determine the elemental composition of 55 archaeological pottery samples by the determination of 17 chemical elements. Fifty-four of them came from the Tel-Alramad Site in Katana town, near Damascus city, Syria, and one sample came from Brazil. The XRF results have been processed using two multivariate statistical methods, cluster and factor analysis, in order to determine similarities and correlation between the selected samples based on their elemental composition. The methodology successfully separates the samples where four distinct chemical groups were identified. (author)

  6. Linear models of coregionalization for multivariate lattice data: Order-dependent and order-free cMCARs.

    Science.gov (United States)

    MacNab, Ying C

    2016-08-01

    This paper concerns with multivariate conditional autoregressive models defined by linear combination of independent or correlated underlying spatial processes. Known as linear models of coregionalization, the method offers a systematic and unified approach for formulating multivariate extensions to a broad range of univariate conditional autoregressive models. The resulting multivariate spatial models represent classes of coregionalized multivariate conditional autoregressive models that enable flexible modelling of multivariate spatial interactions, yielding coregionalization models with symmetric or asymmetric cross-covariances of different spatial variation and smoothness. In the context of multivariate disease mapping, for example, they facilitate borrowing strength both over space and cross variables, allowing for more flexible multivariate spatial smoothing. Specifically, we present a broadened coregionalization framework to include order-dependent, order-free, and order-robust multivariate models; a new class of order-free coregionalized multivariate conditional autoregressives is introduced. We tackle computational challenges and present solutions that are integral for Bayesian analysis of these models. We also discuss two ways of computing deviance information criterion for comparison among competing hierarchical models with or without unidentifiable prior parameters. The models and related methodology are developed in the broad context of modelling multivariate data on spatial lattice and illustrated in the context of multivariate disease mapping. The coregionalization framework and related methods also present a general approach for building spatially structured cross-covariance functions for multivariate geostatistics. © The Author(s) 2016.

  7. [Multivariate ordinal logistic regression analysis on the association between consumption of fried food and both esophageal cancer and precancerous lesions].

    Science.gov (United States)

    Guo, L W; Liu, S Z; Zhang, M; Chen, Q; Zhang, S K; Sun, X B

    2017-12-10

    Objective: To investigate the effect of fried food intake on the pathogenesis of esophageal cancer and precancerous lesions. Methods: From 2005 to 2013, all the residents aged 40-69 years from 11 counties (cities) where cancer screening of upper gastrointestinal cancer had been conducted in rural areas of Henan province, were recruited as the subjects of study. Information on demography and lifestyle was collected. The residents under study were screened with iodine staining endoscopic examination and biopsy samples were diagnosed pathologically, under standardized criteria. Subjects with high risk were divided into the groups based on their different pathological degrees. Multivariate ordinal logistic regression analysis was used to analyze the relationship between the frequency of fried food intake and esophageal cancer and precancerous lesions. Results: A total number of 8 792 cases with normal esophagus, 3 680 with mild hyperplasia, 972 with moderate hyperplasia, 413 with severe hyperplasia carcinoma in situ, and 336 cases of esophageal cancer were recruited. Results from multivariate logistic regression analysis showed that, when compared with those who did not eat fried food, the intake of fried food (food appeared a risk factor for both esophageal cancer and precancerous lesions.

  8. Antioxidant activity of Costa Rican propolis: a multivariate analysis approach

    International Nuclear Information System (INIS)

    Umana Rojas, Eduardo; Solado, Godofredo; Tamayo-Castillo, Giselle

    2013-01-01

    Propolis is produced by Apis mellifera bees from resins of plants that are found around the apiary. The chemical composition is highly variable and Costa Rica has reported without studies of characterization to define the types of propolis in the country. 119 samples were collected from beekeeping areas of the country. The spectrum of 1 H-NMR and its antioxidant activity against DPPH radical were measured. The spectra have been divided into 243 blocks of 0,04 ppm and processed with the Minitab software for multivariate analysis. 99 of the samples collected were used for construction of models for the valuation of the predictive ability of the model have been used coefficients of determination (R 2 ) of prediction by the software and the remaining 20 samples. The existence of three types of propolis with chemically different metabolomes were determined by principal component analysis (PCA). A prediction model was constructed by analysis of partial least squares (PLS). The prediction model has allowed to classify a propolis according to the level of antioxidant activity (AAO), high (type I and II) or low (type III) from the spectrum of 1 H-NMR. The R 2 has been 0.88 and R 2 prediction of 0, 718 for new samples. The nconiferyl benzoate of group I and nemorosone of the group II as two discriminated antioxidants among the groups I and II were isolated and high concentration levels of these compounds have been differentiated with respect to type III. This has allowed the construction of a linear discriminant model with a success rate of 100% for the samples used for formulation and 92,9 for the prediction of different samples. The classification systems could be applied to the standardization of the quality of propolis from Costa Rica for future medicinal or cosmetic applications that take advantage of its antioxidant properties. Also, the methylated derivative has isolated and identified of the nconiferyl benzoate thereof propolis than was obtained his counterpart

  9. Multivariate analysis of dopaminergic gene variants as risk factors of heroin dependence.

    Directory of Open Access Journals (Sweden)

    Andrea Vereczkei

    Full Text Available BACKGROUND: Heroin dependence is a debilitating psychiatric disorder with complex inheritance. Since the dopaminergic system has a key role in rewarding mechanism of the brain, which is directly or indirectly targeted by most drugs of abuse, we focus on the effects and interactions among dopaminergic gene variants. OBJECTIVE: To study the potential association between allelic variants of dopamine D2 receptor (DRD2, ANKK1 (ankyrin repeat and kinase domain containing 1, dopamine D4 receptor (DRD4, catechol-O-methyl transferase (COMT and dopamine transporter (SLC6A3 genes and heroin dependence in Hungarian patients. METHODS: 303 heroin dependent subjects and 555 healthy controls were genotyped for 7 single nucleotide polymorphisms (SNPs rs4680 of the COMT gene; rs1079597 and rs1800498 of the DRD2 gene; rs1800497 of the ANKK1 gene; rs1800955, rs936462 and rs747302 of the DRD4 gene. Four variable number of tandem repeats (VNTRs were also genotyped: 120 bp duplication and 48 bp VNTR in exon 3 of DRD4 and 40 bp VNTR and intron 8 VNTR of SLC6A3. We also perform a multivariate analysis of associations using Bayesian networks in Bayesian multilevel analysis (BN-BMLA. FINDINGS AND CONCLUSIONS: In single marker analysis the TaqIA (rs1800497 and TaqIB (rs1079597 variants were associated with heroin dependence. Moreover, -521 C/T SNP (rs1800955 of the DRD4 gene showed nominal association with a possible protective effect of the C allele. After applying the Bonferroni correction TaqIB was still significant suggesting that the minor (A allele of the TaqIB SNP is a risk component in the genetic background of heroin dependence. The findings of the additional multiple marker analysis are consistent with the results of the single marker analysis, but this method was able to reveal an indirect effect of a promoter polymorphism (rs936462 of the DRD4 gene and this effect is mediated through the -521 C/T (rs1800955 polymorphism in the promoter.

  10. Multivariate analysis in relation to breeding system in opium popy, Papaver somniferum L.

    Directory of Open Access Journals (Sweden)

    Singh S.P.

    2004-01-01

    Full Text Available The opium poppy (Papaver somniferum L. is an important medicinal plant of great pharmacopoel uses. 101 germplasm lines of different eco-geographical origin maintained at National Botanical Research Institute, Lucknow were evaluated to study the genetic divergence for seed yield/plant, opium yield/plant and its 8 component traits following multivariate and canonical analysis. The genotypes were grouped in 13 clusters and confirmed by canonical analysis. Sixty eight percent genotypes (69/101 were genetically close to each other and grouped in 6 clusters (II, III, IV, V, VIII, XII while apparent diversity was noticed for 32 percent (32/101 of the genotypes who diversed into rest 7 clusters (I, VI, VII, IX, X, XI, XIII. Inter cluster distance ranged from 47.28 to 234.55. The maximum was between IX and X followed by VII and IX (208.30 and IX and XI (205.53. The genotypes in cluster IX, X. XI, and XII had greater potential as breeding stock by virtue of high mean values of one or more component characters and high statistical distance among them. Based on findings of high cluster mean of component trait and inter-cluster distance among clusters, a breeding plan has been discussed.

  11. Mini-DIAL system measurements coupled with multivariate data analysis to identify TIC and TIM simulants: preliminary absorption database analysis

    International Nuclear Information System (INIS)

    Gaudio, P; Malizia, A; Gelfusa, M; Poggi, L.A.; Martinelli, E.; Di Natale, C.; Bellecci, C.

    2017-01-01

    Nowadays Toxic Industrial Components (TICs) and Toxic Industrial Materials (TIMs) are one of the most dangerous and diffuse vehicle of contamination in urban and industrial areas. The academic world together with the industrial and military one are working on innovative solutions to monitor the diffusion in atmosphere of such pollutants. In this phase the most common commercial sensors are based on “point detection” technology but it is clear that such instruments cannot satisfy the needs of the smart cities. The new challenge is developing stand-off systems to continuously monitor the atmosphere. Quantum Electronics and Plasma Physics (QEP) research group has a long experience in laser system development and has built two demonstrators based on DIAL (Differential Absorption of Light) technology could be able to identify chemical agents in atmosphere. In this work the authors will present one of those DIAL system, the miniaturized one, together with the preliminary results of an experimental campaign conducted on TICs and TIMs simulants in cell with aim of use the absorption database for the further atmospheric an analysis using the same DIAL system. The experimental results are analysed with standard multivariate data analysis technique as Principal Component Analysis (PCA) to develop a classification model aimed at identifying organic chemical compound in atmosphere. The preliminary results of absorption coefficients of some chemical compound are shown together pre PCA analysis. (paper)

  12. Multivariate statistical analysis of electron energy-loss spectroscopy in anisotropic materials

    International Nuclear Information System (INIS)

    Hu Xuerang; Sun Yuekui; Yuan Jun

    2008-01-01

    Recently, an expression has been developed to take into account the complex dependence of the fine structure in core-level electron energy-loss spectroscopy (EELS) in anisotropic materials on specimen orientation and spectral collection conditions [Y. Sun, J. Yuan, Phys. Rev. B 71 (2005) 125109]. One application of this expression is the development of a phenomenological theory of magic-angle electron energy-loss spectroscopy (MAEELS), which can be used to extract the isotropically averaged spectral information for materials with arbitrary anisotropy. Here we use this expression to extract not only the isotropically averaged spectral information, but also the anisotropic spectral components, without the restriction of MAEELS. The application is based on a multivariate statistical analysis of core-level EELS for anisotropic materials. To demonstrate the applicability of this approach, we have conducted a study on a set of carbon K-edge spectra of multi-wall carbon nanotube (MWCNT) acquired with energy-loss spectroscopic profiling (ELSP) technique and successfully extracted both the averaged and dichroic spectral components of the wrapped graphite-like sheets. Our result shows that this can be a practical alternative to MAEELS for the study of electronic structure of anisotropic materials, in particular for those nanostructures made of layered materials

  13. [Association between hip fractures and risk factors for osteoporosis. Multivariate analysis].

    Science.gov (United States)

    Masoni, Ana; Morosano, Mario; Tomat, María Florencia; Pezzotto, Stella M; Sánchez, Ariel

    2007-01-01

    In this observational, case-control study, 376 inpatients were evaluated in order to determine the association of risk factors (RF) and hip fracture; 151 patients had osteoporotic hip fracture (cases); the remaining were controls. Data were obtained from medical charts, and through a standardized questionnaire about RF. Mean age of the sample (+/- SD) was 80.6 +/- 8.1 years, without statistically significant difference between cases and controls; the female:male ratio was 3:1 in both groups. Fractured women were older than men (82.5 +/- 8.1 vs. 79.7 +/- 7.2 years, respectively; p household duties was a RF (p = 0.007), which was absent in males. In multivariate analysis, the following RF were significantly more frequent: Cognitive impairment (p = 0.001), and previous falls (p < 0.0001); whereas the following protective factors were significantly different from controls: Calcium intake during youth (p < 0.0001), current calcium intake (p < 0.0001), and mechanical aid for walking (p < 0.0001). Evaluation of RF and protective factors may contribute to diminish the probability of hip fracture, through a modification of personal habits, and measures to prevent falls among elderly adults. Present information can help to develop local and national population-based strategies to diminish the burden of hip fractures for the health system.

  14. Population structure of the Korean gizzard shad, Konosirus punctatus (Clupeiformes, Clupeidae) using multivariate morphometric analysis

    Science.gov (United States)

    Myoung, Se Hun; Kim, Jin-Koo

    2016-03-01

    The gizzard shad, Konosirus punctatus, is one of the most important fish species in Korea, China, Japan and Taiwan, and therefore the implementation of an appropriate population structure analysis is both necessary and fitting. In order to clarify the current distribution range for the two lineages of the Korean gizzard shad (Myoung and Kim 2014), we conducted a multivariate morphometric analysis by locality and lineage. We analyzed 17 morphometric and 5 meristic characters of 173 individuals, which were sampled from eight localities in the East Sea, the Yellow Sea and the Korean Strait. Unlike population genetics studies, the canonical discriminant analysis (CDA) results showed that the two morphotypes were clearly segregated by the center value "0" of CAN1, of which morphotype A occurred from the Yellow Sea to the western Korean Strait with negative values, and morphotype B occurred from the East Sea to the eastern Korean Strait with positive values even though there exists an admixture zone in the eastern Korean Strait. Further studies using more sensitive markers such as microsatellite DNA are required in order to define the true relationship between the two lineages.

  15. Factor analysis of multivariate data

    Digital Repository Service at National Institute of Oceanography (India)

    Fernandes, A.A.; Mahadevan, R.

    A brief introduction to factor analysis is presented. A FORTRAN program, which can perform the Q-mode and R-mode factor analysis and the singular value decomposition of a given data matrix is presented in Appendix B. This computer program, uses...

  16. Comparative multivariate analysis of biometric traits of West African Dwarf and Red Sokoto goats.

    Science.gov (United States)

    Yakubu, Abdulmojeed; Salako, Adebowale E; Imumorin, Ikhide G

    2011-03-01

    The population structure of 302 randomly selected West African Dwarf (WAD) and Red Sokoto (RS) goats was examined using multivariate morphometric analyses. This was to make the case for conservation, rational management and genetic improvement of these two most important Nigerian goat breeds. Fifteen morphometric measurements were made on each individual animal. RS goats were superior (Pgoats, three components were obtained for their RS counterparts with variation in the loading traits of each component for each breed. The Mahalanobis distance of 72.28 indicated a high degree of spatial racial separation in morphology between the genotypes. The Ward's option of the cluster analysis consolidated the morphometric distinctness of the two breeds. Application of selective breeding to genetic improvement would benefit from the detected phenotypic differentiation. Other implications for management and conservation of the goats are highlighted.

  17. Comparative multivariate analyses of transient otoacoustic emissions and distorsion products in normal and impaired hearing.

    Science.gov (United States)

    Stamate, Mirela Cristina; Todor, Nicolae; Cosgarea, Marcel

    2015-01-01

    The clinical utility of otoacoustic emissions as a noninvasive objective test of cochlear function has been long studied. Both transient otoacoustic emissions and distorsion products can be used to identify hearing loss, but to what extent they can be used as predictors for hearing loss is still debated. Most studies agree that multivariate analyses have better test performances than univariate analyses. The aim of the study was to determine transient otoacoustic emissions and distorsion products performance in identifying normal and impaired hearing loss, using the pure tone audiogram as a gold standard procedure and different multivariate statistical approaches. The study included 105 adult subjects with normal hearing and hearing loss who underwent the same test battery: pure-tone audiometry, tympanometry, otoacoustic emission tests. We chose to use the logistic regression as a multivariate statistical technique. Three logistic regression models were developed to characterize the relations between different risk factors (age, sex, tinnitus, demographic features, cochlear status defined by otoacoustic emissions) and hearing status defined by pure-tone audiometry. The multivariate analyses allow the calculation of the logistic score, which is a combination of the inputs, weighted by coefficients, calculated within the analyses. The accuracy of each model was assessed using receiver operating characteristics curve analysis. We used the logistic score to generate receivers operating curves and to estimate the areas under the curves in order to compare different multivariate analyses. We compared the performance of each otoacoustic emission (transient, distorsion product) using three different multivariate analyses for each ear, when multi-frequency gold standards were used. We demonstrated that all multivariate analyses provided high values of the area under the curve proving the performance of the otoacoustic emissions. Each otoacoustic emission test presented high

  18. Analysis of ASTER data for mapping bauxite rich pockets within high altitude lateritic bauxite, Jharkhand, India

    Science.gov (United States)

    Guha, Arindam; Singh, Vivek Kr.; Parveen, Reshma; Kumar, K. Vinod; Jeyaseelan, A. T.; Dhanamjaya Rao, E. N.

    2013-04-01

    Bauxite deposits of Jharkhand in India are resulted from the lateritization process and therefore are often associated with the laterites. In the present study, ASTER (Advanced Space borne Thermal Emission and Reflection Radiometer) image is processed to delineate bauxite rich pockets within the laterites. In this regard, spectral signatures of lateritic bauxite samples are analyzed in the laboratory with reference to the spectral features of gibbsite (main mineral constituent of bauxite) and goethite (main mineral constituent of laterite) in VNIR-SWIR (visible-near infrared and short wave infrared) electromagnetic domain. The analysis of spectral signatures of lateritic bauxite samples helps in understanding the differences in the spectral features of bauxites and laterites. Based on these differences; ASTER data based relative band depth and simple ratio images are derived for spatial mapping of the bauxites developed within the lateritic province. In order to integrate the complementary information of different index image, an index based principal component (IPC) image is derived to incorporate the correlative information of these indices to delineate bauxite rich pockets. The occurrences of bauxite rich pockets derived from density sliced IPC image are further delimited by the topographic controls as it has been observed that the major bauxite occurrences of the area are controlled by slope and altitude. In addition to above, IPC image is draped over the digital elevation model (DEM) to illustrate how bauxite rich pockets are distributed with reference to the topographic variability of the terrain. Bauxite rich pockets delineated in the IPC image are also validated based on the known mine occurrences and existing geological map of the bauxite. It is also conceptually validated based on the spectral similarity of the bauxite pixels delineated in the IPC image with the ASTER convolved laboratory spectra of bauxite samples.

  19. Rich analysis and rational models: Inferring individual behavior from infant looking data

    Science.gov (United States)

    Piantadosi, Steven T.; Kidd, Celeste; Aslin, Richard

    2013-01-01

    Studies of infant looking times over the past 50 years have provided profound insights about cognitive development, but their dependent measures and analytic techniques are quite limited. In the context of infants' attention to discrete sequential events, we show how a Bayesian data analysis approach can be combined with a rational cognitive model to create a rich data analysis framework for infant looking times. We formalize (i) a statistical learning model (ii) a parametric linking between the learning model's beliefs and infants' looking behavior, and (iii) a data analysis model that infers parameters of the cognitive model and linking function for groups and individuals. Using this approach, we show that recent findings from Kidd, Piantadosi, and Aslin (2012) of a U-shaped relationship between look-away probability and stimulus complexity even holds within infants and is not due to averaging subjects with different types of behavior. Our results indicate that individual infants prefer stimuli of intermediate complexity, reserving attention for events that are moderately predictable given their probabilistic expectations about the world. PMID:24750256

  20. Multivariate analysis of factors predicting prostate dose in intensity-modulated radiotherapy

    Energy Technology Data Exchange (ETDEWEB)

    Tomita, Tsuneyuki [Division of Radiology, Osaka Red Cross Hospital, Osaka (Japan); Nakamura, Mitsuhiro, E-mail: m_nkmr@kuhp.kyoto-u.ac.jp [Department of Radiation Oncology and Image-applied Therapy, Graduate School of Medicine, Kyoto University, Kyoto (Japan); Hirose, Yoshinori; Kitsuda, Kenji; Notogawa, Takuya; Miki, Katsuhito [Division of Radiology, Osaka Red Cross Hospital, Osaka (Japan); Nakamura, Kiyonao; Ishigaki, Takashi [Department of Radiation Oncology, Osaka Red Cross Hospital, Osaka (Japan)

    2014-01-01

    We conducted a multivariate analysis to determine relationships between prostate radiation dose and the state of surrounding organs, including organ volumes and the internal angle of the levator ani muscle (LAM), based on cone-beam computed tomography (CBCT) images after bone matching. We analyzed 270 CBCT data sets from 30 consecutive patients receiving intensity-modulated radiation therapy for prostate cancer. With patients in the supine position on a couch with the HipFix system, data for center of mass (COM) displacement of the prostate and the state of individual organs were acquired and compared between planning CT and CBCT scans. Dose distributions were then recalculated based on CBCT images. The relative effects of factors on the variance in COM, dose covering 95% of the prostate volume (D{sub 95%}), and percentage of prostate volume covered by the 100% isodose line (V{sub 100%}) were evaluated by a backward stepwise multiple regression analysis. COM displacement in the anterior-posterior direction (COM{sub AP}) correlated significantly with the rectum volume (δVr) and the internal LAM angle (δθ; R = 0.63). Weak correlations were seen for COM in the left-right (R = 0.18) and superior-inferior directions (R = 0.31). Strong correlations between COM{sub AP} and prostate D{sub 95%} and V{sub 100%} were observed (R ≥ 0.69). Additionally, the change ratios in δVr and δθ remained as predictors of prostate D{sub 95%} and V{sub 100%}. This study shows statistically that maintaining the same rectum volume and LAM state for both the planning CT simulation and treatment is important to ensure the correct prostate dose in the supine position with bone matching.

  1. Pre-processing of Fourier transform infrared spectra by means of multivariate analysis implemented in the R environment.

    Science.gov (United States)

    Banas, Krzysztof; Banas, Agnieszka; Gajda, Mariusz; Pawlicki, Bohdan; Kwiatek, Wojciech M; Breese, Mark B H

    2015-04-21

    Pre-processing of Fourier transform infrared (FTIR) spectra is typically the first and crucial step in data analysis. Very often hyperspectral datasets include the regions characterized by the spectra of very low intensity, for example two-dimensional (2D) maps where the areas with only support materials (like mylar foil) are present. In that case segmentation of the complete dataset is required before subsequent evaluation. The method proposed in this contribution is based on a multivariate approach (hierarchical cluster analysis), and shows its superiority when compared to the standard method of cutting-off by using only the mean spectral intensity. Both techniques were implemented and their performance was tested in the R statistical environment - open-source platform - that is a favourable solution if the repeatability and transparency are the key aspects.

  2. Role Of Family Planning Practices In The Control And Prevention of Uterine Cervical Cancer- A Multivariate Analysis

    Directory of Open Access Journals (Sweden)

    Sharma S

    1995-01-01

    Full Text Available Research Question: Does acceptance of family planning reduce the risk of uterine cervical cancer? Objective: To study the association between usage of contraceptive methods and cervical carcinogenesis. Study design: Case control study. Settings: Urban Area â€" Hospital Based. Participants: 160 women having different degrees of dysplasia and 173 women having normal pap smears. Statistical Analysis: Multivariate Analysis. Results: None of the three widely prevalent Family Planning practices viz. IUD condoms and tubectomy turned out to be significant in the development of dysplasia, however, age at consummation of marriage before 18 years and illiteracy were significant. Use of IUD offered protection against carcinoma in situ (CIS and disease of invasive nature. Non- users of condoms were also at risk marginally failing to attain statistical significance.

  3. Multivariate strategies in functional magnetic resonance imaging

    DEFF Research Database (Denmark)

    Hansen, Lars Kai

    2007-01-01

    We discuss aspects of multivariate fMRI modeling, including the statistical evaluation of multivariate models and means for dimensional reduction. In a case study we analyze linear and non-linear dimensional reduction tools in the context of a `mind reading' predictive multivariate fMRI model....

  4. Clustering of samples and elements based on multi-variable chemical data

    International Nuclear Information System (INIS)

    Op de Beeck, J.

    1984-01-01

    Clustering and classification are defined in the context of multivariable chemical analysis data. Classical multi-variate techniques, commonly used to interpret such data, are shown to be based on probabilistic and geometrical principles which are not justified for analytical data, since in that case one assumes or expects a system of more or less systematically related objects (samples) as defined by measurements on more or less systematically interdependent variables (elements). For the specific analytical problem of data set concerning a large number of trace elements determined in a large number of samples, a deterministic cluster analysis can be used to develop the underlying classification structure. Three main steps can be distinguished: diagnostic evaluation and preprocessing of the raw input data; computation of a symmetric matrix with pairwise standardized dissimilarity values between all possible pairs of samples and/or elements; and ultrametric clustering strategy to produce the final classification as a dendrogram. The software packages designed to perform these tasks are discussed and final results are given. Conclusions are formulated concerning the dangers of using multivariate, clustering and classification software packages as a black-box

  5. Hydrogeochemistry and quality of surface water and groundwater in the vicinity of Lake Monoun, West Cameroon: approach from multivariate statistical analysis and stable isotopic characterization.

    Science.gov (United States)

    Kamtchueng, Brice T; Fantong, Wilson Y; Wirmvem, Mengnjo J; Tiodjio, Rosine E; Takounjou, Alain F; Ndam Ngoupayou, Jules R; Kusakabe, Minoru; Zhang, Jing; Ohba, Takeshi; Tanyileke, Gregory; Hell, Joseph V; Ueda, Akira

    2016-09-01

    With the use of conventional hydrogeochemical techniques, multivariate statistical analysis, and stable isotope approaches, this paper investigates for the first time surface water and groundwater from the surrounding areas of Lake Monoun (LM), West Cameroon. The results reveal that waters are generally slightly acidic to neutral. The relative abundance of major dissolved species are Ca(2+) > Mg(2+) > Na(+) > K(+) for cations and HCO3 (-) ≫ NO3 (-) > Cl(-) > SO4 (2-) for anions. The main water type is Ca-Mg-HCO3. Observed salinity is related to water-rock interaction, ion exchange process, and anthropogenic activities. Nitrate and chloride have been identified as the most common pollutants. These pollutants are attributed to the chlorination of wells and leaching from pit latrines and refuse dumps. The stable isotopic compositions in the investigated water sources suggest evidence of evaporation before recharge. Four major groups of waters were identified by salinity and NO3 concentrations using the Q-mode hierarchical cluster analysis (HCA). Consistent with the isotopic results, group 1 represents fresh unpolluted water occurring near the recharge zone in the general flow regime; groups 2 and 3 are mixed water whose composition is controlled by both weathering of rock-forming minerals and anthropogenic activities; group 4 represents water under high vulnerability of anthropogenic pollution. Moreover, the isotopic results and the HCA showed that the CO2-rich bottom water of LM belongs to an isolated hydrological system within the Foumbot plain. Except for some springs, groundwater water in the area is inappropriate for drinking and domestic purposes but good to excellent for irrigation.

  6. Multivariate Bonferroni-type inequalities theory and applications

    CERN Document Server

    Chen, John

    2014-01-01

    Multivariate Bonferroni-Type Inequalities: Theory and Applications presents a systematic account of research discoveries on multivariate Bonferroni-type inequalities published in the past decade. The emergence of new bounding approaches pushes the conventional definitions of optimal inequalities and demands new insights into linear and Fréchet optimality. The book explores these advances in bounding techniques with corresponding innovative applications. It presents the method of linear programming for multivariate bounds, multivariate hybrid bounds, sub-Markovian bounds, and bounds using Hamil

  7. Multivariate Matrix-Exponential Distributions

    DEFF Research Database (Denmark)

    Bladt, Mogens; Nielsen, Bo Friis

    2010-01-01

    be written as linear combinations of the elements in the exponential of a matrix. For this reason we shall refer to multivariate distributions with rational Laplace transform as multivariate matrix-exponential distributions (MVME). The marginal distributions of an MVME are univariate matrix......-exponential distributions. We prove a characterization that states that a distribution is an MVME distribution if and only if all non-negative, non-null linear combinations of the coordinates have a univariate matrix-exponential distribution. This theorem is analog to a well-known characterization theorem...

  8. Multivariate PAT solutions for biopharmaceutical cultivation: current progress and limitations

    NARCIS (Netherlands)

    Mercier, S.M.; Diepenbroek, B.; Wijffels, R.H.; Streefland, M.

    2014-01-01

    Increasingly elaborate and voluminous datasets are generated by the (bio)pharmaceutical industry and are a major challenge for application of PAT and QbD principles. Multivariate data analysis (MVDA) is required to delineate relevant process information from large multi-factorial and multi-collinear

  9. Comparative analysis of chicken chromosome 28 provides new clues to the evolutionary fragility of gene-rich vertebrate regions

    NARCIS (Netherlands)

    Gordon, L.; Yang, S.; Tran-Gyamfi, M.; Baggott, D.; Christensen, M.; Hamilton, A.; Crooijmans, R.P.M.A.; Groenen, M.A.M.; Lucas, S.; Ovcharenko, I.; Stubbs, L.

    2007-01-01

    The chicken genome draft sequence has provided a valuable resource for studies of an important agricultural and experimental model species and an important data set for comparative analysis. However, some of the most gene-rich segments are missing from chicken genome draft assemblies, limiting the

  10. Preliminary Multi-Variable Parametric Cost Model for Space Telescopes

    Science.gov (United States)

    Stahl, H. Philip; Hendrichs, Todd

    2010-01-01

    This slide presentation reviews creating a preliminary multi-variable cost model for the contract costs of making a space telescope. There is discussion of the methodology for collecting the data, definition of the statistical analysis methodology, single variable model results, testing of historical models and an introduction of the multi variable models.

  11. Multivariate analysis of historical data (2004-2013) in assessing the possible environmental impact of the Bellolampo landfill (Palermo).

    Science.gov (United States)

    Indelicato, Serena; Bongiorno, David; Tuzzolino, Nicola; Mannino, Maria Rosaria; Muscarella, Rosalia; Fradella, Pasquale; Gargano, Maria Elena; Nicosia, Salvatore; Ceraulo, Leopoldo

    2018-03-14

    Multivariate analysis was performed on a large data set of groundwater and leachate samples collected during 9 years of operation of the Bellolampo municipal solid waste landfill (located above Palermo, Italy). The aim was to obtain the most likely correlations among the data. The analysis results are presented. Groundwater samples were collected in the period 2004-2013, whereas the leachate analysis refers to the period 2006-2013. For groundwater, statistical data evaluation revealed notable differences among the samples taken from the numerous wells located around the landfill. Characteristic parameters revealed by principal component analysis (PCA) were more deeply investigated, and corresponding thematic maps were drawn. The composition of the leachate was also thoroughly investigated. Several chemical macro-descriptors were calculated, and the results are presented. A comparison of PCA results for the leachate and groundwater data clearly reveals that the groundwater's main components substantially differ from those of the leachate. This outcome strongly suggests excluding leachate permeation through the multiple landfill lining.

  12. Extracting bb Higgs Decay Signals using Multivariate Techniques

    Energy Technology Data Exchange (ETDEWEB)

    Smith, W Clarke; /George Washington U. /SLAC

    2012-08-28

    For low-mass Higgs boson production at ATLAS at {radical}s = 7 TeV, the hard subprocess gg {yields} h{sup 0} {yields} b{bar b} dominates but is in turn drowned out by background. We seek to exploit the intrinsic few-MeV mass width of the Higgs boson to observe it above the background in b{bar b}-dijet mass plots. The mass resolution of existing mass-reconstruction algorithms is insufficient for this purpose due to jet combinatorics, that is, the algorithms cannot identify every jet that results from b{bar b} Higgs decay. We combine these algorithms using the neural net (NN) and boosted regression tree (BDT) multivariate methods in attempt to improve the mass resolution. Events involving gg {yields} h{sup 0} {yields} b{bar b} are generated using Monte Carlo methods with Pythia and then the Toolkit for Multivariate Analysis (TMVA) is used to train and test NNs and BDTs. For a 120 GeV Standard Model Higgs boson, the m{sub h{sup 0}}-reconstruction width is reduced from 8.6 to 6.5 GeV. Most importantly, however, the methods used here allow for more advanced m{sub h{sup 0}}-reconstructions to be created in the future using multivariate methods.

  13. Multivariate moment closure techniques for stochastic kinetic models

    International Nuclear Information System (INIS)

    Lakatos, Eszter; Ale, Angelique; Kirk, Paul D. W.; Stumpf, Michael P. H.

    2015-01-01

    Stochastic effects dominate many chemical and biochemical processes. Their analysis, however, can be computationally prohibitively expensive and a range of approximation schemes have been proposed to lighten the computational burden. These, notably the increasingly popular linear noise approximation and the more general moment expansion methods, perform well for many dynamical regimes, especially linear systems. At higher levels of nonlinearity, it comes to an interplay between the nonlinearities and the stochastic dynamics, which is much harder to capture correctly by such approximations to the true stochastic processes. Moment-closure approaches promise to address this problem by capturing higher-order terms of the temporally evolving probability distribution. Here, we develop a set of multivariate moment-closures that allows us to describe the stochastic dynamics of nonlinear systems. Multivariate closure captures the way that correlations between different molecular species, induced by the reaction dynamics, interact with stochastic effects. We use multivariate Gaussian, gamma, and lognormal closure and illustrate their use in the context of two models that have proved challenging to the previous attempts at approximating stochastic dynamics: oscillations in p53 and Hes1. In addition, we consider a larger system, Erk-mediated mitogen-activated protein kinases signalling, where conventional stochastic simulation approaches incur unacceptably high computational costs

  14. Multivariate moment closure techniques for stochastic kinetic models

    Energy Technology Data Exchange (ETDEWEB)

    Lakatos, Eszter, E-mail: e.lakatos13@imperial.ac.uk; Ale, Angelique; Kirk, Paul D. W.; Stumpf, Michael P. H., E-mail: m.stumpf@imperial.ac.uk [Department of Life Sciences, Centre for Integrative Systems Biology and Bioinformatics, Imperial College London, London SW7 2AZ (United Kingdom)

    2015-09-07

    Stochastic effects dominate many chemical and biochemical processes. Their analysis, however, can be computationally prohibitively expensive and a range of approximation schemes have been proposed to lighten the computational burden. These, notably the increasingly popular linear noise approximation and the more general moment expansion methods, perform well for many dynamical regimes, especially linear systems. At higher levels of nonlinearity, it comes to an interplay between the nonlinearities and the stochastic dynamics, which is much harder to capture correctly by such approximations to the true stochastic processes. Moment-closure approaches promise to address this problem by capturing higher-order terms of the temporally evolving probability distribution. Here, we develop a set of multivariate moment-closures that allows us to describe the stochastic dynamics of nonlinear systems. Multivariate closure captures the way that correlations between different molecular species, induced by the reaction dynamics, interact with stochastic effects. We use multivariate Gaussian, gamma, and lognormal closure and illustrate their use in the context of two models that have proved challenging to the previous attempts at approximating stochastic dynamics: oscillations in p53 and Hes1. In addition, we consider a larger system, Erk-mediated mitogen-activated protein kinases signalling, where conventional stochastic simulation approaches incur unacceptably high computational costs.

  15. Correlations among behavior, performance and environment in broiler breeders using multivariate analysis

    Directory of Open Access Journals (Sweden)

    DF Pereira

    2007-12-01

    Full Text Available Animal welfare issues have received much attention not only to supply farmed animal requirements, but also to ethical and cultural public concerns. Daily collected information, as well as the systematic follow-up of production stages, produces important statistical data for production assessment and control, as well as for improvement possibilities. In this scenario, this research study analyzed behavioral, production, and environmental data using Main Component Multivariable Analysis, which correlated observed behaviors, recorded using video cameras and electronic identification, with performance parameters of female broiler breeders. The aim was to start building a system to support decision-making in broiler breeder housing, based on bird behavioral parameters. Birds were housed in an environmental chamber, with three pens with different controlled environments. Bird sensitivity to environmental conditions were indicated by their behaviors, stressing the importance of behavioral observations for modern poultry management. A strong association between performance parameters and the behavior "at the nest", suggesting that this behavior may be used to predict productivity. The behaviors of "ruffling feathers", "opening wings", "preening", and "at the drinker" were negatively correlated with environmental temperature, suggesting that the increase of in the frequency of these behaviors indicate improvement of thermal welfare.

  16. The value of multivariate model sophistication

    DEFF Research Database (Denmark)

    Rombouts, Jeroen; Stentoft, Lars; Violante, Francesco

    2014-01-01

    We assess the predictive accuracies of a large number of multivariate volatility models in terms of pricing options on the Dow Jones Industrial Average. We measure the value of model sophistication in terms of dollar losses by considering a set of 444 multivariate models that differ in their spec....... In addition to investigating the value of model sophistication in terms of dollar losses directly, we also use the model confidence set approach to statistically infer the set of models that delivers the best pricing performances.......We assess the predictive accuracies of a large number of multivariate volatility models in terms of pricing options on the Dow Jones Industrial Average. We measure the value of model sophistication in terms of dollar losses by considering a set of 444 multivariate models that differ...

  17. Fuel prices scenario generation based on a multivariate GARCH model for risk analysis in a wholesale electricity market

    International Nuclear Information System (INIS)

    Batlle, C.; Barquin, J.

    2004-01-01

    This paper presents a fuel prices scenario generator in the frame of a simulation tool developed to support risk analysis in a competitive electricity environment. The tool feeds different erogenous risk factors to a wholesale electricity market model to perform a statistical analysis of the results. As the different fuel series that are studied, such as the oil or gas ones, present stochastic volatility and strong correlation among them, a multivariate Generalized Autoregressive Conditional Heteroskedastic (GARCH) model has been designed in order to allow the generation of future fuel prices paths. The model makes use of a decomposition method to simplify the consideration of the multidimensional conditional covariance. An example of its application with real data is also presented. (author)

  18. Sparse multivariate measures of similarity between intra-modal neuroimaging datasets

    Directory of Open Access Journals (Sweden)

    Maria J. Rosa

    2015-10-01

    Full Text Available An increasing number of neuroimaging studies are now based on either combining more than one data modality (inter-modal or combining more than one measurement from the same modality (intra-modal. To date, most intra-modal studies using multivariate statistics have focused on differences between datasets, for instance relying on classifiers to differentiate between effects in the data. However, to fully characterize these effects, multivariate methods able to measure similarities between datasets are needed. One classical technique for estimating the relationship between two datasets is canonical correlation analysis (CCA. However, in the context of high-dimensional data the application of CCA is extremely challenging. A recent extension of CCA, sparse CCA (SCCA, overcomes this limitation, by regularizing the model parameters while yielding a sparse solution. In this work, we modify SCCA with the aim of facilitating its application to high-dimensional neuroimaging data and finding meaningful multivariate image-to-image correspondences in intra-modal studies. In particular, we show how the optimal subset of variables can be estimated independently and we look at the information encoded in more than one set of SCCA transformations. We illustrate our framework using Arterial Spin Labelling data to investigate multivariate similarities between the effects of two antipsychotic drugs on cerebral blood flow.

  19. Multivariate Statistical Analysis Software Technologies for Astrophysical Research Involving Large Data Bases

    Science.gov (United States)

    Djorgovski, S. G.

    1994-01-01

    We developed a package to process and analyze the data from the digital version of the Second Palomar Sky Survey. This system, called SKICAT, incorporates the latest in machine learning and expert systems software technology, in order to classify the detected objects objectively and uniformly, and facilitate handling of the enormous data sets from digital sky surveys and other sources. The system provides a powerful, integrated environment for the manipulation and scientific investigation of catalogs from virtually any source. It serves three principal functions: image catalog construction, catalog management, and catalog analysis. Through use of the GID3* Decision Tree artificial induction software, SKICAT automates the process of classifying objects within CCD and digitized plate images. To exploit these catalogs, the system also provides tools to merge them into a large, complex database which may be easily queried and modified when new data or better methods of calibrating or classifying become available. The most innovative feature of SKICAT is the facility it provides to experiment with and apply the latest in machine learning technology to the tasks of catalog construction and analysis. SKICAT provides a unique environment for implementing these tools for any number of future scientific purposes. Initial scientific verification and performance tests have been made using galaxy counts and measurements of galaxy clustering from small subsets of the survey data, and a search for very high redshift quasars. All of the tests were successful and produced new and interesting scientific results. Attachments to this report give detailed accounts of the technical aspects of the SKICAT system, and of some of the scientific results achieved to date. We also developed a user-friendly package for multivariate statistical analysis of small and moderate-size data sets, called STATPROG. The package was tested extensively on a number of real scientific applications and has

  20. Multivariate statistical analysis software technologies for astrophysical research involving large data bases

    Science.gov (United States)

    Djorgovski, S. George

    1994-01-01

    We developed a package to process and analyze the data from the digital version of the Second Palomar Sky Survey. This system, called SKICAT, incorporates the latest in machine learning and expert systems software technology, in order to classify the detected objects objectively and uniformly, and facilitate handling of the enormous data sets from digital sky surveys and other sources. The system provides a powerful, integrated environment for the manipulation and scientific investigation of catalogs from virtually any source. It serves three principal functions: image catalog construction, catalog management, and catalog analysis. Through use of the GID3* Decision Tree artificial induction software, SKICAT automates the process of classifying objects within CCD and digitized plate images. To exploit these catalogs, the system also provides tools to merge them into a large, complete database which may be easily queried and modified when new data or better methods of calibrating or classifying become available. The most innovative feature of SKICAT is the facility it provides to experiment with and apply the latest in machine learning technology to the tasks of catalog construction and analysis. SKICAT provides a unique environment for implementing these tools for any number of future scientific purposes. Initial scientific verification and performance tests have been made using galaxy counts and measurements of galaxy clustering from small subsets of the survey data, and a search for very high redshift quasars. All of the tests were successful, and produced new and interesting scientific results. Attachments to this report give detailed accounts of the technical aspects for multivariate statistical analysis of small and moderate-size data sets, called STATPROG. The package was tested extensively on a number of real scientific applications, and has produced real, published results.

  1. Multivariate data analysis of two-dimensional gel electrophoresis protein patterns from few samples

    DEFF Research Database (Denmark)

    Jensen, Kristina Nedenskov; Jessen, Flemming; Jørgensen, Bo

    2008-01-01

    One application of 2D gel electrophoresis is to reveal differences in protein pattern between two or more groups of individuals, attributable to their group membership. Multivariate data analytical methods are useful in pinpointing the spots relevant for discrimination by focusing not only...... on single spot differences, but on the covariance structure between proteins. However, their outcome is dependent on data scaling, and they may fail in producing valid multivariate models due to the much higher number of "irrelevant" spots present in the gels. The case where only few gels are available...... and where the aim is to find as many as possible of the group-dependent proteins seems particularly difficult to handle. The present paper investigates such a case regarding the effect of scaling and of prefiltering by univariate nonparametric statistics on the selection of spots. Besides, a modified...

  2. Multivariate analysis of the influences of oceanic and meteorological processes on suspended particulate matter distributions in Mississippi coastal waters

    Science.gov (United States)

    O'Brien, S. J.; Fitzpatrick, P. J.; Dzwonkowski, B.; Dykstra, S. L.; Wallace, D. J.; Church, I.; Wiggert, J. D.

    2016-02-01

    The Mississippi Sound is influenced by a high volume of sediment discharge from the Biloxi River, Mobile Bay via Pas aux Herons, Pascagoula River, Pearl River, Wolf River, and Lake Pontchartrain through the Rigolets. The river discharge, variable wind speed, wind direction and tides have a significant impact on the turbidity and transport of sediments in the Sound. Level 1 Moderate Resolution Imaging Spectroradiometer (MODIS) data is processed to extract the remote sensing reflectance at the wavelength of 645 nm and binned into an 8-day composite at a resolution of 500 m. The study uses a regional ocean color algorithm to compute suspended particulate matter (SPM) concentration based on these 8-day composite images. Multivariate analysis is applied between the SPM and time series of tides, wind, turbidity and river discharge measured at federal and academic institutions' stations and moorings. The multivariate analysis also includes in situ measurements of suspended sediment concentration and advective exchanges through the Mississippi Sound's tidal inlets between the coastal shelf and the nearshore estuarine waters. Mechanisms underlying the observed spatiotemporal distribution of SPM, including material exchange between the Sound and adjacent shelf waters, will be explored. The results of this study will contribute to current understanding of exchange mechanisms and pathways with the Mississippi Bight via the Mississippi Sound's tidal inlets.

  3. Sensitivity equation for quantitative analysis with multivariate curve resolution-alternating least-squares: theoretical and experimental approach.

    Science.gov (United States)

    Bauza, María C; Ibañez, Gabriela A; Tauler, Romà; Olivieri, Alejandro C

    2012-10-16

    A new equation is derived for estimating the sensitivity when the multivariate curve resolution-alternating least-squares (MCR-ALS) method is applied to second-order multivariate calibration data. The validity of the expression is substantiated by extensive Monte Carlo noise addition simulations. The multivariate selectivity can be derived from the new sensitivity expression. Other important figures of merit, such as limit of detection, limit of quantitation, and concentration uncertainty of MCR-ALS quantitative estimations can be easily estimated from the proposed sensitivity expression and the instrumental noise. An experimental example involving the determination of an analyte in the presence of uncalibrated interfering agents is described in detail, involving second-order time-decaying sensitized lanthanide luminescence excitation spectra. The estimated figures of merit are reasonably correlated with the analytical features of the analyzed experimental system.

  4. SOFTWARE SUPPORT FOR RICH PICTURES

    DEFF Research Database (Denmark)

    Valente, Andrea; Marchetti, Emanuela

    2010-01-01

    Rich pictures (RP) are common in object-oriented analysis and design courses, but students seem to have problems in integrating them in their projects' workflow. A new software tool is being developed, specific for RP authoring. To better understand students' issues and working practice with RP...

  5. Multivariate return periods in hydrology: a critical and practical review focusing on synthetic design hydrograph estimation

    Directory of Open Access Journals (Sweden)

    B. Gräler

    2013-04-01

    Full Text Available Most of the hydrological and hydraulic studies refer to the notion of a return period to quantify design variables. When dealing with multiple design variables, the well-known univariate statistical analysis is no longer satisfactory, and several issues challenge the practitioner. How should one incorporate the dependence between variables? How should a multivariate return period be defined and applied in order to yield a proper design event? In this study an overview of the state of the art for estimating multivariate design events is given and the different approaches are compared. The construction of multivariate distribution functions is done through the use of copulas, given their practicality in multivariate frequency analyses and their ability to model numerous types of dependence structures in a flexible way. A synthetic case study is used to generate a large data set of simulated discharges that is used for illustrating the effect of different modelling choices on the design events. Based on different uni- and multivariate approaches, the design hydrograph characteristics of a 3-D phenomenon composed of annual maximum peak discharge, its volume, and duration are derived. These approaches are based on regression analysis, bivariate conditional distributions, bivariate joint distributions and Kendall distribution functions, highlighting theoretical and practical issues of multivariate frequency analysis. Also an ensemble-based approach is presented. For a given design return period, the approach chosen clearly affects the calculated design event, and much attention should be given to the choice of the approach used as this depends on the real-world problem at hand.

  6. Study of groundwater arsenic pollution in Lanyang Plain using multivariate statistical analysis

    Science.gov (United States)

    chan, S.

    2013-12-01

    The study area, Lanyang Plain in the eastern Taiwan, has highly developed agriculture and aquaculture, which consume over 70% of the water supplies. Groundwater is frequently considered as an alternative water source. However, the serious arsenic pollution of groundwater in Lanyan Plain should be well studied to ensure the safety of groundwater usage. In this study, 39 groundwater samples were collected. The results of hydrochemistry demonstrate two major trends in Piper diagram. The major trend with most of groundwater samples is determined with water type between Ca+Mg-HCO3 and Na+K-HCO3. This can be explained with cation exchange reaction. The minor trend is obviously corresponding to seawater intrusion, which has water type of Na+K-Cl, because the localities of these samples are all in the coastal area. The multivariate statistical analysis on hydrochemical data was conducted for further exploration on the mechanism of arsenic contamination. Two major factors can be extracted with factor analysis. The major factor includes Ca, Mg and Sr while the minor factor includes Na, K and As. This reconfirms that cation exchange reaction mainly control the groundwater hydrochemistry in the study area. It is worth to note that arsenic is positively related to Na and K. The result of cluster analysis shows that groundwater samples with high arsenic concentration can be grouped into that with high Na, K and HCO3. This supports that cation exchange would enhance the release of arsenic and exclude the effect of seawater intrusion. In other words, the water-rock reaction time is key to obtain higher arsenic content. In general, the major source of arsenic in sediments include exchangeable, reducible and oxidizable phases, which are adsorbed ions, Fe-Mn oxides and organic matters/pyrite, respectively. However, the results of factor analysis do not show apparent correlation between arsenic and Fe/Mn. This may exclude Fe-Mn oxides as a major source of arsenic. The other sources

  7. Multivariate analysis of chromatographic retention data as a supplementary means for grouping structurally related compounds.

    Science.gov (United States)

    Fasoula, S; Zisi, Ch; Sampsonidis, I; Virgiliou, Ch; Theodoridis, G; Gika, H; Nikitas, P; Pappa-Louisi, A

    2015-03-27

    In the present study a series of 45 metabolite standards belonging to four chemically similar metabolite classes (sugars, amino acids, nucleosides and nucleobases, and amines) was subjected to LC analysis on three HILIC columns under 21 different gradient conditions with the aim to explore whether the retention properties of these analytes are determined from the chemical group they belong. Two multivariate techniques, principal component analysis (PCA) and discriminant analysis (DA), were used for statistical evaluation of the chromatographic data and extraction similarities between chemically related compounds. The total variance explained by the first two principal components of PCA was found to be about 98%, whereas both statistical analyses indicated that all analytes are successfully grouped in four clusters of chemical structure based on the retention obtained in four or at least three chromatographic runs, which, however should be performed on two different HILIC columns. Moreover, leave-one-out cross-validation of the above retention data set showed that the chemical group in which an analyte belongs can be 95.6% correctly predicted when the analyte is subjected to LC analysis under the same four or three experimental conditions as the all set of analytes was run beforehand. That, in turn, may assist with disambiguation of analyte identification in complex biological extracts. Copyright © 2015 Elsevier B.V. All rights reserved.

  8. Quantitative Outline-based Shape Analysis and Classification of Planetary Craterforms using Supervised Learning Models

    Science.gov (United States)

    Slezak, Thomas Joseph; Radebaugh, Jani; Christiansen, Eric

    2017-10-01

    The shapes of craterform morphology on planetary surfaces provides rich information about their origins and evolution. While morphologic information provides rich visual clues to geologic processes and properties, the ability to quantitatively communicate this information is less easily accomplished. This study examines the morphology of craterforms using the quantitative outline-based shape methods of geometric morphometrics, commonly used in biology and paleontology. We examine and compare landforms on planetary surfaces using shape, a property of morphology that is invariant to translation, rotation, and size. We quantify the shapes of paterae on Io, martian calderas, terrestrial basaltic shield calderas, terrestrial ash-flow calderas, and lunar impact craters using elliptic Fourier analysis (EFA) and the Zahn and Roskies (Z-R) shape function, or tangent angle approach to produce multivariate shape descriptors. These shape descriptors are subjected to multivariate statistical analysis including canonical variate analysis (CVA), a multiple-comparison variant of discriminant analysis, to investigate the link between craterform shape and classification. Paterae on Io are most similar in shape to terrestrial ash-flow calderas and the shapes of terrestrial basaltic shield volcanoes are most similar to martian calderas. The shapes of lunar impact craters, including simple, transitional, and complex morphology, are classified with a 100% rate of success in all models. Multiple CVA models effectively predict and classify different craterforms using shape-based identification and demonstrate significant potential for use in the analysis of planetary surfaces.

  9. Prognostic factors in nodular lymphomas: a multivariate analysis based on the Princess Margaret Hospital experience

    International Nuclear Information System (INIS)

    Gospodarowicz, M.K.; Bush, R.S.; Brown, T.C.; Chua, T.

    1984-01-01

    A total of 1,394 patients with non-Hodgkin's lymphoma were treated at the Princess Margaret Hospital between January 1, 1967 and December 31, 1978. Overall actuarial survival of 525 patients with nodular lymphomas was 40% at 12 years; survival of patients with localized (Stage I and III) nodular lymphomas treated with radical radiation therapy was 58%. Significant prognostic factors defined by multivariate analysis included patient's age, stage, histology, tumor bulk, and presence of B symptoms. By combining prognostic factors, distinct prognostic groups have been identified within the overall population. Patients with Stage I and II disease, small or medium bulk, less than 70 years of age achieved 92% 12 year actuarial survival and a 73% relapse-free rate in 12 years of follow-up. These patients represent groups highly curable with irradiation

  10. Multivariate analysis and extraction of parameters in resistive RAMs using the Quantum Point Contact model

    Science.gov (United States)

    Roldán, J. B.; Miranda, E.; González-Cordero, G.; García-Fernández, P.; Romero-Zaliz, R.; González-Rodelas, P.; Aguilera, A. M.; González, M. B.; Jiménez-Molinos, F.

    2018-01-01

    A multivariate analysis of the parameters that characterize the reset process in Resistive Random Access Memory (RRAM) has been performed. The different correlations obtained can help to shed light on the current components that contribute in the Low Resistance State (LRS) of the technology considered. In addition, a screening method for the Quantum Point Contact (QPC) current component is presented. For this purpose, the second derivative of the current has been obtained using a novel numerical method which allows determining the QPC model parameters. Once the procedure is completed, a whole Resistive Switching (RS) series of thousands of curves is studied by means of a genetic algorithm. The extracted QPC parameter distributions are characterized in depth to get information about the filamentary pathways associated with LRS in the low voltage conduction regime.

  11. MULTIVARIATE ANALYSIS OF SUPERMARKET SECTOR DATA FROM THE TOPS ABRAS THE STATE OF SÃO PAULO (2010

    Directory of Open Access Journals (Sweden)

    Paulo Rogério Alves Brene

    2014-06-01

    Full Text Available The objective of this paper is to propose a methodology to illustrate the applicability and importance of multivariate analysis. To do that, it is used the data set presented on ABRAS of São Paulo for the year 2010. Thus, new indicators were developed with the aid of factor analysis (FA, 14 condensed information extracted from ABRAS on 2 factors: Size and Efficiency. As a result, it was observed that the application of AF was successful because it reduced the number of variables without losing much information, as well as showing consistency in this grouping beyond the grouping of variables. Finally, there is a direct relationship between billing classification and classification by revenue size (Spearman correlation coefficient of 0.99 and the possibility of underestimating data related to the efficiency of the three largest markets.

  12. Multivariable control in nuclear power stations -survey of design methods

    International Nuclear Information System (INIS)

    Mcmorran, P.D.

    1979-12-01

    The development of larger nuclear generating stations increases the importance of dynamic interaction between controllers, because each control action may affect several plant outputs. Multivariable control provides the techniques to design controllers which perform well under these conditions. This report is a foundation for further work on the application of multivariable control in AECL. It covers the requirements of control and the fundamental mathematics used, then reviews the most important linear methods, based on both state-space and frequency-response concepts. State-space methods are derived from analysis of the system differential equations, while frequency-response methods use the input-output transfer function. State-space methods covered include linear-quadratic optimal control, pole shifting, and the theory of state observers and estimators. Frequency-response methods include the inverse Nyquist array method, and classical non-interactive techniques. Transfer-function methods are particularly emphasized since they can incorporate ill-defined design criteria. The underlying concepts, and the application strengths and weaknesses of each design method are presented. A review of significant applications is also given. It is concluded that the inverse Nyquist array method, a frequency-response technique based on inverse transfer-function matrices, is preferred for the design of multivariable controllers for nuclear power plants. This method may be supplemented by information obtained from a modal analysis of the plant model. (auth)

  13. Chemiluminescence-based multivariate sensing of local equivalence ratios in premixed atmospheric methane-air flames

    Energy Technology Data Exchange (ETDEWEB)

    Tripathi, Markandey M.; Krishnan, Sundar R.; Srinivasan, Kalyan K.; Yueh, Fang-Yu; Singh, Jagdish P.

    2011-09-07

    Chemiluminescence emissions from OH*, CH*, C2, and CO2 formed within the reaction zone of premixed flames depend upon the fuel-air equivalence ratio in the burning mixture. In the present paper, a new partial least square regression (PLS-R) based multivariate sensing methodology is investigated and compared with an OH*/CH* intensity ratio-based calibration model for sensing equivalence ratio in atmospheric methane-air premixed flames. Five replications of spectral data at nine different equivalence ratios ranging from 0.73 to 1.48 were used in the calibration of both models. During model development, the PLS-R model was initially validated with the calibration data set using the leave-one-out cross validation technique. Since the PLS-R model used the entire raw spectral intensities, it did not need the nonlinear background subtraction of CO2 emission that is required for typical OH*/CH* intensity ratio calibrations. An unbiased spectral data set (not used in the PLS-R model development), for 28 different equivalence ratio conditions ranging from 0.71 to 1.67, was used to predict equivalence ratios using the PLS-R and the intensity ratio calibration models. It was found that the equivalence ratios predicted with the PLS-R based multivariate calibration model matched the experimentally measured equivalence ratios within 7%; whereas, the OH*/CH* intensity ratio calibration grossly underpredicted equivalence ratios in comparison to measured equivalence ratios, especially under rich conditions ( > 1.2). The practical implications of the chemiluminescence-based multivariate equivalence ratio sensing methodology are also discussed.

  14. A multivariate time series approach to modeling and forecasting demand in the emergency department.

    Science.gov (United States)

    Jones, Spencer S; Evans, R Scott; Allen, Todd L; Thomas, Alun; Haug, Peter J; Welch, Shari J; Snow, Gregory L

    2009-02-01

    The goals of this investigation were to study the temporal relationships between the demands for key resources in the emergency department (ED) and the inpatient hospital, and to develop multivariate forecasting models. Hourly data were collected from three diverse hospitals for the year 2006. Descriptive analysis and model fitting were carried out using graphical and multivariate time series methods. Multivariate models were compared to a univariate benchmark model in terms of their ability to provide out-of-sample forecasts of ED census and the demands for diagnostic resources. Descriptive analyses revealed little temporal interaction between the demand for inpatient resources and the demand for ED resources at the facilities considered. Multivariate models provided more accurate forecasts of ED census and of the demands for diagnostic resources. Our results suggest that multivariate time series models can be used to reliably forecast ED patient census; however, forecasts of the demands for diagnostic resources were not sufficiently reliable to be useful in the clinical setting.

  15. Environmental heterogeneity–species richness relationships from a global perspective

    Directory of Open Access Journals (Sweden)

    Anke Stein

    2016-01-01

    Full Text Available Spatial environmental heterogeneity (EH is considered one of the most important factors promoting species richness, but no general consent about the EH–richness relationship exists so far. This is because research methods and study settings vary widely, and because non-significant and negative associations have also been reported. My thesis provides a comprehensive review of the different measurements and terminologies of EH used in the literature, and presents strong quantitative evidence of a generally positive relationship between biotic and abiotic EH and species richness of terrestrial plants and animals from landscape to global extents. In a meta-analysis and a subsequent case study comparing multiple EH measures and their association with mammal species richness worldwide, I furthermore reveal that the outcome of EH–richness studies depends strongly on study design, including both the EH measure chosen and spatial scale. My research contributes to a better understanding of the EH–richness relationship, while identifying future research needs.

  16. Multivariable control in nuclear power stations

    International Nuclear Information System (INIS)

    Parent, M.; McMorran, P.D.

    1982-11-01

    Multivariable methods have the potential to improve the control of large systems such as nuclear power stations. Linear-quadratic optimal control is a multivariable method based on the minimization of a cost function. A related technique leads to the Kalman filter for estimation of plant state from noisy measurements. A design program for optimal control and Kalman filtering has been developed as part of a computer-aided design package for multivariable control systems. The method is demonstrated on a model of a nuclear steam generator, and simulated results are presented

  17. Trochanteric entry femoral nails yield better femoral version and lower revision rates-A large cohort multivariate regression analysis.

    Science.gov (United States)

    Yoon, Richard S; Gage, Mark J; Galos, David K; Donegan, Derek J; Liporace, Frank A

    2017-06-01

    Intramedullary nailing (IMN) has become the standard of care for the treatment of most femoral shaft fractures. Different IMN options include trochanteric and piriformis entry as well as retrograde nails, which may result in varying degrees of femoral rotation. The objective of this study was to analyze postoperative femoral version between three types of nails and to delineate any significant differences in femoral version (DFV) and revision rates. Over a 10-year period, 417 patients underwent IMN of a diaphyseal femur fracture (AO/OTA 32A-C). Of these patients, 316 met inclusion criteria and obtained postoperative computed tomography (CT) scanograms to calculate femoral version and were thus included in the study. In this study, our main outcome measure was the difference in femoral version (DFV) between the uninjured limb and the injured limb. The effect of the following variables on DFV and revision rates were determined via univariate, multivariate, and ordinal regression analyses: gender, age, BMI, ethnicity, mechanism of injury, operative side, open fracture, and table type/position. Statistical significance was set at pregression analysis revealed that a lower BMI was significantly associated with a lower DFV (p=0.006). Controlling for possible covariables, multivariate analysis yielded a significantly lower DFV for trochanteric entry nails than piriformis or retrograde nails (7.9±6.10 vs. 9.5±7.4 vs. 9.4±7.8°, pregression analysis. However, this is not to state that the other nail types exhibited abnormal DFV. Translation to the clinical impact of a few degrees of DFV is also unknown. Future studies to more in-depth study the intricacies of femoral version may lead to improved technology in addition to potentially improved clinical outcomes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Multivariate analysis in the frequency mastery applied to the Laguna Verde Central

    International Nuclear Information System (INIS)

    Castillo D, R.; Ortiz V, J.; Calleros M, G.

    2006-01-01

    The noise analysis is an auxiliary tool in the detection of abnormal operation conditions of equipment, instruments or systems that affect to the dynamic behavior of the reactor. The spectral density of normalized power has usually been used (NPSD, by its initials in English), to watch over the behavior of some components of the reactor, for example, the jet pumps, the recirculation pumps, valves of flow control in the recirculation knots, etc. The behavior change is determined by individual analysis of the NPSD of the signals of the components in study. An alternative analysis that can allow to obtain major information on the component under surveillance is the multivariate autoregressive analysis (MAR, by its initials in English), which allows to know the relationship that exists among diverse signals of the reactor systems, in the time domain. In the space of the frequency, the relative contribution of power (RPC for their initials in English) it quantifies the influence of the variables of the systems on a variable of interest. The RPC allows, therefore that for a peak shown in the NPSD of a variable, it can be determine the influence from other variables to that frequency of interest. This facilitates, in principle, the pursuit of the important physical parameters during an event, and to study their interrelation. In this work, by way of example of the application of the RPC, two events happened in the Laguna Verde Central are analyzed: the rods blockade alarms by high scale in the monitors of average power, in which it was presents a power peak of 12% of width peak to peak, and the power oscillations event. The main obtained result of the analysis of the control rods blockade alarm event was that it was detected that the power peak observed in the signals of the average power monitors was caused by the movement of the valve of flow control of recirculation of the knot B. In the other oscillation event the results its show the mechanism of the oscillation of

  19. The relationship between species richness and aboveground biomass in a primary Pinus kesiya forest of Yunnan, southwestern China.

    Science.gov (United States)

    Li, Shuaifeng; Lang, Xuedong; Liu, Wande; Ou, Guanglong; Xu, Hui; Su, Jianrong

    2018-01-01

    The relationship between biodiversity and biomass is an essential element of the natural ecosystem functioning. Our research aims at assessing the effects of species richness on the aboveground biomass and the ecological driver of this relationship in a primary Pinus kesiya forest. We sampled 112 plots of the primary P. kesiya forests in Yunnan Province. The general linear model and the structural equation model were used to estimate relative effects of multivariate factors among aboveground biomass, species richness and the other explanatory variables, including climate moisture index, soil nutrient regime and stand age. We found a positive linear regression relationship between the species richness and aboveground biomass using ordinary least squares regressions. The species richness and soil nutrient regime had no direct significant effect on aboveground biomass. However, the climate moisture index and stand age had direct effects on aboveground biomass. The climate moisture index could be a better link to mediate the relationship between species richness and aboveground biomass. The species richness affected aboveground biomass which was mediated by the climate moisture index. Stand age had direct and indirect effects on aboveground biomass through the climate moisture index. Our results revealed that climate moisture index had a positive feedback in the relationship between species richness and aboveground biomass, which played an important role in a link between biodiversity maintenance and ecosystem functioning. Meanwhile, climate moisture index not only affected positively on aboveground biomass, but also indirectly through species richness. The information would be helpful in understanding the biodiversity-aboveground biomass relationship of a primary P. kesiya forest and for forest management.

  20. Measures of dependence for multivariate Lévy distributions

    Science.gov (United States)

    Boland, J.; Hurd, T. R.; Pivato, M.; Seco, L.

    2001-02-01

    Recent statistical analysis of a number of financial databases is summarized. Increasing agreement is found that logarithmic equity returns show a certain type of asymptotic behavior of the largest events, namely that the probability density functions have power law tails with an exponent α≈3.0. This behavior does not vary much over different stock exchanges or over time, despite large variations in trading environments. The present paper proposes a class of multivariate distributions which generalizes the observed qualities of univariate time series. A new consequence of the proposed class is the "spectral measure" which completely characterizes the multivariate dependences of the extreme tails of the distribution. This measure on the unit sphere in M-dimensions, in principle completely general, can be determined empirically by looking at extreme events. If it can be observed and determined, it will prove to be of importance for scenario generation in portfolio risk management.