WorldWideScience

Sample records for model ridge regression

  1. Normalization Ridge Regression in Practice I: Comparisons Between Ordinary Least Squares, Ridge Regression and Normalization Ridge Regression.

    Science.gov (United States)

    Bulcock, J. W.

    The problem of model estimation when the data are collinear was examined. Though the ridge regression (RR) outperforms ordinary least squares (OLS) regression in the presence of acute multicollinearity, it is not a problem free technique for reducing the variance of the estimates. It is a stochastic procedure when it should be nonstochastic and it…

  2. Model selection in kernel ridge regression

    DEFF Research Database (Denmark)

    Exterkate, Peter

    2013-01-01

    Kernel ridge regression is a technique to perform ridge regression with a potentially infinite number of nonlinear transformations of the independent variables as regressors. This method is gaining popularity as a data-rich nonlinear forecasting tool, which is applicable in many different contexts....... The influence of the choice of kernel and the setting of tuning parameters on forecast accuracy is investigated. Several popular kernels are reviewed, including polynomial kernels, the Gaussian kernel, and the Sinc kernel. The latter two kernels are interpreted in terms of their smoothing properties......, and the tuning parameters associated to all these kernels are related to smoothness measures of the prediction function and to the signal-to-noise ratio. Based on these interpretations, guidelines are provided for selecting the tuning parameters from small grids using cross-validation. A Monte Carlo study...

  3. Ridge regression estimator: combining unbiased and ordinary ridge regression methods of estimation

    Directory of Open Access Journals (Sweden)

    Sharad Damodar Gore

    2009-10-01

    Full Text Available Statistical literature has several methods for coping with multicollinearity. This paper introduces a new shrinkage estimator, called modified unbiased ridge (MUR. This estimator is obtained from unbiased ridge regression (URR in the same way that ordinary ridge regression (ORR is obtained from ordinary least squares (OLS. Properties of MUR are derived. Results on its matrix mean squared error (MMSE are obtained. MUR is compared with ORR and URR in terms of MMSE. These results are illustrated with an example based on data generated by Hoerl and Kennard (1975.

  4. Ridge Regression Signal Processing

    Science.gov (United States)

    Kuhl, Mark R.

    1990-01-01

    The introduction of the Global Positioning System (GPS) into the National Airspace System (NAS) necessitates the development of Receiver Autonomous Integrity Monitoring (RAIM) techniques. In order to guarantee a certain level of integrity, a thorough understanding of modern estimation techniques applied to navigational problems is required. The extended Kalman filter (EKF) is derived and analyzed under poor geometry conditions. It was found that the performance of the EKF is difficult to predict, since the EKF is designed for a Gaussian environment. A novel approach is implemented which incorporates ridge regression to explain the behavior of an EKF in the presence of dynamics under poor geometry conditions. The basic principles of ridge regression theory are presented, followed by the derivation of a linearized recursive ridge estimator. Computer simulations are performed to confirm the underlying theory and to provide a comparative analysis of the EKF and the recursive ridge estimator.

  5. A simulation study on Bayesian Ridge regression models for several collinearity levels

    Science.gov (United States)

    Efendi, Achmad; Effrihan

    2017-12-01

    When analyzing data with multiple regression model if there are collinearities, then one or several predictor variables are usually omitted from the model. However, there sometimes some reasons, for instance medical or economic reasons, the predictors are all important and should be included in the model. Ridge regression model is not uncommon in some researches to use to cope with collinearity. Through this modeling, weights for predictor variables are used for estimating parameters. The next estimation process could follow the concept of likelihood. Furthermore, for the estimation nowadays the Bayesian version could be an alternative. This estimation method does not match likelihood one in terms of popularity due to some difficulties; computation and so forth. Nevertheless, with the growing improvement of computational methodology recently, this caveat should not at the moment become a problem. This paper discusses about simulation process for evaluating the characteristic of Bayesian Ridge regression parameter estimates. There are several simulation settings based on variety of collinearity levels and sample sizes. The results show that Bayesian method gives better performance for relatively small sample sizes, and for other settings the method does perform relatively similar to the likelihood method.

  6. Significance testing in ridge regression for genetic data

    Directory of Open Access Journals (Sweden)

    De Iorio Maria

    2011-09-01

    Full Text Available Abstract Background Technological developments have increased the feasibility of large scale genetic association studies. Densely typed genetic markers are obtained using SNP arrays, next-generation sequencing technologies and imputation. However, SNPs typed using these methods can be highly correlated due to linkage disequilibrium among them, and standard multiple regression techniques fail with these data sets due to their high dimensionality and correlation structure. There has been increasing interest in using penalised regression in the analysis of high dimensional data. Ridge regression is one such penalised regression technique which does not perform variable selection, instead estimating a regression coefficient for each predictor variable. It is therefore desirable to obtain an estimate of the significance of each ridge regression coefficient. Results We develop and evaluate a test of significance for ridge regression coefficients. Using simulation studies, we demonstrate that the performance of the test is comparable to that of a permutation test, with the advantage of a much-reduced computational cost. We introduce the p-value trace, a plot of the negative logarithm of the p-values of ridge regression coefficients with increasing shrinkage parameter, which enables the visualisation of the change in p-value of the regression coefficients with increasing penalisation. We apply the proposed method to a lung cancer case-control data set from EPIC, the European Prospective Investigation into Cancer and Nutrition. Conclusions The proposed test is a useful alternative to a permutation test for the estimation of the significance of ridge regression coefficients, at a much-reduced computational cost. The p-value trace is an informative graphical tool for evaluating the results of a test of significance of ridge regression coefficients as the shrinkage parameter increases, and the proposed test makes its production computationally feasible.

  7. Forecasting Model for IPTV Service in Korea Using Bootstrap Ridge Regression Analysis

    Science.gov (United States)

    Lee, Byoung Chul; Kee, Seho; Kim, Jae Bum; Kim, Yun Bae

    The telecom firms in Korea are taking new step to prepare for the next generation of convergence services, IPTV. In this paper we described our analysis on the effective method for demand forecasting about IPTV broadcasting. We have tried according to 3 types of scenarios based on some aspects of IPTV potential market and made a comparison among the results. The forecasting method used in this paper is the multi generation substitution model with bootstrap ridge regression analysis.

  8. Model Selection in Kernel Ridge Regression

    DEFF Research Database (Denmark)

    Exterkate, Peter

    Kernel ridge regression is gaining popularity as a data-rich nonlinear forecasting tool, which is applicable in many different contexts. This paper investigates the influence of the choice of kernel and the setting of tuning parameters on forecast accuracy. We review several popular kernels......, including polynomial kernels, the Gaussian kernel, and the Sinc kernel. We interpret the latter two kernels in terms of their smoothing properties, and we relate the tuning parameters associated to all these kernels to smoothness measures of the prediction function and to the signal-to-noise ratio. Based...... on these interpretations, we provide guidelines for selecting the tuning parameters from small grids using cross-validation. A Monte Carlo study confirms the practical usefulness of these rules of thumb. Finally, the flexible and smooth functional forms provided by the Gaussian and Sinc kernels makes them widely...

  9. Nonlinear Forecasting With Many Predictors Using Kernel Ridge Regression

    DEFF Research Database (Denmark)

    Exterkate, Peter; Groenen, Patrick J.F.; Heij, Christiaan

    This paper puts forward kernel ridge regression as an approach for forecasting with many predictors that are related nonlinearly to the target variable. In kernel ridge regression, the observed predictor variables are mapped nonlinearly into a high-dimensional space, where estimation of the predi...

  10. The Current and Future Use of Ridge Regression for Prediction in Quantitative Genetics

    Directory of Open Access Journals (Sweden)

    Ronald de Vlaming

    2015-01-01

    Full Text Available In recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i the theoretical foundations of ridge regression, (ii its link to commonly used methods in animal breeding, (iii the computational feasibility, and (iv the scope for constructing prediction models with nonlinear effects (e.g., dominance and epistasis. Based on a simulation study we gauge the current and future potential of ridge regression for prediction of human traits using genome-wide SNP data. We conclude that, for outcomes with a relatively simple genetic architecture, given current sample sizes in most cohorts (i.e., N<10,000 the predictive accuracy of ridge regression is slightly higher than the classical genome-wide association study approach of repeated simple regression (i.e., one regression per SNP. However, both capture only a small proportion of the heritability. Nevertheless, we find evidence that for large-scale initiatives, such as biobanks, sample sizes can be achieved where ridge regression compared to the classical approach improves predictive accuracy substantially.

  11. The comparison between several robust ridge regression estimators in the presence of multicollinearity and multiple outliers

    Science.gov (United States)

    Zahari, Siti Meriam; Ramli, Norazan Mohamed; Moktar, Balkiah; Zainol, Mohammad Said

    2014-09-01

    In the presence of multicollinearity and multiple outliers, statistical inference of linear regression model using ordinary least squares (OLS) estimators would be severely affected and produces misleading results. To overcome this, many approaches have been investigated. These include robust methods which were reported to be less sensitive to the presence of outliers. In addition, ridge regression technique was employed to tackle multicollinearity problem. In order to mitigate both problems, a combination of ridge regression and robust methods was discussed in this study. The superiority of this approach was examined when simultaneous presence of multicollinearity and multiple outliers occurred in multiple linear regression. This study aimed to look at the performance of several well-known robust estimators; M, MM, RIDGE and robust ridge regression estimators, namely Weighted Ridge M-estimator (WRM), Weighted Ridge MM (WRMM), Ridge MM (RMM), in such a situation. Results of the study showed that in the presence of simultaneous multicollinearity and multiple outliers (in both x and y-direction), the RMM and RIDGE are more or less similar in terms of superiority over the other estimators, regardless of the number of observation, level of collinearity and percentage of outliers used. However, when outliers occurred in only single direction (y-direction), the WRMM estimator is the most superior among the robust ridge regression estimators, by producing the least variance. In conclusion, the robust ridge regression is the best alternative as compared to robust and conventional least squares estimators when dealing with simultaneous presence of multicollinearity and outliers.

  12. The efficiency of modified jackknife and ridge type regression estimators: a comparison

    Directory of Open Access Journals (Sweden)

    Sharad Damodar Gore

    2008-09-01

    Full Text Available A common problem in multiple regression models is multicollinearity, which produces undesirable effects on the least squares estimator. To circumvent this problem, two well known estimation procedures are often suggested in the literature. They are Generalized Ridge Regression (GRR estimation suggested by Hoerl and Kennard iteb8 and the Jackknifed Ridge Regression (JRR estimation suggested by Singh et al. iteb13. The GRR estimation leads to a reduction in the sampling variance, whereas, JRR leads to a reduction in the bias. In this paper, we propose a new estimator namely, Modified Jackknife Ridge Regression Estimator (MJR. It is based on the criterion that combines the ideas underlying both the GRR and JRR estimators. We have investigated standard properties of this new estimator. From a simulation study, we find that the new estimator often outperforms the LASSO, and it is superior to both GRR and JRR estimators, using the mean squared error criterion. The conditions under which the MJR estimator is better than the other two competing estimators have been investigated.

  13. The effect of high leverage points on the logistic ridge regression estimator having multicollinearity

    Science.gov (United States)

    Ariffin, Syaiba Balqish; Midi, Habshah

    2014-06-01

    This article is concerned with the performance of logistic ridge regression estimation technique in the presence of multicollinearity and high leverage points. In logistic regression, multicollinearity exists among predictors and in the information matrix. The maximum likelihood estimator suffers a huge setback in the presence of multicollinearity which cause regression estimates to have unduly large standard errors. To remedy this problem, a logistic ridge regression estimator is put forward. It is evident that the logistic ridge regression estimator outperforms the maximum likelihood approach for handling multicollinearity. The effect of high leverage points are then investigated on the performance of the logistic ridge regression estimator through real data set and simulation study. The findings signify that logistic ridge regression estimator fails to provide better parameter estimates in the presence of both high leverage points and multicollinearity.

  14. A robust ridge regression approach in the presence of both multicollinearity and outliers in the data

    Science.gov (United States)

    Shariff, Nurul Sima Mohamad; Ferdaos, Nur Aqilah

    2017-08-01

    Multicollinearity often leads to inconsistent and unreliable parameter estimates in regression analysis. This situation will be more severe in the presence of outliers it will cause fatter tails in the error distributions than the normal distributions. The well-known procedure that is robust to multicollinearity problem is the ridge regression method. This method however is expected to be affected by the presence of outliers due to some assumptions imposed in the modeling procedure. Thus, the robust version of existing ridge method with some modification in the inverse matrix and the estimated response value is introduced. The performance of the proposed method is discussed and comparisons are made with several existing estimators namely, Ordinary Least Squares (OLS), ridge regression and robust ridge regression based on GM-estimates. The finding of this study is able to produce reliable parameter estimates in the presence of both multicollinearity and outliers in the data.

  15. A Comparative Study of Pairwise Learning Methods Based on Kernel Ridge Regression.

    Science.gov (United States)

    Stock, Michiel; Pahikkala, Tapio; Airola, Antti; De Baets, Bernard; Waegeman, Willem

    2018-06-12

    Many machine learning problems can be formulated as predicting labels for a pair of objects. Problems of that kind are often referred to as pairwise learning, dyadic prediction, or network inference problems. During the past decade, kernel methods have played a dominant role in pairwise learning. They still obtain a state-of-the-art predictive performance, but a theoretical analysis of their behavior has been underexplored in the machine learning literature. In this work we review and unify kernel-based algorithms that are commonly used in different pairwise learning settings, ranging from matrix filtering to zero-shot learning. To this end, we focus on closed-form efficient instantiations of Kronecker kernel ridge regression. We show that independent task kernel ridge regression, two-step kernel ridge regression, and a linear matrix filter arise naturally as a special case of Kronecker kernel ridge regression, implying that all these methods implicitly minimize a squared loss. In addition, we analyze universality, consistency, and spectral filtering properties. Our theoretical results provide valuable insights into assessing the advantages and limitations of existing pairwise learning methods.

  16. Local Prediction Models on Mid-Atlantic Ridge MORB by Principal Component Regression

    Science.gov (United States)

    Ling, X.; Snow, J. E.; Chin, W.

    2017-12-01

    The isotopic compositions of the daughter isotopes of long-lived radioactive systems (Sr, Nd, Hf and Pb ) can be used to map the scale and history of mantle heterogeneities beneath mid-ocean ridges. Our goal is to relate the multidimensional structure in the existing isotopic dataset with an underlying physical reality of mantle sources. The numerical technique of Principal Component Analysis is useful to reduce the linear dependence of the data to a minimum set of orthogonal eigenvectors encapsulating the information contained (cf Agranier et al 2005). The dataset used for this study covers almost all the MORBs along mid-Atlantic Ridge (MAR), from 54oS to 77oN and 8.8oW to -46.7oW, including replicating the dataset of Agranier et al., 2005 published plus 53 basalt samples dredged and analyzed since then (data from PetDB). The principal components PC1 and PC2 account for 61.56% and 29.21%, respectively, of the total isotope ratios variability. The samples with similar compositions to HIMU and EM and DM are identified to better understand the PCs. PC1 and PC2 are accountable for HIMU and EM whereas PC2 has limited control over the DM source. PC3 is more strongly controlled by the depleted mantle source than PC2. What this means is that all three principal components have a high degree of significance relevant to the established mantle sources. We also tested the relationship between mantle heterogeneity and sample locality. K-means clustering algorithm is a type of unsupervised learning to find groups in the data based on feature similarity. The PC factor scores of each sample are clustered into three groups. Cluster one and three are alternating on the north and south MAR. Cluster two exhibits on 45.18oN to 0.79oN and -27.9oW to -30.40oW alternating with cluster one. The ridge has been preliminarily divided into 16 sections considering both the clusters and ridge segments. The principal component regression models the section based on 6 isotope ratios and PCs. The

  17. Using the Ridge Regression Procedures to Estimate the Multiple Linear Regression Coefficients

    Science.gov (United States)

    Gorgees, HazimMansoor; Mahdi, FatimahAssim

    2018-05-01

    This article concerns with comparing the performance of different types of ordinary ridge regression estimators that have been already proposed to estimate the regression parameters when the near exact linear relationships among the explanatory variables is presented. For this situations we employ the data obtained from tagi gas filling company during the period (2008-2010). The main result we reached is that the method based on the condition number performs better than other methods since it has smaller mean square error (MSE) than the other stated methods.

  18. Ridge regression for predicting elastic moduli and hardness of calcium aluminosilicate glasses

    Science.gov (United States)

    Deng, Yifan; Zeng, Huidan; Jiang, Yejia; Chen, Guorong; Chen, Jianding; Sun, Luyi

    2018-03-01

    It is of great significance to design glasses with satisfactory mechanical properties predictively through modeling. Among various modeling methods, data-driven modeling is such a reliable approach that can dramatically shorten research duration, cut research cost and accelerate the development of glass materials. In this work, the ridge regression (RR) analysis was used to construct regression models for predicting the compositional dependence of CaO-Al2O3-SiO2 glass elastic moduli (Shear, Bulk, and Young’s moduli) and hardness based on the ternary diagram of the compositions. The property prediction over a large glass composition space was accomplished with known experimental data of various compositions in the literature, and the simulated results are in good agreement with the measured ones. This regression model can serve as a facile and effective tool for studying the relationship between the compositions and the property, enabling high-efficient design of glasses to meet the requirements for specific elasticity and hardness.

  19. The current and future use of ridge regression for prediction in quantitative genetics

    OpenAIRE

    Vlaming, Ronald; Groenen, Patrick

    2015-01-01

    textabstractIn recent years, there has been a considerable amount of research on the use of regularization methods for inference and prediction in quantitative genetics. Such research mostly focuses on selection of markers and shrinkage of their effects. In this review paper, the use of ridge regression for prediction in quantitative genetics using single-nucleotide polymorphism data is discussed. In particular, we consider (i) the theoretical foundations of ridge regression, (ii) its link to...

  20. An application of robust ridge regression model in the presence of outliers to real data problem

    Science.gov (United States)

    Shariff, N. S. Md.; Ferdaos, N. A.

    2017-09-01

    Multicollinearity and outliers are often leads to inconsistent and unreliable parameter estimates in regression analysis. The well-known procedure that is robust to multicollinearity problem is the ridge regression method. This method however is believed are affected by the presence of outlier. The combination of GM-estimation and ridge parameter that is robust towards both problems is on interest in this study. As such, both techniques are employed to investigate the relationship between stock market price and macroeconomic variables in Malaysia due to curiosity of involving the multicollinearity and outlier problem in the data set. There are four macroeconomic factors selected for this study which are Consumer Price Index (CPI), Gross Domestic Product (GDP), Base Lending Rate (BLR) and Money Supply (M1). The results demonstrate that the proposed procedure is able to produce reliable results towards the presence of multicollinearity and outliers in the real data.

  1. A generalization of voxel-wise procedures for highdimensional statistical inference using ridge regression

    DEFF Research Database (Denmark)

    Sjöstrand, Karl; Cardenas, Valerie A.; Larsen, Rasmus

    2008-01-01

    regression to address this issue, allowing for a gradual introduction of correlation information into the model. We make the connections between ridge regression and voxel-wise procedures explicit and discuss relations to other statistical methods. Results are given on an in-vivo data set of deformation......Whole-brain morphometry denotes a group of methods with the aim of relating clinical and cognitive measurements to regions of the brain. Typically, such methods require the statistical analysis of a data set with many variables (voxels and exogenous variables) paired with few observations (subjects...

  2. IMPROVING CORRELATION FUNCTION FITTING WITH RIDGE REGRESSION: APPLICATION TO CROSS-CORRELATION RECONSTRUCTION

    International Nuclear Information System (INIS)

    Matthews, Daniel J.; Newman, Jeffrey A.

    2012-01-01

    Cross-correlation techniques provide a promising avenue for calibrating photometric redshifts and determining redshift distributions using spectroscopy which is systematically incomplete (e.g., current deep spectroscopic surveys fail to obtain secure redshifts for 30%-50% or more of the galaxies targeted). In this paper, we improve on the redshift distribution reconstruction methods from our previous work by incorporating full covariance information into our correlation function fits. Correlation function measurements are strongly covariant between angular or spatial bins, and accounting for this in fitting can yield substantial reduction in errors. However, frequently the covariance matrices used in these calculations are determined from a relatively small set (dozens rather than hundreds) of subsamples or mock catalogs, resulting in noisy covariance matrices whose inversion is ill-conditioned and numerically unstable. We present here a method of conditioning the covariance matrix known as ridge regression which results in a more well behaved inversion than other techniques common in large-scale structure studies. We demonstrate that ridge regression significantly improves the determination of correlation function parameters. We then apply these improved techniques to the problem of reconstructing redshift distributions. By incorporating full covariance information, applying ridge regression, and changing the weighting of fields in obtaining average correlation functions, we obtain reductions in the mean redshift distribution reconstruction error of as much as ∼40% compared to previous methods. We provide a description of POWERFIT, an IDL code for performing power-law fits to correlation functions with ridge regression conditioning that we are making publicly available.

  3. Discriminating between adaptive and carcinogenic liver hypertrophy in rat studies using logistic ridge regression analysis of toxicogenomic data: The mode of action and predictive models

    Energy Technology Data Exchange (ETDEWEB)

    Liu, Shujie; Kawamoto, Taisuke; Morita, Osamu [R& D, Safety Science Research, Kao Corporation, Tochigi (Japan); Yoshinari, Kouichi [Department of Molecular Toxicology, School of Pharmaceutical Sciences, University of Shizuoka, Shizuoka (Japan); Honda, Hiroshi, E-mail: honda.hiroshi@kao.co.jp [R& D, Safety Science Research, Kao Corporation, Tochigi (Japan)

    2017-03-01

    Chemical exposure often results in liver hypertrophy in animal tests, characterized by increased liver weight, hepatocellular hypertrophy, and/or cell proliferation. While most of these changes are considered adaptive responses, there is concern that they may be associated with carcinogenesis. In this study, we have employed a toxicogenomic approach using a logistic ridge regression model to identify genes responsible for liver hypertrophy and hypertrophic hepatocarcinogenesis and to develop a predictive model for assessing hypertrophy-inducing compounds. Logistic regression models have previously been used in the quantification of epidemiological risk factors. DNA microarray data from the Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System were used to identify hypertrophy-related genes that are expressed differently in hypertrophy induced by carcinogens and non-carcinogens. Data were collected for 134 chemicals (72 non-hypertrophy-inducing chemicals, 27 hypertrophy-inducing non-carcinogenic chemicals, and 15 hypertrophy-inducing carcinogenic compounds). After applying logistic ridge regression analysis, 35 genes for liver hypertrophy (e.g., Acot1 and Abcc3) and 13 genes for hypertrophic hepatocarcinogenesis (e.g., Asns and Gpx2) were selected. The predictive models built using these genes were 94.8% and 82.7% accurate, respectively. Pathway analysis of the genes indicates that, aside from a xenobiotic metabolism-related pathway as an adaptive response for liver hypertrophy, amino acid biosynthesis and oxidative responses appear to be involved in hypertrophic hepatocarcinogenesis. Early detection and toxicogenomic characterization of liver hypertrophy using our models may be useful for predicting carcinogenesis. In addition, the identified genes provide novel insight into discrimination between adverse hypertrophy associated with carcinogenesis and adaptive hypertrophy in risk assessment. - Highlights: • Hypertrophy (H) and hypertrophic

  4. Discriminating between adaptive and carcinogenic liver hypertrophy in rat studies using logistic ridge regression analysis of toxicogenomic data: The mode of action and predictive models

    International Nuclear Information System (INIS)

    Liu, Shujie; Kawamoto, Taisuke; Morita, Osamu; Yoshinari, Kouichi; Honda, Hiroshi

    2017-01-01

    Chemical exposure often results in liver hypertrophy in animal tests, characterized by increased liver weight, hepatocellular hypertrophy, and/or cell proliferation. While most of these changes are considered adaptive responses, there is concern that they may be associated with carcinogenesis. In this study, we have employed a toxicogenomic approach using a logistic ridge regression model to identify genes responsible for liver hypertrophy and hypertrophic hepatocarcinogenesis and to develop a predictive model for assessing hypertrophy-inducing compounds. Logistic regression models have previously been used in the quantification of epidemiological risk factors. DNA microarray data from the Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System were used to identify hypertrophy-related genes that are expressed differently in hypertrophy induced by carcinogens and non-carcinogens. Data were collected for 134 chemicals (72 non-hypertrophy-inducing chemicals, 27 hypertrophy-inducing non-carcinogenic chemicals, and 15 hypertrophy-inducing carcinogenic compounds). After applying logistic ridge regression analysis, 35 genes for liver hypertrophy (e.g., Acot1 and Abcc3) and 13 genes for hypertrophic hepatocarcinogenesis (e.g., Asns and Gpx2) were selected. The predictive models built using these genes were 94.8% and 82.7% accurate, respectively. Pathway analysis of the genes indicates that, aside from a xenobiotic metabolism-related pathway as an adaptive response for liver hypertrophy, amino acid biosynthesis and oxidative responses appear to be involved in hypertrophic hepatocarcinogenesis. Early detection and toxicogenomic characterization of liver hypertrophy using our models may be useful for predicting carcinogenesis. In addition, the identified genes provide novel insight into discrimination between adverse hypertrophy associated with carcinogenesis and adaptive hypertrophy in risk assessment. - Highlights: • Hypertrophy (H) and hypertrophic

  5. Interval ridge regression (iRR) as a fast and robust method for quantitative prediction and variable selection applied to edible oil adulteration.

    Science.gov (United States)

    Jović, Ozren; Smrečki, Neven; Popović, Zora

    2016-04-01

    A novel quantitative prediction and variable selection method called interval ridge regression (iRR) is studied in this work. The method is performed on six data sets of FTIR, two data sets of UV-vis and one data set of DSC. The obtained results show that models built with ridge regression on optimal variables selected with iRR significantly outperfom models built with ridge regression on all variables in both calibration (6 out of 9 cases) and validation (2 out of 9 cases). In this study, iRR is also compared with interval partial least squares regression (iPLS). iRR outperfomed iPLS in validation (insignificantly in 6 out of 9 cases and significantly in one out of 9 cases for poil, a well known health beneficial nutrient, is studied in this work by mixing it with cheap and widely used oils such as soybean (So) oil, rapeseed (R) oil and sunflower (Su) oil. Binary mixture sets of hempseed oil with these three oils (HSo, HR and HSu) and a ternary mixture set of H oil, R oil and Su oil (HRSu) were considered. The obtained accuracy indicates that using iRR on FTIR and UV-vis data, each particular oil can be very successfully quantified (in all 8 cases RMSEPoil (R(2)>0.99). Copyright © 2015 Elsevier B.V. All rights reserved.

  6. Using Ridge Regression Models to Estimate Grain Yield from Field Spectral Data in Bread Wheat (Triticum Aestivum L. Grown under Three Water Regimes

    Directory of Open Access Journals (Sweden)

    Javier Hernandez

    2015-02-01

    Full Text Available Plant breeding based on grain yield (GY is an expensive and time-consuming method, so new indirect estimation techniques to evaluate the performance of crops represent an alternative method to improve grain yield. The present study evaluated the ability of canopy reflectance spectroscopy at the range from 350 to 2500 nm to predict GY in a large panel (368 genotypes of wheat (Triticum aestivum L. through multivariate ridge regression models. Plants were treated under three water regimes in the Mediterranean conditions of central Chile: severe water stress (SWS, rain fed, mild water stress (MWS; one irrigation event around booting and full irrigation (FI with mean GYs of 1655, 4739, and 7967 kg∙ha−1, respectively. Models developed from reflectance data during anthesis and grain filling under all water regimes explained between 77% and 91% of the GY variability, with the highest values in SWS condition. When individual models were used to predict yield in the rest of the trials assessed, models fitted during anthesis under MWS performed best. Combined models using data from different water regimes and each phenological stage were used to predict grain yield, and the coefficients of determination (R2 increased to 89.9% and 92.0% for anthesis and grain filling, respectively. The model generated during anthesis in MWS was the best at predicting yields when it was applied to other conditions. Comparisons against conventional reflectance indices were made, showing lower predictive abilities. It was concluded that a Ridge Regression Model using a data set based on spectral reflectance at anthesis or grain filling represents an effective method to predict grain yield in genotypes under different water regimes.

  7. Real estate value prediction using multivariate regression models

    Science.gov (United States)

    Manjula, R.; Jain, Shubham; Srivastava, Sharad; Rajiv Kher, Pranav

    2017-11-01

    The real estate market is one of the most competitive in terms of pricing and the same tends to vary significantly based on a lot of factors, hence it becomes one of the prime fields to apply the concepts of machine learning to optimize and predict the prices with high accuracy. Therefore in this paper, we present various important features to use while predicting housing prices with good accuracy. We have described regression models, using various features to have lower Residual Sum of Squares error. While using features in a regression model some feature engineering is required for better prediction. Often a set of features (multiple regressions) or polynomial regression (applying a various set of powers in the features) is used for making better model fit. For these models are expected to be susceptible towards over fitting ridge regression is used to reduce it. This paper thus directs to the best application of regression models in addition to other techniques to optimize the result.

  8. Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat.

    Science.gov (United States)

    Pérez-Rodríguez, Paulino; Gianola, Daniel; González-Camacho, Juan Manuel; Crossa, José; Manès, Yann; Dreisigacker, Susanne

    2012-12-01

    In genome-enabled prediction, parametric, semi-parametric, and non-parametric regression models have been used. This study assessed the predictive ability of linear and non-linear models using dense molecular markers. The linear models were linear on marker effects and included the Bayesian LASSO, Bayesian ridge regression, Bayes A, and Bayes B. The non-linear models (this refers to non-linearity on markers) were reproducing kernel Hilbert space (RKHS) regression, Bayesian regularized neural networks (BRNN), and radial basis function neural networks (RBFNN). These statistical models were compared using 306 elite wheat lines from CIMMYT genotyped with 1717 diversity array technology (DArT) markers and two traits, days to heading (DTH) and grain yield (GY), measured in each of 12 environments. It was found that the three non-linear models had better overall prediction accuracy than the linear regression specification. Results showed a consistent superiority of RKHS and RBFNN over the Bayesian LASSO, Bayesian ridge regression, Bayes A, and Bayes B models.

  9. Discriminating between adaptive and carcinogenic liver hypertrophy in rat studies using logistic ridge regression analysis of toxicogenomic data: The mode of action and predictive models.

    Science.gov (United States)

    Liu, Shujie; Kawamoto, Taisuke; Morita, Osamu; Yoshinari, Kouichi; Honda, Hiroshi

    2017-03-01

    Chemical exposure often results in liver hypertrophy in animal tests, characterized by increased liver weight, hepatocellular hypertrophy, and/or cell proliferation. While most of these changes are considered adaptive responses, there is concern that they may be associated with carcinogenesis. In this study, we have employed a toxicogenomic approach using a logistic ridge regression model to identify genes responsible for liver hypertrophy and hypertrophic hepatocarcinogenesis and to develop a predictive model for assessing hypertrophy-inducing compounds. Logistic regression models have previously been used in the quantification of epidemiological risk factors. DNA microarray data from the Toxicogenomics Project-Genomics Assisted Toxicity Evaluation System were used to identify hypertrophy-related genes that are expressed differently in hypertrophy induced by carcinogens and non-carcinogens. Data were collected for 134 chemicals (72 non-hypertrophy-inducing chemicals, 27 hypertrophy-inducing non-carcinogenic chemicals, and 15 hypertrophy-inducing carcinogenic compounds). After applying logistic ridge regression analysis, 35 genes for liver hypertrophy (e.g., Acot1 and Abcc3) and 13 genes for hypertrophic hepatocarcinogenesis (e.g., Asns and Gpx2) were selected. The predictive models built using these genes were 94.8% and 82.7% accurate, respectively. Pathway analysis of the genes indicates that, aside from a xenobiotic metabolism-related pathway as an adaptive response for liver hypertrophy, amino acid biosynthesis and oxidative responses appear to be involved in hypertrophic hepatocarcinogenesis. Early detection and toxicogenomic characterization of liver hypertrophy using our models may be useful for predicting carcinogenesis. In addition, the identified genes provide novel insight into discrimination between adverse hypertrophy associated with carcinogenesis and adaptive hypertrophy in risk assessment. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Variational Ridging in Sea Ice Models

    Science.gov (United States)

    Roberts, A.; Hunke, E. C.; Lipscomb, W. H.; Maslowski, W.; Kamal, S.

    2017-12-01

    This work presents the results of a new development to make basin-scale sea ice models aware of the shape, porosity and extent of individual ridges within the pack. We have derived an analytic solution for the Euler-Lagrange equation of individual ridges that accounts for non-conservative forces, and therefore the compressive strength of individual ridges. Because a region of the pack is simply a collection of paths of individual ridges, we are able to solve the Euler-Lagrange equation for a large-scale sea ice field also, and therefore the compressive strength of a region of the pack that explicitly accounts for the macro-porosity of ridged debris. We make a number of assumptions that have simplified the problem, such as treating sea ice as a granular material in ridges, and assuming that bending moments associated with ridging are perturbations around an isostatic state. Regardless of these simplifications, the ridge model is remarkably predictive of macro-porosity and ridge shape, and, because our equations are analytic, they do not require costly computations to solve the Euler-Lagrange equation of ridges on the large scale. The new ridge model is therefore applicable to large-scale sea ice models. We present results from this theoretical development, as well as plans to apply it to the Regional Arctic System Model and a community sea ice code. Most importantly, the new ridging model is particularly useful for pinpointing gaps in our observational record of sea ice ridges, and points to the need for improved measurements of the evolution of porosity of deformed ice in the Arctic and Antarctic. Such knowledge is not only useful for improving models, but also for improving estimates of sea ice volume derived from altimetric measurements of sea ice freeboard.

  11. DRREP: deep ridge regressed epitope predictor.

    Science.gov (United States)

    Sher, Gene; Zhi, Degui; Zhang, Shaojie

    2017-10-03

    The ability to predict epitopes plays an enormous role in vaccine development in terms of our ability to zero in on where to do a more thorough in-vivo analysis of the protein in question. Though for the past decade there have been numerous advancements and improvements in epitope prediction, on average the best benchmark prediction accuracies are still only around 60%. New machine learning algorithms have arisen within the domain of deep learning, text mining, and convolutional networks. This paper presents a novel analytically trained and string kernel using deep neural network, which is tailored for continuous epitope prediction, called: Deep Ridge Regressed Epitope Predictor (DRREP). DRREP was tested on long protein sequences from the following datasets: SARS, Pellequer, HIV, AntiJen, and SEQ194. DRREP was compared to numerous state of the art epitope predictors, including the most recently published predictors called LBtope and DMNLBE. Using area under ROC curve (AUC), DRREP achieved a performance improvement over the best performing predictors on SARS (13.7%), HIV (8.9%), Pellequer (1.5%), and SEQ194 (3.1%), with its performance being matched only on the AntiJen dataset, by the LBtope predictor, where both DRREP and LBtope achieved an AUC of 0.702. DRREP is an analytically trained deep neural network, thus capable of learning in a single step through regression. By combining the features of deep learning, string kernels, and convolutional networks, the system is able to perform residue-by-residue prediction of continues epitopes with higher accuracy than the current state of the art predictors.

  12. Graphical evaluation of the ridge-type robust regression estimators in mixture experiments.

    Science.gov (United States)

    Erkoc, Ali; Emiroglu, Esra; Akay, Kadri Ulas

    2014-01-01

    In mixture experiments, estimation of the parameters is generally based on ordinary least squares (OLS). However, in the presence of multicollinearity and outliers, OLS can result in very poor estimates. In this case, effects due to the combined outlier-multicollinearity problem can be reduced to certain extent by using alternative approaches. One of these approaches is to use biased-robust regression techniques for the estimation of parameters. In this paper, we evaluate various ridge-type robust estimators in the cases where there are multicollinearity and outliers during the analysis of mixture experiments. Also, for selection of biasing parameter, we use fraction of design space plots for evaluating the effect of the ridge-type robust estimators with respect to the scaled mean squared error of prediction. The suggested graphical approach is illustrated on Hald cement data set.

  13. Prediction of CO2 Emission in China’s Power Generation Industry with Gauss Optimized Cuckoo Search Algorithm and Wavelet Neural Network Based on STIRPAT model with Ridge Regression

    Directory of Open Access Journals (Sweden)

    Weibo Zhao

    2017-12-01

    Full Text Available Power generation industry is the key industry of carbon dioxide (CO2 emission in China. Assessing its future CO2 emissions is of great significance to the formulation and implementation of energy saving and emission reduction policies. Based on the Stochastic Impacts by Regression on Population, Affluence and Technology model (STIRPAT, the influencing factors analysis model of CO2 emission of power generation industry is established. The ridge regression (RR method is used to estimate the historical data. In addition, a wavelet neural network (WNN prediction model based on Cuckoo Search algorithm optimized by Gauss (GCS is put forward to predict the factors in the STIRPAT model. Then, the predicted values are substituted into the regression model, and the CO2 emission estimation values of the power generation industry in China are obtained. It’s concluded that population, per capita Gross Domestic Product (GDP, standard coal consumption and thermal power specific gravity are the key factors affecting the CO2 emission from the power generation industry. Besides, the GCS-WNN prediction model has higher prediction accuracy, comparing with other models. Moreover, with the development of science and technology in the future, the CO2 emission growth in the power generation industry will gradually slow down according to the prediction results.

  14. Ridge Regression and Other Kernels for Genomic Selection with R Package rrBLUP

    Directory of Open Access Journals (Sweden)

    Jeffrey B. Endelman

    2011-11-01

    Full Text Available Many important traits in plant breeding are polygenic and therefore recalcitrant to traditional marker-assisted selection. Genomic selection addresses this complexity by including all markers in the prediction model. A key method for the genomic prediction of breeding values is ridge regression (RR, which is equivalent to best linear unbiased prediction (BLUP when the genetic covariance between lines is proportional to their similarity in genotype space. This additive model can be broadened to include epistatic effects by using other kernels, such as the Gaussian, which represent inner products in a complex feature space. To facilitate the use of RR and nonadditive kernels in plant breeding, a new software package for R called rrBLUP has been developed. At its core is a fast maximum-likelihood algorithm for mixed models with a single variance component besides the residual error, which allows for efficient prediction with unreplicated training data. Use of the rrBLUP software is demonstrated through several examples, including the identification of optimal crosses based on superior progeny value. In cross-validation tests, the prediction accuracy with nonadditive kernels was significantly higher than RR for wheat ( L. grain yield but equivalent for several maize ( L. traits.

  15. Intelligent optimization models based on hard-ridge penalty and RBF for forecasting global solar radiation

    International Nuclear Information System (INIS)

    Jiang, He; Dong, Yao; Wang, Jianzhou; Li, Yuqin

    2015-01-01

    Highlights: • CS-hard-ridge-RBF and DE-hard-ridge-RBF are proposed to forecast solar radiation. • Pearson and Apriori algorithm are used to analyze correlations between the data. • Hard-ridge penalty is added to reduce the number of nodes in the hidden layer. • CS algorithm and DE algorithm are used to determine the optimal parameters. • Proposed two models have higher forecasting accuracy than RBF and hard-ridge-RBF. - Abstract: Due to the scarcity of equipment and the high costs of maintenance, far fewer observations of solar radiation are made than observations of temperature, precipitation and other weather factors. Therefore, it is increasingly important to study several relevant meteorological factors to accurately forecast solar radiation. For this research, monthly average global solar radiation and 12 meteorological parameters from 1998 to 2010 at four sites in the United States were collected. Pearson correlation coefficients and Apriori association rules were successfully used to analyze correlations between the data, which provided a basis for these relative parameters as input variables. Two effective and innovative methods were developed to forecast monthly average global solar radiation by converting a RBF neural network into a multiple linear regression problem, adding a hard-ridge penalty to reduce the number of nodes in the hidden layer, and applying intelligent optimization algorithms, such as the cuckoo search algorithm (CS) and differential evolution (DE), to determine the optimal center and scale parameters. The experimental results show that the proposed models produce much more accurate forecasts than other models

  16. Modified Regression Correlation Coefficient for Poisson Regression Model

    Science.gov (United States)

    Kaengthong, Nattacha; Domthong, Uthumporn

    2017-09-01

    This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).

  17. A Solution to Separation and Multicollinearity in Multiple Logistic Regression.

    Science.gov (United States)

    Shen, Jianzhao; Gao, Sujuan

    2008-10-01

    In dementia screening tests, item selection for shortening an existing screening test can be achieved using multiple logistic regression. However, maximum likelihood estimates for such logistic regression models often experience serious bias or even non-existence because of separation and multicollinearity problems resulting from a large number of highly correlated items. Firth (1993, Biometrika, 80(1), 27-38) proposed a penalized likelihood estimator for generalized linear models and it was shown to reduce bias and the non-existence problems. The ridge regression has been used in logistic regression to stabilize the estimates in cases of multicollinearity. However, neither solves the problems for each other. In this paper, we propose a double penalized maximum likelihood estimator combining Firth's penalized likelihood equation with a ridge parameter. We present a simulation study evaluating the empirical performance of the double penalized likelihood estimator in small to moderate sample sizes. We demonstrate the proposed approach using a current screening data from a community-based dementia study.

  18. Investigating the effects of climate variations on bacillary dysentery incidence in northeast China using ridge regression and hierarchical cluster analysis

    Directory of Open Access Journals (Sweden)

    Guo Junqiao

    2008-09-01

    Full Text Available Abstract Background The effects of climate variations on bacillary dysentery incidence have gained more recent concern. However, the multi-collinearity among meteorological factors affects the accuracy of correlation with bacillary dysentery incidence. Methods As a remedy, a modified method to combine ridge regression and hierarchical cluster analysis was proposed for investigating the effects of climate variations on bacillary dysentery incidence in northeast China. Results All weather indicators, temperatures, precipitation, evaporation and relative humidity have shown positive correlation with the monthly incidence of bacillary dysentery, while air pressure had a negative correlation with the incidence. Ridge regression and hierarchical cluster analysis showed that during 1987–1996, relative humidity, temperatures and air pressure affected the transmission of the bacillary dysentery. During this period, all meteorological factors were divided into three categories. Relative humidity and precipitation belonged to one class, temperature indexes and evaporation belonged to another class, and air pressure was the third class. Conclusion Meteorological factors have affected the transmission of bacillary dysentery in northeast China. Bacillary dysentery prevention and control would benefit from by giving more consideration to local climate variations.

  19. Multiattribute shopping models and ridge regression analysis

    NARCIS (Netherlands)

    Timmermans, H.J.P.

    1981-01-01

    Policy decisions regarding retailing facilities essentially involve multiple attributes of shopping centres. If mathematical shopping models are to contribute to these decision processes, their structure should reflect the multiattribute character of retailing planning. Examination of existing

  20. Some improved classification-based ridge parameter of Hoerl and ...

    African Journals Online (AJOL)

    Some improved classification-based ridge parameter of Hoerl and Kennard estimation techniques. ... This assumption is often violated and Ridge Regression estimator introduced by [2]has been identified to be more efficient than ordinary least square (OLS) in handling it. However, it requires a ridge parameter, K, of which ...

  1. On the mean squared error of the ridge estimator of the covariance and precision matrix

    NARCIS (Netherlands)

    van Wieringen, Wessel N.

    2017-01-01

    For a suitably chosen ridge penalty parameter, the ridge regression estimator uniformly dominates the maximum likelihood regression estimator in terms of the mean squared error. Analogous results for the ridge maximum likelihood estimators of covariance and precision matrix are presented.

  2. Modeling of flow through fractured tuff at Fran Ridge

    International Nuclear Information System (INIS)

    Eaton, R.R.; Ho, C.K.; Glass, R.J.; Nicholl, M.J.; Arnold, B.W.

    1996-01-01

    Numerical studies have modeled an infiltration experiment at Fran Ridge, using the TOUGH2 code, to aid in the selection of computational models for waste repository performance assessment. This study investigates the capabilities of TOUGH2 to simulate transient flows through highly fractured tuff, and provides a possible means of calibrating hydrologic parameters such as effective fracture aperture and fracture-matrix connectivity. Two distinctly different conceptual models were used in the TOUGH2 code, the dual permeability model and the equivalent continuum model. The field experiments involved the infiltration of dyed ponded water in highly fractured tuff. The infiltration observed in the experiment was subsequently modeled using Fran Ridge fracture frequencies, obtained during post-experiment site excavation. Comparison of the TOUGH2 results obtained using the two conceptual models gives insight into their relative strengths and weaknesses

  3. Testing models for the formation of the equatorial ridge on Iapetus via crater counting

    Science.gov (United States)

    Damptz, Amanda L.; Dombard, Andrew J.; Kirchoff, Michelle R.

    2018-03-01

    Iapetus's equatorial ridge, visible in global views of the moon, is unique in the Solar System. The formation of this feature is likely attributed to a key event in the evolution of Iapetus, and various models have been proposed as the source of the ridge. By surveying imagery from the Cassini and Voyager missions, this study aims to compile a database of the impact crater population on and around Iapetus's equatorial ridge, assess the relative age of the ridge from differences in cratering between on ridge and off ridge, and test the various models of ridge formation. This work presents a database that contains 7748 craters ranging from 0.83 km to 591 km in diameter. The database includes the study area in which the crater is located, the latitude and longitude of the crater, the major and minor axis lengths, and the azimuthal angle of orientation of the major axis. Analysis of crater orientation over the entire study area reveals that there is no preference for long-axis orientation, particularly in the area with the highest resolution. Comparison of the crater size-frequency distributions show that the crater distribution on the ridge appears to be depleted in craters larger than 16 km with an abruptly enhanced crater population less than 16 km in diameter up to saturation. One possible interpretation is that the ridge is a relatively younger surface with an enhanced small impactor population. Finally, the compiled results are used to examine each ridge formation hypothesis. Based on these results, a model of ridge formation via a tidally disrupted sub-satellite appears most consistent with our interpretation of a younger ridge with an enhanced small impactor population.

  4. Output-Only Modal Parameter Recursive Estimation of Time-Varying Structures via a Kernel Ridge Regression FS-TARMA Approach

    Directory of Open Access Journals (Sweden)

    Zhi-Sai Ma

    2017-01-01

    Full Text Available Modal parameter estimation plays an important role in vibration-based damage detection and is worth more attention and investigation, as changes in modal parameters are usually being used as damage indicators. This paper focuses on the problem of output-only modal parameter recursive estimation of time-varying structures based upon parameterized representations of the time-dependent autoregressive moving average (TARMA. A kernel ridge regression functional series TARMA (FS-TARMA recursive identification scheme is proposed and subsequently employed for the modal parameter estimation of a numerical three-degree-of-freedom time-varying structural system and a laboratory time-varying structure consisting of a simply supported beam and a moving mass sliding on it. The proposed method is comparatively assessed against an existing recursive pseudolinear regression FS-TARMA approach via Monte Carlo experiments and shown to be capable of accurately tracking the time-varying dynamics in a recursive manner.

  5. Extracting valley-ridge lines from point-cloud-based 3D fingerprint models.

    Science.gov (United States)

    Pang, Xufang; Song, Zhan; Xie, Wuyuan

    2013-01-01

    3D fingerprinting is an emerging technology with the distinct advantage of touchless operation. More important, 3D fingerprint models contain more biometric information than traditional 2D fingerprint images. However, current approaches to fingerprint feature detection usually must transform the 3D models to a 2D space through unwrapping or other methods, which might introduce distortions. A new approach directly extracts valley-ridge features from point-cloud-based 3D fingerprint models. It first applies the moving least-squares method to fit a local paraboloid surface and represent the local point cloud area. It then computes the local surface's curvatures and curvature tensors to facilitate detection of the potential valley and ridge points. The approach projects those points to the most likely valley-ridge lines, using statistical means such as covariance analysis and cross correlation. To finally extract the valley-ridge lines, it grows the polylines that approximate the projected feature points and removes the perturbations between the sampled points. Experiments with different 3D fingerprint models demonstrate this approach's feasibility and performance.

  6. Exploring tectonomagmatic controls on mid-ocean ridge faulting and morphology with 3-D numerical models

    Science.gov (United States)

    Howell, S. M.; Ito, G.; Behn, M. D.; Olive, J. A. L.; Kaus, B.; Popov, A.; Mittelstaedt, E. L.; Morrow, T. A.

    2016-12-01

    Previous two-dimensional (2-D) modeling studies of abyssal-hill scale fault generation and evolution at mid-ocean ridges have predicted that M, the ratio of magmatic to total extension, strongly influences the total slip, spacing, and rotation of large faults, as well as the morphology of the ridge axis. Scaling relations derived from these 2-D models broadly explain the globally observed decrease in abyssal hill spacing with increasing ridge spreading rate, as well as the formation of large-offset faults close to the ends of slow-spreading ridge segments. However, these scaling relations do not explain some higher resolution observations of segment-scale variability in fault spacing along the Chile Ridge and the Mid-Atlantic Ridge, where fault spacing shows no obvious correlation with M. This discrepancy between observations and 2-D model predictions illuminates the need for three-dimensional (3-D) numerical models that incorporate the effects of along-axis variations in lithospheric structure and magmatic accretion. To this end, we use the geodynamic modeling software LaMEM to simulate 3-D tectono-magmatic interactions in a visco-elasto-plastic lithosphere under extension. We model a single ridge segment subjected to an along-axis gradient in the rate of magma injection, which is simulated by imposing a mass source in a plane of model finite volumes beneath the ridge axis. Outputs of interest include characteristic fault offset, spacing, and along-axis gradients in seafloor morphology. We also examine the effects of along-axis variations in lithospheric thickness and off-axis thickening rate. The main objectives of this study are to quantify the relative importance of the amount of magmatic extension and the local lithospheric structure at a given along-axis location, versus the importance of along-axis communication of lithospheric stresses on the 3-D fault evolution and morphology of intermediate-spreading-rate ridges.

  7. Global Models of Ridge-Push Force, Geoid, and Lithospheric Strength of Oceanic plates

    Science.gov (United States)

    Mahatsente, Rezene

    2017-12-01

    An understanding of the transmission of ridge-push related stresses in the interior of oceanic plates is important because ridge-push force is one of the principal forces driving plate motion. Here, I assess the transmission of ridge-push related stresses in oceanic plates by comparing the magnitude of the ridge-push force to the integrated strength of oceanic plates. The strength is determined based on plate cooling and rheological models. The strength analysis includes low-temperature plasticity (LTP) in the upper mantle and assumes a range of possible tectonic conditions and rheology in the plates. The ridge-push force has been derived from the thermal state of oceanic lithosphere, seafloor depth and crustal age data. The results of modeling show that the transmission of ridge-push related stresses in oceanic plates mainly depends on rheology and predominant tectonic conditions. If a lithosphere has dry rheology, the estimated strength is higher than the ridge-push force at all ages for compressional tectonics and at old ages (>75 Ma) for extension. Therefore, under such conditions, oceanic plates may not respond to ridge-push force by intraplate deformation. Instead, the plates may transmit the ridge-push related stress in their interior. For a wet rheology, however, the strength of young lithosphere (stress may dissipate in the interior of oceanic plates and diffuses by intraplate deformation. The state of stress within a plate depends on the balance of far-field and intraplate forces.

  8. THE STATISTICAL MODEL OF PRESSURE RIDGE MORPHOMETRY ON THE NORTHEAST SHELF OF SAKHALIN ISLAND

    Directory of Open Access Journals (Sweden)

    E. U. Mironov

    2012-01-01

    Full Text Available The work presents characteristics on geometry and inner structure of ice ridges investigated at offshore the northeast coast of SakhalinIsland. A formula was obtained which allows one to calculate the ice ridge keel depth by the height of its sail. Plots of the probability distribution density for ice ridge characteristics are given. A model of morphometry of a mean statistical ice ridge was constructed, and its mass is determined. Factors influencing the hydrostatic ice ridge equilibrium are considered.

  9. Europan double ridge morphometry as a test of formation models

    Science.gov (United States)

    Dameron, Ashley C.; Burr, Devon M.

    2018-05-01

    Double ridges on the Jovian satellite Europa consist of two parallel ridges with a central trough. Although these features are nearly ubiquitous on Europa, their formation mechanism(s) is (are) not yet well-understood. Previous hypotheses for their formation can be divided into two groups based on 1) the expected interior slope angles and 2) the magnitude of interior/exterior slope symmetry. The published hypotheses in the first ("fracture") group entail brittle deformation of the crust, either by diapirism, shear heating, or buckling due to compression. Because these mechanisms imply uplift of near-vertical fractures, their predicted interior slopes are steeper than the angle of repose (AOR) with shallower exterior slopes. The second ("flow") group includes cryosedimentary and cryovolcanic processes - explosive or effusive cryovolcanism and tidal squeezing -, which are predicted to form ridge slopes at or below the AOR. Explosive cryovolcanism would form self-symmetric ridges, whereas effusive cryolavas and cryo-sediments deposited during tidal squeezing would likely not exhibit slope symmetry. To distinguish between these two groups of hypothesized formation mechanisms, we derived measurements of interior slope angle and interior/exterior slope symmetry at multiple locations on Europa through analysis of data from the Galileo Solid State Imaging (SSI) camera. Two types of data were used: i) elevation data from five stereo-pair digital elevation models (DEMs) covering four ridges (580 individual measurements), and ii) ridge shadow length measurements taken on individual images over 40 ridges (200 individual measurements). Our results shows that slopes measured on our DEMs, located in the Cilix and Banded Plains regions, typically fall below the AOR, and slope symmetry is dominant. Two different shadow measurement techniques implemented to calculate interior slopes yielded slope angles that also fall below the AOR. The shallow interior slopes derived from both

  10. Learning a peptide-protein binding affinity predictor with kernel ridge regression

    Science.gov (United States)

    2013-01-01

    Background The cellular function of a vast majority of proteins is performed through physical interactions with other biomolecules, which, most of the time, are other proteins. Peptides represent templates of choice for mimicking a secondary structure in order to modulate protein-protein interaction. They are thus an interesting class of therapeutics since they also display strong activity, high selectivity, low toxicity and few drug-drug interactions. Furthermore, predicting peptides that would bind to a specific MHC alleles would be of tremendous benefit to improve vaccine based therapy and possibly generate antibodies with greater affinity. Modern computational methods have the potential to accelerate and lower the cost of drug and vaccine discovery by selecting potential compounds for testing in silico prior to biological validation. Results We propose a specialized string kernel for small bio-molecules, peptides and pseudo-sequences of binding interfaces. The kernel incorporates physico-chemical properties of amino acids and elegantly generalizes eight kernels, comprised of the Oligo, the Weighted Degree, the Blended Spectrum, and the Radial Basis Function. We provide a low complexity dynamic programming algorithm for the exact computation of the kernel and a linear time algorithm for it’s approximation. Combined with kernel ridge regression and SupCK, a novel binding pocket kernel, the proposed kernel yields biologically relevant and good prediction accuracy on the PepX database. For the first time, a machine learning predictor is capable of predicting the binding affinity of any peptide to any protein with reasonable accuracy. The method was also applied to both single-target and pan-specific Major Histocompatibility Complex class II benchmark datasets and three Quantitative Structure Affinity Model benchmark datasets. Conclusion On all benchmarks, our method significantly (p-value ≤ 0.057) outperforms the current state-of-the-art methods at predicting

  11. Patch behaviour and predictability properties of modelled finite-amplitude sand ridges on the inner shelf

    Directory of Open Access Journals (Sweden)

    N. C. Vis-Star

    2008-12-01

    Full Text Available The long-term evolution of shoreface-connected sand ridges is investigated with a nonlinear spectral model which governs the dynamics of waves, currents, sediment transport and the bed level on the inner shelf. Wave variables are calculated with a shoaling-refraction model instead of using a parameterisation. The spectral model describes the time evolution of amplitudes of known eigenmodes of the linearised system. Bottom pattern formation occurs if the transverse bottom slope of the inner shelf, β, exceeds a critical value βc. For fixed model parameters the sensitivity of the properties of modelled sand ridges to changes in the number (N−1 of resolved subharmonics (of the initially fastest growing mode is investigated. For any N the model shows the growth and subsequent saturation of the height of the sand ridges. The saturation time scale is several thousands of years, which suggests that observed sand ridges have not reached their saturated stage yet. The migration speed of the ridges and the average longshore spacing between successive crests in the saturated state differ from those in the initial state. Analysis of the potential energy balance of the ridges reveals that bed slope-induced sediment transport is crucial for the saturation process. In the transient stage the shoreface-connected ridges occur in patches. The overall characteristics of the bedforms (saturation time, final maximum height, average longshore spacing, migration speed hardly vary with N. However, individual time series of modal amplitudes and bottom patterns strongly depend on N, thereby implying that the detailed evolution of sand ridges can only be predicted over a limited time interval. Additional experiments show that the critical bed slope βc increases with larger offshore angles of wave incidence, larger offshore wave heights and longer wave periods, and that the corresponding maximum height of the ridges

  12. The Collinearity Free and Bias Reduced Regression Estimation Project: The Theory of Normalization Ridge Regression. Report No. 2.

    Science.gov (United States)

    Bulcock, J. W.; And Others

    Multicollinearity refers to the presence of highly intercorrelated independent variables in structural equation models, that is, models estimated by using techniques such as least squares regression and maximum likelihood. There is a problem of multicollinearity in both the natural and social sciences where theory formulation and estimation is in…

  13. Contaminant transport model validation: The Oak Ridge Reservation

    International Nuclear Information System (INIS)

    Lee, R.R.; Ketelle, R.H.

    1988-09-01

    In the complex geologic setting on the Oak Ridge Reservation, hydraulic conductivity is anisotropic and flow is strongly influenced by an extensive and largely discontinuous fracture network. Difficulties in describing and modeling the aquifer system prompted a study to obtain aquifer property data to be used in a groundwater flow model validation experiment. Characterization studies included the performance of an extensive suite of aquifer test within a 600-square-meter area to obtain aquifer property values to describe the flow field in detail. Following aquifer test, a groundwater tracer test was performed under ambient conditions to verify the aquifer analysis. Tracer migration data in the near-field were used in model calibration to predict tracer arrival time and concentration in the far-field. Despite the extensive aquifer testing, initial modeling inaccurately predicted tracer migration direction. Initial tracer migration rates were consistent with those predicted by the model; however, changing environmental conditions resulted in an unanticipated decay in tracer movement. Evaluation of the predictive accuracy of groundwater flow and contaminant transport models on the Oak Ridge Reservation depends on defining the resolution required, followed by field testing and model grid definition at compatible scales. The use of tracer tests, both as a characterization method and to verify model results, provides the highest level of resolution of groundwater flow characteristics. 3 refs., 4 figs

  14. Regression modeling of ground-water flow

    Science.gov (United States)

    Cooley, R.L.; Naff, R.L.

    1985-01-01

    Nonlinear multiple regression methods are developed to model and analyze groundwater flow systems. Complete descriptions of regression methodology as applied to groundwater flow models allow scientists and engineers engaged in flow modeling to apply the methods to a wide range of problems. Organization of the text proceeds from an introduction that discusses the general topic of groundwater flow modeling, to a review of basic statistics necessary to properly apply regression techniques, and then to the main topic: exposition and use of linear and nonlinear regression to model groundwater flow. Statistical procedures are given to analyze and use the regression models. A number of exercises and answers are included to exercise the student on nearly all the methods that are presented for modeling and statistical analysis. Three computer programs implement the more complex methods. These three are a general two-dimensional, steady-state regression model for flow in an anisotropic, heterogeneous porous medium, a program to calculate a measure of model nonlinearity with respect to the regression parameters, and a program to analyze model errors in computed dependent variables such as hydraulic head. (USGS)

  15. Regression Models for Market-Shares

    DEFF Research Database (Denmark)

    Birch, Kristina; Olsen, Jørgen Kai; Tjur, Tue

    2005-01-01

    On the background of a data set of weekly sales and prices for three brands of coffee, this paper discusses various regression models and their relation to the multiplicative competitive-interaction model (the MCI model, see Cooper 1988, 1993) for market-shares. Emphasis is put on the interpretat......On the background of a data set of weekly sales and prices for three brands of coffee, this paper discusses various regression models and their relation to the multiplicative competitive-interaction model (the MCI model, see Cooper 1988, 1993) for market-shares. Emphasis is put...... on the interpretation of the parameters in relation to models for the total sales based on discrete choice models.Key words and phrases. MCI model, discrete choice model, market-shares, price elasitcity, regression model....

  16. Two-dimensional chronostratigraphic modelling of OSL ages from recent beach-ridge deposits, SE Australia

    DEFF Research Database (Denmark)

    Tamura, Toru; Cunningham, Alastair C.; Oliver, Thomas S.N.

    2018-01-01

    Optically-stimulated luminesecne (OSL) dating, in concert with two-dimensional ground-penetrating radar (GPR) profiling, has contributed to significant advances in our understanding of beach-ridge systems and other sedimentary landforms in various settings. For recent beach-ridges, the good OSL...... samples may be larger than the difference in sample ages. Age inversions can be avoided, however, if the stratigraphic constraints are included in the age estimation process. Here, we create a custom Bayesian chronological model for a recent (..., for direct comparison with a GPR profile. The model includes a full ‘burial-dose model’ for each sample and a dose rate term with the modelled ages constrained by the vertical and shore-normal sample order. The modelled ages are visualized by plotting isochrones on the beach-ridge cross section...

  17. A Seemingly Unrelated Poisson Regression Model

    OpenAIRE

    King, Gary

    1989-01-01

    This article introduces a new estimator for the analysis of two contemporaneously correlated endogenous event count variables. This seemingly unrelated Poisson regression model (SUPREME) estimator combines the efficiencies created by single equation Poisson regression model estimators and insights from "seemingly unrelated" linear regression models.

  18. Panel Smooth Transition Regression Models

    DEFF Research Database (Denmark)

    González, Andrés; Terasvirta, Timo; Dijk, Dick van

    We introduce the panel smooth transition regression model. This new model is intended for characterizing heterogeneous panels, allowing the regression coefficients to vary both across individuals and over time. Specifically, heterogeneity is allowed for by assuming that these coefficients are bou...

  19. Interpretation of commonly used statistical regression models.

    Science.gov (United States)

    Kasza, Jessica; Wolfe, Rory

    2014-01-01

    A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.

  20. Three-dimensional modeling of flow through fractured tuff at Fran Ridge

    International Nuclear Information System (INIS)

    Eaton, R.R.; Ho, C.K.; Glass, RJ.; Nicholl, M.J.; Arnold, B.W.

    1996-09-01

    Numerical studies have been made of an infiltration experiment at Fran Ridge using the TOUGH2 code to aid in the selection of computational models for performance assessment. The exercise investigates the capabilities of TOUGH2 to model transient flows through highly fractured tuff and provides a possible means of calibration. Two distinctly different conceptual models were used in the TOUGH2 code, the dual permeability model and the equivalent continuum model. The infiltration test modeled involved the infiltration of dyed ponded water for 36 minutes. The 205 gallon infiltration of water observed in the experiment was subsequently modeled using measured Fran Ridge fracture frequencies, and a specified fracture aperture of 285 microm. The dual permeability formulation predicted considerable infiltration along the fracture network, which was in agreement with the experimental observations. As expected, al fracture penetration of the infiltrating water was calculated using the equivalent continuum model, thus demonstrating that this model is not appropriate for modeling the highly transient experiment. It is therefore recommended that the dual permeability model be given priority when computing high-flux infiltration for use in performance assessment studies

  1. Three-dimensional modeling of flow through fractured tuff at Fran Ridge

    International Nuclear Information System (INIS)

    Eaton, R.R.; Ho, C.K.; Glass, R.J.; Nicholl, M.J.; Arnold, B.W.

    1996-01-01

    Numerical studies have been made of an infiltration experiment at Fran Ridge using the TOUGH2 code to aid in the selection of computational models for performance assessment. The exercise investigates the capabilities of TOUGH2 to model transient flows through highly fractured tuff and provides a possible means of calibration. Two distinctly different conceptual models were used in the TOUGH2 code, the dual permeability model and the equivalent continuum model. The infiltration test modeled involved the infiltration of dyed ponded water for 36 minutes. The 205 gallon filtration of water observed in the experiment was subsequently modeled using measured Fran Ridge fracture frequencies, and a specified fracture aperture of 285 μm. The dual permeability formulation predicted considerable infiltration along the fracture network, which was in agreement with the experimental observations. As expected, minimal fracture penetration of the infiltrating water was calculated using the equivalent continuum model, thus demonstrating that this model is not appropriate for modeling the highly transient experiment. It is therefore recommended that the dual permeability model be given priority when computing high-flux infiltration for use in performance assessment studies

  2. Poisson Mixture Regression Models for Heart Disease Prediction.

    Science.gov (United States)

    Mufudza, Chipo; Erol, Hamza

    2016-01-01

    Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model.

  3. Poisson Mixture Regression Models for Heart Disease Prediction

    Science.gov (United States)

    Erol, Hamza

    2016-01-01

    Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model. PMID:27999611

  4. Effects of sea level rise on the formation and drowning of shoreface-connected sand ridges, a model study

    Science.gov (United States)

    Nnafie, A.; de Swart, H. E.; Calvete, D.; Garnier, R.

    2014-06-01

    Shoreface-connected sand ridges occur on many storm-dominated inner shelves. These rhythmic features have an along-shelf spacing of 2-10 km, a height of 1-12 m, they evolve on timescales of centuries and they migrate several meters per year. An idealized model is used to study the impact of sea level rise on the characteristics of the sand ridges during their initial and long-term evolution. Different scenarios (rates of sea level rise, geometry of inner shelf) are examined. Results show that with increasing sea level the height of sand ridges increases and their migration decreases until they eventually drown. This latter occurs when the near-bed wave orbital velocity drops below the critical velocity for erosion of sediment. In contrast, in the absence of sea level rise, the model simulates shoreface-connected sand ridges with constant heights and migration rates. Model results furthermore indicate that sand ridges do not form if the rate of sea level rise is too high, or if the initial depth of the inner shelf is too small. A larger transverse bottom slope enhances growth and height of sand ridges and they drown quicker. When shoreface retreat due to sea level rise is considered, new ridges form in the landward part of the inner shelf, while ridges on the antecedent part of the shelf become less active and ultimately drown. Only if sea level rise is accounted for, merging of ridges is reduced such that multiple ridges occur in the end state, thereby yielding a better agreement with observations. The physical mechanisms responsible for these findings are also explained.

  5. Comparison of some biased estimation methods (including ordinary subset regression) in the linear model

    Science.gov (United States)

    Sidik, S. M.

    1975-01-01

    Ridge, Marquardt's generalized inverse, shrunken, and principal components estimators are discussed in terms of the objectives of point estimation of parameters, estimation of the predictive regression function, and hypothesis testing. It is found that as the normal equations approach singularity, more consideration must be given to estimable functions of the parameters as opposed to estimation of the full parameter vector; that biased estimators all introduce constraints on the parameter space; that adoption of mean squared error as a criterion of goodness should be independent of the degree of singularity; and that ordinary least-squares subset regression is the best overall method.

  6. Mixture of Regression Models with Single-Index

    OpenAIRE

    Xiang, Sijia; Yao, Weixin

    2016-01-01

    In this article, we propose a class of semiparametric mixture regression models with single-index. We argue that many recently proposed semiparametric/nonparametric mixture regression models can be considered special cases of the proposed model. However, unlike existing semiparametric mixture regression models, the new pro- posed model can easily incorporate multivariate predictors into the nonparametric components. Backfitting estimates and the corresponding algorithms have been proposed for...

  7. An Additive-Multiplicative Cox-Aalen Regression Model

    DEFF Research Database (Denmark)

    Scheike, Thomas H.; Zhang, Mei-Jie

    2002-01-01

    Aalen model; additive risk model; counting processes; Cox regression; survival analysis; time-varying effects......Aalen model; additive risk model; counting processes; Cox regression; survival analysis; time-varying effects...

  8. [From clinical judgment to linear regression model.

    Science.gov (United States)

    Palacios-Cruz, Lino; Pérez, Marcela; Rivas-Ruiz, Rodolfo; Talavera, Juan O

    2013-01-01

    When we think about mathematical models, such as linear regression model, we think that these terms are only used by those engaged in research, a notion that is far from the truth. Legendre described the first mathematical model in 1805, and Galton introduced the formal term in 1886. Linear regression is one of the most commonly used regression models in clinical practice. It is useful to predict or show the relationship between two or more variables as long as the dependent variable is quantitative and has normal distribution. Stated in another way, the regression is used to predict a measure based on the knowledge of at least one other variable. Linear regression has as it's first objective to determine the slope or inclination of the regression line: Y = a + bx, where "a" is the intercept or regression constant and it is equivalent to "Y" value when "X" equals 0 and "b" (also called slope) indicates the increase or decrease that occurs when the variable "x" increases or decreases in one unit. In the regression line, "b" is called regression coefficient. The coefficient of determination (R 2 ) indicates the importance of independent variables in the outcome.

  9. Regression models of reactor diagnostic signals

    International Nuclear Information System (INIS)

    Vavrin, J.

    1989-01-01

    The application is described of an autoregression model as the simplest regression model of diagnostic signals in experimental analysis of diagnostic systems, in in-service monitoring of normal and anomalous conditions and their diagnostics. The method of diagnostics is described using a regression type diagnostic data base and regression spectral diagnostics. The diagnostics is described of neutron noise signals from anomalous modes in the experimental fuel assembly of a reactor. (author)

  10. Forecasting with Dynamic Regression Models

    CERN Document Server

    Pankratz, Alan

    2012-01-01

    One of the most widely used tools in statistical forecasting, single equation regression models is examined here. A companion to the author's earlier work, Forecasting with Univariate Box-Jenkins Models: Concepts and Cases, the present text pulls together recent time series ideas and gives special attention to possible intertemporal patterns, distributed lag responses of output to input series and the auto correlation patterns of regression disturbance. It also includes six case studies.

  11. Categorical regression dose-response modeling

    Science.gov (United States)

    The goal of this training is to provide participants with training on the use of the U.S. EPA’s Categorical Regression soft¬ware (CatReg) and its application to risk assessment. Categorical regression fits mathematical models to toxicity data that have been assigned ord...

  12. Mixed-effects regression models in linguistics

    CERN Document Server

    Heylen, Kris; Geeraerts, Dirk

    2018-01-01

    When data consist of grouped observations or clusters, and there is a risk that measurements within the same group are not independent, group-specific random effects can be added to a regression model in order to account for such within-group associations. Regression models that contain such group-specific random effects are called mixed-effects regression models, or simply mixed models. Mixed models are a versatile tool that can handle both balanced and unbalanced datasets and that can also be applied when several layers of grouping are present in the data; these layers can either be nested or crossed.  In linguistics, as in many other fields, the use of mixed models has gained ground rapidly over the last decade. This methodological evolution enables us to build more sophisticated and arguably more realistic models, but, due to its technical complexity, also introduces new challenges. This volume brings together a number of promising new evolutions in the use of mixed models in linguistics, but also addres...

  13. Moderation analysis using a two-level regression model.

    Science.gov (United States)

    Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott

    2014-10-01

    Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.

  14. FEAST 3.1: finite-element modeling of sheath deformation such as longitudinal ridging and collapse into axial gap

    Energy Technology Data Exchange (ETDEWEB)

    Wang, X.; Xu, Z.; Kim, Y-S.; Lai, L.; Cheng, G.; Xu, S. [Atomic Energy of Canada Limited, Mississauga, Ontario (Canada)

    2010-07-01

    During normal operation, the collapsible CANDU® fuel sheath deforms, especially, it may deform into longitudinal ridges or collapse instantaneously into the axial gaps between the end pellet and endcap or between two neighbouring pellets. These phenomena occur under certain conditions, such as the coolant pressure exceeding critical pressures for longitudinal ridging or axial collapse. Both longitudinal ridging and axial collapse phenomena result from plastic instability in the sheath under coolant pressure. Longitudinal ridging features one or multiple lobes or 'ridges' (outward from the sheath surface) formed along the sheath in the longitudinal direction. Axial collapse features a 'valley' around the sheath circumference. Both phenomena can lead to sheath overstrain, which in turn potentially leads to sheath failure. The LONGER code, which contains empirical correlations, has been used to predict the critical pressures for these two sheath deformation phenomena. To study fuel behaviour outside of the application ranges of the LONGER empirical correlations, a mechanistic model is needed. FEAST (Finite Element Analysis for Stresses) is an AECL computer code used to assess the structural integrity of the CANDU fuel element. The FEAST code has recently been developed (to Version 3.1) to model processes occurring during longitudinal ridge formation and instantaneous collapse into the axial gap. The new models include those for geometric non-linearity (large deformation, large material rotation), non-linear stress-strain curve for plastic deformation, Zr-4 sheath creep law, and variable Young’s Modulus etc. This paper describes the mechanistic model (FEAST 3.1) development for analyses of longitudinal ridging and instantaneous collapse into axial gap, and the comparison with the results from empirical correlations in LONGER. (author)

  15. FEAST 3.1: finite-element modeling of sheath deformation such as longitudinal ridging and collapse into axial gap

    International Nuclear Information System (INIS)

    Wang, X.; Xu, Z.; Kim, Y-S.; Lai, L.; Cheng, G.; Xu, S.

    2010-01-01

    During normal operation, the collapsible CANDU® fuel sheath deforms, especially, it may deform into longitudinal ridges or collapse instantaneously into the axial gaps between the end pellet and endcap or between two neighbouring pellets. These phenomena occur under certain conditions, such as the coolant pressure exceeding critical pressures for longitudinal ridging or axial collapse. Both longitudinal ridging and axial collapse phenomena result from plastic instability in the sheath under coolant pressure. Longitudinal ridging features one or multiple lobes or 'ridges' (outward from the sheath surface) formed along the sheath in the longitudinal direction. Axial collapse features a 'valley' around the sheath circumference. Both phenomena can lead to sheath overstrain, which in turn potentially leads to sheath failure. The LONGER code, which contains empirical correlations, has been used to predict the critical pressures for these two sheath deformation phenomena. To study fuel behaviour outside of the application ranges of the LONGER empirical correlations, a mechanistic model is needed. FEAST (Finite Element Analysis for Stresses) is an AECL computer code used to assess the structural integrity of the CANDU fuel element. The FEAST code has recently been developed (to Version 3.1) to model processes occurring during longitudinal ridge formation and instantaneous collapse into the axial gap. The new models include those for geometric non-linearity (large deformation, large material rotation), non-linear stress-strain curve for plastic deformation, Zr-4 sheath creep law, and variable Young’s Modulus etc. This paper describes the mechanistic model (FEAST 3.1) development for analyses of longitudinal ridging and instantaneous collapse into axial gap, and the comparison with the results from empirical correlations in LONGER. (author)

  16. A comparison of regression algorithms for wind speed forecasting at Alexander Bay

    CSIR Research Space (South Africa)

    Botha, Nicolene

    2016-12-01

    Full Text Available to forecast 1 to 24 hours ahead, in hourly intervals. Predictions are performed on a wind speed time series with three machine learning regression algorithms, namely support vector regression, ordinary least squares and Bayesian ridge regression. The resulting...

  17. Variable selection and model choice in geoadditive regression models.

    Science.gov (United States)

    Kneib, Thomas; Hothorn, Torsten; Tutz, Gerhard

    2009-06-01

    Model choice and variable selection are issues of major concern in practical regression analyses, arising in many biometric applications such as habitat suitability analyses, where the aim is to identify the influence of potentially many environmental conditions on certain species. We describe regression models for breeding bird communities that facilitate both model choice and variable selection, by a boosting algorithm that works within a class of geoadditive regression models comprising spatial effects, nonparametric effects of continuous covariates, interaction surfaces, and varying coefficients. The major modeling components are penalized splines and their bivariate tensor product extensions. All smooth model terms are represented as the sum of a parametric component and a smooth component with one degree of freedom to obtain a fair comparison between the model terms. A generic representation of the geoadditive model allows us to devise a general boosting algorithm that automatically performs model choice and variable selection.

  18. Regression tools for CO2 inversions: application of a shrinkage estimator to process attribution

    International Nuclear Information System (INIS)

    Shaby, Benjamin A.; Field, Christopher B.

    2006-01-01

    In this study we perform an atmospheric inversion based on a shrinkage estimator. This method is used to estimate surface fluxes of CO 2 , first partitioned according to constituent geographic regions, and then according to constituent processes that are responsible for the total flux. Our approach differs from previous approaches in two important ways. The first is that the technique of linear Bayesian inversion is recast as a regression problem. Seen as such, standard regression tools are employed to analyse and reduce errors in the resultant estimates. A shrinkage estimator, which combines standard ridge regression with the linear 'Bayesian inversion' model, is introduced. This method introduces additional bias into the model with the aim of reducing variance such that errors are decreased overall. Compared with standard linear Bayesian inversion, the ridge technique seems to reduce both flux estimation errors and prediction errors. The second divergence from previous studies is that instead of dividing the world into geographically distinct regions and estimating the CO 2 flux in each region, the flux space is divided conceptually into processes that contribute to the total global flux. Formulating the problem in this manner adds to the interpretability of the resultant estimates and attempts to shed light on the problem of attributing sources and sinks to their underlying mechanisms

  19. The MIDAS Touch: Mixed Data Sampling Regression Models

    OpenAIRE

    Ghysels, Eric; Santa-Clara, Pedro; Valkanov, Rossen

    2004-01-01

    We introduce Mixed Data Sampling (henceforth MIDAS) regression models. The regressions involve time series data sampled at different frequencies. Technically speaking MIDAS models specify conditional expectations as a distributed lag of regressors recorded at some higher sampling frequencies. We examine the asymptotic properties of MIDAS regression estimation and compare it with traditional distributed lag models. MIDAS regressions have wide applicability in macroeconomics and �nance.

  20. Waste management systems model for energy systems sites on the Oak Ridge Reservation

    International Nuclear Information System (INIS)

    Rodgers, B.R.; Nehls, J.W. Jr.; Rivera, A.L.; Pechin, W.H.; Genung, R.K.

    1987-01-01

    The Oak Ridge Model (ORM) is a DOE/Oak Ridge Operations (DOE/ORO) initiative which involves regulators and the public at the problem definition stage to attempt to reach a consensus on acceptable approaches to the solution of waste management problems. Once the approaches have been defined, the private sector participates in identifying and demonstrating the technologies to be employed. The Waste Management Systems Model (WSM), a major part of the ORM, is discussed in this paper. It can be generically described as employing a number of computer models to: 1. determine waste management requirements (e.g., capabilities and capacities); 2. develop scenarios that respond to changes in requirements and that evaluate alternatives as they become available; 3. compare these scenarios technically, operationally, and financially; and 4. select the most promising technologies for demonstration. 12 figures

  1. A deep structural ridge beneath central India

    Science.gov (United States)

    Agrawal, P. K.; Thakur, N. K.; Negi, J. G.

    A joint-inversion of magnetic satellite (MAGSAT) and free air gravity data has been conducted to quantitatively investigate the cause for Bouguer gravity anomaly over Central Indian plateaus and possible fold consequences beside Himalayan zone in the Indian sub-continent due to collision between Indian and Eurasian plates. The appropriate inversion with 40 km crustal depth model has delineated after discriminating high density and magnetisation models, for the first time, about 1500 km long hidden ridge structure trending NW-SE. The structure is parallel to Himalayan fold axis and the Indian Ocean ridge in the Arabian Sea. A quantitative relief model across a representative anomaly profile confirms the ridge structure with its highest point nearly 6 km higher than the surrounding crustal level in peninsular India. The ridge structure finds visible support from the astro-geoidal contours.

  2. Introduction to the use of regression models in epidemiology.

    Science.gov (United States)

    Bender, Ralf

    2009-01-01

    Regression modeling is one of the most important statistical techniques used in analytical epidemiology. By means of regression models the effect of one or several explanatory variables (e.g., exposures, subject characteristics, risk factors) on a response variable such as mortality or cancer can be investigated. From multiple regression models, adjusted effect estimates can be obtained that take the effect of potential confounders into account. Regression methods can be applied in all epidemiologic study designs so that they represent a universal tool for data analysis in epidemiology. Different kinds of regression models have been developed in dependence on the measurement scale of the response variable and the study design. The most important methods are linear regression for continuous outcomes, logistic regression for binary outcomes, Cox regression for time-to-event data, and Poisson regression for frequencies and rates. This chapter provides a nontechnical introduction to these regression models with illustrating examples from cancer research.

  3. Development of a hydrological model for simulation of runoff from catchments unbounded by ridge lines

    Science.gov (United States)

    Vema, Vamsikrishna; Sudheer, K. P.; Chaubey, I.

    2017-08-01

    Watershed hydrological models are effective tools for simulating the hydrological processes in the watershed. Although there are a plethora of hydrological models, none of them can be directly applied to make water conservation decisions in irregularly bounded areas that do not confirm to topographically defined ridge lines. This study proposes a novel hydrological model that can be directly applied to any catchment, with or without ridge line boundaries. The model is based on the water balance concept, and a linear function concept to approximate the cross-boundary flow from upstream areas to the administrative catchment under consideration. The developed model is tested in 2 watersheds - Riesel Experimental Watershed and a sub-basin of Cedar Creek Watershed in Texas, USA. Hypothetical administrative catchments that did not confirm to the location of ridge lines were considered for verifying the efficacy of the model for hydrologic simulations. The linear function concept used to account the cross boundary flow was based on the hypothesis that the flow coming from outside the boundary to administrative area was proportional to the flow generated in the boundary grid cell. The model performance was satisfactory with an NSE and r2 of ≥0.80 and a PBIAS of administrative catchments of the watersheds were in good agreement with the observed hydrographs, indicating a satisfactory performance of the model in the administratively bounded areas.

  4. Model-based Quantile Regression for Discrete Data

    KAUST Repository

    Padellini, Tullia

    2018-04-10

    Quantile regression is a class of methods voted to the modelling of conditional quantiles. In a Bayesian framework quantile regression has typically been carried out exploiting the Asymmetric Laplace Distribution as a working likelihood. Despite the fact that this leads to a proper posterior for the regression coefficients, the resulting posterior variance is however affected by an unidentifiable parameter, hence any inferential procedure beside point estimation is unreliable. We propose a model-based approach for quantile regression that considers quantiles of the generating distribution directly, and thus allows for a proper uncertainty quantification. We then create a link between quantile regression and generalised linear models by mapping the quantiles to the parameter of the response variable, and we exploit it to fit the model with R-INLA. We extend it also in the case of discrete responses, where there is no 1-to-1 relationship between quantiles and distribution\\'s parameter, by introducing continuous generalisations of the most common discrete variables (Poisson, Binomial and Negative Binomial) to be exploited in the fitting.

  5. Origin of lavas from the Ninetyeast Ridge, Eastern Indian Ocean: An evaluation of fractional crystallization models

    Energy Technology Data Exchange (ETDEWEB)

    Ludden, J.N.; Thompson, G.; Bryan, W.B.; Frey, F.A.

    1980-08-10

    Ferrobasalts from DSDP sites 214 and 216 on the Ninetyeast Ridge are characterized by high absolute iron (FeO>12.9 wt %), FeO/MgO>1.9, and TiO/sub 2/>2.0 wt %. Their trace element abundances indicate a tholeiitic affinity; however, they are distinct from midocean ridge incompatible element-depleted tholeiites owing to higher contents of Ba, Zr, and Sr and flat to slightly light-REE-enriched, chondrite-normalized REE patterns. Calculations using major and trace element abundances and phase compositions are generally consistent with a model relating most major elements and phase compositions in site 214 and 216 ferrobasalts by fractionation of clinopyroxene and plagio-class. However, some incompatible element abundances for site 216 basalts are not consistent with the fractional crystallization models. Baslats from site 214 can be related to andesitic rocks from the same site by fractionating clinopyroxene, plagioclase and titanomagnetite. Site 254 basalts, at the southern end of the Ninetyeast Ridge, and island tholeiites in the southern Indian Ocean (Amsterdam-St. Paul or Kerguelen-Heard volcanic provinces) possibly represent the most recent activity associated with a hot spot forming the Ninetyeast Ridge. These incompatible-element-enriched tholeiites have major element compositions consistent with those expected for a parental liquid for the site 214 and 216 ferrobasalts. However, differences in the trace element contents of the basalts from the Ninetyeast Ridge sites are not consistent with simple fractional crystallization derivation but require either a complex melting model or a heterogeneous mantle source.

  6. Variable importance in latent variable regression models

    NARCIS (Netherlands)

    Kvalheim, O.M.; Arneberg, R.; Bleie, O.; Rajalahti, T.; Smilde, A.K.; Westerhuis, J.A.

    2014-01-01

    The quality and practical usefulness of a regression model are a function of both interpretability and prediction performance. This work presents some new graphical tools for improved interpretation of latent variable regression models that can also assist in improved algorithms for variable

  7. A simple approach to power and sample size calculations in logistic regression and Cox regression models.

    Science.gov (United States)

    Vaeth, Michael; Skovlund, Eva

    2004-06-15

    For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.

  8. Regression modeling methods, theory, and computation with SAS

    CERN Document Server

    Panik, Michael

    2009-01-01

    Regression Modeling: Methods, Theory, and Computation with SAS provides an introduction to a diverse assortment of regression techniques using SAS to solve a wide variety of regression problems. The author fully documents the SAS programs and thoroughly explains the output produced by the programs.The text presents the popular ordinary least squares (OLS) approach before introducing many alternative regression methods. It covers nonparametric regression, logistic regression (including Poisson regression), Bayesian regression, robust regression, fuzzy regression, random coefficients regression,

  9. A regression approach for Zircaloy-2 in-reactor creep constitutive equations

    International Nuclear Information System (INIS)

    Yung Liu, Y.; Bement, A.L.

    1977-01-01

    In this paper the methodology of multiple regressions as applied to Zircaloy-2 in-reactor creep data analysis and construction of constitutive equation are illustrated. While the resulting constitutive equation can be used in creep analysis of in-reactor Zircaloy structural components, the methodology itself is entirely general and can be applied to any creep data analysis. The promising aspects of multiple regression creep data analysis are briefly outlined as follows: (1) When there are more than one variable involved, there is no need to make the assumption that each variable affects the response independently. No separate normalizations are required either and the estimation of parameters is obtained by solving many simultaneous equations. The number of simultaneous equations is equal to the number of data sets. (2) Regression statistics such as R 2 - and F-statistics provide measures of the significance of regression creep equation in correlating the overall data. The relative weights of each variable on the response can also be obtained. (3) Special regression techniques such as step-wise, ridge, and robust regressions and residual plots, etc., provide diagnostic tools for model selections. Multiple regression analysis performed on a set of carefully selected Zircaloy-2 in-reactor creep data leads to a model which provides excellent correlations for the data. (Auth.)

  10. Regression Models For Multivariate Count Data.

    Science.gov (United States)

    Zhang, Yiwen; Zhou, Hua; Zhou, Jin; Sun, Wei

    2017-01-01

    Data with multivariate count responses frequently occur in modern applications. The commonly used multinomial-logit model is limiting due to its restrictive mean-variance structure. For instance, analyzing count data from the recent RNA-seq technology by the multinomial-logit model leads to serious errors in hypothesis testing. The ubiquity of over-dispersion and complicated correlation structures among multivariate counts calls for more flexible regression models. In this article, we study some generalized linear models that incorporate various correlation structures among the counts. Current literature lacks a treatment of these models, partly due to the fact that they do not belong to the natural exponential family. We study the estimation, testing, and variable selection for these models in a unifying framework. The regression models are compared on both synthetic and real RNA-seq data.

  11. Nonparametric Mixture of Regression Models.

    Science.gov (United States)

    Huang, Mian; Li, Runze; Wang, Shaoli

    2013-07-01

    Motivated by an analysis of US house price index data, we propose nonparametric finite mixture of regression models. We study the identifiability issue of the proposed models, and develop an estimation procedure by employing kernel regression. We further systematically study the sampling properties of the proposed estimators, and establish their asymptotic normality. A modified EM algorithm is proposed to carry out the estimation procedure. We show that our algorithm preserves the ascent property of the EM algorithm in an asymptotic sense. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed estimation procedure. An empirical analysis of the US house price index data is illustrated for the proposed methodology.

  12. Gaussian Process Regression Model in Spatial Logistic Regression

    Science.gov (United States)

    Sofro, A.; Oktaviarina, A.

    2018-01-01

    Spatial analysis has developed very quickly in the last decade. One of the favorite approaches is based on the neighbourhood of the region. Unfortunately, there are some limitations such as difficulty in prediction. Therefore, we offer Gaussian process regression (GPR) to accommodate the issue. In this paper, we will focus on spatial modeling with GPR for binomial data with logit link function. The performance of the model will be investigated. We will discuss the inference of how to estimate the parameters and hyper-parameters and to predict as well. Furthermore, simulation studies will be explained in the last section.

  13. Regression and regression analysis time series prediction modeling on climate data of quetta, pakistan

    International Nuclear Information System (INIS)

    Jafri, Y.Z.; Kamal, L.

    2007-01-01

    Various statistical techniques was used on five-year data from 1998-2002 of average humidity, rainfall, maximum and minimum temperatures, respectively. The relationships to regression analysis time series (RATS) were developed for determining the overall trend of these climate parameters on the basis of which forecast models can be corrected and modified. We computed the coefficient of determination as a measure of goodness of fit, to our polynomial regression analysis time series (PRATS). The correlation to multiple linear regression (MLR) and multiple linear regression analysis time series (MLRATS) were also developed for deciphering the interdependence of weather parameters. Spearman's rand correlation and Goldfeld-Quandt test were used to check the uniformity or non-uniformity of variances in our fit to polynomial regression (PR). The Breusch-Pagan test was applied to MLR and MLRATS, respectively which yielded homoscedasticity. We also employed Bartlett's test for homogeneity of variances on a five-year data of rainfall and humidity, respectively which showed that the variances in rainfall data were not homogenous while in case of humidity, were homogenous. Our results on regression and regression analysis time series show the best fit to prediction modeling on climatic data of Quetta, Pakistan. (author)

  14. Robust mislabel logistic regression without modeling mislabel probabilities.

    Science.gov (United States)

    Hung, Hung; Jou, Zhi-Yu; Huang, Su-Yun

    2018-03-01

    Logistic regression is among the most widely used statistical methods for linear discriminant analysis. In many applications, we only observe possibly mislabeled responses. Fitting a conventional logistic regression can then lead to biased estimation. One common resolution is to fit a mislabel logistic regression model, which takes into consideration of mislabeled responses. Another common method is to adopt a robust M-estimation by down-weighting suspected instances. In this work, we propose a new robust mislabel logistic regression based on γ-divergence. Our proposal possesses two advantageous features: (1) It does not need to model the mislabel probabilities. (2) The minimum γ-divergence estimation leads to a weighted estimating equation without the need to include any bias correction term, that is, it is automatically bias-corrected. These features make the proposed γ-logistic regression more robust in model fitting and more intuitive for model interpretation through a simple weighting scheme. Our method is also easy to implement, and two types of algorithms are included. Simulation studies and the Pima data application are presented to demonstrate the performance of γ-logistic regression. © 2017, The International Biometric Society.

  15. Mixed Frequency Data Sampling Regression Models: The R Package midasr

    Directory of Open Access Journals (Sweden)

    Eric Ghysels

    2016-08-01

    Full Text Available When modeling economic relationships it is increasingly common to encounter data sampled at different frequencies. We introduce the R package midasr which enables estimating regression models with variables sampled at different frequencies within a MIDAS regression framework put forward in work by Ghysels, Santa-Clara, and Valkanov (2002. In this article we define a general autoregressive MIDAS regression model with multiple variables of different frequencies and show how it can be specified using the familiar R formula interface and estimated using various optimization methods chosen by the researcher. We discuss how to check the validity of the estimated model both in terms of numerical convergence and statistical adequacy of a chosen regression specification, how to perform model selection based on a information criterion, how to assess forecasting accuracy of the MIDAS regression model and how to obtain a forecast aggregation of different MIDAS regression models. We illustrate the capabilities of the package with a simulated MIDAS regression model and give two empirical examples of application of MIDAS regression.

  16. Impact of multicollinearity on small sample hydrologic regression models

    Science.gov (United States)

    Kroll, Charles N.; Song, Peter

    2013-06-01

    Often hydrologic regression models are developed with ordinary least squares (OLS) procedures. The use of OLS with highly correlated explanatory variables produces multicollinearity, which creates highly sensitive parameter estimators with inflated variances and improper model selection. It is not clear how to best address multicollinearity in hydrologic regression models. Here a Monte Carlo simulation is developed to compare four techniques to address multicollinearity: OLS, OLS with variance inflation factor screening (VIF), principal component regression (PCR), and partial least squares regression (PLS). The performance of these four techniques was observed for varying sample sizes, correlation coefficients between the explanatory variables, and model error variances consistent with hydrologic regional regression models. The negative effects of multicollinearity are magnified at smaller sample sizes, higher correlations between the variables, and larger model error variances (smaller R2). The Monte Carlo simulation indicates that if the true model is known, multicollinearity is present, and the estimation and statistical testing of regression parameters are of interest, then PCR or PLS should be employed. If the model is unknown, or if the interest is solely on model predictions, is it recommended that OLS be employed since using more complicated techniques did not produce any improvement in model performance. A leave-one-out cross-validation case study was also performed using low-streamflow data sets from the eastern United States. Results indicate that OLS with stepwise selection generally produces models across study regions with varying levels of multicollinearity that are as good as biased regression techniques such as PCR and PLS.

  17. Applied Regression Modeling A Business Approach

    CERN Document Server

    Pardoe, Iain

    2012-01-01

    An applied and concise treatment of statistical regression techniques for business students and professionals who have little or no background in calculusRegression analysis is an invaluable statistical methodology in business settings and is vital to model the relationship between a response variable and one or more predictor variables, as well as the prediction of a response value given values of the predictors. In view of the inherent uncertainty of business processes, such as the volatility of consumer spending and the presence of market uncertainty, business professionals use regression a

  18. Testing homogeneity in Weibull-regression models.

    Science.gov (United States)

    Bolfarine, Heleno; Valença, Dione M

    2005-10-01

    In survival studies with families or geographical units it may be of interest testing whether such groups are homogeneous for given explanatory variables. In this paper we consider score type tests for group homogeneity based on a mixing model in which the group effect is modelled as a random variable. As opposed to hazard-based frailty models, this model presents survival times that conditioned on the random effect, has an accelerated failure time representation. The test statistics requires only estimation of the conventional regression model without the random effect and does not require specifying the distribution of the random effect. The tests are derived for a Weibull regression model and in the uncensored situation, a closed form is obtained for the test statistic. A simulation study is used for comparing the power of the tests. The proposed tests are applied to real data sets with censored data.

  19. Model-based Quantile Regression for Discrete Data

    KAUST Repository

    Padellini, Tullia; Rue, Haavard

    2018-01-01

    Quantile regression is a class of methods voted to the modelling of conditional quantiles. In a Bayesian framework quantile regression has typically been carried out exploiting the Asymmetric Laplace Distribution as a working likelihood. Despite

  20. Detection of epistatic effects with logic regression and a classical linear regression model.

    Science.gov (United States)

    Malina, Magdalena; Ickstadt, Katja; Schwender, Holger; Posch, Martin; Bogdan, Małgorzata

    2014-02-01

    To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham's model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham's approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham's approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.

  1. Privacy-Preserving Distributed Linear Regression on High-Dimensional Data

    Directory of Open Access Journals (Sweden)

    Gascón Adrià

    2017-10-01

    Full Text Available We propose privacy-preserving protocols for computing linear regression models, in the setting where the training dataset is vertically distributed among several parties. Our main contribution is a hybrid multi-party computation protocol that combines Yao’s garbled circuits with tailored protocols for computing inner products. Like many machine learning tasks, building a linear regression model involves solving a system of linear equations. We conduct a comprehensive evaluation and comparison of different techniques for securely performing this task, including a new Conjugate Gradient Descent (CGD algorithm. This algorithm is suitable for secure computation because it uses an efficient fixed-point representation of real numbers while maintaining accuracy and convergence rates comparable to what can be obtained with a classical solution using floating point numbers. Our technique improves on Nikolaenko et al.’s method for privacy-preserving ridge regression (S&P 2013, and can be used as a building block in other analyses. We implement a complete system and demonstrate that our approach is highly scalable, solving data analysis problems with one million records and one hundred features in less than one hour of total running time.

  2. Migrating Toward Fully 4-D Geodynamical Models of Asthenospheric Circulation and Melt Production at Mid-Ocean Ridges

    Science.gov (United States)

    van Dam, L.; Kincaid, C. R.; Pockalny, R. A.; Sylvia, R. T.; Hall, P. S.

    2017-12-01

    Lateral migration of mid-ocean ridge spreading centers is a well-documented phenomenon leading to asymmetric melt production and the surficial expressions thereof. This form of plate motion has been difficult to incorporate into both numerical and analogue geodynamical models, and consequently, current estimates of time-dependent flow, material transport, and melting in the mantle beneath ridges are lacking. To address this, we have designed and built an innovative research apparatus that allows for precise and repeatable simulations of mid-ocean ridge spreading and migration. Three pairs of counter-rotating belts with adjustable lateral orientations are scaled to simulate spreading at, and flow beneath, three 600km wide ridge segments with up to 300km transform offsets. This apparatus is attached to a drive system that allows us to test a full range of axis-parallel to axis-normal migration directions, and is suspended above a reservoir of viscous glucose syrup, a scaled analogue for the upper mantle, and neutrally buoyant tracers. We image plate-driven flow in the syrup with high-resolution digital cameras and use particle image velocimetry methods to obtain information about transport pathlines and flow-induced anisotropy. Suites of experiments are run with and without ridge migration to determine the overall significance of migration on spatial and temporal characteristics of shallow mantle flow. Our experiments cover an expansive parameter space by including various spreading rates, migration speeds and directions, degrees of spreading asymmetry, transform-offset lengths, and upper mantle viscosity conditions. Preliminary results highlight the importance of modeling migratory plate forces. Mantle material exhibits a significant degree of lateral transport, particularly between ridge segments and towards the melt triangle. Magma supply to the melting region is highly complex; parcels of material do not necessarily move along fixed streamlines, rather, they can

  3. AN APPLICATION OF FUNCTIONAL MULTIVARIATE REGRESSION MODEL TO MULTICLASS CLASSIFICATION

    OpenAIRE

    Krzyśko, Mirosław; Smaga, Łukasz

    2017-01-01

    In this paper, the scale response functional multivariate regression model is considered. By using the basis functions representation of functional predictors and regression coefficients, this model is rewritten as a multivariate regression model. This representation of the functional multivariate regression model is used for multiclass classification for multivariate functional data. Computational experiments performed on real labelled data sets demonstrate the effectiveness of the proposed ...

  4. Regularized principal covariates regression and its application to finding coupled patterns in climate fields

    Science.gov (United States)

    Fischer, M. J.

    2014-02-01

    There are many different methods for investigating the coupling between two climate fields, which are all based on the multivariate regression model. Each different method of solving the multivariate model has its own attractive characteristics, but often the suitability of a particular method for a particular problem is not clear. Continuum regression methods search the solution space between the conventional methods and thus can find regression model subspaces that mix the attractive characteristics of the end-member subspaces. Principal covariates regression is a continuum regression method that is easily applied to climate fields and makes use of two end-members: principal components regression and redundancy analysis. In this study, principal covariates regression is extended to additionally span a third end-member (partial least squares or maximum covariance analysis). The new method, regularized principal covariates regression, has several attractive features including the following: it easily applies to problems in which the response field has missing values or is temporally sparse, it explores a wide range of model spaces, and it seeks a model subspace that will, for a set number of components, have a predictive skill that is the same or better than conventional regression methods. The new method is illustrated by applying it to the problem of predicting the southern Australian winter rainfall anomaly field using the regional atmospheric pressure anomaly field. Regularized principal covariates regression identifies four major coupled patterns in these two fields. The two leading patterns, which explain over half the variance in the rainfall field, are related to the subtropical ridge and features of the zonally asymmetric circulation.

  5. Dynamical instability produces transform faults at mid-ocean ridges.

    Science.gov (United States)

    Gerya, Taras

    2010-08-27

    Transform faults at mid-ocean ridges--one of the most striking, yet enigmatic features of terrestrial plate tectonics--are considered to be the inherited product of preexisting fault structures. Ridge offsets along these faults therefore should remain constant with time. Here, numerical models suggest that transform faults are actively developing and result from dynamical instability of constructive plate boundaries, irrespective of previous structure. Boundary instability from asymmetric plate growth can spontaneously start in alternate directions along successive ridge sections; the resultant curved ridges become transform faults within a few million years. Fracture-related rheological weakening stabilizes ridge-parallel detachment faults. Offsets along the transform faults change continuously with time by asymmetric plate growth and discontinuously by ridge jumps.

  6. Parameters Estimation of Geographically Weighted Ordinal Logistic Regression (GWOLR) Model

    Science.gov (United States)

    Zuhdi, Shaifudin; Retno Sari Saputro, Dewi; Widyaningsih, Purnami

    2017-06-01

    A regression model is the representation of relationship between independent variable and dependent variable. The dependent variable has categories used in the logistic regression model to calculate odds on. The logistic regression model for dependent variable has levels in the logistics regression model is ordinal. GWOLR model is an ordinal logistic regression model influenced the geographical location of the observation site. Parameters estimation in the model needed to determine the value of a population based on sample. The purpose of this research is to parameters estimation of GWOLR model using R software. Parameter estimation uses the data amount of dengue fever patients in Semarang City. Observation units used are 144 villages in Semarang City. The results of research get GWOLR model locally for each village and to know probability of number dengue fever patient categories.

  7. A regression approach for zircaloy-2 in-reactor creep constitutive equations

    International Nuclear Information System (INIS)

    Yung Liu, Y.; Bement, A.L.

    1977-01-01

    In this paper the methodology of multiple regressions as applied to zircaloy-2 in-reactor creep data analysis and construction of constitutive equation are illustrated. While the resulting constitutive equation can be used in creep analysis of in-reactor zircaloy structural components, the methodology itself is entirely general and can be applied to any creep data analysis. From data analysis and model development point of views, both the assumption of independence and prior committment to specific model forms are unacceptable. One would desire means which can not only estimate the required parameters directly from data but also provide basis for model selections, viz., one model against others. Basic understanding of the physics of deformation is important in choosing the forms of starting physical model equations, but the justifications must rely on their abilities in correlating the overall data. The promising aspects of multiple regression creep data analysis are briefly outlined as follows: (1) when there are more than one variable involved, there is no need to make the assumption that each variable affects the response independently. No separate normalizations are required either and the estimation of parameters is obtained by solving many simultaneous equations. The number of simultaneous equations is equal to the number of data sets, (2) regression statistics such as R 2 - and F-statistics provide measures of the significance of regression creep equation in correlating the overall data. The relative weights of each variable on the response can also be obtained. (3) Special regression techniques such as step-wise, ridge, and robust regressions and residual plots, etc., provide diagnostic tools for model selections

  8. Environmental baseline survey report for West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, West Pine Ridge and parcel 21D in the vicinity of the East Technology Park, Oak Ridge, Tennessee

    Energy Technology Data Exchange (ETDEWEB)

    King, David A. [Oak Ridge Institute for Science and Education (ORISE), Oak Ridge, TN (United States). Independent Environmental Assessment and Verification Program

    2012-11-29

    This environmental baseline survey (EBS) report documents the baseline environmental conditions of five land parcels located near the U.S. Department of Energy?s (DOE?s) East Tennessee Technology Park (ETTP), including West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, West Pine Ridge, and Parcel 21d. Preparation of this report included the detailed search of federal government records, title documents, aerial photos that may reflect prior uses, and visual inspections of the property and adjacent properties. Interviews with current employees involved in, or familiar with, operations on the real property were also conducted to identify any areas on the property where hazardous substances and petroleum products, or their derivatives, and acutely hazardous wastes may have been released or disposed. In addition, a search was made of reasonably obtainable federal, state, and local government records of each adjacent facility where there has been a release of any hazardous substance or any petroleum product or their derivatives, including aviation fuel and motor oil, and which is likely to cause or contribute to a release of any hazardous substance or any petroleum product or its derivatives, including aviation fuel or motor oil, on the real property. A radiological survey and soil/sediment sampling was conducted to assess baseline conditions of Parcel 21d that were not addressed by the soils-only no-further-investigation (NFI) reports. Groundwater sampling was also conducted to support a Parcel 21d decision. Based on available data West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, and West Pine Ridge are not impacted by site operations and are not subject to actions per the Federal Facility Agreement (FFA). This determination is supported by visual inspections, records searches and interviews, groundwater conceptual modeling, approved NFI reports, analytical data, and risk analysis results. Parcel 21d data, however, demonstrate impacts from site

  9. Environmental baseline survey report for West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, West Pine Ridge and parcel 21D in the vicinity of the East Technology Park, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    King, David A.

    2012-01-01

    This environmental baseline survey (EBS) report documents the baseline environmental conditions of five land parcels located near the U.S. Department of Energy's (DOE's) East Tennessee Technology Park (ETTP), including West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, West Pine Ridge, and Parcel 21d. Preparation of this report included the detailed search of federal government records, title documents, aerial photos that may reflect prior uses, and visual inspections of the property and adjacent properties. Interviews with current employees involved in, or familiar with, operations on the real property were also conducted to identify any areas on the property where hazardous substances and petroleum products, or their derivatives, and acutely hazardous wastes may have been released or disposed. In addition, a search was made of reasonably obtainable federal, state, and local government records of each adjacent facility where there has been a release of any hazardous substance or any petroleum product or their derivatives, including aviation fuel and motor oil, and which is likely to cause or contribute to a release of any hazardous substance or any petroleum product or its derivatives, including aviation fuel or motor oil, on the real property. A radiological survey and soil/sediment sampling was conducted to assess baseline conditions of Parcel 21d that were not addressed by the soils-only no-further-investigation (NFI) reports. Groundwater sampling was also conducted to support a Parcel 21d decision. Based on available data West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, and West Pine Ridge are not impacted by site operations and are not subject to actions per the Federal Facility Agreement (FFA). This determination is supported by visual inspections, records searches and interviews, groundwater conceptual modeling, approved NFI reports, analytical data, and risk analysis results. Parcel 21d data, however, demonstrate impacts from site

  10. Two-Dimensional Heat Transfer Modeling of the Formosa Ridge Offshore SW Taiwan: Implication for Fluid Migrating Paths of a Cold Seep Site

    Science.gov (United States)

    Tsai, Y.; Chi, W.; Liu, C.; Shyu, C.

    2011-12-01

    The Formosa Ridge, a small ridge located on the passive China continental slope offshore southwestern Taiwan, is an active cold seep site. Large and dense chemosynthetic communities were found there by the ROV Hyper-Dolphin during the 2007 NT0705 cruise. A vertical blank zone is clearly observed on all the seismic profiles across the cold seep site. This narrow zone is interpreted to be the fluid conduit of the seep site. Previous studies suggest that cold sea water carrying large amount of sulfate could flow into the fluid system from flanks of the ridge, and forms a very effective fluid circulation system that emits both methane and hydrogen sulfide to feed the unusual chemosynthetic communities observed at the Formosa Ridge cold seep site. Here we use thermal signals to study possible fluid flow migration paths. In 2008 and 2010, we have collected vdense thermal probe data at this site. We also study the temperatures at Bottom-Simulating Reflectors (BSRs) based on methane hydrate phase diagram. We perform 2D finite element thermal conductive simulations to study the effects of bathymetry on the temperature field in the ridge, and compare the simulation result with thermal probe and BSR-derived datasets. The boundary conditions include insulated boundaries on both sides, and we assign a fix temperature at the bottom of the model using an average regional geothermal gradient. Sensitivity tests and thermal probe data from a nearby region give a regional background geothermal gradient of 0.04 to 0.05 °C/m. The outputs of the simulation runs include geothermal gradient and temperature at different parts of the model. The model can fit the geothermal gradient at a distance away from the ridge where there is less geophysics evidence of fluid flow. However our model over-predicts the geothermal gradient by 50% at the ridge top. We also compare simulated temperature field and found that under the flanks of the ridge the temperature is cooled by 2 °C compared with the

  11. Alternative regression models to assess increase in childhood BMI.

    Science.gov (United States)

    Beyerlein, Andreas; Fahrmeir, Ludwig; Mansmann, Ulrich; Toschke, André M

    2008-09-08

    Body mass index (BMI) data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs), quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS). We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity. GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models. GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.

  12. Volcanism and hydrothermalism on a hotspot-influenced ridge: Comparing Reykjanes Peninsula and Reykjanes Ridge, Iceland

    Science.gov (United States)

    Pałgan, Dominik; Devey, Colin W.; Yeo, Isobel A.

    2017-12-01

    Current estimates indicate that the number of high-temperature vents (one of the primary pathways for the heat extraction from the Earth's mantle) - at least 1 per 100 km of axial length - scales with spreading rate and should scale with crustal thickness. But up to present, shallow ridge axes underlain by thick crust show anomalously low incidences of high-temperature activity. Here we compare the Reykjanes Ridge, an abnormally shallow ridge with thick crust and only one high-temperature vent known over 900 km axial length, to the adjacent subaerial Reykjanes Peninsula (RP), which is characterized by high-temperature geothermal sites confined to four volcanic systems transected by fissure swarms with young (Holocene) volcanic activity, multiple faults, cracks and fissures, and continuous seismic activity. New high-resolution bathymetry (gridded at 60 m) of the Reykjanes Ridge between 62°30‧N and 63°30‧N shows seven Axial Volcanic Ridges (AVR) that, based on their morphology, geometry and tectonic regime, are analogues for the volcanic systems and fissure swarms on land. We investigate in detail the volcano-tectonic features of all mapped AVRs and show that they do not fit with the previously suggested 4-stage evolution model for AVR construction. Instead, we suggest that AVR morphology reflects the robust or weak melt supply to the system and two (or more) eruption mechanisms may co-exist on one AVR (in contrast to 4-stage evolution model). Our interpretations indicate that, unlike on the Reykjanes Peninsula, faults on and around AVRs do not cluster in orientation domains but all are subparallel to the overall strike of AVRs (orthogonal to spreading direction). High abundance of seamounts shows that the region centered at 62°47‧N and 25°04‧W (between AVR-5 and -6) is volcanically robust while the highest fault density implies that AVR-1 and southern part of AVR-6 rather undergo period of melt starvation. Based on our observations and interpretations we

  13. Alternative regression models to assess increase in childhood BMI

    OpenAIRE

    Beyerlein, Andreas; Fahrmeir, Ludwig; Mansmann, Ulrich; Toschke, André M

    2008-01-01

    Abstract Background Body mass index (BMI) data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Methods Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs), quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS). We analyzed data of 4967 childre...

  14. A comparative study of some robust ridge and liu estimators ...

    African Journals Online (AJOL)

    In multiple linear regression analysis, multicollinearity and outliers are two main problems. When multicollinearity exists, biased estimation techniques such as Ridge and Liu Estimators are preferable to Ordinary Least Square. On the other hand, when outliers exist in the data, robust estimators like M, MM, LTS and S ...

  15. Patch behaviour and predictability properties of modelled finite-amplitude sand ridges on the inner shelf

    NARCIS (Netherlands)

    Vis-star, N.C.; de Swart, H.E.; Calvete, D.

    2008-01-01

    The long-term evolution of shoreface-connected sand ridges is investigated with a nonlinear spectral model which governs the dynamics of waves, currents, sediment transport and the bed level on the inner shelf. Wave variables are calculated with a shoaling-refraction model instead of using a

  16. Thermal Efficiency Degradation Diagnosis Method Using Regression Model

    International Nuclear Information System (INIS)

    Jee, Chang Hyun; Heo, Gyun Young; Jang, Seok Won; Lee, In Cheol

    2011-01-01

    This paper proposes an idea for thermal efficiency degradation diagnosis in turbine cycles, which is based on turbine cycle simulation under abnormal conditions and a linear regression model. The correlation between the inputs for representing degradation conditions (normally unmeasured but intrinsic states) and the simulation outputs (normally measured but superficial states) was analyzed with the linear regression model. The regression models can inversely response an associated intrinsic state for a superficial state observed from a power plant. The diagnosis method proposed herein is classified into three processes, 1) simulations for degradation conditions to get measured states (referred as what-if method), 2) development of the linear model correlating intrinsic and superficial states, and 3) determination of an intrinsic state using the superficial states of current plant and the linear regression model (referred as inverse what-if method). The what-if method is to generate the outputs for the inputs including various root causes and/or boundary conditions whereas the inverse what-if method is the process of calculating the inverse matrix with the given superficial states, that is, component degradation modes. The method suggested in this paper was validated using the turbine cycle model for an operating power plant

  17. Random regression models for detection of gene by environment interaction

    Directory of Open Access Journals (Sweden)

    Meuwissen Theo HE

    2007-02-01

    Full Text Available Abstract Two random regression models, where the effect of a putative QTL was regressed on an environmental gradient, are described. The first model estimates the correlation between intercept and slope of the random regression, while the other model restricts this correlation to 1 or -1, which is expected under a bi-allelic QTL model. The random regression models were compared to a model assuming no gene by environment interactions. The comparison was done with regards to the models ability to detect QTL, to position them accurately and to detect possible QTL by environment interactions. A simulation study based on a granddaughter design was conducted, and QTL were assumed, either by assigning an effect independent of the environment or as a linear function of a simulated environmental gradient. It was concluded that the random regression models were suitable for detection of QTL effects, in the presence and absence of interactions with environmental gradients. Fixing the correlation between intercept and slope of the random regression had a positive effect on power when the QTL effects re-ranked between environments.

  18. Alternative regression models to assess increase in childhood BMI

    Directory of Open Access Journals (Sweden)

    Mansmann Ulrich

    2008-09-01

    Full Text Available Abstract Background Body mass index (BMI data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Methods Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs, quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS. We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity. Results GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models. Conclusion GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.

  19. The microcomputer scientific software series 2: general linear model--regression.

    Science.gov (United States)

    Harold M. Rauscher

    1983-01-01

    The general linear model regression (GLMR) program provides the microcomputer user with a sophisticated regression analysis capability. The output provides a regression ANOVA table, estimators of the regression model coefficients, their confidence intervals, confidence intervals around the predicted Y-values, residuals for plotting, a check for multicollinearity, a...

  20. Wavelet regression model in forecasting crude oil price

    Science.gov (United States)

    Hamid, Mohd Helmie; Shabri, Ani

    2017-05-01

    This study presents the performance of wavelet multiple linear regression (WMLR) technique in daily crude oil forecasting. WMLR model was developed by integrating the discrete wavelet transform (DWT) and multiple linear regression (MLR) model. The original time series was decomposed to sub-time series with different scales by wavelet theory. Correlation analysis was conducted to assist in the selection of optimal decomposed components as inputs for the WMLR model. The daily WTI crude oil price series has been used in this study to test the prediction capability of the proposed model. The forecasting performance of WMLR model were also compared with regular multiple linear regression (MLR), Autoregressive Moving Average (ARIMA) and Generalized Autoregressive Conditional Heteroscedasticity (GARCH) using root mean square errors (RMSE) and mean absolute errors (MAE). Based on the experimental results, it appears that the WMLR model performs better than the other forecasting technique tested in this study.

  1. Predicting and Modelling of Survival Data when Cox's Regression Model does not hold

    DEFF Research Database (Denmark)

    Scheike, Thomas H.; Zhang, Mei-Jie

    2002-01-01

    Aalen model; additive risk model; counting processes; competing risk; Cox regression; flexible modeling; goodness of fit; prediction of survival; survival analysis; time-varying effects......Aalen model; additive risk model; counting processes; competing risk; Cox regression; flexible modeling; goodness of fit; prediction of survival; survival analysis; time-varying effects...

  2. Spatial stochastic regression modelling of urban land use

    International Nuclear Information System (INIS)

    Arshad, S H M; Jaafar, J; Abiden, M Z Z; Latif, Z A; Rasam, A R A

    2014-01-01

    Urbanization is very closely linked to industrialization, commercialization or overall economic growth and development. This results in innumerable benefits of the quantity and quality of the urban environment and lifestyle but on the other hand contributes to unbounded development, urban sprawl, overcrowding and decreasing standard of living. Regulation and observation of urban development activities is crucial. The understanding of urban systems that promotes urban growth are also essential for the purpose of policy making, formulating development strategies as well as development plan preparation. This study aims to compare two different stochastic regression modeling techniques for spatial structure models of urban growth in the same specific study area. Both techniques will utilize the same datasets and their results will be analyzed. The work starts by producing an urban growth model by using stochastic regression modeling techniques namely the Ordinary Least Square (OLS) and Geographically Weighted Regression (GWR). The two techniques are compared to and it is found that, GWR seems to be a more significant stochastic regression model compared to OLS, it gives a smaller AICc (Akaike's Information Corrected Criterion) value and its output is more spatially explainable

  3. Physics constrained nonlinear regression models for time series

    International Nuclear Information System (INIS)

    Majda, Andrew J; Harlim, John

    2013-01-01

    A central issue in contemporary science is the development of data driven statistical nonlinear dynamical models for time series of partial observations of nature or a complex physical model. It has been established recently that ad hoc quadratic multi-level regression (MLR) models can have finite-time blow up of statistical solutions and/or pathological behaviour of their invariant measure. Here a new class of physics constrained multi-level quadratic regression models are introduced, analysed and applied to build reduced stochastic models from data of nonlinear systems. These models have the advantages of incorporating memory effects in time as well as the nonlinear noise from energy conserving nonlinear interactions. The mathematical guidelines for the performance and behaviour of these physics constrained MLR models as well as filtering algorithms for their implementation are developed here. Data driven applications of these new multi-level nonlinear regression models are developed for test models involving a nonlinear oscillator with memory effects and the difficult test case of the truncated Burgers–Hopf model. These new physics constrained quadratic MLR models are proposed here as process models for Bayesian estimation through Markov chain Monte Carlo algorithms of low frequency behaviour in complex physical data. (paper)

  4. Logistic regression modelling: procedures and pitfalls in developing and interpreting prediction models

    Directory of Open Access Journals (Sweden)

    Nataša Šarlija

    2017-01-01

    Full Text Available This study sheds light on the most common issues related to applying logistic regression in prediction models for company growth. The purpose of the paper is 1 to provide a detailed demonstration of the steps in developing a growth prediction model based on logistic regression analysis, 2 to discuss common pitfalls and methodological errors in developing a model, and 3 to provide solutions and possible ways of overcoming these issues. Special attention is devoted to the question of satisfying logistic regression assumptions, selecting and defining dependent and independent variables, using classification tables and ROC curves, for reporting model strength, interpreting odds ratios as effect measures and evaluating performance of the prediction model. Development of a logistic regression model in this paper focuses on a prediction model of company growth. The analysis is based on predominantly financial data from a sample of 1471 small and medium-sized Croatian companies active between 2009 and 2014. The financial data is presented in the form of financial ratios divided into nine main groups depicting following areas of business: liquidity, leverage, activity, profitability, research and development, investing and export. The growth prediction model indicates aspects of a business critical for achieving high growth. In that respect, the contribution of this paper is twofold. First, methodological, in terms of pointing out pitfalls and potential solutions in logistic regression modelling, and secondly, theoretical, in terms of identifying factors responsible for high growth of small and medium-sized companies.

  5. The ESASSI-08 cruise in the South Scotia Ridge region: An inverse model property-transport analysis over the Ridge

    Science.gov (United States)

    Palmer, Margarita; Gomis, Damià; Del Mar Flexas, Maria; Jordà, Gabriel; Naveira-Garabato, Alberto; Jullion, Loic; Tsubouchi, Takamasa

    2010-05-01

    The ESASSI-08 oceanographic cruise carried out in January 2008 was the most significant milestone of the ESASSI project. ESASSI is the Spanish component of the Synoptic Antarctic Shelf-Slope Interactions (SASSI) study, one of the core projects of the International Polar Year. Hydrographical and biochemical (oxygen, CFCs, nutrients, chlorophyll content, alkalinity, pH, DOC) data were obtained along 11 sections in the South Scotia Ridge (SSR) region, between Elephant and South Orkney Islands. One of the aims of the ESASSI project is to determine the northward outflow of cold and ventilated waters from the Weddell Sea into the Scotia Sea. For that purpose, the accurate estimation of mass, heat, salt, and oxygen transport over the Ridge is requested. An initial analysis of transports across the different sections was first obtained from CTD and ADCP data. The following step has been the application of an inverse method, in order to obtain a better estimation of the net flow for the different water masses present in the region. The set of property-conservation equations considered by the inverse model includes mass, heat and salinity fluxes. The "box" is delimited by the sections along the northern flank of the SSR, between Elephant Island and 50°W, the southern flank of the Ridge, between 51.5°W and 50°W, the 50°W meridian and a diagonal line between Elephant Island and 51.5°W, 61.75°S. Results show that the initial calculations of transports suffered of a significant volume imbalance, due to the inherent errors of ship-ADCP data, the complicated topography and the presence of strong tidal currents in some sections. We present the post-inversion property transports across the rim of the box (and their error bars) for the different water masses.

  6. Regression Models for Repairable Systems

    Czech Academy of Sciences Publication Activity Database

    Novák, Petr

    2015-01-01

    Roč. 17, č. 4 (2015), s. 963-972 ISSN 1387-5841 Institutional support: RVO:67985556 Keywords : Reliability analysis * Repair models * Regression Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.782, year: 2015 http://library.utia.cas.cz/separaty/2015/SI/novak-0450902.pdf

  7. Remedial Investigation Work Plan for Chestnut Ridge Operable Unit 1 (Chestnut Ridge Security Pits) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1994-03-01

    This document outlines the activities necessary to conduct a Remedial Investigation (RI) of the Chestnut Ridge Security Pits (CRSP) at the Oak Ridge Y-12 Plant. The CRSP, also designated Chestnut Ridge Operable Unit (OU) 1, is one of four OUs along Chestnut Ridge on the Oak Ridge Reservation (ORR). The purpose of the RI is to collect data to (1) evaluate the nature and extent of known and suspected contaminants, (2) support an Ecological Risk Assessment (ERA) and a Human Health Risk Assessment (HHRA), (3) support the feasibility study in the development and analysis of remedial alternatives, and (4) ultimately, develop a Record of Decision (ROD) for the site. This chapter summarizes the regulatory background of environmental investigation on the ORR and the approach currently being followed and provides an overview of the RI to be conducted at the CRSP. Subsequent chapters provide details on site history, sampling activities, procedures and methods, quality assurance (QA), health and safety, and waste management related to the RI

  8. Remedial Investigation Work Plan for Chestnut Ridge Operable Unit 1 (Chestnut Ridge Security Pits) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    Energy Technology Data Exchange (ETDEWEB)

    1994-03-01

    This document outlines the activities necessary to conduct a Remedial Investigation (RI) of the Chestnut Ridge Security Pits (CRSP) at the Oak Ridge Y-12 Plant. The CRSP, also designated Chestnut Ridge Operable Unit (OU) 1, is one of four OUs along Chestnut Ridge on the Oak Ridge Reservation (ORR). The purpose of the RI is to collect data to (1) evaluate the nature and extent of known and suspected contaminants, (2) support an Ecological Risk Assessment (ERA) and a Human Health Risk Assessment (HHRA), (3) support the feasibility study in the development and analysis of remedial alternatives, and (4) ultimately, develop a Record of Decision (ROD) for the site. This chapter summarizes the regulatory background of environmental investigation on the ORR and the approach currently being followed and provides an overview of the RI to be conducted at the CRSP. Subsequent chapters provide details on site history, sampling activities, procedures and methods, quality assurance (QA), health and safety, and waste management related to the RI.

  9. An aerial radiological survey of the Oak Ridge Reservation, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    Maurer, R.J.

    1993-04-01

    An aerial radiological survey of the Oak Ridge Reservation (ORR) and surrounding area in Oak Ridge, Tennessee, was conducted during the period March 30 to April 14,1992. The purpose of the survey was to measure and document the terrestrial radiological environment of the Oak Ridge Reservation for use in environmental management programs and emergency response planning. The aerial survey was flown at an altitude of 150 feet (46 meters) along a series of parallel lines 250 feet (76 meters) apart and included X-10 (Oak Ridge National Laboratory), K-25 (former Gaseous Diffusion Plant), Y-12 (Weapons Production Plant), the Freels Bend Area and Oak Ridge Institute for Science and Education, the East Fork Poplar Creek (100-year floodplain extending from K-25 to Y-12), Elza Gate (former uranium ore storage site located in the city of Oak Ridge), Parcel A, the Clinch River (river banks extending from Melton Hill Dam to the city of Kingston), and the CSX Railroad Tracks (extending from Y-12 to the city of Oak Ridge). The survey encompassed approximately 55 square miles (1 41 square kilometers) of the Oak Ridge Reservation and surrounding area

  10. Petrological systematics of mid-ocean ridge basalts: Constraints on melt generation beneath ocean ridges

    Science.gov (United States)

    Langmuir, Charles H.; Klein, Emily M.; Plank, Terry

    ridges, and still more enriched basalts can erupt sporadically along the entire length of the EPR. This leads to very different histograms of distribution for the data sets as a whole, and a very different distribution of chemistry along strike for the two ridges. Despite these differences, the mean Ce/Sm ratios from the two ridges are identical. Existing methods for calculating the major element compositions of mantle melts [Klein and Langmuir, 1987; McKenzie and Bickle, 1988; Niu and Batiza, 1991] are critically examined. New quantitative methods for mantle melting and high pressure fractionation are developed to evaluate the chemical consequences of melting and fractionation processes and mantle heterogeneity. The new methods rely on new equations for partition coefficients for the major elements between mantle minerals and melts. The melting calculations can be used to investigate the chemical compositions produced by small extents of melting or high pressures of melting that cannot yet be determined experimentally. Application of the new models to the observations described above leads to two major conclusions: (1) The global correlations for normal ridges are caused by variations in mantle temperature, as suggested by Klein and Langmuir [1987] and not by mantle heterogeneity. (2) Local variations are caused by melting processes, but are not yet quantitatively accounted for. On slower spreading ridges, local variations are controlled by the melting regime in the mantle. On the EPR, local variations are predominantly controlled by ubiquitous, small scale heterogeneites. Volatile content may be an important and as yet undetermined factor in affecting the observed variations in major elements. We propose a hypothesis, similar to one proposed by Allegre et al [1984] for isotopic data, to explain the differences between the Atlantic and Pacific local trends, and the trace element systematics of the two ocean basins, as consequences of spreading rate and a different

  11. Ridge, Bulk, and Medium Response: How to Kill Models and Learn Something in the Process

    International Nuclear Information System (INIS)

    Nagle, J.L.

    2009-01-01

    In these proceedings, we highlight experimental data (published and preliminary) related to jet quenching and the response of the medium to this deposited energy. Signatures in two- and three-particle hadron correlations indicate interesting structures near the trigger particle in azimuth and over a broad range in pseudo-rapidity, often termed 'the ridge', and conical-like structures separated in azimuth opposite to the trigger particle. We review numerous theoretical interpretations of the ridge in particular with a critical eye for the key properties that allow one to discriminate between, or rule out, certain physical pictures and models (and hopefully learn something in the process).

  12. Geographically weighted regression model on poverty indicator

    Science.gov (United States)

    Slamet, I.; Nugroho, N. F. T. A.; Muslich

    2017-12-01

    In this research, we applied geographically weighted regression (GWR) for analyzing the poverty in Central Java. We consider Gaussian Kernel as weighted function. The GWR uses the diagonal matrix resulted from calculating kernel Gaussian function as a weighted function in the regression model. The kernel weights is used to handle spatial effects on the data so that a model can be obtained for each location. The purpose of this paper is to model of poverty percentage data in Central Java province using GWR with Gaussian kernel weighted function and to determine the influencing factors in each regency/city in Central Java province. Based on the research, we obtained geographically weighted regression model with Gaussian kernel weighted function on poverty percentage data in Central Java province. We found that percentage of population working as farmers, population growth rate, percentage of households with regular sanitation, and BPJS beneficiaries are the variables that affect the percentage of poverty in Central Java province. In this research, we found the determination coefficient R2 are 68.64%. There are two categories of district which are influenced by different of significance factors.

  13. Mantle Convection beneath the Aegir Ridge, a Shadow in the Iceland Hotspot

    Science.gov (United States)

    Howell, S. M.; Ito, G.; Breivik, A. J.; Hanan, B. B.; Mjelde, R.; Sayit, K.; Vogt, P. R.

    2012-12-01

    The Iceland Hotspot has produced extensive volcanism spanning much of the ocean basin between Greenland and Norway, forming one of the world's largest igneous provinces. However, an apparent igneous "shadow" in hotspot activity is located at the fossil Aegir Ridge, which formed anomalously thin crust, despite this ridge being near the Iceland hotspot when it was active. The Aegir Ridge accommodated seafloor spreading northeast of present-day Iceland from the time of continental breakup at ~55 Ma until ~25 Ma, at which point spreading shifted west to the Kolbeinsey Ridge. To address the cause of the anomalously thin crust produced by the Aegir Ridge, we use three-dimensional numerical models to simulate the interaction between a mantle plume beneath the Iceland hotspot, rifting continental lithosphere, and the time-evolving North Atlantic ridge system. Two end-member hypotheses were investigated: (1) Material emanating from the Iceland mantle plume was blocked from reaching the Aegir Ridge by the thick lithosphere of the Jan Mayen Microcontinent as the Kolbeinsey Ridge began rifting it from Greenland at ~30 Ma, just east of the plume center; (2) Plume material was not blocked and did reach the Aegir Ridge, but had already experienced partial melting closer to the hotspot. This material was then unable to produce melt volumes at the Aegir Ridge comparable to those of pristine mantle. To test these hypotheses, we vary the volume flux and viscosity of the plume, and identify which conditions do and do not lead to the Aegir Ridge forming anomalously thin crust. Results show that the combination of plume material being drawn into the lithospheric channels beneath the Reykjanes Ridge and Kolbeinsey Ridge after their respective openings, and the impedance of plume flow by the Jan Mayen Microcontinent (hypothesis 1), can deprive the Aegir Ridge of plume influence. This leads to low crustal thicknesses that are comparable to those observed. We have yet to produce a model

  14. On a Robust MaxEnt Process Regression Model with Sample-Selection

    Directory of Open Access Journals (Sweden)

    Hea-Jung Kim

    2018-04-01

    Full Text Available In a regression analysis, a sample-selection bias arises when a dependent variable is partially observed as a result of the sample selection. This study introduces a Maximum Entropy (MaxEnt process regression model that assumes a MaxEnt prior distribution for its nonparametric regression function and finds that the MaxEnt process regression model includes the well-known Gaussian process regression (GPR model as a special case. Then, this special MaxEnt process regression model, i.e., the GPR model, is generalized to obtain a robust sample-selection Gaussian process regression (RSGPR model that deals with non-normal data in the sample selection. Various properties of the RSGPR model are established, including the stochastic representation, distributional hierarchy, and magnitude of the sample-selection bias. These properties are used in the paper to develop a hierarchical Bayesian methodology to estimate the model. This involves a simple and computationally feasible Markov chain Monte Carlo algorithm that avoids analytical or numerical derivatives of the log-likelihood function of the model. The performance of the RSGPR model in terms of the sample-selection bias correction, robustness to non-normality, and prediction, is demonstrated through results in simulations that attest to its good finite-sample performance.

  15. On concurvity in nonlinear and nonparametric regression models

    Directory of Open Access Journals (Sweden)

    Sonia Amodio

    2014-12-01

    Full Text Available When data are affected by multicollinearity in the linear regression framework, then concurvity will be present in fitting a generalized additive model (GAM. The term concurvity describes nonlinear dependencies among the predictor variables. As collinearity results in inflated variance of the estimated regression coefficients in the linear regression model, the result of the presence of concurvity leads to instability of the estimated coefficients in GAMs. Even if the backfitting algorithm will always converge to a solution, in case of concurvity the final solution of the backfitting procedure in fitting a GAM is influenced by the starting functions. While exact concurvity is highly unlikely, approximate concurvity, the analogue of multicollinearity, is of practical concern as it can lead to upwardly biased estimates of the parameters and to underestimation of their standard errors, increasing the risk of committing type I error. We compare the existing approaches to detect concurvity, pointing out their advantages and drawbacks, using simulated and real data sets. As a result, this paper will provide a general criterion to detect concurvity in nonlinear and non parametric regression models.

  16. Semiparametric Mixtures of Regressions with Single-index for Model Based Clustering

    OpenAIRE

    Xiang, Sijia; Yao, Weixin

    2017-01-01

    In this article, we propose two classes of semiparametric mixture regression models with single-index for model based clustering. Unlike many semiparametric/nonparametric mixture regression models that can only be applied to low dimensional predictors, the new semiparametric models can easily incorporate high dimensional predictors into the nonparametric components. The proposed models are very general, and many of the recently proposed semiparametric/nonparametric mixture regression models a...

  17. Short-term electricity prices forecasting based on support vector regression and Auto-regressive integrated moving average modeling

    International Nuclear Information System (INIS)

    Che Jinxing; Wang Jianzhou

    2010-01-01

    In this paper, we present the use of different mathematical models to forecast electricity price under deregulated power. A successful prediction tool of electricity price can help both power producers and consumers plan their bidding strategies. Inspired by that the support vector regression (SVR) model, with the ε-insensitive loss function, admits of the residual within the boundary values of ε-tube, we propose a hybrid model that combines both SVR and Auto-regressive integrated moving average (ARIMA) models to take advantage of the unique strength of SVR and ARIMA models in nonlinear and linear modeling, which is called SVRARIMA. A nonlinear analysis of the time-series indicates the convenience of nonlinear modeling, the SVR is applied to capture the nonlinear patterns. ARIMA models have been successfully applied in solving the residuals regression estimation problems. The experimental results demonstrate that the model proposed outperforms the existing neural-network approaches, the traditional ARIMA models and other hybrid models based on the root mean square error and mean absolute percentage error.

  18. A Comparison of Advanced Regression Algorithms for Quantifying Urban Land Cover

    Directory of Open Access Journals (Sweden)

    Akpona Okujeni

    2014-07-01

    Full Text Available Quantitative methods for mapping sub-pixel land cover fractions are gaining increasing attention, particularly with regard to upcoming hyperspectral satellite missions. We evaluated five advanced regression algorithms combined with synthetically mixed training data for quantifying urban land cover from HyMap data at 3.6 and 9 m spatial resolution. Methods included support vector regression (SVR, kernel ridge regression (KRR, artificial neural networks (NN, random forest regression (RFR and partial least squares regression (PLSR. Our experiments demonstrate that both kernel methods SVR and KRR yield high accuracies for mapping complex urban surface types, i.e., rooftops, pavements, grass- and tree-covered areas. SVR and KRR models proved to be stable with regard to the spatial and spectral differences between both images and effectively utilized the higher complexity of the synthetic training mixtures for improving estimates for coarser resolution data. Observed deficiencies mainly relate to known problems arising from spectral similarities or shadowing. The remaining regressors either revealed erratic (NN or limited (RFR and PLSR performances when comprehensively mapping urban land cover. Our findings suggest that the combination of kernel-based regression methods, such as SVR and KRR, with synthetically mixed training data is well suited for quantifying urban land cover from imaging spectrometer data at multiple scales.

  19. Modeling oil production based on symbolic regression

    International Nuclear Information System (INIS)

    Yang, Guangfei; Li, Xianneng; Wang, Jianliang; Lian, Lian; Ma, Tieju

    2015-01-01

    Numerous models have been proposed to forecast the future trends of oil production and almost all of them are based on some predefined assumptions with various uncertainties. In this study, we propose a novel data-driven approach that uses symbolic regression to model oil production. We validate our approach on both synthetic and real data, and the results prove that symbolic regression could effectively identify the true models beneath the oil production data and also make reliable predictions. Symbolic regression indicates that world oil production will peak in 2021, which broadly agrees with other techniques used by researchers. Our results also show that the rate of decline after the peak is almost half the rate of increase before the peak, and it takes nearly 12 years to drop 4% from the peak. These predictions are more optimistic than those in several other reports, and the smoother decline will provide the world, especially the developing countries, with more time to orchestrate mitigation plans. -- Highlights: •A data-driven approach has been shown to be effective at modeling the oil production. •The Hubbert model could be discovered automatically from data. •The peak of world oil production is predicted to appear in 2021. •The decline rate after peak is half of the increase rate before peak. •Oil production projected to decline 4% post-peak

  20. Genomic-Enabled Prediction Based on Molecular Markers and Pedigree Using the Bayesian Linear Regression Package in R

    Directory of Open Access Journals (Sweden)

    Paulino Pérez

    2010-09-01

    Full Text Available The availability of dense molecular markers has made possible the use of genomic selection in plant and animal breeding. However, models for genomic selection pose several computational and statistical challenges and require specialized computer programs, not always available to the end user and not implemented in standard statistical software yet. The R-package BLR (Bayesian Linear Regression implements several statistical procedures (e.g., Bayesian Ridge Regression, Bayesian LASSO in a unified framework that allows including marker genotypes and pedigree data jointly. This article describes the classes of models implemented in the BLR package and illustrates their use through examples. Some challenges faced when applying genomic-enabled selection, such as model choice, evaluation of predictive ability through cross-validation, and choice of hyper-parameters, are also addressed.

  1. Wrinkle Ridges and Young Fresh Crater

    Science.gov (United States)

    2002-01-01

    (Released 10 May 2002) The Science Wrinkle ridges are a very common landform on Mars, Mercury, Venus, and the Moon. These ridges are linear to arcuate asymmetric topographic highs commonly found on smooth plains. The origin of wrinkle ridges is not certain and two leading hypotheses have been put forth by scientists over the past 40 years. The volcanic model calls for the extrusion of high viscosity lavas along linear conduits. This thick lava accumulated over these conduits and formed the ridges. The other model is tectonic and advocates that the ridges are formed by compressional faulting and folding. Today's THEMIS image is of the ridged plains of Lunae Planum located between Kasei Valles and Valles Marineris in the northern hemisphere of the planet. Wrinkle ridges are found mostly along the eastern side of the image. The broadest wrinkle ridges in this image are up to 2 km wide. A 3 km diameter young fresh crater is located near the bottom of the image. The crater's ejecta blanket is also clearly seen surrounding the sharp well-defined crater rim. These features are indicative of a very young crater that has not been subjected to erosional processes. The Story The great thing about the solar system is that planets are both alike and different. They're all foreign enough to be mysterious and intriguing, and yet familiar enough to be seen as planetary 'cousins.' By comparing them, we can learn a lot about how planets form and then evolve geologically over time. Crinkled over smooth plains, the long, wavy raised landforms seen here are called 'wrinkle ridges,' and they've been found on Mars, Mercury, Venus, and the Moon - that is, on rocky bodies that are a part of our inner solar system. We know from this observation that planets (and large-enough moons) follow similar processes. What we don't know for sure is HOW these processes work. Scientists have been trying to understand how wrinkle ridges form for 40 years, and they still haven't reached a conclusion. That

  2. The Oak Ridge Competitive Electricity Dispatch (ORCED) Model Version 9

    Energy Technology Data Exchange (ETDEWEB)

    Hadley, Stanton W. [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States); Baek, Young Sun [Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States)

    2016-11-01

    The Oak Ridge Competitive Electricity Dispatch (ORCED) model dispatches power plants in a region to meet the electricity demands for any single given year up to 2030. It uses publicly available sources of data describing electric power units such as the National Energy Modeling System and hourly demands from utility submittals to the Federal Energy Regulatory Commission that are projected to a future year. The model simulates a single region of the country for a given year, matching generation to demands and predefined net exports from the region, assuming no transmission constraints within the region. ORCED can calculate a number of key financial and operating parameters for generating units and regional market outputs including average and marginal prices, air emissions, and generation adequacy. By running the model with and without changes such as generation plants, fuel prices, emission costs, plug-in hybrid electric vehicles, distributed generation, or demand response, the marginal impact of these changes can be found.

  3. Model performance analysis and model validation in logistic regression

    Directory of Open Access Journals (Sweden)

    Rosa Arboretti Giancristofaro

    2007-10-01

    Full Text Available In this paper a new model validation procedure for a logistic regression model is presented. At first, we illustrate a brief review of different techniques of model validation. Next, we define a number of properties required for a model to be considered "good", and a number of quantitative performance measures. Lastly, we describe a methodology for the assessment of the performance of a given model by using an example taken from a management study.

  4. Model building strategy for logistic regression: purposeful selection.

    Science.gov (United States)

    Zhang, Zhongheng

    2016-03-01

    Logistic regression is one of the most commonly used models to account for confounders in medical literature. The article introduces how to perform purposeful selection model building strategy with R. I stress on the use of likelihood ratio test to see whether deleting a variable will have significant impact on model fit. A deleted variable should also be checked for whether it is an important adjustment of remaining covariates. Interaction should be checked to disentangle complex relationship between covariates and their synergistic effect on response variable. Model should be checked for the goodness-of-fit (GOF). In other words, how the fitted model reflects the real data. Hosmer-Lemeshow GOF test is the most widely used for logistic regression model.

  5. The APT model as reduced-rank regression

    NARCIS (Netherlands)

    Bekker, P.A.; Dobbelstein, P.; Wansbeek, T.J.

    Integrating the two steps of an arbitrage pricing theory (APT) model leads to a reduced-rank regression (RRR) model. So the results on RRR can be used to estimate APT models, making estimation very simple. We give a succinct derivation of estimation of RRR, derive the asymptotic variance of RRR

  6. Modelling subject-specific childhood growth using linear mixed-effect models with cubic regression splines.

    Science.gov (United States)

    Grajeda, Laura M; Ivanescu, Andrada; Saito, Mayuko; Crainiceanu, Ciprian; Jaganath, Devan; Gilman, Robert H; Crabtree, Jean E; Kelleher, Dermott; Cabrera, Lilia; Cama, Vitaliano; Checkley, William

    2016-01-01

    Childhood growth is a cornerstone of pediatric research. Statistical models need to consider individual trajectories to adequately describe growth outcomes. Specifically, well-defined longitudinal models are essential to characterize both population and subject-specific growth. Linear mixed-effect models with cubic regression splines can account for the nonlinearity of growth curves and provide reasonable estimators of population and subject-specific growth, velocity and acceleration. We provide a stepwise approach that builds from simple to complex models, and account for the intrinsic complexity of the data. We start with standard cubic splines regression models and build up to a model that includes subject-specific random intercepts and slopes and residual autocorrelation. We then compared cubic regression splines vis-à-vis linear piecewise splines, and with varying number of knots and positions. Statistical code is provided to ensure reproducibility and improve dissemination of methods. Models are applied to longitudinal height measurements in a cohort of 215 Peruvian children followed from birth until their fourth year of life. Unexplained variability, as measured by the variance of the regression model, was reduced from 7.34 when using ordinary least squares to 0.81 (p linear mixed-effect models with random slopes and a first order continuous autoregressive error term. There was substantial heterogeneity in both the intercept (p modeled with a first order continuous autoregressive error term as evidenced by the variogram of the residuals and by a lack of association among residuals. The final model provides a parametric linear regression equation for both estimation and prediction of population- and individual-level growth in height. We show that cubic regression splines are superior to linear regression splines for the case of a small number of knots in both estimation and prediction with the full linear mixed effect model (AIC 19,352 vs. 19

  7. Influence diagnostics in meta-regression model.

    Science.gov (United States)

    Shi, Lei; Zuo, ShanShan; Yu, Dalei; Zhou, Xiaohua

    2017-09-01

    This paper studies the influence diagnostics in meta-regression model including case deletion diagnostic and local influence analysis. We derive the subset deletion formulae for the estimation of regression coefficient and heterogeneity variance and obtain the corresponding influence measures. The DerSimonian and Laird estimation and maximum likelihood estimation methods in meta-regression are considered, respectively, to derive the results. Internal and external residual and leverage measure are defined. The local influence analysis based on case-weights perturbation scheme, responses perturbation scheme, covariate perturbation scheme, and within-variance perturbation scheme are explored. We introduce a method by simultaneous perturbing responses, covariate, and within-variance to obtain the local influence measure, which has an advantage of capable to compare the influence magnitude of influential studies from different perturbations. An example is used to illustrate the proposed methodology. Copyright © 2017 John Wiley & Sons, Ltd.

  8. Logistic Regression Modeling of Diminishing Manufacturing Sources for Integrated Circuits

    National Research Council Canada - National Science Library

    Gravier, Michael

    1999-01-01

    .... The research identified logistic regression as a powerful tool for analysis of DMSMS and further developed twenty models attempting to identify the "best" way to model and predict DMSMS using logistic regression...

  9. Geophysical studies of aseismic ridges in northern Indian Ocean-crustal structure and isostatic models

    Digital Repository Service at National Institute of Oceanography (India)

    Sreejith, K.M.

    . The important conclusions derived from this study are as follows. 1. The Comorin Ridge extends for about 500 km in NNW-SSE direction in north central Indian Ocean, close to the southern tip of India and western continental margin of Sri Lanka. The ridge... topography has variable reliefs along its strike and uneven gradients on both sides of the ridge flanks. The southern part of the ridge (between 1.5°N and 3°N) has an elevation of up to 0.5 km from the adjacent seafloor of about 4 km deep; in the central...

  10. A correlated-cluster model and the ridge phenomenon in hadron–hadron collisions

    Energy Technology Data Exchange (ETDEWEB)

    Sanchis-Lozano, Miguel-Angel, E-mail: Miguel.Angel.Sanchis@ific.uv.es [Instituto de Física Corpuscular (IFIC) and Departamento de Física Teórica, Centro Mixto Universitat de València-CSIC, Dr. Moliner 50, E-46100 Burjassot, Valencia (Spain); Sarkisyan-Grinbaum, Edward, E-mail: sedward@cern.ch [Experimental Physics Department, CERN, 1211 Geneva 23 (Switzerland); Department of Physics, The University of Texas at Arlington, Arlington, TX 76019 (United States)

    2017-03-10

    A study of the near-side ridge phenomenon in hadron–hadron collisions based on a cluster picture of multiparticle production is presented. The near-side ridge effect is shown to have a natural explanation in this context provided that clusters are produced in a correlated manner in the collision transverse plane.

  11. A correlated-cluster model and the ridge phenomenon in hadron-hadron collisions

    CERN Document Server

    Sanchis-Lozano, Miguel-Angel

    2017-03-10

    A study of the near-side ridge phenomenon in hadron-hadron collisions based on a cluster picture of multiparticle production is presented. The near-side ridge effect is shown to have a natural explanation in this context provided that clusters are produced in a correlated manner in the collision transverse plane.

  12. Analysis of dental caries using generalized linear and count regression models

    Directory of Open Access Journals (Sweden)

    Javali M. Phil

    2013-11-01

    Full Text Available Generalized linear models (GLM are generalization of linear regression models, which allow fitting regression models to response data in all the sciences especially medical and dental sciences that follow a general exponential family. These are flexible and widely used class of such models that can accommodate response variables. Count data are frequently characterized by overdispersion and excess zeros. Zero-inflated count models provide a parsimonious yet powerful way to model this type of situation. Such models assume that the data are a mixture of two separate data generation processes: one generates only zeros, and the other is either a Poisson or a negative binomial data-generating process. Zero inflated count regression models such as the zero-inflated Poisson (ZIP, zero-inflated negative binomial (ZINB regression models have been used to handle dental caries count data with many zeros. We present an evaluation framework to the suitability of applying the GLM, Poisson, NB, ZIP and ZINB to dental caries data set where the count data may exhibit evidence of many zeros and over-dispersion. Estimation of the model parameters using the method of maximum likelihood is provided. Based on the Vuong test statistic and the goodness of fit measure for dental caries data, the NB and ZINB regression models perform better than other count regression models.

  13. Thickness of Knox Group overburden on Central Chestnut Ridge, Oak Ridge Reservation

    International Nuclear Information System (INIS)

    Staub, W.P.; Hopkins, R.A.

    1984-05-01

    The thickness of residual soil overlying the Knox Group along Central Chestnut Ridge was estimated by a conventional seismic refraction survey. The purpose of this survey was to identify sites on the Department of Energy's Oak Ridge Reservation where ample overburden exists above the water table for the shallow land burial of low-level radioactive waste. The results of the survey suggest that the upper slopes of the higher ridges in the area have a minimum of 16 to 26 m (52 to 85 ft) of overburden and that the crests of these ridges may have more than 30 m (100 ft). Therefore, it is unlikely that sound bedrock would be encountered during trench excavation [maximum of 10 m (32 ft)] along Central Chestnut Ridge. Also, the relatively low seismic wave velocities measured in the overburden suggest that the water table is generally deep. On the basis of these preliminary results, Central Chestnut Ridge appears to be suitable for further site characterization for the shallow land burial of low-level radioactive waste. 3 references, 5 figures, 1 table

  14. AIRLINE ACTIVITY FORECASTING BY REGRESSION MODELS

    Directory of Open Access Journals (Sweden)

    Н. Білак

    2012-04-01

    Full Text Available Proposed linear and nonlinear regression models, which take into account the equation of trend and seasonality indices for the analysis and restore the volume of passenger traffic over the past period of time and its prediction for future years, as well as the algorithm of formation of these models based on statistical analysis over the years. The desired model is the first step for the synthesis of more complex models, which will enable forecasting of passenger (income level airline with the highest accuracy and time urgency.

  15. Inconsistency of Bayesian inference for misspecified linear models, and a proposal for repairing it

    NARCIS (Netherlands)

    P.D. Grünwald (Peter); T. van Ommen (Thijs)

    2017-01-01

    textabstractWe empirically show that Bayesian inference can be inconsistent under misspecification in simple linear regression problems, both in a model averaging/selection and in a Bayesian ridge regression setting. We use the standard linear model, which assumes homoskedasticity, whereas the data

  16. Inconsistency of Bayesian Inference for Misspecified Linear Models, and a Proposal for Repairing It

    NARCIS (Netherlands)

    Grünwald, P.; van Ommen, T.

    2017-01-01

    We empirically show that Bayesian inference can be inconsistent under misspecification in simple linear regression problems, both in a model averaging/selection and in a Bayesian ridge regression setting. We use the standard linear model, which assumes homoskedasticity, whereas the data are

  17. [Application of detecting and taking overdispersion into account in Poisson regression model].

    Science.gov (United States)

    Bouche, G; Lepage, B; Migeot, V; Ingrand, P

    2009-08-01

    Researchers often use the Poisson regression model to analyze count data. Overdispersion can occur when a Poisson regression model is used, resulting in an underestimation of variance of the regression model parameters. Our objective was to take overdispersion into account and assess its impact with an illustration based on the data of a study investigating the relationship between use of the Internet to seek health information and number of primary care consultations. Three methods, overdispersed Poisson, a robust estimator, and negative binomial regression, were performed to take overdispersion into account in explaining variation in the number (Y) of primary care consultations. We tested overdispersion in the Poisson regression model using the ratio of the sum of Pearson residuals over the number of degrees of freedom (chi(2)/df). We then fitted the three models and compared parameter estimation to the estimations given by Poisson regression model. Variance of the number of primary care consultations (Var[Y]=21.03) was greater than the mean (E[Y]=5.93) and the chi(2)/df ratio was 3.26, which confirmed overdispersion. Standard errors of the parameters varied greatly between the Poisson regression model and the three other regression models. Interpretation of estimates from two variables (using the Internet to seek health information and single parent family) would have changed according to the model retained, with significant levels of 0.06 and 0.002 (Poisson), 0.29 and 0.09 (overdispersed Poisson), 0.29 and 0.13 (use of a robust estimator) and 0.45 and 0.13 (negative binomial) respectively. Different methods exist to solve the problem of underestimating variance in the Poisson regression model when overdispersion is present. The negative binomial regression model seems to be particularly accurate because of its theorical distribution ; in addition this regression is easy to perform with ordinary statistical software packages.

  18. Variable Selection for Regression Models of Percentile Flows

    Science.gov (United States)

    Fouad, G.

    2017-12-01

    Percentile flows describe the flow magnitude equaled or exceeded for a given percent of time, and are widely used in water resource management. However, these statistics are normally unavailable since most basins are ungauged. Percentile flows of ungauged basins are often predicted using regression models based on readily observable basin characteristics, such as mean elevation. The number of these independent variables is too large to evaluate all possible models. A subset of models is typically evaluated using automatic procedures, like stepwise regression. This ignores a large variety of methods from the field of feature (variable) selection and physical understanding of percentile flows. A study of 918 basins in the United States was conducted to compare an automatic regression procedure to the following variable selection methods: (1) principal component analysis, (2) correlation analysis, (3) random forests, (4) genetic programming, (5) Bayesian networks, and (6) physical understanding. The automatic regression procedure only performed better than principal component analysis. Poor performance of the regression procedure was due to a commonly used filter for multicollinearity, which rejected the strongest models because they had cross-correlated independent variables. Multicollinearity did not decrease model performance in validation because of a representative set of calibration basins. Variable selection methods based strictly on predictive power (numbers 2-5 from above) performed similarly, likely indicating a limit to the predictive power of the variables. Similar performance was also reached using variables selected based on physical understanding, a finding that substantiates recent calls to emphasize physical understanding in modeling for predictions in ungauged basins. The strongest variables highlighted the importance of geology and land cover, whereas widely used topographic variables were the weakest predictors. Variables suffered from a high

  19. Microgravity survey of the Oak Ridge K-25 Site, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    Kaufmann, R.D.

    1996-05-01

    Karst features are known to exist within the carbonate bedrock of the Oak Ridge K-25 Site and may play an important role in groundwater flow and contaminant migration. This report discusses the results of a microgravity survey of the Oak Ridge K-25 Site. The main objective of the survey is to identify areas containing bedrock cavities. Secondary objectives included correlating the observed gravity to the geology and to variations in overburden thickness. The analysis includes 11 profile lines that are oriented perpendicular to the geologic strike and major structures throughout the K-25 Site. The profile lines are modeled in an effort to relate gravity anomalies to karst features such as concentrations of mud-filled cavities. Regolith thickness and density data provided by boreholes constrain the models. Areally distributed points are added to the profile lines to produce a gravity contour map of the site. In addition, data from the K-901 area are combined with data from previous surveys to produce a high resolution map of that site. The K-25 Site is located in an area of folded and faulted sedimentary rocks within the Appalachian Valley and Ridge physiographic province. Paleozoic age rocks of the Rome Formation, Knox Group, and Chickamauga Supergroup underlie the K-25 Site and contain structures that include the Whiteoak Mountain Fault, the K-25 Fault, a syncline, and an anticline. The mapped locations of the rock units and complex structures are currently derived from outcrop and well log analysis

  20. [Evaluation of estimation of prevalence ratio using bayesian log-binomial regression model].

    Science.gov (United States)

    Gao, W L; Lin, H; Liu, X N; Ren, X W; Li, J S; Shen, X P; Zhu, S L

    2017-03-10

    To evaluate the estimation of prevalence ratio ( PR ) by using bayesian log-binomial regression model and its application, we estimated the PR of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea in their infants by using bayesian log-binomial regression model in Openbugs software. The results showed that caregivers' recognition of infant' s risk signs of diarrhea was associated significantly with a 13% increase of medical care-seeking. Meanwhile, we compared the differences in PR 's point estimation and its interval estimation of medical care-seeking prevalence to caregivers' recognition of risk signs of diarrhea and convergence of three models (model 1: not adjusting for the covariates; model 2: adjusting for duration of caregivers' education, model 3: adjusting for distance between village and township and child month-age based on model 2) between bayesian log-binomial regression model and conventional log-binomial regression model. The results showed that all three bayesian log-binomial regression models were convergence and the estimated PRs were 1.130(95 %CI : 1.005-1.265), 1.128(95 %CI : 1.001-1.264) and 1.132(95 %CI : 1.004-1.267), respectively. Conventional log-binomial regression model 1 and model 2 were convergence and their PRs were 1.130(95 % CI : 1.055-1.206) and 1.126(95 % CI : 1.051-1.203), respectively, but the model 3 was misconvergence, so COPY method was used to estimate PR , which was 1.125 (95 %CI : 1.051-1.200). In addition, the point estimation and interval estimation of PRs from three bayesian log-binomial regression models differed slightly from those of PRs from conventional log-binomial regression model, but they had a good consistency in estimating PR . Therefore, bayesian log-binomial regression model can effectively estimate PR with less misconvergence and have more advantages in application compared with conventional log-binomial regression model.

  1. Explorative methods in linear models

    DEFF Research Database (Denmark)

    Høskuldsson, Agnar

    2004-01-01

    The author has developed the H-method of mathematical modeling that builds up the model by parts, where each part is optimized with respect to prediction. Besides providing with better predictions than traditional methods, these methods provide with graphic procedures for analyzing different feat...... features in data. These graphic methods extend the well-known methods and results of Principal Component Analysis to any linear model. Here the graphic procedures are applied to linear regression and Ridge Regression....

  2. Geographically Weighted Logistic Regression Applied to Credit Scoring Models

    Directory of Open Access Journals (Sweden)

    Pedro Henrique Melo Albuquerque

    Full Text Available Abstract This study used real data from a Brazilian financial institution on transactions involving Consumer Direct Credit (CDC, granted to clients residing in the Distrito Federal (DF, to construct credit scoring models via Logistic Regression and Geographically Weighted Logistic Regression (GWLR techniques. The aims were: to verify whether the factors that influence credit risk differ according to the borrower’s geographic location; to compare the set of models estimated via GWLR with the global model estimated via Logistic Regression, in terms of predictive power and financial losses for the institution; and to verify the viability of using the GWLR technique to develop credit scoring models. The metrics used to compare the models developed via the two techniques were the AICc informational criterion, the accuracy of the models, the percentage of false positives, the sum of the value of false positive debt, and the expected monetary value of portfolio default compared with the monetary value of defaults observed. The models estimated for each region in the DF were distinct in their variables and coefficients (parameters, with it being concluded that credit risk was influenced differently in each region in the study. The Logistic Regression and GWLR methodologies presented very close results, in terms of predictive power and financial losses for the institution, and the study demonstrated viability in using the GWLR technique to develop credit scoring models for the target population in the study.

  3. Modeling maximum daily temperature using a varying coefficient regression model

    Science.gov (United States)

    Han Li; Xinwei Deng; Dong-Yum Kim; Eric P. Smith

    2014-01-01

    Relationships between stream water and air temperatures are often modeled using linear or nonlinear regression methods. Despite a strong relationship between water and air temperatures and a variety of models that are effective for data summarized on a weekly basis, such models did not yield consistently good predictions for summaries such as daily maximum temperature...

  4. Lepini Mountains Carbonatic Ridge: try of springs recharge areas verification and water exchange quantification with Pontina Plain by use of a numerical model (Central Italy

    Directory of Open Access Journals (Sweden)

    Pamela Teoli

    2014-03-01

    Full Text Available The study area of this work is represented by the Lepini Mountains carbonatic ridge and by the Pontina Plain foothills area, on which in the past, within quantitative hydrogeological characterizations, models were developed for calculating the groundwater flow, but only referred to the ridge. The most recent studies (Teoli, 2012 have done their best, instead, to represent the underground water exchanges between the ridge and the Pontina Plain foothill area. The new model (developed using computer code MODFLOW 2005 has been implemented to simulate steady-state underground flow using equivalent porous media approach even for the ridge; attention has been particularly directed to the proper tectonic ridge schematic, which the authors had previously defined, together with others (Alimonti et al., 2010, on detailed structural-geological survey basis, integrated by hydrogeological analysis. So, it’s been possible to determine partitioning effects on groundwater flowpaths and on springs recharge areas extent, whose total average discharge is about 10m3/s. Model calibration main goal has been the recharge areas permeability definition, posing the correspondence of calculated flows with measured springs’ flows; as a consequence, it’s been possible to improve the model reliability (uncertainty reduction quantifying the flow residuals’ standard deviation offset.

  5. Observation of pressure ridges in SAR images of sea ice: Scattering theory and comparison with observations

    Science.gov (United States)

    Vesecky, J. F.; Daida, J. M.; Shuchman, R. A.; Onstott, R. H.; Camiso, J. C.

    1993-01-01

    Ridges and keels (hummocks and bummocks) in sea ice flows are important in sea ice research for both scientific and practical reasons. Sea ice movement and deformation is driven by internal and external stresses on the ice. Ridges and keels play important roles in both cases because they determine the external wind and current stresses via drag coefficients. For example, the drag coefficient over sea ice can vary by a factor of several depending on the fluid mechanical roughness length of the surface. This roughness length is thought to be strongly dependent on the ridge structures present. Thus, variations in ridge and keel structure can cause gradients in external stresses which must be balanced by internal stresses and possibly fracture of the ice. Ridging in sea ice is also a sign of fracture. In a practical sense, large ridges form the biggest impediment to surface travel over the ice or penetration through sea ice by ice-strengthened ships. Ridges also play an important role in the damage caused by sea ice to off-shore structures. Hence, observation and measurement of sea ice ridges is an important component of sea ice remote sensing. The research reported here builds on previous work, estimating the characteristics of ridges and leads in sea ice from SAR images. Our objective is to develop methods for quantitative measurement of sea ice ridges from SAR images. To make further progress, in particular, to estimate ridge height, a scattering model for ridges is needed. Our research approach for a ridge scattering model begins with a survey of the geometrical properties of ridges and a comparison with the characteristics of the surrounding ice. For this purpose we have used airborne optical laser (AOL) data collected during the 1987 Greenland Sea Experiment. These data were used to generate a spatial wavenumber spectrum for height variance for a typical ridge - the typical ridge is the average over 10 large ridges. Our first-order model radar scattering includes

  6. Measurement of pressure ridges in SAR images of sea ice - Preliminary results on scattering theory

    Science.gov (United States)

    Vesecky, J. F.; Smith, M. P.; Daida, J. M.; Samadani, R.; Camiso, J. C.

    1992-01-01

    Sea ice ridges and keels (hummocks and bummocks) are important in sea ice research for both scientific and practical reasons. A long-term objective is to make quantitative measurements of sea ice ridges using synthetic aperture radar (SAR) images. The preliminary results of a scattering model for sea ice ridge are reported. The approach is through the ridge height variance spectrum Psi(K), where K is the spatial wavenumber, and the two-scale scattering model. The height spectrum model is constructed to mimic height statistics observed with an airborne optical laser. The spectrum model is used to drive a two-scale scattering model. Model results for ridges observed at C- and X-band yield normalized radar cross sections that are 10 to 15 dB larger than the observed cross sections of multiyear ice over the range of angles of incidence from 10 to 70 deg.

  7. Prediction of facial height, width, and ratio from thumbprints ridge count and its possible applications

    Directory of Open Access Journals (Sweden)

    Lawan Hassan Adamu

    2017-01-01

    Full Text Available The fingerprints and face recognition are two biometric processes that comprise methods for uniquely recognizing humans based on certain number of intrinsic physical or behavioral traits. The objectives of the study were to predict the facial height (FH, facial width, and ratios from thumbprints ridge count and its possible applications. This was a cross-sectional study. A total of 457 participants were recruited. A fingerprint live scanner was used to capture the plain thumbprint. The facial photograph was captured using a digital camera. Pearson's correlation analysis was used for the relationship between thumbprint ridge density and facial linear dimensions. Step-wise linear multiple regression analysis was used to predict facial distances from thumbprint ridge density. The result showed that in males the right ulnar ridge count correlates negatively with lower facial width (LFW, upper facial width/upper FH (UFW/UFH, lower FH/FH (LFH/FH, and positively with UFH and UFW/LFW. The right and left proximal ridge counts correlate with LFW and UFH, respectively. In males, the right ulnar ridge count predicts LFW, UFW/LFW, UFW/UFH, and LFH/FH. Special upper face height I, LFW, height of lower third of the face, UFW/LFW was predicted by right radial ridge counts. LFH, height of lower third of the face, and LFH/FH were predicted from left ulnar ridge count whereas left proximal ridge count predicted LFW. In females only, the special upper face height I was predicted by right ulnar ridge count. In conclusion, thumbprint ridge counts can be used to predict FH, width, ratios among Hausa population. The possible application of fingerprints in facial characterization for used in human biology, paleodemography, and forensic science was demonstrated.

  8. Modeling the morphodynamics of shoreface-connected sand ridges

    NARCIS (Netherlands)

    Vis-star, N.C.

    2008-01-01

    The focus of this thesis is on the morphodynamics of shoreface-connected sand ridges, which are large-scale bedforms observed on the inner shelf of coastal seas where storms occur frequently. The main aim was to explore which physical processes control the formation, long-term evolution and main

  9. Grain-Size Dynamics Beneath Mid-Ocean Ridges: Implications for Permeability and Melt Extraction

    Science.gov (United States)

    Turner, A. J.; Katz, R. F.; Behn, M. D.

    2014-12-01

    The permeability structure of the sub-ridge mantle plays an important role in how melt is focused and extracted at mid-ocean ridges. Permeability is controlled by porosity and the grain size of the solid mantle matrix, which is in turn controlled by the deformation conditions. To date, models of grain size evolution and mantle deformation have not been coupled to determine the influence of spatial variations in grain-size on the permeability structure at mid-ocean ridges. Rather, current models typically assume a constant grain size for the whole domain [1]. Here, we use 2-D numerical models to evaluate the influence of grain-size variability on the permeability structure beneath a mid-ocean ridge and use these results to speculate on the consequences for melt focusing and extraction. We construct a two-dimensional, single phase model for the steady-state grain size beneath a mid-ocean ridge. The model employs a composite rheology of diffusion creep, dislocation creep, dislocation accommodated grain boundary sliding, and a brittle stress limiter. Grain size is calculated using the "wattmeter" model of Austin and Evans [2]. We investigate the sensitivity of the model to global variations in grain growth exponent, potential temperature, spreading-rate, and grain boundary sliding parameters [3,4]. Our model predicts that permeability varies by two orders of magnitude due to the spatial variability of grain size within the expected melt region of a mid-ocean ridge. The predicted permeability structure suggests grain size may promote focusing of melt towards the ridge axis. Furthermore, the calculated grain size structure should focus melt from a greater depth than models that exclude grain-size variability. Future work will involve evaluating this hypothesis by implementing grain-size dynamics within a two-phase mid-ocean ridge model. The developments of such a model will be discussed. References: [1] R. F. Katz, Journal of Petrology, volume 49, issue 12, page 2099

  10. Posterior Mandibular Ridge Resorption Associated with Different Retentive Systems for Overdentures: A 7-Year Retrospective Preliminary Study.

    Science.gov (United States)

    Elsyad, Moustafa Abdou; Mohamed, Shahinaz Sayed; Shawky, Ahmad Fathalla

    This retrospective study compared posterior mandibular residual ridge resorption with two different retentive mechanisms for overdentures after 7 years. A convenience sample of 18 edentulous men was assigned to one of two equal groups. Two implants were placed in the mandibular canine areas for each patient using the conventional two-stage surgical protocol, and the implants were splinted with a round bar 3 months later. New mandibular overdentures were then connected to the bars with clips (clip-retained overdentures, CR group) or resilient liners (resilient liner-retained overdentures, RR group). Posterior mandibular ridge resorption (PMRR) was recorded using proportional measurements and posterior area index (PAI) on panoramic radiographs taken immediately after overdenture insertion (T 0 ) and 7 years later (T 7 ). A linear regression model was used to verify the relationship between PAI and the following considerations: attachment type, age, initial mandibular ridge height, period of mandibular edentulism, number of previously worn dentures, and relining events. After 7 years, the RR group demonstrated a significantly (P = .014) higher change in PAI (0.11 ± 0.02) than the CR group (0.06 ± 0.04). The average PMRR for each mm of posterior mandibular ridge was 0.79 mm (0.11 mm/year) in the CR group and 1.4 mm (0.2 mm/year) in the RR group. Attachment type, initial mandibular ridge height, and relining times were significantly correlated with change in the PAI (P = .004, P = .035, and P = .045, respectively). Within the limitations of this preliminary study's design, it was observed that following a 7-year period of use, resilient liner attachments for bar/implant-retained overdentures appear to be associated with greater posterior mandibular ridge resorption when compared to clip attachments.

  11. Structured Additive Regression Models: An R Interface to BayesX

    Directory of Open Access Journals (Sweden)

    Nikolaus Umlauf

    2015-02-01

    Full Text Available Structured additive regression (STAR models provide a flexible framework for model- ing possible nonlinear effects of covariates: They contain the well established frameworks of generalized linear models and generalized additive models as special cases but also allow a wider class of effects, e.g., for geographical or spatio-temporal data, allowing for specification of complex and realistic models. BayesX is standalone software package providing software for fitting general class of STAR models. Based on a comprehensive open-source regression toolbox written in C++, BayesX uses Bayesian inference for estimating STAR models based on Markov chain Monte Carlo simulation techniques, a mixed model representation of STAR models, or stepwise regression techniques combining penalized least squares estimation with model selection. BayesX not only covers models for responses from univariate exponential families, but also models from less-standard regression situations such as models for multi-categorical responses with either ordered or unordered categories, continuous time survival data, or continuous time multi-state models. This paper presents a new fully interactive R interface to BayesX: the R package R2BayesX. With the new package, STAR models can be conveniently specified using Rs formula language (with some extended terms, fitted using the BayesX binary, represented in R with objects of suitable classes, and finally printed/summarized/plotted. This makes BayesX much more accessible to users familiar with R and adds extensive graphics capabilities for visualizing fitted STAR models. Furthermore, R2BayesX complements the already impressive capabilities for semiparametric regression in R by a comprehensive toolbox comprising in particular more complex response types and alternative inferential procedures such as simulation-based Bayesian inference.

  12. Multiple regression and beyond an introduction to multiple regression and structural equation modeling

    CERN Document Server

    Keith, Timothy Z

    2014-01-01

    Multiple Regression and Beyond offers a conceptually oriented introduction to multiple regression (MR) analysis and structural equation modeling (SEM), along with analyses that flow naturally from those methods. By focusing on the concepts and purposes of MR and related methods, rather than the derivation and calculation of formulae, this book introduces material to students more clearly, and in a less threatening way. In addition to illuminating content necessary for coursework, the accessibility of this approach means students are more likely to be able to conduct research using MR or SEM--and more likely to use the methods wisely. Covers both MR and SEM, while explaining their relevance to one another Also includes path analysis, confirmatory factor analysis, and latent growth modeling Figures and tables throughout provide examples and illustrate key concepts and techniques For additional resources, please visit: http://tzkeith.com/.

  13. Time series regression model for infectious disease and weather.

    Science.gov (United States)

    Imai, Chisato; Armstrong, Ben; Chalabi, Zaid; Mangtani, Punam; Hashizume, Masahiro

    2015-10-01

    Time series regression has been developed and long used to evaluate the short-term associations of air pollution and weather with mortality or morbidity of non-infectious diseases. The application of the regression approaches from this tradition to infectious diseases, however, is less well explored and raises some new issues. We discuss and present potential solutions for five issues often arising in such analyses: changes in immune population, strong autocorrelations, a wide range of plausible lag structures and association patterns, seasonality adjustments, and large overdispersion. The potential approaches are illustrated with datasets of cholera cases and rainfall from Bangladesh and influenza and temperature in Tokyo. Though this article focuses on the application of the traditional time series regression to infectious diseases and weather factors, we also briefly introduce alternative approaches, including mathematical modeling, wavelet analysis, and autoregressive integrated moving average (ARIMA) models. Modifications proposed to standard time series regression practice include using sums of past cases as proxies for the immune population, and using the logarithm of lagged disease counts to control autocorrelation due to true contagion, both of which are motivated from "susceptible-infectious-recovered" (SIR) models. The complexity of lag structures and association patterns can often be informed by biological mechanisms and explored by using distributed lag non-linear models. For overdispersed models, alternative distribution models such as quasi-Poisson and negative binomial should be considered. Time series regression can be used to investigate dependence of infectious diseases on weather, but may need modifying to allow for features specific to this context. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  14. Linear regression crash prediction models : issues and proposed solutions.

    Science.gov (United States)

    2010-05-01

    The paper develops a linear regression model approach that can be applied to : crash data to predict vehicle crashes. The proposed approach involves novice data aggregation : to satisfy linear regression assumptions; namely error structure normality ...

  15. Identification of Influential Points in a Linear Regression Model

    Directory of Open Access Journals (Sweden)

    Jan Grosz

    2011-03-01

    Full Text Available The article deals with the detection and identification of influential points in the linear regression model. Three methods of detection of outliers and leverage points are described. These procedures can also be used for one-sample (independentdatasets. This paper briefly describes theoretical aspects of several robust methods as well. Robust statistics is a powerful tool to increase the reliability and accuracy of statistical modelling and data analysis. A simulation model of the simple linear regression is presented.

  16. Testing Predictions of a Landscape Evolution Model Using the Dragon’s Back Pressure Ridge as a Natural Experiment

    Science.gov (United States)

    Perignon, M. C.; Tucker, G. E.; Hilley, G. E.; Arrowsmith, R.

    2009-12-01

    Landscape evolution models use mass transport rules to simulate the temporal development of topography over timescales too long for humans to observe. As such, these models are difficult to test using the decadal time-scale observations of topographic change that can be directly measured. In contrast, natural systems in which driving forces, boundary conditions, and timing of landscape evolution over millennial time-scales can be well constrained may be used to test the ability of mathematical models to reproduce various attributes of the observed topography. The Dragon’s Back pressure ridge, a 4km x 0.5 km x 100 m high area of elevated topography elongate parallel to the south-central San Andreas fault (SAF) in California, serves as a natural laboratory for studying how the timing and spatial distribution of uplift affects patterns of erosion and topography. Geologic mapping and geophysical studies show that, at this location, the Pacific plate is forced over a relatively stationary shallow discontinuity in the SAF, resulting in local uplift. Continued right-lateral motion along the fault results in the movement of material though the uplift zone at the SAF slip rate of 35 mm/yr. This allows for the substitution of space for time when observing topographic change, and can be used to constrain the tectonic conditions to which the surface processes responded and developed the resulting landscape. We used the CHILD model of landscape evolution to recreate the Dragon’s Back pressure ridge system in order to test the reliability of the model predictions and determine the necessary and sufficient conditions to explain the observed topography. To do this, we first ran a Monte Carlo simulation in which we varied the model inputs within a range of plausible values. We then compared the model results with LiDAR topography from the Dragon’s Back pressure ridge to determine which combinations of input parameters best reproduced the observed topography and how well it

  17. Unfaulting the Sardarapat Ridge, Southwest Armenia

    Science.gov (United States)

    Wetmore, P.; Connor, C.; Connor, L. J.; Savov, I. P.; Karakhanyan, A.

    2012-12-01

    Armenia is located near the core of contractional deformation associated with the collision between the Arabian and Eurasian tectonic plates. Several studies of this region, including portions of adjacent Georgia, Iran, and Turkey, have indicated that 1-2 mm/yr of intra-plate, north-south shortening is primarily accommodated by a network of E-W trending thrust faults, and NW-trending (dextral) and NE-trending (sinistral) strike-slip faults. One proposed fault in this network, the Sardarapat Fault (SF), was investigated as part of a regional seismic hazard assessment ahead of the installation of a replacement reactor at the Armenian Nuclear Power Plant (ANPP). The SF is primarily defined by the Sardarapat Ridge (SR), which is a WNW-trending, 40-70 m high topographic feature located just north of the Arax River and the Turkey-Armenia border. The stratigraphy comprising this ridge includes alluvium overlying several meters of lacustrine deposits above a crystal-rich basaltic lava flow that yields an Ar-Ar age of 0.9 +/- 0.02 Ma. The alluvial sediments on the ridge contain early Bronze age (3832-3470 BP) artifacts at an elevation 25 m above those of the surrounding alluvial plane. This has lead to the suggestion that the SR is bound to the south (the steepest side) by the SF, which is uplifting the ridge at a rate of 0.7 mm/yr. However, despite the prominence and trend of the ridge there are no unequivocal observations, such as scarps or exposures of fault rocks, to support the existence of the SF. The goal of the investigation of the SR area was to test various models for the formation of the ridge including faulting and combined volcanic and erosional processes. We therefore collected gravimetric, magnetic, magneto-tellurics (MT), and transient electromagnetic (TEM) data across an area of ~400 km2, and used correlations of stratigraphic data from coreholes drilled proximal to the study area to define the geometry of the contact between the basement and basin fill to

  18. The art of regression modeling in road safety

    CERN Document Server

    Hauer, Ezra

    2015-01-01

    This unique book explains how to fashion useful regression models from commonly available data to erect models essential for evidence-based road safety management and research. Composed from techniques and best practices presented over many years of lectures and workshops, The Art of Regression Modeling in Road Safety illustrates that fruitful modeling cannot be done without substantive knowledge about the modeled phenomenon. Class-tested in courses and workshops across North America, the book is ideal for professionals, researchers, university professors, and graduate students with an interest in, or responsibilities related to, road safety. This book also: · Presents for the first time a powerful analytical tool for road safety researchers and practitioners · Includes problems and solutions in each chapter as well as data and spreadsheets for running models and PowerPoint presentation slides · Features pedagogy well-suited for graduate courses and workshops including problems, solutions, and PowerPoint p...

  19. Support Vector Regression Model Based on Empirical Mode Decomposition and Auto Regression for Electric Load Forecasting

    Directory of Open Access Journals (Sweden)

    Hong-Juan Li

    2013-04-01

    Full Text Available Electric load forecasting is an important issue for a power utility, associated with the management of daily operations such as energy transfer scheduling, unit commitment, and load dispatch. Inspired by strong non-linear learning capability of support vector regression (SVR, this paper presents a SVR model hybridized with the empirical mode decomposition (EMD method and auto regression (AR for electric load forecasting. The electric load data of the New South Wales (Australia market are employed for comparing the forecasting performances of different forecasting models. The results confirm the validity of the idea that the proposed model can simultaneously provide forecasting with good accuracy and interpretability.

  20. Robust geographically weighted regression of modeling the Air Polluter Standard Index (APSI)

    Science.gov (United States)

    Warsito, Budi; Yasin, Hasbi; Ispriyanti, Dwi; Hoyyi, Abdul

    2018-05-01

    The Geographically Weighted Regression (GWR) model has been widely applied to many practical fields for exploring spatial heterogenity of a regression model. However, this method is inherently not robust to outliers. Outliers commonly exist in data sets and may lead to a distorted estimate of the underlying regression model. One of solution to handle the outliers in the regression model is to use the robust models. So this model was called Robust Geographically Weighted Regression (RGWR). This research aims to aid the government in the policy making process related to air pollution mitigation by developing a standard index model for air polluter (Air Polluter Standard Index - APSI) based on the RGWR approach. In this research, we also consider seven variables that are directly related to the air pollution level, which are the traffic velocity, the population density, the business center aspect, the air humidity, the wind velocity, the air temperature, and the area size of the urban forest. The best model is determined by the smallest AIC value. There are significance differences between Regression and RGWR in this case, but Basic GWR using the Gaussian kernel is the best model to modeling APSI because it has smallest AIC.

  1. Flexible competing risks regression modeling and goodness-of-fit

    DEFF Research Database (Denmark)

    Scheike, Thomas; Zhang, Mei-Jie

    2008-01-01

    In this paper we consider different approaches for estimation and assessment of covariate effects for the cumulative incidence curve in the competing risks model. The classic approach is to model all cause-specific hazards and then estimate the cumulative incidence curve based on these cause...... models that is easy to fit and contains the Fine-Gray model as a special case. One advantage of this approach is that our regression modeling allows for non-proportional hazards. This leads to a new simple goodness-of-fit procedure for the proportional subdistribution hazards assumption that is very easy...... of the flexible regression models to analyze competing risks data when non-proportionality is present in the data....

  2. Topography and vegetation as predictors of snow water equivalent across the alpine treeline ecotone at Lee Ridge, Glacier National Park, Montana, U.S.A.

    Science.gov (United States)

    Geddes, C.A.; Brown, D.G.; Fagre, D.B.

    2005-01-01

    We derived and implemented two spatial models of May snow water equivalent (SWE) at Lee Ridge in Glacier National Park, Montana. We used the models to test the hypothesis that vegetation structure is a control on snow redistribution at the alpine treeline ecotone (ATE). The statistical models were derived using stepwise and "best" subsets regression techniques. The first model was derived from field measurements of SWE, topography, and vegetation taken at 27 sample points. The second model was derived using GIS-based measures of topography and vegetation. Both the field- (R² = 0.93) and GIS-based models (R² = 0.69) of May SWE included the following variables: site type (based on vegetation), elevation, maximum slope, and general slope aspect. Site type was identified as the most important predictor of SWE in both models, accounting for 74.0% and 29.5% of the variation, respectively. The GIS-based model was applied to create a predictive map of SWE across Lee Ridge, predicting little snow accumulation on the top of the ridge where vegetation is scarce. The GIS model failed in large depressions, including ephemeral stream channels. The models supported the hypothesis that upright vegetation has a positive effect on accumulation of SWE above and beyond the effects of topography. Vegetation, therefore, creates a positive feedback in which it modifies its, environment and could affect the ability of additional vegetation to become established.

  3. Ridge Orientations of the Ridge-Forming Unit, Sinus Meridiani, Mars-A Fluvial Explanation

    Science.gov (United States)

    Wilkinson, M. Justin; Herridge, A.

    2013-01-01

    Imagery and MOLA data were used in an analysis of the ridge-forming rock unit (RFU) exposed in Sinus Meridiani (SM). This unit shows parallels at different scales with fluvial sedimentary bodies. We propose the terrestrial megafan as the prime analog for the RFU, and likely for other members of the layered units. Megafans are partial cones of fluvial sediment, with radii up to hundreds of km. Although recent reviews of hypotheses for the RFU units exclude fluvial hypotheses [1], inverted ridges in the deserts of Oman have been suggested as putative analogs for some ridges [2], apparently without appreciating The wider context in which these ridges have formed is a series of megafans [3], a relatively unappreciated geomorphic feature. It has been argued that these units conform to the megafan model at the regional, subregional and local scales [4]. At the regional scale suites of terrestrial megafans are known to cover large areas at the foot of uplands on all continents - a close parallel with the setting of the Meridiani sediments at the foot of the southern uplands of Mars, with its incised fluvial systems leading down the regional NW slope [2, 3] towards the sedimentary units. At the subregional scale the layering and internal discontinuities of the Meridiani rocks are consistent, inter alia, with stacked fluvial units [4]. Although poorly recognized as such, the prime geomorphic environment in which stream channel networks cover large areas, without intervening hillslopes, is the megafan [see e.g. 4]. Single megafans can reach 200,000 km2 [5]. Megafans thus supply an analog for areas where channel-like ridges (as a palimpsest of a prior landscape) cover the intercrater plains of Meridiani [6]. At the local, or river-reach scale, the numerous sinuous features of the RFU are suggestive of fluvial channels. Cross-cutting relationships, a common feature of channels on terrestrial megafans, are ubiquitous. Desert megafans show cemented paleo-channels as inverted

  4. Bioinspired design of a ridging shovel with anti-adhesive and drag reducing

    Directory of Open Access Journals (Sweden)

    Zhijun Zhang

    2015-03-01

    Full Text Available Learning from the microstructure of the convex (concave and ridging (triangle and arc-shaped shapes of fresh lotus leaves and shark skin, bionic ridging shovels was designed with the characteristics of adhesion and resistance reduction. Ten ridging shovel models were established, and the interaction process with the soil by ANSYS is discussed. Stress analysis results showed that the bionic ridging shovel was more obvious in visbreaking and in the resistance reduction effect. An indoor soil bin experiment with the bionic ridging shovel and the prototype ridging shovel was operated as follows: the ridging resistance of the three types of ridging shovel was tested under the condition of two soil moistures (18.61% and 20.9% and three different ridging speeds (0.68, 0.87, and 1.11 m/s. In this article, the structure, the mechanism, and their relationship to the functions are discussed. The results of this study will be useful in practical application in the field of agricultural machinery toward practical use and industrialization.

  5. Hydrodynamic role of longitudinal ridges in a leatherback turtle swimming

    Science.gov (United States)

    Bang, Kyeongtae; Kim, Jooha; Lee, Sang-Im; Choi, Haecheon

    2015-11-01

    The leatherback sea turtle (Dermochelys coriacea), the fastest swimmer and the deepest diver among marine turtles, has five longitudinal ridges on its carapace. These ridges are the most remarkable morphological features distinguished from other marine turtles. To investigate the hydrodynamic role of these ridges in the leatherback turtle swimming, we model a carapace with and without ridges by using three dimensional surface data of a stuffed leatherback turtle in the National Science Museum, Korea. The experiment is conducted in a wind tunnel in the ranges of the real leatherback turtle's Reynolds number (Re) and angle of attack (α). The longitudinal ridges function differently according to the flow condition (i.e. Re and α). At low Re and negative α that represent the swimming condition of hatchlings and juveniles, the ridges significantly decrease the drag by generating streamwise vortices and delaying the main separation. On the other hand, at high Re and positive α that represent the swimming condition of adults, the ridges suppress the laminar separation bubble near the front part by generating streamwise vortices and enhance the lift and lift-to-drag ratio. Supported by the NRF program (2011-0028032).

  6. Maximum Entropy Discrimination Poisson Regression for Software Reliability Modeling.

    Science.gov (United States)

    Chatzis, Sotirios P; Andreou, Andreas S

    2015-11-01

    Reliably predicting software defects is one of the most significant tasks in software engineering. Two of the major components of modern software reliability modeling approaches are: 1) extraction of salient features for software system representation, based on appropriately designed software metrics and 2) development of intricate regression models for count data, to allow effective software reliability data modeling and prediction. Surprisingly, research in the latter frontier of count data regression modeling has been rather limited. More specifically, a lack of simple and efficient algorithms for posterior computation has made the Bayesian approaches appear unattractive, and thus underdeveloped in the context of software reliability modeling. In this paper, we try to address these issues by introducing a novel Bayesian regression model for count data, based on the concept of max-margin data modeling, effected in the context of a fully Bayesian model treatment with simple and efficient posterior distribution updates. Our novel approach yields a more discriminative learning technique, making more effective use of our training data during model inference. In addition, it allows of better handling uncertainty in the modeled data, which can be a significant problem when the training data are limited. We derive elegant inference algorithms for our model under the mean-field paradigm and exhibit its effectiveness using the publicly available benchmark data sets.

  7. Modelling fourier regression for time series data- a case study: modelling inflation in foods sector in Indonesia

    Science.gov (United States)

    Prahutama, Alan; Suparti; Wahyu Utami, Tiani

    2018-03-01

    Regression analysis is an analysis to model the relationship between response variables and predictor variables. The parametric approach to the regression model is very strict with the assumption, but nonparametric regression model isn’t need assumption of model. Time series data is the data of a variable that is observed based on a certain time, so if the time series data wanted to be modeled by regression, then we should determined the response and predictor variables first. Determination of the response variable in time series is variable in t-th (yt), while the predictor variable is a significant lag. In nonparametric regression modeling, one developing approach is to use the Fourier series approach. One of the advantages of nonparametric regression approach using Fourier series is able to overcome data having trigonometric distribution. In modeling using Fourier series needs parameter of K. To determine the number of K can be used Generalized Cross Validation method. In inflation modeling for the transportation sector, communication and financial services using Fourier series yields an optimal K of 120 parameters with R-square 99%. Whereas if it was modeled by multiple linear regression yield R-square 90%.

  8. Bayesian Inference of a Multivariate Regression Model

    Directory of Open Access Journals (Sweden)

    Marick S. Sinay

    2014-01-01

    Full Text Available We explore Bayesian inference of a multivariate linear regression model with use of a flexible prior for the covariance structure. The commonly adopted Bayesian setup involves the conjugate prior, multivariate normal distribution for the regression coefficients and inverse Wishart specification for the covariance matrix. Here we depart from this approach and propose a novel Bayesian estimator for the covariance. A multivariate normal prior for the unique elements of the matrix logarithm of the covariance matrix is considered. Such structure allows for a richer class of prior distributions for the covariance, with respect to strength of beliefs in prior location hyperparameters, as well as the added ability, to model potential correlation amongst the covariance structure. The posterior moments of all relevant parameters of interest are calculated based upon numerical results via a Markov chain Monte Carlo procedure. The Metropolis-Hastings-within-Gibbs algorithm is invoked to account for the construction of a proposal density that closely matches the shape of the target posterior distribution. As an application of the proposed technique, we investigate a multiple regression based upon the 1980 High School and Beyond Survey.

  9. General regression and representation model for classification.

    Directory of Open Access Journals (Sweden)

    Jianjun Qian

    Full Text Available Recently, the regularized coding-based classification methods (e.g. SRC and CRC show a great potential for pattern classification. However, most existing coding methods assume that the representation residuals are uncorrelated. In real-world applications, this assumption does not hold. In this paper, we take account of the correlations of the representation residuals and develop a general regression and representation model (GRR for classification. GRR not only has advantages of CRC, but also takes full use of the prior information (e.g. the correlations between representation residuals and representation coefficients and the specific information (weight matrix of image pixels to enhance the classification performance. GRR uses the generalized Tikhonov regularization and K Nearest Neighbors to learn the prior information from the training data. Meanwhile, the specific information is obtained by using an iterative algorithm to update the feature (or image pixel weights of the test sample. With the proposed model as a platform, we design two classifiers: basic general regression and representation classifier (B-GRR and robust general regression and representation classifier (R-GRR. The experimental results demonstrate the performance advantages of proposed methods over state-of-the-art algorithms.

  10. A test for the parameters of multiple linear regression models ...

    African Journals Online (AJOL)

    A test for the parameters of multiple linear regression models is developed for conducting tests simultaneously on all the parameters of multiple linear regression models. The test is robust relative to the assumptions of homogeneity of variances and absence of serial correlation of the classical F-test. Under certain null and ...

  11. Exploring Microbial Processes with Thermal-Hydrological Models of the Eastern Flank of the Juan de Fuca Ridge

    Science.gov (United States)

    Weathers, T. S.; Fisher, A. T.; Winslow, D. M.; Stauffer, P. H.; Gable, C. W.

    2017-12-01

    The flanks of mid-ocean ridges experience coupled flows of fluid, heat, and solutes that are critical for a wide range of global processes, including the cycling of carbon and nutrients, which supports a vast crustal biosphere. Only a few ridge-flank sites have been studied in detail; hydrogeologic conditions and processes in the volcanic crust are best understood on the eastern flank of the Juan de Fuca Ridge. This area has been extensively explored with decades of drilling, submersible, observatory, and survey expeditions and experiments, including the first hole-to-hole tracer injection experiment in the ocean crust. This study describes the development of reactive transport simulations for this ridge-flank setting using three-dimensional coupled (thermal-hydrological) models of crustal-scale circulation, beginning with the exploration of tracer transport. The prevailing flow direction is roughly south to north as a result of outcrop-to-outcrop flow, with a bulk flow rate in the range of meters/year. However, tracer was detected 500 m south ("upstream") from the injection borehole during the first year following injection. This may be explained by local mixing and/or formation fluid discharge from the southern borehole during and after injection. The constraints and parameters required to fit the observed tracer behavior can be used as a basis for modeling reactive transport processes such as nutrient delivery or microbial community evolution as a function of fluid flow. For example, the sulfate concentration in fluid samples from Baby Bare outcrop ( 8 km south of the tracer transport experiment) was 17.8 mmol/kg, whereas at Mama Bare outcrop ( 8 km to north of the tracer transport experiment) the sulfate concentration was 16.3 mmol/mg. By integrating laboratory-derived sulfate reduction rates from microbial samples originating from Juan de Fuca borehole observatories into reactive transport models, we can explore the range of microbial activity that supports

  12. Predicting recycling behaviour: Comparison of a linear regression model and a fuzzy logic model.

    Science.gov (United States)

    Vesely, Stepan; Klöckner, Christian A; Dohnal, Mirko

    2016-03-01

    In this paper we demonstrate that fuzzy logic can provide a better tool for predicting recycling behaviour than the customarily used linear regression. To show this, we take a set of empirical data on recycling behaviour (N=664), which we randomly divide into two halves. The first half is used to estimate a linear regression model of recycling behaviour, and to develop a fuzzy logic model of recycling behaviour. As the first comparison, the fit of both models to the data included in estimation of the models (N=332) is evaluated. As the second comparison, predictive accuracy of both models for "new" cases (hold-out data not included in building the models, N=332) is assessed. In both cases, the fuzzy logic model significantly outperforms the regression model in terms of fit. To conclude, when accurate predictions of recycling and possibly other environmental behaviours are needed, fuzzy logic modelling seems to be a promising technique. Copyright © 2015 Elsevier Ltd. All rights reserved.

  13. A History of Regression and Related Model-Fitting in the Earth Sciences (1636?-2000)

    International Nuclear Information System (INIS)

    Howarth, Richard J.

    2001-01-01

    roots in meeting the evident need for improved estimators in spatial interpolation. Technical advances in regression analysis during the 1970s embraced the development of regression diagnostics and consequent attention to outliers; the recognition of problems caused by correlated predictors, and the subsequent introduction of ridge regression to overcome them; and techniques for fitting errors-in-variables and mixture models. Improvements in computational power have enabled ever more computer-intensive methods to be applied. These include algorithms which are robust in the presence of outliers, for example Rousseeuw's 1984 Least Median Squares; nonparametric smoothing methods, such as kernel-functions, splines and Cleveland's 1979 LOcally WEighted Scatterplot Smoother (LOWESS); and the Classification and Regression Tree (CART) technique of Breiman and others in 1984. Despite a continuing improvement in the rate of technology-transfer from the statistical to the earth-science community, despite an abrupt drop to a time-lag of about 10 years following the introduction of digital computers, these more recent developments are only just beginning to penetrate beyond the research community of earth scientists. Examples of applications to problem-solving in the earth sciences are given

  14. Logistic regression models

    CERN Document Server

    Hilbe, Joseph M

    2009-01-01

    This book really does cover everything you ever wanted to know about logistic regression … with updates available on the author's website. Hilbe, a former national athletics champion, philosopher, and expert in astronomy, is a master at explaining statistical concepts and methods. Readers familiar with his other expository work will know what to expect-great clarity.The book provides considerable detail about all facets of logistic regression. No step of an argument is omitted so that the book will meet the needs of the reader who likes to see everything spelt out, while a person familiar with some of the topics has the option to skip "obvious" sections. The material has been thoroughly road-tested through classroom and web-based teaching. … The focus is on helping the reader to learn and understand logistic regression. The audience is not just students meeting the topic for the first time, but also experienced users. I believe the book really does meet the author's goal … .-Annette J. Dobson, Biometric...

  15. Linking Simple Economic Theory Models and the Cointegrated Vector AutoRegressive Model

    DEFF Research Database (Denmark)

    Møller, Niels Framroze

    This paper attempts to clarify the connection between simple economic theory models and the approach of the Cointegrated Vector-Auto-Regressive model (CVAR). By considering (stylized) examples of simple static equilibrium models, it is illustrated in detail, how the theoretical model and its stru....... Further fundamental extensions and advances to more sophisticated theory models, such as those related to dynamics and expectations (in the structural relations) are left for future papers......This paper attempts to clarify the connection between simple economic theory models and the approach of the Cointegrated Vector-Auto-Regressive model (CVAR). By considering (stylized) examples of simple static equilibrium models, it is illustrated in detail, how the theoretical model and its......, it is demonstrated how other controversial hypotheses such as Rational Expectations can be formulated directly as restrictions on the CVAR-parameters. A simple example of a "Neoclassical synthetic" AS-AD model is also formulated. Finally, the partial- general equilibrium distinction is related to the CVAR as well...

  16. Multiple Response Regression for Gaussian Mixture Models with Known Labels.

    Science.gov (United States)

    Lee, Wonyul; Du, Ying; Sun, Wei; Hayes, D Neil; Liu, Yufeng

    2012-12-01

    Multiple response regression is a useful regression technique to model multiple response variables using the same set of predictor variables. Most existing methods for multiple response regression are designed for modeling homogeneous data. In many applications, however, one may have heterogeneous data where the samples are divided into multiple groups. Our motivating example is a cancer dataset where the samples belong to multiple cancer subtypes. In this paper, we consider modeling the data coming from a mixture of several Gaussian distributions with known group labels. A naive approach is to split the data into several groups according to the labels and model each group separately. Although it is simple, this approach ignores potential common structures across different groups. We propose new penalized methods to model all groups jointly in which the common and unique structures can be identified. The proposed methods estimate the regression coefficient matrix, as well as the conditional inverse covariance matrix of response variables. Asymptotic properties of the proposed methods are explored. Through numerical examples, we demonstrate that both estimation and prediction can be improved by modeling all groups jointly using the proposed methods. An application to a glioblastoma cancer dataset reveals some interesting common and unique gene relationships across different cancer subtypes.

  17. Extending the linear model with R generalized linear, mixed effects and nonparametric regression models

    CERN Document Server

    Faraway, Julian J

    2005-01-01

    Linear models are central to the practice of statistics and form the foundation of a vast range of statistical methodologies. Julian J. Faraway''s critically acclaimed Linear Models with R examined regression and analysis of variance, demonstrated the different methods available, and showed in which situations each one applies. Following in those footsteps, Extending the Linear Model with R surveys the techniques that grow from the regression model, presenting three extensions to that framework: generalized linear models (GLMs), mixed effect models, and nonparametric regression models. The author''s treatment is thoroughly modern and covers topics that include GLM diagnostics, generalized linear mixed models, trees, and even the use of neural networks in statistics. To demonstrate the interplay of theory and practice, throughout the book the author weaves the use of the R software environment to analyze the data of real examples, providing all of the R commands necessary to reproduce the analyses. All of the ...

  18. Bayesian approach to errors-in-variables in regression models

    Science.gov (United States)

    Rozliman, Nur Aainaa; Ibrahim, Adriana Irawati Nur; Yunus, Rossita Mohammad

    2017-05-01

    In many applications and experiments, data sets are often contaminated with error or mismeasured covariates. When at least one of the covariates in a model is measured with error, Errors-in-Variables (EIV) model can be used. Measurement error, when not corrected, would cause misleading statistical inferences and analysis. Therefore, our goal is to examine the relationship of the outcome variable and the unobserved exposure variable given the observed mismeasured surrogate by applying the Bayesian formulation to the EIV model. We shall extend the flexible parametric method proposed by Hossain and Gustafson (2009) to another nonlinear regression model which is the Poisson regression model. We shall then illustrate the application of this approach via a simulation study using Markov chain Monte Carlo sampling methods.

  19. Modeling Fire Occurrence at the City Scale: A Comparison between Geographically Weighted Regression and Global Linear Regression.

    Science.gov (United States)

    Song, Chao; Kwan, Mei-Po; Zhu, Jiping

    2017-04-08

    An increasing number of fires are occurring with the rapid development of cities, resulting in increased risk for human beings and the environment. This study compares geographically weighted regression-based models, including geographically weighted regression (GWR) and geographically and temporally weighted regression (GTWR), which integrates spatial and temporal effects and global linear regression models (LM) for modeling fire risk at the city scale. The results show that the road density and the spatial distribution of enterprises have the strongest influences on fire risk, which implies that we should focus on areas where roads and enterprises are densely clustered. In addition, locations with a large number of enterprises have fewer fire ignition records, probably because of strict management and prevention measures. A changing number of significant variables across space indicate that heterogeneity mainly exists in the northern and eastern rural and suburban areas of Hefei city, where human-related facilities or road construction are only clustered in the city sub-centers. GTWR can capture small changes in the spatiotemporal heterogeneity of the variables while GWR and LM cannot. An approach that integrates space and time enables us to better understand the dynamic changes in fire risk. Thus governments can use the results to manage fire safety at the city scale.

  20. Direction of Effects in Multiple Linear Regression Models.

    Science.gov (United States)

    Wiedermann, Wolfgang; von Eye, Alexander

    2015-01-01

    Previous studies analyzed asymmetric properties of the Pearson correlation coefficient using higher than second order moments. These asymmetric properties can be used to determine the direction of dependence in a linear regression setting (i.e., establish which of two variables is more likely to be on the outcome side) within the framework of cross-sectional observational data. Extant approaches are restricted to the bivariate regression case. The present contribution extends the direction of dependence methodology to a multiple linear regression setting by analyzing distributional properties of residuals of competing multiple regression models. It is shown that, under certain conditions, the third central moments of estimated regression residuals can be used to decide upon direction of effects. In addition, three different approaches for statistical inference are discussed: a combined D'Agostino normality test, a skewness difference test, and a bootstrap difference test. Type I error and power of the procedures are assessed using Monte Carlo simulations, and an empirical example is provided for illustrative purposes. In the discussion, issues concerning the quality of psychological data, possible extensions of the proposed methods to the fourth central moment of regression residuals, and potential applications are addressed.

  1. Waste management systems model for energy systems sites on the Oak Ridge Reservation

    International Nuclear Information System (INIS)

    Rodgers, B.R.; Nehls, J.W. Jr.; Rivera, A.L.; Pechin, W.H.; Genung, R.K.

    1986-01-01

    There is a model on the Oak Ridge Reservation which provides requirements for determining capacities and capabilities related to low-level, hazardous, and mixed wastes. In FY 1987, the model will be sufficiently advanced to provide various waste management scenarios. These scenarios will be compared technically, operationally, and financially by use of waste characterization data and process simulators that are currently under development. The results of the process simulations will be used to help identify waste treatment, storage, and disposal technologies that need to be demonstrated prior to full-scale development for DOE use. The information derived from this effort will be made available to all DOE facilities

  2. The Effects of Ridge Axis Width on Mantle Melting at Mid-Ocean Ridges

    Science.gov (United States)

    Montesi, L.; Magni, V.; Gaina, C.

    2017-12-01

    Mantle upwelling in response to plate divergence produces melt at mid-ocean ridges. Melt starts when the solidus is crossed and stops when conductive cooling overcomes heat advection associated with the upwelling. Most mid-ocean ridge models assume that divergence takes place only in a narrow zone that defines the ridge axis, resulting in a single upwelling. However, more complex patterns of divergence are occasionally observed. The rift axis can be 20 km wide at ultraslow spreading center. Overlapping spreading center contain two parallel axes. Rifting in backarc basins is sometimes organized as a series of parallel spreading centers. Distributing plate divergence over several rifts reduces the intensity of upwelling and limits melting. Can this have a significant effect on the expected crustal thickness and on the mode of melt delivery at the seafloor? We address this question by modeling mantle flow and melting underneath two spreading centers separated by a rigid block. We adopt a non-linear rheology that includes dislocation creep, diffusion creep and yielding and include hydrothermal cooling by enhancing thermal conductivity where yielding takes place. The crustal thickness decreases if the rifts are separated by 30 km or more but only if the half spreading rate is between 1 and 2 cm/yr. At melting depth, a single upwelling remains the norm until the separation of the rifts exceeds a critical value ranging from 15 km in the fastest ridges to more than 50 km at ultraslow spreading centers. The stability of the central upwelling is due to hydrothermal cooling, which prevents hot mantle from reaching the surface at each spreading center. When hydrothermal cooling is suppressed, or the spreading centers are sufficiently separated, the rigid block becomes extremely cold and separates two distinct, highly asymmetric upwellings that may focus melt beyond the spreading center. In that case, melt delivery might drive further and further the divergence centers, whereas

  3. Large fault fabric of the Ninetyeast Ridge implies near-spreading ridge formation

    Digital Repository Service at National Institute of Oceanography (India)

    Sager, W.W.; Paul, C.F.; Krishna, K.S.; Pringle, M.S.; Eisin, A.E.; Frey, F.A.; Rao, D.G.; Levchenko, O.V.

    of the high ridge. At 26°S, prominent NE-SW 97 oriented lineations extend southwest from the ridge. Some appear to connect with N-S fracture 98 zone troughs east of NER, implying that the NE-SW features are fracture zone scars formed after 99 the change... to the 105 ridge (Fig. 3). This is especially true for NER south of ~4°S. Where KNOX06RR crossed a 106 gravity lineation, negative gradient features correspond to troughs whereas positive gradient 107 features result from igneous basement highs (Fig. 3...

  4. A Monte Carlo simulation study comparing linear regression, beta regression, variable-dispersion beta regression and fractional logit regression at recovering average difference measures in a two sample design.

    Science.gov (United States)

    Meaney, Christopher; Moineddin, Rahim

    2014-01-24

    In biomedical research, response variables are often encountered which have bounded support on the open unit interval--(0,1). Traditionally, researchers have attempted to estimate covariate effects on these types of response data using linear regression. Alternative modelling strategies may include: beta regression, variable-dispersion beta regression, and fractional logit regression models. This study employs a Monte Carlo simulation design to compare the statistical properties of the linear regression model to that of the more novel beta regression, variable-dispersion beta regression, and fractional logit regression models. In the Monte Carlo experiment we assume a simple two sample design. We assume observations are realizations of independent draws from their respective probability models. The randomly simulated draws from the various probability models are chosen to emulate average proportion/percentage/rate differences of pre-specified magnitudes. Following simulation of the experimental data we estimate average proportion/percentage/rate differences. We compare the estimators in terms of bias, variance, type-1 error and power. Estimates of Monte Carlo error associated with these quantities are provided. If response data are beta distributed with constant dispersion parameters across the two samples, then all models are unbiased and have reasonable type-1 error rates and power profiles. If the response data in the two samples have different dispersion parameters, then the simple beta regression model is biased. When the sample size is small (N0 = N1 = 25) linear regression has superior type-1 error rates compared to the other models. Small sample type-1 error rates can be improved in beta regression models using bias correction/reduction methods. In the power experiments, variable-dispersion beta regression and fractional logit regression models have slightly elevated power compared to linear regression models. Similar results were observed if the

  5. Tutorial on Using Regression Models with Count Outcomes Using R

    Directory of Open Access Journals (Sweden)

    A. Alexander Beaujean

    2016-02-01

    Full Text Available Education researchers often study count variables, such as times a student reached a goal, discipline referrals, and absences. Most researchers that study these variables use typical regression methods (i.e., ordinary least-squares either with or without transforming the count variables. In either case, using typical regression for count data can produce parameter estimates that are biased, thus diminishing any inferences made from such data. As count-variable regression models are seldom taught in training programs, we present a tutorial to help educational researchers use such methods in their own research. We demonstrate analyzing and interpreting count data using Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial regression models. The count regression methods are introduced through an example using the number of times students skipped class. The data for this example are freely available and the R syntax used run the example analyses are included in the Appendix.

  6. Remedial investigation report on Waste Area Group 5 at Oak Ridge National Laboratory, Oak Ridge, Tennessee. Volume 1: Technical summary

    International Nuclear Information System (INIS)

    1995-03-01

    A remedial investigation (RI) was performed to support environmental restoration activities for Waste Area Grouping (WAG) 5 at the Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tennessee. The WAG 5 RI made use of the observational approach, which concentrates on collecting only information needed to assess site risks and support future cleanup work. This information was interpreted and is presented using the framework of the site conceptual model, which relates contaminant sources and release mechanisms to migration pathways and exposure points that are keyed to current and future environmental risks for both human and ecological receptors. The site conceptual model forms the basis of the WAG 5 remedial action strategy and remedial action objectives. The RI provided the data necessary to verify this model and allows recommendations to be made to accomplish those objectives

  7. Remedial investigation report on Waste Area Grouping 5 at Oak Ridge National Laboratory, Oak Ridge, Tennessee. Volume 1: Technical summary

    International Nuclear Information System (INIS)

    1995-09-01

    A remedial investigation (RI) was performed to support environmental restoration activities for Waste Area Grouping (WAG) 5 at the Oak Ridge National Laboratory (ORNL) in Oak Ridge, Tennessee. The WAG 5 RI made use of the observational approach, which concentrates on collecting only information needed to assess site risks and support future cleanup work. This information was interpreted and is presented using the framework of the site conceptual model, which relates contaminant sources and release mechanisms to migration pathways and exposure points that are keyed to current and future environmental risks for both human and ecological receptors. The site conceptual model forms the basis of the WAG 5 remedial action strategy and remedial action objectives. The RI provided the data necessary to verify this model and allows recommendations to be made to accomplish those objectives.

  8. Statistical approach for selection of regression model during validation of bioanalytical method

    Directory of Open Access Journals (Sweden)

    Natalija Nakov

    2014-06-01

    Full Text Available The selection of an adequate regression model is the basis for obtaining accurate and reproducible results during the bionalytical method validation. Given the wide concentration range, frequently present in bioanalytical assays, heteroscedasticity of the data may be expected. Several weighted linear and quadratic regression models were evaluated during the selection of the adequate curve fit using nonparametric statistical tests: One sample rank test and Wilcoxon signed rank test for two independent groups of samples. The results obtained with One sample rank test could not give statistical justification for the selection of linear vs. quadratic regression models because slight differences between the error (presented through the relative residuals were obtained. Estimation of the significance of the differences in the RR was achieved using Wilcoxon signed rank test, where linear and quadratic regression models were treated as two independent groups. The application of this simple non-parametric statistical test provides statistical confirmation of the choice of an adequate regression model.

  9. Linear regression models for quantitative assessment of left ...

    African Journals Online (AJOL)

    Changes in left ventricular structures and function have been reported in cardiomyopathies. No prediction models have been established in this environment. This study established regression models for prediction of left ventricular structures in normal subjects. A sample of normal subjects was drawn from a large urban ...

  10. Reduced Rank Regression

    DEFF Research Database (Denmark)

    Johansen, Søren

    2008-01-01

    The reduced rank regression model is a multivariate regression model with a coefficient matrix with reduced rank. The reduced rank regression algorithm is an estimation procedure, which estimates the reduced rank regression model. It is related to canonical correlations and involves calculating...

  11. Poisson regression for modeling count and frequency outcomes in trauma research.

    Science.gov (United States)

    Gagnon, David R; Doron-LaMarca, Susan; Bell, Margret; O'Farrell, Timothy J; Taft, Casey T

    2008-10-01

    The authors describe how the Poisson regression method for analyzing count or frequency outcome variables can be applied in trauma studies. The outcome of interest in trauma research may represent a count of the number of incidents of behavior occurring in a given time interval, such as acts of physical aggression or substance abuse. Traditional linear regression approaches assume a normally distributed outcome variable with equal variances over the range of predictor variables, and may not be optimal for modeling count outcomes. An application of Poisson regression is presented using data from a study of intimate partner aggression among male patients in an alcohol treatment program and their female partners. Results of Poisson regression and linear regression models are compared.

  12. Sedimentary characteristics and controlling factors of shelf sand ridges in the Pearl River Mouth Basin, northeast of South China Sea

    Directory of Open Access Journals (Sweden)

    Xiangtao Zhang

    2017-04-01

    Full Text Available Shelf sand ridge is a significant type of reservoir in the continental marginal basin, and it has drawn so much attention from sedimentologists and petroleum geologists. We were able to investigate the morphology, distribution, and sedimentary structures of shelf sand ridges systematically in this study based on the integration of high-resolution 3D seismic data, well logging, and cores. These shelf sand ridges are an asymmetrical mound-like structure in profiles, and they developed on an ancient uplift in the forced regression system tract and are onlapped by the overlying strata. In the plane, shelf sand ridges present as linear-shaped, which is different from the classical radial pattern; not to mention, they are separated into two parts by low amplitude tidal muddy channels. Corrugated bedding, tidal bedding, and scouring features are distinguished in cores of shelf sand ridges together with the coarsening up in lithology. All of these sedimentary characteristics indicate that shelf sand ridges deposited in the Pearl River Mouth Basin are reconstructed by the tidal and coastal current.

  13. Regression Models and Fuzzy Logic Prediction of TBM Penetration Rate

    Directory of Open Access Journals (Sweden)

    Minh Vu Trieu

    2017-03-01

    Full Text Available This paper presents statistical analyses of rock engineering properties and the measured penetration rate of tunnel boring machine (TBM based on the data of an actual project. The aim of this study is to analyze the influence of rock engineering properties including uniaxial compressive strength (UCS, Brazilian tensile strength (BTS, rock brittleness index (BI, the distance between planes of weakness (DPW, and the alpha angle (Alpha between the tunnel axis and the planes of weakness on the TBM rate of penetration (ROP. Four (4 statistical regression models (two linear and two nonlinear are built to predict the ROP of TBM. Finally a fuzzy logic model is developed as an alternative method and compared to the four statistical regression models. Results show that the fuzzy logic model provides better estimations and can be applied to predict the TBM performance. The R-squared value (R2 of the fuzzy logic model scores the highest value of 0.714 over the second runner-up of 0.667 from the multiple variables nonlinear regression model.

  14. Regression Models and Fuzzy Logic Prediction of TBM Penetration Rate

    Science.gov (United States)

    Minh, Vu Trieu; Katushin, Dmitri; Antonov, Maksim; Veinthal, Renno

    2017-03-01

    This paper presents statistical analyses of rock engineering properties and the measured penetration rate of tunnel boring machine (TBM) based on the data of an actual project. The aim of this study is to analyze the influence of rock engineering properties including uniaxial compressive strength (UCS), Brazilian tensile strength (BTS), rock brittleness index (BI), the distance between planes of weakness (DPW), and the alpha angle (Alpha) between the tunnel axis and the planes of weakness on the TBM rate of penetration (ROP). Four (4) statistical regression models (two linear and two nonlinear) are built to predict the ROP of TBM. Finally a fuzzy logic model is developed as an alternative method and compared to the four statistical regression models. Results show that the fuzzy logic model provides better estimations and can be applied to predict the TBM performance. The R-squared value (R2) of the fuzzy logic model scores the highest value of 0.714 over the second runner-up of 0.667 from the multiple variables nonlinear regression model.

  15. Deep ensemble learning of sparse regression models for brain disease diagnosis.

    Science.gov (United States)

    Suk, Heung-Il; Lee, Seong-Whan; Shen, Dinggang

    2017-04-01

    Recent studies on brain imaging analysis witnessed the core roles of machine learning techniques in computer-assisted intervention for brain disease diagnosis. Of various machine-learning techniques, sparse regression models have proved their effectiveness in handling high-dimensional data but with a small number of training samples, especially in medical problems. In the meantime, deep learning methods have been making great successes by outperforming the state-of-the-art performances in various applications. In this paper, we propose a novel framework that combines the two conceptually different methods of sparse regression and deep learning for Alzheimer's disease/mild cognitive impairment diagnosis and prognosis. Specifically, we first train multiple sparse regression models, each of which is trained with different values of a regularization control parameter. Thus, our multiple sparse regression models potentially select different feature subsets from the original feature set; thereby they have different powers to predict the response values, i.e., clinical label and clinical scores in our work. By regarding the response values from our sparse regression models as target-level representations, we then build a deep convolutional neural network for clinical decision making, which thus we call 'Deep Ensemble Sparse Regression Network.' To our best knowledge, this is the first work that combines sparse regression models with deep neural network. In our experiments with the ADNI cohort, we validated the effectiveness of the proposed method by achieving the highest diagnostic accuracies in three classification tasks. We also rigorously analyzed our results and compared with the previous studies on the ADNI cohort in the literature. Copyright © 2017 Elsevier B.V. All rights reserved.

  16. A computational approach to compare regression modelling strategies in prediction research.

    Science.gov (United States)

    Pajouheshnia, Romin; Pestman, Wiebe R; Teerenstra, Steven; Groenwold, Rolf H H

    2016-08-25

    It is often unclear which approach to fit, assess and adjust a model will yield the most accurate prediction model. We present an extension of an approach for comparing modelling strategies in linear regression to the setting of logistic regression and demonstrate its application in clinical prediction research. A framework for comparing logistic regression modelling strategies by their likelihoods was formulated using a wrapper approach. Five different strategies for modelling, including simple shrinkage methods, were compared in four empirical data sets to illustrate the concept of a priori strategy comparison. Simulations were performed in both randomly generated data and empirical data to investigate the influence of data characteristics on strategy performance. We applied the comparison framework in a case study setting. Optimal strategies were selected based on the results of a priori comparisons in a clinical data set and the performance of models built according to each strategy was assessed using the Brier score and calibration plots. The performance of modelling strategies was highly dependent on the characteristics of the development data in both linear and logistic regression settings. A priori comparisons in four empirical data sets found that no strategy consistently outperformed the others. The percentage of times that a model adjustment strategy outperformed a logistic model ranged from 3.9 to 94.9 %, depending on the strategy and data set. However, in our case study setting the a priori selection of optimal methods did not result in detectable improvement in model performance when assessed in an external data set. The performance of prediction modelling strategies is a data-dependent process and can be highly variable between data sets within the same clinical domain. A priori strategy comparison can be used to determine an optimal logistic regression modelling strategy for a given data set before selecting a final modelling approach.

  17. Interactions Between Mantle Plumes and Mid-Ocean Ridges: Constraints from Geophysics, Geochemistry, and Geodynamical Modeling

    National Research Council Canada - National Science Library

    Georgen, Jennifer

    2001-01-01

    This thesis studies interactions between mid-ocean ridges and mantle plumes. Chapter 1 investigates the effects of the Marion and Bouvet hotspots on the ultra-slow spreading, highly-segmented Southwest Indian Ridge (SWIR...

  18. Geology along the Blue Ridge Parkway in Virginia

    Science.gov (United States)

    Carter, Mark W.; Southworth, C. Scott; Tollo, Richard P.; Merschat, Arthur J.; Wagner, Sara; Lazor, Ava; Aleinikoff, John N.

    2017-01-01

    Back Formations. These rocks are bound by numerous faults, including the Rock Castle Creek fault that separates Ashe Formation rocks from Alligator Back Formation rocks in the core of the Ararat River synclinorium. The lack of unequivocal paleontologic or geochronologic ages for any of these rock sequences, combined with fundamental and conflicting differences in tectonogenetic models, compound the problem of regional correlation with Blue Ridge cover rocks to the north.The geologic transition from the central to southern Appalachians is also marked by a profound change in landscape and surficial deposits. In central Virginia, the Blue Ridge consists of narrow ridges that are held up by resistant but contrasting basement and cover lithologies. These ridges have shed eroded material from their crests to the base of the mountain fronts in the form of talus slopes, debris flows, and alluvial-colluvial fans for perhaps 10 m.y. South of Roanoke, however, ridges transition into a broad hilly plateau, flanked on the east by the Blue Ridge escarpment and the eastern Continental Divide. Here, deposits of rounded pebbles, cobbles, and boulders preserve remnants of ancestral west-flowing drainage systems.Both bedrock and surficial geologic processes provide an array of economic deposits along the length of the Blue Ridge Parkway corridor in Virginia, including base and precious metals and industrial minerals. However, common stone was the most important commodity for creating the Blue Ridge Parkway, which yielded building stone for overlooks and tunnels, or crushed stone for road base and pavement.

  19. A generalized right truncated bivariate Poisson regression model with applications to health data.

    Science.gov (United States)

    Islam, M Ataharul; Chowdhury, Rafiqul I

    2017-01-01

    A generalized right truncated bivariate Poisson regression model is proposed in this paper. Estimation and tests for goodness of fit and over or under dispersion are illustrated for both untruncated and right truncated bivariate Poisson regression models using marginal-conditional approach. Estimation and test procedures are illustrated for bivariate Poisson regression models with applications to Health and Retirement Study data on number of health conditions and the number of health care services utilized. The proposed test statistics are easy to compute and it is evident from the results that the models fit the data very well. A comparison between the right truncated and untruncated bivariate Poisson regression models using the test for nonnested models clearly shows that the truncated model performs significantly better than the untruncated model.

  20. Prediction of Agriculture Drought Using Support Vector Regression Incorporating with Climatology Indices

    Science.gov (United States)

    Tian, Y.; Xu, Y. P.

    2017-12-01

    In this paper, the Support Vector Regression (SVR) model incorporating climate indices and drought indices are developed to predict agriculture drought in Xiangjiang River basin, Central China. The agriculture droughts are presented with the Precipitation-Evapotranspiration Index (SPEI). According to the analysis of the relationship between SPEI with different time scales and soil moisture, it is found that SPEI of six months time scales (SPEI-6) could reflect the soil moisture better than that of three and one month time scale from the drought features including drought duration, severity and peak. Climate forcing like El Niño Southern Oscillation and western Pacific subtropical high (WPSH) are represented by climate indices such as MEI and series indices of WPSH. Ridge Point of WPSH is found to be the key factor that influences the agriculture drought mainly through the control of temperature. Based on the climate indices analysis, the predictions of SPEI-6 are conducted using the SVR model. The results show that the SVR model incorperating climate indices, especially ridge point of WPSH, could improve the prediction accuracy compared to that using drought index only. The improvement was more significant for the prediction of one month lead time than that of three months lead time. However, it needs to be cautious in selection of the input parameters, since adding more useless information could have a counter effect in attaining a better prediction.

  1. Improved voltage performance of the Oak Ridge 25URC tandem accelerator

    International Nuclear Information System (INIS)

    Meigs, M.J.; Jones, C.M.; Haynes, D.L.; Juras, R.C.; Ziegler, N.F.; Roatz, J.E.; Rathmell, R.D.

    1989-01-01

    This paper reports on the Oak Ridge 25URC tandem electrostatic accelerator one of two accelerators operated by the Holifield Heavy Ion Research Facility (HHIRF) at the Oak Ridge National Laboratory. Placed into routine service in 1982, the accelerator has provided a wide range of heavy ion beams for research in nuclear and atomic physics. These beams have been provided both directly and after further acceleration by the Oak Ridge Isochronous Cyclotron (ORIC). Show schematically in this paper, the tandem accelerator is a model 25URC Pelletron accelerator

  2. Linear regression metamodeling as a tool to summarize and present simulation model results.

    Science.gov (United States)

    Jalal, Hawre; Dowd, Bryan; Sainfort, François; Kuntz, Karen M

    2013-10-01

    Modelers lack a tool to systematically and clearly present complex model results, including those from sensitivity analyses. The objective was to propose linear regression metamodeling as a tool to increase transparency of decision analytic models and better communicate their results. We used a simplified cancer cure model to demonstrate our approach. The model computed the lifetime cost and benefit of 3 treatment options for cancer patients. We simulated 10,000 cohorts in a probabilistic sensitivity analysis (PSA) and regressed the model outcomes on the standardized input parameter values in a set of regression analyses. We used the regression coefficients to describe measures of sensitivity analyses, including threshold and parameter sensitivity analyses. We also compared the results of the PSA to deterministic full-factorial and one-factor-at-a-time designs. The regression intercept represented the estimated base-case outcome, and the other coefficients described the relative parameter uncertainty in the model. We defined simple relationships that compute the average and incremental net benefit of each intervention. Metamodeling produced outputs similar to traditional deterministic 1-way or 2-way sensitivity analyses but was more reliable since it used all parameter values. Linear regression metamodeling is a simple, yet powerful, tool that can assist modelers in communicating model characteristics and sensitivity analyses.

  3. Hierarchical Neural Regression Models for Customer Churn Prediction

    Directory of Open Access Journals (Sweden)

    Golshan Mohammadi

    2013-01-01

    Full Text Available As customers are the main assets of each industry, customer churn prediction is becoming a major task for companies to remain in competition with competitors. In the literature, the better applicability and efficiency of hierarchical data mining techniques has been reported. This paper considers three hierarchical models by combining four different data mining techniques for churn prediction, which are backpropagation artificial neural networks (ANN, self-organizing maps (SOM, alpha-cut fuzzy c-means (α-FCM, and Cox proportional hazards regression model. The hierarchical models are ANN + ANN + Cox, SOM + ANN + Cox, and α-FCM + ANN + Cox. In particular, the first component of the models aims to cluster data in two churner and nonchurner groups and also filter out unrepresentative data or outliers. Then, the clustered data as the outputs are used to assign customers to churner and nonchurner groups by the second technique. Finally, the correctly classified data are used to create Cox proportional hazards model. To evaluate the performance of the hierarchical models, an Iranian mobile dataset is considered. The experimental results show that the hierarchical models outperform the single Cox regression baseline model in terms of prediction accuracy, Types I and II errors, RMSE, and MAD metrics. In addition, the α-FCM + ANN + Cox model significantly performs better than the two other hierarchical models.

  4. LINEAR REGRESSION MODEL ESTİMATİON FOR RIGHT CENSORED DATA

    Directory of Open Access Journals (Sweden)

    Ersin Yılmaz

    2016-05-01

    Full Text Available In this study, firstly we will define a right censored data. If we say shortly right-censored data is censoring values that above the exact line. This may be related with scaling device. And then  we will use response variable acquainted from right-censored explanatory variables. Then the linear regression model will be estimated. For censored data’s existence, Kaplan-Meier weights will be used for  the estimation of the model. With the weights regression model  will be consistent and unbiased with that.   And also there is a method for the censored data that is a semi parametric regression and this method also give  useful results  for censored data too. This study also might be useful for the health studies because of the censored data used in medical issues generally.

  5. Developing and testing a global-scale regression model to quantify mean annual streamflow

    Science.gov (United States)

    Barbarossa, Valerio; Huijbregts, Mark A. J.; Hendriks, A. Jan; Beusen, Arthur H. W.; Clavreul, Julie; King, Henry; Schipper, Aafke M.

    2017-01-01

    Quantifying mean annual flow of rivers (MAF) at ungauged sites is essential for assessments of global water supply, ecosystem integrity and water footprints. MAF can be quantified with spatially explicit process-based models, which might be overly time-consuming and data-intensive for this purpose, or with empirical regression models that predict MAF based on climate and catchment characteristics. Yet, regression models have mostly been developed at a regional scale and the extent to which they can be extrapolated to other regions is not known. In this study, we developed a global-scale regression model for MAF based on a dataset unprecedented in size, using observations of discharge and catchment characteristics from 1885 catchments worldwide, measuring between 2 and 106 km2. In addition, we compared the performance of the regression model with the predictive ability of the spatially explicit global hydrological model PCR-GLOBWB by comparing results from both models to independent measurements. We obtained a regression model explaining 89% of the variance in MAF based on catchment area and catchment averaged mean annual precipitation and air temperature, slope and elevation. The regression model performed better than PCR-GLOBWB for the prediction of MAF, as root-mean-square error (RMSE) values were lower (0.29-0.38 compared to 0.49-0.57) and the modified index of agreement (d) was higher (0.80-0.83 compared to 0.72-0.75). Our regression model can be applied globally to estimate MAF at any point of the river network, thus providing a feasible alternative to spatially explicit process-based global hydrological models.

  6. Resolving issues at the Department of Energy/Oak Ridge operations facilities

    International Nuclear Information System (INIS)

    Row, T.H.; Adams, W.D.

    1988-01-01

    The development of the US Department of Energy Oak Ridge Operations Office's model for waste management and its application in the Oak Ridge Reservation are discussed. The concept simply stated is to assure that all stakeholders in waste management decisions have the opportunity to be participants from the first step. The paper discusses the advisory committees involved in the process, subcontracting support, college and university relation, technology demonstrations and planning, other federal agency interaction, and the model meeting

  7. Remedial Investigation Work Plan for Chestnut Ridge Operable Unit 1 (Chestnut Ridge Security Pits) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    Energy Technology Data Exchange (ETDEWEB)

    1993-09-01

    This Remedial Investigation (RI) Work Plan specifically addresses Chestnut Ridge Operable Unit 1, (OU1) which consists of the Chestnut Ridge Security Pits (CRSP). The CRSP are located {approximately}800 ft southeast of the central portion of the Y-12 Plant atop Chestnut Ridge, which is bounded to the northwest by Bear Creek Valley and to the southeast by Bethel Valley. Operated from 1973 to 1988, the CRSP consisted of a series of trenches used for the disposal of classified hazardous and nonhazardous waste materials. Disposal of hazardous waste materials was discontinued in December 1984, while nonhazardous waste disposal ended on November 8, 1988. An RI is being conducted at this site in response to CERCLA regulations. The overall objectives of the RI are to collect data necessary to evaluate the nature and extent of contaminants of concern (COC), support an ecological risk assessment (ERA) and a human health risk assessment (HHRA), support the evaluation of remedial alternatives, and ultimately develop a Record of Decision for the site. The purpose of this Work Plan is to outline RI activities necessary to define the nature and extent of suspected contaminants at Chestnut Ridge OU1. Potential migration pathways also will be investigated. Data collected during the RI will be used to evaluate the overall risk posed to human health and the environment by OU1.

  8. Remedial Investigation Work Plan for Chestnut Ridge Operable Unit 1 (Chestnut Ridge Security Pits) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1993-09-01

    This Remedial Investigation (RI) Work Plan specifically addresses Chestnut Ridge Operable Unit 1, (OU1) which consists of the Chestnut Ridge Security Pits (CRSP). The CRSP are located ∼800 ft southeast of the central portion of the Y-12 Plant atop Chestnut Ridge, which is bounded to the northwest by Bear Creek Valley and to the southeast by Bethel Valley. Operated from 1973 to 1988, the CRSP consisted of a series of trenches used for the disposal of classified hazardous and nonhazardous waste materials. Disposal of hazardous waste materials was discontinued in December 1984, while nonhazardous waste disposal ended on November 8, 1988. An RI is being conducted at this site in response to CERCLA regulations. The overall objectives of the RI are to collect data necessary to evaluate the nature and extent of contaminants of concern (COC), support an ecological risk assessment (ERA) and a human health risk assessment (HHRA), support the evaluation of remedial alternatives, and ultimately develop a Record of Decision for the site. The purpose of this Work Plan is to outline RI activities necessary to define the nature and extent of suspected contaminants at Chestnut Ridge OU1. Potential migration pathways also will be investigated. Data collected during the RI will be used to evaluate the overall risk posed to human health and the environment by OU1

  9. A land use regression model for ambient ultrafine particles in Montreal, Canada: A comparison of linear regression and a machine learning approach.

    Science.gov (United States)

    Weichenthal, Scott; Ryswyk, Keith Van; Goldstein, Alon; Bagg, Scott; Shekkarizfard, Maryam; Hatzopoulou, Marianne

    2016-04-01

    Existing evidence suggests that ambient ultrafine particles (UFPs) (regression model for UFPs in Montreal, Canada using mobile monitoring data collected from 414 road segments during the summer and winter months between 2011 and 2012. Two different approaches were examined for model development including standard multivariable linear regression and a machine learning approach (kernel-based regularized least squares (KRLS)) that learns the functional form of covariate impacts on ambient UFP concentrations from the data. The final models included parameters for population density, ambient temperature and wind speed, land use parameters (park space and open space), length of local roads and rail, and estimated annual average NOx emissions from traffic. The final multivariable linear regression model explained 62% of the spatial variation in ambient UFP concentrations whereas the KRLS model explained 79% of the variance. The KRLS model performed slightly better than the linear regression model when evaluated using an external dataset (R(2)=0.58 vs. 0.55) or a cross-validation procedure (R(2)=0.67 vs. 0.60). In general, our findings suggest that the KRLS approach may offer modest improvements in predictive performance compared to standard multivariable linear regression models used to estimate spatial variations in ambient UFPs. However, differences in predictive performance were not statistically significant when evaluated using the cross-validation procedure. Crown Copyright © 2015. Published by Elsevier Inc. All rights reserved.

  10. Adaptive regression for modeling nonlinear relationships

    CERN Document Server

    Knafl, George J

    2016-01-01

    This book presents methods for investigating whether relationships are linear or nonlinear and for adaptively fitting appropriate models when they are nonlinear. Data analysts will learn how to incorporate nonlinearity in one or more predictor variables into regression models for different types of outcome variables. Such nonlinear dependence is often not considered in applied research, yet nonlinear relationships are common and so need to be addressed. A standard linear analysis can produce misleading conclusions, while a nonlinear analysis can provide novel insights into data, not otherwise possible. A variety of examples of the benefits of modeling nonlinear relationships are presented throughout the book. Methods are covered using what are called fractional polynomials based on real-valued power transformations of primary predictor variables combined with model selection based on likelihood cross-validation. The book covers how to formulate and conduct such adaptive fractional polynomial modeling in the s...

  11. Confidence bands for inverse regression models

    International Nuclear Information System (INIS)

    Birke, Melanie; Bissantz, Nicolai; Holzmann, Hajo

    2010-01-01

    We construct uniform confidence bands for the regression function in inverse, homoscedastic regression models with convolution-type operators. Here, the convolution is between two non-periodic functions on the whole real line rather than between two periodic functions on a compact interval, since the former situation arguably arises more often in applications. First, following Bickel and Rosenblatt (1973 Ann. Stat. 1 1071–95) we construct asymptotic confidence bands which are based on strong approximations and on a limit theorem for the supremum of a stationary Gaussian process. Further, we propose bootstrap confidence bands based on the residual bootstrap and prove consistency of the bootstrap procedure. A simulation study shows that the bootstrap confidence bands perform reasonably well for moderate sample sizes. Finally, we apply our method to data from a gel electrophoresis experiment with genetically engineered neuronal receptor subunits incubated with rat brain extract

  12. Electricity consumption forecasting in Italy using linear regression models

    Energy Technology Data Exchange (ETDEWEB)

    Bianco, Vincenzo; Manca, Oronzio; Nardini, Sergio [DIAM, Seconda Universita degli Studi di Napoli, Via Roma 29, 81031 Aversa (CE) (Italy)

    2009-09-15

    The influence of economic and demographic variables on the annual electricity consumption in Italy has been investigated with the intention to develop a long-term consumption forecasting model. The time period considered for the historical data is from 1970 to 2007. Different regression models were developed, using historical electricity consumption, gross domestic product (GDP), gross domestic product per capita (GDP per capita) and population. A first part of the paper considers the estimation of GDP, price and GDP per capita elasticities of domestic and non-domestic electricity consumption. The domestic and non-domestic short run price elasticities are found to be both approximately equal to -0.06, while long run elasticities are equal to -0.24 and -0.09, respectively. On the contrary, the elasticities of GDP and GDP per capita present higher values. In the second part of the paper, different regression models, based on co-integrated or stationary data, are presented. Different statistical tests are employed to check the validity of the proposed models. A comparison with national forecasts, based on complex econometric models, such as Markal-Time, was performed, showing that the developed regressions are congruent with the official projections, with deviations of {+-}1% for the best case and {+-}11% for the worst. These deviations are to be considered acceptable in relation to the time span taken into account. (author)

  13. Electricity consumption forecasting in Italy using linear regression models

    International Nuclear Information System (INIS)

    Bianco, Vincenzo; Manca, Oronzio; Nardini, Sergio

    2009-01-01

    The influence of economic and demographic variables on the annual electricity consumption in Italy has been investigated with the intention to develop a long-term consumption forecasting model. The time period considered for the historical data is from 1970 to 2007. Different regression models were developed, using historical electricity consumption, gross domestic product (GDP), gross domestic product per capita (GDP per capita) and population. A first part of the paper considers the estimation of GDP, price and GDP per capita elasticities of domestic and non-domestic electricity consumption. The domestic and non-domestic short run price elasticities are found to be both approximately equal to -0.06, while long run elasticities are equal to -0.24 and -0.09, respectively. On the contrary, the elasticities of GDP and GDP per capita present higher values. In the second part of the paper, different regression models, based on co-integrated or stationary data, are presented. Different statistical tests are employed to check the validity of the proposed models. A comparison with national forecasts, based on complex econometric models, such as Markal-Time, was performed, showing that the developed regressions are congruent with the official projections, with deviations of ±1% for the best case and ±11% for the worst. These deviations are to be considered acceptable in relation to the time span taken into account. (author)

  14. Improving sub-pixel imperviousness change prediction by ensembling heterogeneous non-linear regression models

    Directory of Open Access Journals (Sweden)

    Drzewiecki Wojciech

    2016-12-01

    Full Text Available In this work nine non-linear regression models were compared for sub-pixel impervious surface area mapping from Landsat images. The comparison was done in three study areas both for accuracy of imperviousness coverage evaluation in individual points in time and accuracy of imperviousness change assessment. The performance of individual machine learning algorithms (Cubist, Random Forest, stochastic gradient boosting of regression trees, k-nearest neighbors regression, random k-nearest neighbors regression, Multivariate Adaptive Regression Splines, averaged neural networks, and support vector machines with polynomial and radial kernels was also compared with the performance of heterogeneous model ensembles constructed from the best models trained using particular techniques.

  15. Preliminary results from the first InRidge cruise to the central Indian Ridge

    Digital Repository Service at National Institute of Oceanography (India)

    Mukhopadhyay, R.; Murthy, K.S.R.; Iyer, S.D.; Rao, M.M.M.; Banerjee, R.; Subrahmanyam, A.S.; Shirodkar, P.V.; Ghose, I.; Ganesan, P.; Rao, A.K.; Suribabu, A.; Ganesh, C.; Naik, G.P.

    stream_size 1 stream_content_type text/plain stream_name Inter_Ridge_News_7_40.pdf.txt stream_source_info Inter_Ridge_News_7_40.pdf.txt Content-Encoding ISO-8859-1 Content-Type text/plain; charset=ISO-8859-1 ...

  16. Modelling infant mortality rate in Central Java, Indonesia use generalized poisson regression method

    Science.gov (United States)

    Prahutama, Alan; Sudarno

    2018-05-01

    The infant mortality rate is the number of deaths under one year of age occurring among the live births in a given geographical area during a given year, per 1,000 live births occurring among the population of the given geographical area during the same year. This problem needs to be addressed because it is an important element of a country’s economic development. High infant mortality rate will disrupt the stability of a country as it relates to the sustainability of the population in the country. One of regression model that can be used to analyze the relationship between dependent variable Y in the form of discrete data and independent variable X is Poisson regression model. Recently The regression modeling used for data with dependent variable is discrete, among others, poisson regression, negative binomial regression and generalized poisson regression. In this research, generalized poisson regression modeling gives better AIC value than poisson regression. The most significant variable is the Number of health facilities (X1), while the variable that gives the most influence to infant mortality rate is the average breastfeeding (X9).

  17. Augmented Beta rectangular regression models: A Bayesian perspective.

    Science.gov (United States)

    Wang, Jue; Luo, Sheng

    2016-01-01

    Mixed effects Beta regression models based on Beta distributions have been widely used to analyze longitudinal percentage or proportional data ranging between zero and one. However, Beta distributions are not flexible to extreme outliers or excessive events around tail areas, and they do not account for the presence of the boundary values zeros and ones because these values are not in the support of the Beta distributions. To address these issues, we propose a mixed effects model using Beta rectangular distribution and augment it with the probabilities of zero and one. We conduct extensive simulation studies to assess the performance of mixed effects models based on both the Beta and Beta rectangular distributions under various scenarios. The simulation studies suggest that the regression models based on Beta rectangular distributions improve the accuracy of parameter estimates in the presence of outliers and heavy tails. The proposed models are applied to the motivating Neuroprotection Exploratory Trials in Parkinson's Disease (PD) Long-term Study-1 (LS-1 study, n = 1741), developed by The National Institute of Neurological Disorders and Stroke Exploratory Trials in Parkinson's Disease (NINDS NET-PD) network. © 2015 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  18. Bayesian semiparametric regression models to characterize molecular evolution

    Directory of Open Access Journals (Sweden)

    Datta Saheli

    2012-10-01

    Full Text Available Abstract Background Statistical models and methods that associate changes in the physicochemical properties of amino acids with natural selection at the molecular level typically do not take into account the correlations between such properties. We propose a Bayesian hierarchical regression model with a generalization of the Dirichlet process prior on the distribution of the regression coefficients that describes the relationship between the changes in amino acid distances and natural selection in protein-coding DNA sequence alignments. Results The Bayesian semiparametric approach is illustrated with simulated data and the abalone lysin sperm data. Our method identifies groups of properties which, for this particular dataset, have a similar effect on evolution. The model also provides nonparametric site-specific estimates for the strength of conservation of these properties. Conclusions The model described here is distinguished by its ability to handle a large number of amino acid properties simultaneously, while taking into account that such data can be correlated. The multi-level clustering ability of the model allows for appealing interpretations of the results in terms of properties that are roughly equivalent from the standpoint of molecular evolution.

  19. Regression analysis of a chemical reaction fouling model

    International Nuclear Information System (INIS)

    Vasak, F.; Epstein, N.

    1996-01-01

    A previously reported mathematical model for the initial chemical reaction fouling of a heated tube is critically examined in the light of the experimental data for which it was developed. A regression analysis of the model with respect to that data shows that the reference point upon which the two adjustable parameters of the model were originally based was well chosen, albeit fortuitously. (author). 3 refs., 2 tabs., 2 figs

  20. Correlation-regression model for physico-chemical quality of ...

    African Journals Online (AJOL)

    abusaad

    areas, suggesting that groundwater quality in urban areas is closely related with land use ... the ground water, with correlation and regression model is also presented. ...... WHO (World Health Organization) (1985). Health hazards from nitrates.

  1. Greenland Fracture Zone-East Greenland Ridge(s) revisited: Indications of a C22-change in plate motion?

    DEFF Research Database (Denmark)

    Døssing, Arne; Funck, T.

    2012-01-01

    a reinterpretation of the Greenland Fracture Zone -East Greenland Ridge based on new and existing geophysical data. Evidence is shown for two overstepping ridge segments (Segments A and B) of which Segment A corresponds to the already known East Greenland Ridge while Segment B was not detected previously......Changes in the lithospheric stress field, causing axial rift migration and reorientation of the transform, are generally proposed as an explanation for anomalously old crust and/or major aseismic valleys in oceanic ridge-transform-ridge settings. Similarly, transform migration of the Greenland...... Fracture Zone and separation of the 200-km-long, fracture-zone-parallel continental East Greenland Ridge from the Eurasia plate is thought to be related to a major change in relative plate motions between Greenland and Eurasia during the earliest Oligocene (Chron 13 time). This study presents...

  2. Multiple linear regression and regression with time series error models in forecasting PM10 concentrations in Peninsular Malaysia.

    Science.gov (United States)

    Ng, Kar Yong; Awang, Norhashidah

    2018-01-06

    Frequent haze occurrences in Malaysia have made the management of PM 10 (particulate matter with aerodynamic less than 10 μm) pollution a critical task. This requires knowledge on factors associating with PM 10 variation and good forecast of PM 10 concentrations. Hence, this paper demonstrates the prediction of 1-day-ahead daily average PM 10 concentrations based on predictor variables including meteorological parameters and gaseous pollutants. Three different models were built. They were multiple linear regression (MLR) model with lagged predictor variables (MLR1), MLR model with lagged predictor variables and PM 10 concentrations (MLR2) and regression with time series error (RTSE) model. The findings revealed that humidity, temperature, wind speed, wind direction, carbon monoxide and ozone were the main factors explaining the PM 10 variation in Peninsular Malaysia. Comparison among the three models showed that MLR2 model was on a same level with RTSE model in terms of forecasting accuracy, while MLR1 model was the worst.

  3. Multivariate Frequency-Severity Regression Models in Insurance

    Directory of Open Access Journals (Sweden)

    Edward W. Frees

    2016-02-01

    Full Text Available In insurance and related industries including healthcare, it is common to have several outcome measures that the analyst wishes to understand using explanatory variables. For example, in automobile insurance, an accident may result in payments for damage to one’s own vehicle, damage to another party’s vehicle, or personal injury. It is also common to be interested in the frequency of accidents in addition to the severity of the claim amounts. This paper synthesizes and extends the literature on multivariate frequency-severity regression modeling with a focus on insurance industry applications. Regression models for understanding the distribution of each outcome continue to be developed yet there now exists a solid body of literature for the marginal outcomes. This paper contributes to this body of literature by focusing on the use of a copula for modeling the dependence among these outcomes; a major advantage of this tool is that it preserves the body of work established for marginal models. We illustrate this approach using data from the Wisconsin Local Government Property Insurance Fund. This fund offers insurance protection for (i property; (ii motor vehicle; and (iii contractors’ equipment claims. In addition to several claim types and frequency-severity components, outcomes can be further categorized by time and space, requiring complex dependency modeling. We find significant dependencies for these data; specifically, we find that dependencies among lines are stronger than the dependencies between the frequency and average severity within each line.

  4. Quantile Regression Methods

    DEFF Research Database (Denmark)

    Fitzenberger, Bernd; Wilke, Ralf Andreas

    2015-01-01

    if the mean regression model does not. We provide a short informal introduction into the principle of quantile regression which includes an illustrative application from empirical labor market research. This is followed by briefly sketching the underlying statistical model for linear quantile regression based......Quantile regression is emerging as a popular statistical approach, which complements the estimation of conditional mean models. While the latter only focuses on one aspect of the conditional distribution of the dependent variable, the mean, quantile regression provides more detailed insights...... by modeling conditional quantiles. Quantile regression can therefore detect whether the partial effect of a regressor on the conditional quantiles is the same for all quantiles or differs across quantiles. Quantile regression can provide evidence for a statistical relationship between two variables even...

  5. A linked land-sea modeling framework to inform ridge-to-reef management in high oceanic islands.

    Directory of Open Access Journals (Sweden)

    Jade M S Delevaux

    Full Text Available Declining natural resources have led to a cultural renaissance across the Pacific that seeks to revive customary ridge-to-reef management approaches to protect freshwater and restore abundant coral reef fisheries. Effective ridge-to-reef management requires improved understanding of land-sea linkages and decision-support tools to simultaneously evaluate the effects of terrestrial and marine drivers on coral reefs, mediated by anthropogenic activities. Although a few applications have linked the effects of land cover to coral reefs, these are too coarse in resolution to inform watershed-scale management for Pacific Islands. To address this gap, we developed a novel linked land-sea modeling framework based on local data, which coupled groundwater and coral reef models at fine spatial resolution, to determine the effects of terrestrial drivers (groundwater and nutrients, mediated by human activities (land cover/use, and marine drivers (waves, geography, and habitat on coral reefs. We applied this framework in two 'ridge-to-reef' systems (Hā'ena and Ka'ūpūlehu subject to different natural disturbance regimes, located in the Hawaiian Archipelago. Our results indicated that coral reefs in Ka'ūpūlehu are coral-dominated with many grazers and scrapers due to low rainfall and wave power. While coral reefs in Hā'ena are dominated by crustose coralline algae with many grazers and less scrapers due to high rainfall and wave power. In general, Ka'ūpūlehu is more vulnerable to land-based nutrients and coral bleaching than Hā'ena due to high coral cover and limited dilution and mixing from low rainfall and wave power. However, the shallow and wave sheltered back-reef areas of Hā'ena, which support high coral cover and act as nursery habitat for fishes, are also vulnerable to land-based nutrients and coral bleaching. Anthropogenic sources of nutrients located upstream from these vulnerable areas are relevant locations for nutrient mitigation, such as

  6. Application of random regression models to the genetic evaluation ...

    African Journals Online (AJOL)

    The model included fixed regression on AM (range from 30 to 138 mo) and the effect of herd-measurement date concatenation. Random parts of the model were RRM coefficients for additive and permanent environmental effects, while residual effects were modelled to account for heterogeneity of variance by AY. Estimates ...

  7. Multitask Quantile Regression under the Transnormal Model.

    Science.gov (United States)

    Fan, Jianqing; Xue, Lingzhou; Zou, Hui

    2016-01-01

    We consider estimating multi-task quantile regression under the transnormal model, with focus on high-dimensional setting. We derive a surprisingly simple closed-form solution through rank-based covariance regularization. In particular, we propose the rank-based ℓ 1 penalization with positive definite constraints for estimating sparse covariance matrices, and the rank-based banded Cholesky decomposition regularization for estimating banded precision matrices. By taking advantage of alternating direction method of multipliers, nearest correlation matrix projection is introduced that inherits sampling properties of the unprojected one. Our work combines strengths of quantile regression and rank-based covariance regularization to simultaneously deal with nonlinearity and nonnormality for high-dimensional regression. Furthermore, the proposed method strikes a good balance between robustness and efficiency, achieves the "oracle"-like convergence rate, and provides the provable prediction interval under the high-dimensional setting. The finite-sample performance of the proposed method is also examined. The performance of our proposed rank-based method is demonstrated in a real application to analyze the protein mass spectroscopy data.

  8. Moving Low-Carbon Transportation in Xinjiang: Evidence from STIRPAT and Rigid Regression Models

    Directory of Open Access Journals (Sweden)

    Jiefang Dong

    2016-12-01

    Full Text Available With the rapid economic development of the Xinjiang Uygur Autonomous Region, the area’s transport sector has witnessed significant growth, which in turn has led to a large increase in carbon dioxide emissions. As such, calculating of the carbon footprint of Xinjiang’s transportation sector and probing the driving factors of carbon dioxide emissions are of great significance to the region’s energy conservation and environmental protection. This paper provides an account of the growth in the carbon emissions of Xinjiang’s transportation sector during the period from 1989 to 2012. We also analyze the transportation sector’s trends and historical evolution. Combined with the STIRPAT (Stochastic Impacts by Regression on Population, Affluence and Technology model and ridge regression, this study further quantitatively analyzes the factors that influence the carbon emissions of Xinjiang’s transportation sector. The results indicate the following: (1 the total carbon emissions and per capita carbon emissions of Xinjiang’s transportation sector both continued to rise rapidly during this period; their average annual growth rates were 10.8% and 9.1%, respectively; (2 the carbon emissions of the transportation sector come mainly from the consumption of diesel and gasoline, which accounted for an average of 36.2% and 2.6% of carbon emissions, respectively; in addition, the overall carbon emission intensity of the transportation sector showed an “S”-pattern trend within the study period; (3 population density plays a dominant role in increasing carbon dioxide emissions. Population is then followed by per capita GDP and, finally, energy intensity. Cargo turnover has a more significant potential impact on and role in emission reduction than do private vehicles. This is because road freight is the primary form of transportation used across Xinjiang, and this form of transportation has low energy efficiency. These findings have important

  9. Approximating prediction uncertainty for random forest regression models

    Science.gov (United States)

    John W. Coulston; Christine E. Blinn; Valerie A. Thomas; Randolph H. Wynne

    2016-01-01

    Machine learning approaches such as random forest have increased for the spatial modeling and mapping of continuous variables. Random forest is a non-parametric ensemble approach, and unlike traditional regression approaches there is no direct quantification of prediction error. Understanding prediction uncertainty is important when using model-based continuous maps as...

  10. Alveolar ridge augmentation by osteoinduction in rats

    DEFF Research Database (Denmark)

    Pinholt, E M; Bang, G; Haanaes, H R

    1990-01-01

    The purpose of this study was to evaluate bone substitutes for alveolar ridge augmentation by osteoinduction. Allogenic, demineralized, and lyophilized dentin and bone was tested for osteoinductive properties in order to establish an experimental model for further studies. Implantations were perf...

  11. Global survey of lunar wrinkle ridge formation times

    Science.gov (United States)

    Yue, Z.; Michael, G. G.; Di, K.; Liu, J.

    2017-11-01

    Wrinkle ridges are a common feature of the lunar maria and record subsequent contraction of mare infill. Constraining the timing of wrinkle ridge formation from crater counts is challenging because they have limited areal extent and it is difficult to determine whether superposed craters post-date ridge formation or have alternatively been uplifted by the deformation. Some wrinkle ridges do allow determination to be made. This is possible where a ridge shows a sufficiently steep boundary or scarp that can be identified as deforming an intersecting crater or the crater obliterates the relief of the ridge. Such boundaries constitute only a small fraction of lunar wrinkle ridge structures yet they are sufficiently numerous to enable us to obtain statistically significant crater counts over systems of structurally related wrinkle ridges. We carried out a global mapping of mare wrinkle ridges, identifying appropriate boundaries for crater identification, and mapping superposed craters. Selected groups of ridges were analyzed using the buffered crater counting method. We found that, except for the ridges in mare Tranquilitatis, the ridge groups formed with average ages between 3.5 and 3.1 Ga ago, or 100-650 Ma after the oldest observable erupted basalts where they are located. We interpret these results to suggest that local stresses from loading by basalt fill are the principal agent responsible for the formation of lunar wrinkle ridges, as others have proposed. We find a markedly longer interval before wrinkle ridge formation in Tranquilitatis which likely indicates a different mechanism of stress accumulation at this site.

  12. Hot subduction: Magmatism along the Hunter Ridge, SW Pacific

    International Nuclear Information System (INIS)

    Crawford, A.J.; Verbeeten, A.; Danyushevsky, L.V.; Sigurdsson, I.A.; Maillet, P.; Monzier, M.

    1997-01-01

    The Hunter 'fracture zone' is generally regarded as a transform plate boundary linking the oppositely dipping Tongan and Vanuatu subduction systems. Dredging along the Hunter Ridge and sampling of its northernmost extent, exposed as the island of Kadavu in Fiji, has yielded a diversity of magmatic suites, including arc tholeiites and high-Ca boninites, high-Mg lavas with some affinities to boninites and some affinities to adakites, and true adakitic lavas associated with remarkable low-Fe, high-Na basalts with 8-16 ppm Nb (herein high-Nb basalts). Lavas which show clear evidence of slab melt involvement in their petrogenesis occur at either end of the Hunter Ridge, whereas the arc tholeiites and high-Ca boninites appear to be restricted to the south central part of the ridge. Mineralogical and whole rock geochemical data for each of these suites are summarized, and a tectono-magmatic model for their genesis and distribution is suggested. Trace element features and radiogenic isotope data for the Hunter Ridge lavas indicate compositions analogue to Pacific MORB-like mantle

  13. Profile-driven regression for modeling and runtime optimization of mobile networks

    DEFF Research Database (Denmark)

    McClary, Dan; Syrotiuk, Violet; Kulahci, Murat

    2010-01-01

    Computer networks often display nonlinear behavior when examined over a wide range of operating conditions. There are few strategies available for modeling such behavior and optimizing such systems as they run. Profile-driven regression is developed and applied to modeling and runtime optimization...... of throughput in a mobile ad hoc network, a self-organizing collection of mobile wireless nodes without any fixed infrastructure. The intermediate models generated in profile-driven regression are used to fit an overall model of throughput, and are also used to optimize controllable factors at runtime. Unlike...

  14. Best management practices plan for the Chestnut Ridge-Filled Coal Ash Pond at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1996-05-01

    The Chestnut Ridge Filled Coal Ash Pond (FCAP) Project has been established to satisfy Comprehensive Environmental Response, Compensation, and Liability Act (CERCLA) requirements for the Chestnut Ridge Operable Unit 2. FCAP is on Chestnut Ridge, approximately 0.5 miles south of the Y-12 Plant. A 62-foot high earthen dam across Upper McCoy Branch was constructed in 1955 to create a pond to serve as a settling basin for fly and bottom ashes generated by burning coal at the Y-12 Steam Plant. Ash from the steam was mixed with water to form a slurry and then pumped to the crest of Chestnut Ridge and released through a large pipe to flow across the Sluice Channel area and into the pond. The ash slurry eventually overtopped the dam and flowed along Upper McCoy Branch to Rogers Quarry. The purpose of this document is to provide a site-specific Best Management Practices (BMP) Plan for construction associated with environmental restoration activities at the FCAP Site

  15. Magnetic Anomalies over the Mid-Atlantic Ridge near 27{degrees}N.

    Science.gov (United States)

    Phillips, J D

    1967-08-25

    Ten magnetic profiles across the mid-Atlantic ridge near 27 degrees N show trends that are parallel to the ridge axis and symmetrical about the ridge axis. The configuration of magnetic bodies that could account for the pattern supports the Vine and Matthews hypothesis for the origin of magnetic anomalies over oceanic ridges. A polarity-reversal time scale inferred from models for sea-floor spreading in the Pacific-Antarctic ridge and radiometrically dated reversals of the geomagnetic field indicates a spreading rate of 1.25 centimeters per year during the last 6 million years and a rate of 1.65 centimeters per year between 6 and 10 million years ago. A similar analysis of more limited data over the mid-Atlantic ridge near 22 degrees N also indicates a change in the spreading rate. Here a rate of 1.4 centimeters per year appears to have been in effect during the last 5 million years; between 5 and 9 million years ago, an increased rate of 1.7 centimeters per year is indicated. The time of occurrence and relative magnitude of these changes in the spreading rate, about 5 to 6 million years ago and 18 to 27 percent, respectively, accords with the spreading rate change implied for the Juan de Fuca ridge in the northeast Pacific.

  16. Ridge interaction features of the Line Islands

    Science.gov (United States)

    Konter, J. G.; Koppers, A. A. P.; Storm, L. P.

    2016-12-01

    The sections of Pacific absolute plate motion history that precede the Hawaii-Emperor and Louisville chains are based on three chains: the Line Islands-Mid-Pacific Mountains, the Hess Rise-Shatsky Rise, and the Marshall Islands-Wake Islands (Rurutu hotspot). Although it has been clear that the Line Islands do not define a simple age progression (e.g. Schlanger et al., 1984), the apparent similarity to the Emperor Seamount geographic trend has been used to extend the overall Hawaii-Emperor track further into the past. However, we show here that plate tectonic reconstructions suggest that the Mid-Pacific Mountains (MPMs) and Line Islands (LIs) were erupted near a mid-ocean ridge, and thus these structures do not reflect absolute plate motion. Moverover, the morphology and geochemistry of the volcanoes show similarities with Pukapuka Ridge (e.g. Davis et al., 2002) and the Rano Rahi seamounts, presumed to have a shallow origin. Modern 40Ar/39Ar ages show that the LIs erupted at various times along the entire volcanic chain. The oldest structures formed within 10 Ma of plate formation. Given the short distance to the ridge system, large aseismic volcanic ridges, such as Necker Ridge and Horizon Guyot may simply reflect a connection between MPMs and the ridge, similar to the Pukapuka Ridge. The Line Islands to the south (including Karin Ridge) define short subchains of elongated seamounts that are widespread, resembling the Rano Rahi seamount field. During this time, the plate moved nearly parallel to the ridge system. The change from few large ridges to many subchains may reflect a change in absolute plate motion, similar to the Rano Rahi field. Here, significant MPMs volcanism is no longer connected to the ridge along plate motion. Similar to Pukapuka vs. Rano Rahi, the difference in direction between plate motion and the closest ridge determines whether larger ridges or smaller seamount subchains are formed. The difference between the largest structures (MPMs and LIs

  17. Accounting for measurement error in log regression models with applications to accelerated testing.

    Science.gov (United States)

    Richardson, Robert; Tolley, H Dennis; Evenson, William E; Lunt, Barry M

    2018-01-01

    In regression settings, parameter estimates will be biased when the explanatory variables are measured with error. This bias can significantly affect modeling goals. In particular, accelerated lifetime testing involves an extrapolation of the fitted model, and a small amount of bias in parameter estimates may result in a significant increase in the bias of the extrapolated predictions. Additionally, bias may arise when the stochastic component of a log regression model is assumed to be multiplicative when the actual underlying stochastic component is additive. To account for these possible sources of bias, a log regression model with measurement error and additive error is approximated by a weighted regression model which can be estimated using Iteratively Re-weighted Least Squares. Using the reduced Eyring equation in an accelerated testing setting, the model is compared to previously accepted approaches to modeling accelerated testing data with both simulations and real data.

  18. Accounting for measurement error in log regression models with applications to accelerated testing.

    Directory of Open Access Journals (Sweden)

    Robert Richardson

    Full Text Available In regression settings, parameter estimates will be biased when the explanatory variables are measured with error. This bias can significantly affect modeling goals. In particular, accelerated lifetime testing involves an extrapolation of the fitted model, and a small amount of bias in parameter estimates may result in a significant increase in the bias of the extrapolated predictions. Additionally, bias may arise when the stochastic component of a log regression model is assumed to be multiplicative when the actual underlying stochastic component is additive. To account for these possible sources of bias, a log regression model with measurement error and additive error is approximated by a weighted regression model which can be estimated using Iteratively Re-weighted Least Squares. Using the reduced Eyring equation in an accelerated testing setting, the model is compared to previously accepted approaches to modeling accelerated testing data with both simulations and real data.

  19. Reconstructing paleo-discharge from geometries of fluvial sinuous ridges on Earth and Mars

    Science.gov (United States)

    Hayden, A.; Lamb, M. P.; Mohrig, D. C.; Williams, R. M. E.; Myrow, P.; Ewing, R. C.; Cardenas, B. T.; Findlay, C. P., III

    2017-12-01

    Sinuous, branching networks of topographic ridges resembling river networks are common across Mars, and show promise for quantifying ancient martian surface hydrology. There are two leading formation mechanisms for ridges with a fluvial origin. Inverted channels are ridges that represent casts (e.g., due to lava fill) of relict river channel topography, whereas exhumed channel deposits are eroded remnants of a more extensive fluvial deposit, such as a channel belt. The inverted channel model is often assumed on Mars; however, we currently lack the ability to distinguish these ridge formation mechanisms, motivating the need for Earth-analog study. To address this issue, we studied the extensive networks of sinuous ridges in the Ebro basin of northeast Spain. The Ebro ridges stand 3-15 meters above the surrounding plains and are capped by a cliff-forming sandstone unit 3-10 meters thick and 20-50 meters in breadth. The caprock sandstone bodies contain bar-scale cross stratification, point-bar deposits, levee deposits, and lenses of mudstone, indicating that these are channel-belt deposits, rather than casts of channels formed from lateral channel migration, avulsion and reoccupation. In plan view, ridges form segments branching outward to the north resembling a distributary network; however, crosscutting relationships indicate that ridges cross at different stratigraphic levels. Thus, the apparent network in planview reflects non-uniform exhumation of channel-belt deposits from multiple stratigraphic positions, rather than an inverted coeval river network. As compared to the inverted channel model, exhumed fluvial deposits indicate persistent fluvial activity over geologic timescales, indicating the potential for long-lived surface water on ancient Mars.

  20. OPLS statistical model versus linear regression to assess sonographic predictors of stroke prognosis.

    Science.gov (United States)

    Vajargah, Kianoush Fathi; Sadeghi-Bazargani, Homayoun; Mehdizadeh-Esfanjani, Robab; Savadi-Oskouei, Daryoush; Farhoudi, Mehdi

    2012-01-01

    The objective of the present study was to assess the comparable applicability of orthogonal projections to latent structures (OPLS) statistical model vs traditional linear regression in order to investigate the role of trans cranial doppler (TCD) sonography in predicting ischemic stroke prognosis. The study was conducted on 116 ischemic stroke patients admitted to a specialty neurology ward. The Unified Neurological Stroke Scale was used once for clinical evaluation on the first week of admission and again six months later. All data was primarily analyzed using simple linear regression and later considered for multivariate analysis using PLS/OPLS models through the SIMCA P+12 statistical software package. The linear regression analysis results used for the identification of TCD predictors of stroke prognosis were confirmed through the OPLS modeling technique. Moreover, in comparison to linear regression, the OPLS model appeared to have higher sensitivity in detecting the predictors of ischemic stroke prognosis and detected several more predictors. Applying the OPLS model made it possible to use both single TCD measures/indicators and arbitrarily dichotomized measures of TCD single vessel involvement as well as the overall TCD result. In conclusion, the authors recommend PLS/OPLS methods as complementary rather than alternative to the available classical regression models such as linear regression.

  1. A Technique of Fuzzy C-Mean in Multiple Linear Regression Model toward Paddy Yield

    Science.gov (United States)

    Syazwan Wahab, Nur; Saifullah Rusiman, Mohd; Mohamad, Mahathir; Amira Azmi, Nur; Che Him, Norziha; Ghazali Kamardan, M.; Ali, Maselan

    2018-04-01

    In this paper, we propose a hybrid model which is a combination of multiple linear regression model and fuzzy c-means method. This research involved a relationship between 20 variates of the top soil that are analyzed prior to planting of paddy yields at standard fertilizer rates. Data used were from the multi-location trials for rice carried out by MARDI at major paddy granary in Peninsular Malaysia during the period from 2009 to 2012. Missing observations were estimated using mean estimation techniques. The data were analyzed using multiple linear regression model and a combination of multiple linear regression model and fuzzy c-means method. Analysis of normality and multicollinearity indicate that the data is normally scattered without multicollinearity among independent variables. Analysis of fuzzy c-means cluster the yield of paddy into two clusters before the multiple linear regression model can be used. The comparison between two method indicate that the hybrid of multiple linear regression model and fuzzy c-means method outperform the multiple linear regression model with lower value of mean square error.

  2. Sandy berm and beach-ridge formation in relation to extreme sea-levels

    DEFF Research Database (Denmark)

    Bendixen, Mette; Clemmensen, Lars B; Kroon, Aart

    2013-01-01

    The formation of berms and their transformation into beach ridges in a micro-tidal environment is coupled to wave run-up and overtopping during extreme sea levels. A straight-forward comparison between extreme sea levels due to storm-surges and active berm levels is impossible in the semi...... prograding spit on the south-eastern Baltic shores of Zealand, Denmark. The modern, sandy beach at this location consists of a beachface with a shallow incipient berm, a mature berm, and a dune-covered beach ridge. It borders a beach-ridge plain to the west, where more than 20 N–S oriented beach ridges...... and swales are present. Measured water-level data from 1991 to 2012 and topographical observations, carried out during fair weather period and during a storm event, provided the basis for a conceptual model exhibiting berm formation and transformation into the local beach-ridge system. The character...

  3. Radiogenic isotopes in enriched mid-ocean ridge basalts from Explorer Ridge, northeast Pacific Ocean

    Science.gov (United States)

    Cousens, Brian; Weis, Dominique; Constantin, Marc; Scott, Steve

    2017-09-01

    Extreme gradients in topography related to variations in magma supply are observed on the Southern Explorer Ridge (SER), part of the northern Juan de Fuca ridge system. We report radiogenic isotope (Pb, Sr, Nd, Hf) and geochemical data for twenty-four basalt whole-rock and glass samples collected from the length of the SER and from Explorer Deep, a rift to the north of the SER. Lavas from the SER form a north-south geochemical gradient, dominated by E-MORB at the northern axial high, and range from T-MORB to N-MORB towards the southern deepest part of the ridge. Linear relationships between incompatible element ratios and isotopic ratios in MORB along the ridge are consistent with mixing of magmas beneath the ridge to generate the geographic gradient from E- to N-MORB. The E-MORB have high Sr and Pb, and low Nd and Hf isotopic ratios, typical of enriched mantle that includes a FOZO or HIMU isotopic component. The West Valley and Endeavour segments of the northern Juan de Fuca ridge also include this isotopic component, but the proportion of the FOZO or HIMU component is more extreme in the SER basalts. The FOZO or HIMU component may be garnet-bearing peridotite, or a garnet pyroxenite embedded in peridotite. Recycled garnet pyroxenite better explains the very shallow SER axial high, high Nb/La and La/Sm, and the ;enriched; isotopic compositions.

  4. Direct modeling of regression effects for transition probabilities in the progressive illness-death model

    DEFF Research Database (Denmark)

    Azarang, Leyla; Scheike, Thomas; de Uña-Álvarez, Jacobo

    2017-01-01

    In this work, we present direct regression analysis for the transition probabilities in the possibly non-Markov progressive illness–death model. The method is based on binomial regression, where the response is the indicator of the occupancy for the given state along time. Randomly weighted score...

  5. A holistic model for the role of the axial melt lens at fast-spreading mid-ocean ridges

    Science.gov (United States)

    MacLeod, C. J.; Loocke, M. P.; Lissenberg, J. C. J.

    2016-12-01

    Axial melt lenses (AML) are melt or crystal mush1 bodies located at the dyke-gabbro transition beneath intermediate- and fast-spreading mid-ocean ridges (MORs)2,3. Although it is generally thought that AMLs play a major role in the storage and differentiation of mid-ocean ridge basalts (MORB)1, the melt compositions within the AML and its role in the accretion of the lower crust are heavily debated4-6. Here we present the first comprehensive study of the AML horizon at a fast-spreading MOR (Hess Deep, equatorial Pacific Ocean). We show that plagioclase and pyroxene within the AML are much too evolved to be in equilibrium with MORB, with mean An (54.85) and Mg# (65.01) consistent with derivation from basaltic andesite to andesite melts (Mg# 43-26). We propose that, in between decadal eruptions, the AML is predominantly crystal mush and is fed by small volumes of evolved interstitial melts. Short-lived, focused injection of primitive melt leads to mixing of primitive melts with the extant highly fractionated melt, and triggers eruptions. This model reconciles the paradoxical compositional mismatch between the volcanic and plutonic records with the geophysical characteristics of the AML, the short residence times of Pacific MORB phenocrysts, and the incompatible trace element over-enrichments in MORB. 1Marjanović, M. et al., 2015. Distribution of melt along the East Pacific Rise from 9°30' to 10°N from an amplitude variation with angle of incidence (AVA) technique. Geophys. J. Int. 203. 2Detrick, R. S. et al., 1987. Multi-channel seismic imaging of a crustal magma chamber along the EPR. Nature 326. 3Sinton, J. M. & Detrick, R. S., 1992. Mid-ocean ridge magma chambers. J. Geophys. Res. 97. 4Coogan, L. A., Thompson, G. & MacLeod, C. J., 2002. A textural and geochemical investigation of high level gabbros from the Oman ophiolite: implications for the role of the axial magma chamber at fast-spreading ridges. Lithos 63. 5Pan, Y. & Batiza, R., 2002. Mid-ocean ridge magma

  6. SPSS macros to compare any two fitted values from a regression model.

    Science.gov (United States)

    Weaver, Bruce; Dubois, Sacha

    2012-12-01

    In regression models with first-order terms only, the coefficient for a given variable is typically interpreted as the change in the fitted value of Y for a one-unit increase in that variable, with all other variables held constant. Therefore, each regression coefficient represents the difference between two fitted values of Y. But the coefficients represent only a fraction of the possible fitted value comparisons that might be of interest to researchers. For many fitted value comparisons that are not captured by any of the regression coefficients, common statistical software packages do not provide the standard errors needed to compute confidence intervals or carry out statistical tests-particularly in more complex models that include interactions, polynomial terms, or regression splines. We describe two SPSS macros that implement a matrix algebra method for comparing any two fitted values from a regression model. The !OLScomp and !MLEcomp macros are for use with models fitted via ordinary least squares and maximum likelihood estimation, respectively. The output from the macros includes the standard error of the difference between the two fitted values, a 95% confidence interval for the difference, and a corresponding statistical test with its p-value.

  7. Parameter estimation and statistical test of geographically weighted bivariate Poisson inverse Gaussian regression models

    Science.gov (United States)

    Amalia, Junita; Purhadi, Otok, Bambang Widjanarko

    2017-11-01

    Poisson distribution is a discrete distribution with count data as the random variables and it has one parameter defines both mean and variance. Poisson regression assumes mean and variance should be same (equidispersion). Nonetheless, some case of the count data unsatisfied this assumption because variance exceeds mean (over-dispersion). The ignorance of over-dispersion causes underestimates in standard error. Furthermore, it causes incorrect decision in the statistical test. Previously, paired count data has a correlation and it has bivariate Poisson distribution. If there is over-dispersion, modeling paired count data is not sufficient with simple bivariate Poisson regression. Bivariate Poisson Inverse Gaussian Regression (BPIGR) model is mix Poisson regression for modeling paired count data within over-dispersion. BPIGR model produces a global model for all locations. In another hand, each location has different geographic conditions, social, cultural and economic so that Geographically Weighted Regression (GWR) is needed. The weighting function of each location in GWR generates a different local model. Geographically Weighted Bivariate Poisson Inverse Gaussian Regression (GWBPIGR) model is used to solve over-dispersion and to generate local models. Parameter estimation of GWBPIGR model obtained by Maximum Likelihood Estimation (MLE) method. Meanwhile, hypothesis testing of GWBPIGR model acquired by Maximum Likelihood Ratio Test (MLRT) method.

  8. Can We Use Regression Modeling to Quantify Mean Annual Streamflow at a Global-Scale?

    Science.gov (United States)

    Barbarossa, V.; Huijbregts, M. A. J.; Hendriks, J. A.; Beusen, A.; Clavreul, J.; King, H.; Schipper, A.

    2016-12-01

    Quantifying mean annual flow of rivers (MAF) at ungauged sites is essential for a number of applications, including assessments of global water supply, ecosystem integrity and water footprints. MAF can be quantified with spatially explicit process-based models, which might be overly time-consuming and data-intensive for this purpose, or with empirical regression models that predict MAF based on climate and catchment characteristics. Yet, regression models have mostly been developed at a regional scale and the extent to which they can be extrapolated to other regions is not known. In this study, we developed a global-scale regression model for MAF using observations of discharge and catchment characteristics from 1,885 catchments worldwide, ranging from 2 to 106 km2 in size. In addition, we compared the performance of the regression model with the predictive ability of the spatially explicit global hydrological model PCR-GLOBWB [van Beek et al., 2011] by comparing results from both models to independent measurements. We obtained a regression model explaining 89% of the variance in MAF based on catchment area, mean annual precipitation and air temperature, average slope and elevation. The regression model performed better than PCR-GLOBWB for the prediction of MAF, as root-mean-square error values were lower (0.29 - 0.38 compared to 0.49 - 0.57) and the modified index of agreement was higher (0.80 - 0.83 compared to 0.72 - 0.75). Our regression model can be applied globally at any point of the river network, provided that the input parameters are within the range of values employed in the calibration of the model. The performance is reduced for water scarce regions and further research should focus on improving such an aspect for regression-based global hydrological models.

  9. Modeling Governance KB with CATPCA to Overcome Multicollinearity in the Logistic Regression

    Science.gov (United States)

    Khikmah, L.; Wijayanto, H.; Syafitri, U. D.

    2017-04-01

    The problem often encounters in logistic regression modeling are multicollinearity problems. Data that have multicollinearity between explanatory variables with the result in the estimation of parameters to be bias. Besides, the multicollinearity will result in error in the classification. In general, to overcome multicollinearity in regression used stepwise regression. They are also another method to overcome multicollinearity which involves all variable for prediction. That is Principal Component Analysis (PCA). However, classical PCA in only for numeric data. Its data are categorical, one method to solve the problems is Categorical Principal Component Analysis (CATPCA). Data were used in this research were a part of data Demographic and Population Survey Indonesia (IDHS) 2012. This research focuses on the characteristic of women of using the contraceptive methods. Classification results evaluated using Area Under Curve (AUC) values. The higher the AUC value, the better. Based on AUC values, the classification of the contraceptive method using stepwise method (58.66%) is better than the logistic regression model (57.39%) and CATPCA (57.39%). Evaluation of the results of logistic regression using sensitivity, shows the opposite where CATPCA method (99.79%) is better than logistic regression method (92.43%) and stepwise (92.05%). Therefore in this study focuses on major class classification (using a contraceptive method), then the selected model is CATPCA because it can raise the level of the major class model accuracy.

  10. Time series modeling by a regression approach based on a latent process.

    Science.gov (United States)

    Chamroukhi, Faicel; Samé, Allou; Govaert, Gérard; Aknin, Patrice

    2009-01-01

    Time series are used in many domains including finance, engineering, economics and bioinformatics generally to represent the change of a measurement over time. Modeling techniques may then be used to give a synthetic representation of such data. A new approach for time series modeling is proposed in this paper. It consists of a regression model incorporating a discrete hidden logistic process allowing for activating smoothly or abruptly different polynomial regression models. The model parameters are estimated by the maximum likelihood method performed by a dedicated Expectation Maximization (EM) algorithm. The M step of the EM algorithm uses a multi-class Iterative Reweighted Least-Squares (IRLS) algorithm to estimate the hidden process parameters. To evaluate the proposed approach, an experimental study on simulated data and real world data was performed using two alternative approaches: a heteroskedastic piecewise regression model using a global optimization algorithm based on dynamic programming, and a Hidden Markov Regression Model whose parameters are estimated by the Baum-Welch algorithm. Finally, in the context of the remote monitoring of components of the French railway infrastructure, and more particularly the switch mechanism, the proposed approach has been applied to modeling and classifying time series representing the condition measurements acquired during switch operations.

  11. Regression to Causality : Regression-style presentation influences causal attribution

    DEFF Research Database (Denmark)

    Bordacconi, Mats Joe; Larsen, Martin Vinæs

    2014-01-01

    of equivalent results presented as either regression models or as a test of two sample means. Our experiment shows that the subjects who were presented with results as estimates from a regression model were more inclined to interpret these results causally. Our experiment implies that scholars using regression...... models – one of the primary vehicles for analyzing statistical results in political science – encourage causal interpretation. Specifically, we demonstrate that presenting observational results in a regression model, rather than as a simple comparison of means, makes causal interpretation of the results...... more likely. Our experiment drew on a sample of 235 university students from three different social science degree programs (political science, sociology and economics), all of whom had received substantial training in statistics. The subjects were asked to compare and evaluate the validity...

  12. Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.

    Science.gov (United States)

    Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H

    2016-01-01

    Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.

  13. Regression Model to Predict Global Solar Irradiance in Malaysia

    Directory of Open Access Journals (Sweden)

    Hairuniza Ahmed Kutty

    2015-01-01

    Full Text Available A novel regression model is developed to estimate the monthly global solar irradiance in Malaysia. The model is developed based on different available meteorological parameters, including temperature, cloud cover, rain precipitate, relative humidity, wind speed, pressure, and gust speed, by implementing regression analysis. This paper reports on the details of the analysis of the effect of each prediction parameter to identify the parameters that are relevant to estimating global solar irradiance. In addition, the proposed model is compared in terms of the root mean square error (RMSE, mean bias error (MBE, and the coefficient of determination (R2 with other models available from literature studies. Seven models based on single parameters (PM1 to PM7 and five multiple-parameter models (PM7 to PM12 are proposed. The new models perform well, with RMSE ranging from 0.429% to 1.774%, R2 ranging from 0.942 to 0.992, and MBE ranging from −0.1571% to 0.6025%. In general, cloud cover significantly affects the estimation of global solar irradiance. However, cloud cover in Malaysia lacks sufficient influence when included into multiple-parameter models although it performs fairly well in single-parameter prediction models.

  14. Polynomial regression analysis and significance test of the regression function

    International Nuclear Information System (INIS)

    Gao Zhengming; Zhao Juan; He Shengping

    2012-01-01

    In order to analyze the decay heating power of a certain radioactive isotope per kilogram with polynomial regression method, the paper firstly demonstrated the broad usage of polynomial function and deduced its parameters with ordinary least squares estimate. Then significance test method of polynomial regression function is derived considering the similarity between the polynomial regression model and the multivariable linear regression model. Finally, polynomial regression analysis and significance test of the polynomial function are done to the decay heating power of the iso tope per kilogram in accord with the authors' real work. (authors)

  15. Effects of sea level rise on the formation and drowning of shoreface-connected sand ridges, a model study

    NARCIS (Netherlands)

    Nnafie, A.|info:eu-repo/dai/nl/37551127X; de Swart, Huib|info:eu-repo/dai/nl/073449725; Calvete, D.|info:eu-repo/dai/nl/304846317; Garnier, R.

    2014-01-01

    Shoreface-connected sand ridges occur on many storm-dominated inner shelves. These rhythmic features have an along-shelf spacing of 2-10. km, a height of 1-12. m, they evolve on timescales of centuries and they migrate several meters per year. An idealized model is used to study the impact of sea

  16. Waveform modeling of the seismic response of a mid-ocean ridge axial melt sill

    Science.gov (United States)

    Xu, Min; Stephen, R. A.; Canales, J. Pablo

    2017-12-01

    Seismic reflections from axial magma lens (AML) are commonly observed along many mid-ocean ridges, and are thought to arise from the negative impedance contrast between a solid, high-speed lid and the underlying low-speed, molten or partially molten (mush) sill. The polarity of the AML reflection ( P AML P) at vertical incidence and the amplitude vs offset (AVO) behavior of the AML reflections (e.g., P AML P and S-converted P AML S waves) are often used as a diagnostic tool for the nature of the low-speed sill. Time-domain finite difference calculations for two-dimensional laterally homogeneous models show some scenarios make the interpretation of melt content from partial-offset stacks of P- and S-waves difficult. Laterally heterogeneous model calculations indicate diffractions from the edges of the finite-width AML reducing the amplitude of the AML reflections. Rough seafloor and/or a rough AML surface can also greatly reduce the amplitude of peg-leg multiples because of scattering and destructive interference. Mid-crustal seismic reflection events are observed in the three-dimensional multi-channel seismic dataset acquired over the RIDGE-2000 Integrated Study Site at East Pacific Rise (EPR, cruise MGL0812). Modeling indicates that the mid-crustal seismic reflection reflections are unlikely to arise from peg-leg multiples of the AML reflections, P-to- S converted phases, or scattering due to rough topography, but could probably arise from deeper multiple magma sills. Our results support the identification of Marjanović et al. (Nat Geosci 7(11):825-829, 2014) that a multi-level complex of melt lenses is present beneath the axis of the EPR.

  17. Logistic regression for risk factor modelling in stuttering research.

    Science.gov (United States)

    Reed, Phil; Wu, Yaqionq

    2013-06-01

    To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.

  18. Efficient estimation of an additive quantile regression model

    NARCIS (Netherlands)

    Cheng, Y.; de Gooijer, J.G.; Zerom, D.

    2011-01-01

    In this paper, two non-parametric estimators are proposed for estimating the components of an additive quantile regression model. The first estimator is a computationally convenient approach which can be viewed as a more viable alternative to existing kernel-based approaches. The second estimator

  19. Advanced statistics: linear regression, part I: simple linear regression.

    Science.gov (United States)

    Marill, Keith A

    2004-01-01

    Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.

  20. Advanced statistics: linear regression, part II: multiple linear regression.

    Science.gov (United States)

    Marill, Keith A

    2004-01-01

    The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.

  1. Dynamics of the Seychelles-Chagos Thermocline Ridge

    Science.gov (United States)

    Bulusu, S.

    2016-02-01

    multilinear regression model (MRM). This study will help to improve monsoon modeling and forecasting, two areas that remain highly inaccurate after decades of research work.

  2. Fluvial Channel Networks as Analogs for the Ridge-Forming Unit, Sinus Meridiani, Mars

    Science.gov (United States)

    Wilkinson, M. J.; du Bois, J. B.

    2010-01-01

    Fluvial models have been generally discounted as analogs for the younger layered rock units of Sinus Meridiani. A fluvial model based on the large fluvial fan provides a possibly close analog for various features of the sinuous ridges of the etched, ridge-forming unit (RFU) in particular. The close spacing of the RFU ridges, their apparently chaotic orientations, and their organization in dense networks all appear unlike classical stream channel patterns. However, drainage patterns on large fluvial fans low-angle, fluvial aggradational features, 100s of km long, documented worldwide by us provide parallels. Some large fan characteristics resemble those of classical floodplains, but many differences have been demonstrated. One major distinction relevant to the RFU is that channel landscapes of large fans can dominate large areas (1.2 million km2 in one S. American study area). We compare channel morphologies on large fans in the southern Sahara Desert with ridge patterns in Sinus Meridiani (fig 1). Stream channels are the dominant landform on large terrestrial fans: they may equate to the ubiquitous, sinuous, elongated ridges of the RFU that cover areas region wide. Networks of convergent/divergent and crossing channels may equate to similar features in the ridge networks. Downslope divergence is absent in channels of terrestrial upland erosional landscapes (fig. 1, left), whereas it is common to both large fans (fig. 1, center) and RFU ridge patterns (fig 1, right downslope defined as the regional NW slope of Sinus Meridiani). RFU ridge orientation, judged from those areas apparently devoid of impact crater control, is broadly parallel with the regional slope (arrow, fig. 1, right), as is mean orientation of major channels on large fans (arrow, fig. 1, center). High densities per unit area characterize fan channels and martian ridges reaching an order of magnitude higher than those in uplands just upstream of the terrestrial study areas fig. 1. In concert with

  3. Crime Modeling using Spatial Regression Approach

    Science.gov (United States)

    Saleh Ahmar, Ansari; Adiatma; Kasim Aidid, M.

    2018-01-01

    Act of criminality in Indonesia increased both variety and quantity every year. As murder, rape, assault, vandalism, theft, fraud, fencing, and other cases that make people feel unsafe. Risk of society exposed to crime is the number of reported cases in the police institution. The higher of the number of reporter to the police institution then the number of crime in the region is increasing. In this research, modeling criminality in South Sulawesi, Indonesia with the dependent variable used is the society exposed to the risk of crime. Modelling done by area approach is the using Spatial Autoregressive (SAR) and Spatial Error Model (SEM) methods. The independent variable used is the population density, the number of poor population, GDP per capita, unemployment and the human development index (HDI). Based on the analysis using spatial regression can be shown that there are no dependencies spatial both lag or errors in South Sulawesi.

  4. Determining factors influencing survival of breast cancer by fuzzy logistic regression model.

    Science.gov (United States)

    Nikbakht, Roya; Bahrampour, Abbas

    2017-01-01

    Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.

  5. Efficient estimation of an additive quantile regression model

    NARCIS (Netherlands)

    Cheng, Y.; de Gooijer, J.G.; Zerom, D.

    2009-01-01

    In this paper two kernel-based nonparametric estimators are proposed for estimating the components of an additive quantile regression model. The first estimator is a computationally convenient approach which can be viewed as a viable alternative to the method of De Gooijer and Zerom (2003). By

  6. Efficient estimation of an additive quantile regression model

    NARCIS (Netherlands)

    Cheng, Y.; de Gooijer, J.G.; Zerom, D.

    2010-01-01

    In this paper two kernel-based nonparametric estimators are proposed for estimating the components of an additive quantile regression model. The first estimator is a computationally convenient approach which can be viewed as a viable alternative to the method of De Gooijer and Zerom (2003). By

  7. Boosted beta regression.

    Directory of Open Access Journals (Sweden)

    Matthias Schmid

    Full Text Available Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1. Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures.

  8. Using the classical linear regression model in analysis of the dependences of conveyor belt life

    Directory of Open Access Journals (Sweden)

    Miriam Andrejiová

    2013-12-01

    Full Text Available The paper deals with the classical linear regression model of the dependence of conveyor belt life on some selected parameters: thickness of paint layer, width and length of the belt, conveyor speed and quantity of transported material. The first part of the article is about regression model design, point and interval estimation of parameters, verification of statistical significance of the model, and about the parameters of the proposed regression model. The second part of the article deals with identification of influential and extreme values that can have an impact on estimation of regression model parameters. The third part focuses on assumptions of the classical regression model, i.e. on verification of independence assumptions, normality and homoscedasticity of residuals.

  9. Modeling the frequency of opposing left-turn conflicts at signalized intersections using generalized linear regression models.

    Science.gov (United States)

    Zhang, Xin; Liu, Pan; Chen, Yuguang; Bai, Lu; Wang, Wei

    2014-01-01

    The primary objective of this study was to identify whether the frequency of traffic conflicts at signalized intersections can be modeled. The opposing left-turn conflicts were selected for the development of conflict predictive models. Using data collected at 30 approaches at 20 signalized intersections, the underlying distributions of the conflicts under different traffic conditions were examined. Different conflict-predictive models were developed to relate the frequency of opposing left-turn conflicts to various explanatory variables. The models considered include a linear regression model, a negative binomial model, and separate models developed for four traffic scenarios. The prediction performance of different models was compared. The frequency of traffic conflicts follows a negative binominal distribution. The linear regression model is not appropriate for the conflict frequency data. In addition, drivers behaved differently under different traffic conditions. Accordingly, the effects of conflicting traffic volumes on conflict frequency vary across different traffic conditions. The occurrences of traffic conflicts at signalized intersections can be modeled using generalized linear regression models. The use of conflict predictive models has potential to expand the uses of surrogate safety measures in safety estimation and evaluation.

  10. Use of empirical likelihood to calibrate auxiliary information in partly linear monotone regression models.

    Science.gov (United States)

    Chen, Baojiang; Qin, Jing

    2014-05-10

    In statistical analysis, a regression model is needed if one is interested in finding the relationship between a response variable and covariates. When the response depends on the covariate, then it may also depend on the function of this covariate. If one has no knowledge of this functional form but expect for monotonic increasing or decreasing, then the isotonic regression model is preferable. Estimation of parameters for isotonic regression models is based on the pool-adjacent-violators algorithm (PAVA), where the monotonicity constraints are built in. With missing data, people often employ the augmented estimating method to improve estimation efficiency by incorporating auxiliary information through a working regression model. However, under the framework of the isotonic regression model, the PAVA does not work as the monotonicity constraints are violated. In this paper, we develop an empirical likelihood-based method for isotonic regression model to incorporate the auxiliary information. Because the monotonicity constraints still hold, the PAVA can be used for parameter estimation. Simulation studies demonstrate that the proposed method can yield more efficient estimates, and in some situations, the efficiency improvement is substantial. We apply this method to a dementia study. Copyright © 2013 John Wiley & Sons, Ltd.

  11. Predictive market segmentation model: An application of logistic regression model and CHAID procedure

    Directory of Open Access Journals (Sweden)

    Soldić-Aleksić Jasna

    2009-01-01

    Full Text Available Market segmentation presents one of the key concepts of the modern marketing. The main goal of market segmentation is focused on creating groups (segments of customers that have similar characteristics, needs, wishes and/or similar behavior regarding the purchase of concrete product/service. Companies can create specific marketing plan for each of these segments and therefore gain short or long term competitive advantage on the market. Depending on the concrete marketing goal, different segmentation schemes and techniques may be applied. This paper presents a predictive market segmentation model based on the application of logistic regression model and CHAID analysis. The logistic regression model was used for the purpose of variables selection (from the initial pool of eleven variables which are statistically significant for explaining the dependent variable. Selected variables were afterwards included in the CHAID procedure that generated the predictive market segmentation model. The model results are presented on the concrete empirical example in the following form: summary model results, CHAID tree, Gain chart, Index chart, risk and classification tables.

  12. Analysis of Multivariate Experimental Data Using A Simplified Regression Model Search Algorithm

    Science.gov (United States)

    Ulbrich, Norbert Manfred

    2013-01-01

    A new regression model search algorithm was developed in 2011 that may be used to analyze both general multivariate experimental data sets and wind tunnel strain-gage balance calibration data. The new algorithm is a simplified version of a more complex search algorithm that was originally developed at the NASA Ames Balance Calibration Laboratory. The new algorithm has the advantage that it needs only about one tenth of the original algorithm's CPU time for the completion of a search. In addition, extensive testing showed that the prediction accuracy of math models obtained from the simplified algorithm is similar to the prediction accuracy of math models obtained from the original algorithm. The simplified algorithm, however, cannot guarantee that search constraints related to a set of statistical quality requirements are always satisfied in the optimized regression models. Therefore, the simplified search algorithm is not intended to replace the original search algorithm. Instead, it may be used to generate an alternate optimized regression model of experimental data whenever the application of the original search algorithm either fails or requires too much CPU time. Data from a machine calibration of NASA's MK40 force balance is used to illustrate the application of the new regression model search algorithm.

  13. Evaluation of accuracy of linear regression models in predicting urban stormwater discharge characteristics.

    Science.gov (United States)

    Madarang, Krish J; Kang, Joo-Hyon

    2014-06-01

    Stormwater runoff has been identified as a source of pollution for the environment, especially for receiving waters. In order to quantify and manage the impacts of stormwater runoff on the environment, predictive models and mathematical models have been developed. Predictive tools such as regression models have been widely used to predict stormwater discharge characteristics. Storm event characteristics, such as antecedent dry days (ADD), have been related to response variables, such as pollutant loads and concentrations. However it has been a controversial issue among many studies to consider ADD as an important variable in predicting stormwater discharge characteristics. In this study, we examined the accuracy of general linear regression models in predicting discharge characteristics of roadway runoff. A total of 17 storm events were monitored in two highway segments, located in Gwangju, Korea. Data from the monitoring were used to calibrate United States Environmental Protection Agency's Storm Water Management Model (SWMM). The calibrated SWMM was simulated for 55 storm events, and the results of total suspended solid (TSS) discharge loads and event mean concentrations (EMC) were extracted. From these data, linear regression models were developed. R(2) and p-values of the regression of ADD for both TSS loads and EMCs were investigated. Results showed that pollutant loads were better predicted than pollutant EMC in the multiple regression models. Regression may not provide the true effect of site-specific characteristics, due to uncertainty in the data. Copyright © 2014 The Research Centre for Eco-Environmental Sciences, Chinese Academy of Sciences. Published by Elsevier B.V. All rights reserved.

  14. Improving sub-pixel imperviousness change prediction by ensembling heterogeneous non-linear regression models

    Science.gov (United States)

    Drzewiecki, Wojciech

    2016-12-01

    In this work nine non-linear regression models were compared for sub-pixel impervious surface area mapping from Landsat images. The comparison was done in three study areas both for accuracy of imperviousness coverage evaluation in individual points in time and accuracy of imperviousness change assessment. The performance of individual machine learning algorithms (Cubist, Random Forest, stochastic gradient boosting of regression trees, k-nearest neighbors regression, random k-nearest neighbors regression, Multivariate Adaptive Regression Splines, averaged neural networks, and support vector machines with polynomial and radial kernels) was also compared with the performance of heterogeneous model ensembles constructed from the best models trained using particular techniques. The results proved that in case of sub-pixel evaluation the most accurate prediction of change may not necessarily be based on the most accurate individual assessments. When single methods are considered, based on obtained results Cubist algorithm may be advised for Landsat based mapping of imperviousness for single dates. However, Random Forest may be endorsed when the most reliable evaluation of imperviousness change is the primary goal. It gave lower accuracies for individual assessments, but better prediction of change due to more correlated errors of individual predictions. Heterogeneous model ensembles performed for individual time points assessments at least as well as the best individual models. In case of imperviousness change assessment the ensembles always outperformed single model approaches. It means that it is possible to improve the accuracy of sub-pixel imperviousness change assessment using ensembles of heterogeneous non-linear regression models.

  15. The beach ridges of India: A review

    Digital Repository Service at National Institute of Oceanography (India)

    Kunte, P.D.; Wagle, B.G.

    , and is presented in a consolidated form. Beach ridges of the east and west coast of India are grouped in thirteen-beach ridge complexes based on their association. Review indicates that the beach ridges of India are not older than the Holocene age...

  16. Regularized multivariate regression models with skew-t error distributions

    KAUST Repository

    Chen, Lianfu; Pourahmadi, Mohsen; Maadooliat, Mehdi

    2014-01-01

    We consider regularization of the parameters in multivariate linear regression models with the errors having a multivariate skew-t distribution. An iterative penalized likelihood procedure is proposed for constructing sparse estimators of both

  17. Evaluation of a multiple linear regression model and SARIMA model in forecasting heat demand for district heating system

    International Nuclear Information System (INIS)

    Fang, Tingting; Lahdelma, Risto

    2016-01-01

    Highlights: • Social factor is considered for the linear regression models besides weather file. • Simultaneously optimize all the coefficients for linear regression models. • SARIMA combined with linear regression is used to forecast the heat demand. • The accuracy for both linear regression and time series models are evaluated. - Abstract: Forecasting heat demand is necessary for production and operation planning of district heating (DH) systems. In this study we first propose a simple regression model where the hourly outdoor temperature and wind speed forecast the heat demand. Weekly rhythm of heat consumption as a social component is added to the model to significantly improve the accuracy. The other type of model is the seasonal autoregressive integrated moving average (SARIMA) model with exogenous variables as a combination to take weather factors, and the historical heat consumption data as depending variables. One outstanding advantage of the model is that it peruses the high accuracy for both long-term and short-term forecast by considering both exogenous factors and time series. The forecasting performance of both linear regression models and time series model are evaluated based on real-life heat demand data for the city of Espoo in Finland by out-of-sample tests for the last 20 full weeks of the year. The results indicate that the proposed linear regression model (T168h) using 168-h demand pattern with midweek holidays classified as Saturdays or Sundays gives the highest accuracy and strong robustness among all the tested models based on the tested forecasting horizon and corresponding data. Considering the parsimony of the input, the ease of use and the high accuracy, the proposed T168h model is the best in practice. The heat demand forecasting model can also be developed for individual buildings if automated meter reading customer measurements are available. This would allow forecasting the heat demand based on more accurate heat consumption

  18. Remedial Investigation Report on Chestnut Ridge Operable Unit 2 (Filled Coal Ash Pond/Upper McCoy Branch) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee. Volume 1. Main Text

    International Nuclear Information System (INIS)

    1994-08-01

    This document is a report on the remedial investigation (RI) of Chestnut Ridge Operable Unit (OU) 2 at the Oak Ridge Y-12 Plant. Chestnut Ridge OU 2 consists of Upper McCoy Branch (UMB), the Filled Coal Ash Pond (FCAP), and the area surrounding the Sluice Channel formerly associated with coal ash disposal in the FCAP. Chestnut Ridge OU 2 is located within the U.S. Department of Energy's (DOE's) Oak Ridge Reservation in Anderson County, Tennessee, approximately 24 miles west of Knoxville. The pond is an 8.5-acre area on the southern slope of Chestnut Ridge, 0.5 mile south of the main Y-12 Plant and geographically separated from the Y-12 Plant by Chestnut Ridge. The elevation of the FCAP is ∼ 950 ft above mean sea level (msl), and it is relatively flat and largely vegetated. Two small ponds are usually present at the northeast and northwest comers of the FCAP. The Sluice Channel Area extends ∼1000 ft from the northern margin of the FCAP to the crest of Chestnut Ridge, which has an elevation of ∼1100 ft above msl. The Sluice Channel Area is largely vegetated also. McCoy Branch runs from the top of Chestnut Ridge across the FCAP into Rogers Quarry and out of the quarry where it runs a short distance into Milton Hill Lake at McCoy Embayment, termed UMB. The portion south of Rogers Quarry, within Chestnut Ridge OU 4, is termed Lower McCoy Branch. The DOE Oak Ridge Y-12 Plant disposed of coal ash from its steam plant operations as a slurry that was discharged into an ash retention impoundment; this impoundment is the FCAP. The FCAP was built in 1955 to serve as a settling basin after coal ash slurried over Chestnut Ridge from the Y-12 Plant. The FCAP was constructed by building an earthen dam across the northern tributary of McCoy Branch. The dam was designed to hold 20 years of Y-12 steam plant ash. By July 1967, ash had filled up the impoundment storage behind the dam to within 4 ft of the top

  19. Remedial Investigation Report on Chestnut Ridge Operable Unit 2 (Filled Coal Ash Pond/Upper McCoy Branch) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee. Volume 1. Main Text

    Energy Technology Data Exchange (ETDEWEB)

    1994-08-01

    This document is a report on the remedial investigation (RI) of Chestnut Ridge Operable Unit (OU) 2 at the Oak Ridge Y-12 Plant. Chestnut Ridge OU 2 consists of Upper McCoy Branch (UMB), the Filled Coal Ash Pond (FCAP), and the area surrounding the Sluice Channel formerly associated with coal ash disposal in the FCAP. Chestnut Ridge OU 2 is located within the U.S. Department of Energy`s (DOE`s) Oak Ridge Reservation in Anderson County, Tennessee, approximately 24 miles west of Knoxville. The pond is an 8.5-acre area on the southern slope of Chestnut Ridge, 0.5 mile south of the main Y-12 Plant and geographically separated from the Y-12 Plant by Chestnut Ridge. The elevation of the FCAP is {approximately} 950 ft above mean sea level (msl), and it is relatively flat and largely vegetated. Two small ponds are usually present at the northeast and northwest comers of the FCAP. The Sluice Channel Area extends {approximately}1000 ft from the northern margin of the FCAP to the crest of Chestnut Ridge, which has an elevation of {approximately}1100 ft above msl. The Sluice Channel Area is largely vegetated also. McCoy Branch runs from the top of Chestnut Ridge across the FCAP into Rogers Quarry and out of the quarry where it runs a short distance into Milton Hill Lake at McCoy Embayment, termed UMB. The portion south of Rogers Quarry, within Chestnut Ridge OU 4, is termed Lower McCoy Branch. The DOE Oak Ridge Y-12 Plant disposed of coal ash from its steam plant operations as a slurry that was discharged into an ash retention impoundment; this impoundment is the FCAP. The FCAP was built in 1955 to serve as a settling basin after coal ash slurried over Chestnut Ridge from the Y-12 Plant. The FCAP was constructed by building an earthen dam across the northern tributary of McCoy Branch. The dam was designed to hold 20 years of Y-12 steam plant ash. By July 1967, ash had filled up the impoundment storage behind the dam to within 4 ft of the top.

  20. Preliminary Analysis of the Knipovich Ridge Segmentation - Influence of Focused Magmatism and Ridge Obliquity on an Ultraslow Spreading System

    Science.gov (United States)

    Okino, K.; Curewitz, D.; Asada, M.; Tamaki, K.

    2002-12-01

    Bathymetry, gravity and deep-tow sonar image data are used to define the segmentation of a 400 km long portion of the ultraslow-spreading Knipovich Ridge in the Norwegian-Greenland Sea, Northeast Atlantic Ocean. Discrete volcanic centers marked by large volcanic constructions and accompanying short wavelength mantle Bouguer anomaly (MBA) lows generally resemble those of the Gakkel Ridge and the easternmost Southwest Indian Ridge (SWIR). These magmatically robust segment centers are regularly spaced about 85-100 km apart along the ridge, and are characterized by accumulated hummocky terrain, high relief, off-axis seamount chains and significant MBA lows. We suggest that these eruptive centers correspond to areas of enhanced magma flux, and that their spacing reflects the geometry of underlying mantle upwelling cells. The large-scale thermal structure of the mantle primarily controls discrete and focused magmatism, and the relatively wide spacing of these segments may reflect cool mantle beneath the ridge. Segment centers along the southern Knipovich Ridge are characterized by lower relief and smaller MBA anomalies than along the northern section of the ridge. This suggests that ridge obliquity is a secondary control on ridge construction on the Knipovich Ridge, as the obliquity changes from 35° to 49° from north to south, respectively, while spreading rate and axial depth remain approximately constant. The increased obliquity may contribute to decreased effective spreading rates, lower upwelling magma velocity and melt formation, and limited horizontal dike propagation near the surface. We also identify small, magmatically weaker segments with low relief, little or no MBA anomaly, and no off axis expression. We suggest that these segments are either fed by lateral melt migration from adjacent magmatically stronger segments or represent smaller, discrete mantle upwelling centers with short-lived melt supply.

  1. Preliminary analysis of the Knipovich Ridge segmentation: influence of focused magmatism and ridge obliquity on an ultraslow spreading system

    Science.gov (United States)

    Okino, Kyoko; Curewitz, Daniel; Asada, Miho; Tamaki, Kensaku; Vogt, Peter; Crane, Kathleen

    2002-09-01

    Bathymetry, gravity and deep-tow sonar image data are used to define the segmentation of a 400 km long portion of the ultraslow-spreading Knipovich Ridge in the Norwegian-Greenland Sea, Northeast Atlantic Ocean. Discrete volcanic centers marked by large volcanic constructions and accompanying short wavelength mantle Bouguer anomaly (MBA) lows generally resemble those of the Gakkel Ridge and the easternmost Southwest Indian Ridge. These magmatically robust segment centers are regularly spaced about 85-100 km apart along the ridge, and are characterized by accumulated hummocky terrain, high relief, off-axis seamount chains and significant MBA lows. We suggest that these eruptive centers correspond to areas of enhanced magma flux, and that their spacing reflects the geometry of underlying mantle upwelling cells. The large-scale thermal structure of the mantle primarily controls discrete and focused magmatism, and the relatively wide spacing of these segments may reflect cool mantle beneath the ridge. Segment centers along the southern Knipovich Ridge are characterized by lower relief and smaller MBA anomalies than along the northern section of the ridge. This suggests that ridge obliquity is a secondary control on ridge construction on the Knipovich Ridge, as the obliquity changes from 35° to 49° from north to south, respectively, while spreading rate and axial depth remain approximately constant. The increased obliquity may contribute to decreased effective spreading rates, lower upwelling magma velocity and melt formation, and limited horizontal dike propagation near the surface. We also identify small, magmatically weaker segments with low relief, little or no MBA anomaly, and no off-axis expression. We suggest that these segments are either fed by lateral melt migration from adjacent magmatically stronger segments or represent smaller, discrete mantle upwelling centers with short-lived melt supply.

  2. Comprehensive integrated planning: A process for the Oak Ridge Reservation, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1998-05-01

    The Oak Ridge Comprehensive Integrated Plan is intended to assist the US Department of Energy (DOE) and contractor personnel in implementing a comprehensive integrated planning process consistent with DOE Order 430.1, Life Cycle Asset Management and Oak Ridge Operations Order 430. DOE contractors are charged with developing and producing the Comprehensive Integrated Plan, which serves as a summary document, providing information from other planning efforts regarding vision statements, missions, contextual conditions, resources and facilities, decision processes, and stakeholder involvement. The Comprehensive Integrated Plan is a planning reference that identifies primary issues regarding major changes in land and facility use and serves all programs and functions on-site as well as the Oak Ridge Operations Office and DOE Headquarters. The Oak Ridge Reservation is a valuable national resource and is managed on the basis of the principles of ecosystem management and sustainable development and how mission, economic, ecological, social, and cultural factors are used to guide land- and facility-use decisions. The long-term goals of the comprehensive integrated planning process, in priority order, are to support DOE critical missions and to stimulate the economy while maintaining a quality environment

  3. Comprehensive integrated planning: A process for the Oak Ridge Reservation, Oak Ridge, Tennessee

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1998-05-01

    The Oak Ridge Comprehensive Integrated Plan is intended to assist the US Department of Energy (DOE) and contractor personnel in implementing a comprehensive integrated planning process consistent with DOE Order 430.1, Life Cycle Asset Management and Oak Ridge Operations Order 430. DOE contractors are charged with developing and producing the Comprehensive Integrated Plan, which serves as a summary document, providing information from other planning efforts regarding vision statements, missions, contextual conditions, resources and facilities, decision processes, and stakeholder involvement. The Comprehensive Integrated Plan is a planning reference that identifies primary issues regarding major changes in land and facility use and serves all programs and functions on-site as well as the Oak Ridge Operations Office and DOE Headquarters. The Oak Ridge Reservation is a valuable national resource and is managed on the basis of the principles of ecosystem management and sustainable development and how mission, economic, ecological, social, and cultural factors are used to guide land- and facility-use decisions. The long-term goals of the comprehensive integrated planning process, in priority order, are to support DOE critical missions and to stimulate the economy while maintaining a quality environment.

  4. Validation of regression models for nitrate concentrations in the upper groundwater in sandy soils

    International Nuclear Information System (INIS)

    Sonneveld, M.P.W.; Brus, D.J.; Roelsma, J.

    2010-01-01

    For Dutch sandy regions, linear regression models have been developed that predict nitrate concentrations in the upper groundwater on the basis of residual nitrate contents in the soil in autumn. The objective of our study was to validate these regression models for one particular sandy region dominated by dairy farming. No data from this area were used for calibrating the regression models. The model was validated by additional probability sampling. This sample was used to estimate errors in 1) the predicted areal fractions where the EU standard of 50 mg l -1 is exceeded for farms with low N surpluses (ALT) and farms with higher N surpluses (REF); 2) predicted cumulative frequency distributions of nitrate concentration for both groups of farms. Both the errors in the predicted areal fractions as well as the errors in the predicted cumulative frequency distributions indicate that the regression models are invalid for the sandy soils of this study area. - This study indicates that linear regression models that predict nitrate concentrations in the upper groundwater using residual soil N contents should be applied with care.

  5. Bivariate least squares linear regression: Towards a unified analytic formalism. I. Functional models

    Science.gov (United States)

    Caimmi, R.

    2011-08-01

    Concerning bivariate least squares linear regression, the classical approach pursued for functional models in earlier attempts ( York, 1966, 1969) is reviewed using a new formalism in terms of deviation (matrix) traces which, for unweighted data, reduce to usual quantities leaving aside an unessential (but dimensional) multiplicative factor. Within the framework of classical error models, the dependent variable relates to the independent variable according to the usual additive model. The classes of linear models considered are regression lines in the general case of correlated errors in X and in Y for weighted data, and in the opposite limiting situations of (i) uncorrelated errors in X and in Y, and (ii) completely correlated errors in X and in Y. The special case of (C) generalized orthogonal regression is considered in detail together with well known subcases, namely: (Y) errors in X negligible (ideally null) with respect to errors in Y; (X) errors in Y negligible (ideally null) with respect to errors in X; (O) genuine orthogonal regression; (R) reduced major-axis regression. In the limit of unweighted data, the results determined for functional models are compared with their counterparts related to extreme structural models i.e. the instrumental scatter is negligible (ideally null) with respect to the intrinsic scatter ( Isobe et al., 1990; Feigelson and Babu, 1992). While regression line slope and intercept estimators for functional and structural models necessarily coincide, the contrary holds for related variance estimators even if the residuals obey a Gaussian distribution, with the exception of Y models. An example of astronomical application is considered, concerning the [O/H]-[Fe/H] empirical relations deduced from five samples related to different stars and/or different methods of oxygen abundance determination. For selected samples and assigned methods, different regression models yield consistent results within the errors (∓ σ) for both

  6. Modeling and prediction of flotation performance using support vector regression

    Directory of Open Access Journals (Sweden)

    Despotović Vladimir

    2017-01-01

    Full Text Available Continuous efforts have been made in recent year to improve the process of paper recycling, as it is of critical importance for saving the wood, water and energy resources. Flotation deinking is considered to be one of the key methods for separation of ink particles from the cellulose fibres. Attempts to model the flotation deinking process have often resulted in complex models that are difficult to implement and use. In this paper a model for prediction of flotation performance based on Support Vector Regression (SVR, is presented. Representative data samples were created in laboratory, under a variety of practical control variables for the flotation deinking process, including different reagents, pH values and flotation residence time. Predictive model was created that was trained on these data samples, and the flotation performance was assessed showing that Support Vector Regression is a promising method even when dataset used for training the model is limited.

  7. Effective porosity and density of carbonate rocks (Maynardville Limestone and Copper Ridge Dolomite) within Bear Creek Valley on the Oak Ridge Reservation based on modern petrophysical techniques

    International Nuclear Information System (INIS)

    Dorsch, J.

    1997-02-01

    The purpose of this study is to provide quantitative data on effective porosity of carbonate rock from the Maynardville Limestone and Copper Ridge Dolomite within Bear Creek Valley based on modern petrophysical techniques. The data will be useful for groundwater-flow and contaminant-flow modeling in the vicinity of the Y-12 Plant on the Oak Ridge Reservation (ORR). Furthermore, the data provides needed information on the amount of interconnected pore space potentially available for operation of matrix diffusion as a transport process within the fractured carbonate rock. A second aspect of this study is to compare effective porosity data based on modern petrophysical techniques to effective porosity data determined earlier by Goldstrand et al. (1995) with a different technique. An added bonus of the study is quantitative data on the bulk density and grain density of dolostone and limestone of the Maynardville Limestone and Copper Ridge Dolomite which might find use for geophysical modeling on the ORR

  8. Forecasting daily meteorological time series using ARIMA and regression models

    Science.gov (United States)

    Murat, Małgorzata; Malinowska, Iwona; Gos, Magdalena; Krzyszczak, Jaromir

    2018-04-01

    The daily air temperature and precipitation time series recorded between January 1, 1980 and December 31, 2010 in four European sites (Jokioinen, Dikopshof, Lleida and Lublin) from different climatic zones were modeled and forecasted. In our forecasting we used the methods of the Box-Jenkins and Holt- Winters seasonal auto regressive integrated moving-average, the autoregressive integrated moving-average with external regressors in the form of Fourier terms and the time series regression, including trend and seasonality components methodology with R software. It was demonstrated that obtained models are able to capture the dynamics of the time series data and to produce sensible forecasts.

  9. Replica analysis of overfitting in regression models for time-to-event data

    Science.gov (United States)

    Coolen, A. C. C.; Barrett, J. E.; Paga, P.; Perez-Vicente, C. J.

    2017-09-01

    Overfitting, which happens when the number of parameters in a model is too large compared to the number of data points available for determining these parameters, is a serious and growing problem in survival analysis. While modern medicine presents us with data of unprecedented dimensionality, these data cannot yet be used effectively for clinical outcome prediction. Standard error measures in maximum likelihood regression, such as p-values and z-scores, are blind to overfitting, and even for Cox’s proportional hazards model (the main tool of medical statisticians), one finds in literature only rules of thumb on the number of samples required to avoid overfitting. In this paper we present a mathematical theory of overfitting in regression models for time-to-event data, which aims to increase our quantitative understanding of the problem and provide practical tools with which to correct regression outcomes for the impact of overfitting. It is based on the replica method, a statistical mechanical technique for the analysis of heterogeneous many-variable systems that has been used successfully for several decades in physics, biology, and computer science, but not yet in medical statistics. We develop the theory initially for arbitrary regression models for time-to-event data, and verify its predictions in detail for the popular Cox model.

  10. Cross-validation pitfalls when selecting and assessing regression and classification models.

    Science.gov (United States)

    Krstajic, Damjan; Buturovic, Ljubomir J; Leahy, David E; Thomas, Simon

    2014-03-29

    We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error.

  11. Geospatializing The Klang Gate Quartz Ridge in Malaysia: A Technological Perspective

    Science.gov (United States)

    Azahari Razak, Khamarrul; Mohamad, Zakaria; Zaki Ibrahim, Mohd; Azad Rosle, Qalam; Hattanajmie Abd Wahab, Mohd; Abu Bakar, Rabieahtul; Mohd Akib, Wan Abdul Aziz Wan

    2015-04-01

    Establishment of inventories on geological heritage, or geoheritage resources is a step forward for a comprehensive geoheritage management leading to a better conservation at national and global levels. Compiling and updating inventory of geoheritage is a tedious process and even so in a tropical environment. Malaysia has a tremendous list of geodiversity and generating its national database is a multi-institutional effort and worthwhile investment. However, producing accurate and reliable characteristics of such landform and spectacular geological features remained elusive. The advanced and modern mapping techniques have revolutionized the mapping, monitoring and modelling of the earth surface processes and landforms. Yet the methods for quantification of geodiversity physical features are not fully utilized in Malaysia for a better understanding its processes and activity. This study provides a better insight into the use of advanced active remote sensing technology for characterizing the forested Quartz Ridge in Malaysia. We have developed the novel method and tested in the Klang Gates Quartz Ridge, Selangor. The granitic country rock made up by quartz mineral is known as the longest quartz ridge in Malaysia and characterized by rugged topography, steep slopes, densely vegetated terrain and also rich-biodiversity area. This study presents an integrated field methodological framework and processing scheme by taking into account the climatic, topographic, geologic, and anthropogenic challenges in an equatorial region. Advanced terrestrial laser scanning system was used to accurately capture, map and model the ridge carried out within a relatively stringent time period. The high frequency Global Navigation Satellite System and modern Total Station coupled with the optical satellite and radar imageries and also advanced spatial analysis were fully utilized in the field campaign and data assessment performed during the recent monsoon season. As a result, the mapping

  12. A LATENT CLASS POISSON REGRESSION-MODEL FOR HETEROGENEOUS COUNT DATA

    NARCIS (Netherlands)

    WEDEL, M; DESARBO, WS; BULT, [No Value; RAMASWAMY, [No Value

    1993-01-01

    In this paper an approach is developed that accommodates heterogeneity in Poisson regression models for count data. The model developed assumes that heterogeneity arises from a distribution of both the intercept and the coefficients of the explanatory variables. We assume that the mixing

  13. Predicting Error Bars for QSAR Models

    International Nuclear Information System (INIS)

    Schroeter, Timon; Schwaighofer, Anton; Mika, Sebastian; Ter Laak, Antonius; Suelzle, Detlev; Ganzer, Ursula; Heinrich, Nikolaus; Mueller, Klaus-Robert

    2007-01-01

    Unfavorable physicochemical properties often cause drug failures. It is therefore important to take lipophilicity and water solubility into account early on in lead discovery. This study presents log D 7 models built using Gaussian Process regression, Support Vector Machines, decision trees and ridge regression algorithms based on 14556 drug discovery compounds of Bayer Schering Pharma. A blind test was conducted using 7013 new measurements from the last months. We also present independent evaluations using public data. Apart from accuracy, we discuss the quality of error bars that can be computed by Gaussian Process models, and ensemble and distance based techniques for the other modelling approaches

  14. Oak Ridge Leadership Computing Facility (OLCF)

    Data.gov (United States)

    Federal Laboratory Consortium — The Oak Ridge Leadership Computing Facility (OLCF) was established at Oak Ridge National Laboratory in 2004 with the mission of standing up a supercomputer 100 times...

  15. Continuous validation of ASTEC containment models and regression testing

    International Nuclear Information System (INIS)

    Nowack, Holger; Reinke, Nils; Sonnenkalb, Martin

    2014-01-01

    The focus of the ASTEC (Accident Source Term Evaluation Code) development at GRS is primarily on the containment module CPA (Containment Part of ASTEC), whose modelling is to a large extent based on the GRS containment code COCOSYS (COntainment COde SYStem). Validation is usually understood as the approval of the modelling capabilities by calculations of appropriate experiments done by external users different from the code developers. During the development process of ASTEC CPA, bugs and unintended side effects may occur, which leads to changes in the results of the initially conducted validation. Due to the involvement of a considerable number of developers in the coding of ASTEC modules, validation of the code alone, even if executed repeatedly, is not sufficient. Therefore, a regression testing procedure has been implemented in order to ensure that the initially obtained validation results are still valid with succeeding code versions. Within the regression testing procedure, calculations of experiments and plant sequences are performed with the same input deck but applying two different code versions. For every test-case the up-to-date code version is compared to the preceding one on the basis of physical parameters deemed to be characteristic for the test-case under consideration. In the case of post-calculations of experiments also a comparison to experimental data is carried out. Three validation cases from the regression testing procedure are presented within this paper. The very good post-calculation of the HDR E11.1 experiment shows the high quality modelling of thermal-hydraulics in ASTEC CPA. Aerosol behaviour is validated on the BMC VANAM M3 experiment, and the results show also a very good agreement with experimental data. Finally, iodine behaviour is checked in the validation test-case of the THAI IOD-11 experiment. Within this test-case, the comparison of the ASTEC versions V2.0r1 and V2.0r2 shows how an error was detected by the regression testing

  16. Linear Multivariable Regression Models for Prediction of Eddy Dissipation Rate from Available Meteorological Data

    Science.gov (United States)

    MCKissick, Burnell T. (Technical Monitor); Plassman, Gerald E.; Mall, Gerald H.; Quagliano, John R.

    2005-01-01

    Linear multivariable regression models for predicting day and night Eddy Dissipation Rate (EDR) from available meteorological data sources are defined and validated. Model definition is based on a combination of 1997-2000 Dallas/Fort Worth (DFW) data sources, EDR from Aircraft Vortex Spacing System (AVOSS) deployment data, and regression variables primarily from corresponding Automated Surface Observation System (ASOS) data. Model validation is accomplished through EDR predictions on a similar combination of 1994-1995 Memphis (MEM) AVOSS and ASOS data. Model forms include an intercept plus a single term of fixed optimal power for each of these regression variables; 30-minute forward averaged mean and variance of near-surface wind speed and temperature, variance of wind direction, and a discrete cloud cover metric. Distinct day and night models, regressing on EDR and the natural log of EDR respectively, yield best performance and avoid model discontinuity over day/night data boundaries.

  17. US Department of Energy Oak Ridge Operations Environmental Management Public Involvement Plan for the Oak Ridge Reservation

    International Nuclear Information System (INIS)

    1996-03-01

    This document was prepared in accordance with CERCLA requirements for writing community relations plans. It includes information on how the DOE Oak Ridge Operations Office prepares and executes Environmental Management Community relations activities. It is divided into three sections: the public involvement plan, public involvement in Oak Ridge, and public involvement in 1995. Four appendices are also included: environmental management in Oak Ridge; community and regional overview; key laws, agreements, and policy; and principal contacts

  18. Prediction of Mind-Wandering with Electroencephalogram and Non-linear Regression Modeling.

    Science.gov (United States)

    Kawashima, Issaku; Kumano, Hiroaki

    2017-01-01

    Mind-wandering (MW), task-unrelated thought, has been examined by researchers in an increasing number of articles using models to predict whether subjects are in MW, using numerous physiological variables. However, these models are not applicable in general situations. Moreover, they output only binary classification. The current study suggests that the combination of electroencephalogram (EEG) variables and non-linear regression modeling can be a good indicator of MW intensity. We recorded EEGs of 50 subjects during the performance of a Sustained Attention to Response Task, including a thought sampling probe that inquired the focus of attention. We calculated the power and coherence value and prepared 35 patterns of variable combinations and applied Support Vector machine Regression (SVR) to them. Finally, we chose four SVR models: two of them non-linear models and the others linear models; two of the four models are composed of a limited number of electrodes to satisfy model usefulness. Examination using the held-out data indicated that all models had robust predictive precision and provided significantly better estimations than a linear regression model using single electrode EEG variables. Furthermore, in limited electrode condition, non-linear SVR model showed significantly better precision than linear SVR model. The method proposed in this study helps investigations into MW in various little-examined situations. Further, by measuring MW with a high temporal resolution EEG, unclear aspects of MW, such as time series variation, are expected to be revealed. Furthermore, our suggestion that a few electrodes can also predict MW contributes to the development of neuro-feedback studies.

  19. Prediction of Mind-Wandering with Electroencephalogram and Non-linear Regression Modeling

    Directory of Open Access Journals (Sweden)

    Issaku Kawashima

    2017-07-01

    Full Text Available Mind-wandering (MW, task-unrelated thought, has been examined by researchers in an increasing number of articles using models to predict whether subjects are in MW, using numerous physiological variables. However, these models are not applicable in general situations. Moreover, they output only binary classification. The current study suggests that the combination of electroencephalogram (EEG variables and non-linear regression modeling can be a good indicator of MW intensity. We recorded EEGs of 50 subjects during the performance of a Sustained Attention to Response Task, including a thought sampling probe that inquired the focus of attention. We calculated the power and coherence value and prepared 35 patterns of variable combinations and applied Support Vector machine Regression (SVR to them. Finally, we chose four SVR models: two of them non-linear models and the others linear models; two of the four models are composed of a limited number of electrodes to satisfy model usefulness. Examination using the held-out data indicated that all models had robust predictive precision and provided significantly better estimations than a linear regression model using single electrode EEG variables. Furthermore, in limited electrode condition, non-linear SVR model showed significantly better precision than linear SVR model. The method proposed in this study helps investigations into MW in various little-examined situations. Further, by measuring MW with a high temporal resolution EEG, unclear aspects of MW, such as time series variation, are expected to be revealed. Furthermore, our suggestion that a few electrodes can also predict MW contributes to the development of neuro-feedback studies.

  20. Reconstruction of missing daily streamflow data using dynamic regression models

    Science.gov (United States)

    Tencaliec, Patricia; Favre, Anne-Catherine; Prieur, Clémentine; Mathevet, Thibault

    2015-12-01

    River discharge is one of the most important quantities in hydrology. It provides fundamental records for water resources management and climate change monitoring. Even very short data-gaps in this information can cause extremely different analysis outputs. Therefore, reconstructing missing data of incomplete data sets is an important step regarding the performance of the environmental models, engineering, and research applications, thus it presents a great challenge. The objective of this paper is to introduce an effective technique for reconstructing missing daily discharge data when one has access to only daily streamflow data. The proposed procedure uses a combination of regression and autoregressive integrated moving average models (ARIMA) called dynamic regression model. This model uses the linear relationship between neighbor and correlated stations and then adjusts the residual term by fitting an ARIMA structure. Application of the model to eight daily streamflow data for the Durance river watershed showed that the model yields reliable estimates for the missing data in the time series. Simulation studies were also conducted to evaluate the performance of the procedure.

  1. InRidge program: Preliminary results from the first cruise

    Digital Repository Service at National Institute of Oceanography (India)

    Mukhopadhyay, R.; Murthy, K.S.R.; Iyer, S.D.; Rao, M.M.M.; Banerjee, R.; Subrahmanyam, A.S.; Shirodkar, P.V.; Ghose, I.

    The first cruise under India's own Ridge research initiative, InRidge collected new data on bathymetry, free-air gravity and magnetic anomalies across the ridge axis between the Vema and Zhivago transform faults in the Central Indian Ridge...

  2. Detection of Outliers in Regression Model for Medical Data

    Directory of Open Access Journals (Sweden)

    Stephen Raj S

    2017-07-01

    Full Text Available In regression analysis, an outlier is an observation for which the residual is large in magnitude compared to other observations in the data set. The detection of outliers and influential points is an important step of the regression analysis. Outlier detection methods have been used to detect and remove anomalous values from data. In this paper, we detect the presence of outliers in simple linear regression models for medical data set. Chatterjee and Hadi mentioned that the ordinary residuals are not appropriate for diagnostic purposes; a transformed version of them is preferable. First, we investigate the presence of outliers based on existing procedures of residuals and standardized residuals. Next, we have used the new approach of standardized scores for detecting outliers without the use of predicted values. The performance of the new approach was verified with the real-life data.

  3. Modelos ridge em planejamentos de misturas: uma aplicação na extração da polpa de pequi Ridge models in mixture planning: an application in the extraction of pequi pulp

    Directory of Open Access Journals (Sweden)

    Tania Miranda Nepomucena

    2013-01-01

    Full Text Available Mixture Models can be used in experimental situations involving areas related to food science and chemistry. Some problems of a statistical nature can be found, such as effects of multicollinearity that result in uncertainty in the optimization of a dependent variable. This study proposes the application of the ridge model adapted for mixture planning considering the Kronecker (K-model and Scheffe (S-Model methods applied to response surfaces. The method determined the proportions of hexane, acetone and alcohol proportions that resulted in the maximum response of percentage of extracted pequi (Caryocar brasiliense pulp oil.

  4. Estimasi Model Seemingly Unrelated Regression (SUR dengan Metode Generalized Least Square (GLS

    Directory of Open Access Journals (Sweden)

    Ade Widyaningsih

    2015-04-01

    Full Text Available Regression analysis is a statistical tool that is used to determine the relationship between two or more quantitative variables so that one variable can be predicted from the other variables. A method that can used to obtain a good estimation in the regression analysis is ordinary least squares method. The least squares method is used to estimate the parameters of one or more regression but relationships among the errors in the response of other estimators are not allowed. One way to overcome this problem is Seemingly Unrelated Regression model (SUR in which parameters are estimated using Generalized Least Square (GLS. In this study, the author applies SUR model using GLS method on world gasoline demand data. The author obtains that SUR using GLS is better than OLS because SUR produce smaller errors than the OLS.

  5. Estimasi Model Seemingly Unrelated Regression (SUR dengan Metode Generalized Least Square (GLS

    Directory of Open Access Journals (Sweden)

    Ade Widyaningsih

    2014-06-01

    Full Text Available Regression analysis is a statistical tool that is used to determine the relationship between two or more quantitative variables so that one variable can be predicted from the other variables. A method that can used to obtain a good estimation in the regression analysis is ordinary least squares method. The least squares method is used to estimate the parameters of one or more regression but relationships among the errors in the response of other estimators are not allowed. One way to overcome this problem is Seemingly Unrelated Regression model (SUR in which parameters are estimated using Generalized Least Square (GLS. In this study, the author applies SUR model using GLS method on world gasoline demand data. The author obtains that SUR using GLS is better than OLS because SUR produce smaller errors than the OLS.

  6. Quality assurance project plan for the Chestnut Ridge Fly Ash Pond Stabilization Project at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1996-07-01

    The Chestnut Ridge Fly Ash Pond Stabilization (CRFAPS) Project will stabilize a 19-m-high (62-ft-high) earthen embankment across Upper McCoy Branch situated along the southern slope of Chestnut Ridge. This task will be accomplished by raising the crest of the embankment, reinforcing the face of the embankment, removing trees from the face and top of the embankment, and repairing the emergency spillway. The primary responsibilities of the team members are: Lockheed Martin Energy Systems, Inc., (Energy Systems) will be responsible for project integration, technical support, Title 3 field support, environmental oversight, and quality assurance (QA) oversight of the project; Foster Wheeler Environmental Corporation (FWENC) will be responsible for design and home office Title 3 support; MK-Ferguson of Oak Ridge Company (MK-F) will be responsible for health and safety, construction, and procurement of construction materials. Each of the team members has a QA program approved by the US Department of Energy (DOE) Oak Ridge Operations. This project-specific QA project plan (QAPP), which is applicable to all project activities, identifies and integrates the specific QA requirements from the participant's QA programs that are necessary for this project

  7. On pseudo-values for regression analysis in competing risks models

    DEFF Research Database (Denmark)

    Graw, F; Gerds, Thomas Alexander; Schumacher, M

    2009-01-01

    For regression on state and transition probabilities in multi-state models Andersen et al. (Biometrika 90:15-27, 2003) propose a technique based on jackknife pseudo-values. In this article we analyze the pseudo-values suggested for competing risks models and prove some conjectures regarding their...

  8. Modeling and prediction of Turkey's electricity consumption using Support Vector Regression

    International Nuclear Information System (INIS)

    Kavaklioglu, Kadir

    2011-01-01

    Support Vector Regression (SVR) methodology is used to model and predict Turkey's electricity consumption. Among various SVR formalisms, ε-SVR method was used since the training pattern set was relatively small. Electricity consumption is modeled as a function of socio-economic indicators such as population, Gross National Product, imports and exports. In order to facilitate future predictions of electricity consumption, a separate SVR model was created for each of the input variables using their current and past values; and these models were combined to yield consumption prediction values. A grid search for the model parameters was performed to find the best ε-SVR model for each variable based on Root Mean Square Error. Electricity consumption of Turkey is predicted until 2026 using data from 1975 to 2006. The results show that electricity consumption can be modeled using Support Vector Regression and the models can be used to predict future electricity consumption. (author)

  9. Cluster regression model and level fluctuation features of Van Lake, Turkey

    Directory of Open Access Journals (Sweden)

    Z. Şen

    1999-02-01

    Full Text Available Lake water levels change under the influences of natural and/or anthropogenic environmental conditions. Among these influences are the climate change, greenhouse effects and ozone layer depletions which are reflected in the hydrological cycle features over the lake drainage basins. Lake levels are among the most significant hydrological variables that are influenced by different atmospheric and environmental conditions. Consequently, lake level time series in many parts of the world include nonstationarity components such as shifts in the mean value, apparent or hidden periodicities. On the other hand, many lake level modeling techniques have a stationarity assumption. The main purpose of this work is to develop a cluster regression model for dealing with nonstationarity especially in the form of shifting means. The basis of this model is the combination of transition probability and classical regression technique. Both parts of the model are applied to monthly level fluctuations of Lake Van in eastern Turkey. It is observed that the cluster regression procedure does preserve the statistical properties and the transitional probabilities that are indistinguishable from the original data.Key words. Hydrology (hydrologic budget; stochastic processes · Meteorology and atmospheric dynamics (ocean-atmosphere interactions

  10. Cluster regression model and level fluctuation features of Van Lake, Turkey

    Directory of Open Access Journals (Sweden)

    Z. Şen

    Full Text Available Lake water levels change under the influences of natural and/or anthropogenic environmental conditions. Among these influences are the climate change, greenhouse effects and ozone layer depletions which are reflected in the hydrological cycle features over the lake drainage basins. Lake levels are among the most significant hydrological variables that are influenced by different atmospheric and environmental conditions. Consequently, lake level time series in many parts of the world include nonstationarity components such as shifts in the mean value, apparent or hidden periodicities. On the other hand, many lake level modeling techniques have a stationarity assumption. The main purpose of this work is to develop a cluster regression model for dealing with nonstationarity especially in the form of shifting means. The basis of this model is the combination of transition probability and classical regression technique. Both parts of the model are applied to monthly level fluctuations of Lake Van in eastern Turkey. It is observed that the cluster regression procedure does preserve the statistical properties and the transitional probabilities that are indistinguishable from the original data.

    Key words. Hydrology (hydrologic budget; stochastic processes · Meteorology and atmospheric dynamics (ocean-atmosphere interactions

  11. Department of Energy Air Emissions Annual Report Oak Ridge Reservation, Oak Ridge, Tennessee 40 Code of Federal Regulations (CFR) 61, Subpart H Calendar Year 2016

    Energy Technology Data Exchange (ETDEWEB)

    Martin, Richard [Oak Ridge Y-12 Plant (Y-12), Oak Ridge, TN (United States)

    2017-06-30

    As defined in the preamble of the final rule, the entire DOE facility on the Oak Ridge Reservation (ORR) must meet the 10 mrem/yr ED standard.1 In other words, the combined ED from all radiological air emission sources from Y-12 National Security Complex (Y-12 Complex), Oak Ridge National Laboratory (ORNL), East Tennessee Technology Park (ETTP), Oak Ridge Institute for Science and Education (ORISE) and any other DOE operation on the reservation must meet the 10 mrem/yr standard. Compliance with the standard is demonstrated through emission sampling, monitoring, calculations and radiation dose modeling in accordance with approved EPA methodologies and procedures. DOE estimates the ED to many individuals or receptor points in the vicinity of ORR, but it is the dose to the maximally exposed individual (MEI) that determines compliance with the standard.

  12. Improved model of the retardance in citric acid coated ferrofluids using stepwise regression

    Science.gov (United States)

    Lin, J. F.; Qiu, X. R.

    2017-06-01

    Citric acid (CA) coated Fe3O4 ferrofluids (FFs) have been conducted for biomedical application. The magneto-optical retardance of CA coated FFs was measured by a Stokes polarimeter. Optimization and multiple regression of retardance in FFs were executed by Taguchi method and Microsoft Excel previously, and the F value of regression model was large enough. However, the model executed by Excel was not systematic. Instead we adopted the stepwise regression to model the retardance of CA coated FFs. From the results of stepwise regression by MATLAB, the developed model had highly predictable ability owing to F of 2.55897e+7 and correlation coefficient of one. The average absolute error of predicted retardances to measured retardances was just 0.0044%. Using the genetic algorithm (GA) in MATLAB, the optimized parametric combination was determined as [4.709 0.12 39.998 70.006] corresponding to the pH of suspension, molar ratio of CA to Fe3O4, CA volume, and coating temperature. The maximum retardance was found as 31.712°, close to that obtained by evolutionary solver in Excel and a relative error of -0.013%. Above all, the stepwise regression method was successfully used to model the retardance of CA coated FFs, and the maximum global retardance was determined by the use of GA.

  13. Production, pathways and budgets of melts in mid-ocean ridges: An enthalpy based thermo-mechanical model

    Science.gov (United States)

    Mandal, Nibir; Sarkar, Shamik; Baruah, Amiya; Dutta, Urmi

    2018-04-01

    Using an enthalpy based thermo-mechanical model we provide a theoretical evaluation of melt production beneath mid-ocean ridges (MORs), and demonstrate how the melts subsequently develop their pathways to sustain the major ridge processes. Our model employs a Darcy idealization of the two-phase (solid-melt) system, accounting enthalpy (ΔH) as a function of temperature dependent liquid fraction (ϕ). Random thermal perturbations imposed in this model set in local convection that drive melts to flow through porosity controlled pathways with a typical mushroom-like 3D structure. We present across- and along-MOR axis model profiles to show the mode of occurrence of melt-rich zones within mushy regions, connected to deeper sources by single or multiple feeders. The upwelling of melts experiences two synchronous processes: 1) solidification-accretion, and 2) eruption, retaining a large melt fraction in the framework of mantle dynamics. Using a bifurcation analysis we determine the threshold condition for melt eruption, and estimate the potential volumes of eruptible melts (∼3.7 × 106 m3/yr) and sub-crustal solidified masses (∼1-8.8 × 106 m3/yr) on an axis length of 500 km. The solidification process far dominates over the eruption process in the initial phase, but declines rapidly on a time scale (t) of 1 Myr. Consequently, the eruption rate takes over the solidification rate, but attains nearly a steady value as t > 1.5 Myr. We finally present a melt budget, where a maximum of ∼5% of the total upwelling melt volume is available for eruption, whereas ∼19% for deeper level solidification; the rest continue to participate in the sub-crustal processes.

  14. Accounting for spatial effects in land use regression for urban air pollution modeling.

    Science.gov (United States)

    Bertazzon, Stefania; Johnson, Markey; Eccles, Kristin; Kaplan, Gilaad G

    2015-01-01

    In order to accurately assess air pollution risks, health studies require spatially resolved pollution concentrations. Land-use regression (LUR) models estimate ambient concentrations at a fine spatial scale. However, spatial effects such as spatial non-stationarity and spatial autocorrelation can reduce the accuracy of LUR estimates by increasing regression errors and uncertainty; and statistical methods for resolving these effects--e.g., spatially autoregressive (SAR) and geographically weighted regression (GWR) models--may be difficult to apply simultaneously. We used an alternate approach to address spatial non-stationarity and spatial autocorrelation in LUR models for nitrogen dioxide. Traditional models were re-specified to include a variable capturing wind speed and direction, and re-fit as GWR models. Mean R(2) values for the resulting GWR-wind models (summer: 0.86, winter: 0.73) showed a 10-20% improvement over traditional LUR models. GWR-wind models effectively addressed both spatial effects and produced meaningful predictive models. These results suggest a useful method for improving spatially explicit models. Copyright © 2015 The Authors. Published by Elsevier Ltd.. All rights reserved.

  15. Regression analysis understanding and building business and economic models using Excel

    CERN Document Server

    Wilson, J Holton

    2012-01-01

    The technique of regression analysis is used so often in business and economics today that an understanding of its use is necessary for almost everyone engaged in the field. This book will teach you the essential elements of building and understanding regression models in a business/economic context in an intuitive manner. The authors take a non-theoretical treatment that is accessible even if you have a limited statistical background. It is specifically designed to teach the correct use of regression, while advising you of its limitations and teaching about common pitfalls. This book describe

  16. The prediction of intelligence in preschool children using alternative models to regression.

    Science.gov (United States)

    Finch, W Holmes; Chang, Mei; Davis, Andrew S; Holden, Jocelyn E; Rothlisberg, Barbara A; McIntosh, David E

    2011-12-01

    Statistical prediction of an outcome variable using multiple independent variables is a common practice in the social and behavioral sciences. For example, neuropsychologists are sometimes called upon to provide predictions of preinjury cognitive functioning for individuals who have suffered a traumatic brain injury. Typically, these predictions are made using standard multiple linear regression models with several demographic variables (e.g., gender, ethnicity, education level) as predictors. Prior research has shown conflicting evidence regarding the ability of such models to provide accurate predictions of outcome variables such as full-scale intelligence (FSIQ) test scores. The present study had two goals: (1) to demonstrate the utility of a set of alternative prediction methods that have been applied extensively in the natural sciences and business but have not been frequently explored in the social sciences and (2) to develop models that can be used to predict premorbid cognitive functioning in preschool children. Predictions of Stanford-Binet 5 FSIQ scores for preschool-aged children is used to compare the performance of a multiple regression model with several of these alternative methods. Results demonstrate that classification and regression trees provided more accurate predictions of FSIQ scores than does the more traditional regression approach. Implications of these results are discussed.

  17. A primer for biomedical scientists on how to execute model II linear regression analysis.

    Science.gov (United States)

    Ludbrook, John

    2012-04-01

    1. There are two very different ways of executing linear regression analysis. One is Model I, when the x-values are fixed by the experimenter. The other is Model II, in which the x-values are free to vary and are subject to error. 2. I have received numerous complaints from biomedical scientists that they have great difficulty in executing Model II linear regression analysis. This may explain the results of a Google Scholar search, which showed that the authors of articles in journals of physiology, pharmacology and biochemistry rarely use Model II regression analysis. 3. I repeat my previous arguments in favour of using least products linear regression analysis for Model II regressions. I review three methods for executing ordinary least products (OLP) and weighted least products (WLP) regression analysis: (i) scientific calculator and/or computer spreadsheet; (ii) specific purpose computer programs; and (iii) general purpose computer programs. 4. Using a scientific calculator and/or computer spreadsheet, it is easy to obtain correct values for OLP slope and intercept, but the corresponding 95% confidence intervals (CI) are inaccurate. 5. Using specific purpose computer programs, the freeware computer program smatr gives the correct OLP regression coefficients and obtains 95% CI by bootstrapping. In addition, smatr can be used to compare the slopes of OLP lines. 6. When using general purpose computer programs, I recommend the commercial programs systat and Statistica for those who regularly undertake linear regression analysis and I give step-by-step instructions in the Supplementary Information as to how to use loss functions. © 2011 The Author. Clinical and Experimental Pharmacology and Physiology. © 2011 Blackwell Publishing Asia Pty Ltd.

  18. Applied logistic regression

    CERN Document Server

    Hosmer, David W; Sturdivant, Rodney X

    2013-01-01

     A new edition of the definitive guide to logistic regression modeling for health science and other applications This thoroughly expanded Third Edition provides an easily accessible introduction to the logistic regression (LR) model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables. Applied Logistic Regression, Third Edition emphasizes applications in the health sciences and handpicks topics that best suit the use of modern statistical software. The book provides readers with state-of-

  19. Evaluation of Logistic Regression and Multivariate Adaptive Regression Spline Models for Groundwater Potential Mapping Using R and GIS

    Directory of Open Access Journals (Sweden)

    Soyoung Park

    2017-07-01

    Full Text Available This study mapped and analyzed groundwater potential using two different models, logistic regression (LR and multivariate adaptive regression splines (MARS, and compared the results. A spatial database was constructed for groundwater well data and groundwater influence factors. Groundwater well data with a high potential yield of ≥70 m3/d were extracted, and 859 locations (70% were used for model training, whereas the other 365 locations (30% were used for model validation. We analyzed 16 groundwater influence factors including altitude, slope degree, slope aspect, plan curvature, profile curvature, topographic wetness index, stream power index, sediment transport index, distance from drainage, drainage density, lithology, distance from fault, fault density, distance from lineament, lineament density, and land cover. Groundwater potential maps (GPMs were constructed using LR and MARS models and tested using a receiver operating characteristics curve. Based on this analysis, the area under the curve (AUC for the success rate curve of GPMs created using the MARS and LR models was 0.867 and 0.838, and the AUC for the prediction rate curve was 0.836 and 0.801, respectively. This implies that the MARS model is useful and effective for groundwater potential analysis in the study area.

  20. Sample size calculation to externally validate scoring systems based on logistic regression models.

    Directory of Open Access Journals (Sweden)

    Antonio Palazón-Bru

    Full Text Available A sample size containing at least 100 events and 100 non-events has been suggested to validate a predictive model, regardless of the model being validated and that certain factors can influence calibration of the predictive model (discrimination, parameterization and incidence. Scoring systems based on binary logistic regression models are a specific type of predictive model.The aim of this study was to develop an algorithm to determine the sample size for validating a scoring system based on a binary logistic regression model and to apply it to a case study.The algorithm was based on bootstrap samples in which the area under the ROC curve, the observed event probabilities through smooth curves, and a measure to determine the lack of calibration (estimated calibration index were calculated. To illustrate its use for interested researchers, the algorithm was applied to a scoring system, based on a binary logistic regression model, to determine mortality in intensive care units.In the case study provided, the algorithm obtained a sample size with 69 events, which is lower than the value suggested in the literature.An algorithm is provided for finding the appropriate sample size to validate scoring systems based on binary logistic regression models. This could be applied to determine the sample size in other similar cases.

  1. Detection of Cutting Tool Wear using Statistical Analysis and Regression Model

    Science.gov (United States)

    Ghani, Jaharah A.; Rizal, Muhammad; Nuawi, Mohd Zaki; Haron, Che Hassan Che; Ramli, Rizauddin

    2010-10-01

    This study presents a new method for detecting the cutting tool wear based on the measured cutting force signals. A statistical-based method called Integrated Kurtosis-based Algorithm for Z-Filter technique, called I-kaz was used for developing a regression model and 3D graphic presentation of I-kaz 3D coefficient during machining process. The machining tests were carried out using a CNC turning machine Colchester Master Tornado T4 in dry cutting condition. A Kistler 9255B dynamometer was used to measure the cutting force signals, which were transmitted, analyzed, and displayed in the DasyLab software. Various force signals from machining operation were analyzed, and each has its own I-kaz 3D coefficient. This coefficient was examined and its relationship with flank wear lands (VB) was determined. A regression model was developed due to this relationship, and results of the regression model shows that the I-kaz 3D coefficient value decreases as tool wear increases. The result then is used for real time tool wear monitoring.

  2. Tubular Initial Conditions and Ridge Formation

    Directory of Open Access Journals (Sweden)

    M. S. Borysova

    2013-01-01

    Full Text Available The 2D azimuth and rapidity structure of the two-particle correlations in relativistic A+A collisions is altered significantly by the presence of sharp inhomogeneities in superdense matter formed in such processes. The causality constraints enforce one to associate the long-range longitudinal correlations observed in a narrow angular interval, the so-called (soft ridge, with peculiarities of the initial conditions of collision process. This study's objective is to analyze whether multiform initial tubular structures, undergoing the subsequent hydrodynamic evolution and gradual decoupling, can form the soft ridges. Motivated by the flux-tube scenarios, the initial energy density distribution contains the different numbers of high density tube-like boost-invariant inclusions that form a bumpy structure in the transverse plane. The influence of various structures of such initial conditions in the most central A+A events on the collective evolution of matter, resulting spectra, angular particle correlations and vn-coefficients is studied in the framework of the hydrokinetic model (HKM.

  3. An aerial radiological survey of the Oak Ridge Reservation and surrounding area, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    Maurer, R.J.

    1989-09-01

    An aerial radiological survey of the Oak Ridge Reservation (ORR) and surrounding area in Oak Ridge, Tennessee, was conducted from September 12--29, 1989. The purpose of the survey was to measure and document the site's terrestrial radiological environment for use in effective environmental management and emergency response planning. The aerial survey was flown at an altitude of 91 meters (300 feet) along a series of parallel lines 152 meters (500 feet) apart. The survey encompassed an area of 440 square kilometers (170 square miles) as defined by the Tennessee Valley Authority Map S-16A of the entire Oak Ridge Reservation and adjacent area. The results of the aerial survey are reported as inferred exposure rates at 1 meter above ground level (AGL) in the form of a radiation contour map. Typical background exposure rates were found to vary from 5 to 14 microroentgens per hour (μR/h). The man-made radionuclides, cobalt-60, cesium-137, and protactinium-234m (a radioisotope indicative of depleted uranium), were detected at several facilities on the site. In support of the aerial survey, ground-based exposure rate and soil sample measurements were obtained at several locations within the survey boundary. In addition to the large scale aerial survey, two special flyovers were requested by the Department of Energy. The first request was to conduct a survey of a 1-mile x 2-mile area in south Knoxville, Tennessee. The area had been used previously to store contaminated scrap metals from operations at the Oak Ridge site. The second request was to fly several passes over a 5-mile length of railroad tracks leading from the Oak Ridge Y-12 Plant, north through the city of Oak Ridge. The railroad tracks had been previously used in the transport of cesium-137

  4. Strain distribution and model for formation of eastern Umtanum Ridge anticline, south-central Washington

    Energy Technology Data Exchange (ETDEWEB)

    Price, E.H.

    1979-10-01

    Umtanum Ridge in south-central Washington is the topographic expression of a complex anticline within the Yakima Fold system in the Miocene Columbia River Basalt Group. The Yakima Fold system, which is partly contained within the Hanford Site, is an example of a layered basalt sequence folded near the surface of the earth. The Pasco Basin stratigraphic nomenclature is used in this repot. Rockwelll Hanford Operations, under contract to the US Department of Energy, is investigating the feasibility of therminal high-level nuclear waste storage in mined repositories in basalt beneath the Hanford Site. Because thereis essentially no basalt within the Site that has not been involved in some folding, any potential location for a repository will be either on the limbs or near the hinge zone of a Yakima Fold structure. Umtanum Ridge is the best exposed Yakima Fold structure in the vicinity of the Site for studying the nature and three-dimensional style of deformation of a multilayered basalt sequence. The structural geometry, distribution of strain within the Umtanum structure and deformational mechanisms of the Umtanum Ridge are discussed.

  5. Strain distribution and model for formation of eastern Umtanum Ridge anticline, south-central Washington

    International Nuclear Information System (INIS)

    Price, E.H.

    1979-10-01

    Umtanum Ridge in south-central Washington is the topographic expression of a complex anticline within the Yakima Fold system in the Miocene Columbia River Basalt Group. The Yakima Fold system, which is partly contained within the Hanford Site, is an example of a layered basalt sequence folded near the surface of the earth. The Pasco Basin stratigraphic nomenclature is used in this repot. Rockwelll Hanford Operations, under contract to the US Department of Energy, is investigating the feasibility of therminal high-level nuclear waste storage in mined repositories in basalt beneath the Hanford Site. Because thereis essentially no basalt within the Site that has not been involved in some folding, any potential location for a repository will be either on the limbs or near the hinge zone of a Yakima Fold structure. Umtanum Ridge is the best exposed Yakima Fold structure in the vicinity of the Site for studying the nature and three-dimensional style of deformation of a multilayered basalt sequence. The structural geometry, distribution of strain within the Umtanum structure and deformational mechanisms of the Umtanum Ridge are discussed

  6. Using the Logistic Regression model in supporting decisions of establishing marketing strategies

    Directory of Open Access Journals (Sweden)

    Cristinel CONSTANTIN

    2015-12-01

    Full Text Available This paper is about an instrumental research regarding the using of Logistic Regression model for data analysis in marketing research. The decision makers inside different organisation need relevant information to support their decisions regarding the marketing strategies. The data provided by marketing research could be computed in various ways but the multivariate data analysis models can enhance the utility of the information. Among these models we can find the Logistic Regression model, which is used for dichotomous variables. Our research is based on explanation the utility of this model and interpretation of the resulted information in order to help practitioners and researchers to use it in their future investigations

  7. Vector regression introduced

    Directory of Open Access Journals (Sweden)

    Mok Tik

    2014-06-01

    Full Text Available This study formulates regression of vector data that will enable statistical analysis of various geodetic phenomena such as, polar motion, ocean currents, typhoon/hurricane tracking, crustal deformations, and precursory earthquake signals. The observed vector variable of an event (dependent vector variable is expressed as a function of a number of hypothesized phenomena realized also as vector variables (independent vector variables and/or scalar variables that are likely to impact the dependent vector variable. The proposed representation has the unique property of solving the coefficients of independent vector variables (explanatory variables also as vectors, hence it supersedes multivariate multiple regression models, in which the unknown coefficients are scalar quantities. For the solution, complex numbers are used to rep- resent vector information, and the method of least squares is deployed to estimate the vector model parameters after transforming the complex vector regression model into a real vector regression model through isomorphism. Various operational statistics for testing the predictive significance of the estimated vector parameter coefficients are also derived. A simple numerical example demonstrates the use of the proposed vector regression analysis in modeling typhoon paths.

  8. Multiple regression models for energy use in air-conditioned office buildings in different climates

    International Nuclear Information System (INIS)

    Lam, Joseph C.; Wan, Kevin K.W.; Liu Dalong; Tsang, C.L.

    2010-01-01

    An attempt was made to develop multiple regression models for office buildings in the five major climates in China - severe cold, cold, hot summer and cold winter, mild, and hot summer and warm winter. A total of 12 key building design variables were identified through parametric and sensitivity analysis, and considered as inputs in the regression models. The coefficient of determination R 2 varies from 0.89 in Harbin to 0.97 in Kunming, indicating that 89-97% of the variations in annual building energy use can be explained by the changes in the 12 parameters. A pseudo-random number generator based on three simple multiplicative congruential generators was employed to generate random designs for evaluation of the regression models. The difference between regression-predicted and DOE-simulated annual building energy use are largely within 10%. It is envisaged that the regression models developed can be used to estimate the likely energy savings/penalty during the initial design stage when different building schemes and design concepts are being considered.

  9. CICAAR - Convolutive ICA with an Auto-Regressive Inverse Model

    DEFF Research Database (Denmark)

    Dyrholm, Mads; Hansen, Lars Kai

    2004-01-01

    We invoke an auto-regressive IIR inverse model for convolutive ICA and derive expressions for the likelihood and its gradient. We argue that optimization will give a stable inverse. When there are more sensors than sources the mixing model parameters are estimated in a second step by least square...... estimation. We demonstrate the method on synthetic data and finally separate speech and music in a real room recording....

  10. Oak Ridge Reservation environmental report for 1991

    International Nuclear Information System (INIS)

    Mucke, P.C.

    1992-10-01

    The Oak Ridge Reservation Environmental Report for 1991 is the 21st in a series that began in 1971. The report documents the annual results of a comprehensive program to estimate the impact of the US Department of Energy (DOE) Oak Ridge operations upon human health and the environment. The report is organized into ten sections that address various aspects of effluent monitoring, environmental surveillance, dose assessment, waste management, and quality assurance. A compliance summary gives a synopsis of the status of each facility relative to applicable state and federal regulations. Data are included for the following: Oak Ridge Y-12 Plant; Oak Ridge National Laboratory (ORNL); and Oak Ridge K-25 Site. Effluent monitoring and environmental surveillance programs are intended to serve as effective indicators of contaminant releases and ambient contaminant concentrations that have the potential to result in adverse impacts to human health and the environment

  11. Two levels ARIMAX and regression models for forecasting time series data with calendar variation effects

    Science.gov (United States)

    Suhartono, Lee, Muhammad Hisyam; Prastyo, Dedy Dwi

    2015-12-01

    The aim of this research is to develop a calendar variation model for forecasting retail sales data with the Eid ul-Fitr effect. The proposed model is based on two methods, namely two levels ARIMAX and regression methods. Two levels ARIMAX and regression models are built by using ARIMAX for the first level and regression for the second level. Monthly men's jeans and women's trousers sales in a retail company for the period January 2002 to September 2009 are used as case study. In general, two levels of calendar variation model yields two models, namely the first model to reconstruct the sales pattern that already occurred, and the second model to forecast the effect of increasing sales due to Eid ul-Fitr that affected sales at the same and the previous months. The results show that the proposed two level calendar variation model based on ARIMAX and regression methods yields better forecast compared to the seasonal ARIMA model and Neural Networks.

  12. Formation of ridges in a stable lithosphere in mantle convection models with a viscoplastic rheology.

    Science.gov (United States)

    Rozel, A; Golabek, G J; Näf, R; Tackley, P J

    2015-06-28

    Numerical simulations of mantle convection with a viscoplastic rheology usually display mobile, episodic or stagnant lid regimes. In this study, we report a new convective regime in which a ridge can form without destabilizing the surrounding lithosphere or forming subduction zones. Using simulations in 2-D spherical annulus geometry, we show that a depth-dependent yield stress is sufficient to reach this ridge only regime. This regime occurs when the friction coefficient is close to the critical value between mobile lid and stagnant lid regimes. Maps of convective regime as a function of the parameters friction coefficients and depth dependence of viscosity are provided for both basal heating and mixed heating situations. The ridge only regime appears for both pure basal heating and mixed heating mode. For basal heating, this regime can occur for all vertical viscosity contrasts, while for mixed heating, a highly viscous deep mantle is required.

  13. Testing and Modeling Fuel Regression Rate in a Miniature Hybrid Burner

    Directory of Open Access Journals (Sweden)

    Luciano Fanton

    2012-01-01

    Full Text Available Ballistic characterization of an extended group of innovative HTPB-based solid fuel formulations for hybrid rocket propulsion was performed in a lab-scale burner. An optical time-resolved technique was used to assess the quasisteady regression history of single perforation, cylindrical samples. The effects of metalized additives and radiant heat transfer on the regression rate of such formulations were assessed. Under the investigated operating conditions and based on phenomenological models from the literature, analyses of the collected experimental data show an appreciable influence of the radiant heat flux from burnt gases and soot for both unloaded and loaded fuel formulations. Pure HTPB regression rate data are satisfactorily reproduced, while the impressive initial regression rates of metalized formulations require further assessment.

  14. An enhanced structure tensor method for sea ice ridge detection from GF-3 SAR imagery

    Science.gov (United States)

    Zhu, T.; Li, F.; Zhang, Y.; Zhang, S.; Spreen, G.; Dierking, W.; Heygster, G.

    2017-12-01

    In SAR imagery, ridges or leads are shown as the curvilinear features. The proposed ridge detection method is facilitated by their curvilinear shapes. The bright curvilinear features are recognized as the ridges while the dark curvilinear features are classified as the leads. In dual-polarization HH or HV channel of C-band SAR imagery, the bright curvilinear feature may be false alarm because the frost flowers of young leads may show as bright pixels associated with changes in the surface salinity under calm surface conditions. Wind roughened leads also trigger the backscatter increasing that can be misclassified as ridges [1]. Thus the width limitation is considered in this proposed structure tensor method [2], since only shape feature based method is not enough for detecting ridges. The ridge detection algorithm is based on the hypothesis that the bright pixels are ridges with curvilinear shapes and the ridge width is less 30 meters. Benefited from GF-3 with high spatial resolution of 3 meters, we provide an enhanced structure tensor method for detecting the significant ridge. The preprocessing procedures including the calibration and incidence angle normalization are also investigated. The bright pixels will have strong response to the bandpass filtering. The ridge training samples are delineated from the SAR imagery in the Log-Gabor filters to construct structure tensor. From the tensor, the dominant orientation of the pixel representing the ridge is determined by the dominant eigenvector. For the post-processing of structure tensor, the elongated kernel is desired to enhance the ridge curvilinear shape. Since ridge presents along a certain direction, the ratio of the dominant eigenvector will be used to measure the intensity of local anisotropy. The convolution filter has been utilized in the constructed structure tensor is used to model spatial contextual information. Ridge detection results from GF-3 show the proposed method performs better compared to the

  15. Two-step variable selection in quantile regression models

    Directory of Open Access Journals (Sweden)

    FAN Yali

    2015-06-01

    Full Text Available We propose a two-step variable selection procedure for high dimensional quantile regressions, in which the dimension of the covariates, pn is much larger than the sample size n. In the first step, we perform ℓ1 penalty, and we demonstrate that the first step penalized estimator with the LASSO penalty can reduce the model from an ultra-high dimensional to a model whose size has the same order as that of the true model, and the selected model can cover the true model. The second step excludes the remained irrelevant covariates by applying the adaptive LASSO penalty to the reduced model obtained from the first step. Under some regularity conditions, we show that our procedure enjoys the model selection consistency. We conduct a simulation study and a real data analysis to evaluate the finite sample performance of the proposed approach.

  16. Regression Models for Predicting Force Coefficients of Aerofoils

    Directory of Open Access Journals (Sweden)

    Mohammed ABDUL AKBAR

    2015-09-01

    Full Text Available Renewable sources of energy are attractive and advantageous in a lot of different ways. Among the renewable energy sources, wind energy is the fastest growing type. Among wind energy converters, Vertical axis wind turbines (VAWTs have received renewed interest in the past decade due to some of the advantages they possess over their horizontal axis counterparts. VAWTs have evolved into complex 3-D shapes. A key component in predicting the output of VAWTs through analytical studies is obtaining the values of lift and drag coefficients which is a function of shape of the aerofoil, ‘angle of attack’ of wind and Reynolds’s number of flow. Sandia National Laboratories have carried out extensive experiments on aerofoils for the Reynolds number in the range of those experienced by VAWTs. The volume of experimental data thus obtained is huge. The current paper discusses three Regression analysis models developed wherein lift and drag coefficients can be found out using simple formula without having to deal with the bulk of the data. Drag coefficients and Lift coefficients were being successfully estimated by regression models with R2 values as high as 0.98.

  17. The Relationship between Economic Growth and Money Laundering – a Linear Regression Model

    Directory of Open Access Journals (Sweden)

    Daniel Rece

    2009-09-01

    Full Text Available This study provides an overview of the relationship between economic growth and money laundering modeled by a least squares function. The report analyzes statistically data collected from USA, Russia, Romania and other eleven European countries, rendering a linear regression model. The study illustrates that 23.7% of the total variance in the regressand (level of money laundering is “explained” by the linear regression model. In our opinion, this model will provide critical auxiliary judgment and decision support for anti-money laundering service systems.

  18. Regression analysis of informative current status data with the additive hazards model.

    Science.gov (United States)

    Zhao, Shishun; Hu, Tao; Ma, Ling; Wang, Peijie; Sun, Jianguo

    2015-04-01

    This paper discusses regression analysis of current status failure time data arising from the additive hazards model in the presence of informative censoring. Many methods have been developed for regression analysis of current status data under various regression models if the censoring is noninformative, and also there exists a large literature on parametric analysis of informative current status data in the context of tumorgenicity experiments. In this paper, a semiparametric maximum likelihood estimation procedure is presented and in the method, the copula model is employed to describe the relationship between the failure time of interest and the censoring time. Furthermore, I-splines are used to approximate the nonparametric functions involved and the asymptotic consistency and normality of the proposed estimators are established. A simulation study is conducted and indicates that the proposed approach works well for practical situations. An illustrative example is also provided.

  19. Poisson regression approach for modeling fatal injury rates amongst Malaysian workers

    International Nuclear Information System (INIS)

    Kamarulzaman Ibrahim; Heng Khai Theng

    2005-01-01

    Many safety studies are based on the analysis carried out on injury surveillance data. The injury surveillance data gathered for the analysis include information on number of employees at risk of injury in each of several strata where the strata are defined in terms of a series of important predictor variables. Further insight into the relationship between fatal injury rates and predictor variables may be obtained by the poisson regression approach. Poisson regression is widely used in analyzing count data. In this study, poisson regression is used to model the relationship between fatal injury rates and predictor variables which are year (1995-2002), gender, recording system and industry type. Data for the analysis were obtained from PERKESO and Jabatan Perangkaan Malaysia. It is found that the assumption that the data follow poisson distribution has been violated. After correction for the problem of over dispersion, the predictor variables that are found to be significant in the model are gender, system of recording, industry type, two interaction effects (interaction between recording system and industry type and between year and industry type). Introduction Regression analysis is one of the most popular

  20. Accounting for Zero Inflation of Mussel Parasite Counts Using Discrete Regression Models

    Directory of Open Access Journals (Sweden)

    Emel Çankaya

    2017-06-01

    Full Text Available In many ecological applications, the absences of species are inevitable due to either detection faults in samples or uninhabitable conditions for their existence, resulting in high number of zero counts or abundance. Usual practice for modelling such data is regression modelling of log(abundance+1 and it is well know that resulting model is inadequate for prediction purposes. New discrete models accounting for zero abundances, namely zero-inflated regression (ZIP and ZINB, Hurdle-Poisson (HP and Hurdle-Negative Binomial (HNB amongst others are widely preferred to the classical regression models. Due to the fact that mussels are one of the economically most important aquatic products of Turkey, the purpose of this study is therefore to examine the performances of these four models in determination of the significant biotic and abiotic factors on the occurrences of Nematopsis legeri parasite harming the existence of Mediterranean mussels (Mytilus galloprovincialis L.. The data collected from the three coastal regions of Sinop city in Turkey showed more than 50% of parasite counts on the average are zero-valued and model comparisons were based on information criterion. The results showed that the probability of the occurrence of this parasite is here best formulated by ZINB or HNB models and influential factors of models were found to be correspondent with ecological differences of the regions.

  1. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

    Science.gov (United States)

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.

  2. Transpiration of glasshouse rose crops: evaluation of regression models

    NARCIS (Netherlands)

    Baas, R.; Rijssel, van E.

    2006-01-01

    Regression models of transpiration (T) based on global radiation inside the greenhouse (G), with or without energy input from heating pipes (Eh) and/or vapor pressure deficit (VPD) were parameterized. Therefore, data on T, G, temperatures from air, canopy and heating pipes, and VPD from both a

  3. Convergent Time-Varying Regression Models for Data Streams: Tracking Concept Drift by the Recursive Parzen-Based Generalized Regression Neural Networks.

    Science.gov (United States)

    Duda, Piotr; Jaworski, Maciej; Rutkowski, Leszek

    2018-03-01

    One of the greatest challenges in data mining is related to processing and analysis of massive data streams. Contrary to traditional static data mining problems, data streams require that each element is processed only once, the amount of allocated memory is constant and the models incorporate changes of investigated streams. A vast majority of available methods have been developed for data stream classification and only a few of them attempted to solve regression problems, using various heuristic approaches. In this paper, we develop mathematically justified regression models working in a time-varying environment. More specifically, we study incremental versions of generalized regression neural networks, called IGRNNs, and we prove their tracking properties - weak (in probability) and strong (with probability one) convergence assuming various concept drift scenarios. First, we present the IGRNNs, based on the Parzen kernels, for modeling stationary systems under nonstationary noise. Next, we extend our approach to modeling time-varying systems under nonstationary noise. We present several types of concept drifts to be handled by our approach in such a way that weak and strong convergence holds under certain conditions. Finally, in the series of simulations, we compare our method with commonly used heuristic approaches, based on forgetting mechanism or sliding windows, to deal with concept drift. Finally, we apply our concept in a real life scenario solving the problem of currency exchange rates prediction.

  4. Predicting Antitumor Activity of Peptides by Consensus of Regression Models Trained on a Small Data Sample

    Directory of Open Access Journals (Sweden)

    Ivanka Jerić

    2011-11-01

    Full Text Available Predicting antitumor activity of compounds using regression models trained on a small number of compounds with measured biological activity is an ill-posed inverse problem. Yet, it occurs very often within the academic community. To counteract, up to some extent, overfitting problems caused by a small training data, we propose to use consensus of six regression models for prediction of biological activity of virtual library of compounds. The QSAR descriptors of 22 compounds related to the opioid growth factor (OGF, Tyr-Gly-Gly-Phe-Met with known antitumor activity were used to train regression models: the feed-forward artificial neural network, the k-nearest neighbor, sparseness constrained linear regression, the linear and nonlinear (with polynomial and Gaussian kernel support vector machine. Regression models were applied on a virtual library of 429 compounds that resulted in six lists with candidate compounds ranked by predicted antitumor activity. The highly ranked candidate compounds were synthesized, characterized and tested for an antiproliferative activity. Some of prepared peptides showed more pronounced activity compared with the native OGF; however, they were less active than highly ranked compounds selected previously by the radial basis function support vector machine (RBF SVM regression model. The ill-posedness of the related inverse problem causes unstable behavior of trained regression models on test data. These results point to high complexity of prediction based on the regression models trained on a small data sample.

  5. Resolving issues at the Department of Energy/Oak Ridge Operations Facilities

    International Nuclear Information System (INIS)

    Row, T.H.; Adams, W.D.

    1988-01-01

    Waste management, like many other issues, has experienced major milestones. In 1971, the Calvert Cliff's decision resulted in an entirely different approach to the consideration of environmental impact analysis in reactor siting. The accidents at Three Mile Island and Chernobyl have had profound effects on nuclear power plant design. The high-level waste repository program has had many similar experiences that have modified the course of events. The management of radioactive, hazardous chemical and mixed waste in all of the facilities of the Oak Ridge Operations (ORO) Office of the Department of Energy (DOE) took on an entirely different meaning in 1984. On April 13, 1984, Federal Judge Robert Taylor said that DOE should proceed 'with all deliberate speed' to bring the Y-12 plant into compliance with the Resource Conservation and Recovery Act and the Clean Water Act. This decision resulted from a suit brought by the Legal Environmental Assistance Foundation (LEAF) and grew out of a continuing revelation of mercury spills and other problems related to the Oak Ridge plants of DOE. In this same time frame, other events occurred in Oak Ridge that would set the stage for major changes, to provide the supporting environment that allowed a very different and successful approach to resolving waste management issues at the DOE/ORO Facilities. This is the origin of the Oak Ridge Model which was recently adopted as the DOE Model. The concept is to assure that all stakeholders in waste management decisions have the opportunity to be participants from the first step. A discussion of many of the elements that have contributed to the success of the Model follows

  6. Magmatic tectonic effects of high thermal regime at the site of active ridge subduction: the Chile Triple Junction model

    Science.gov (United States)

    Lagabrielle, Yves; Guivel, Christèle; Maury, René C.; Bourgois, Jacques; Fourcade, Serge; Martin, Hervé

    2000-11-01

    High thermal gradients are expected to be found at sites of subduction of very young oceanic lithosphere and more particularly at ridge-trench-trench (RTT) triple junctions, where active oceanic spreading ridges enter a subduction zone. Active tectonics, associated with the emplacement of two main types of volcanic products, (1) MORB-type magmas, and (2) calc-alkaline acidic magmas in the forearc, also characterize these plate junction domains. In this context, MORB-type magmas are generally thought to derive from the buried active spreading center subducted at shallow depths, whereas the origin of calc-alkaline acidic magmas is more problematic. One of the best constrained examples of ridge-trench interaction is the Chile Triple Junction (CTJ) located southwest of the South American plate at 46°12'S, where the active Chile spreading center enters the subduction zone. In this area, there is a clear correlation between the emplacement of magmatic products and the migration of the triple junction along the active margin. The CTJ lava population is bimodal, with mafic to intermediate lavas (48-56% SiO 2) and acidic lavas ranging from dacites to rhyolites (66-73% SiO 2). Previous models have shown that partial melting of oceanic crust plus 10-20% of sediments, leaving an amphibole- and plagioclase-rich residue, is the only process that may account for the genesis of acidic magmas. Due to special plate geometry in the CTJ area, a given section of the margin may be successively affected by the passage of several ridge segments. We emphasize that repeated passages will lead to the development of very high thermal gradients allowing melting of rocks of oceanic origin at temperatures of 800-900°C and low pressures, corresponding to depths of 10-20 km depth only. In addition, the structure of the CTJ forearc domain is dominated by horizontal displacements and tilting of crustal blocks along a network of strike-slip faults. The occurrence of such a deformed domain implies

  7. Modelling of hydrothermal fluid circulation in a heterogeneous medium: Application to the Rainbow Vent site (Mid-Atlantic-Ridge, 36°14N)

    Science.gov (United States)

    Perez, F.; Mügler, C.; Jean-Baptiste, P.; Charlou, J. L.

    2012-04-01

    Hydrothermal activity at the axis of mid-ocean ridges is a key driver for energy and matter transfer from the interior of the Earth to the ocean floor. At mid-ocean ridges, seawater penetrates through the permeable young crust, warms at depth and exchanges chemicals with the surrounding rocks. This hot fluid focuses and flows upwards, then is expelled from the crust at hydrothermal vent sites in the form of black or white smokers completed by diffusive emissions. We developed a new numerical tool in the Cast3M software framework to model such hydrothermal circulations. Thermodynamic properties of one-phase pure water were calculated from the IAPWS formulation. This new numerical tool was validated on several test cases of convection in closed-top and open-top boxes. Simulations of hydrothermal circulation in a homogeneous-permeability porous medium also gave results in good agreement with already published simulations. We used this new numerical tool to construct a geometric and physical model configuration of the Rainbow Vent site at 36°14'N on the Mid-Atlantic Ridge. In this presentation, several configurations will be discussed, showing that high temperatures and high mass fluxes measured at the Rainbow site cannot be modelled with hydrothermal circulation in a homogeneous-permeability porous medium. We will show that these high values require the presence of a fault or a preferential pathway right below the venting site. We will propose and discuss a 2-D one-path model that allows us to simulate both high temperatures and high mass fluxes. This modelling of the hydrothermal circulation at the Rainbow site constitutes a first but necessary step to understand the origin of high concentrations of hydrogen issued from this ultramafic-hosted vent field.

  8. Oak Ridge Reservation environmental report for 1989

    International Nuclear Information System (INIS)

    Jacobs, V.A.; Wilson, A.R.

    1990-10-01

    This two-volume report, the Oak Ridge Reservation Environmental Report for 1989, is the nineteenth in an annual series that began in 1971. It reports the results of a comprehensive, year-round program to monitor the impact of operations at the three major US Department of Energy (DOE) production and research installations in Oak Ridge on the immediate areas' and surrounding region's groundwater and surface waters, soil, air quality, vegetation and wildlife, and through these multiple and varied pathways, the resident human population. Information is presented for the environmental monitoring Quality Assurance (QA) Program, audits and reviews, waste management activities, land special environmental studies. Data are included for the Oak Ridge Y-12 Plant, Oak Ridge National Laboratory (ORNL), and Oak Ridge Gaseous Diffusion Plant (ORGDP). Volume 1 presents narratives, summaries, and conclusions based on environmental monitoring at the three DOE installations and in the surrounding environs during calendar year (CY) 1989. Volume 1 is intended to be a ''stand-alone'' report about the Oak Ridge Reservation (ORR) for the reader who does not want an in-depth review of 1989 data. Volume 2 presents the detailed data from which these conclusions have been drawn and should be used in conjunction with Volume 1

  9. Oak Ridge Reservation environmental report for 1989

    Energy Technology Data Exchange (ETDEWEB)

    Jacobs, V.A.; Wilson, A.R. (eds.)

    1990-10-01

    This two-volume report, the Oak Ridge Reservation Environmental Report for 1989, is the nineteenth in an annual series that began in 1971. It reports the results of a comprehensive, year-round program to monitor the impact of operations at the three major US Department of Energy (DOE) production and research installations in Oak Ridge on the immediate areas' and surrounding region's groundwater and surface waters, soil, air quality, vegetation and wildlife, and through these multiple and varied pathways, the resident human population. Information is presented for the environmental monitoring Quality Assurance (QA) Program, audits and reviews, waste management activities, land special environmental studies. Data are included for the Oak Ridge Y-12 Plant, Oak Ridge National Laboratory (ORNL), and Oak Ridge Gaseous Diffusion Plant (ORGDP). Volume 1 presents narratives, summaries, and conclusions based on environmental monitoring at the three DOE installations and in the surrounding environs during calendar year (CY) 1989. Volume 1 is intended to be a stand-alone'' report about the Oak Ridge Reservation (ORR) for the reader who does not want an in-depth review of 1989 data. Volume 2 presents the detailed data from which these conclusions have been drawn and should be used in conjunction with Volume 1.

  10. Photovoltaic Array Condition Monitoring Based on Online Regression of Performance Model

    DEFF Research Database (Denmark)

    Spataru, Sergiu; Sera, Dezso; Kerekes, Tamas

    2013-01-01

    regression modeling, from PV array production, plane-of-array irradiance, and module temperature measurements, acquired during an initial learning phase of the system. After the model has been parameterized automatically, the condition monitoring system enters the normal operation phase, where...

  11. Extended cox regression model: The choice of timefunction

    Science.gov (United States)

    Isik, Hatice; Tutkun, Nihal Ata; Karasoy, Durdu

    2017-07-01

    Cox regression model (CRM), which takes into account the effect of censored observations, is one the most applicative and usedmodels in survival analysis to evaluate the effects of covariates. Proportional hazard (PH), requires a constant hazard ratio over time, is the assumptionofCRM. Using extended CRM provides the test of including a time dependent covariate to assess the PH assumption or an alternative model in case of nonproportional hazards. In this study, the different types of real data sets are used to choose the time function and the differences between time functions are analyzed and discussed.

  12. Alveolar ridge rehabilitation to increase full denture retention and stability

    Directory of Open Access Journals (Sweden)

    Mefina Kuntjoro

    2010-12-01

    Full Text Available Background: Atrophic mandibular alveolar ridge generally complicates prostetic restoration expecially full denture. Low residual alveolar ridge and basal seat can cause unstable denture, permanent ulcer, pain, neuralgia, and mastication difficulty. Pre-proshetic surgery is needed to improve denture retention and stability. Augmentation is a major surgery to increase vertical height of the atrophic mandible while vestibuloplasty is aimed to increase the denture bearing area. Purpose: The augmentation and vestibuloplasty was aimed to provide stability and retentive denture atrophic mandibular alveolar ridge. Case: A 65 years old woman patient complained about uncomfortable denture. Clinical evaluate showed flat ridge in the anterior mandible, flabby tissue and candidiasis, while residual ridge height was classified into class IV. Case management: Augmentation using autograph was conducted as the mandible vertical height is less than 15 mm. Autograph was used to achieve better bone quantity and quality. Separated alveolar ridge was conducted from left to right canine region and was elevated 0.5 mm from the previous position to get new ridge in the anterior region. The separated alveolar ridge was fixated by using T-plate and ligature wire. Three months after augmentation fixation appliances was removed vestibuloplasty was performed to increase denture bearing area that can make a stable and retentive denture. Conclusion: Augmentation and vestibuloplasty can improve flat ridge to become prominent.Latar belakang: Ridge mandibula yang atrofi pada umumnya mempersulit pembuatan restorasi prostetik terutama gigi tiruan lengkap (GTL. Residual alveolar ridge dan basal seat yang rendah menyebabkan gigi tiruan menjadi tidak stabil, menimbulkan ulser permanen, nyeri, neuralgia, dan kesulitan mengunyah. Tujuan: Augmentasi dan vestibuloplasti pada ridge mandibula yang atrofi dilakukan untuk menciptakan gigi tiruan yang stabil dan retentive. Kasus: Pasien wanita

  13. INVESTIGATION OF E-MAIL TRAFFIC BY USING ZERO-INFLATED REGRESSION MODELS

    Directory of Open Access Journals (Sweden)

    Yılmaz KAYA

    2012-06-01

    Full Text Available Based on count data obtained with a value of zero may be greater than anticipated. These types of data sets should be used to analyze by regression methods taking into account zero values. Zero- Inflated Poisson (ZIP, Zero-Inflated negative binomial (ZINB, Poisson Hurdle (PH, negative binomial Hurdle (NBH are more common approaches in modeling more zero value possessing dependent variables than expected. In the present study, the e-mail traffic of Yüzüncü Yıl University in 2009 spring semester was investigated. ZIP and ZINB, PH and NBH regression methods were applied on the data set because more zeros counting (78.9% were found in data set than expected. ZINB and NBH regression considered zero dispersion and overdispersion were found to be more accurate results due to overdispersion and zero dispersion in sending e-mail. ZINB is determined to be best model accordingto Vuong statistics and information criteria.

  14. Focused information criterion and model averaging based on weighted composite quantile regression

    KAUST Repository

    Xu, Ganggang; Wang, Suojin; Huang, Jianhua Z.

    2013-01-01

    We study the focused information criterion and frequentist model averaging and their application to post-model-selection inference for weighted composite quantile regression (WCQR) in the context of the additive partial linear models. With the non

  15. Ridge and Furrow Fields

    DEFF Research Database (Denmark)

    Møller, Per Grau

    2016-01-01

    Ridge and furrow is a specific way of ploughing which makes fields of systematic ridges and furrows like a rubbing washboard. They are part of an overall openfield system, but the focus in this paper is on the functionality of the fields. There are many indications that agro-technological reasons...... systems and the establishment of basic structures like villages (with churches) and townships and states (in northern Europe). The fields can be considered as a resilient structure lasting for 800 years, along with the same basic physical structures in society....

  16. Model-based bootstrapping when correcting for measurement error with application to logistic regression.

    Science.gov (United States)

    Buonaccorsi, John P; Romeo, Giovanni; Thoresen, Magne

    2018-03-01

    When fitting regression models, measurement error in any of the predictors typically leads to biased coefficients and incorrect inferences. A plethora of methods have been proposed to correct for this. Obtaining standard errors and confidence intervals using the corrected estimators can be challenging and, in addition, there is concern about remaining bias in the corrected estimators. The bootstrap, which is one option to address these problems, has received limited attention in this context. It has usually been employed by simply resampling observations, which, while suitable in some situations, is not always formally justified. In addition, the simple bootstrap does not allow for estimating bias in non-linear models, including logistic regression. Model-based bootstrapping, which can potentially estimate bias in addition to being robust to the original sampling or whether the measurement error variance is constant or not, has received limited attention. However, it faces challenges that are not present in handling regression models with no measurement error. This article develops new methods for model-based bootstrapping when correcting for measurement error in logistic regression with replicate measures. The methodology is illustrated using two examples, and a series of simulations are carried out to assess and compare the simple and model-based bootstrap methods, as well as other standard methods. While not always perfect, the model-based approaches offer some distinct improvements over the other methods. © 2017, The International Biometric Society.

  17. Seawater Circulation and Thermal Sink at OCEAN Ridges - FIELD Evidence in Oman Ophiolite

    Science.gov (United States)

    Nicolas, A. A.; Boudier, F. I.; Cathles, L. M.; Buck, W. R.; Celerier, B. P.

    2014-12-01

    Exceptionally, the lowermost gabbros in the Oman ophiolite are black and totally fresh, except for minute traces of impregnation by seawater fluids at very high temperature (~1000°C). These black gabbros sharply contrast with normal, whitish gabbros altered down to Low-T~500-350°C. These hydrous alterations are ascribed to an unconventional model of seawater circulation and cooling of the permanent magma chambers of fast spreading ocean ridges. In this model, gabbros issued from the magma chamber cross a ~100 m thick thermal boundary layer (TBL) before reaching a narrow, Low-T high permeability channel where the heated return seawater is flowing towards black smokers and the local gabbros are altered. Uprising mantle diapirs in Oman diverge at ~5 km on each side of the palaeo-ridge axis and feed an overlying magma chamber that closes at this distance from axis. Preservation of black gabbros along the Moho implies that the loop of seawater alteration locally does not reach Moho beyond this ~5km distance (otherwise black gabbros would be altered in whitish gabbros). This defines an internal "thermal sink" within ~5 km to the ridge axis. There, the sink is efficiently cooled by the active hydrothermal convection that is ridge transverse. This has been documented near the Galapagos ridge by marine geophysical data, within the same distance. Beyond this critical distance, the cooling system becomes dominantly conductive and ridge-parallel. The TBL and attached return flow channels must be rising into the overcooled, accreted crust. Beyond the thermal sink, the 500°C isotherm rebounds into the crust. It is only after ~ 1My of crustal drift that this isotherm penetrates into the uppermost mantle in a sustained fashion, developing serpentinites at the expense of peridotites.

  18. Ordinal regression models to describe tourist satisfaction with Sintra's world heritage

    Science.gov (United States)

    Mouriño, Helena

    2013-10-01

    In Tourism Research, ordinal regression models are becoming a very powerful tool in modelling the relationship between an ordinal response variable and a set of explanatory variables. In August and September 2010, we conducted a pioneering Tourist Survey in Sintra, Portugal. The data were obtained by face-to-face interviews at the entrances of the Palaces and Parks of Sintra. The work developed in this paper focus on two main points: tourists' perception of the entrance fees; overall level of satisfaction with this heritage site. For attaining these goals, ordinal regression models were developed. We concluded that tourist's nationality was the only significant variable to describe the perception of the admission fees. Also, Sintra's image among tourists depends not only on their nationality, but also on previous knowledge about Sintra's World Heritage status.

  19. Combination of supervised and semi-supervised regression models for improved unbiased estimation

    DEFF Research Database (Denmark)

    Arenas-Garía, Jeronimo; Moriana-Varo, Carlos; Larsen, Jan

    2010-01-01

    In this paper we investigate the steady-state performance of semisupervised regression models adjusted using a modified RLS-like algorithm, identifying the situations where the new algorithm is expected to outperform standard RLS. By using an adaptive combination of the supervised and semisupervi......In this paper we investigate the steady-state performance of semisupervised regression models adjusted using a modified RLS-like algorithm, identifying the situations where the new algorithm is expected to outperform standard RLS. By using an adaptive combination of the supervised...

  20. Random regression models for daily feed intake in Danish Duroc pigs

    DEFF Research Database (Denmark)

    Strathe, Anders Bjerring; Mark, Thomas; Jensen, Just

    The objective of this study was to develop random regression models and estimate covariance functions for daily feed intake (DFI) in Danish Duroc pigs. A total of 476201 DFI records were available on 6542 Duroc boars between 70 to 160 days of age. The data originated from the National test station......-year-season, permanent, and animal genetic effects. The functional form was based on Legendre polynomials. A total of 64 models for random regressions were initially ranked by BIC to identify the approximate order for the Legendre polynomials using AI-REML. The parsimonious model included Legendre polynomials of 2nd...... order for genetic and permanent environmental curves and a heterogeneous residual variance, allowing the daily residual variance to change along the age trajectory due to scale effects. The parameters of the model were estimated in a Bayesian framework, using the RJMC module of the DMU package, where...

  1. Longitudinal beta regression models for analyzing health-related quality of life scores over time

    Directory of Open Access Journals (Sweden)

    Hunger Matthias

    2012-09-01

    Full Text Available Abstract Background Health-related quality of life (HRQL has become an increasingly important outcome parameter in clinical trials and epidemiological research. HRQL scores are typically bounded at both ends of the scale and often highly skewed. Several regression techniques have been proposed to model such data in cross-sectional studies, however, methods applicable in longitudinal research are less well researched. This study examined the use of beta regression models for analyzing longitudinal HRQL data using two empirical examples with distributional features typically encountered in practice. Methods We used SF-6D utility data from a German older age cohort study and stroke-specific HRQL data from a randomized controlled trial. We described the conceptual differences between mixed and marginal beta regression models and compared both models to the commonly used linear mixed model in terms of overall fit and predictive accuracy. Results At any measurement time, the beta distribution fitted the SF-6D utility data and stroke-specific HRQL data better than the normal distribution. The mixed beta model showed better likelihood-based fit statistics than the linear mixed model and respected the boundedness of the outcome variable. However, it tended to underestimate the true mean at the upper part of the distribution. Adjusted group means from marginal beta model and linear mixed model were nearly identical but differences could be observed with respect to standard errors. Conclusions Understanding the conceptual differences between mixed and marginal beta regression models is important for their proper use in the analysis of longitudinal HRQL data. Beta regression fits the typical distribution of HRQL data better than linear mixed models, however, if focus is on estimating group mean scores rather than making individual predictions, the two methods might not differ substantially.

  2. Serpentinization of abyssal peridotites from the MARK area, Mid-Atlantic Ridge: Sulfur geochemistry and reaction modeling

    Science.gov (United States)

    Alt, J.C.; Shanks, Wayne C.

    2003-01-01

    The opaque mineralogy and the contents and isotope compositions of sulfur in serpentinized peridotites from the MARK (Mid-Atlantic Ridge, Kane Fracture Zone) area were examined to understand the conditions of serpentinization and evaluate this process as a sink for seawater sulfur. The serpentinites contain a sulfur-rich secondary mineral assemblage and have high sulfur contents (up to 1 wt.%) and elevated ??34Ssulfide (3.7 to 12.7???). Geochemical reaction modeling indicates that seawater-peridotite interaction at 300 to 400??C alone cannot account for both the high sulfur contents and high ??34Ssulfide. These require a multistage reaction with leaching of sulfide from subjacent gabbro during higher temperature (???400??C) reactions with seawater and subsequent deposition of sulfide during serpentinization of peridotite at ???300??C. Serpentinization produces highly reducing conditions and significant amounts of H2 and results in the partial reduction of seawater carbonate to methane. The latter is documented by formation of carbonate veins enriched in 13C (up to 4.5???) at temperatures above 250??C. Although different processes produce variable sulfur isotope effects in other oceanic serpentinites, sulfur is consistently added to abyssal peridotites during serpentinization. Data for serpentinites drilled and dredged from oceanic crust and from ophiolites indicate that oceanic peridotites are a sink for up to 0.4 to 6.0 ?? 1012 g seawater S yr-1. This is comparable to sulfur exchange that occurs in hydrothermal systems in mafic oceanic crust at midocean ridges and on ridge flanks and amounts to 2 to 30% of the riverine sulfate source and sedimentary sulfide sink in the oceans. The high concentrations and modified isotope compositions of sulfur in serpentinites could be important for mantle metasomatism during subduction of crust generated at slow spreading rates. ?? 2003 Elsevier Science Ltd.

  3. A Gompertz regression model for fern spores germination

    Directory of Open Access Journals (Sweden)

    Gabriel y Galán, Jose María

    2015-06-01

    Full Text Available Germination is one of the most important biological processes for both seed and spore plants, also for fungi. At present, mathematical models of germination have been developed in fungi, bryophytes and several plant species. However, ferns are the only group whose germination has never been modelled. In this work we develop a regression model of the germination of fern spores. We have found that for Blechnum serrulatum, Blechnum yungense, Cheilanthes pilosa, Niphidium macbridei and Polypodium feuillei species the Gompertz growth model describe satisfactorily cumulative germination. An important result is that regression parameters are independent of fern species and the model is not affected by intraspecific variation. Our results show that the Gompertz curve represents a general germination model for all the non-green spore leptosporangiate ferns, including in the paper a discussion about the physiological and ecological meaning of the model.La germinación es uno de los procesos biológicos más relevantes tanto para las plantas con esporas, como para las plantas con semillas y los hongos. Hasta el momento, se han desarrollado modelos de germinación para hongos, briofitos y diversas especies de espermatófitos. Los helechos son el único grupo de plantas cuya germinación nunca ha sido modelizada. En este trabajo se desarrolla un modelo de regresión para explicar la germinación de las esporas de helechos. Observamos que para las especies Blechnum serrulatum, Blechnum yungense, Cheilanthes pilosa, Niphidium macbridei y Polypodium feuillei el modelo de crecimiento de Gompertz describe satisfactoriamente la germinación acumulativa. Un importante resultado es que los parámetros de la regresión son independientes de la especie y que el modelo no está afectado por variación intraespecífica. Por lo tanto, los resultados del trabajo muestran que la curva de Gompertz puede representar un modelo general para todos los helechos leptosporangiados

  4. Generic global regression models for growth prediction of Salmonella in ground pork and pork cuts

    DEFF Research Database (Denmark)

    Buschhardt, Tasja; Hansen, Tina Beck; Bahl, Martin Iain

    2017-01-01

    Introduction and Objectives Models for the prediction of bacterial growth in fresh pork are primarily developed using two-step regression (i.e. primary models followed by secondary models). These models are also generally based on experiments in liquids or ground meat and neglect surface growth....... It has been shown that one-step global regressions can result in more accurate models and that bacterial growth on intact surfaces can substantially differ from growth in liquid culture. Material and Methods We used a global-regression approach to develop predictive models for the growth of Salmonella....... One part of obtained logtransformed cell counts was used for model development and another for model validation. The Ratkowsky square root model and the relative lag time (RLT) model were integrated into the logistic model with delay. Fitted parameter estimates were compared to investigate the effect...

  5. Application of multilinear regression analysis in modeling of soil ...

    African Journals Online (AJOL)

    The application of Multi-Linear Regression Analysis (MLRA) model for predicting soil properties in Calabar South offers a technical guide and solution in foundation designs problems in the area. Forty-five soil samples were collected from fifteen different boreholes at a different depth and 270 tests were carried out for CBR, ...

  6. Glacial modulation of mid-ocean ridge magmatism and anomalous Pacific Antarctic Ridge volcanism during Termination II

    Science.gov (United States)

    Asimow, P. D.; Lewis, M.; Lund, D. C.; Seeley, E.; McCart, S.; Mudahy, A.

    2017-12-01

    Glacially-driven sea level rise and fall may modulate submarine volcanism by superposing pressure changes on the tectonic decompression that causes melt production in the mantle below mid-ocean ridges. A number of recent studies have considered whether this effect is recorded in the periodicity of ridge flank bathymetry (Tolstoy, 2015; Crowley et al., 2015) but interpretation of the bathymetric data remains controversial (Goff, 2016; Olive et al., 2016). We have pursued an independent approach using hydrothermal metals in well-dated near-ridge sediment cores. Along the full length of the East Pacific Rise, in areas of the ocean with widely variable biologic productivity, there are large and consistent rises in Fe, Mn, and As concentrations during the last two glacial terminations. We interpret these cores as records of excess hydrothermal flux due to delayed delivery to the axis of excess melt generated by the preceding falls in sea level. Here we discuss the potentially related discovery, in a core near the Pacific Antarctic Ridge (PAR), of a 10 cm thick layer of basaltic ash shards up to 250 mm in size, coincident with the penultimate deglaciation (Termination II). Although the site was 8 km off-axis at the time, the glasses have major element, volatile, and trace element composition consistent with more evolved members of the axial MORB suite from the nearby ridge axis. Their morphologies are typical of pyroclastic deposits created by explosive submarine volcanism (Clague et al., 2009). We propose that a period of low magmatic flux following a sea-level rise caused cooling of crustal magmatic systems, more advanced fractionation in the axial magma chamber, and increases in viscosity and volatile concentration. We hypothesize subsequent arrival of high magmatic flux during Termination II then reactivated the system and triggered an unusually vigorous series of explosive eruptions along this segment of the PAR. Ash layers recording large eruptions such as this one

  7. Electromagnetic survey of the K1070A burial ground at the Oak Ridge K-25 Site, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    Nyquist, J.E.; Emery, M.S.

    1993-01-01

    The K1070A burial ground, located at the K-25 Site on the Oak Ridge Reservation, received chemical and radioactive wastes from the late 1940s until 1975. Analysis of water samples collected from nearby monitoring wells indicates that contamination is migrating offsite. In November 1991, Oak Ridge National Laboratory (ORNL) personnel collected high-resolution electrical terrain conductivity data at the K1070A burial ground. A Model EM31 terrain conductivity meter manufactured by Geonics Limited was used in conjunction with the ORNL-developed Ultrasonic Ranging and Data System (USRADS) to perform the survey. The purposeof the survey was to provide Environmental Restoration (ER) staff with a detailed map of the spatial variation of the apparent electrical conductivity of the shallow subsurface (upper 3 m) to assist them in siting future monitoring wells closer to the waste area without drilling into the buried waste

  8. EMD-regression for modelling multi-scale relationships, and application to weather-related cardiovascular mortality

    Science.gov (United States)

    Masselot, Pierre; Chebana, Fateh; Bélanger, Diane; St-Hilaire, André; Abdous, Belkacem; Gosselin, Pierre; Ouarda, Taha B. M. J.

    2018-01-01

    In a number of environmental studies, relationships between natural processes are often assessed through regression analyses, using time series data. Such data are often multi-scale and non-stationary, leading to a poor accuracy of the resulting regression models and therefore to results with moderate reliability. To deal with this issue, the present paper introduces the EMD-regression methodology consisting in applying the empirical mode decomposition (EMD) algorithm on data series and then using the resulting components in regression models. The proposed methodology presents a number of advantages. First, it accounts of the issues of non-stationarity associated to the data series. Second, this approach acts as a scan for the relationship between a response variable and the predictors at different time scales, providing new insights about this relationship. To illustrate the proposed methodology it is applied to study the relationship between weather and cardiovascular mortality in Montreal, Canada. The results shed new knowledge concerning the studied relationship. For instance, they show that the humidity can cause excess mortality at the monthly time scale, which is a scale not visible in classical models. A comparison is also conducted with state of the art methods which are the generalized additive models and distributed lag models, both widely used in weather-related health studies. The comparison shows that EMD-regression achieves better prediction performances and provides more details than classical models concerning the relationship.

  9. Structure and isostatic compensation of the Comorin Ridge, north central Indian Ocean

    Digital Repository Service at National Institute of Oceanography (India)

    Sreejith, K.M.; Krishna, K.S.; Bansal, A.R.

    of the ridge. Additionally, ETOPO-5 bathymetric data (3-D image) are used to study the mor- phology of the ridge. Sediment isopachs of this region published by Kahle et al. (1981) are also considered for 2-D forward modelling of gravity anomalies to constrain... admittance. In this method, bathymetry and gravity data are divided into overlapping segments and each of them is detrended, windowed using Hanning window and discrete Fourier transformed. Two more additional parameters, coherence and phase of admittance...

  10. The Application of Classical and Neural Regression Models for the Valuation of Residential Real Estate

    Directory of Open Access Journals (Sweden)

    Mach Łukasz

    2017-06-01

    Full Text Available The research process aimed at building regression models, which helps to valuate residential real estate, is presented in the following article. Two widely used computational tools i.e. the classical multiple regression and regression models of artificial neural networks were used in order to build models. An attempt to define the utilitarian usefulness of the above-mentioned tools and comparative analysis of them is the aim of the conducted research. Data used for conducting analyses refers to the secondary transactional residential real estate market.

  11. A review of a priori regression models for warfarin maintenance dose prediction.

    Directory of Open Access Journals (Sweden)

    Ben Francis

    Full Text Available A number of a priori warfarin dosing algorithms, derived using linear regression methods, have been proposed. Although these dosing algorithms may have been validated using patients derived from the same centre, rarely have they been validated using a patient cohort recruited from another centre. In order to undertake external validation, two cohorts were utilised. One cohort formed by patients from a prospective trial and the second formed by patients in the control arm of the EU-PACT trial. Of these, 641 patients were identified as having attained stable dosing and formed the dataset used for validation. Predicted maintenance doses from six criterion fulfilling regression models were then compared to individual patient stable warfarin dose. Predictive ability was assessed with reference to several statistics including the R-square and mean absolute error. The six regression models explained different amounts of variability in the stable maintenance warfarin dose requirements of the patients in the two validation cohorts; adjusted R-squared values ranged from 24.2% to 68.6%. An overview of the summary statistics demonstrated that no one dosing algorithm could be considered optimal. The larger validation cohort from the prospective trial produced more consistent statistics across the six dosing algorithms. The study found that all the regression models performed worse in the validation cohort when compared to the derivation cohort. Further, there was little difference between regression models that contained pharmacogenetic coefficients and algorithms containing just non-pharmacogenetic coefficients. The inconsistency of results between the validation cohorts suggests that unaccounted population specific factors cause variability in dosing algorithm performance. Better methods for dosing that take into account inter- and intra-individual variability, at the initiation and maintenance phases of warfarin treatment, are needed.

  12. A review of a priori regression models for warfarin maintenance dose prediction.

    Science.gov (United States)

    Francis, Ben; Lane, Steven; Pirmohamed, Munir; Jorgensen, Andrea

    2014-01-01

    A number of a priori warfarin dosing algorithms, derived using linear regression methods, have been proposed. Although these dosing algorithms may have been validated using patients derived from the same centre, rarely have they been validated using a patient cohort recruited from another centre. In order to undertake external validation, two cohorts were utilised. One cohort formed by patients from a prospective trial and the second formed by patients in the control arm of the EU-PACT trial. Of these, 641 patients were identified as having attained stable dosing and formed the dataset used for validation. Predicted maintenance doses from six criterion fulfilling regression models were then compared to individual patient stable warfarin dose. Predictive ability was assessed with reference to several statistics including the R-square and mean absolute error. The six regression models explained different amounts of variability in the stable maintenance warfarin dose requirements of the patients in the two validation cohorts; adjusted R-squared values ranged from 24.2% to 68.6%. An overview of the summary statistics demonstrated that no one dosing algorithm could be considered optimal. The larger validation cohort from the prospective trial produced more consistent statistics across the six dosing algorithms. The study found that all the regression models performed worse in the validation cohort when compared to the derivation cohort. Further, there was little difference between regression models that contained pharmacogenetic coefficients and algorithms containing just non-pharmacogenetic coefficients. The inconsistency of results between the validation cohorts suggests that unaccounted population specific factors cause variability in dosing algorithm performance. Better methods for dosing that take into account inter- and intra-individual variability, at the initiation and maintenance phases of warfarin treatment, are needed.

  13. Modeling of chemical exergy of agricultural biomass using improved general regression neural network

    International Nuclear Information System (INIS)

    Huang, Y.W.; Chen, M.Q.; Li, Y.; Guo, J.

    2016-01-01

    A comprehensive evaluation for energy potential contained in agricultural biomass was a vital step for energy utilization of agricultural biomass. The chemical exergy of typical agricultural biomass was evaluated based on the second law of thermodynamics. The chemical exergy was significantly influenced by C and O elements rather than H element. The standard entropy of the samples also was examined based on their element compositions. Two predicted models of the chemical exergy were developed, which referred to a general regression neural network model based upon the element composition, and a linear model based upon the high heat value. An auto-refinement algorithm was firstly developed to improve the performance of regression neural network model. The developed general regression neural network model with K-fold cross-validation had a better ability for predicting the chemical exergy than the linear model, which had lower predicted errors (±1.5%). - Highlights: • Chemical exergies of agricultural biomass were evaluated based upon fifty samples. • Values for the standard entropy of agricultural biomass samples were calculated. • A linear relationship between chemical exergy and HHV of samples was detected. • An improved GRNN prediction model for the chemical exergy of biomass was developed.

  14. Pulmonary lobe segmentation based on ridge surface sampling and shape model fitting

    Energy Technology Data Exchange (ETDEWEB)

    Ross, James C., E-mail: jross@bwh.harvard.edu [Channing Laboratory, Brigham and Women' s Hospital, Boston, Massachusetts 02215 (United States); Surgical Planning Lab, Brigham and Women' s Hospital, Boston, Massachusetts 02215 (United States); Laboratory of Mathematics in Imaging, Brigham and Women' s Hospital, Boston, Massachusetts 02126 (United States); Kindlmann, Gordon L. [Computer Science Department and Computation Institute, University of Chicago, Chicago, Illinois 60637 (United States); Okajima, Yuka; Hatabu, Hiroto [Department of Radiology, Brigham and Women' s Hospital, Boston, Massachusetts 02215 (United States); Díaz, Alejandro A. [Pulmonary and Critical Care Division, Brigham and Women' s Hospital and Harvard Medical School, Boston, Massachusetts 02215 and Department of Pulmonary Diseases, Pontificia Universidad Católica de Chile, Santiago (Chile); Silverman, Edwin K. [Channing Laboratory, Brigham and Women' s Hospital, Boston, Massachusetts 02215 and Pulmonary and Critical Care Division, Brigham and Women' s Hospital and Harvard Medical School, Boston, Massachusetts 02215 (United States); Washko, George R. [Pulmonary and Critical Care Division, Brigham and Women' s Hospital and Harvard Medical School, Boston, Massachusetts 02215 (United States); Dy, Jennifer [ECE Department, Northeastern University, Boston, Massachusetts 02115 (United States); Estépar, Raúl San José [Department of Radiology, Brigham and Women' s Hospital, Boston, Massachusetts 02215 (United States); Surgical Planning Lab, Brigham and Women' s Hospital, Boston, Massachusetts 02215 (United States); Laboratory of Mathematics in Imaging, Brigham and Women' s Hospital, Boston, Massachusetts 02126 (United States)

    2013-12-15

    Purpose: Performing lobe-based quantitative analysis of the lung in computed tomography (CT) scans can assist in efforts to better characterize complex diseases such as chronic obstructive pulmonary disease (COPD). While airways and vessels can help to indicate the location of lobe boundaries, segmentations of these structures are not always available, so methods to define the lobes in the absence of these structures are desirable. Methods: The authors present a fully automatic lung lobe segmentation algorithm that is effective in volumetric inspiratory and expiratory computed tomography (CT) datasets. The authors rely on ridge surface image features indicating fissure locations and a novel approach to modeling shape variation in the surfaces defining the lobe boundaries. The authors employ a particle system that efficiently samples ridge surfaces in the image domain and provides a set of candidate fissure locations based on the Hessian matrix. Following this, lobe boundary shape models generated from principal component analysis (PCA) are fit to the particles data to discriminate between fissure and nonfissure candidates. The resulting set of particle points are used to fit thin plate spline (TPS) interpolating surfaces to form the final boundaries between the lung lobes. Results: The authors tested algorithm performance on 50 inspiratory and 50 expiratory CT scans taken from the COPDGene study. Results indicate that the authors' algorithm performs comparably to pulmonologist-generated lung lobe segmentations and can produce good results in cases with accessory fissures, incomplete fissures, advanced emphysema, and low dose acquisition protocols. Dice scores indicate that only 29 out of 500 (5.85%) lobes showed Dice scores lower than 0.9. Two different approaches for evaluating lobe boundary surface discrepancies were applied and indicate that algorithm boundary identification is most accurate in the vicinity of fissures detectable on CT. Conclusions: The

  15. Pulmonary lobe segmentation based on ridge surface sampling and shape model fitting

    International Nuclear Information System (INIS)

    Ross, James C.; Kindlmann, Gordon L.; Okajima, Yuka; Hatabu, Hiroto; Díaz, Alejandro A.; Silverman, Edwin K.; Washko, George R.; Dy, Jennifer; Estépar, Raúl San José

    2013-01-01

    Purpose: Performing lobe-based quantitative analysis of the lung in computed tomography (CT) scans can assist in efforts to better characterize complex diseases such as chronic obstructive pulmonary disease (COPD). While airways and vessels can help to indicate the location of lobe boundaries, segmentations of these structures are not always available, so methods to define the lobes in the absence of these structures are desirable. Methods: The authors present a fully automatic lung lobe segmentation algorithm that is effective in volumetric inspiratory and expiratory computed tomography (CT) datasets. The authors rely on ridge surface image features indicating fissure locations and a novel approach to modeling shape variation in the surfaces defining the lobe boundaries. The authors employ a particle system that efficiently samples ridge surfaces in the image domain and provides a set of candidate fissure locations based on the Hessian matrix. Following this, lobe boundary shape models generated from principal component analysis (PCA) are fit to the particles data to discriminate between fissure and nonfissure candidates. The resulting set of particle points are used to fit thin plate spline (TPS) interpolating surfaces to form the final boundaries between the lung lobes. Results: The authors tested algorithm performance on 50 inspiratory and 50 expiratory CT scans taken from the COPDGene study. Results indicate that the authors' algorithm performs comparably to pulmonologist-generated lung lobe segmentations and can produce good results in cases with accessory fissures, incomplete fissures, advanced emphysema, and low dose acquisition protocols. Dice scores indicate that only 29 out of 500 (5.85%) lobes showed Dice scores lower than 0.9. Two different approaches for evaluating lobe boundary surface discrepancies were applied and indicate that algorithm boundary identification is most accurate in the vicinity of fissures detectable on CT. Conclusions: The proposed

  16. Binary Logistic Regression Versus Boosted Regression Trees in Assessing Landslide Susceptibility for Multiple-Occurring Regional Landslide Events: Application to the 2009 Storm Event in Messina (Sicily, southern Italy).

    Science.gov (United States)

    Lombardo, L.; Cama, M.; Maerker, M.; Parisi, L.; Rotigliano, E.

    2014-12-01

    This study aims at comparing the performances of Binary Logistic Regression (BLR) and Boosted Regression Trees (BRT) methods in assessing landslide susceptibility for multiple-occurrence regional landslide events within the Mediterranean region. A test area was selected in the north-eastern sector of Sicily (southern Italy), corresponding to the catchments of the Briga and the Giampilieri streams both stretching for few kilometres from the Peloritan ridge (eastern Sicily, Italy) to the Ionian sea. This area was struck on the 1st October 2009 by an extreme climatic event resulting in thousands of rapid shallow landslides, mainly of debris flows and debris avalanches types involving the weathered layer of a low to high grade metamorphic bedrock. Exploiting the same set of predictors and the 2009 landslide archive, BLR- and BRT-based susceptibility models were obtained for the two catchments separately, adopting a random partition (RP) technique for validation; besides, the models trained in one of the two catchments (Briga) were tested in predicting the landslide distribution in the other (Giampilieri), adopting a spatial partition (SP) based validation procedure. All the validation procedures were based on multi-folds tests so to evaluate and compare the reliability of the fitting, the prediction skill, the coherence in the predictor selection and the precision of the susceptibility estimates. All the obtained models for the two methods produced very high predictive performances, with a general congruence between BLR and BRT in the predictor importance. In particular, the research highlighted that BRT-models reached a higher prediction performance with respect to BLR-models, for RP based modelling, whilst for the SP-based models the difference in predictive skills between the two methods dropped drastically, converging to an analogous excellent performance. However, when looking at the precision of the probability estimates, BLR demonstrated to produce more robust

  17. New robust statistical procedures for the polytomous logistic regression models.

    Science.gov (United States)

    Castilla, Elena; Ghosh, Abhik; Martin, Nirian; Pardo, Leandro

    2018-05-17

    This article derives a new family of estimators, namely the minimum density power divergence estimators, as a robust generalization of the maximum likelihood estimator for the polytomous logistic regression model. Based on these estimators, a family of Wald-type test statistics for linear hypotheses is introduced. Robustness properties of both the proposed estimators and the test statistics are theoretically studied through the classical influence function analysis. Appropriate real life examples are presented to justify the requirement of suitable robust statistical procedures in place of the likelihood based inference for the polytomous logistic regression model. The validity of the theoretical results established in the article are further confirmed empirically through suitable simulation studies. Finally, an approach for the data-driven selection of the robustness tuning parameter is proposed with empirical justifications. © 2018, The International Biometric Society.

  18. Electricity demand loads modeling using AutoRegressive Moving Average (ARMA) models

    Energy Technology Data Exchange (ETDEWEB)

    Pappas, S.S. [Department of Information and Communication Systems Engineering, University of the Aegean, Karlovassi, 83 200 Samos (Greece); Ekonomou, L.; Chatzarakis, G.E. [Department of Electrical Engineering Educators, ASPETE - School of Pedagogical and Technological Education, N. Heraklion, 141 21 Athens (Greece); Karamousantas, D.C. [Technological Educational Institute of Kalamata, Antikalamos, 24100 Kalamata (Greece); Katsikas, S.K. [Department of Technology Education and Digital Systems, University of Piraeus, 150 Androutsou Srt., 18 532 Piraeus (Greece); Liatsis, P. [Division of Electrical Electronic and Information Engineering, School of Engineering and Mathematical Sciences, Information and Biomedical Engineering Centre, City University, Northampton Square, London EC1V 0HB (United Kingdom)

    2008-09-15

    This study addresses the problem of modeling the electricity demand loads in Greece. The provided actual load data is deseasonilized and an AutoRegressive Moving Average (ARMA) model is fitted on the data off-line, using the Akaike Corrected Information Criterion (AICC). The developed model fits the data in a successful manner. Difficulties occur when the provided data includes noise or errors and also when an on-line/adaptive modeling is required. In both cases and under the assumption that the provided data can be represented by an ARMA model, simultaneous order and parameter estimation of ARMA models under the presence of noise are performed. The produced results indicate that the proposed method, which is based on the multi-model partitioning theory, tackles successfully the studied problem. For validation purposes the produced results are compared with three other established order selection criteria, namely AICC, Akaike's Information Criterion (AIC) and Schwarz's Bayesian Information Criterion (BIC). The developed model could be useful in the studies that concern electricity consumption and electricity prices forecasts. (author)

  19. Aerodynamic roughness length related to non-aggregated tillage ridges

    Directory of Open Access Journals (Sweden)

    M. Kardous

    2005-11-01

    Full Text Available Wind erosion in agricultural soils is dependent, in part, on the aerodynamic roughness length (z0 produced by tillage ridges. Although previous studies have related z0 to ridge characteristics (ridge height (RH and spacing (RS, these relationships have not been tested for tillage ridges observed in the North African agricultural fields. In these regions, due to climate and soil conditions, small plowing tools are largely used. Most of these tools produce non-aggregated and closely-spaced small ridges. Thus, experiments were conducted in a 7-m long wind tunnel to measure z0 for 11 ridge types covering the range of geometric characteristics frequently observed in south Tunisia. Experimental results suggest that RH2/RS is the first order parameter controlling z0. A strong relationship between z0 and RH2/RS is proposed for a wide range of ridge characteristics.

  20. Analysis of extreme drinking in patients with alcohol dependence using Pareto regression.

    Science.gov (United States)

    Das, Sourish; Harel, Ofer; Dey, Dipak K; Covault, Jonathan; Kranzler, Henry R

    2010-05-20

    We developed a novel Pareto regression model with an unknown shape parameter to analyze extreme drinking in patients with Alcohol Dependence (AD). We used the generalized linear model (GLM) framework and the log-link to include the covariate information through the scale parameter of the generalized Pareto distribution. We proposed a Bayesian method based on Ridge prior and Zellner's g-prior for the regression coefficients. Simulation study indicated that the proposed Bayesian method performs better than the existing likelihood-based inference for the Pareto regression.We examined two issues of importance in the study of AD. First, we tested whether a single nucleotide polymorphism within GABRA2 gene, which encodes a subunit of the GABA(A) receptor, and that has been associated with AD, influences 'extreme' alcohol intake and second, the efficacy of three psychotherapies for alcoholism in treating extreme drinking behavior. We found an association between extreme drinking behavior and GABRA2. We also found that, at baseline, men with a high-risk GABRA2 allele had a significantly higher probability of extreme drinking than men with no high-risk allele. However, men with a high-risk allele responded to the therapy better than those with two copies of the low-risk allele. Women with high-risk alleles also responded to the therapy better than those with two copies of the low-risk allele, while women who received the cognitive behavioral therapy had better outcomes than those receiving either of the other two therapies. Among men, motivational enhancement therapy was the best for the treatment of the extreme drinking behavior. Copyright 2010 John Wiley & Sons, Ltd.

  1. Ridge Regression: A tool to forecast wheat area and production

    Directory of Open Access Journals (Sweden)

    Nasir Jamal

    2007-07-01

    Full Text Available This research study is designed to develop forecasting models for acreage and production of wheat crop for Chakwal district of Rawalpindi region keeping in view the assumptions of OLS estimation. The forecasting models are developed on the basis of 15 years data from 1984-85 to 1998-99 then wheat area and production for next five years from 1999-2000 to 2003-04 is forecasted through the models and compared with the actual figures. After evaluating the accuracy of the models, final models are developed on the basis of 20 years data for the period 1984-85 to 2003-04. These linear models can be used to forecast wheat area and production of next five years. The Urea fertilizer, DAP fertilizer and manures plays a significant role to enhance the production of wheat crop. Number of ploughs in the wheat fields is significant factor to increase the production of wheat crop. Good rains in the month of October and November significantly contributes to increase the production of wheat crop and mean maximum temperature in the month of March is a significant factor to reduce the production of wheat crop.

  2. Analysis of the influence of quantile regression model on mainland tourists' service satisfaction performance.

    Science.gov (United States)

    Wang, Wen-Cheng; Cho, Wen-Chien; Chen, Yin-Jen

    2014-01-01

    It is estimated that mainland Chinese tourists travelling to Taiwan can bring annual revenues of 400 billion NTD to the Taiwan economy. Thus, how the Taiwanese Government formulates relevant measures to satisfy both sides is the focus of most concern. Taiwan must improve the facilities and service quality of its tourism industry so as to attract more mainland tourists. This paper conducted a questionnaire survey of mainland tourists and used grey relational analysis in grey mathematics to analyze the satisfaction performance of all satisfaction question items. The first eight satisfaction items were used as independent variables, and the overall satisfaction performance was used as a dependent variable for quantile regression model analysis to discuss the relationship between the dependent variable under different quantiles and independent variables. Finally, this study further discussed the predictive accuracy of the least mean regression model and each quantile regression model, as a reference for research personnel. The analysis results showed that other variables could also affect the overall satisfaction performance of mainland tourists, in addition to occupation and age. The overall predictive accuracy of quantile regression model Q0.25 was higher than that of the other three models.

  3. Analysis of the Influence of Quantile Regression Model on Mainland Tourists' Service Satisfaction Performance

    Science.gov (United States)

    Wang, Wen-Cheng; Cho, Wen-Chien; Chen, Yin-Jen

    2014-01-01

    It is estimated that mainland Chinese tourists travelling to Taiwan can bring annual revenues of 400 billion NTD to the Taiwan economy. Thus, how the Taiwanese Government formulates relevant measures to satisfy both sides is the focus of most concern. Taiwan must improve the facilities and service quality of its tourism industry so as to attract more mainland tourists. This paper conducted a questionnaire survey of mainland tourists and used grey relational analysis in grey mathematics to analyze the satisfaction performance of all satisfaction question items. The first eight satisfaction items were used as independent variables, and the overall satisfaction performance was used as a dependent variable for quantile regression model analysis to discuss the relationship between the dependent variable under different quantiles and independent variables. Finally, this study further discussed the predictive accuracy of the least mean regression model and each quantile regression model, as a reference for research personnel. The analysis results showed that other variables could also affect the overall satisfaction performance of mainland tourists, in addition to occupation and age. The overall predictive accuracy of quantile regression model Q0.25 was higher than that of the other three models. PMID:24574916

  4. Analysis of the Influence of Quantile Regression Model on Mainland Tourists’ Service Satisfaction Performance

    Directory of Open Access Journals (Sweden)

    Wen-Cheng Wang

    2014-01-01

    Full Text Available It is estimated that mainland Chinese tourists travelling to Taiwan can bring annual revenues of 400 billion NTD to the Taiwan economy. Thus, how the Taiwanese Government formulates relevant measures to satisfy both sides is the focus of most concern. Taiwan must improve the facilities and service quality of its tourism industry so as to attract more mainland tourists. This paper conducted a questionnaire survey of mainland tourists and used grey relational analysis in grey mathematics to analyze the satisfaction performance of all satisfaction question items. The first eight satisfaction items were used as independent variables, and the overall satisfaction performance was used as a dependent variable for quantile regression model analysis to discuss the relationship between the dependent variable under different quantiles and independent variables. Finally, this study further discussed the predictive accuracy of the least mean regression model and each quantile regression model, as a reference for research personnel. The analysis results showed that other variables could also affect the overall satisfaction performance of mainland tourists, in addition to occupation and age. The overall predictive accuracy of quantile regression model Q0.25 was higher than that of the other three models.

  5. A binary logistic regression model with complex sampling design of ...

    African Journals Online (AJOL)

    2017-09-03

    Sep 3, 2017 ... Bi-variable and multi-variable binary logistic regression model with complex sampling design was fitted. .... Data was entered into STATA-12 and analyzed using. SPSS-21. .... lack of access/too far or costs too much. 35. 1.2.

  6. truncSP: An R Package for Estimation of Semi-Parametric Truncated Linear Regression Models

    Directory of Open Access Journals (Sweden)

    Maria Karlsson

    2014-05-01

    Full Text Available Problems with truncated data occur in many areas, complicating estimation and inference. Regarding linear regression models, the ordinary least squares estimator is inconsistent and biased for these types of data and is therefore unsuitable for use. Alternative estimators, designed for the estimation of truncated regression models, have been developed. This paper presents the R package truncSP. The package contains functions for the estimation of semi-parametric truncated linear regression models using three different estimators: the symmetrically trimmed least squares, quadratic mode, and left truncated estimators, all of which have been shown to have good asymptotic and ?nite sample properties. The package also provides functions for the analysis of the estimated models. Data from the environmental sciences are used to illustrate the functions in the package.

  7. Calendar year 1995 groundwater quality report for the Chestnut Ridge Hydrogeological Regime, Oak Ridge Y-12 Plant, Oak Ridge, Tennessee. 1995 Groundwater quality data and calculated rate of contaminant migration

    International Nuclear Information System (INIS)

    1996-02-01

    This annual groundwater quality report (GWQR) contains groundwater quality data obtained during the 1995 calendar year (CY) at several hazardous and nonhazardous waste management facilities associated with the U.S. Department of Energy (DOE) Y-12 Plant located on the DOE Oak Ridge Reservation (ORR) southeast of Oak Ridge, Tennessee. These sites are located south of the Y-12 Plant in the Chestnut Ridge Hydrogeologic Regime (Chestnut Ridge Regime), which is one of three regimes defined for the purposes of groundwater quality monitoring at the Y-12 Plant. The Environmental Management Department of the Y-12 Plant Health, Safety, Environment, and Accountability (HSEA) Organization manages the groundwater monitoring activities in each regime as part of the Y-12 Plant Groundwater Protection Program (GWPP). The U.S. Environmental Protection Agency (EPA) identification number for the Y-12 Plant is TN

  8. Applied linear regression

    CERN Document Server

    Weisberg, Sanford

    2013-01-01

    Praise for the Third Edition ""...this is an excellent book which could easily be used as a course text...""-International Statistical Institute The Fourth Edition of Applied Linear Regression provides a thorough update of the basic theory and methodology of linear regression modeling. Demonstrating the practical applications of linear regression analysis techniques, the Fourth Edition uses interesting, real-world exercises and examples. Stressing central concepts such as model building, understanding parameters, assessing fit and reliability, and drawing conclusions, the new edition illus

  9. A Case Report of Ridge Augmentation using Onlay Interpositional Graft: An Approach to Improve Prosthetic Prognosis of a Deficit Ridge

    Directory of Open Access Journals (Sweden)

    Devanand Shetty

    2014-01-01

    Full Text Available Background: Periodontal therapy has developed beyond the scope of the treatment of periodontal pathoses. Periodontal plastic surgery consists of the reconstructive procedures designed to enhance the both function and esthetics. Deficient ridges pose a severe problem to the restorative dentist in restoring the natural form, function and esthetics of the prosthesis replacing the natural dentition. Depending upon the severity, location of these defects and the prosthetic option chosen, hard and soft tissue ridge augmentation or non-surgical approach or a combination may help to address them. The present clinical report describes a soft tissue ridge augmentation of a localized ridge defect in maxillary aesthetic region using onlay interpositional graft followed by fixed partial denture.

  10. Geophysical Investigation of Upper Mantle Anomalies of the Australian-Antarctic Ridge

    Science.gov (United States)

    Park, S. H.; Choi, H.; Kim, S. S.; Lin, J.

    2017-12-01

    Australian-Antarctic Ridge (AAR) is situated between the Pacific-Antarctic Ridge (PAR) and Southeast Indian Ridge (SEIR), extending eastward from the Australian-Antarctic Discordance (AAD). Much of the AAR has been remained uncharted until 2011 because of its remoteness and harsh weather conditions. Since 2011, four multidisciplinary expeditions initiated by the Korea Polar Research Institute (KOPRI) have surveyed the little-explored eastern ends of the AAR and investigated the tectonics, geochemistry, and hydrothermal activity of this intermediate spreading system. Recent isotope studies using the new basalt samples from the AAR have led to the new hypothesis of the Southern Ocean mantle domain (SOM), which may have originated from the super-plume activity associated with the Gondwana break-up. In this study, we characterize the geophysics of the Southern Ocean mantle using the newly acquired shipboard bathymetry and available geophysical datasets. First, we computed residual mantle Bouguer gravity anomalies (RMBA), gravity-derived crustal thickness, and residual topography along the AAR in order to obtain a geological proxy for regional variations in magma supply. The results of these analyses revealed that the southern flank of the AAR is associated with shallower seafloor, more negative RMBA, thicker crust, and/or less dense mantle in comparison to the conjugate northern flank. Furthermore, this north-south asymmetry becomes more prominent toward the central ridge segments of the AAR. Interestingly, the along-axis depths of the entire AAR are significantly shallower than the neighboring ridge systems and the global ridges of intermediate spreading rates. Such shallow depths are also correlated with regional negative geoid anomalies. Furthermore, recent mantle tomography models consistently showed that the upper mantle (< 250 km) below the AAR has low S-wave velocities, suggesting that it may be hotter than the nearby ridges. Such regional-scale anomalies of the

  11. Effects of Cocos Ridge Collision on the Western Caribbean: Is there a Panama Block?

    Science.gov (United States)

    Kobayashi, D.; La Femina, P. C.; Geirsson, H.; Chichaco, E.; Abrego M, A. A.; Fisher, D. M.; Camacho, E. I.

    2011-12-01

    It has been recognized that the subduction and collision of the Cocos Ridge, a 2 km high aseismic ridge standing on >20 km thick oceanic crust of the Cocos plate, drives upper plate deformation in southern Central America. Recent studies of Global Positioning System (GPS) derived horizontal velocities relative to the Caribbean Plate showed a radial pattern centered on the Cocos Ridge axis where Cocos-Caribbean convergence is orthogonal, and margin-parallel velocities to the northwest. Models of the full three-dimensional GPS velocity field and earthquake slip vectors demonstrate low mechanical coupling along the Middle America subduction zone in Nicaragua and El Salvador, and a broad zone of high coupling beneath the Osa Peninsula, where the Cocos Ridge intersects the margin. These results suggest that Cocos Ridge collision may be the main driver for trench-parallel motion of the fore arc to the northwest and for uplift and shortening of the outer fore arc in southern Central America, whereby thickened and hence buoyant Cocos Ridge crust acts as an indenter causing the tectonic escape of the fore arc. These studies, however, were not able to constrain well the pattern of surface deformation east-southeast of the ridge axis due to a lack of GPS stations, and Cocos Ridge collision may be responsible for the kinematics and deformation of the proposed Panama block. Recent reinforcement of the GPS network in southeastern Costa Rica and Panama has increased the spatial and temporal resolution of the network and made it possible to further investigate surface deformation of southern Central America and the Panama block. We present a new regional surface velocity field for Central America from geodetic GPS data collected at 11 recently-installed and 178 existing episodic, semi-continuous, and continuous GPS sites in Nicaragua, Costa Rica, and Panama. We investigate the effects of Cocos Ridge collision on the Panama block through kinematic block modeling. Published

  12. Testing for constant nonparametric effects in general semiparametric regression models with interactions

    KAUST Repository

    Wei, Jiawei; Carroll, Raymond J.; Maity, Arnab

    2011-01-01

    We consider the problem of testing for a constant nonparametric effect in a general semi-parametric regression model when there is the potential for interaction between the parametrically and nonparametrically modeled variables. The work

  13. ENVIRONMENTAL BASELINE SURVEY REPORT FOR WEST BLACK OAK RIDGE, EAST BLACK OAK RIDGE, MCKINNEY RIDGE, WEST PINE RIDGE, AND PARCEL 21D IN THE VICINITY OF THE EAST TENNESSEE TECHNOLOGY PARK, OAK RIDGE, TENNESSEE

    Energy Technology Data Exchange (ETDEWEB)

    David A. King

    2012-11-29

    This environmental baseline survey (EBS) report documents the baseline environmental conditions of five land parcels located near the U.S. Department of Energy’s (DOE’s) East Tennessee Technology Park (ETTP), including West Black Oak Ridge, East Black Oak Ridge, McKinney Ridge, West Pine Ridge, and Parcel 21d. The goal is to obtain all media no-further-investigation (NFI) determinations for the subject parcels considering existing soils. To augment the existing soils-only NFI determinations, samples of groundwater, surface water, soil, and sediment were collected to support all media NFI decisions. The only updates presented here are those that were made after the original issuance of the NFI documents. In the subject parcel where the soils NFI determination was not completed for approval (Parcel 21d), the full process has been performed to address the soils as well. Preparation of this report included the detailed search of federal government records, title documents, aerial photos that may reflect prior uses, and visual inspections of the property and adjacent properties. Interviews with current employees involved in, or familiar with, operations on the real property were also conducted to identify any areas on the property where hazardous substances and petroleum products, or their derivatives, and acutely hazardous wastes may have been released or disposed. In addition, a search was made of reasonably obtainable federal, state, and local government records of each adjacent facility where there has been a release of any hazardous substance or any petroleum product or their derivatives, including aviation fuel and motor oil, and which is likely to cause or contribute to a release of any hazardous substance or any petroleum product or its derivatives, including aviation fuel or motor oil, on the real property. A radiological survey and soil/sediment sampling was conducted to assess baseline conditions of Parcel 21d that were not addressed by the soils-only NFI

  14. Building interpretable predictive models for pediatric hospital readmission using Tree-Lasso logistic regression.

    Science.gov (United States)

    Jovanovic, Milos; Radovanovic, Sandro; Vukicevic, Milan; Van Poucke, Sven; Delibasic, Boris

    2016-09-01

    Quantification and early identification of unplanned readmission risk have the potential to improve the quality of care during hospitalization and after discharge. However, high dimensionality, sparsity, and class imbalance of electronic health data and the complexity of risk quantification, challenge the development of accurate predictive models. Predictive models require a certain level of interpretability in order to be applicable in real settings and create actionable insights. This paper aims to develop accurate and interpretable predictive models for readmission in a general pediatric patient population, by integrating a data-driven model (sparse logistic regression) and domain knowledge based on the international classification of diseases 9th-revision clinical modification (ICD-9-CM) hierarchy of diseases. Additionally, we propose a way to quantify the interpretability of a model and inspect the stability of alternative solutions. The analysis was conducted on >66,000 pediatric hospital discharge records from California, State Inpatient Databases, Healthcare Cost and Utilization Project between 2009 and 2011. We incorporated domain knowledge based on the ICD-9-CM hierarchy in a data driven, Tree-Lasso regularized logistic regression model, providing the framework for model interpretation. This approach was compared with traditional Lasso logistic regression resulting in models that are easier to interpret by fewer high-level diagnoses, with comparable prediction accuracy. The results revealed that the use of a Tree-Lasso model was as competitive in terms of accuracy (measured by area under the receiver operating characteristic curve-AUC) as the traditional Lasso logistic regression, but integration with the ICD-9-CM hierarchy of diseases provided more interpretable models in terms of high-level diagnoses. Additionally, interpretations of models are in accordance with existing medical understanding of pediatric readmission. Best performing models have

  15. Multiple regression analysis in modelling of carbon dioxide emissions by energy consumption use in Malaysia

    Science.gov (United States)

    Keat, Sim Chong; Chun, Beh Boon; San, Lim Hwee; Jafri, Mohd Zubir Mat

    2015-04-01

    Climate change due to carbon dioxide (CO2) emissions is one of the most complex challenges threatening our planet. This issue considered as a great and international concern that primary attributed from different fossil fuels. In this paper, regression model is used for analyzing the causal relationship among CO2 emissions based on the energy consumption in Malaysia using time series data for the period of 1980-2010. The equations were developed using regression model based on the eight major sources that contribute to the CO2 emissions such as non energy, Liquefied Petroleum Gas (LPG), diesel, kerosene, refinery gas, Aviation Turbine Fuel (ATF) and Aviation Gasoline (AV Gas), fuel oil and motor petrol. The related data partly used for predict the regression model (1980-2000) and partly used for validate the regression model (2001-2010). The results of the prediction model with the measured data showed a high correlation coefficient (R2=0.9544), indicating the model's accuracy and efficiency. These results are accurate and can be used in early warning of the population to comply with air quality standards.

  16. Better Autologistic Regression

    Directory of Open Access Journals (Sweden)

    Mark A. Wolters

    2017-11-01

    Full Text Available Autologistic regression is an important probability model for dichotomous random variables observed along with covariate information. It has been used in various fields for analyzing binary data possessing spatial or network structure. The model can be viewed as an extension of the autologistic model (also known as the Ising model, quadratic exponential binary distribution, or Boltzmann machine to include covariates. It can also be viewed as an extension of logistic regression to handle responses that are not independent. Not all authors use exactly the same form of the autologistic regression model. Variations of the model differ in two respects. First, the variable coding—the two numbers used to represent the two possible states of the variables—might differ. Common coding choices are (zero, one and (minus one, plus one. Second, the model might appear in either of two algebraic forms: a standard form, or a recently proposed centered form. Little attention has been paid to the effect of these differences, and the literature shows ambiguity about their importance. It is shown here that changes to either coding or centering in fact produce distinct, non-nested probability models. Theoretical results, numerical studies, and analysis of an ecological data set all show that the differences among the models can be large and practically significant. Understanding the nature of the differences and making appropriate modeling choices can lead to significantly improved autologistic regression analyses. The results strongly suggest that the standard model with plus/minus coding, which we call the symmetric autologistic model, is the most natural choice among the autologistic variants.

  17. Alternative Methods of Regression

    CERN Document Server

    Birkes, David

    2011-01-01

    Of related interest. Nonlinear Regression Analysis and its Applications Douglas M. Bates and Donald G. Watts ".an extraordinary presentation of concepts and methods concerning the use and analysis of nonlinear regression models.highly recommend[ed].for anyone needing to use and/or understand issues concerning the analysis of nonlinear regression models." --Technometrics This book provides a balance between theory and practice supported by extensive displays of instructive geometrical constructs. Numerous in-depth case studies illustrate the use of nonlinear regression analysis--with all data s

  18. Time-adaptive quantile regression

    DEFF Research Database (Denmark)

    Møller, Jan Kloppenborg; Nielsen, Henrik Aalborg; Madsen, Henrik

    2008-01-01

    and an updating procedure are combined into a new algorithm for time-adaptive quantile regression, which generates new solutions on the basis of the old solution, leading to savings in computation time. The suggested algorithm is tested against a static quantile regression model on a data set with wind power......An algorithm for time-adaptive quantile regression is presented. The algorithm is based on the simplex algorithm, and the linear optimization formulation of the quantile regression problem is given. The observations have been split to allow a direct use of the simplex algorithm. The simplex method...... production, where the models combine splines and quantile regression. The comparison indicates superior performance for the time-adaptive quantile regression in all the performance parameters considered....

  19. How plume-ridge interaction shapes the crustal thickness pattern of the Réunion hotspot track

    Science.gov (United States)

    Bredow, Eva; Steinberger, Bernhard; Gassmöller, Rene; Dannberg, Juliane

    2017-08-01

    The Réunion mantle plume has shaped a large area of the Earth's surface over the past 65 million years: from the Deccan Traps in India along the hotspot track comprising the island chains of the Laccadives, Maldives, and Chagos Bank on the Indian plate and the Mascarene Plateau on the African plate up to the currently active volcanism at La Réunion Island. This study addresses the question how the Réunion plume, especially in interaction with the Central Indian Ridge, created the complex crustal thickness pattern of the hotspot track. For this purpose, the mantle convection code ASPECT was used to design three-dimensional numerical models, which consider the specific location of the plume underneath moving plates and surrounded by large-scale mantle flow. The results show the crustal thickness pattern produced by the plume, which altogether agrees well with topographic maps. Especially two features are consistently reproduced by the models: the distinctive gap in the hotspot track between the Maldives and Chagos is created by the combination of the ridge geometry and plume-ridge interaction; and the Rodrigues Ridge, a narrow crustal structure which connects the hotspot track and the Central Indian Ridge, appears as the surface expression of a long-distance sublithospheric flow channel. This study therefore provides further insight how small-scale surface features are generated by the complex interplay between mantle and lithospheric processes.

  20. Construction of risk prediction model of type 2 diabetes mellitus based on logistic regression

    Directory of Open Access Journals (Sweden)

    Li Jian

    2017-01-01

    Full Text Available Objective: to construct multi factor prediction model for the individual risk of T2DM, and to explore new ideas for early warning, prevention and personalized health services for T2DM. Methods: using logistic regression techniques to screen the risk factors for T2DM and construct the risk prediction model of T2DM. Results: Male’s risk prediction model logistic regression equation: logit(P=BMI × 0.735+ vegetables × (−0.671 + age × 0.838+ diastolic pressure × 0.296+ physical activity× (−2.287 + sleep ×(−0.009 +smoking ×0.214; Female’s risk prediction model logistic regression equation: logit(P=BMI ×1.979+ vegetables× (−0.292 + age × 1.355+ diastolic pressure× 0.522+ physical activity × (−2.287 + sleep × (−0.010.The area under the ROC curve of male was 0.83, the sensitivity was 0.72, the specificity was 0.86, the area under the ROC curve of female was 0.84, the sensitivity was 0.75, the specificity was 0.90. Conclusion: This study model data is from a compared study of nested case, the risk prediction model has been established by using the more mature logistic regression techniques, and the model is higher predictive sensitivity, specificity and stability.

  1. THE REGRESSION MODEL OF IRAN LIBRARIES ORGANIZATIONAL CLIMATE

    OpenAIRE

    Jahani, Mohammad Ali; Yaminfirooz, Mousa; Siamian, Hasan

    2015-01-01

    Background: The purpose of this study was to drawing a regression model of organizational climate of central libraries of Iran?s universities. Methods: This study is an applied research. The statistical population of this study consisted of 96 employees of the central libraries of Iran?s public universities selected among the 117 universities affiliated to the Ministry of Health by Stratified Sampling method (510 people). Climate Qual localized questionnaire was used as research tools. For pr...

  2. Harmonic regression of Landsat time series for modeling attributes from national forest inventory data

    Science.gov (United States)

    Wilson, Barry T.; Knight, Joseph F.; McRoberts, Ronald E.

    2018-03-01

    Imagery from the Landsat Program has been used frequently as a source of auxiliary data for modeling land cover, as well as a variety of attributes associated with tree cover. With ready access to all scenes in the archive since 2008 due to the USGS Landsat Data Policy, new approaches to deriving such auxiliary data from dense Landsat time series are required. Several methods have previously been developed for use with finer temporal resolution imagery (e.g. AVHRR and MODIS), including image compositing and harmonic regression using Fourier series. The manuscript presents a study, using Minnesota, USA during the years 2009-2013 as the study area and timeframe. The study examined the relative predictive power of land cover models, in particular those related to tree cover, using predictor variables based solely on composite imagery versus those using estimated harmonic regression coefficients. The study used two common non-parametric modeling approaches (i.e. k-nearest neighbors and random forests) for fitting classification and regression models of multiple attributes measured on USFS Forest Inventory and Analysis plots using all available Landsat imagery for the study area and timeframe. The estimated Fourier coefficients developed by harmonic regression of tasseled cap transformation time series data were shown to be correlated with land cover, including tree cover. Regression models using estimated Fourier coefficients as predictor variables showed a two- to threefold increase in explained variance for a small set of continuous response variables, relative to comparable models using monthly image composites. Similarly, the overall accuracies of classification models using the estimated Fourier coefficients were approximately 10-20 percentage points higher than the models using the image composites, with corresponding individual class accuracies between six and 45 percentage points higher.

  3. An appraisal of convergence failures in the application of logistic regression model in published manuscripts.

    Science.gov (United States)

    Yusuf, O B; Bamgboye, E A; Afolabi, R F; Shodimu, M A

    2014-09-01

    Logistic regression model is widely used in health research for description and predictive purposes. Unfortunately, most researchers are sometimes not aware that the underlying principles of the techniques have failed when the algorithm for maximum likelihood does not converge. Young researchers particularly postgraduate students may not know why separation problem whether quasi or complete occurs, how to identify it and how to fix it. This study was designed to critically evaluate convergence issues in articles that employed logistic regression analysis published in an African Journal of Medicine and medical sciences between 2004 and 2013. Problems of quasi or complete separation were described and were illustrated with the National Demographic and Health Survey dataset. A critical evaluation of articles that employed logistic regression was conducted. A total of 581 articles was reviewed, of which 40 (6.9%) used binary logistic regression. Twenty-four (60.0%) stated the use of logistic regression model in the methodology while none of the articles assessed model fit. Only 3 (12.5%) properly described the procedures. Of the 40 that used the logistic regression model, the problem of convergence occurred in 6 (15.0%) of the articles. Logistic regression tends to be poorly reported in studies published between 2004 and 2013. Our findings showed that the procedure may not be well understood by researchers since very few described the process in their reports and may be totally unaware of the problem of convergence or how to deal with it.

  4. Modeling of the Monthly Rainfall-Runoff Process Through Regressions

    Directory of Open Access Journals (Sweden)

    Campos-Aranda Daniel Francisco

    2014-10-01

    Full Text Available To solve the problems associated with the assessment of water resources of a river, the modeling of the rainfall-runoff process (RRP allows the deduction of runoff missing data and to extend its record, since generally the information available on precipitation is larger. It also enables the estimation of inputs to reservoirs, when their building led to the suppression of the gauging station. The simplest mathematical model that can be set for the RRP is the linear regression or curve on a monthly basis. Such a model is described in detail and is calibrated with the simultaneous record of monthly rainfall and runoff in Ballesmi hydrometric station, which covers 35 years. Since the runoff of this station has an important contribution from the spring discharge, the record is corrected first by removing that contribution. In order to do this a procedure was developed based either on the monthly average regional runoff coefficients or on nearby and similar watershed; in this case the Tancuilín gauging station was used. Both stations belong to the Partial Hydrologic Region No. 26 (Lower Rio Panuco and are located within the state of San Luis Potosi, México. The study performed indicates that the monthly regression model, due to its conceptual approach, faithfully reproduces monthly average runoff volumes and achieves an excellent approximation in relation to the dispersion, proved by calculation of the means and standard deviations.

  5. Genetic evaluation of European quails by random regression models

    Directory of Open Access Journals (Sweden)

    Flaviana Miranda Gonçalves

    2012-09-01

    Full Text Available The objective of this study was to compare different random regression models, defined from different classes of heterogeneity of variance combined with different Legendre polynomial orders for the estimate of (covariance of quails. The data came from 28,076 observations of 4,507 female meat quails of the LF1 lineage. Quail body weights were determined at birth and 1, 14, 21, 28, 35 and 42 days of age. Six different classes of residual variance were fitted to Legendre polynomial functions (orders ranging from 2 to 6 to determine which model had the best fit to describe the (covariance structures as a function of time. According to the evaluated criteria (AIC, BIC and LRT, the model with six classes of residual variances and of sixth-order Legendre polynomial was the best fit. The estimated additive genetic variance increased from birth to 28 days of age, and dropped slightly from 35 to 42 days. The heritability estimates decreased along the growth curve and changed from 0.51 (1 day to 0.16 (42 days. Animal genetic and permanent environmental correlation estimates between weights and age classes were always high and positive, except for birth weight. The sixth order Legendre polynomial, along with the residual variance divided into six classes was the best fit for the growth rate curve of meat quails; therefore, they should be considered for breeding evaluation processes by random regression models.

  6. Significance tests to determine the direction of effects in linear regression models.

    Science.gov (United States)

    Wiedermann, Wolfgang; Hagmann, Michael; von Eye, Alexander

    2015-02-01

    Previous studies have discussed asymmetric interpretations of the Pearson correlation coefficient and have shown that higher moments can be used to decide on the direction of dependence in the bivariate linear regression setting. The current study extends this approach by illustrating that the third moment of regression residuals may also be used to derive conclusions concerning the direction of effects. Assuming non-normally distributed variables, it is shown that the distribution of residuals of the correctly specified regression model (e.g., Y is regressed on X) is more symmetric than the distribution of residuals of the competing model (i.e., X is regressed on Y). Based on this result, 4 one-sample tests are discussed which can be used to decide which variable is more likely to be the response and which one is more likely to be the explanatory variable. A fifth significance test is proposed based on the differences of skewness estimates, which leads to a more direct test of a hypothesis that is compatible with direction of dependence. A Monte Carlo simulation study was performed to examine the behaviour of the procedures under various degrees of associations, sample sizes, and distributional properties of the underlying population. An empirical example is given which illustrates the application of the tests in practice. © 2014 The British Psychological Society.

  7. Weighted functional linear regression models for gene-based association analysis.

    Science.gov (United States)

    Belonogova, Nadezhda M; Svishcheva, Gulnara R; Wilson, James F; Campbell, Harry; Axenovich, Tatiana I

    2018-01-01

    Functional linear regression models are effectively used in gene-based association analysis of complex traits. These models combine information about individual genetic variants, taking into account their positions and reducing the influence of noise and/or observation errors. To increase the power of methods, where several differently informative components are combined, weights are introduced to give the advantage to more informative components. Allele-specific weights have been introduced to collapsing and kernel-based approaches to gene-based association analysis. Here we have for the first time introduced weights to functional linear regression models adapted for both independent and family samples. Using data simulated on the basis of GAW17 genotypes and weights defined by allele frequencies via the beta distribution, we demonstrated that type I errors correspond to declared values and that increasing the weights of causal variants allows the power of functional linear models to be increased. We applied the new method to real data on blood pressure from the ORCADES sample. Five of the six known genes with P models. Moreover, we found an association between diastolic blood pressure and the VMP1 gene (P = 8.18×10-6), when we used a weighted functional model. For this gene, the unweighted functional and weighted kernel-based models had P = 0.004 and 0.006, respectively. The new method has been implemented in the program package FREGAT, which is freely available at https://cran.r-project.org/web/packages/FREGAT/index.html.

  8. Metallogenesis along the Indian Ocean Ridge System

    Digital Repository Service at National Institute of Oceanography (India)

    Banerjee, R.; Ray, Dwijesh

    including India. Among these studies majority were concentrated around the Central Indian Ridge and the Southwest Indian Ridge areas, while a few observations were made around the rest of the areas in the IORS. The findings of these studies are discussed...

  9. Improving the Prediction of Total Surgical Procedure Time Using Linear Regression Modeling

    Directory of Open Access Journals (Sweden)

    Eric R. Edelman

    2017-06-01

    Full Text Available For efficient utilization of operating rooms (ORs, accurate schedules of assigned block time and sequences of patient cases need to be made. The quality of these planning tools is dependent on the accurate prediction of total procedure time (TPT per case. In this paper, we attempt to improve the accuracy of TPT predictions by using linear regression models based on estimated surgeon-controlled time (eSCT and other variables relevant to TPT. We extracted data from a Dutch benchmarking database of all surgeries performed in six academic hospitals in The Netherlands from 2012 till 2016. The final dataset consisted of 79,983 records, describing 199,772 h of total OR time. Potential predictors of TPT that were included in the subsequent analysis were eSCT, patient age, type of operation, American Society of Anesthesiologists (ASA physical status classification, and type of anesthesia used. First, we computed the predicted TPT based on a previously described fixed ratio model for each record, multiplying eSCT by 1.33. This number is based on the research performed by van Veen-Berkx et al., which showed that 33% of SCT is generally a good approximation of anesthesia-controlled time (ACT. We then systematically tested all possible linear regression models to predict TPT using eSCT in combination with the other available independent variables. In addition, all regression models were again tested without eSCT as a predictor to predict ACT separately (which leads to TPT by adding SCT. TPT was most accurately predicted using a linear regression model based on the independent variables eSCT, type of operation, ASA classification, and type of anesthesia. This model performed significantly better than the fixed ratio model and the method of predicting ACT separately. Making use of these more accurate predictions in planning and sequencing algorithms may enable an increase in utilization of ORs, leading to significant financial and productivity related

  10. Improving the Prediction of Total Surgical Procedure Time Using Linear Regression Modeling.

    Science.gov (United States)

    Edelman, Eric R; van Kuijk, Sander M J; Hamaekers, Ankie E W; de Korte, Marcel J M; van Merode, Godefridus G; Buhre, Wolfgang F F A

    2017-01-01

    For efficient utilization of operating rooms (ORs), accurate schedules of assigned block time and sequences of patient cases need to be made. The quality of these planning tools is dependent on the accurate prediction of total procedure time (TPT) per case. In this paper, we attempt to improve the accuracy of TPT predictions by using linear regression models based on estimated surgeon-controlled time (eSCT) and other variables relevant to TPT. We extracted data from a Dutch benchmarking database of all surgeries performed in six academic hospitals in The Netherlands from 2012 till 2016. The final dataset consisted of 79,983 records, describing 199,772 h of total OR time. Potential predictors of TPT that were included in the subsequent analysis were eSCT, patient age, type of operation, American Society of Anesthesiologists (ASA) physical status classification, and type of anesthesia used. First, we computed the predicted TPT based on a previously described fixed ratio model for each record, multiplying eSCT by 1.33. This number is based on the research performed by van Veen-Berkx et al., which showed that 33% of SCT is generally a good approximation of anesthesia-controlled time (ACT). We then systematically tested all possible linear regression models to predict TPT using eSCT in combination with the other available independent variables. In addition, all regression models were again tested without eSCT as a predictor to predict ACT separately (which leads to TPT by adding SCT). TPT was most accurately predicted using a linear regression model based on the independent variables eSCT, type of operation, ASA classification, and type of anesthesia. This model performed significantly better than the fixed ratio model and the method of predicting ACT separately. Making use of these more accurate predictions in planning and sequencing algorithms may enable an increase in utilization of ORs, leading to significant financial and productivity related benefits.

  11. Multi-model ensemble estimation of volume transport through the straits of the East/Japan Sea

    Science.gov (United States)

    Han, Sooyeon; Hirose, Naoki; Usui, Norihisa; Miyazawa, Yasumasa

    2016-01-01

    The volume transports measured at the Korea/Tsushima, Tsugaru, and Soya/La Perouse Straits remain quantitatively inconsistent. However, data assimilation models at least provide a self-consistent budget despite subtle differences among the models. This study examined the seasonal variation of the volume transport using the multiple linear regression and ridge regression of multi-model ensemble (MME) methods to estimate more accurately transport at these straits by using four different data assimilation models. The MME outperformed all of the single models by reducing uncertainties, especially the multicollinearity problem with the ridge regression. However, the regression constants turned out to be inconsistent with each other if the MME was applied separately for each strait. The MME for a connected system was thus performed to find common constants for these straits. The estimation of this MME was found to be similar to the MME result of sea level difference (SLD). The estimated mean transport (2.43 Sv) was smaller than the measurement data at the Korea/Tsushima Strait, but the calibrated transport of the Tsugaru Strait (1.63 Sv) was larger than the observed data. The MME results of transport and SLD also suggested that the standard deviation (STD) of the Korea/Tsushima Strait is larger than the STD of the observation, whereas the estimated results were almost identical to that observed for the Tsugaru and Soya/La Perouse Straits. The similarity between MME results enhances the reliability of the present MME estimation.

  12. Linearity and Misspecification Tests for Vector Smooth Transition Regression Models

    DEFF Research Database (Denmark)

    Teräsvirta, Timo; Yang, Yukai

    The purpose of the paper is to derive Lagrange multiplier and Lagrange multiplier type specification and misspecification tests for vector smooth transition regression models. We report results from simulation studies in which the size and power properties of the proposed asymptotic tests in small...

  13. Ridge Waveguide Structures in Magnesium-Doped Lithium Niobate

    Science.gov (United States)

    Himmer, Phillip; Battle, Philip; Suckow, William; Switzer, Greg

    2011-01-01

    This work proposes to establish the feasibility of fabricating isolated ridge waveguides in 5% MgO:LN. Ridge waveguides in MgO:LN will significantly improve power handling and conversion efficiency, increase photonic component integration, and be well suited to spacebased applications. The key innovation in this effort is to combine recently available large, high-photorefractive-damage-threshold, z-cut 5% MgO:LN with novel ridge fabrication techniques to achieve high-optical power, low-cost, high-volume manufacturing of frequency conversion structures. The proposed ridge waveguide structure should maintain the characteristics of the periodically poled bulk substrate, allowing for the efficient frequency conversion typical of waveguides and the high optical damage threshold and long lifetimes typical of the 5% doped bulk substrate. The low cost and large area of 5% MgO:LN wafers, and the improved performance of the proposed ridge waveguide structure, will enhance existing measurement capabilities as well as reduce the resources required to achieve high-performance specifications. The purpose of the ridge waveguides in MgO:LN is to provide platform technology that will improve optical power handling and conversion efficiency compared to existing waveguide technology. The proposed ridge waveguide is produced using standard microfabrication techniques. The approach is enabled by recent advances in inductively coupled plasma etchers and chemical mechanical planarization techniques. In conjunction with wafer bonding, this fabrication methodology can be used to create arbitrarily shaped waveguides allowing complex optical circuits to be engineered in nonlinear optical materials such as magnesium doped lithium niobate. Researchers here have identified NLO (nonlinear optical) ridge waveguide structures as having suitable value to be the leading frequency conversion structures. Its value is based on having the low-cost fabrication necessary to satisfy the challenging pricing

  14. Anatomy of a shoreface sand ridge revisted using foraminifera: False Cape Shoals, Virginia/North Carolina inner shelf

    Science.gov (United States)

    Robinson, Marci M.; McBride, Randolph A.

    2008-01-01

    Certain details regarding the origin and evolution of shelf sand ridges remain elusive. Knowledge of their internal stratigraphy and microfossil distribution is necessary to define the origin and to determine the processes that modify sand ridges. Fourteen vibracores from False Cape Shoal A, a well-developed shoreface-attached sand ridge on the Virginia/North Carolina inner continental shelf, were examined to document the internal stratigraphy and benthic foraminiferal assemblages, as well as to reconstruct the depositional environments recorded in down-core sediments. Seven sedimentary and foraminiferal facies correspond to the following stratigraphic units: fossiliferous silt, barren sand, clay to sandy clay, laminated and bioturbated sand, poorly sorted massive sand, fine clean sand, and poorly sorted clay to gravel. The units represent a Pleistocene estuary and shoreface, a Holocene estuary, ebb tidal delta, modern shelf, modern shoreface, and swale fill, respectively. The succession of depositional environments reflects a Pleistocene sea-level highstand and subsequent regression followed by the Holocene transgression in which barrier island/spit systems formed along the Virginia/North Carolina inner shelf not, vert, ~5.2 ka and migrated landward and an ebb tidal delta that was deposited, reworked, and covered by shelf sand.

  15. Regularized multivariate regression models with skew-t error distributions

    KAUST Repository

    Chen, Lianfu

    2014-06-01

    We consider regularization of the parameters in multivariate linear regression models with the errors having a multivariate skew-t distribution. An iterative penalized likelihood procedure is proposed for constructing sparse estimators of both the regression coefficient and inverse scale matrices simultaneously. The sparsity is introduced through penalizing the negative log-likelihood by adding L1-penalties on the entries of the two matrices. Taking advantage of the hierarchical representation of skew-t distributions, and using the expectation conditional maximization (ECM) algorithm, we reduce the problem to penalized normal likelihood and develop a procedure to minimize the ensuing objective function. Using a simulation study the performance of the method is assessed, and the methodology is illustrated using a real data set with a 24-dimensional response vector. © 2014 Elsevier B.V.

  16. Quantile regression theory and applications

    CERN Document Server

    Davino, Cristina; Vistocco, Domenico

    2013-01-01

    A guide to the implementation and interpretation of Quantile Regression models This book explores the theory and numerous applications of quantile regression, offering empirical data analysis as well as the software tools to implement the methods. The main focus of this book is to provide the reader with a comprehensivedescription of the main issues concerning quantile regression; these include basic modeling, geometrical interpretation, estimation and inference for quantile regression, as well as issues on validity of the model, diagnostic tools. Each methodological aspect is explored and

  17. Regression models in the determination of the absorbed dose with extrapolation chamber for ophthalmological applicators

    International Nuclear Information System (INIS)

    Alvarez R, J.T.; Morales P, R.

    1992-06-01

    The absorbed dose for equivalent soft tissue is determined,it is imparted by ophthalmologic applicators, ( 90 Sr/ 90 Y, 1850 MBq) using an extrapolation chamber of variable electrodes; when estimating the slope of the extrapolation curve using a simple lineal regression model is observed that the dose values are underestimated from 17.7 percent up to a 20.4 percent in relation to the estimate of this dose by means of a regression model polynomial two grade, at the same time are observed an improvement in the standard error for the quadratic model until in 50%. Finally the global uncertainty of the dose is presented, taking into account the reproducibility of the experimental arrangement. As conclusion it can infers that in experimental arrangements where the source is to contact with the extrapolation chamber, it was recommended to substitute the lineal regression model by the quadratic regression model, in the determination of the slope of the extrapolation curve, for more exact and accurate measurements of the absorbed dose. (Author)

  18. Segmentation and morphology of the Central Indian Ridge between 3°S and 11°S, Indian Ocean

    Digital Repository Service at National Institute of Oceanography (India)

    KameshRaju, K.A.; Samudrala, K.; Drolia, R.K.; Amarnath, D.; Ramachandran, R.; Mudholkar, A.V.

    that are separated by well defined transform faults and non-transform discontinuities. Magnetic model studies qualify the ridge as a slow spreading ridge with average full spreading rates varying from 26 to 38 mm/yr. The disposition of the magnetic anomalies suggests...

  19. ON THE EFFECTS OF THE PRESENCE AND METHODS OF THE ELIMINATION HETEROSCEDASTICITY AND AUTOCORRELATION IN THE REGRESSION MODEL

    Directory of Open Access Journals (Sweden)

    Nina L. Timofeeva

    2014-01-01

    Full Text Available The article presents the methodological and technical bases for the creation of regression models that adequately reflect reality. The focus is on methods of removing residual autocorrelation in models. Algorithms eliminating heteroscedasticity and autocorrelation of the regression model residuals: reweighted least squares method, the method of Cochran-Orkutta are given. A model of "pure" regression is build, as well as to compare the effect on the dependent variable of the different explanatory variables when the latter are expressed in different units, a standardized form of the regression equation. The scheme of abatement techniques of heteroskedasticity and autocorrelation for the creation of regression models specific to the social and cultural sphere is developed.

  20. A generalized multivariate regression model for modelling ocean wave heights

    Science.gov (United States)

    Wang, X. L.; Feng, Y.; Swail, V. R.

    2012-04-01

    In this study, a generalized multivariate linear regression model is developed to represent the relationship between 6-hourly ocean significant wave heights (Hs) and the corresponding 6-hourly mean sea level pressure (MSLP) fields. The model is calibrated using the ERA-Interim reanalysis of Hs and MSLP fields for 1981-2000, and is validated using the ERA-Interim reanalysis for 2001-2010 and ERA40 reanalysis of Hs and MSLP for 1958-2001. The performance of the fitted model is evaluated in terms of Pierce skill score, frequency bias index, and correlation skill score. Being not normally distributed, wave heights are subjected to a data adaptive Box-Cox transformation before being used in the model fitting. Also, since 6-hourly data are being modelled, lag-1 autocorrelation must be and is accounted for. The models with and without Box-Cox transformation, and with and without accounting for autocorrelation, are inter-compared in terms of their prediction skills. The fitted MSLP-Hs relationship is then used to reconstruct historical wave height climate from the 6-hourly MSLP fields taken from the Twentieth Century Reanalysis (20CR, Compo et al. 2011), and to project possible future wave height climates using CMIP5 model simulations of MSLP fields. The reconstructed and projected wave heights, both seasonal means and maxima, are subject to a trend analysis that allows for non-linear (polynomial) trends.

  1. Design assessment for the Bethel Valley FFA Upgrades at Oak Ridge National Laboratory, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1995-09-01

    This report describes the proposed upgrades to Building 3025 and the Evaporator Area at Oak Ridge National Laboratory. Design assessments, specifications and drawings are provided. Building 3025 is a general purpose research facility utilized by the Materials and Ceramics Division to conduct research on irradiated materials. The Evaporator Area, building 2531, serves as the collection point for all low-level liquid wastes generated at the Oak Ridge National Laboratory

  2. Atmospheric dispersion modeling of hazardous air pollutant emissions from USDOE Oak Ridge Reservation Facilities. Volume 1, Independent Assessment conducted from April 1994 to December 1994

    International Nuclear Information System (INIS)

    Humphreys, M.P.

    1995-01-01

    Title 3 of the 1990 Clean Air Act (CAA) Amendments addresses the emissions of 189 hazardous air pollutants (HAPs) and mandates that EPA develop technology-based [Maximum Achievable Control Technology (MACT)] standards for the control of these pollutants from approximately 174 source categories. After implementation of technology-based standards, EPA is required to further evaluate ''residual risk'' from HAP emissions, and, if required, develop more stringent standards to protect human health and the environment with an ''adequate margin of safety''. Recognizing that EPA will be issuing risk-based regulations after MACT standards have been implemented, the US Department of Energy (DOE), Oak Ridge Operations Office (ORO) has conducted an evaluation of ambient air impacts of HAP emissions from its installations located on the Oak Ridge Reservation (ORR) near Oak Ridge, Tennessee. This report provides results of atmospheric dispersion modeling conducted to determine ambient air impacts of HAP emissions from facilities located on the ORR

  3. Modeling of radionuclide and heavy metal sorption around low and high pH waste disposal sites at Oak Ridge, Tennessee: Classification review package

    International Nuclear Information System (INIS)

    Saunders, J.A.

    1994-10-01

    Modeling of mineral precipitation and metal sorption reactions using MINTEQA2 and the iron oxyhydroxide diffuse-layer model has provided insights into geochemical processes governing contaminant migration from low-level radioactive waste disposal sites at the US Department of Energy's Oak Ridge National Laboratory and Y-12 Plant at Oak Ridge, Tennessee. Both acidic and basic nuclear-fuel reprocessing wastes, locally mixed with decontamination solvents, were disposed of in unlined trenches and lagoons. Model results show that as wastes move toward neutral pH due to reactions with surrounding soils and saprolite, mineral precipitation and sorption can limit the solubility of heavy metals and radionuclides. However, observed contaminant levels in monitoring wells indicate that at least locally, wastes are moving in faults and fractures and are not retarded by sorption reactions along such flow paths. Model results also support previous studies that have indicated organic complexing agents used in decontamination procedures can enhance radionuclide and heavy metal solubility when mixed with nuclear fuel reprocessing wastes. However, complex interactions between metal-organic complexes and mineral surfaces and natural organic matter, biodegradation, and fracture flow complicate the interpretation of contaminant mobility

  4. The use of logistic regression in modelling the distributions of bird ...

    African Journals Online (AJOL)

    The method of logistic regression was used to model the observed geographical distribution patterns of bird species in Swaziland in relation to a set of environmental variables. Reporting rates derived from bird atlas data are used as an index of population densities. This is justified in part by the success of the modelling ...

  5. Modeling Tetanus Neonatorum case using the regression of negative binomial and zero-inflated negative binomial

    Science.gov (United States)

    Amaliana, Luthfatul; Sa'adah, Umu; Wayan Surya Wardhani, Ni

    2017-12-01

    Tetanus Neonatorum is an infectious disease that can be prevented by immunization. The number of Tetanus Neonatorum cases in East Java Province is the highest in Indonesia until 2015. Tetanus Neonatorum data contain over dispersion and big enough proportion of zero-inflation. Negative Binomial (NB) regression is an alternative method when over dispersion happens in Poisson regression. However, the data containing over dispersion and zero-inflation are more appropriately analyzed by using Zero-Inflated Negative Binomial (ZINB) regression. The purpose of this study are: (1) to model Tetanus Neonatorum cases in East Java Province with 71.05 percent proportion of zero-inflation by using NB and ZINB regression, (2) to obtain the best model. The result of this study indicates that ZINB is better than NB regression with smaller AIC.

  6. Modeling Information Content Via Dirichlet-Multinomial Regression Analysis.

    Science.gov (United States)

    Ferrari, Alberto

    2017-01-01

    Shannon entropy is being increasingly used in biomedical research as an index of complexity and information content in sequences of symbols, e.g. languages, amino acid sequences, DNA methylation patterns and animal vocalizations. Yet, distributional properties of information entropy as a random variable have seldom been the object of study, leading to researchers mainly using linear models or simulation-based analytical approach to assess differences in information content, when entropy is measured repeatedly in different experimental conditions. Here a method to perform inference on entropy in such conditions is proposed. Building on results coming from studies in the field of Bayesian entropy estimation, a symmetric Dirichlet-multinomial regression model, able to deal efficiently with the issue of mean entropy estimation, is formulated. Through a simulation study the model is shown to outperform linear modeling in a vast range of scenarios and to have promising statistical properties. As a practical example, the method is applied to a data set coming from a real experiment on animal communication.

  7. Geomorphological investigation of multiphase glacitectonic composite ridge systems in Svalbard

    Science.gov (United States)

    Lovell, Harold; Benn, Douglas I.; Lukas, Sven; Spagnolo, Matteo; Cook, Simon J.; Swift, Darrel A.; Clark, Chris D.; Yde, Jacob C.; Watts, Tom

    2018-01-01

    Some surge-type glaciers on the High-Arctic archipelago of Svalbard have large glacitectonic composite ridge systems at their terrestrial margins. These have formed by rapid glacier advance into proglacial sediments during the active surge phase, creating multicrested moraine complexes. Such complexes can be formed during single surge advances or multiple surges to successively less-extensive positions. The few existing studies of composite ridge systems have largely relied on detailed information on internal structure and sedimentology to reconstruct their formation and links to surge processes. However, natural exposures of internal structure are commonly unavailable, and the creation of artificial exposures is often problematic in fragile Arctic environments. To compensate for these issues, we investigate the potential for reconstructing composite ridge system formation based on geomorphological evidence alone, focusing on clear morphostratigraphic relationships between ridges within the moraine complex and relict meltwater channels/outwash fans. Based on mapping at the margins of Finsterwalderbreen (in Van Keulenfjorden) and Grønfjordbreen (in Grønfjorden), we show that relict meltwater channels that breach outer parts of the composite ridge systems are in most cases truncated upstream within the ridge complex by an inner pushed ridge or ridges at their ice-proximal extents. Our interpretation of this relationship is that the entire composite ridge system is unlikely to have formed during the same glacier advance but is instead the product of multiple advances to successively less-extensive positions, whereby younger ridges are emplaced on the ice-proximal side of older ridges. This indicates that the Finsterwalderbreen composite ridge system has been formed by multiple separate advances, consistent with the cyclicity of surges. Being able to identify the frequency and magnitude of former surges is important as it provides insight into the past behaviour of

  8. Oak Ridge reservation land-use plan

    Energy Technology Data Exchange (ETDEWEB)

    Bibb, W. R.; Hardin, T. H.; Hawkins, C. C.; Johnson, W. A.; Peitzsch, F. C.; Scott, T. H.; Theisen, M. R.; Tuck, S. C.

    1980-03-01

    This study establishes a basis for long-range land-use planning to accommodate both present and projected DOE program requirements in Oak Ridge. In addition to technological requirements, this land-use plan incorporates in-depth ecological concepts that recognize multiple uses of land as a viable option. Neither environmental research nor technological operations need to be mutually exclusive in all instances. Unique biological areas, as well as rare and endangered species, need to be protected, and human and environmental health and safety must be maintained. The plan is based on the concept that the primary use of DOE land resources must be to implement the overall DOE mission in Oak Ridge. This document, along with the base map and overlay maps, provides a reasonably detailed description of the DOE Oak Ridge land resources and of the current and potential uses of the land. A description of the land characteristics, including geomorphology, agricultural productivity and soils, water courses, vegetation, and terrestrial and aquatic animal habitats, is presented to serve as a resource document. Essentially all DOE land in the Oak Ridge area is being fully used for ongoing DOE programs or has been set aside as protected areas.

  9. Predicting 30-day Hospital Readmission with Publicly Available Administrative Database. A Conditional Logistic Regression Modeling Approach.

    Science.gov (United States)

    Zhu, K; Lou, Z; Zhou, J; Ballester, N; Kong, N; Parikh, P

    2015-01-01

    This article is part of the Focus Theme of Methods of Information in Medicine on "Big Data and Analytics in Healthcare". Hospital readmissions raise healthcare costs and cause significant distress to providers and patients. It is, therefore, of great interest to healthcare organizations to predict what patients are at risk to be readmitted to their hospitals. However, current logistic regression based risk prediction models have limited prediction power when applied to hospital administrative data. Meanwhile, although decision trees and random forests have been applied, they tend to be too complex to understand among the hospital practitioners. Explore the use of conditional logistic regression to increase the prediction accuracy. We analyzed an HCUP statewide inpatient discharge record dataset, which includes patient demographics, clinical and care utilization data from California. We extracted records of heart failure Medicare beneficiaries who had inpatient experience during an 11-month period. We corrected the data imbalance issue with under-sampling. In our study, we first applied standard logistic regression and decision tree to obtain influential variables and derive practically meaning decision rules. We then stratified the original data set accordingly and applied logistic regression on each data stratum. We further explored the effect of interacting variables in the logistic regression modeling. We conducted cross validation to assess the overall prediction performance of conditional logistic regression (CLR) and compared it with standard classification models. The developed CLR models outperformed several standard classification models (e.g., straightforward logistic regression, stepwise logistic regression, random forest, support vector machine). For example, the best CLR model improved the classification accuracy by nearly 20% over the straightforward logistic regression model. Furthermore, the developed CLR models tend to achieve better sensitivity of

  10. Calendar year 1993 groundwater quality report for the Chestnut Ridge Hydrogeologic Regime, Y-12 Plant, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1994-02-01

    This annual groundwater report contains groundwater quality data obtained during the 1993 calendar year (CY) at several hazardous and non-hazardous waste-management facilities associated with the US Department of Energy (DOE) Y-12 Plant located on the DOE Oak Ridge Reservation (ORR) southeast of Oak Ridge, Tennessee. These sites are located south of the Y-12 Plant in the Chestnut Ridge Hydrogeologic Regime (Chestnut Ridge Regime), which is one of three regimes defined for the purposes of groundwater quality monitoring at the Y-12 Plant. The Environmental Management Department of the Y-12 Plant Health, Safety, Environment, and Accountability Organization manages the groundwater monitoring activities in each regime as part of the Y-12 Plant Groundwater Protection Program (GWPP). The annual groundwater report for the Chestnut Ridge Regime is completed in two-parts; Part 1 (this report) containing the groundwater quality data and Part 2 containing a detailed evaluation of the data. The primary purpose of this report is to serve as a reference for the groundwater quality data obtained each year under the lead of the Y-12 Plant GWPP. However, because it contains information needed to comply with Resource Conservation and Recovery Act (RCRA) interim status assessment monitoring and reporting requirements, this report is submitted to the Tennessee Department of Health and Environment (TDEC) by the RCRA reporting deadline

  11. Notes on power of normality tests of error terms in regression models

    International Nuclear Information System (INIS)

    Střelec, Luboš

    2015-01-01

    Normality is one of the basic assumptions in applying statistical procedures. For example in linear regression most of the inferential procedures are based on the assumption of normality, i.e. the disturbance vector is assumed to be normally distributed. Failure to assess non-normality of the error terms may lead to incorrect results of usual statistical inference techniques such as t-test or F-test. Thus, error terms should be normally distributed in order to allow us to make exact inferences. As a consequence, normally distributed stochastic errors are necessary in order to make a not misleading inferences which explains a necessity and importance of robust tests of normality. Therefore, the aim of this contribution is to discuss normality testing of error terms in regression models. In this contribution, we introduce the general RT class of robust tests for normality, and present and discuss the trade-off between power and robustness of selected classical and robust normality tests of error terms in regression models

  12. Notes on power of normality tests of error terms in regression models

    Energy Technology Data Exchange (ETDEWEB)

    Střelec, Luboš [Department of Statistics and Operation Analysis, Faculty of Business and Economics, Mendel University in Brno, Zemědělská 1, Brno, 61300 (Czech Republic)

    2015-03-10

    Normality is one of the basic assumptions in applying statistical procedures. For example in linear regression most of the inferential procedures are based on the assumption of normality, i.e. the disturbance vector is assumed to be normally distributed. Failure to assess non-normality of the error terms may lead to incorrect results of usual statistical inference techniques such as t-test or F-test. Thus, error terms should be normally distributed in order to allow us to make exact inferences. As a consequence, normally distributed stochastic errors are necessary in order to make a not misleading inferences which explains a necessity and importance of robust tests of normality. Therefore, the aim of this contribution is to discuss normality testing of error terms in regression models. In this contribution, we introduce the general RT class of robust tests for normality, and present and discuss the trade-off between power and robustness of selected classical and robust normality tests of error terms in regression models.

  13. Surveillance Plan for environmental monitoring in Waste Area Grouping 6 at Oak Ridge National Laboratory, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1993-12-01

    This Surveillance Plan has been developed as part of the Environmental Monitoring Plan for Waste Area Grouping 6 at Oak Ridge National Laboratory, Oak Ridge, Tennessee. Environmental monitoring will be conducted in two phases: the baseline monitoring phase and the routine annual monitoring phase. The baseline monitoring phase will be conducted to establish the baseline contaminant release conditions at the Waste Area Grouping (WAG), to confirm the site-related chemicals of concern (COC), and to gather data to confirm the site hydrologic model The baseline monitoring phase is expected to begin in 1994 and continue for 12--18 months. The routine annual monitoring phase will consist of continued sampling and analyses of COC to determine off-WAG contaminant flux, to identify trends in releases, and to confirm the COC The routine annual monitoring phase will continue for ∼4 years after completion of the baseline monitoring phase. This Surveillance Plan presents the technical and quality assurance surveillance activities for the various WAG 6 environmental monitoring and data evaluation plans and implementing procedures

  14. Online Statistical Modeling (Regression Analysis) for Independent Responses

    Science.gov (United States)

    Made Tirta, I.; Anggraeni, Dian; Pandutama, Martinus

    2017-06-01

    Regression analysis (statistical analmodelling) are among statistical methods which are frequently needed in analyzing quantitative data, especially to model relationship between response and explanatory variables. Nowadays, statistical models have been developed into various directions to model various type and complex relationship of data. Rich varieties of advanced and recent statistical modelling are mostly available on open source software (one of them is R). However, these advanced statistical modelling, are not very friendly to novice R users, since they are based on programming script or command line interface. Our research aims to developed web interface (based on R and shiny), so that most recent and advanced statistical modelling are readily available, accessible and applicable on web. We have previously made interface in the form of e-tutorial for several modern and advanced statistical modelling on R especially for independent responses (including linear models/LM, generalized linier models/GLM, generalized additive model/GAM and generalized additive model for location scale and shape/GAMLSS). In this research we unified them in the form of data analysis, including model using Computer Intensive Statistics (Bootstrap and Markov Chain Monte Carlo/ MCMC). All are readily accessible on our online Virtual Statistics Laboratory. The web (interface) make the statistical modeling becomes easier to apply and easier to compare them in order to find the most appropriate model for the data.

  15. The Transmuted Geometric-Weibull distribution: Properties, Characterizations and Regression Models

    Directory of Open Access Journals (Sweden)

    Zohdy M Nofal

    2017-06-01

    Full Text Available We propose a new lifetime model called the transmuted geometric-Weibull distribution. Some of its structural properties including ordinary and incomplete moments, quantile and generating functions, probability weighted moments, Rényi and q-entropies and order statistics are derived. The maximum likelihood method is discussed to estimate the model parameters by means of Monte Carlo simulation study. A new location-scale regression model is introduced based on the proposed distribution. The new distribution is applied to two real data sets to illustrate its flexibility. Empirical results indicate that proposed distribution can be alternative model to other lifetime models available in the literature for modeling real data in many areas.

  16. Credit Scoring Problem Based on Regression Analysis

    OpenAIRE

    Khassawneh, Bashar Suhil Jad Allah

    2014-01-01

    ABSTRACT: This thesis provides an explanatory introduction to the regression models of data mining and contains basic definitions of key terms in the linear, multiple and logistic regression models. Meanwhile, the aim of this study is to illustrate fitting models for the credit scoring problem using simple linear, multiple linear and logistic regression models and also to analyze the found model functions by statistical tools. Keywords: Data mining, linear regression, logistic regression....

  17. Conditional Monte Carlo randomization tests for regression models.

    Science.gov (United States)

    Parhat, Parwen; Rosenberger, William F; Diao, Guoqing

    2014-08-15

    We discuss the computation of randomization tests for clinical trials of two treatments when the primary outcome is based on a regression model. We begin by revisiting the seminal paper of Gail, Tan, and Piantadosi (1988), and then describe a method based on Monte Carlo generation of randomization sequences. The tests based on this Monte Carlo procedure are design based, in that they incorporate the particular randomization procedure used. We discuss permuted block designs, complete randomization, and biased coin designs. We also use a new technique by Plamadeala and Rosenberger (2012) for simple computation of conditional randomization tests. Like Gail, Tan, and Piantadosi, we focus on residuals from generalized linear models and martingale residuals from survival models. Such techniques do not apply to longitudinal data analysis, and we introduce a method for computation of randomization tests based on the predicted rate of change from a generalized linear mixed model when outcomes are longitudinal. We show, by simulation, that these randomization tests preserve the size and power well under model misspecification. Copyright © 2014 John Wiley & Sons, Ltd.

  18. Oak Ridge low-level waste disposal facility designs

    International Nuclear Information System (INIS)

    Van Hoesen, S.D.; Jones, L.S.

    1991-01-01

    The strategic planning process that culuminates in the identification, selection, construction, and ultimate operation of treatment, storage, and disposal facilities for all types of low-level waste (LLW) generated on the Oak Ridge Reservation (ORR) was conducted under the Low-Level Waste Disposal Development and Demonstration (LLWDDD) Program. This program considered management of various concentrations of short half-life radionuclides generated principally at Oak Ridge National Laboratory (ORNL) and long half-life radionuclides (principally uranium) generated at the Oak Ridge Y-12 Plant and the Oak Ridge K-25 Plant. The LLWDDD Program is still ongoing and involves four phases: (1) alternative identification and evaluation, (2) technology demonstration, (3) limited operational implementation, and (4) full operational implementation. This document provides a discussion of these phases

  19. Use of multiple linear regression and logistic regression models to investigate changes in birthweight for term singleton infants in Scotland.

    Science.gov (United States)

    Bonellie, Sandra R

    2012-10-01

    To illustrate the use of regression and logistic regression models to investigate changes over time in size of babies particularly in relation to social deprivation, age of the mother and smoking. Mean birthweight has been found to be increasing in many countries in recent years, but there are still a group of babies who are born with low birthweights. Population-based retrospective cohort study. Multiple linear regression and logistic regression models are used to analyse data on term 'singleton births' from Scottish hospitals between 1994-2003. Mothers who smoke are shown to give birth to lighter babies on average, a difference of approximately 0.57 Standard deviations lower (95% confidence interval. 0.55-0.58) when adjusted for sex and parity. These mothers are also more likely to have babies that are low birthweight (odds ratio 3.46, 95% confidence interval 3.30-3.63) compared with non-smokers. Low birthweight is 30% more likely where the mother lives in the most deprived areas compared with the least deprived, (odds ratio 1.30, 95% confidence interval 1.21-1.40). Smoking during pregnancy is shown to have a detrimental effect on the size of infants at birth. This effect explains some, though not all, of the observed socioeconomic birthweight. It also explains much of the observed birthweight differences by the age of the mother.   Identifying mothers at greater risk of having a low birthweight baby as important implications for the care and advice this group receives. © 2012 Blackwell Publishing Ltd.

  20. Scalable Bayesian nonparametric regression via a Plackett-Luce model for conditional ranks

    Science.gov (United States)

    Gray-Davies, Tristan; Holmes, Chris C.; Caron, François

    2018-01-01

    We present a novel Bayesian nonparametric regression model for covariates X and continuous response variable Y ∈ ℝ. The model is parametrized in terms of marginal distributions for Y and X and a regression function which tunes the stochastic ordering of the conditional distributions F (y|x). By adopting an approximate composite likelihood approach, we show that the resulting posterior inference can be decoupled for the separate components of the model. This procedure can scale to very large datasets and allows for the use of standard, existing, software from Bayesian nonparametric density estimation and Plackett-Luce ranking estimation to be applied. As an illustration, we show an application of our approach to a US Census dataset, with over 1,300,000 data points and more than 100 covariates. PMID:29623150

  1. Field Sampling and Analysis Plan for the Remedial Investigation of Waste Area Grouping 2 at Oak Ridge National Laboratory, Oak Ridge, Tennessee

    International Nuclear Information System (INIS)

    1992-10-01

    This report provides responses to US Environmental Protection Agency Region IV EPA-M and Tennessee Department of Environment and Conservation Oversite Division (TDEC-O) comments on report ORNL/ER-58, Field Sampling and Analysis Plan for the Remedial Investigation of Waste Area Grouping 2 at Oak Ridge National Laboratory, Oak Ridge, Tennessee. Waste Area Grouping (WAG) 2 consists of the White Oak Creek (WOC) drainage system downgradient of the major ORNL WAGs in the WOC watershed. A strategy for the remedial investigation (RI) of WAG2 was developed in report ES/ER-14 ampersand Dl, Remedial Investigation Plan for Waste Area Grouping 2 at Oak Ridge National Laboratory, Oak Ridge, Tennessee. This strategy takes full advantage of WAG2's role as an integrator of contaminant releases from the ORNL WAGs in the WOC watershed, and takes full advantage of WAG2's role as a conduit for contaminants from the ORNL site to the Clinch River. The strategy calls for a multimedia environmental monitoring and characterization program to be conducted in WAG2 while upgradient contaminant sources are being remediated. This monitoring and characterization program will (1) identify and quantify contaminant fluxes, (2) identify pathways of greatest concern for human health and environmental risk, (3) improve conceptual models of contaminant movement, (4) support the evaluation of remedial alternatives, (5) support efforts to prioritize sites for remediation, (6) document the reduction in contaminant fluxes following remediation, and (7) support the eventual remediation of WAG2. Following this strategy, WAG2 has been termed an ''integrator WAG,'' and efforts in WAG2 over the short term are directed toward supporting efforts to remediate the contaminant ''source WAGS'' at ORNL

  2. Dacite petrogenesis on mid-ocean ridges: Evidence for oceanic crustal melting and assimilation

    Science.gov (United States)

    Wanless, V.D.; Perfit, M.R.; Ridley, W.I.; Klein, E.

    2010-01-01

    Whereas the majority of eruptions at oceanic spreading centers produce lavas with relatively homogeneous mid-ocean ridge basalt (MORB) compositions, the formation of tholeiitic andesites and dacites at mid-ocean ridges (MORs) is a petrological enigma. Eruptions of MOR high-silica lavas are typically associated with ridge discontinuities and have produced regionally significant volumes of lava. Andesites and dacites have been observed and sampled at several locations along the global MOR system; these include propagating ridge tips at ridge-transform intersections on the Juan de Fuca Ridge and eastern Gal??pagos spreading center, and at the 9??N overlapping spreading center on the East Pacific Rise. Despite the formation of these lavas at various ridges, MOR dacites show remarkably similar major element trends and incompatible trace element enrichments, suggesting that similar processes are controlling their chemistry. Although most geochemical variability in MOR basalts is consistent with low-pressure fractional crystallization of various mantle-derived parental melts, our geochemical data for MOR dacitic glasses suggest that contamination from a seawater-altered component is important in their petrogenesis. MOR dacites are characterized by elevated U, Th, Zr, and Hf, low Nb and Ta concentrations relative to rare earth elements (REE), and Al2O3, K2O, and Cl concentrations that are higher than expected from low-pressure fractional crystallization alone. Petrological modeling of MOR dacites suggests that partial melting and assimilation are both integral to their petrogenesis. Extensive fractional crystallization of a MORB parent combined with partial melting and assimilation of amphibole-bearing altered crust produces a magma with a geochemical signature similar to a MOR dacite. This supports the hypothesis that crustal assimilation is an important process in the formation of highly evolved MOR lavas and may be significant in the generation of evolved MORB in

  3. Recursive Algorithm For Linear Regression

    Science.gov (United States)

    Varanasi, S. V.

    1988-01-01

    Order of model determined easily. Linear-regression algorithhm includes recursive equations for coefficients of model of increased order. Algorithm eliminates duplicative calculations, facilitates search for minimum order of linear-regression model fitting set of data satisfactory.

  4. An adaptive two-stage analog/regression model for probabilistic prediction of small-scale precipitation in France

    Science.gov (United States)

    Chardon, Jérémy; Hingray, Benoit; Favre, Anne-Catherine

    2018-01-01

    Statistical downscaling models (SDMs) are often used to produce local weather scenarios from large-scale atmospheric information. SDMs include transfer functions which are based on a statistical link identified from observations between local weather and a set of large-scale predictors. As physical processes driving surface weather vary in time, the most relevant predictors and the regression link are likely to vary in time too. This is well known for precipitation for instance and the link is thus often estimated after some seasonal stratification of the data. In this study, we present a two-stage analog/regression model where the regression link is estimated from atmospheric analogs of the current prediction day. Atmospheric analogs are identified from fields of geopotential heights at 1000 and 500 hPa. For the regression stage, two generalized linear models are further used to model the probability of precipitation occurrence and the distribution of non-zero precipitation amounts, respectively. The two-stage model is evaluated for the probabilistic prediction of small-scale precipitation over France. It noticeably improves the skill of the prediction for both precipitation occurrence and amount. As the analog days vary from one prediction day to another, the atmospheric predictors selected in the regression stage and the value of the corresponding regression coefficients can vary from one prediction day to another. The model allows thus for a day-to-day adaptive and tailored downscaling. It can also reveal specific predictors for peculiar and non-frequent weather configurations.

  5. Exploration of the Pine Ridge Uranium Deposits, Powder River Basin, Wyoming

    International Nuclear Information System (INIS)

    Doelger, Mark J.; Sundell, Kent A.

    2014-01-01

    Summary of Exploration in Pine Ridge District: • Use of outcrop mapping integrated with oil and gas subsurface data and available well logs resulted in a geologic model for this previously unexplored area. • Proprietary drilling by Stakeholder over the past two years has confirmed the geologic model of large mineralized alteration cells in staked fluvial sandstone sequences. • The target-rich area of potential extends over nine contiguous townships where Stakeholder has leased over 70,000 acres. • Adjacent mature in-situ projects provide strong analogs and demonstrate amenability for the ore bodies at shallow, intermediate, and deep depths. • These project attributes, with discoveries by Stakeholder are expected to result in future yellow cake production with partner or successor to Stakeholder, and warrants naming this the Pine Ridge District. • Potential resource is an estimated 66 to 72 million pounds

  6. Lateral ridge split and immediate implant placement in moderately resorbed alveolar ridges: How much is the added width?

    Directory of Open Access Journals (Sweden)

    Amin Rahpeyma

    2013-01-01

    Full Text Available Background: Lateral ridge split technique is a way to solve the problem of the width in narrow ridges with adequate height. Simultaneous insertion of dental implants will considerably reduce the edentulism time. Materials and Methods: Twenty-five patients who were managed with ridge splitting technique were enrolled. Thirty-eight locations in both jaws with near equal distribution in quadrants received 82 dental fixtures. Beta Tricalcium phosphate (Cerasorb® was used as biomaterial to fill the intercortical space. Submerged implants were used and 3 months later healing caps were placed. Direct bone measurements before and after split were done with a Collis. Patients were clinically re-evaluated at least 6 months after implant loading. All the data were analyzed by Statistical Package for Social Sciences (SPSS software version 11.5 (SPSS Inc, Chicago Illinois, USA. Frequency of edentulous spaces and pre/post operative bone width was analyzed. Paired t-test was used for statistical analysis. Difference was considered significant if P value was less than 0.05. Results: Mean value for presplit width was 3.2 ± 0.34 mm while post-split mean width was 5.57 ± 0.49 mm. Mean gain in crest ridge after ridge splitting was 2 ± 0.3 mm. Statistical analysis showed significant differences in width before and after operation ((P < 0.05. All implants (n = 82 survived and were in full function at follow up (at least 6 months after implant loading. Conclusion: Ridge splitting technique in both jaws showed the predictable outcomes, if appropriate cases selected and special attention paid to details; then the waiting time between surgery and beginning of prosthodontic treatment can be reduced to 3 month.

  7. Logistic regression for dichotomized counts.

    Science.gov (United States)

    Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W

    2016-12-01

    Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.

  8. Logistic regression models for polymorphic and antagonistic pleiotropic gene action on human aging and longevity

    DEFF Research Database (Denmark)

    Tan, Qihua; Bathum, L; Christiansen, L

    2003-01-01

    In this paper, we apply logistic regression models to measure genetic association with human survival for highly polymorphic and pleiotropic genes. By modelling genotype frequency as a function of age, we introduce a logistic regression model with polytomous responses to handle the polymorphic...... situation. Genotype and allele-based parameterization can be used to investigate the modes of gene action and to reduce the number of parameters, so that the power is increased while the amount of multiple testing minimized. A binomial logistic regression model with fractional polynomials is used to capture...... the age-dependent or antagonistic pleiotropic effects. The models are applied to HFE genotype data to assess the effects on human longevity by different alleles and to detect if an age-dependent effect exists. Application has shown that these methods can serve as useful tools in searching for important...

  9. Modeling daily soil temperature over diverse climate conditions in Iran—a comparison of multiple linear regression and support vector regression techniques

    Science.gov (United States)

    Delbari, Masoomeh; Sharifazari, Salman; Mohammadi, Ehsan

    2018-02-01

    The knowledge of soil temperature at different depths is important for agricultural industry and for understanding climate change. The aim of this study is to evaluate the performance of a support vector regression (SVR)-based model in estimating daily soil temperature at 10, 30 and 100 cm depth at different climate conditions over Iran. The obtained results were compared to those obtained from a more classical multiple linear regression (MLR) model. The correlation sensitivity for the input combinations and periodicity effect were also investigated. Climatic data used as inputs to the models were minimum and maximum air temperature, solar radiation, relative humidity, dew point, and the atmospheric pressure (reduced to see level), collected from five synoptic stations Kerman, Ahvaz, Tabriz, Saghez, and Rasht located respectively in the hyper-arid, arid, semi-arid, Mediterranean, and hyper-humid climate conditions. According to the results, the performance of both MLR and SVR models was quite well at surface layer, i.e., 10-cm depth. However, SVR performed better than MLR in estimating soil temperature at deeper layers especially 100 cm depth. Moreover, both models performed better in humid climate condition than arid and hyper-arid areas. Further, adding a periodicity component into the modeling process considerably improved the models' performance especially in the case of SVR.

  10. Remedial investigation work plan for Chestnut Ridge Operable Unit 4 (Rogers Quarry/Lower McCoy Branch) at the Oak Ridge Y-12 Plant, Oak Ridge, Tennessee

    Energy Technology Data Exchange (ETDEWEB)

    1993-09-01

    The Oak Ridge Y-12 Plant includes - 800 acres near the northeast comer of the reservation and adjacent to the city of Oak Ridge (Fig. 1-1). The plant is a manufacturing and developmental engineering facility that produced components for various nuclear weapons systems and provides engineering support to other Energy Systems facilities. More than 200 contaminated sites have been identified at the Y-12 Plant that resulted from past waste management practices. Many of the sites have operable units (OUs) based on priority and on investigative and remediation requirements. This Remedial Investigation RI work plan specifically addresses Chestnut Ridge OU 4. Chestnut Ridge OU 4 consists of Rogers Quarry and Lower McCoy Branch (MCB). Rogers Quarry, which is also known as Old Rogers Quarry or Bethel Valley Quarry was used for quarrying from the late 1940s or early 1950s until about 1960. Since that time, the quarry has been used for disposal of coal ash and materials from Y-12 production operations, including classified materials. Disposal of coal ash ended in July 1993. An RI is being conducted at this site in response to CERCLA regulations. The overall objectives of the RI are to collect data necessary to evaluate the nature and extent of contaminants of concern, support an Ecological Risk Assessment and a Human Health Risk Assessment, support the evaluation of remedial alternatives, and ultimately develop a Record of Decision for the site. The purpose of this work plan is to outline RI activities necessary to define the nature and extent of suspected contaminants at Chestnut Ridge OU 4. Potential migration pathways also will be investigated. Data collected during the RI will be used to evaluate the risk posed to human health and the environment by OU 4.

  11. Ajuste de modelos de platô de resposta via regressão isotônica Response plateau models fitting via isotonic regression

    Directory of Open Access Journals (Sweden)

    Renata Pires Gonçalves

    2012-02-01

    . The experiments of type dosage x response are very common in the determination of levels of nutrients in optimal food balance and include the use of regression models to achieve this objective. Nevertheless, the regression analysis routine, generally, uses a priori information about a possible relationship between the response variable. The isotonic regression is a method of estimation by least squares that generates estimates which preserves data ordering. In the theory of isotonic regression this information is essential and it is expected to increase fitting efficiency. The objective of this work was to use an isotonic regression methodology, as an alternative way of analyzing data of Zn deposition in tibia of male birds of Hubbard lineage. We considered the models of plateau response of polynomial quadratic and linear exponential forms. In addition to these models, we also proposed the fitting of a logarithmic model to the data and the efficiency of the methodology was evaluated by Monte Carlo simulations, considering different scenarios for the parametric values. The isotonization of the data yielded an improvement in all the fitting quality parameters evaluated. Among the models used, the logarithmic presented estimates of the parameters more consistent with the values reported in literature.

  12. Non-linear calibration models for near infrared spectroscopy

    DEFF Research Database (Denmark)

    Ni, Wangdong; Nørgaard, Lars; Mørup, Morten

    2014-01-01

    by ridge regression (RR). The performance of the different methods is demonstrated by their practical applications using three real-life near infrared (NIR) data sets. Different aspects of the various approaches including computational time, model interpretability, potential over-fitting using the non-linear...... models on linear problems, robustness to small or medium sample sets, and robustness to pre-processing, are discussed. The results suggest that GPR and BANN are powerful and promising methods for handling linear as well as nonlinear systems, even when the data sets are moderately small. The LS......-SVM), relevance vector machines (RVM), Gaussian process regression (GPR), artificial neural network (ANN), and Bayesian ANN (BANN). In this comparison, partial least squares (PLS) regression is used as a linear benchmark, while the relationship of the methods is considered in terms of traditional calibration...

  13. Agricultural drought prediction using climate indices based on Support Vector Regression in Xiangjiang River basin.

    Science.gov (United States)

    Tian, Ye; Xu, Yue-Ping; Wang, Guoqing

    2018-05-01

    Drought can have a substantial impact on the ecosystem and agriculture of the affected region and does harm to local economy. This study aims to analyze the relation between soil moisture and drought and predict agricultural drought in Xiangjiang River basin. The agriculture droughts are presented with the Precipitation-Evapotranspiration Index (SPEI). The Support Vector Regression (SVR) model incorporating climate indices is developed to predict the agricultural droughts. Analysis of climate forcing including El Niño Southern Oscillation and western Pacific subtropical high (WPSH) are carried out to select climate indices. The results show that SPEI of six months time scales (SPEI-6) represents the soil moisture better than that of three and one month time scale on drought duration, severity and peaks. The key factor that influences the agriculture drought is the Ridge Point of WPSH, which mainly controls regional temperature. The SVR model incorporating climate indices, especially ridge point of WPSH, could improve the prediction accuracy compared to that solely using drought index by 4.4% in training and 5.1% in testing measured by Nash Sutcliffe efficiency coefficient (NSE) for three month lead time. The improvement is more significant for the prediction with one month lead (15.8% in training and 27.0% in testing) than that with three months lead time. However, it needs to be cautious in selection of the input parameters, since adding redundant information could have a counter effect in attaining a better prediction. Copyright © 2017 Elsevier B.V. All rights reserved.

  14. Adjusting for overdispersion in piecewise exponential regression models to estimate excess mortality rate in population-based research.

    Science.gov (United States)

    Luque-Fernandez, Miguel Angel; Belot, Aurélien; Quaresma, Manuela; Maringe, Camille; Coleman, Michel P; Rachet, Bernard

    2016-10-01

    In population-based cancer research, piecewise exponential regression models are used to derive adjusted estimates of excess mortality due to cancer using the Poisson generalized linear modelling framework. However, the assumption that the conditional mean and variance of the rate parameter given the set of covariates x i are equal is strong and may fail to account for overdispersion given the variability of the rate parameter (the variance exceeds the mean). Using an empirical example, we aimed to describe simple methods to test and correct for overdispersion. We used a regression-based score test for overdispersion under the relative survival framework and proposed different approaches to correct for overdispersion including a quasi-likelihood, robust standard errors estimation, negative binomial regression and flexible piecewise modelling. All piecewise exponential regression models showed the presence of significant inherent overdispersion (p-value regression modelling, with either a quasi-likelihood or robust standard errors, was the best approach as it deals with both, overdispersion due to model misspecification and true or inherent overdispersion.

  15. Endogenous glucose production from infancy to adulthood: a non-linear regression model

    NARCIS (Netherlands)

    Huidekoper, Hidde H.; Ackermans, Mariëtte T.; Ruiter, An F. C.; Sauerwein, Hans P.; Wijburg, Frits A.

    2014-01-01

    To construct a regression model for endogenous glucose production (EGP) as a function of age, and compare this with glucose supplementation using commonly used dextrose-based saline solutions at fluid maintenance rate in children. A model was constructed based on EGP data, as quantified by

  16. Modeling animal-vehicle collisions using diagonal inflated bivariate Poisson regression.

    Science.gov (United States)

    Lao, Yunteng; Wu, Yao-Jan; Corey, Jonathan; Wang, Yinhai

    2011-01-01

    Two types of animal-vehicle collision (AVC) data are commonly adopted for AVC-related risk analysis research: reported AVC data and carcass removal data. One issue with these two data sets is that they were found to have significant discrepancies by previous studies. In order to model these two types of data together and provide a better understanding of highway AVCs, this study adopts a diagonal inflated bivariate Poisson regression method, an inflated version of bivariate Poisson regression model, to fit the reported AVC and carcass removal data sets collected in Washington State during 2002-2006. The diagonal inflated bivariate Poisson model not only can model paired data with correlation, but also handle under- or over-dispersed data sets as well. Compared with three other types of models, double Poisson, bivariate Poisson, and zero-inflated double Poisson, the diagonal inflated bivariate Poisson model demonstrates its capability of fitting two data sets with remarkable overlapping portions resulting from the same stochastic process. Therefore, the diagonal inflated bivariate Poisson model provides researchers a new approach to investigating AVCs from a different perspective involving the three distribution parameters (λ(1), λ(2) and λ(3)). The modeling results show the impacts of traffic elements, geometric design and geographic characteristics on the occurrences of both reported AVC and carcass removal data. It is found that the increase of some associated factors, such as speed limit, annual average daily traffic, and shoulder width, will increase the numbers of reported AVCs and carcass removals. Conversely, the presence of some geometric factors, such as rolling and mountainous terrain, will decrease the number of reported AVCs. Published by Elsevier Ltd.

  17. Geographically weighted negative binomial regression applied to zonal level safety performance models.

    Science.gov (United States)

    Gomes, Marcos José Timbó Lima; Cunto, Flávio; da Silva, Alan Ricardo

    2017-09-01

    Generalized Linear Models (GLM) with negative binomial distribution for errors, have been widely used to estimate safety at the level of transportation planning. The limited ability of this technique to take spatial effects into account can be overcome through the use of local models from spatial regression techniques, such as Geographically Weighted Poisson Regression (GWPR). Although GWPR is a system that deals with spatial dependency and heterogeneity and has already been used in some road safety studies at the planning level, it fails to account for the possible overdispersion that can be found in the observations on road-traffic crashes. Two approaches were adopted for the Geographically Weighted Negative Binomial Regression (GWNBR) model to allow discrete data to be modeled in a non-stationary form and to take note of the overdispersion of the data: the first examines the constant overdispersion for all the traffic zones and the second includes the variable for each spatial unit. This research conducts a comparative analysis between non-spatial global crash prediction models and spatial local GWPR and GWNBR at the level of traffic zones in Fortaleza/Brazil. A geographic database of 126 traffic zones was compiled from the available data on exposure, network characteristics, socioeconomic factors and land use. The models were calibrated by using the frequency of injury crashes as a dependent variable and the results showed that GWPR and GWNBR achieved a better performance than GLM for the average residuals and likelihood as well as reducing the spatial autocorrelation of the residuals, and the GWNBR model was more able to capture the spatial heterogeneity of the crash frequency. Copyright © 2017 Elsevier Ltd. All rights reserved.

  18. Linear regression

    CERN Document Server

    Olive, David J

    2017-01-01

    This text covers both multiple linear regression and some experimental design models. The text uses the response plot to visualize the model and to detect outliers, does not assume that the error distribution has a known parametric distribution, develops prediction intervals that work when the error distribution is unknown, suggests bootstrap hypothesis tests that may be useful for inference after variable selection, and develops prediction regions and large sample theory for the multivariate linear regression model that has m response variables. A relationship between multivariate prediction regions and confidence regions provides a simple way to bootstrap confidence regions. These confidence regions often provide a practical method for testing hypotheses. There is also a chapter on generalized linear models and generalized additive models. There are many R functions to produce response and residual plots, to simulate prediction intervals and hypothesis tests, to detect outliers, and to choose response trans...

  19. Regression-based model of skin diffuse reflectance for skin color analysis

    Science.gov (United States)

    Tsumura, Norimichi; Kawazoe, Daisuke; Nakaguchi, Toshiya; Ojima, Nobutoshi; Miyake, Yoichi

    2008-11-01

    A simple regression-based model of skin diffuse reflectance is developed based on reflectance samples calculated by Monte Carlo simulation of light transport in a two-layered skin model. This reflectance model includes the values of spectral reflectance in the visible spectra for Japanese women. The modified Lambert Beer law holds in the proposed model with a modified mean free path length in non-linear density space. The averaged RMS and maximum errors of the proposed model were 1.1 and 3.1%, respectively, in the above range.

  20. Orientationally ordered ridge structures of aluminum films on hydrogen terminated silicon

    DEFF Research Database (Denmark)

    Quaade, Ulrich; Pantleon, Karen

    2006-01-01

    Films of aluminum deposited onto Si(100) substrates show a surface structure of parallel ridges. On films deposited on oxidized silicon substrates the direction of the ridges is arbitrary, but on films deposited on hydrogen-terminated Si(100) the ridges are oriented parallel to the < 110 > direct......Films of aluminum deposited onto Si(100) substrates show a surface structure of parallel ridges. On films deposited on oxidized silicon substrates the direction of the ridges is arbitrary, but on films deposited on hydrogen-terminated Si(100) the ridges are oriented parallel to the ... > directions on the silicon substrate. The ridge structure appears when the film thickness is above 500 nm, and increasing the film thickness makes the structure more distinct. Anodic oxidation enhances the structure even further. X-ray diffraction indicates that grains in the film have mostly (110) facets...

  1. Development of an empirical model of turbine efficiency using the Taylor expansion and regression analysis

    International Nuclear Information System (INIS)

    Fang, Xiande; Xu, Yu

    2011-01-01

    The empirical model of turbine efficiency is necessary for the control- and/or diagnosis-oriented simulation and useful for the simulation and analysis of dynamic performances of the turbine equipment and systems, such as air cycle refrigeration systems, power plants, turbine engines, and turbochargers. Existing empirical models of turbine efficiency are insufficient because there is no suitable form available for air cycle refrigeration turbines. This work performs a critical review of empirical models (called mean value models in some literature) of turbine efficiency and develops an empirical model in the desired form for air cycle refrigeration, the dominant cooling approach in aircraft environmental control systems. The Taylor series and regression analysis are used to build the model, with the Taylor series being used to expand functions with the polytropic exponent and the regression analysis to finalize the model. The measured data of a turbocharger turbine and two air cycle refrigeration turbines are used for the regression analysis. The proposed model is compact and able to present the turbine efficiency map. Its predictions agree with the measured data very well, with the corrected coefficient of determination R c 2 ≥ 0.96 and the mean absolute percentage deviation = 1.19% for the three turbines. -- Highlights: → Performed a critical review of empirical models of turbine efficiency. → Developed an empirical model in the desired form for air cycle refrigeration, using the Taylor expansion and regression analysis. → Verified the method for developing the empirical model. → Verified the model.

  2. Tectonics of ridge-transform intersections at the Kane fracture zone

    Science.gov (United States)

    Karson, J. A.; Dick, H. J. B.

    1983-03-01

    The Kane Transform offsets spreading-center segments of the Mid-Atlantic Ridge by about 150 km at 24° N latitude. In terms of its first-order morphological, geological, and geophysical characteristics it appears to be typical of long-offset (>100 km), slow-slipping (2 cm yr-1) ridge-ridge transform faults. High-resolution geological observations were made from deep-towed ANGUS photographs and the manned submersible ALVIN at the ridge-transform intersections and indicate similar relationships in these two regions. These data indicate that over a distance of about 20 km as the spreading axes approach the fracture zone, the two flanks of each ridge axis behave in very different ways. Along the flanks that intersect the active transform zone the rift valley floor deepens and the surface expression of volcanism becomes increasingly narrow and eventually absent at the intersection where only a sediment-covered ‘nodal basin’ exists. The adjacent median valley walls have structural trends that are oblique to both the ridge and the transform and have as much as 4 km of relief. These are tectonically active regions that have only a thin (young volcanics passes laterally into median valley walls with a simple block-faulted character where only volcanic rocks have been found. Along strike toward the fracture zone, the youngest volcanics form linear constructional volcanic ridges that transect the entire width of the fracture zone valley. These volcanics are continuous with the older-looking, slightly faulted volcanic terrain that floors the non-transform fracture zone valleys. These observations document the asymmetric nature of seafloor spreading near ridge-transform intersections. An important implication is that the crust and lithosphere across different portions of the fracture zone will have different geological characteristics. Across the active transform zone two lithosphere plate edges formed at ridge-transform corners are faulted against one another. In the non

  3. Linear regression in astronomy. II

    Science.gov (United States)

    Feigelson, Eric D.; Babu, Gutti J.

    1992-01-01

    A wide variety of least-squares linear regression procedures used in observational astronomy, particularly investigations of the cosmic distance scale, are presented and discussed. The classes of linear models considered are (1) unweighted regression lines, with bootstrap and jackknife resampling; (2) regression solutions when measurement error, in one or both variables, dominates the scatter; (3) methods to apply a calibration line to new data; (4) truncated regression models, which apply to flux-limited data sets; and (5) censored regression models, which apply when nondetections are present. For the calibration problem we develop two new procedures: a formula for the intercept offset between two parallel data sets, which propagates slope errors from one regression to the other; and a generalization of the Working-Hotelling confidence bands to nonstandard least-squares lines. They can provide improved error analysis for Faber-Jackson, Tully-Fisher, and similar cosmic distance scale relations.

  4. Bayesian Bandwidth Selection for a Nonparametric Regression Model with Mixed Types of Regressors

    Directory of Open Access Journals (Sweden)

    Xibin Zhang

    2016-04-01

    Full Text Available This paper develops a sampling algorithm for bandwidth estimation in a nonparametric regression model with continuous and discrete regressors under an unknown error density. The error density is approximated by the kernel density estimator of the unobserved errors, while the regression function is estimated using the Nadaraya-Watson estimator admitting continuous and discrete regressors. We derive an approximate likelihood and posterior for bandwidth parameters, followed by a sampling algorithm. Simulation results show that the proposed approach typically leads to better accuracy of the resulting estimates than cross-validation, particularly for smaller sample sizes. This bandwidth estimation approach is applied to nonparametric regression model of the Australian All Ordinaries returns and the kernel density estimation of gross domestic product (GDP growth rates among the organisation for economic co-operation and development (OECD and non-OECD countries.

  5. Multiresponse semiparametric regression for modelling the effect of regional socio-economic variables on the use of information technology

    Science.gov (United States)

    Wibowo, Wahyu; Wene, Chatrien; Budiantara, I. Nyoman; Permatasari, Erma Oktania

    2017-03-01

    Multiresponse semiparametric regression is simultaneous equation regression model and fusion of parametric and nonparametric model. The regression model comprise several models and each model has two components, parametric and nonparametric. The used model has linear function as parametric and polynomial truncated spline as nonparametric component. The model can handle both linearity and nonlinearity relationship between response and the sets of predictor variables. The aim of this paper is to demonstrate the application of the regression model for modeling of effect of regional socio-economic on use of information technology. More specific, the response variables are percentage of households has access to internet and percentage of households has personal computer. Then, predictor variables are percentage of literacy people, percentage of electrification and percentage of economic growth. Based on identification of the relationship between response and predictor variable, economic growth is treated as nonparametric predictor and the others are parametric predictors. The result shows that the multiresponse semiparametric regression can be applied well as indicate by the high coefficient determination, 90 percent.

  6. Multi-model ensemble combinations of the water budget in the East/Japan Sea

    Science.gov (United States)

    HAN, S.; Hirose, N.; Usui, N.; Miyazawa, Y.

    2016-02-01

    The water balance of East/Japan Sea is determined mainly by inflow and outflow through the Korea/Tsushima, Tsugaru and Soya/La Perouse Straits. However, the volume transports measured at three straits remain quantitatively unbalanced. This study examined the seasonal variation of the volume transport using the multiple linear regression and ridge regression of multi-model ensemble (MME) methods to estimate physically consistent circulation in East/Japan Sea by using four different data assimilation models. The MME outperformed all of the single models by reducing uncertainties, especially the multicollinearity problem with the ridge regression. However, the regression constants turned out to be inconsistent with each other if the MME was applied separately for each strait. The MME for a connected system was thus performed to find common constants for these straits. The estimation of this MME was found to be similar to the MME result of sea level difference (SLD). The estimated mean transport (2.42 Sv) was smaller than the measurement data at the Korea/Tsushima Strait, but the calibrated transport of the Tsugaru Strait (1.63 Sv) was larger than the observed data. The MME results of transport and SLD also suggested that the standard deviation (STD) of the Korea/Tsushima Strait is larger than the STD of the observation, whereas the estimated results were almost identical to that observed for the Tsugaru and Soya/La Perouse Straits. The similarity between MME results enhances the reliability of the present MME estimation.

  7. A hydrologic regression sediment-yield model for two ungaged watershed outlet stations in Africa

    International Nuclear Information System (INIS)

    Moussa, O.M.; Smith, S.E.; Shrestha, R.L.

    1991-01-01

    A hydrologic regression sediment-yield model was established to determine the relationship between water discharge and suspended sediment discharge at the Blue Nile and the Atbara River outlet stations during the flood season. The model consisted of two main submodels: (1) a suspended sediment discharge model, which was used to determine suspended sediment discharge for each basin outlet; and (2) a sediment rating model, which related water discharge and suspended sediment discharge for each outlet station. Due to the absence of suspended sediment concentration measurements at or near the outlet stations, a minimum norm solution, which is based on the minimization of the unknowns rather than the residuals, was used to determine the suspended sediment discharges at the stations. In addition, the sediment rating submodel was regressed by using an observation equations procedure. Verification analyses on the model were carried out and the mean percentage errors were found to be +12.59 and -12.39, respectively, for the Blue Nile and Atbara. The hydrologic regression model was found to be most sensitive to the relative weight matrix, moderately sensitive to the mean water discharge ratio, and slightly sensitive to the concentration variation along the River Nile's course

  8. Understanding poisson regression.

    Science.gov (United States)

    Hayat, Matthew J; Higgins, Melinda

    2014-04-01

    Nurse investigators often collect study data in the form of counts. Traditional methods of data analysis have historically approached analysis of count data either as if the count data were continuous and normally distributed or with dichotomization of the counts into the categories of occurred or did not occur. These outdated methods for analyzing count data have been replaced with more appropriate statistical methods that make use of the Poisson probability distribution, which is useful for analyzing count data. The purpose of this article is to provide an overview of the Poisson distribution and its use in Poisson regression. Assumption violations for the standard Poisson regression model are addressed with alternative approaches, including addition of an overdispersion parameter or negative binomial regression. An illustrative example is presented with an application from the ENSPIRE study, and regression modeling of comorbidity data is included for illustrative purposes. Copyright 2014, SLACK Incorporated.

  9. Internal doses in Oak Ridge. The Internet beams

    International Nuclear Information System (INIS)

    Passchier, W.F.

    1997-01-01

    A brief overview is given of the information, presented by the Radiation Internal Dose Information Center (RIDIC) of the Oak Ridge Associated Universities in Oak Ridge, TN, USA, via Internet (www.orau.gov/ehsd/ridic.htm)

  10. Optical dating of dune ridges on Rømø

    DEFF Research Database (Denmark)

    Madsen, Anni Tindahl; Murray, A. S.; Andersen, Thorbjørn Joest

    2007-01-01

    The application of optically stimulated luminescence (OSL) to the dating of recent aeolian sand ridges on Rømø, an island off the southwest coast of Denmark, is tested. These sand ridges began to form approximately 300 years ago, and estimates of the ages are available from historical records....... Samples for OSL dating were taken ~0.5 m below the crests of four different dune ridges; at least five samples were recovered from each ridge to test the internal consistency of the ages. Additional samples were recovered from the low lying areas in the swales and from the scattered dune formations......-defined building phases separated by inactive periods and the first major ridge formed ~235 years ago. This study demonstrates that optical dating can be successfully applied to these young aeolian sand deposits, and we conclude that OSL dating is a powerful chronological tool in studies of coastal change....

  11. An adaptive two-stage analog/regression model for probabilistic prediction of small-scale precipitation in France

    Directory of Open Access Journals (Sweden)

    J. Chardon

    2018-01-01

    Full Text Available Statistical downscaling models (SDMs are often used to produce local weather scenarios from large-scale atmospheric information. SDMs include transfer functions which are based on a statistical link identified from observations between local weather and a set of large-scale predictors. As physical processes driving surface weather vary in time, the most relevant predictors and the regression link are likely to vary in time too. This is well known for precipitation for instance and the link is thus often estimated after some seasonal stratification of the data. In this study, we present a two-stage analog/regression model where the regression link is estimated from atmospheric analogs of the current prediction day. Atmospheric analogs are identified from fields of geopotential heights at 1000 and 500 hPa. For the regression stage, two generalized linear models are further used to model the probability of precipitation occurrence and the distribution of non-zero precipitation amounts, respectively. The two-stage model is evaluated for the probabilistic prediction of small-scale precipitation over France. It noticeably improves the skill of the prediction for both precipitation occurrence and amount. As the analog days vary from one prediction day to another, the atmospheric predictors selected in the regression stage and the value of the corresponding regression coefficients can vary from one prediction day to another. The model allows thus for a day-to-day adaptive and tailored downscaling. It can also reveal specific predictors for peculiar and non-frequent weather configurations.

  12. A test of inflated zeros for Poisson regression models.

    Science.gov (United States)

    He, Hua; Zhang, Hui; Ye, Peng; Tang, Wan

    2017-01-01

    Excessive zeros are common in practice and may cause overdispersion and invalidate inference when fitting Poisson regression models. There is a large body of literature on zero-inflated Poisson models. However, methods for testing whether there are excessive zeros are less well developed. The Vuong test comparing a Poisson and a zero-inflated Poisson model is commonly applied in practice. However, the type I error of the test often deviates seriously from the nominal level, rendering serious doubts on the validity of the test in such applications. In this paper, we develop a new approach for testing inflated zeros under the Poisson model. Unlike the Vuong test for inflated zeros, our method does not require a zero-inflated Poisson model to perform the test. Simulation studies show that when compared with the Vuong test our approach not only better at controlling type I error rate, but also yield more power.

  13. Long-term seismicity of the Reykjanes Ridge (North Atlantic) recorded by a regional hydrophone array

    Science.gov (United States)

    Goslin, Jean; Lourenço, Nuno; Dziak, Robert P.; Bohnenstiehl, DelWayne R.; Haxel, Joe; Luis, Joaquim

    2005-08-01

    The seismicity of the northern Mid-Atlantic Ridge was recorded by two hydrophone networks moored in the sound fixing and ranging (SOFAR) channel, on the flanks of the Mid-Atlantic Ridge, north and south of the Azores. During its period of operation (05/2002-09/2003), the northern `SIRENA' network, deployed between latitudes 40° 20'N and 50° 30'N, recorded acoustic signals generated by 809 earthquakes on the hotspot-influenced Reykjanes Ridge. This activity was distributed between five spatio-temporal event clusters, each initiated by a moderate-to-large magnitude (4.0-5.6 M) earthquake. The rate of earthquake occurrence within the initial portion of the largest sequence (which began on 2002 October 6) is described adequately by a modified Omori law aftershock model. Although this is consistent with triggering by tectonic processes, none of the Reykjanes Ridge sequences are dominated by a single large-magnitude earthquake, and they appear to be of relatively short duration (0.35-4.5 d) when compared to previously described mid-ocean ridge aftershock sequences. The occurrence of several near-equal magnitude events distributed throughout each sequence is inconsistent with the simple relaxation of mainshock-induced stresses and may reflect the involvement of magmatic or fluid processes along this deep (>2000 m) section of the Reykjanes Ridge.

  14. Ridge augmentation with soft tissue procedures in aesthetic dentistry: pre- and postoperative volume measurements with a new kind of moire technique

    Science.gov (United States)

    Studer, Stephan P.; Mueller, Ernst; Bucher, Alfred

    1993-09-01

    The aim of this paper was to measure the volume differences of operated alveolar ridge defects before and until 3 months post-surgically. Ten patients with ten localized alveolar ridge defects were operated on. Five alveolar ridge defects were corrected by using the full thickness onlay graft technique and the other five defects were operated by the subepithelial connective tissue graft technique. A strict standardized operation protocol was followed and all alveolar ridge defects were operated on by the same dental surgeon. Before as well as 1, 2, and 3 months after surgery the corrected defect was photographed and an impression was made by using an A-silicon material to produce a gypsum-cast model. The form of all these cast models was then measured using the moire technique. The three months result of ten cases shows that the form of the operated alveolar ridge defects, which were corrected by the subepithelial connective tissue graft technique are more stable compared to those which were operated on by the full thickness onlay graft technique. Localized alveolar ridge defects using the latter method does not show a form stability after 3 months post-surgically.

  15. Comparison of logistic regression and neural models in predicting the outcome of biopsy in breast cancer from MRI findings

    International Nuclear Information System (INIS)

    Abdolmaleki, P.; Yarmohammadi, M.; Gity, M.

    2004-01-01

    Background: We designed an algorithmic model based on regression analysis and a non-algorithmic model based on the Artificial Neural Network. Materials and methods: The ability of these models was compared together in clinical application to differentiate malignant from benign breast tumors in a study group of 161 patient's records. Each patient's record consisted of 6 subjective features extracted from MRI appearance. These findings were enclosed as features extracted for an Artificial Neural Network as well as a logistic regression model to predict biopsy outcome. After both models had been trained perfectly on samples (n=100), the validation samples (n=61) were presented to the trained network as well as the established logistic regression models. Finally, the diagnostic performance of models were compared to the that of the radiologist in terms of sensitivity, specificity and accuracy, using receiver operating characteristic curve analysis. Results: The average out put of the Artificial Neural Network yielded a perfect sensitivity (98%) and high accuracy (90%) similar to that one of an expert radiologist (96% and 92%) while specificity was smaller than that (67%) verses 80%). The output of the logistic regression model using significant features showed improvement in specificity from 60% for the logistic regression model using all features to 93% for the reduced logistic regression model, keeping the accuracy around 90%. Conclusion: Results show that Artificial Neural Network and logistic regression model prove the relationship between extracted morphological features and biopsy results. Using statistically significant variables reduced logistic regression model outperformed of Artificial Neural Network with remarkable specificity while keeping high sensitivity is achieved

  16. Order Selection for General Expression of Nonlinear Autoregressive Model Based on Multivariate Stepwise Regression

    Science.gov (United States)

    Shi, Jinfei; Zhu, Songqing; Chen, Ruwen

    2017-12-01

    An order selection method based on multiple stepwise regressions is proposed for General Expression of Nonlinear Autoregressive model which converts the model order problem into the variable selection of multiple linear regression equation. The partial autocorrelation function is adopted to define the linear term in GNAR model. The result is set as the initial model, and then the nonlinear terms are introduced gradually. Statistics are chosen to study the improvements of both the new introduced and originally existed variables for the model characteristics, which are adopted to determine the model variables to retain or eliminate. So the optimal model is obtained through data fitting effect measurement or significance test. The simulation and classic time-series data experiment results show that the method proposed is simple, reliable and can be applied to practical engineering.

  17. Modelling of binary logistic regression for obesity among secondary students in a rural area of Kedah

    Science.gov (United States)

    Kamaruddin, Ainur Amira; Ali, Zalila; Noor, Norlida Mohd.; Baharum, Adam; Ahmad, Wan Muhamad Amir W.

    2014-07-01

    Logistic regression analysis examines the influence of various factors on a dichotomous outcome by estimating the probability of the event's occurrence. Logistic regression, also called a logit model, is a statistical procedure used to model dichotomous outcomes. In the logit model the log odds of the dichotomous outcome is modeled as a linear combination of the predictor variables. The log odds ratio in logistic regression provides a description of the probabilistic relationship of the variables and the outcome. In conducting logistic regression, selection procedures are used in selecting important predictor variables, diagnostics are used to check that assumptions are valid which include independence of errors, linearity in the logit for continuous variables, absence of multicollinearity, and lack of strongly influential outliers and a test statistic is calculated to determine the aptness of the model. This study used the binary logistic regression model to investigate overweight and obesity among rural secondary school students on the basis of their demographics profile, medical history, diet and lifestyle. The results indicate that overweight and obesity of students are influenced by obesity in family and the interaction between a student's ethnicity and routine meals intake. The odds of a student being overweight and obese are higher for a student having a family history of obesity and for a non-Malay student who frequently takes routine meals as compared to a Malay student.

  18. Capacitance Regression Modelling Analysis on Latex from Selected Rubber Tree Clones

    International Nuclear Information System (INIS)

    Rosli, A D; Baharudin, R; Hashim, H; Khairuzzaman, N A; Mohd Sampian, A F; Abdullah, N E; Kamaru'zzaman, M; Sulaiman, M S

    2015-01-01

    This paper investigates the capacitance regression modelling performance of latex for various rubber tree clones, namely clone 2002, 2008, 2014 and 3001. Conventionally, the rubber tree clones identification are based on observation towards tree features such as shape of leaf, trunk, branching habit and pattern of seeds texture. The former method requires expert persons and very time-consuming. Currently, there is no sensing device based on electrical properties that can be employed to measure different clones from latex samples. Hence, with a hypothesis that the dielectric constant of each clone varies, this paper discusses the development of a capacitance sensor via Capacitance Comparison Bridge (known as capacitance sensor) to measure an output voltage of different latex samples. The proposed sensor is initially tested with 30ml of latex sample prior to gradually addition of dilution water. The output voltage and capacitance obtained from the test are recorded and analyzed using Simple Linear Regression (SLR) model. This work outcome infers that latex clone of 2002 has produced the highest and reliable linear regression line with determination coefficient of 91.24%. In addition, the study also found that the capacitive elements in latex samples deteriorate if it is diluted with higher volume of water. (paper)

  19. Equatorial segment of the mid-atlantic ridge

    Energy Technology Data Exchange (ETDEWEB)

    NONE

    1997-12-31

    The Equatorial Segment of the Mid-Atlantic Ridge is a part of this mid-oceanic ridge limited by a cluster of fracture zones - Cape Verde, Marathon, Mercury, Vema, Doldrums, Vernadsky and Sierra Leone - in the North, and a similar cluster of fracture zones - St Paul, Romanche and Chain - in the South. During recent decades, following the publication of the 5. edition of the General Bathymetric Chart of the Oceans (GEBCO), there has been a great deal of geological-geophysical research and mapping of the World Ocean. The results have led to the development of a number of theories concerning the essential heterogeneity of the structure of the ocean floor and, in particular, the heterogeneity of the structure and segmentation of mid-oceanic ridges. Research on the nature of such segmentation is of great importance for an understanding of the processes of development of such ridges and oceanic basins as a whole. Chapter 20 is dedicated to the study of the atlantic ocean mantle by using (Th.U)Th, (Th/U)pb and K/Ti systematics 380 refs.

  20. Equatorial segment of the mid-atlantic ridge

    International Nuclear Information System (INIS)

    1996-01-01

    The Equatorial Segment of the Mid-Atlantic Ridge is a part of this mid-oceanic ridge limited by a cluster of fracture zones - Cape Verde, Marathon, Mercury, Vema, Doldrums, Vernadsky and Sierra Leone - in the North, and a similar cluster of fracture zones - St Paul, Romanche and Chain - in the South. During recent decades, following the publication of the 5. edition of the General Bathymetric Chart of the Oceans (GEBCO), there has been a great deal of geological-geophysical research and mapping of the World Ocean. The results have led to the development of a number of theories concerning the essential heterogeneity of the structure of the ocean floor and, in particular, the heterogeneity of the structure and segmentation of mid-oceanic ridges. Research on the nature of such segmentation is of great importance for an understanding of the processes of development of such ridges and oceanic basins as a whole. Chapter 20 is dedicated to the study of the atlantic ocean mantle by using (Th.U)Th, (Th/U)pb and K/Ti systematics