WorldWideScience

Sample records for regression analyses yielded

  1. Correlation, Regression and Path Analyses of Seed Yield Components in Crambe abyssinica, a Promising Industrial Oil Crop

    OpenAIRE

    Huang, Banglian; Yang, Yiming; Luo, Tingting; Wu, S.; Du, Xuezhu; Cai, Detian; Loo, van, E.N.; Huang Bangquan

    2013-01-01

    In the present study correlation, regression and path analyses were carried out to decide correlations among the agro- nomic traits and their contributions to seed yield per plant in Crambe abyssinica. Partial correlation analysis indicated that plant height (X1) was significantly correlated with branching height and the number of first branches (P <0.01); Branching height (X2) was significantly correlated with pod number of primary inflorescence (P <0.01) and number of secondary branch...

  2. [Milk yield and environmental factors: Multiple regression analysis of the association between milk yield and udder health, fertility data and replacement rate].

    Science.gov (United States)

    Fölsche, C; Staufenbiel, R

    2014-01-01

    The relationship between milk yield and both fertility and general animal health in dairy herds is discussed from opposing viewpoints. The hypothesis (1) that raising the herd milk yield would decrease fertility results, the number of milk cells as an indicator for udder health and the replacement rate as a global indicator for animal health as well as increasing the occurrence of specific diseases as a herd problem was compared to the opposing hypotheses that there is no relationship (2) or that there is a differentiated and changing relationship (3). A total of 743 herd examinations, considered independent, were performed in 489 herds between 1995 and 2010. The milk yield, fertility rate, milk cell count, replacement rate, categorized herd problems and management information were recorded. The relationship between the milk yield and both the fertility data and animal health was evaluated using simple and multiple regression analyses. The period between calving and the first service displayed no significant relationship to the herd milk yield. Simple regression analysis showed that the period between calving and gestation, the calving interval and the insemination number were significantly positively associated with the herd milk yield. This positive correlation was lost in multiple regression analysis. The milk cell count and replacement rate using both the simple and multiple regression analyses displayed a significant negative relationship to the milk yield. The alternative hypothesis (3) was confirmed. A higher milk yield has no negative influence on the milk cell count and the replacement rate in terms of the udder and general health. When parameterizing the fertility, the herd milk yield should be considered. Extending the resting time may increase the milk yield while preventing a decline in the insemination index.

  3. Principal component regression for crop yield estimation

    CERN Document Server

    Suryanarayana, T M V

    2016-01-01

    This book highlights the estimation of crop yield in Central Gujarat, especially with regard to the development of Multiple Regression Models and Principal Component Regression (PCR) models using climatological parameters as independent variables and crop yield as a dependent variable. It subsequently compares the multiple linear regression (MLR) and PCR results, and discusses the significance of PCR for crop yield estimation. In this context, the book also covers Principal Component Analysis (PCA), a statistical procedure used to reduce a number of correlated variables into a smaller number of uncorrelated variables called principal components (PC). This book will be helpful to the students and researchers, starting their works on climate and agriculture, mainly focussing on estimation models. The flow of chapters takes the readers in a smooth path, in understanding climate and weather and impact of climate change, and gradually proceeds towards downscaling techniques and then finally towards development of ...

  4. Geoelectrical parameter-based multivariate regression borehole yield model for predicting aquifer yield in managing groundwater resource sustainability

    Directory of Open Access Journals (Sweden)

    Kehinde Anthony Mogaji

    2016-07-01

    Full Text Available This study developed a GIS-based multivariate regression (MVR yield rate prediction model of groundwater resource sustainability in the hard-rock geology terrain of southwestern Nigeria. This model can economically manage the aquifer yield rate potential predictions that are often overlooked in groundwater resources development. The proposed model relates the borehole yield rate inventory of the area to geoelectrically derived parameters. Three sets of borehole yield rate conditioning geoelectrically derived parameters—aquifer unit resistivity (ρ, aquifer unit thickness (D and coefficient of anisotropy (λ—were determined from the acquired and interpreted geophysical data. The extracted borehole yield rate values and the geoelectrically derived parameter values were regressed to develop the MVR relationship model by applying linear regression and GIS techniques. The sensitivity analysis results of the MVR model evaluated at P ⩽ 0.05 for the predictors ρ, D and λ provided values of 2.68 × 10−05, 2 × 10−02 and 2.09 × 10−06, respectively. The accuracy and predictive power tests conducted on the MVR model using the Theil inequality coefficient measurement approach, coupled with the sensitivity analysis results, confirmed the model yield rate estimation and prediction capability. The MVR borehole yield prediction model estimates were processed in a GIS environment to model an aquifer yield potential prediction map of the area. The information on the prediction map can serve as a scientific basis for predicting aquifer yield potential rates relevant in groundwater resources sustainability management. The developed MVR borehole yield rate prediction mode provides a good alternative to other methods used for this purpose.

  5. Statistical power analyses using G*Power 3.1: tests for correlation and regression analyses.

    Science.gov (United States)

    Faul, Franz; Erdfelder, Edgar; Buchner, Axel; Lang, Albert-Georg

    2009-11-01

    G*Power is a free power analysis program for a variety of statistical tests. We present extensions and improvements of the version introduced by Faul, Erdfelder, Lang, and Buchner (2007) in the domain of correlation and regression analyses. In the new version, we have added procedures to analyze the power of tests based on (1) single-sample tetrachoric correlations, (2) comparisons of dependent correlations, (3) bivariate linear regression, (4) multiple linear regression based on the random predictor model, (5) logistic regression, and (6) Poisson regression. We describe these new features and provide a brief introduction to their scope and handling.

  6. A hydrologic regression sediment-yield model for two ungaged watershed outlet stations in Africa

    International Nuclear Information System (INIS)

    Moussa, O.M.; Smith, S.E.; Shrestha, R.L.

    1991-01-01

    A hydrologic regression sediment-yield model was established to determine the relationship between water discharge and suspended sediment discharge at the Blue Nile and the Atbara River outlet stations during the flood season. The model consisted of two main submodels: (1) a suspended sediment discharge model, which was used to determine suspended sediment discharge for each basin outlet; and (2) a sediment rating model, which related water discharge and suspended sediment discharge for each outlet station. Due to the absence of suspended sediment concentration measurements at or near the outlet stations, a minimum norm solution, which is based on the minimization of the unknowns rather than the residuals, was used to determine the suspended sediment discharges at the stations. In addition, the sediment rating submodel was regressed by using an observation equations procedure. Verification analyses on the model were carried out and the mean percentage errors were found to be +12.59 and -12.39, respectively, for the Blue Nile and Atbara. The hydrologic regression model was found to be most sensitive to the relative weight matrix, moderately sensitive to the mean water discharge ratio, and slightly sensitive to the concentration variation along the River Nile's course

  7. Screening for ketosis using multiple logistic regression based on milk yield and composition.

    Science.gov (United States)

    Kayano, Mitsunori; Kataoka, Tomoko

    2015-11-01

    Multiple logistic regression was applied to milk yield and composition data for 632 records of healthy cows and 61 records of ketotic cows in Hokkaido, Japan. The purpose was to diagnose ketosis based on milk yield and composition, simultaneously. The cows were divided into two groups: (1) multiparous, including 314 healthy cows and 45 ketotic cows and (2) primiparous, including 318 healthy cows and 16 ketotic cows, since nutritional status, milk yield and composition are affected by parity. Multiple logistic regression was applied to these groups separately. For multiparous cows, milk yield (kg/day/cow) and protein-to-fat (P/F) ratio in milk were significant factors (Pketosis. For primiparous cows, lactose content (%), solid not fat (SNF) content (%) and milk urea nitrogen (MUN) content (mg/dl) were significantly associated with ketosis (Pketosis, provided the sensitivity, specificity and AUC values of (1) 0.711, 0.726 and 0.781; and (2) 0.678, 0.767 and 0.738, respectively.

  8. A Technique of Fuzzy C-Mean in Multiple Linear Regression Model toward Paddy Yield

    Science.gov (United States)

    Syazwan Wahab, Nur; Saifullah Rusiman, Mohd; Mohamad, Mahathir; Amira Azmi, Nur; Che Him, Norziha; Ghazali Kamardan, M.; Ali, Maselan

    2018-04-01

    In this paper, we propose a hybrid model which is a combination of multiple linear regression model and fuzzy c-means method. This research involved a relationship between 20 variates of the top soil that are analyzed prior to planting of paddy yields at standard fertilizer rates. Data used were from the multi-location trials for rice carried out by MARDI at major paddy granary in Peninsular Malaysia during the period from 2009 to 2012. Missing observations were estimated using mean estimation techniques. The data were analyzed using multiple linear regression model and a combination of multiple linear regression model and fuzzy c-means method. Analysis of normality and multicollinearity indicate that the data is normally scattered without multicollinearity among independent variables. Analysis of fuzzy c-means cluster the yield of paddy into two clusters before the multiple linear regression model can be used. The comparison between two method indicate that the hybrid of multiple linear regression model and fuzzy c-means method outperform the multiple linear regression model with lower value of mean square error.

  9. Climate Impacts on Chinese Corn Yields: A Fractional Polynomial Regression Model

    NARCIS (Netherlands)

    Kooten, van G.C.; Sun, Baojing

    2012-01-01

    In this study, we examine the effect of climate on corn yields in northern China using data from ten districts in Inner Mongolia and two in Shaanxi province. A regression model with a flexible functional form is specified, with explanatory variables that include seasonal growing degree days,

  10. Comparison of Regression Techniques to Predict Response of Oilseed Rape Yield to Variation in Climatic Conditions in Denmark

    DEFF Research Database (Denmark)

    Sharif, Behzad; Makowski, David; Plauborg, Finn

    2017-01-01

    Statistical regression models represent alternatives to process-based dynamic models for predicting the response of crop yields to variation in climatic conditions. Regression models can be used to quantify the effect of change in temperature and precipitation on yields. However, it is difficult ...

  11. Relationship between rice yield and climate variables in southwest Nigeria using multiple linear regression and support vector machine analysis

    Science.gov (United States)

    Oguntunde, Philip G.; Lischeid, Gunnar; Dietrich, Ottfried

    2018-03-01

    This study examines the variations of climate variables and rice yield and quantifies the relationships among them using multiple linear regression, principal component analysis, and support vector machine (SVM) analysis in southwest Nigeria. The climate and yield data used was for a period of 36 years between 1980 and 2015. Similar to the observed decrease ( P 1 and explained 83.1% of the total variance of predictor variables. The SVM regression function using the scores of the first principal component explained about 75% of the variance in rice yield data and linear regression about 64%. SVM regression between annual solar radiation values and yield explained 67% of the variance. Only the first component of the principal component analysis (PCA) exhibited a clear long-term trend and sometimes short-term variance similar to that of rice yield. Short-term fluctuations of the scores of the PC1 are closely coupled to those of rice yield during the 1986-1993 and the 2006-2013 periods thereby revealing the inter-annual sensitivity of rice production to climate variability. Solar radiation stands out as the climate variable of highest influence on rice yield, and the influence was especially strong during monsoon and post-monsoon periods, which correspond to the vegetative, booting, flowering, and grain filling stages in the study area. The outcome is expected to provide more in-depth regional-specific climate-rice linkage for screening of better cultivars that can positively respond to future climate fluctuations as well as providing information that may help optimized planting dates for improved radiation use efficiency in the study area.

  12. USE OF THE SIMPLE LINEAR REGRESSION MODEL IN MACRO-ECONOMICAL ANALYSES

    Directory of Open Access Journals (Sweden)

    Constantin ANGHELACHE

    2011-10-01

    Full Text Available The article presents the fundamental aspects of the linear regression, as a toolbox which can be used in macroeconomic analyses. The article describes the estimation of the parameters, the statistical tests used, the homoscesasticity and heteroskedasticity. The use of econometrics instrument in macroeconomics is an important factor that guarantees the quality of the models, analyses, results and possible interpretation that can be drawn at this level.

  13. Relationship between rice yield and climate variables in southwest Nigeria using multiple linear regression and support vector machine analysis.

    Science.gov (United States)

    Oguntunde, Philip G; Lischeid, Gunnar; Dietrich, Ottfried

    2018-03-01

    This study examines the variations of climate variables and rice yield and quantifies the relationships among them using multiple linear regression, principal component analysis, and support vector machine (SVM) analysis in southwest Nigeria. The climate and yield data used was for a period of 36 years between 1980 and 2015. Similar to the observed decrease (P  1 and explained 83.1% of the total variance of predictor variables. The SVM regression function using the scores of the first principal component explained about 75% of the variance in rice yield data and linear regression about 64%. SVM regression between annual solar radiation values and yield explained 67% of the variance. Only the first component of the principal component analysis (PCA) exhibited a clear long-term trend and sometimes short-term variance similar to that of rice yield. Short-term fluctuations of the scores of the PC1 are closely coupled to those of rice yield during the 1986-1993 and the 2006-2013 periods thereby revealing the inter-annual sensitivity of rice production to climate variability. Solar radiation stands out as the climate variable of highest influence on rice yield, and the influence was especially strong during monsoon and post-monsoon periods, which correspond to the vegetative, booting, flowering, and grain filling stages in the study area. The outcome is expected to provide more in-depth regional-specific climate-rice linkage for screening of better cultivars that can positively respond to future climate fluctuations as well as providing information that may help optimized planting dates for improved radiation use efficiency in the study area.

  14. Genetic Parameters for Body condition score, Body weigth, Milk yield and Fertility estimated using random regression models

    NARCIS (Netherlands)

    Berry, D.P.; Buckley, F.; Dillon, P.; Evans, R.D.; Rath, M.; Veerkamp, R.F.

    2003-01-01

    Genetic (co)variances between body condition score (BCS), body weight (BW), milk yield, and fertility were estimated using a random regression animal model extended to multivariate analysis. The data analyzed included 81,313 BCS observations, 91,937 BW observations, and 100,458 milk test-day yields

  15. Regression Association Analysis of Yield-Related Traits with RAPD Molecular Markers in Pistachio (Pistacia vera L.

    Directory of Open Access Journals (Sweden)

    Saeid Mirzaei

    2017-10-01

    Full Text Available Introduction: The pistachio (Pistacia vera, a member of the cashew family, is a small tree originating from Central Asia and the Middle East. The tree produces seeds that are widely consumed as food. Pistacia vera often is confused with other species in the genus Pistacia that are also known as pistachio. These other species can be distinguished by their geographic distributions and their seeds which are much smaller and have a soft shell. Continual advances in crop improvement through plant breeding are driven by the available genetic diversity. Therefore, the recognition and measurement of such diversity is crucial to breeding programs. In the past 20 years, the major effort in plant breeding has changed from quantitative to molecular genetics with emphasis on quantitative trait loci (QTL identification and marker assisted selection (MAS. The germplasm-regression-combined association studies not only allow mapping of genes/QTLs with higher level of confidence, but also allow detection of genes/QTLs, which will otherwise escape detection in linkage-based QTL studies based on the planned populations. The development of the marker-based technology offers a fast, reliable, and easy way to perform multiple regression analysis and comprise an alternative approach to breeding in diverse species of plants. The availability of many makers and morphological traits can help to regression analysis between these markers and morphological traits. Materials and Methods: In this study, 20 genotypes of Pistachio were studied and yield related traits were measured. Young well-expanded leaves were collected for DNA extraction and total genomic DNA was extracted. Genotyping was performed using 15 RAPD primers and PCR amplification products were visualized by gel electrophoresis. The reproducible RAPD fragments were scored on the basis of present (1 or absent (0 bands and a binary matrix constructed using each molecular marker. Association analysis between

  16. Genetic variability, partial regression, Co-heritability studies and their implication in selection of high yielding potato gen

    International Nuclear Information System (INIS)

    Iqbal, Z.M.; Khan, S.A.

    2003-01-01

    Partial regression coefficient, genotypic and phenotypic variabilities, heritability co-heritability and genetic advance were studied in 15 Potato varieties of exotic and local origin. Both genotypic and phenotypic coefficients of variations were high for scab and rhizoctonia incidence percentage. Significant partial regression coefficient for emergence percentage indicated its relative importance in tuber yield. High heritability (broadsense) estimates coupled with high genetic advance for plant height, number of stems per plant and scab percentage revealed substantial contribution of additive genetic variance in the expression of these traits. Hence, the selection based on these characters could play a significant role in their improvement the dominance and epistatic variance was more important for character expression of yield ha/sup -1/, emergence and rhizoctonia percentage. This phenomenon is mainly due to the accumulative effects of low heritability and low to moderate genetic advance. The high co-heritability coupled with negative genotypic and phenotypic covariance revealed that selection of varieties having low scab and rhizoctonia percentage resulted in more potato yield. (author)

  17. Plateletpheresis efficiency and mathematical correction of software-derived platelet yield prediction: A linear regression and ROC modeling approach.

    Science.gov (United States)

    Jaime-Pérez, José Carlos; Jiménez-Castillo, Raúl Alberto; Vázquez-Hernández, Karina Elizabeth; Salazar-Riojas, Rosario; Méndez-Ramírez, Nereida; Gómez-Almaguer, David

    2017-10-01

    Advances in automated cell separators have improved the efficiency of plateletpheresis and the possibility of obtaining double products (DP). We assessed cell processor accuracy of predicted platelet (PLT) yields with the goal of a better prediction of DP collections. This retrospective proof-of-concept study included 302 plateletpheresis procedures performed on a Trima Accel v6.0 at the apheresis unit of a hematology department. Donor variables, software predicted yield and actual PLT yield were statistically evaluated. Software prediction was optimized by linear regression analysis and its optimal cut-off to obtain a DP assessed by receiver operating characteristic curve (ROC) modeling. Three hundred and two plateletpheresis procedures were performed; in 271 (89.7%) occasions, donors were men and in 31 (10.3%) women. Pre-donation PLT count had the best direct correlation with actual PLT yield (r = 0.486. P Simple correction derived from linear regression analysis accurately corrected this underestimation and ROC analysis identified a precise cut-off to reliably predict a DP. © 2016 Wiley Periodicals, Inc.

  18. Genetic parameters for body condition score, body weight, milk yield, and fertility estimated using random regression models.

    Science.gov (United States)

    Berry, D P; Buckley, F; Dillon, P; Evans, R D; Rath, M; Veerkamp, R F

    2003-11-01

    Genetic (co)variances between body condition score (BCS), body weight (BW), milk yield, and fertility were estimated using a random regression animal model extended to multivariate analysis. The data analyzed included 81,313 BCS observations, 91,937 BW observations, and 100,458 milk test-day yields from 8725 multiparous Holstein-Friesian cows. A cubic random regression was sufficient to model the changing genetic variances for BCS, BW, and milk across different days in milk. The genetic correlations between BCS and fertility changed little over the lactation; genetic correlations between BCS and interval to first service and between BCS and pregnancy rate to first service varied from -0.47 to -0.31, and from 0.15 to 0.38, respectively. This suggests that maximum genetic gain in fertility from indirect selection on BCS should be based on measurements taken in midlactation when the genetic variance for BCS is largest. Selection for increased BW resulted in shorter intervals to first service, but more services and poorer pregnancy rates; genetic correlations between BW and pregnancy rate to first service varied from -0.52 to -0.45. Genetic selection for higher lactation milk yield alone through selection on increased milk yield in early lactation is likely to have a more deleterious effect on genetic merit for fertility than selection on higher milk yield in late lactation.

  19. Applications of MIDAS regression in analysing trends in water quality

    Science.gov (United States)

    Penev, Spiridon; Leonte, Daniela; Lazarov, Zdravetz; Mann, Rob A.

    2014-04-01

    We discuss novel statistical methods in analysing trends in water quality. Such analysis uses complex data sets of different classes of variables, including water quality, hydrological and meteorological. We analyse the effect of rainfall and flow on trends in water quality utilising a flexible model called Mixed Data Sampling (MIDAS). This model arises because of the mixed frequency in the data collection. Typically, water quality variables are sampled fortnightly, whereas the rain data is sampled daily. The advantage of using MIDAS regression is in the flexible and parsimonious modelling of the influence of the rain and flow on trends in water quality variables. We discuss the model and its implementation on a data set from the Shoalhaven Supply System and Catchments in the state of New South Wales, Australia. Information criteria indicate that MIDAS modelling improves upon simplistic approaches that do not utilise the mixed data sampling nature of the data.

  20. Climate-based statistical regression models for crop yield forecasting of coffee in humid tropical Kerala, India

    Science.gov (United States)

    Jayakumar, M.; Rajavel, M.; Surendran, U.

    2016-12-01

    A study on the variability of coffee yield of both Coffea arabica and Coffea canephora as influenced by climate parameters (rainfall (RF), maximum temperature (Tmax), minimum temperature (Tmin), and mean relative humidity (RH)) was undertaken at Regional Coffee Research Station, Chundale, Wayanad, Kerala State, India. The result on the coffee yield data of 30 years (1980 to 2009) revealed that the yield of coffee is fluctuating with the variations in climatic parameters. Among the species, productivity was higher for C. canephora coffee than C. arabica in most of the years. Maximum yield of C. canephora (2040 kg ha-1) was recorded in 2003-2004 and there was declining trend of yield noticed in the recent years. Similarly, the maximum yield of C. arabica (1745 kg ha-1) was recorded in 1988-1989 and decreased yield was noticed in the subsequent years till 1997-1998 due to year to year variability in climate. The highest correlation coefficient was found between the yield of C. arabica coffee and maximum temperature during January (0.7) and between C. arabica coffee yield and RH during July (0.4). Yield of C. canephora coffee had highest correlation with maximum temperature, RH and rainfall during February. Statistical regression model between selected climatic parameters and yield of C. arabica and C. canephora coffee was developed to forecast the yield of coffee in Wayanad district in Kerala. The model was validated for years 2010, 2011, and 2012 with the coffee yield data obtained during the years and the prediction was found to be good.

  1. Regression models of ultimate methane yields of fruits and vegetable solid wastes, sorghum and napiergrass on chemical composition

    Energy Technology Data Exchange (ETDEWEB)

    Gunaseelan, V.N. [PSG College of Arts and Science, Coimbatore (India). Department of Zoology

    2007-04-15

    Several fractions of fruits and vegetable solid wastes (FVSW), sorghum and napiergrass were analyzed for total solids (TS), volatile solids (VS), total organic carbon, total kjeldahl nitrogen, total soluble carbohydrate, extractable protein, acid-detergent fiber (ADF), lignin, cellulose and ash contents. Their ultimate methane yields (B{sub o}) were determined using the biochemical methane potential (BMP) assay. A series of simple and multiple regression models relating the B{sub o} to the various substrate constituents were generated and evaluated using computer statistical software, Statistical Package for Social Sciences (SPSS). The results of simple regression analyses revealed that, only weak relationship existed between the individual components such as carbohydrate, protein, ADF, lignin and cellulose versus B{sub o}. A regression of B{sub o} versus combination of two variables as a single independent variable such as carbohydrate/ADF and carbohydrate + protein/ADF also showed that the relationship is not strong. Thus it does not appear possible to relate the B{sub o} of FVSW, sorghum and napiergrass with single compositional characteristics. The results of multiple regression analyses showed promise and the relationship appeared to be good. When ADF and lignin/ADF were used as independent variables, the percentage of variation accounted for by the model is low for FVSW (r{sup 2}=0.665) and sorghum and napiergrass (r{sup 2}=0.746). Addition of nitrogen, ash and total soluble carbohydrate data to the model had a significantly higher effect on prediction of B{sub o} of these wastes with the r{sup 2} values ranging from 0.9 to 0.99. More than 90% of variation in B{sub o} of FVSW could be accounted for by the models when the variables carbohydrate, lignin, lignin/ADF, nitrogen and ash (r{sup 2}=0.904), carbohydrate, ADF, lignin/ADF, nitrogen and ash (r{sup 2}=0.90) and carbohydrate/ADF, lignin/ADF, lignin and ash (r{sup 2}=0.901) were used. All the models have

  2. Estimation of Genetic Parameters for First Lactation Monthly Test-day Milk Yields using Random Regression Test Day Model in Karan Fries Cattle

    Directory of Open Access Journals (Sweden)

    Ajay Singh

    2016-06-01

    Full Text Available A single trait linear mixed random regression test-day model was applied for the first time for analyzing the first lactation monthly test-day milk yield records in Karan Fries cattle. The test-day milk yield data was modeled using a random regression model (RRM considering different order of Legendre polynomial for the additive genetic effect (4th order and the permanent environmental effect (5th order. Data pertaining to 1,583 lactation records spread over a period of 30 years were recorded and analyzed in the study. The variance component, heritability and genetic correlations among test-day milk yields were estimated using RRM. RRM heritability estimates of test-day milk yield varied from 0.11 to 0.22 in different test-day records. The estimates of genetic correlations between different test-day milk yields ranged 0.01 (test-day 1 [TD-1] and TD-11 to 0.99 (TD-4 and TD-5. The magnitudes of genetic correlations between test-day milk yields decreased as the interval between test-days increased and adjacent test-day had higher correlations. Additive genetic and permanent environment variances were higher for test-day milk yields at both ends of lactation. The residual variance was observed to be lower than the permanent environment variance for all the test-day milk yields.

  3. Importance of growth characteristics for yield of barley in different growing systems: will growth characteristics describe yield diffently in different growing systems?

    DEFF Research Database (Denmark)

    Kristensen, Kristian; Ericson, Lars

    2008-01-01

    The interest in organic grown cereals has increased the need for variety tests under organic growing systems and/or the knowledge on whether growth characteristics describe yield differently under conventional and organic conditions. This paper is a contribution to that question by examining...... the relationships between some important growth characteristics in barley trials in both systems in Northern Sweden and in Denmark. Mixed model analyses were used for regressions of growth characteristics (or transformations of those) on yield (and log-transformed yield), allowing the slope to depend on the growing...... system. The analyses showed that diseases seemed to have a less negative effect on yield in the organic growing system than in the conventional growing system if pesticides were not applied. For other characteristics the effect depended on the country. This was the case for grain characteristics where...

  4. Estimation of genotype X environment interactions, in a grassbased system, for milk yield, body condition score,and body weight using random regression models

    NARCIS (Netherlands)

    Berry, D.P.; Buckley, F.; Dillon, P.; Evans, R.D.; Rath, M.; Veerkamp, R.F.

    2003-01-01

    (Co)variance components for milk yield, body condition score (BCS), body weight (BW), BCS change and BW change over different herd-year mean milk yields (HMY) and nutritional environments (concentrate feeding level, grazing severity and silage quality) were estimated using a random regression model.

  5. Genetic correlations among body condition score, yield, and fertility in first-parity cows estimated by random regression models.

    Science.gov (United States)

    Veerkamp, R F; Koenen, E P; De Jong, G

    2001-10-01

    Twenty type classifiers scored body condition (BCS) of 91,738 first-parity cows from 601 sires and 5518 maternal grandsires. Fertility data during first lactation were extracted for 177,220 cows, of which 67,278 also had a BCS observation, and first-lactation 305-d milk, fat, and protein yields were added for 180,631 cows. Heritabilities and genetic correlations were estimated using a sire-maternal grandsire model. Heritability of BCS was 0.38. Heritabilities for fertility traits were low (0.01 to 0.07), but genetic standard deviations were substantial, 9 d for days to first service and calving interval, 0.25 for number of services, and 5% for first-service conception. Phenotypic correlations between fertility and yield or BCS were small (-0.15 to 0.20). Genetic correlations between yield and all fertility traits were unfavorable (0.37 to 0.74). Genetic correlations with BCS were between -0.4 and -0.6 for calving interval and days to first service. Random regression analysis (RR) showed that correlations changed with days in milk for BCS. Little agreement was found between variances and correlations from RR, and analysis including a single month (mo 1 to 10) of data for BCS, especially during early and late lactation. However, this was due to excluding data from the conventional analysis, rather than due to the polynomials used. RR and a conventional five-traits model where BCS in mo 1, 4, 7, and 10 was treated as a separate traits (plus yield or fertility) gave similar results. Thus a parsimonious random regression model gave more realistic estimates for the (co)variances than a series of bivariate analysis on subsets of the data for BCS. A higher genetic merit for yield has unfavorable effects on fertility, but the genetic correlation suggests that BCS (at some stages of lactation) might help to alleviate the unfavorable effect of selection for higher yield on fertility.

  6. Genetic correlations among body condition score, yield and fertility in multiparous cows using random regression models

    OpenAIRE

    Bastin, Catherine; Gillon, Alain; Massart, Xavier; Bertozzi, Carlo; Vanderick, Sylvie; Gengler, Nicolas

    2010-01-01

    Genetic correlations between body condition score (BCS) in lactation 1 to 3 and four economically important traits (days open, 305-days milk, fat, and protein yields recorded in the first 3 lactations) were estimated on about 12,500 Walloon Holstein cows using 4-trait random regression models. Results indicated moderate favorable genetic correlations between BCS and days open (from -0.46 to -0.62) and suggested the use of BCS for indirect selection on fertility. However, unfavorable genetic c...

  7. Multicollinearity in Regression Analyses Conducted in Epidemiologic Studies.

    Science.gov (United States)

    Vatcheva, Kristina P; Lee, MinJae; McCormick, Joseph B; Rahbar, Mohammad H

    2016-04-01

    The adverse impact of ignoring multicollinearity on findings and data interpretation in regression analysis is very well documented in the statistical literature. The failure to identify and report multicollinearity could result in misleading interpretations of the results. A review of epidemiological literature in PubMed from January 2004 to December 2013, illustrated the need for a greater attention to identifying and minimizing the effect of multicollinearity in analysis of data from epidemiologic studies. We used simulated datasets and real life data from the Cameron County Hispanic Cohort to demonstrate the adverse effects of multicollinearity in the regression analysis and encourage researchers to consider the diagnostic for multicollinearity as one of the steps in regression analysis.

  8. How to deal with continuous and dichotomic outcomes in epidemiological research: linear and logistic regression analyses

    NARCIS (Netherlands)

    Tripepi, Giovanni; Jager, Kitty J.; Stel, Vianda S.; Dekker, Friedo W.; Zoccali, Carmine

    2011-01-01

    Because of some limitations of stratification methods, epidemiologists frequently use multiple linear and logistic regression analyses to address specific epidemiological questions. If the dependent variable is a continuous one (for example, systolic pressure and serum creatinine), the researcher

  9. A Mixed Application of Geographically Weighted Regression and Unsupervised Classification for Analyzing Latex Yield Variability in Yunnan, China

    Directory of Open Access Journals (Sweden)

    Oh Seok Kim

    2017-05-01

    Full Text Available This paper introduces a mixed method approach for analyzing the determinants of natural latex yields and the associated spatial variations and identifying the most suitable regions for producing latex. Geographically Weighted Regressions (GWR and Iterative Self-Organizing Data Analysis Technique (ISODATA are jointly applied to the georeferenced data points collected from the rubber plantations in Xishuangbanna (in Yunnan province, south China and other remotely-sensed spatial data. According to the GWR models, Age of rubber tree, Percent of clay in soil, Elevation, Solar radiation, Population, Distance from road, Distance from stream, Precipitation, and Mean temperature turn out statistically significant, indicating that these are the major determinants shaping latex yields at the prefecture level. However, the signs and magnitudes of the parameter estimates at the aggregate level are different from those at the lower spatial level, and the differences are due to diverse reasons. The ISODATA classifies the landscape into three categories: high, medium, and low potential yields. The map reveals that Mengla County has the majority of land with high potential yield, while Jinghong City and Menghai County show lower potential yield. In short, the mixed method can offer a means of providing greater insights in the prediction of agricultural production.

  10. Multiple linear regression and artificial neural networks for delta-endotoxin and protease yields modelling of Bacillus thuringiensis.

    Science.gov (United States)

    Ennouri, Karim; Ben Ayed, Rayda; Triki, Mohamed Ali; Ottaviani, Ennio; Mazzarello, Maura; Hertelli, Fathi; Zouari, Nabil

    2017-07-01

    The aim of the present work was to develop a model that supplies accurate predictions of the yields of delta-endotoxins and proteases produced by B. thuringiensis var. kurstaki HD-1. Using available medium ingredients as variables, a mathematical method, based on Plackett-Burman design (PB), was employed to analyze and compare data generated by the Bootstrap method and processed by multiple linear regressions (MLR) and artificial neural networks (ANN) including multilayer perceptron (MLP) and radial basis function (RBF) models. The predictive ability of these models was evaluated by comparison of output data through the determination of coefficient (R 2 ) and mean square error (MSE) values. The results demonstrate that the prediction of the yields of delta-endotoxin and protease was more accurate by ANN technique (87 and 89% for delta-endotoxin and protease determination coefficients, respectively) when compared with MLR method (73.1 and 77.2% for delta-endotoxin and protease determination coefficients, respectively), suggesting that the proposed ANNs, especially MLP, is a suitable new approach for determining yields of bacterial products that allow us to make more appropriate predictions in a shorter time and with less engineering effort.

  11. Measuring yield performance of upland cotton varieties using adaptability, stability and principal component analyses

    International Nuclear Information System (INIS)

    Baloch, M.J.

    2003-01-01

    Nine upland cotton varieties/strains were tested over 36 environments in Pakistan so as to determine their stability in yield performance. The regression coefficient (b) was used as a measure of adaptability, whereas parameters such as coefficient of determination (r2) and sum of squared deviations from regression (s/sup 2/d) were used as measure of stability. Although the regression coefficients (b) of all varieties did not deviate significantly from the unit slope, the varieties CRIS-5A. BII-89, DNH-40 and Rehmani gave b value closer to unity implying their better adaptation. Lower s/sub 2/d and higher r/sub 2/ of CRIS- 121 and DNH-40 suggest that both of these are fairly stable. The results indicate that, generally, adaptability and stability parameters are independent of each in as much as not all of the parameters simultaneously favoured one variety over the other excepting the variety DNH-40, which was stable based on majority of the parameters. Principal component analysis revealed that the first two components (latent roots) account for about 91.4% of the total variation. The latent vectors of first principal component (PCA1) were smaller and positive which also suggest that most of the varieties were quite adaptive to all of the test environments. (author)

  12. Heterogeneous global crop yield response to biochar: a meta-regression analysis

    International Nuclear Information System (INIS)

    Crane-Droesch, Andrew; Torn, Margaret S; Abiven, Samuel; Jeffery, Simon

    2013-01-01

    Biochar may contribute to climate change mitigation at negative cost by sequestering photosynthetically fixed carbon in soil while increasing crop yields. The magnitude of biochar’s potential in this regard will depend on crop yield benefits, which have not been well-characterized across different soils and biochars. Using data from 84 studies, we employ meta-analytical, missing data, and semiparametric statistical methods to explain heterogeneity in crop yield responses across different soils, biochars, and agricultural management factors, and then estimate potential changes in yield across different soil environments globally. We find that soil cation exchange capacity and organic carbon were strong predictors of yield response, with low cation exchange and low carbon associated with positive response. We also find that yield response increases over time since initial application, compared to non-biochar controls. High reported soil clay content and low soil pH were weaker predictors of higher yield response. No biochar parameters in our dataset—biochar pH, percentage carbon content, or temperature of pyrolysis—were significant predictors of yield impacts. Projecting our fitted model onto a global soil database, we find the largest potential increases in areas with highly weathered soils, such as those characterizing much of the humid tropics. Richer soils characterizing much of the world’s important agricultural areas appear to be less likely to benefit from biochar. (letter)

  13. Using Ridge Regression Models to Estimate Grain Yield from Field Spectral Data in Bread Wheat (Triticum Aestivum L. Grown under Three Water Regimes

    Directory of Open Access Journals (Sweden)

    Javier Hernandez

    2015-02-01

    Full Text Available Plant breeding based on grain yield (GY is an expensive and time-consuming method, so new indirect estimation techniques to evaluate the performance of crops represent an alternative method to improve grain yield. The present study evaluated the ability of canopy reflectance spectroscopy at the range from 350 to 2500 nm to predict GY in a large panel (368 genotypes of wheat (Triticum aestivum L. through multivariate ridge regression models. Plants were treated under three water regimes in the Mediterranean conditions of central Chile: severe water stress (SWS, rain fed, mild water stress (MWS; one irrigation event around booting and full irrigation (FI with mean GYs of 1655, 4739, and 7967 kg∙ha−1, respectively. Models developed from reflectance data during anthesis and grain filling under all water regimes explained between 77% and 91% of the GY variability, with the highest values in SWS condition. When individual models were used to predict yield in the rest of the trials assessed, models fitted during anthesis under MWS performed best. Combined models using data from different water regimes and each phenological stage were used to predict grain yield, and the coefficients of determination (R2 increased to 89.9% and 92.0% for anthesis and grain filling, respectively. The model generated during anthesis in MWS was the best at predicting yields when it was applied to other conditions. Comparisons against conventional reflectance indices were made, showing lower predictive abilities. It was concluded that a Ridge Regression Model using a data set based on spectral reflectance at anthesis or grain filling represents an effective method to predict grain yield in genotypes under different water regimes.

  14. The number of subjects per variable required in linear regression analyses.

    Science.gov (United States)

    Austin, Peter C; Steyerberg, Ewout W

    2015-06-01

    To determine the number of independent variables that can be included in a linear regression model. We used a series of Monte Carlo simulations to examine the impact of the number of subjects per variable (SPV) on the accuracy of estimated regression coefficients and standard errors, on the empirical coverage of estimated confidence intervals, and on the accuracy of the estimated R(2) of the fitted model. A minimum of approximately two SPV tended to result in estimation of regression coefficients with relative bias of less than 10%. Furthermore, with this minimum number of SPV, the standard errors of the regression coefficients were accurately estimated and estimated confidence intervals had approximately the advertised coverage rates. A much higher number of SPV were necessary to minimize bias in estimating the model R(2), although adjusted R(2) estimates behaved well. The bias in estimating the model R(2) statistic was inversely proportional to the magnitude of the proportion of variation explained by the population regression model. Linear regression models require only two SPV for adequate estimation of regression coefficients, standard errors, and confidence intervals. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  15. Statistical and regression analyses of detected extrasolar systems

    Czech Academy of Sciences Publication Activity Database

    Pintr, Pavel; Peřinová, V.; Lukš, A.; Pathak, A.

    2013-01-01

    Roč. 75, č. 1 (2013), s. 37-45 ISSN 0032-0633 Institutional support: RVO:61389021 Keywords : Exoplanets * Kepler candidates * Regression analysis Subject RIV: BN - Astronomy, Celestial Mechanics, Astrophysics Impact factor: 1.630, year: 2013 http://www.sciencedirect.com/science/article/pii/S0032063312003066

  16. Reducing Inter-Laboratory Differences between Semen Analyses Using Z Score and Regression Transformations

    Directory of Open Access Journals (Sweden)

    Esther Leushuis

    2016-12-01

    Full Text Available Background: Standardization of the semen analysis may improve reproducibility. We assessed variability between laboratories in semen analyses and evaluated whether a transformation using Z scores and regression statistics was able to reduce this variability. Materials and Methods: We performed a retrospective cohort study. We calculated between-laboratory coefficients of variation (CVB for sperm concentration and for morphology. Subsequently, we standardized the semen analysis results by calculating laboratory specific Z scores, and by using regression. We used analysis of variance for four semen parameters to assess systematic differences between laboratories before and after the transformations, both in the circulation samples and in the samples obtained in the prospective cohort study in the Netherlands between January 2002 and February 2004. Results: The mean CVB was 7% for sperm concentration (range 3 to 13% and 32% for sperm morphology (range 18 to 51%. The differences between the laboratories were statistically significant for all semen parameters (all P<0.001. Standardization using Z scores did not reduce the differences in semen analysis results between the laboratories (all P<0.001. Conclusion: There exists large between-laboratory variability for sperm morphology and small, but statistically significant, between-laboratory variation for sperm concentration. Standardization using Z scores does not eliminate between-laboratory variability.

  17. Partial Least Squares Regression for Determining the Control Factors for Runoff and Suspended Sediment Yield during Rainfall Events

    Directory of Open Access Journals (Sweden)

    Nufang Fang

    2015-07-01

    Full Text Available Multivariate statistics are commonly used to identify the factors that control the dynamics of runoff or sediment yields during hydrological processes. However, one issue with the use of conventional statistical methods to address relationships between variables and runoff or sediment yield is multicollinearity. The main objectives of this study were to apply a method for effectively identifying runoff and sediment control factors during hydrological processes and apply that method to a case study. The method combines the clustering approach and partial least squares regression (PLSR models. The case study was conducted in a mountainous watershed in the Three Gorges Area. A total of 29 flood events in three hydrological years in areas with different land uses were obtained. In total, fourteen related variables were separated from hydrographs using the classical hydrograph separation method. Twenty-nine rainfall events were classified into two rainfall regimes (heavy Rainfall Regime I and moderate Rainfall Regime II based on rainfall characteristics and K-means clustering. Four separate PLSR models were constructed to identify the main variables that control runoff and sediment yield for the two rainfall regimes. For Rainfall Regime I, the dominant first-order factors affecting the changes in sediment yield in our study were all of the four rainfall-related variables, flood peak discharge, maximum flood suspended sediment concentration, runoff, and the percentages of forest and farmland. For Rainfall Regime II, antecedent condition-related variables have more effects on both runoff and sediment yield than in Rainfall Regime I. The results suggest that the different control factors of the two rainfall regimes are determined by the rainfall characteristics and thus different runoff mechanisms.

  18. Consequences of kriging and land use regression for PM2.5 predictions in epidemiologic analyses: insights into spatial variability using high-resolution satellite data.

    Science.gov (United States)

    Alexeeff, Stacey E; Schwartz, Joel; Kloog, Itai; Chudnovsky, Alexandra; Koutrakis, Petros; Coull, Brent A

    2015-01-01

    Many epidemiological studies use predicted air pollution exposures as surrogates for true air pollution levels. These predicted exposures contain exposure measurement error, yet simulation studies have typically found negligible bias in resulting health effect estimates. However, previous studies typically assumed a statistical spatial model for air pollution exposure, which may be oversimplified. We address this shortcoming by assuming a realistic, complex exposure surface derived from fine-scale (1 km × 1 km) remote-sensing satellite data. Using simulation, we evaluate the accuracy of epidemiological health effect estimates in linear and logistic regression when using spatial air pollution predictions from kriging and land use regression models. We examined chronic (long-term) and acute (short-term) exposure to air pollution. Results varied substantially across different scenarios. Exposure models with low out-of-sample R(2) yielded severe biases in the health effect estimates of some models, ranging from 60% upward bias to 70% downward bias. One land use regression exposure model with >0.9 out-of-sample R(2) yielded upward biases up to 13% for acute health effect estimates. Almost all models drastically underestimated the SEs. Land use regression models performed better in chronic effect simulations. These results can help researchers when interpreting health effect estimates in these types of studies.

  19. Multitrait, Random Regression, or Simple Repeatability Model in High-Throughput Phenotyping Data Improve Genomic Prediction for Wheat Grain Yield.

    Science.gov (United States)

    Sun, Jin; Rutkoski, Jessica E; Poland, Jesse A; Crossa, José; Jannink, Jean-Luc; Sorrells, Mark E

    2017-07-01

    High-throughput phenotyping (HTP) platforms can be used to measure traits that are genetically correlated with wheat ( L.) grain yield across time. Incorporating such secondary traits in the multivariate pedigree and genomic prediction models would be desirable to improve indirect selection for grain yield. In this study, we evaluated three statistical models, simple repeatability (SR), multitrait (MT), and random regression (RR), for the longitudinal data of secondary traits and compared the impact of the proposed models for secondary traits on their predictive abilities for grain yield. Grain yield and secondary traits, canopy temperature (CT) and normalized difference vegetation index (NDVI), were collected in five diverse environments for 557 wheat lines with available pedigree and genomic information. A two-stage analysis was applied for pedigree and genomic selection (GS). First, secondary traits were fitted by SR, MT, or RR models, separately, within each environment. Then, best linear unbiased predictions (BLUPs) of secondary traits from the above models were used in the multivariate prediction models to compare predictive abilities for grain yield. Predictive ability was substantially improved by 70%, on average, from multivariate pedigree and genomic models when including secondary traits in both training and test populations. Additionally, (i) predictive abilities slightly varied for MT, RR, or SR models in this data set, (ii) results indicated that including BLUPs of secondary traits from the MT model was the best in severe drought, and (iii) the RR model was slightly better than SR and MT models under drought environment. Copyright © 2017 Crop Science Society of America.

  20. PREDICTION MODELS OF GRAIN YIELD AND CHARACTERIZATION

    Directory of Open Access Journals (Sweden)

    Narciso Ysac Avila Serrano

    2009-06-01

    Full Text Available With the objective to characterize the grain yield of five cowpea cultivars and to find linear regression models to predict it, a study was developed in La Paz, Baja California Sur, Mexico. A complete randomized blocks design was used. Simple and multivariate analyses of variance were carried out using the canonical variables to characterize the cultivars. The variables cluster per plant, pods per plant, pods per cluster, seeds weight per plant, seeds hectoliter weight, 100-seed weight, seeds length, seeds wide, seeds thickness, pods length, pods wide, pods weight, seeds per pods, and seeds weight per pods, showed significant differences (P≤ 0.05 among cultivars. Paceño and IT90K-277-2 cultivars showed the higher seeds weight per plant. The linear regression models showed correlation coefficients ≥0.92. In these models, the seeds weight per plant, pods per cluster, pods per plant, cluster per plant and pods length showed significant correlations (P≤ 0.05. In conclusion, the results showed that grain yield differ among cultivars and for its estimation, the prediction models showed determination coefficients highly dependable.

  1. Correlation and regression analyses of genetic effects for different types of cells in mammals under radiation and chemical treatment

    International Nuclear Information System (INIS)

    Slutskaya, N.G.; Mosseh, I.B.

    2006-01-01

    Data about genetic mutations under radiation and chemical treatment for different types of cells have been analyzed with correlation and regression analyses. Linear correlation between different genetic effects in sex cells and somatic cells have found. The results may be extrapolated on sex cells of human and mammals. (authors)

  2. The number of subjects per variable required in linear regression analyses

    NARCIS (Netherlands)

    P.C. Austin (Peter); E.W. Steyerberg (Ewout)

    2015-01-01

    textabstractObjectives To determine the number of independent variables that can be included in a linear regression model. Study Design and Setting We used a series of Monte Carlo simulations to examine the impact of the number of subjects per variable (SPV) on the accuracy of estimated regression

  3. Multicollinearity in Regression Analyses Conducted in Epidemiologic Studies

    OpenAIRE

    Vatcheva, Kristina P.; Lee, MinJae; McCormick, Joseph B.; Rahbar, Mohammad H.

    2016-01-01

    The adverse impact of ignoring multicollinearity on findings and data interpretation in regression analysis is very well documented in the statistical literature. The failure to identify and report multicollinearity could result in misleading interpretations of the results. A review of epidemiological literature in PubMed from January 2004 to December 2013, illustrated the need for a greater attention to identifying and minimizing the effect of multicollinearity in analysis of data from epide...

  4. Histograms showing variations in oil yield, water yield, and specific gravity of oil from Fischer assay analyses of oil-shale drill cores and cuttings from the Piceance Basin, northwestern Colorado

    Science.gov (United States)

    Dietrich, John D.; Brownfield, Michael E.; Johnson, Ronald C.; Mercier, Tracey J.

    2014-01-01

    Recent studies indicate that the Piceance Basin in northwestern Colorado contains over 1.5 trillion barrels of oil in place, making the basin the largest known oil-shale deposit in the world. Previously published histograms display oil-yield variations with depth and widely correlate rich and lean oil-shale beds and zones throughout the basin. Histograms in this report display oil-yield data plotted alongside either water-yield or oil specific-gravity data. Fischer assay analyses of core and cutting samples collected from exploration drill holes penetrating the Eocene Green River Formation in the Piceance Basin can aid in determining the origins of those deposits, as well as estimating the amount of organic matter, halite, nahcolite, and water-bearing minerals. This report focuses only on the oil yield plotted against water yield and oil specific gravity.

  5. Analyses of non-fatal accidents in an opencast mine by logistic regression model - a case study.

    Science.gov (United States)

    Onder, Seyhan; Mutlu, Mert

    2017-09-01

    Accidents cause major damage for both workers and enterprises in the mining industry. To reduce the number of occupational accidents, these incidents should be properly registered and carefully analysed. This study efficiently examines the Aegean Lignite Enterprise (ELI) of Turkish Coal Enterprises (TKI) in Soma between 2006 and 2011, and opencast coal mine occupational accident records were used for statistical analyses. A total of 231 occupational accidents were analysed for this study. The accident records were categorized into seven groups: area, reason, occupation, part of body, age, shift hour and lost days. The SPSS package program was used in this study for logistic regression analyses, which predicted the probability of accidents resulting in greater or less than 3 lost workdays for non-fatal injuries. Social facilities-area of surface installations, workshops and opencast mining areas are the areas with the highest probability for accidents with greater than 3 lost workdays for non-fatal injuries, while the reasons with the highest probability for these types of accidents are transporting and manual handling. Additionally, the model was tested for such reported accidents that occurred in 2012 for the ELI in Soma and estimated the probability of exposure to accidents with lost workdays correctly by 70%.

  6. Logistic regression applied to natural hazards: rare event logistic regression with replications

    Science.gov (United States)

    Guns, M.; Vanacker, V.

    2012-06-01

    Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.

  7. Advanced statistics: linear regression, part II: multiple linear regression.

    Science.gov (United States)

    Marill, Keith A

    2004-01-01

    The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.

  8. Logistic regression applied to natural hazards: rare event logistic regression with replications

    Directory of Open Access Journals (Sweden)

    M. Guns

    2012-06-01

    Full Text Available Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.

  9. Phenotypic Correlation Between Yield and Yield components of Read wheat (Triticum Aestivum L) in Drought Simulated Conditions in Kenya

    International Nuclear Information System (INIS)

    Kimurto, P.K.

    2002-01-01

    Establishing the presence and magnitude of x watering regimes interaction and stability of yield under drought simulated conditions would allow plant breeders select the drought tolerant wheat genotypes based on their performance at different rainfall patterns in different locations, not on overall mean yield. Development of drought tolerant wheat varieties in Kenya in an easier, cheaper and more efficient way is required most of it's land area is marginal. Four moisture stress regimes which simulated terminal, early, mid and late drought were created under rain shelter by supplying 70, 82, 94, 106 mm of moisture up to seedling stage, tillering, anthesis and grain filling, respectively. control had 118 mm of moisture applied at all stages. Four test genotypes R748, R830, R831 and R833 were tested together with one check variety, Duma. Yields for each genotype in two seasons were analysed using ANOVA and genotype x watering regimes assessed. Yield stability was also analysed using regression analysis. The result showed that genotype x watering regimes interaction was highly significant, suggesting that genotypes responded differently to increases water levels in each season. This indicated that selecting of drought tolerant genotypes for marginal areas under rain shelter should be based on those rainfall regimes. Yield stability across watering regimes varied among genotypes with Duma and R830 being the most stable cultivars, indicating that they only do well in low water levels. Genotypes R748 and R831 were the most unstable among all the test cultivars. R748 was the most responsive to increasing levels, indicating that it can be grown in low and high rainfall areas. The study showed that selection of stable drought tolerant cultivars using mobile rain shelters is possible

  10. Study on the Method of Grass Yield Model in the Source Region of Three Rivers with Multivariate Data

    International Nuclear Information System (INIS)

    You, Haoyan; Luo, Chengfeng; Liu, Zhengjun; Wang, Jiao

    2014-01-01

    This paper uses remote sensing and GIS technology to analyse the Source Region of Three Rivers (SRTR) to establish a grass yield estimation model during 2010 with remote sensing data, meteorological data, grassland type data and ground measured data. Analysis of the correlation between ground measured data, vegetation index based HJ-1A/B satellite data, meteorological data and grassland type data were used to establish the grass yield model. The grass yield model was studied by several statistical methods, such as multiple linear regression and Geographically Weighted Regression (GWR). The model's precision was validated. Finally, the best model to estimate the grass yield of Maduo County in SRTR was contrasted with the TM degraded grassland interpretation image of Maduo County from 2009. The result shows that: (1) Comparing with the multiple linear regression model, the GWR model gave a much better fitting result with the quality of fit increasing significantly from less than 0.3 to more than 0.8; (2) The most sensitive factors affecting the grass yield in SRTR were precipitation from May to August and drought index from May to August. From calculation of the five vegetation indices, MSAVI fitted the best; (3) The Maduo County grass yield estimated by the optimal model was consistent with the TM degraded grassland interpretation image, the spatial distribution of grass yield in Maduo County for 2010 showed a ''high south and low north'' pattern

  11. Path and ridge regression analysis of seed yield and seed yield components of Russian wildrye (Psathyrostachys juncea Nevski) under field conditions

    DEFF Research Database (Denmark)

    Wang, Quanzhen; Zhang, Tiejun; Cui, Jian

    2011-01-01

    The correlations among seed yield components, and their direct and indirect effects on the seed yield (Z) of Russina wildrye (Psathyrostachys juncea Nevski) were investigated. The seed yield components: fertile tillers m-2 (Y1), spikelets per fertile tillers (Y2), florets per spikelet- (Y3), seed...

  12. Regression analysis with categorized regression calibrated exposure: some interesting findings

    Directory of Open Access Journals (Sweden)

    Hjartåker Anette

    2006-07-01

    Full Text Available Abstract Background Regression calibration as a method for handling measurement error is becoming increasingly well-known and used in epidemiologic research. However, the standard version of the method is not appropriate for exposure analyzed on a categorical (e.g. quintile scale, an approach commonly used in epidemiologic studies. A tempting solution could then be to use the predicted continuous exposure obtained through the regression calibration method and treat it as an approximation to the true exposure, that is, include the categorized calibrated exposure in the main regression analysis. Methods We use semi-analytical calculations and simulations to evaluate the performance of the proposed approach compared to the naive approach of not correcting for measurement error, in situations where analyses are performed on quintile scale and when incorporating the original scale into the categorical variables, respectively. We also present analyses of real data, containing measures of folate intake and depression, from the Norwegian Women and Cancer study (NOWAC. Results In cases where extra information is available through replicated measurements and not validation data, regression calibration does not maintain important qualities of the true exposure distribution, thus estimates of variance and percentiles can be severely biased. We show that the outlined approach maintains much, in some cases all, of the misclassification found in the observed exposure. For that reason, regression analysis with the corrected variable included on a categorical scale is still biased. In some cases the corrected estimates are analytically equal to those obtained by the naive approach. Regression calibration is however vastly superior to the naive method when applying the medians of each category in the analysis. Conclusion Regression calibration in its most well-known form is not appropriate for measurement error correction when the exposure is analyzed on a

  13. Trochanteric entry femoral nails yield better femoral version and lower revision rates-A large cohort multivariate regression analysis.

    Science.gov (United States)

    Yoon, Richard S; Gage, Mark J; Galos, David K; Donegan, Derek J; Liporace, Frank A

    2017-06-01

    Intramedullary nailing (IMN) has become the standard of care for the treatment of most femoral shaft fractures. Different IMN options include trochanteric and piriformis entry as well as retrograde nails, which may result in varying degrees of femoral rotation. The objective of this study was to analyze postoperative femoral version between three types of nails and to delineate any significant differences in femoral version (DFV) and revision rates. Over a 10-year period, 417 patients underwent IMN of a diaphyseal femur fracture (AO/OTA 32A-C). Of these patients, 316 met inclusion criteria and obtained postoperative computed tomography (CT) scanograms to calculate femoral version and were thus included in the study. In this study, our main outcome measure was the difference in femoral version (DFV) between the uninjured limb and the injured limb. The effect of the following variables on DFV and revision rates were determined via univariate, multivariate, and ordinal regression analyses: gender, age, BMI, ethnicity, mechanism of injury, operative side, open fracture, and table type/position. Statistical significance was set at pregression analysis revealed that a lower BMI was significantly associated with a lower DFV (p=0.006). Controlling for possible covariables, multivariate analysis yielded a significantly lower DFV for trochanteric entry nails than piriformis or retrograde nails (7.9±6.10 vs. 9.5±7.4 vs. 9.4±7.8°, pregression analysis. However, this is not to state that the other nail types exhibited abnormal DFV. Translation to the clinical impact of a few degrees of DFV is also unknown. Future studies to more in-depth study the intricacies of femoral version may lead to improved technology in addition to potentially improved clinical outcomes. Copyright © 2017 Elsevier Ltd. All rights reserved.

  14. modelling relationship between rainfall variability and yields

    African Journals Online (AJOL)

    , S. and ... factors to rice yield. Adebayo and Adebayo (1997) developed double log multiple regression model to predict rice yield in Adamawa State, Nigeria. The general form of .... the second are the crop yield/values for millet and sorghum ...

  15. Genetic analyses of partial egg production in Japanese quail using multi-trait random regression models.

    Science.gov (United States)

    Karami, K; Zerehdaran, S; Barzanooni, B; Lotfi, E

    2017-12-01

    1. The aim of the present study was to estimate genetic parameters for average egg weight (EW) and egg number (EN) at different ages in Japanese quail using multi-trait random regression (MTRR) models. 2. A total of 8534 records from 900 quail, hatched between 2014 and 2015, were used in the study. Average weekly egg weights and egg numbers were measured from second until sixth week of egg production. 3. Nine random regression models were compared to identify the best order of the Legendre polynomials (LP). The most optimal model was identified by the Bayesian Information Criterion. A model with second order of LP for fixed effects, second order of LP for additive genetic effects and third order of LP for permanent environmental effects (MTRR23) was found to be the best. 4. According to the MTRR23 model, direct heritability for EW increased from 0.26 in the second week to 0.53 in the sixth week of egg production, whereas the ratio of permanent environment to phenotypic variance decreased from 0.48 to 0.1. Direct heritability for EN was low, whereas the ratio of permanent environment to phenotypic variance decreased from 0.57 to 0.15 during the production period. 5. For each trait, estimated genetic correlations among weeks of egg production were high (from 0.85 to 0.98). Genetic correlations between EW and EN were low and negative for the first two weeks, but they were low and positive for the rest of the egg production period. 6. In conclusion, random regression models can be used effectively for analysing egg production traits in Japanese quail. Response to selection for increased egg weight would be higher at older ages because of its higher heritability and such a breeding program would have no negative genetic impact on egg production.

  16. Rice yield estimation based on weather conditions and on technological level of production systems in Brazil

    Directory of Open Access Journals (Sweden)

    José Eduardo Boffino de Almeida Monteiro

    2013-02-01

    Full Text Available The objective of this work was to evaluate an estimation system for rice yield in Brazil, based on simple agrometeorological models and on the technological level of production systems. This estimation system incorporates the conceptual basis proposed by Doorenbos & Kassam for potential and attainable yields with empirical adjusts for maximum yield and crop sensitivity to water deficit, considering five categories of rice yield. Rice yield was estimated from 2000/2001 to 2007/2008, and compared to IBGE yield data. Regression analyses between model estimates and data from IBGE surveys resulted in significant coefficients of determination, with less dispersion in the South than in the North and Northeast regions of the country. Index of model efficiency (E1' ranged from 0.01 in the lower yield classes to 0.45 in higher ones, and mean absolute error ranged from 58 to 250 kg ha‑1, respectively.

  17. A comparison of Cox and logistic regression for use in genome-wide association studies of cohort and case-cohort design.

    Science.gov (United States)

    Staley, James R; Jones, Edmund; Kaptoge, Stephen; Butterworth, Adam S; Sweeting, Michael J; Wood, Angela M; Howson, Joanna M M

    2017-06-01

    Logistic regression is often used instead of Cox regression to analyse genome-wide association studies (GWAS) of single-nucleotide polymorphisms (SNPs) and disease outcomes with cohort and case-cohort designs, as it is less computationally expensive. Although Cox and logistic regression models have been compared previously in cohort studies, this work does not completely cover the GWAS setting nor extend to the case-cohort study design. Here, we evaluated Cox and logistic regression applied to cohort and case-cohort genetic association studies using simulated data and genetic data from the EPIC-CVD study. In the cohort setting, there was a modest improvement in power to detect SNP-disease associations using Cox regression compared with logistic regression, which increased as the disease incidence increased. In contrast, logistic regression had more power than (Prentice weighted) Cox regression in the case-cohort setting. Logistic regression yielded inflated effect estimates (assuming the hazard ratio is the underlying measure of association) for both study designs, especially for SNPs with greater effect on disease. Given logistic regression is substantially more computationally efficient than Cox regression in both settings, we propose a two-step approach to GWAS in cohort and case-cohort studies. First to analyse all SNPs with logistic regression to identify associated variants below a pre-defined P-value threshold, and second to fit Cox regression (appropriately weighted in case-cohort studies) to those identified SNPs to ensure accurate estimation of association with disease.

  18. Derivation of ozone flux-yield relationships for lettuce: A key horticultural crop

    Energy Technology Data Exchange (ETDEWEB)

    Goumenaki, Eleni [Environmental and Molecular Plant Physiology, Institute for Research on the Environment and Sustainability, School of Biology and Psychology, Division of Biology, Devonshire Building, Newcastle University, Newcastle Upon Tyne NE1 7RU (United Kingdom); School of Agricultural Technology, Technological Education Institute of Crete, P.O. Box 1939, 71004 Heraklion (Greece); Fernandez, Ignacio Gonzalez [Environmental and Molecular Plant Physiology, Institute for Research on the Environment and Sustainability, School of Biology and Psychology, Division of Biology, Devonshire Building, Newcastle University, Newcastle Upon Tyne NE1 7RU (United Kingdom); CIEMAT, Ecotoxicology of Air Pollution, Avda. Complutense 22, 28040 Madrid (Spain); Papanikolaou, Antigoni [School of Agricultural Technology, Technological Education Institute of Crete, P.O. Box 1939, 71004 Heraklion (Greece); Papadopoulou, Despoina [School of Agricultural Technology, Technological Education Institute of Crete, P.O. Box 1939, 71004 Heraklion (Greece); Askianakis, Christos [School of Agricultural Technology, Technological Education Institute of Crete, P.O. Box 1939, 71004 Heraklion (Greece); Kouvarakis, George [Environmental and Chemical Processes Laboratory, Department of Chemistry, University of Crete, P.O. Box 1470, 71409 Heraklion (Greece); Barnes, Jeremy [Environmental and Molecular Plant Physiology, Institute for Research on the Environment and Sustainability, School of Biology and Psychology, Division of Biology, Devonshire Building, Newcastle University, Newcastle Upon Tyne NE1 7RU (United Kingdom)]. E-mail: j.d.barnes@ncl.ac.uk

    2007-04-15

    Ozone flux-response relationships were derived for lettuce, employing a multiplicative approach to model the manner in which stomatal conductance is influenced by key environmental variables, using a dataset collected during field experimentation in Crete and yield-response relationships derived from parallel open-top chamber experiments. Regional agronomic practices were adopted throughout. Computed versus measured data revealed that the derived model explained 51% (P < 0.001) of the observed variation in stomatal conductance. Concentration-based indices were compared with flux-based indices. Analyses revealed a significant relationship between accumulated stomatal ozone flux and yield employing flux threshold cut-offs up to 4 nmol m{sup -2} s{sup -1}. Regressions employing very low or zero flux thresholds resulted in the strongest yield-flux relationships (explaining {approx}80% (P < 0.05) of the variation in the dataset). - Establishment of ozone flux-yield relationships for a commercially-important horticultural crop grown widely in the Mediterranean.

  19. Regression and regression analysis time series prediction modeling on climate data of quetta, pakistan

    International Nuclear Information System (INIS)

    Jafri, Y.Z.; Kamal, L.

    2007-01-01

    Various statistical techniques was used on five-year data from 1998-2002 of average humidity, rainfall, maximum and minimum temperatures, respectively. The relationships to regression analysis time series (RATS) were developed for determining the overall trend of these climate parameters on the basis of which forecast models can be corrected and modified. We computed the coefficient of determination as a measure of goodness of fit, to our polynomial regression analysis time series (PRATS). The correlation to multiple linear regression (MLR) and multiple linear regression analysis time series (MLRATS) were also developed for deciphering the interdependence of weather parameters. Spearman's rand correlation and Goldfeld-Quandt test were used to check the uniformity or non-uniformity of variances in our fit to polynomial regression (PR). The Breusch-Pagan test was applied to MLR and MLRATS, respectively which yielded homoscedasticity. We also employed Bartlett's test for homogeneity of variances on a five-year data of rainfall and humidity, respectively which showed that the variances in rainfall data were not homogenous while in case of humidity, were homogenous. Our results on regression and regression analysis time series show the best fit to prediction modeling on climatic data of Quetta, Pakistan. (author)

  20. Appropriate assessment of neighborhood effects on individual health: integrating random and fixed effects in multilevel logistic regression

    DEFF Research Database (Denmark)

    Larsen, Klaus; Merlo, Juan

    2005-01-01

    The logistic regression model is frequently used in epidemiologic studies, yielding odds ratio or relative risk interpretations. Inspired by the theory of linear normal models, the logistic regression model has been extended to allow for correlated responses by introducing random effects. However......, the model does not inherit the interpretational features of the normal model. In this paper, the authors argue that the existing measures are unsatisfactory (and some of them are even improper) when quantifying results from multilevel logistic regression analyses. The authors suggest a measure...... of heterogeneity, the median odds ratio, that quantifies cluster heterogeneity and facilitates a direct comparison between covariate effects and the magnitude of heterogeneity in terms of well-known odds ratios. Quantifying cluster-level covariates in a meaningful way is a challenge in multilevel logistic...

  1. Assessment of competition and yield advantage in addition series of barley variety mixtures

    Directory of Open Access Journals (Sweden)

    Kari Jokinen

    1991-09-01

    Full Text Available In an addition series experiment the competition between three barley varieties (Agneta, Arra and Porno and the yield performance of mixtures were evaluated. Also two levels of nitrogen fertilization (50 and 100 kgN/ha were applied. Two approaches (the replacement series and the linear regression equation were used to analyse the competitive relationship based on grain yields in two-component mixtures. In three component mixtures the replacement series approach was applied. Both methods showed a similar dominance order of the varieties with Arra always being dominant and Agneta subordinate. The relationship between varieties was independent of the number of varieties in the mixture. Increase in available nitrogen strengthened the competitiveness of Arra especially in the dense, two-variety mixtures. Some mixtures over yielded but the differences were not statistically significant. The yield advantage based on relative yield total or on the ratio of actual and expected yield was greatest when the density and nitrogen fertilization were low and especially when one component in the mixture was a rather low yielding variety (Agneta. The land equivalent ratios (LER (the reference pure culture yield was the maximum yield of each variety were close to one, suggesting that under optimal growing conditions the yield advantage of barley varietal mixtures is marginal.

  2. Effects of Hydrological Parameters on Palm Oil Fresh Fruit Bunch Yield)

    Science.gov (United States)

    Nda, M.; Adnan, M. S.; Suhadak, M. A.; Zakaria, M. S.; Lopa, R. T.

    2018-04-01

    Climate change effects and variability have been studied by many researchers in diverse geophysical fields. Malaysia produces large volume of palm oil, the effects of climate change on hydrological parameters (rainfall and precipitation) could have adverse effects on palm oil fresh fruit bunch (FFB) production with implications at both local and international market. It is important to understand the effects of climate change on crop yield to adopt new cultivation techniques and guaranteeing food security globally. Based on this background, the paper’s objective is to investigate the effects of rainfall and temperature pattern on crop yield (FFB) within five years period (2013 - 2017) at Batu Pahat District. The Man - Kendall rank technique (trend test) and statistical analyses (correlation and regression) were applied to the dataset used for the study. The results reveal that there are variabilities in rainfall and temperature from one month to the other and the statistical analysis reveals that the hydrological parameters have an insignificant effect on crop yield.

  3. The spatial prediction of landslide susceptibility applying artificial neural network and logistic regression models: A case study of Inje, Korea

    Science.gov (United States)

    Saro, Lee; Woo, Jeon Seong; Kwan-Young, Oh; Moung-Jin, Lee

    2016-02-01

    The aim of this study is to predict landslide susceptibility caused using the spatial analysis by the application of a statistical methodology based on the GIS. Logistic regression models along with artificial neutral network were applied and validated to analyze landslide susceptibility in Inje, Korea. Landslide occurrence area in the study were identified based on interpretations of optical remote sensing data (Aerial photographs) followed by field surveys. A spatial database considering forest, geophysical, soil and topographic data, was built on the study area using the Geographical Information System (GIS). These factors were analysed using artificial neural network (ANN) and logistic regression models to generate a landslide susceptibility map. The study validates the landslide susceptibility map by comparing them with landslide occurrence areas. The locations of landslide occurrence were divided randomly into a training set (50%) and a test set (50%). A training set analyse the landslide susceptibility map using the artificial network along with logistic regression models, and a test set was retained to validate the prediction map. The validation results revealed that the artificial neural network model (with an accuracy of 80.10%) was better at predicting landslides than the logistic regression model (with an accuracy of 77.05%). Of the weights used in the artificial neural network model, `slope' yielded the highest weight value (1.330), and `aspect' yielded the lowest value (1.000). This research applied two statistical analysis methods in a GIS and compared their results. Based on the findings, we were able to derive a more effective method for analyzing landslide susceptibility.

  4. The spatial prediction of landslide susceptibility applying artificial neural network and logistic regression models: A case study of Inje, Korea

    Directory of Open Access Journals (Sweden)

    Saro Lee

    2016-02-01

    Full Text Available The aim of this study is to predict landslide susceptibility caused using the spatial analysis by the application of a statistical methodology based on the GIS. Logistic regression models along with artificial neutral network were applied and validated to analyze landslide susceptibility in Inje, Korea. Landslide occurrence area in the study were identified based on interpretations of optical remote sensing data (Aerial photographs followed by field surveys. A spatial database considering forest, geophysical, soil and topographic data, was built on the study area using the Geographical Information System (GIS. These factors were analysed using artificial neural network (ANN and logistic regression models to generate a landslide susceptibility map. The study validates the landslide susceptibility map by comparing them with landslide occurrence areas. The locations of landslide occurrence were divided randomly into a training set (50% and a test set (50%. A training set analyse the landslide susceptibility map using the artificial network along with logistic regression models, and a test set was retained to validate the prediction map. The validation results revealed that the artificial neural network model (with an accuracy of 80.10% was better at predicting landslides than the logistic regression model (with an accuracy of 77.05%. Of the weights used in the artificial neural network model, ‘slope’ yielded the highest weight value (1.330, and ‘aspect’ yielded the lowest value (1.000. This research applied two statistical analysis methods in a GIS and compared their results. Based on the findings, we were able to derive a more effective method for analyzing landslide susceptibility.

  5. Association between response rates and survival outcomes in patients with newly diagnosed multiple myeloma. A systematic review and meta-regression analysis.

    Science.gov (United States)

    Mainou, Maria; Madenidou, Anastasia-Vasiliki; Liakos, Aris; Paschos, Paschalis; Karagiannis, Thomas; Bekiari, Eleni; Vlachaki, Efthymia; Wang, Zhen; Murad, Mohammad Hassan; Kumar, Shaji; Tsapas, Apostolos

    2017-06-01

    We performed a systematic review and meta-regression analysis of randomized control trials to investigate the association between response to initial treatment and survival outcomes in patients with newly diagnosed multiple myeloma (MM). Response outcomes included complete response (CR) and the combined outcome of CR or very good partial response (VGPR), while survival outcomes were overall survival (OS) and progression-free survival (PFS). We used random-effect meta-regression models and conducted sensitivity analyses based on definition of CR and study quality. Seventy-two trials were included in the systematic review, 63 of which contributed data in meta-regression analyses. There was no association between OS and CR in patients without autologous stem cell transplant (ASCT) (regression coefficient: .02, 95% confidence interval [CI] -0.06, 0.10), in patients undergoing ASCT (-.11, 95% CI -0.44, 0.22) and in trials comparing ASCT with non-ASCT patients (.04, 95% CI -0.29, 0.38). Similarly, OS did not correlate with the combined metric of CR or VGPR, and no association was evident between response outcomes and PFS. Sensitivity analyses yielded similar results. This meta-regression analysis suggests that there is no association between conventional response outcomes and survival in patients with newly diagnosed MM. © 2017 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  6. Genetic Analysis of Milk Yield Using Random Regression Test Day Model in Tehran Province Holstein Dairy Cow

    Directory of Open Access Journals (Sweden)

    A. Seyeddokht

    2012-09-01

    Full Text Available In this research a random regression test day model was used to estimate heritability values and calculation genetic correlations between test day milk records. a total of 140357 monthly test day milk records belonging to 28292 first lactation Holstein cattle(trice time a day milking distributed in 165 herd and calved from 2001 to 2010 belonging to the herds of Tehran province were used. The fixed effects of herd-year-month of calving as contemporary group and age at calving and Holstein gene percentage as covariate were fitted. Orthogonal legendre polynomial with a 4th-order was implemented to take account of genetic and environmental aspects of milk production over the course of lactation. RRM using Legendre polynomials as base functions appears to be the most adequate to describe the covariance structure of the data. The results showed that the average of heritability for the second half of lactation period was higher than that of the first half. The heritability value for the first month was lowest (0.117 and for the eighth month of the lactation was highest (0.230 compared to the other months of lactation. Because of genetic variation was increased gradually, and residual variance was high in the first months of lactation, heritabilities were different over the course of lactation. The RRMs with a higher number of parameters were more useful to describe the genetic variation of test-day milk yield throughout the lactation. In this research estimation of genetic parameters, and calculation genetic correlations were implemented by random regression test day model, therefore using this method is the exact way to take account of parameters rather than the other ways.

  7. Influence of Previous Crop on Durum Wheat Yield and Yield Stability in a Long-term Experiment

    Directory of Open Access Journals (Sweden)

    Anna Maria Stellacci

    2011-02-01

    Full Text Available Long-term experiments are leading indicators of sustainability and serve as an early warning system to detect problems that may compromise future productivity. So the stability of yield is an important parameter to be considered when judging the value of a cropping system relative to others. In a long-term rotation experiment set up in 1972 the influence of different crop sequences on the yields and on yield stability of durum wheat (Triticum durum Desf. was studied. The complete field experiment is a split-split plot in a randomized complete block design with two replications; the whole experiment considers three crop sequences: 1 three-year crop rotation: sugar-beet, wheat + catch crop, wheat; 2 one-year crop rotation: wheat + catch crop; 3 wheat continuous crop; the split treatments are two different crop residue managements; the split-split plot treatments are 18 different fertilization formulas. Each phase of every crop rotation occurred every year. In this paper only one crop residue management and only one fertilization treatment have been analized. Wheat crops in different rotations are coded as follows: F1: wheat after sugar-beet in three-year crop rotation; F2: wheat after wheat in three-year crop rotation; Fc+i: wheat in wheat + catch crop rotation; Fc: continuous wheat. The following two variables were analysed: grain yield and hectolitre weight. Repeated measures analyses of variance and stability analyses have been perfomed for the two variables. The stability analysis was conducted using: three variance methods, namely the coefficient of variability of Francis and Kannenberg, the ecovalence index of Wricke and the stability variance index of Shukla; the regression method of Eberhart and Russell; a method, proposed by Piepho, that computes the probability of one system outperforming another system. It has turned out that each of the stability methods used has enriched of information the simple variance analysis. The Piepho

  8. Analyses of Developmental Rate Isomorphy in Ectotherms: Introducing the Dirichlet Regression.

    Directory of Open Access Journals (Sweden)

    David S Boukal

    Full Text Available Temperature drives development in insects and other ectotherms because their metabolic rate and growth depends directly on thermal conditions. However, relative durations of successive ontogenetic stages often remain nearly constant across a substantial range of temperatures. This pattern, termed 'developmental rate isomorphy' (DRI in insects, appears to be widespread and reported departures from DRI are generally very small. We show that these conclusions may be due to the caveats hidden in the statistical methods currently used to study DRI. Because the DRI concept is inherently based on proportional data, we propose that Dirichlet regression applied to individual-level data is an appropriate statistical method to critically assess DRI. As a case study we analyze data on five aquatic and four terrestrial insect species. We find that results obtained by Dirichlet regression are consistent with DRI violation in at least eight of the studied species, although standard analysis detects significant departure from DRI in only four of them. Moreover, the departures from DRI detected by Dirichlet regression are consistently much larger than previously reported. The proposed framework can also be used to infer whether observed departures from DRI reflect life history adaptations to size- or stage-dependent effects of varying temperature. Our results indicate that the concept of DRI in insects and other ectotherms should be critically re-evaluated and put in a wider context, including the concept of 'equiproportional development' developed for copepods.

  9. Accuracy of genomic selection for alfalfa biomass yield in different reference populations.

    Science.gov (United States)

    Annicchiarico, Paolo; Nazzicari, Nelson; Li, Xuehui; Wei, Yanling; Pecetti, Luciano; Brummer, E Charles

    2015-12-01

    Genomic selection based on genotyping-by-sequencing (GBS) data could accelerate alfalfa yield gains, if it displayed moderate ability to predict parent breeding values. Its interest would be enhanced by predicting ability also for germplasm/reference populations other than those for which it was defined. Predicting accuracy may be influenced by statistical models, SNP calling procedures and missing data imputation strategies. Landrace and variety material from two genetically-contrasting reference populations, i.e., 124 elite genotypes adapted to the Po Valley (sub-continental climate; PV population) and 154 genotypes adapted to Mediterranean-climate environments (Me population), were genotyped by GBS and phenotyped in separate environments for dry matter yield of their dense-planted half-sib progenies. Both populations showed no sub-population genetic structure. Predictive accuracy was higher by joint rather than separate SNP calling for the two data sets, and using random forest imputation of missing data. Highest accuracy was obtained using Support Vector Regression (SVR) for PV, and Ridge Regression BLUP and SVR for Me germplasm. Bayesian methods (Bayes A, Bayes B and Bayesian Lasso) tended to be less accurate. Random Forest Regression was the least accurate model. Accuracy attained about 0.35 for Me in the range of 0.30-0.50 missing data, and 0.32 for PV at 0.50 missing data, using at least 10,000 SNP markers. Cross-population predictions based on a smaller subset of common SNPs implied a relative loss of accuracy of about 25% for Me and 30% for PV. Genome-wide association analyses based on large subsets of M. truncatula-aligned markers revealed many SNPs with modest association with yield, and some genome areas hosting putative QTLs. A comparison of genomic vs. conventional selection for parent breeding value assuming 1-year vs. 5-year selection cycles, respectively, indicated over three-fold greater predicted yield gain per unit time for genomic selection

  10. Predictors of course in obsessive-compulsive disorder: logistic regression versus Cox regression for recurrent events.

    Science.gov (United States)

    Kempe, P T; van Oppen, P; de Haan, E; Twisk, J W R; Sluis, A; Smit, J H; van Dyck, R; van Balkom, A J L M

    2007-09-01

    Two methods for predicting remissions in obsessive-compulsive disorder (OCD) treatment are evaluated. Y-BOCS measurements of 88 patients with a primary OCD (DSM-III-R) diagnosis were performed over a 16-week treatment period, and during three follow-ups. Remission at any measurement was defined as a Y-BOCS score lower than thirteen combined with a reduction of seven points when compared with baseline. Logistic regression models were compared with a Cox regression for recurrent events model. Logistic regression yielded different models at different evaluation times. The recurrent events model remained stable when fewer measurements were used. Higher baseline levels of neuroticism and more severe OCD symptoms were associated with a lower chance of remission, early age of onset and more depressive symptoms with a higher chance. Choice of outcome time affects logistic regression prediction models. Recurrent events analysis uses all information on remissions and relapses. Short- and long-term predictors for OCD remission show overlap.

  11. Comparison of Classical Linear Regression and Orthogonal Regression According to the Sum of Squares Perpendicular Distances

    OpenAIRE

    KELEŞ, Taliha; ALTUN, Murat

    2016-01-01

    Regression analysis is a statistical technique for investigating and modeling the relationship between variables. The purpose of this study was the trivial presentation of the equation for orthogonal regression (OR) and the comparison of classical linear regression (CLR) and OR techniques with respect to the sum of squared perpendicular distances. For that purpose, the analyses were shown by an example. It was found that the sum of squared perpendicular distances of OR is smaller. Thus, it wa...

  12. Improved Dietary Guidelines for Vitamin D: Application of Individual Participant Data (IPD)-Level Meta-Regression Analyses

    Science.gov (United States)

    Cashman, Kevin D.; Ritz, Christian; Kiely, Mairead

    2017-01-01

    Dietary Reference Values (DRVs) for vitamin D have a key role in the prevention of vitamin D deficiency. However, despite adopting similar risk assessment protocols, estimates from authoritative agencies over the last 6 years have been diverse. This may have arisen from diverse approaches to data analysis. Modelling strategies for pooling of individual subject data from cognate vitamin D randomized controlled trials (RCTs) are likely to provide the most appropriate DRV estimates. Thus, the objective of the present work was to undertake the first-ever individual participant data (IPD)-level meta-regression, which is increasingly recognized as best practice, from seven winter-based RCTs (with 882 participants ranging in age from 4 to 90 years) of the vitamin D intake–serum 25-hydroxyvitamin D (25(OH)D) dose-response. Our IPD-derived estimates of vitamin D intakes required to maintain 97.5% of 25(OH)D concentrations >25, 30, and 50 nmol/L across the population are 10, 13, and 26 µg/day, respectively. In contrast, standard meta-regression analyses with aggregate data (as used by several agencies in recent years) from the same RCTs estimated that a vitamin D intake requirement of 14 µg/day would maintain 97.5% of 25(OH)D >50 nmol/L. These first IPD-derived estimates offer improved dietary recommendations for vitamin D because the underpinning modeling captures the between-person variability in response of serum 25(OH)D to vitamin D intake. PMID:28481259

  13. Improved Dietary Guidelines for Vitamin D: Application of Individual Participant Data (IPD-Level Meta-Regression Analyses

    Directory of Open Access Journals (Sweden)

    Kevin D. Cashman

    2017-05-01

    Full Text Available Dietary Reference Values (DRVs for vitamin D have a key role in the prevention of vitamin D deficiency. However, despite adopting similar risk assessment protocols, estimates from authoritative agencies over the last 6 years have been diverse. This may have arisen from diverse approaches to data analysis. Modelling strategies for pooling of individual subject data from cognate vitamin D randomized controlled trials (RCTs are likely to provide the most appropriate DRV estimates. Thus, the objective of the present work was to undertake the first-ever individual participant data (IPD-level meta-regression, which is increasingly recognized as best practice, from seven winter-based RCTs (with 882 participants ranging in age from 4 to 90 years of the vitamin D intake–serum 25-hydroxyvitamin D (25(OHD dose-response. Our IPD-derived estimates of vitamin D intakes required to maintain 97.5% of 25(OHD concentrations >25, 30, and 50 nmol/L across the population are 10, 13, and 26 µg/day, respectively. In contrast, standard meta-regression analyses with aggregate data (as used by several agencies in recent years from the same RCTs estimated that a vitamin D intake requirement of 14 µg/day would maintain 97.5% of 25(OHD >50 nmol/L. These first IPD-derived estimates offer improved dietary recommendations for vitamin D because the underpinning modeling captures the between-person variability in response of serum 25(OHD to vitamin D intake.

  14. Application of Molecular Topology for the Prediction of Reaction Yields and Anti-Inflammatory Activity of Heterocyclic Amidine Derivatives

    Directory of Open Access Journals (Sweden)

    Ramón García-Domenech

    2011-02-01

    Full Text Available Topological-mathematical models based on multiple linear regression analyses have been built to predict the reaction yields and the anti-inflammatory activity of a set of heterocylic amidine derivatives, synthesized under environmental friendly conditions, using microwave irradiation. Two models with three variables each were selected. The models were validated by cross-validation and randomization tests. The final outcome demonstrates a good agreement between the predicted and experimental results, confirming the robustness of the method. These models also enabled the screening of virtual libraries for new amidine derivatives predicted to show higher values of reaction yields and anti-inflammatory activity.

  15. Vectors, a tool in statistical regression theory

    NARCIS (Netherlands)

    Corsten, L.C.A.

    1958-01-01

    Using linear algebra this thesis developed linear regression analysis including analysis of variance, covariance analysis, special experimental designs, linear and fertility adjustments, analysis of experiments at different places and times. The determination of the orthogonal projection, yielding

  16. Post-processing through linear regression

    Science.gov (United States)

    van Schaeybroeck, B.; Vannitsem, S.

    2011-03-01

    Various post-processing techniques are compared for both deterministic and ensemble forecasts, all based on linear regression between forecast data and observations. In order to evaluate the quality of the regression methods, three criteria are proposed, related to the effective correction of forecast error, the optimal variability of the corrected forecast and multicollinearity. The regression schemes under consideration include the ordinary least-square (OLS) method, a new time-dependent Tikhonov regularization (TDTR) method, the total least-square method, a new geometric-mean regression (GM), a recently introduced error-in-variables (EVMOS) method and, finally, a "best member" OLS method. The advantages and drawbacks of each method are clarified. These techniques are applied in the context of the 63 Lorenz system, whose model version is affected by both initial condition and model errors. For short forecast lead times, the number and choice of predictors plays an important role. Contrarily to the other techniques, GM degrades when the number of predictors increases. At intermediate lead times, linear regression is unable to provide corrections to the forecast and can sometimes degrade the performance (GM and the best member OLS with noise). At long lead times the regression schemes (EVMOS, TDTR) which yield the correct variability and the largest correlation between ensemble error and spread, should be preferred.

  17. The yield of colorectal cancer among fast track patients with normocytic and microcytic anaemia.

    Science.gov (United States)

    Panagiotopoulou, I G; Fitzrol, D; Parker, R A; Kuzhively, J; Luscombe, N; Wells, A D; Menon, M; Bajwa, F M; Watson, M A

    2014-05-01

    We receive fast track referrals on the basis of iron deficiency anaemia (IDA) for patients with normocytic anaemia or for patients with no iron studies. This study examined the yield of colorectal cancer (CRC) among fast track patients to ascertain whether awaiting confirmation of IDA is necessary prior to performing bowel investigations. A review was undertaken of 321 and 930 consecutive fast track referrals from Centre A and Centre B respectively. Contingency tables were analysed using Fisher's exact test. Logistic regression analyses were performed to investigate significant predictors of CRC. Overall, 229 patients were included from Centre A and 689 from Centre B. The odds ratio for microcytic anaemia versus normocytic anaemia in the outcome of CRC was 1.3 (95% confidence interval [CI]: 0.5-3.9) for Centre A and 1.6 (95% CI: 0.8-3.3) for Centre B. In a logistic regression analysis (Centre B only), no significant difference in CRC rates was seen between microcytic and normocytic anaemia (adjusted odds ratio: 1.9, 95% CI: 0.9-3.9). There was no statistically significant difference in the yield of CRC between microcytic and normocytic anaemia (p=0.515, Fisher's exact test) in patients with anaemia only and no colorectal symptoms. Finally, CRC cases were seen in both microcytic and normocytic groups with or without low ferritin. There is no significant difference in the yield of CRC between fast track patients with microcytic and normocytic anaemia. This study provides insufficient evidence to support awaiting confirmation of IDA in fast track patients with normocytic anaemia prior to requesting bowel investigations.

  18. Analysing inequalities in Germany a structured additive distributional regression approach

    CERN Document Server

    Silbersdorff, Alexander

    2017-01-01

    This book seeks new perspectives on the growing inequalities that our societies face, putting forward Structured Additive Distributional Regression as a means of statistical analysis that circumvents the common problem of analytical reduction to simple point estimators. This new approach allows the observed discrepancy between the individuals’ realities and the abstract representation of those realities to be explicitly taken into consideration using the arithmetic mean alone. In turn, the method is applied to the question of economic inequality in Germany.

  19. Determination of benzo(apyrene content in PM10 using regression methods

    Directory of Open Access Journals (Sweden)

    Jacek Gębicki

    2015-12-01

    Full Text Available The paper presents an attempt of application of multidimensional linear regression to estimation of an empirical model describing the factors influencing on B(aP content in suspended dust PM10 in Olsztyn and Elbląg city regions between 2010 and 2013. During this period annual average concentration of B(aP in PM10 exceeded the admissible level 1.5-3 times. Conducted investigations confirm that the reasons of B(aP concentration increase are low-efficiency individual home heat stations or low-temperature heat sources, which are responsible for so-called low emission during heating period. Dependences between the following quantities were analysed: concentration of PM10 dust in air, air temperature, wind velocity, air humidity. A measure of model fitting to actual B(aP concentration in PM10 was the coefficient of determination of the model. Application of multidimensional linear regression yielded the equations characterized by high values of the coefficient of determination of the model, especially during heating season. This parameter ranged from 0.54 to 0.80 during the analyzed period.

  20. Assessing spatiotemporal variation of drought and its impact on maize yield in Northeast China

    Science.gov (United States)

    Guo, Enliang; Liu, Xingpeng; Zhang, Jiquan; Wang, Yongfang; Wang, Cailin; Wang, Rui; Li, Danjun

    2017-10-01

    In the context of global climate change, drought has become an important factor that affects the maize yield in China. To analyse the impact of drought on maize yield loss in Northeast China in current and future climate scenarios, the Composite Meteorological Drought Index (CI) is introduced to reconstruct the following drought indicators: drought accumulative days (DAD), drought accumulative intensity (DAI), and consecutive drought days (CDD). These three drought indicators are used to describe the three-dimensional characteristics of drought in this study. Sen's slope method and three-dimensional copula functions are adopted to analyse the variety of drought features, and Ensemble Empirical Mode Decomposition (EEMD) is used to analyse the variations in maize yield. A temporal assessment of the standardized yield residuals series (SYRS) of maize from 1961 to 2014 is conducted. A panel regression model is applied to demonstrate the drought impact on maize yield at various growth stages under the RCP4.5 scenario. The results show that the drought risk level for midwest Jilin Province, western Liaoning, and eastern Heilongjiang increase with global warming in the current scenario. The shorter three-dimensional joint return periods, 44-80 yr, were mainly located in western Jilin Province, Liaodong Peninsula, and northwestern Liaoning. Eastern Heilongjiang has a slightly longer joint return period of 80-100 yr. The SYRS shows a strong statistical correlation with drought indicator variations; drought-prone regions exhibit strong positive correlations. In comparison, excess precipitation regions show strong negative correlations with drought indicators in most growth stages. Drought indicators have a relatively strong association with SYRS at the milky-mature maize growth stage, and the occurrence of drought during this period primarily determines the maize yield changes in the future. Maize yield changes are -2.04%, -2.65% and -1.57% for Liaoning, Jilin, and

  1. Post-processing through linear regression

    Directory of Open Access Journals (Sweden)

    B. Van Schaeybroeck

    2011-03-01

    Full Text Available Various post-processing techniques are compared for both deterministic and ensemble forecasts, all based on linear regression between forecast data and observations. In order to evaluate the quality of the regression methods, three criteria are proposed, related to the effective correction of forecast error, the optimal variability of the corrected forecast and multicollinearity. The regression schemes under consideration include the ordinary least-square (OLS method, a new time-dependent Tikhonov regularization (TDTR method, the total least-square method, a new geometric-mean regression (GM, a recently introduced error-in-variables (EVMOS method and, finally, a "best member" OLS method. The advantages and drawbacks of each method are clarified.

    These techniques are applied in the context of the 63 Lorenz system, whose model version is affected by both initial condition and model errors. For short forecast lead times, the number and choice of predictors plays an important role. Contrarily to the other techniques, GM degrades when the number of predictors increases. At intermediate lead times, linear regression is unable to provide corrections to the forecast and can sometimes degrade the performance (GM and the best member OLS with noise. At long lead times the regression schemes (EVMOS, TDTR which yield the correct variability and the largest correlation between ensemble error and spread, should be preferred.

  2. A Simulation Investigation of Principal Component Regression.

    Science.gov (United States)

    Allen, David E.

    Regression analysis is one of the more common analytic tools used by researchers. However, multicollinearity between the predictor variables can cause problems in using the results of regression analyses. Problems associated with multicollinearity include entanglement of relative influences of variables due to reduced precision of estimation,…

  3. Dicentric yields induced in rabbit blood lymphocytes after exposure in vitro to X-rays

    International Nuclear Information System (INIS)

    Inoue, Yoshinori

    1995-06-01

    For the purpose of biological dosimetry, it is essential to establish the relationship between dicentric yields and absorbed doses. The present experiment was carried out to obtain data for rabbit lymphocytes as a reference for this relationship. As data at low dose level are scanty, rabbit lymphocytes were exposed to various doses, especially below 0.5 Gy, of 150 kVp X-rays and analysed at their first mitotic division for dicentric yields. The yields at high dose level were compared with data reported by other authors. The linear-quadratic equation, which is generally accepted, for the dose-response relationship was obtained by the iteratively reweighted least squares method. However, as the present experiment result showed that the dose-response relationship at low dose-levels was likely to be linear, a dose-response line was calculated by the linear regression analysis. As the result of the chi-square tests, it was found that the dicentric yield was better fitted to the linear model at low doses below 0.5 Gy than the linear quadratic model. (author)

  4. Check-all-that-apply data analysed by Partial Least Squares regression

    DEFF Research Database (Denmark)

    Rinnan, Åsmund; Giacalone, Davide; Frøst, Michael Bom

    2015-01-01

    are analysed by multivariate techniques. CATA data can be analysed both by setting the CATA as the X and the Y. The former is the PLS-Discriminant Analysis (PLS-DA) version, while the latter is the ANOVA-PLS (A-PLS) version. We investigated the difference between these two approaches, concluding...

  5. Diagnosis of cranial hemangioma: Comparison between logistic regression analysis and neuronal network

    International Nuclear Information System (INIS)

    Arana, E.; Marti-Bonmati, L.; Bautista, D.; Paredes, R.

    1998-01-01

    To study the utility of logistic regression and the neuronal network in the diagnosis of cranial hemangiomas. Fifteen patients presenting hemangiomas were selected form a total of 167 patients with cranial lesions. All were evaluated by plain radiography and computed tomography (CT). Nineteen variables in their medical records were reviewed. Logistic regression and neuronal network models were constructed and validated by the jackknife (leave-one-out) approach. The yields of the two models were compared by means of ROC curves, using the area under the curve as parameter. Seven men and 8 women presented hemangiomas. The mean age of these patients was 38.4 (15.4 years (mea ± standard deviation). Logistic regression identified as significant variables the shape, soft tissue mass and periosteal reaction. The neuronal network lent more importance to the existence of ossified matrix, ruptured cortical vein and the mixed calcified-blastic (trabeculated) pattern. The neuronal network showed a greater yield than logistic regression (Az, 0.9409) (0.004 versus 0.7211± 0.075; p<0.001). The neuronal network discloses hidden interactions among the variables, providing a higher yield in the characterization of cranial hemangiomas and constituting a medical diagnostic acid. (Author)29 refs

  6. Influence of yield surface curvature on the macroscopic yielding and ductile failure of isotropic porous plastic materials

    Science.gov (United States)

    Dæhli, Lars Edvard Bryhni; Morin, David; Børvik, Tore; Hopperstad, Odd Sture

    2017-10-01

    Numerical unit cell models of an approximative representative volume element for a porous ductile solid are utilized to investigate differences in the mechanical response between a quadratic and a non-quadratic matrix yield surface. A Hershey equivalent stress measure with two distinct values of the yield surface exponent is employed as the matrix description. Results from the unit cell calculations are further used to calibrate a heuristic extension of the Gurson model which incorporates effects of the third deviatoric stress invariant. An assessment of the porous plasticity model reveals its ability to describe the unit cell response to some extent, however underestimating the effect of the Lode parameter for the lower triaxiality ratios imposed in this study when compared to unit cell simulations. Ductile failure predictions by means of finite element simulations using a unit cell model that resembles an imperfection band are then conducted to examine how the non-quadratic matrix yield surface influences the failure strain as compared to the quadratic matrix yield surface. Further, strain localization predictions based on bifurcation analyses and imperfection band analyses are undertaken using the calibrated porous plasticity model. These simulations are then compared to the unit cell calculations in order to elucidate the differences between the various modelling strategies. The current study reveals that strain localization analyses using an imperfection band model and a spatially discretized unit cell are in reasonable agreement, while the bifurcation analyses predict higher strain levels at localization. Imperfection band analyses are finally used to calculate failure loci for the quadratic and the non-quadratic matrix yield surface under a wide range of loading conditions. The underlying matrix yield surface is demonstrated to have a pronounced influence on the onset of strain localization.

  7. The Use of Nonparametric Kernel Regression Methods in Econometric Production Analysis

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard

    and nonparametric estimations of production functions in order to evaluate the optimal firm size. The second paper discusses the use of parametric and nonparametric regression methods to estimate panel data regression models. The third paper analyses production risk, price uncertainty, and farmers' risk preferences...... within a nonparametric panel data regression framework. The fourth paper analyses the technical efficiency of dairy farms with environmental output using nonparametric kernel regression in a semiparametric stochastic frontier analysis. The results provided in this PhD thesis show that nonparametric......This PhD thesis addresses one of the fundamental problems in applied econometric analysis, namely the econometric estimation of regression functions. The conventional approach to regression analysis is the parametric approach, which requires the researcher to specify the form of the regression...

  8. Detecting overdispersion in count data: A zero-inflated Poisson regression analysis

    Science.gov (United States)

    Afiqah Muhamad Jamil, Siti; Asrul Affendi Abdullah, M.; Kek, Sie Long; Nor, Maria Elena; Mohamed, Maryati; Ismail, Norradihah

    2017-09-01

    This study focusing on analysing count data of butterflies communities in Jasin, Melaka. In analysing count dependent variable, the Poisson regression model has been known as a benchmark model for regression analysis. Continuing from the previous literature that used Poisson regression analysis, this study comprising the used of zero-inflated Poisson (ZIP) regression analysis to gain acute precision on analysing the count data of butterfly communities in Jasin, Melaka. On the other hands, Poisson regression should be abandoned in the favour of count data models, which are capable of taking into account the extra zeros explicitly. By far, one of the most popular models include ZIP regression model. The data of butterfly communities which had been called as the number of subjects in this study had been taken in Jasin, Melaka and consisted of 131 number of subjects visits Jasin, Melaka. Since the researchers are considering the number of subjects, this data set consists of five families of butterfly and represent the five variables involve in the analysis which are the types of subjects. Besides, the analysis of ZIP used the SAS procedure of overdispersion in analysing zeros value and the main purpose of continuing the previous study is to compare which models would be better than when exists zero values for the observation of the count data. The analysis used AIC, BIC and Voung test of 5% level significance in order to achieve the objectives. The finding indicates that there is a presence of over-dispersion in analysing zero value. The ZIP regression model is better than Poisson regression model when zero values exist.

  9. Long Term Evaluation of Yield Stability Trend for Cereal Crops in Iran

    Directory of Open Access Journals (Sweden)

    mehdi nassiri mahalati

    2016-05-01

    Full Text Available During the last few decades cereals yield have increased drastically at the national level however, information about yield stability and its resistance to annual environmental variability are scare. In this study long term stability of grin yield of wheat, barley, rice, corn and overall cereals in Iran were evaluated during a 40-year period (1971-2011. Stability analysis was conducted using two different methods. In the first method the residuals of regression between crop yield and time (years were calculated as stability index. For this different segmented regression models including linear, bi-linear and tri-linear were fitted to yield trend data and the best model for each crop was selected based on statistical measures. Absolute residuals (the difference between actual and predicted yields for each year as well as relative residuals (absolute residuals as percent of predicted yield were estimated. In the second method yield stability was estimated from the slope of the regression line between average annual yield of all cereals (environmental index and the yield of each crop in the same year. Results indicted that in wheat and barley absolute and relative residuals were increased during the study period leading to reduction of stability despite considerable yield increment. However, for rice and corn residuals followed a decreasing trend and therefore yield stability of these crops was increased during the last 40 years. The same result was obtained with the environmental index but in this method reduction of yield stability in barley was lower than wheat. Based on the results, yield and yield stability of cereals crops in Iran increased during the last 40 years. However, the percentage increase in stability is lower than that of yield. Application of nitrogen fertilizers was led to reduction in stability. Yield stability of wheat, barley, rice, corn and overall cereals was improved with increasing their cultivated area.

  10. Alpins and thibos vectorial astigmatism analyses: proposal of a linear regression model between methods

    Directory of Open Access Journals (Sweden)

    Giuliano de Oliveira Freitas

    2013-10-01

    Full Text Available PURPOSE: To determine linear regression models between Alpins descriptive indices and Thibos astigmatic power vectors (APV, assessing the validity and strength of such correlations. METHODS: This case series prospectively assessed 62 eyes of 31 consecutive cataract patients with preoperative corneal astigmatism between 0.75 and 2.50 diopters in both eyes. Patients were randomly assorted among two phacoemulsification groups: one assigned to receive AcrySof®Toric intraocular lens (IOL in both eyes and another assigned to have AcrySof Natural IOL associated with limbal relaxing incisions, also in both eyes. All patients were reevaluated postoperatively at 6 months, when refractive astigmatism analysis was performed using both Alpins and Thibos methods. The ratio between Thibos postoperative APV and preoperative APV (APVratio and its linear regression to Alpins percentage of success of astigmatic surgery, percentage of astigmatism corrected and percentage of astigmatism reduction at the intended axis were assessed. RESULTS: Significant negative correlation between the ratio of post- and preoperative Thibos APVratio and Alpins percentage of success (%Success was found (Spearman's ρ=-0.93; linear regression is given by the following equation: %Success = (-APVratio + 1.00x100. CONCLUSION: The linear regression we found between APVratio and %Success permits a validated mathematical inference concerning the overall success of astigmatic surgery.

  11. Hydrogeological Characteristics of Groundwater Yield in Shallow ...

    African Journals Online (AJOL)

    Hydrogeological Characteristics of Groundwater Yield in Shallow Wells of the ... of Water Resources and Lower Niger River Basin Development Authority in Ilorin. ... moment correlation, multiple and stepwise multiple regression analysis.

  12. Linear regression metamodeling as a tool to summarize and present simulation model results.

    Science.gov (United States)

    Jalal, Hawre; Dowd, Bryan; Sainfort, François; Kuntz, Karen M

    2013-10-01

    Modelers lack a tool to systematically and clearly present complex model results, including those from sensitivity analyses. The objective was to propose linear regression metamodeling as a tool to increase transparency of decision analytic models and better communicate their results. We used a simplified cancer cure model to demonstrate our approach. The model computed the lifetime cost and benefit of 3 treatment options for cancer patients. We simulated 10,000 cohorts in a probabilistic sensitivity analysis (PSA) and regressed the model outcomes on the standardized input parameter values in a set of regression analyses. We used the regression coefficients to describe measures of sensitivity analyses, including threshold and parameter sensitivity analyses. We also compared the results of the PSA to deterministic full-factorial and one-factor-at-a-time designs. The regression intercept represented the estimated base-case outcome, and the other coefficients described the relative parameter uncertainty in the model. We defined simple relationships that compute the average and incremental net benefit of each intervention. Metamodeling produced outputs similar to traditional deterministic 1-way or 2-way sensitivity analyses but was more reliable since it used all parameter values. Linear regression metamodeling is a simple, yet powerful, tool that can assist modelers in communicating model characteristics and sensitivity analyses.

  13. Lowland rice yield estimates based on air temperature and solar radiation

    International Nuclear Information System (INIS)

    Pedro Júnior, M.J.; Sentelhas, P.C.; Moraes, A.V.C.; Villela, O.V.

    1995-01-01

    Two regression equations were developed to estimate lowland rice yield as a function of air temperature and incoming solar radiation, during the crop yield production period in Pindamonhangaba, SP, Brazil. The following rice cultivars were used: IAC-242, IAC-100, IAC-101 and IAC-102. The value of optimum air temperature obtained was 25.0°C and of optimum global solar radiation was 475 cal.cm -2 , day -1 . The best agrometeorological model was the one that related least deviation of air temperature and solar radiation in relation to the optimum value obtained through a multiple linear regression. The yield values estimated by the model showed good fit to actual yields of lowland rice (less than 10%). (author) [pt

  14. Estimation Methods for Non-Homogeneous Regression - Minimum CRPS vs Maximum Likelihood

    Science.gov (United States)

    Gebetsberger, Manuel; Messner, Jakob W.; Mayr, Georg J.; Zeileis, Achim

    2017-04-01

    Non-homogeneous regression models are widely used to statistically post-process numerical weather prediction models. Such regression models correct for errors in mean and variance and are capable to forecast a full probability distribution. In order to estimate the corresponding regression coefficients, CRPS minimization is performed in many meteorological post-processing studies since the last decade. In contrast to maximum likelihood estimation, CRPS minimization is claimed to yield more calibrated forecasts. Theoretically, both scoring rules used as an optimization score should be able to locate a similar and unknown optimum. Discrepancies might result from a wrong distributional assumption of the observed quantity. To address this theoretical concept, this study compares maximum likelihood and minimum CRPS estimation for different distributional assumptions. First, a synthetic case study shows that, for an appropriate distributional assumption, both estimation methods yield to similar regression coefficients. The log-likelihood estimator is slightly more efficient. A real world case study for surface temperature forecasts at different sites in Europe confirms these results but shows that surface temperature does not always follow the classical assumption of a Gaussian distribution. KEYWORDS: ensemble post-processing, maximum likelihood estimation, CRPS minimization, probabilistic temperature forecasting, distributional regression models

  15. Random regression models to account for the effect of genotype by environment interaction due to heat stress on the milk yield of Holstein cows under tropical conditions.

    Science.gov (United States)

    Santana, Mário L; Bignardi, Annaiza Braga; Pereira, Rodrigo Junqueira; Menéndez-Buxadera, Alberto; El Faro, Lenira

    2016-02-01

    The present study had the following objectives: to compare random regression models (RRM) considering the time-dependent (days in milk, DIM) and/or temperature × humidity-dependent (THI) covariate for genetic evaluation; to identify the effect of genotype by environment interaction (G×E) due to heat stress on milk yield; and to quantify the loss of milk yield due to heat stress across lactation of cows under tropical conditions. A total of 937,771 test-day records from 3603 first lactations of Brazilian Holstein cows obtained between 2007 and 2013 were analyzed. An important reduction in milk yield due to heat stress was observed for THI values above 66 (-0.23 kg/day/THI). Three phases of milk yield loss were identified during lactation, the most damaging one at the end of lactation (-0.27 kg/day/THI). Using the most complex RRM, the additive genetic variance could be altered simultaneously as a function of both DIM and THI values. This model could be recommended for the genetic evaluation taking into account the effect of G×E. The response to selection in the comfort zone (THI ≤ 66) is expected to be higher than that obtained in the heat stress zone (THI > 66) of the animals. The genetic correlations between milk yield in the comfort and heat stress zones were less than unity at opposite extremes of the environmental gradient. Thus, the best animals for milk yield in the comfort zone are not necessarily the best in the zone of heat stress and, therefore, G×E due to heat stress should not be neglected in the genetic evaluation.

  16. Assessing the suitability of summary data for two-sample Mendelian randomization analyses using MR-Egger regression: the role of the I2 statistic.

    Science.gov (United States)

    Bowden, Jack; Del Greco M, Fabiola; Minelli, Cosetta; Davey Smith, George; Sheehan, Nuala A; Thompson, John R

    2016-12-01

    : MR-Egger regression has recently been proposed as a method for Mendelian randomization (MR) analyses incorporating summary data estimates of causal effect from multiple individual variants, which is robust to invalid instruments. It can be used to test for directional pleiotropy and provides an estimate of the causal effect adjusted for its presence. MR-Egger regression provides a useful additional sensitivity analysis to the standard inverse variance weighted (IVW) approach that assumes all variants are valid instruments. Both methods use weights that consider the single nucleotide polymorphism (SNP)-exposure associations to be known, rather than estimated. We call this the `NO Measurement Error' (NOME) assumption. Causal effect estimates from the IVW approach exhibit weak instrument bias whenever the genetic variants utilized violate the NOME assumption, which can be reliably measured using the F-statistic. The effect of NOME violation on MR-Egger regression has yet to be studied. An adaptation of the I2 statistic from the field of meta-analysis is proposed to quantify the strength of NOME violation for MR-Egger. It lies between 0 and 1, and indicates the expected relative bias (or dilution) of the MR-Egger causal estimate in the two-sample MR context. We call it IGX2 . The method of simulation extrapolation is also explored to counteract the dilution. Their joint utility is evaluated using simulated data and applied to a real MR example. In simulated two-sample MR analyses we show that, when a causal effect exists, the MR-Egger estimate of causal effect is biased towards the null when NOME is violated, and the stronger the violation (as indicated by lower values of IGX2 ), the stronger the dilution. When additionally all genetic variants are valid instruments, the type I error rate of the MR-Egger test for pleiotropy is inflated and the causal effect underestimated. Simulation extrapolation is shown to substantially mitigate these adverse effects. We

  17. The benefits of using quantile regression for analysing the effect of weeds on organic winter wheat

    NARCIS (Netherlands)

    Casagrande, M.; Makowski, D.; Jeuffroy, M.H.; Valantin-Morison, M.; David, C.

    2010-01-01

    P>In organic farming, weeds are one of the threats that limit crop yield. An early prediction of weed effect on yield loss and the size of late weed populations could help farmers and advisors to improve weed management. Numerous studies predicting the effect of weeds on yield have already been

  18. Yield Stability of Sorghum Hybrids and Parental Lines | Kenga ...

    African Journals Online (AJOL)

    Seventy-five sorghum hybrids and twenty parental lines were evaluated for two consecutive years at two locations. Our objective was to compare relative stability of grain yields among hybrids and parental lines. Mean grain yields and stability analysis of variance, which included linear regression coefficient (bi) and ...

  19. The more total cognitive load is reduced by cues, the better retention and transfer of multimedia learning: A meta-analysis and two meta-regression analyses.

    Science.gov (United States)

    Xie, Heping; Wang, Fuxing; Hao, Yanbin; Chen, Jiaxue; An, Jing; Wang, Yuxin; Liu, Huashan

    2017-01-01

    Cueing facilitates retention and transfer of multimedia learning. From the perspective of cognitive load theory (CLT), cueing has a positive effect on learning outcomes because of the reduction in total cognitive load and avoidance of cognitive overload. However, this has not been systematically evaluated. Moreover, what remains ambiguous is the direct relationship between the cue-related cognitive load and learning outcomes. A meta-analysis and two subsequent meta-regression analyses were conducted to explore these issues. Subjective total cognitive load (SCL) and scores on a retention test and transfer test were selected as dependent variables. Through a systematic literature search, 32 eligible articles encompassing 3,597 participants were included in the SCL-related meta-analysis. Among them, 25 articles containing 2,910 participants were included in the retention-related meta-analysis and the following retention-related meta-regression, while there were 29 articles containing 3,204 participants included in the transfer-related meta-analysis and the transfer-related meta-regression. The meta-analysis revealed a statistically significant cueing effect on subjective ratings of cognitive load (d = -0.11, 95% CI = [-0.19, -0.02], p < 0.05), retention performance (d = 0.27, 95% CI = [0.08, 0.46], p < 0.01), and transfer performance (d = 0.34, 95% CI = [0.12, 0.56], p < 0.01). The subsequent meta-regression analyses showed that dSCL for cueing significantly predicted dretention for cueing (β = -0.70, 95% CI = [-1.02, -0.38], p < 0.001), as well as dtransfer for cueing (β = -0.60, 95% CI = [-0.92, -0.28], p < 0.001). Thus in line with CLT, adding cues in multimedia materials can indeed reduce SCL and promote learning outcomes, and the more SCL is reduced by cues, the better retention and transfer of multimedia learning.

  20. Predicting Word Reading Ability: A Quantile Regression Study

    Science.gov (United States)

    McIlraith, Autumn L.

    2018-01-01

    Predictors of early word reading are well established. However, it is unclear if these predictors hold for readers across a range of word reading abilities. This study used quantile regression to investigate predictive relationships at different points in the distribution of word reading. Quantile regression analyses used preschool and…

  1. Structural vascular disease in Africans: performance of ethnic-specific waist circumference cut points using logistic regression and neural network analyses: the SABPA study

    OpenAIRE

    Botha, J.; De Ridder, J.H.; Potgieter, J.C.; Steyn, H.S.; Malan, L.

    2013-01-01

    A recently proposed model for waist circumference cut points (RPWC), driven by increased blood pressure, was demonstrated in an African population. We therefore aimed to validate the RPWC by comparing the RPWC and the Joint Statement Consensus (JSC) models via Logistic Regression (LR) and Neural Networks (NN) analyses. Urban African gender groups (N=171) were stratified according to the JSC and RPWC cut point models. Ultrasound carotid intima media thickness (CIMT), blood pressure (BP) and fa...

  2. Predictive ability of machine learning methods for massive crop yield prediction

    Directory of Open Access Journals (Sweden)

    Alberto Gonzalez-Sanchez

    2014-04-01

    Full Text Available An important issue for agricultural planning purposes is the accurate yield estimation for the numerous crops involved in the planning. Machine learning (ML is an essential approach for achieving practical and effective solutions for this problem. Many comparisons of ML methods for yield prediction have been made, seeking for the most accurate technique. Generally, the number of evaluated crops and techniques is too low and does not provide enough information for agricultural planning purposes. This paper compares the predictive accuracy of ML and linear regression techniques for crop yield prediction in ten crop datasets. Multiple linear regression, M5-Prime regression trees, perceptron multilayer neural networks, support vector regression and k-nearest neighbor methods were ranked. Four accuracy metrics were used to validate the models: the root mean square error (RMS, root relative square error (RRSE, normalized mean absolute error (MAE, and correlation factor (R. Real data of an irrigation zone of Mexico were used for building the models. Models were tested with samples of two consecutive years. The results show that M5-Prime and k-nearest neighbor techniques obtain the lowest average RMSE errors (5.14 and 4.91, the lowest RRSE errors (79.46% and 79.78%, the lowest average MAE errors (18.12% and 19.42%, and the highest average correlation factors (0.41 and 0.42. Since M5-Prime achieves the largest number of crop yield models with the lowest errors, it is a very suitable tool for massive crop yield prediction in agricultural planning.

  3. Bisphenol-A exposures and behavioural aberrations: median and linear spline and meta-regression analyses of 12 toxicity studies in rodents.

    Science.gov (United States)

    Peluso, Marco E M; Munnia, Armelle; Ceppi, Marcello

    2014-11-05

    Exposures to bisphenol-A, a weak estrogenic chemical, largely used for the production of plastic containers, can affect the rodent behaviour. Thus, we examined the relationships between bisphenol-A and the anxiety-like behaviour, spatial skills, and aggressiveness, in 12 toxicity studies of rodent offspring from females orally exposed to bisphenol-A, while pregnant and/or lactating, by median and linear splines analyses. Subsequently, the meta-regression analysis was applied to quantify the behavioural changes. U-shaped, inverted U-shaped and J-shaped dose-response curves were found to describe the relationships between bisphenol-A with the behavioural outcomes. The occurrence of anxiogenic-like effects and spatial skill changes displayed U-shaped and inverted U-shaped curves, respectively, providing examples of effects that are observed at low-doses. Conversely, a J-dose-response relationship was observed for aggressiveness. When the proportion of rodents expressing certain traits or the time that they employed to manifest an attitude was analysed, the meta-regression indicated that a borderline significant increment of anxiogenic-like effects was present at low-doses regardless of sexes (β)=-0.8%, 95% C.I. -1.7/0.1, P=0.076, at ≤120 μg bisphenol-A. Whereas, only bisphenol-A-males exhibited a significant inhibition of spatial skills (β)=0.7%, 95% C.I. 0.2/1.2, P=0.004, at ≤100 μg/day. A significant increment of aggressiveness was observed in both the sexes (β)=67.9,C.I. 3.4, 172.5, P=0.038, at >4.0 μg. Then, bisphenol-A treatments significantly abrogated spatial learning and ability in males (Pbisphenol-A, e.g. ≤120 μg/day, were associated to behavioural aberrations in offspring. Copyright © 2014. Published by Elsevier Ireland Ltd.

  4. Improving Crop Yield and Nutrient Use Efficiency via Biofertilization-A Global Meta-analysis.

    Science.gov (United States)

    Schütz, Lukas; Gattinger, Andreas; Meier, Matthias; Müller, Adrian; Boller, Thomas; Mäder, Paul; Mathimaran, Natarajan

    2017-01-01

    The application of microbial inoculants (biofertilizers) is a promising technology for future sustainable farming systems in view of rapidly decreasing phosphorus stocks and the need to more efficiently use available nitrogen (N). Various microbial taxa are currently used as biofertilizers, based on their capacity to access nutrients from fertilizers and soil stocks, to fix atmospheric nitrogen, to improve water uptake or to act as biocontrol agents. Despite the existence of a considerable knowledge on effects of specific taxa of biofertilizers, a comprehensive quantitative assessment of the performance of biofertilizers with different traits such as phosphorus solubilization and N fixation applied to various crops at a global scale is missing. We conducted a meta-analysis to quantify benefits of biofertilizers in terms of yield increase, nitrogen and phosphorus use efficiency, based on 171 peer reviewed publications that met eligibility criteria. Major findings are: (i) the superiority of biofertilizer performance in dry climates over other climatic regions (yield response: dry climate +20.0 ± 1.7%, tropical climate +14.9 ± 1.2%, oceanic climate +10.0 ± 3.7%, continental climate +8.5 ± 2.4%); (ii) meta-regression analyses revealed that yield response due to biofertilizer application was generally small at low soil P levels; efficacy increased along higher soil P levels in the order arbuscular mycorrhizal fungi (AMF), P solubilizers, and N fixers; (iii) meta-regressions showed that the success of inoculation with AMF was greater at low organic matter content and at neutral pH. Our comprehensive analysis provides a basis and guidance for proper choice and application of biofertilizers.

  5. Assessment of genotype x environment interaction on yield and ...

    African Journals Online (AJOL)

    Days to heading, plant height, number of spikes per square meter, number of kernels per spike, spike weight, 1000 kernel weight and grain yield of the genotypes were evaluated in each location. The regression coefficient (bi) of Finlay and Wilkinson (1963) and mean square of deviation from regression (S2d) of Eberhart ...

  6. Genotype x environment interaction for grain yield of wheat genotypes tested under water stress conditions

    International Nuclear Information System (INIS)

    Sail, M.A.; Dahot, M.U.; Mangrio, S.M.; Memon, S.

    2007-01-01

    Effect of water stress on grain yield in different wheat genotypes was studied under field conditions at various locations. Grain yield is a complex polygenic trait influenced by genotype, environment and genotype x environment (GxE) interaction. To understand the stability among genotypes for grain yield, twenty-one wheat genotypes developed Through hybridization and radiation-induced mutations at Nuclear Institute of Agriculture (NIA) TandoJam were evaluated with four local check varieties (Sarsabz, Thori, Margalla-99 and Chakwal-86) in multi-environmental trails (MET/sub s/). The experiments were conducted over 5 different water stress environments in Sindh. Data on grain yield were recorded from each site and statistically analyzed. Combined analysis of variance for all the environments indicated that the genotype, environment and genotype x environment (GxE) interaction were highly significant (P greater then 0.01) for grain yield. Genotypes differed in their response to various locations. The overall highest site mean yield (4031 kg/ha) recorded at Moro and the lowest (2326 kg/ha) at Thatta. Six genotypes produced significantly (P=0.01) the highest grain yield overall the environments. Stability analysis was applied to estimate stability parameters viz., regression coefficient (b), standard error of regression coefficient and variance due to deviation from regression (S/sub 2/d) genotypes 10/8, BWS-78 produced the highest mean yield over all the environments with low regression coefficient (b=0.68, 0.67 and 0.63 respectively and higher S/sup 2/ d value, showing specific adaptation to poor (un favorable) environments. Genotype 8/7 produced overall higher grain yield (3647 kg/ha) and ranked as third high yielding genotype had regression value close to unity (b=0.9) and low S/sup d/ value, indicating more stability and wide adaptation over the all environments. The knowledge of the presence and magnitude of genotype x environment (GE) interaction is important to

  7. Analyses of moisture deficit grain yield loss in drought tolerant maize ...

    African Journals Online (AJOL)

    Development of drought tolerant maize cultivars is prerequisite to achieving stable grain yield in drought–prone ecologies of Nigeria's Guinea savanna. However, success has been limited mainly due to lack of maize genotypes that show clear differences in response to well defined moisture deficit condition. Two sets of ...

  8. Feature Selection for Wheat Yield Prediction

    Science.gov (United States)

    Ruß, Georg; Kruse, Rudolf

    Carrying out effective and sustainable agriculture has become an important issue in recent years. Agricultural production has to keep up with an everincreasing population by taking advantage of a field’s heterogeneity. Nowadays, modern technology such as the global positioning system (GPS) and a multitude of developed sensors enable farmers to better measure their fields’ heterogeneities. For this small-scale, precise treatment the term precision agriculture has been coined. However, the large amounts of data that are (literally) harvested during the growing season have to be analysed. In particular, the farmer is interested in knowing whether a newly developed heterogeneity sensor is potentially advantageous or not. Since the sensor data are readily available, this issue should be seen from an artificial intelligence perspective. There it can be treated as a feature selection problem. The additional task of yield prediction can be treated as a multi-dimensional regression problem. This article aims to present an approach towards solving these two practically important problems using artificial intelligence and data mining ideas and methodologies.

  9. Simulation-based production planning for engineer-to-order systems with random yield

    NARCIS (Netherlands)

    Akcay, Alp; Martagan, Tugce

    2018-01-01

    We consider an engineer-to-order production system with unknown yield. We model the yield as a random variable which represents the percentage output obtained from one unit of production quantity. We develop a beta-regression model in which the mean value of the yield depends on the unique

  10. Using a time-series statistical framework to quantify trends and abrupt change in US corn, soybean, and wheat yields from 1970-2016

    Science.gov (United States)

    Zhang, J.; Ives, A. R.; Turner, M. G.; Kucharik, C. J.

    2017-12-01

    Previous studies have identified global agricultural regions where "stagnation" of long-term crop yield increases has occurred. These studies have used a variety of simple statistical methods that often ignore important aspects of time series regression modeling. These methods can lead to differing and contradictory results, which creates uncertainty regarding food security given rapid global population growth. Here, we present a new statistical framework incorporating time series-based algorithms into standard regression models to quantify spatiotemporal yield trends of US maize, soybean, and winter wheat from 1970-2016. Our primary goal was to quantify spatial differences in yield trends for these three crops using USDA county level data. This information was used to identify regions experiencing the largest changes in the rate of yield increases over time, and to determine whether abrupt shifts in the rate of yield increases have occurred. Although crop yields continue to increase in most maize-, soybean-, and winter wheat-growing areas, yield increases have stagnated in some key agricultural regions during the most recent 15 to 16 years: some maize-growing areas, except for the northern Great Plains, have shown a significant trend towards smaller annual yield increases for maize; soybean has maintained an consistent long-term yield gains in the Northern Great Plains, the Midwest, and southeast US, but has experienced a shift to smaller annual increases in other regions; winter wheat maintained a moderate annual increase in eastern South Dakota and eastern US locations, but showed a decline in the magnitude of annual increases across the central Great Plains and western US regions. Our results suggest that there were abrupt shifts in the rate of annual yield increases in a variety of US regions among the three crops. The framework presented here can be broadly applied to additional yield trend analyses for different crops and regions of the Earth.

  11. Uso de modelos de regressão aleatória para descrever a variação genética da produção de leite na raça Holandesa Random regressions models to describe the genetic variation of milk yield in Holstein breed

    Directory of Open Access Journals (Sweden)

    Cláudio Vieira de Araújo

    2006-06-01

    Full Text Available Registros de produção de leite de 68.523 controles leiteiros de 8.536 vacas da raça Holandesa, com parições nos anos de 1996 a 2001, foram utilizados na comparação entre modelos de regressão aleatória para estimação de componentes de variância. Os registros de controle leiteiro foram analisados como características múltiplas, considerando cada controle uma característica distinta. Os mesmos registros de controle leiteiro foram analisados como dados longitudinais, por meio de modelos de regressão aleatória, que diferiram entre si pela função utilizada para descrever a trajetória da curva de lactação dos animais. As funções utilizadas foram a exponencial de Wilmink, a função de Ali e Schaeffer e os polinômios de Legendre de segundo e quarto graus. A comparação entre modelos foi realizada com base nos seguintes critérios: estimativas de componentes de variância, obtidas no modelo multicaractístico e por regressão aleatória; valores da variância residual; e valores do logaritmo da função de verossimilhança. As estimativas de herdabilidade obtidas por meio dos modelos de características múltiplas variaram de 0,110 a 0,244. Para os modelos de regressão aleatória, esses valores oscilaram de 0,127 a 0,301, observando-se as maiores estimativas nos modelos com maior número de parâmetros. Verificou-se que os modelos de regressão aleatória que utilizaram os polinômios de Legendre descreveram melhor a variação genética da produção de leite.Data comprising 68,523 test day milk yield of 8,536 cows of the Holstein breed, calving from 1996 to 2001, were used to compare random regression models, for estimating variance components. Test day records (TD were analyzed as multiple traits, considering each TD as a different trait. The test day records were analyzed as longitudinal traits by different random regression models regarding the function used to describe the trajectory of the lactation curve of the animals

  12. Classification and regression tree (CART) analyses of genomic signatures reveal sets of tetramers that discriminate temperature optima of archaea and bacteria

    Science.gov (United States)

    Dyer, Betsey D.; Kahn, Michael J.; LeBlanc, Mark D.

    2008-01-01

    Classification and regression tree (CART) analysis was applied to genome-wide tetranucleotide frequencies (genomic signatures) of 195 archaea and bacteria. Although genomic signatures have typically been used to classify evolutionary divergence, in this study, convergent evolution was the focus. Temperature optima for most of the organisms examined could be distinguished by CART analyses of tetranucleotide frequencies. This suggests that pervasive (nonlinear) qualities of genomes may reflect certain environmental conditions (such as temperature) in which those genomes evolved. The predominant use of GAGA and AGGA as the discriminating tetramers in CART models suggests that purine-loading and codon biases of thermophiles may explain some of the results. PMID:19054742

  13. Path analyses of yield and some agronomic and quality traits of ...

    African Journals Online (AJOL)

    ONOS

    2010-08-09

    Aug 9, 2010 ... This research was conducted to determine characters effecting grain yield in fifty bread wheat (Triticum aestivum L.) cultivars and advanced lines by using simple correlation coefficient and path analysis under 2 locations (high rainfall and low rainfall; 745 and 506 mm, respectively). A total of 50 genotypes,.

  14. Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia

    Science.gov (United States)

    Pradhan, Biswajeet

    2010-05-01

    This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross

  15. Alumina Yield in the Bayer Process

    Science.gov (United States)

    Den Hond, R.

    The alumina industry has historically been able to reduce alumina production costs, by increasing the liquor alumina yield. To know the potential for further yield increases, the phase diagram of the ternary system Na2O-Al2O -H2O at various temperature levels was analysed. It was found that the maximum theorical precipitation alumina yield is 160 g/l, while that for digestion was calculated to be 675 g/l.

  16. Testing Mediation Using Multiple Regression and Structural Equation Modeling Analyses in Secondary Data

    Science.gov (United States)

    Li, Spencer D.

    2011-01-01

    Mediation analysis in child and adolescent development research is possible using large secondary data sets. This article provides an overview of two statistical methods commonly used to test mediated effects in secondary analysis: multiple regression and structural equation modeling (SEM). Two empirical studies are presented to illustrate the…

  17. Improving validation methods for molecular diagnostics: application of Bland-Altman, Deming and simple linear regression analyses in assay comparison and evaluation for next-generation sequencing.

    Science.gov (United States)

    Misyura, Maksym; Sukhai, Mahadeo A; Kulasignam, Vathany; Zhang, Tong; Kamel-Reid, Suzanne; Stockley, Tracy L

    2018-02-01

    A standard approach in test evaluation is to compare results of the assay in validation to results from previously validated methods. For quantitative molecular diagnostic assays, comparison of test values is often performed using simple linear regression and the coefficient of determination (R 2 ), using R 2 as the primary metric of assay agreement. However, the use of R 2 alone does not adequately quantify constant or proportional errors required for optimal test evaluation. More extensive statistical approaches, such as Bland-Altman and expanded interpretation of linear regression methods, can be used to more thoroughly compare data from quantitative molecular assays. We present the application of Bland-Altman and linear regression statistical methods to evaluate quantitative outputs from next-generation sequencing assays (NGS). NGS-derived data sets from assay validation experiments were used to demonstrate the utility of the statistical methods. Both Bland-Altman and linear regression were able to detect the presence and magnitude of constant and proportional error in quantitative values of NGS data. Deming linear regression was used in the context of assay comparison studies, while simple linear regression was used to analyse serial dilution data. Bland-Altman statistical approach was also adapted to quantify assay accuracy, including constant and proportional errors, and precision where theoretical and empirical values were known. The complementary application of the statistical methods described in this manuscript enables more extensive evaluation of performance characteristics of quantitative molecular assays, prior to implementation in the clinical molecular laboratory. © Article author(s) (or their employer(s) unless otherwise stated in the text of the article) 2018. All rights reserved. No commercial use is permitted unless otherwise expressly granted.

  18. Food Yields and Nutrient Analyses of the Three Sisters: A Haudenosaunee Cropping System

    Directory of Open Access Journals (Sweden)

    Jane Mt.Pleasant

    2016-11-01

    Full Text Available Scholars have studied The Three Sisters, a traditional cropping system of the Haudenosaunee (Iroquois, from multiple perspectives. However, there is no research examining food yields, defined as the quantities of energy and protein produced per unit land area, from the cropping system within Iroquoia. This article compares food yields and other nutrient contributions from the Three Sisters, comprised of interplanted maize, bean and pumpkin, with monocultures of these same crops. The Three Sisters yields more energy (12.25 x 106 kcal/ha and more protein (349 kg/ha than any of the crop monocultures or mixtures of monocultures planted to the same area. The Three Sisters supplies 13.42 people/ha/yr. with energy and 15.86 people/ha/yr. with protein. Nutrient contents of the crops are further enhanced by nixtamalization, a traditional processing technique where maize is cooked in a high alkaline solution. This process increases calcium, protein quality, and niacin in maize.

  19. Image superresolution using support vector regression.

    Science.gov (United States)

    Ni, Karl S; Nguyen, Truong Q

    2007-06-01

    A thorough investigation of the application of support vector regression (SVR) to the superresolution problem is conducted through various frameworks. Prior to the study, the SVR problem is enhanced by finding the optimal kernel. This is done by formulating the kernel learning problem in SVR form as a convex optimization problem, specifically a semi-definite programming (SDP) problem. An additional constraint is added to reduce the SDP to a quadratically constrained quadratic programming (QCQP) problem. After this optimization, investigation of the relevancy of SVR to superresolution proceeds with the possibility of using a single and general support vector regression for all image content, and the results are impressive for small training sets. This idea is improved upon by observing structural properties in the discrete cosine transform (DCT) domain to aid in learning the regression. Further improvement involves a combination of classification and SVR-based techniques, extending works in resolution synthesis. This method, termed kernel resolution synthesis, uses specific regressors for isolated image content to describe the domain through a partitioned look of the vector space, thereby yielding good results.

  20. Augmenting Data with Published Results in Bayesian Linear Regression

    Science.gov (United States)

    de Leeuw, Christiaan; Klugkist, Irene

    2012-01-01

    In most research, linear regression analyses are performed without taking into account published results (i.e., reported summary statistics) of similar previous studies. Although the prior density in Bayesian linear regression could accommodate such prior knowledge, formal models for doing so are absent from the literature. The goal of this…

  1. Remotely sensed rice yield prediction using multi-temporal NDVI data derived from NOAA's-AVHRR.

    Science.gov (United States)

    Huang, Jingfeng; Wang, Xiuzhen; Li, Xinxing; Tian, Hanqin; Pan, Zhuokun

    2013-01-01

    Grain-yield prediction using remotely sensed data have been intensively studied in wheat and maize, but such information is limited in rice, barley, oats and soybeans. The present study proposes a new framework for rice-yield prediction, which eliminates the influence of the technology development, fertilizer application, and management improvement and can be used for the development and implementation of provincial rice-yield predictions. The technique requires the collection of remotely sensed data over an adequate time frame and a corresponding record of the region's crop yields. Longer normalized-difference-vegetation-index (NDVI) time series are preferable to shorter ones for the purposes of rice-yield prediction because the well-contrasted seasons in a longer time series provide the opportunity to build regression models with a wide application range. A regression analysis of the yield versus the year indicated an annual gain in the rice yield of 50 to 128 kg ha(-1). Stepwise regression models for the remotely sensed rice-yield predictions have been developed for five typical rice-growing provinces in China. The prediction models for the remotely sensed rice yield indicated that the influences of the NDVIs on the rice yield were always positive. The association between the predicted and observed rice yields was highly significant without obvious outliers from 1982 to 2004. Independent validation found that the overall relative error is approximately 5.82%, and a majority of the relative errors were less than 5% in 2005 and 2006, depending on the study area. The proposed models can be used in an operational context to predict rice yields at the provincial level in China. The methodologies described in the present paper can be applied to any crop for which a sufficient time series of NDVI data and the corresponding historical yield information are available, as long as the historical yield increases significantly.

  2. Wheat yield dynamics: a structural econometric analysis.

    Science.gov (United States)

    Sahin, Afsin; Akdi, Yilmaz; Arslan, Fahrettin

    2007-10-15

    In this study we initially have tried to explore the wheat situation in Turkey, which has a small-open economy and in the member countries of European Union (EU). We have observed that increasing the wheat yield is fundamental to obtain comparative advantage among countries by depressing domestic prices. Also the changing structure of supporting schemes in Turkey makes it necessary to increase its wheat yield level. For this purpose, we have used available data to determine the dynamics of wheat yield by Ordinary Least Square Regression methods. In order to find out whether there is a linear relationship among these series we have checked each series whether they are integrated at the same order or not. Consequently, we have pointed out that fertilizer usage and precipitation level are substantial inputs for producing high wheat yield. Furthermore, in respect for our model, fertilizer usage affects wheat yield more than precipitation level.

  3. Improving Crop Yield and Nutrient Use Efficiency via Biofertilization—A Global Meta-analysis

    Science.gov (United States)

    Schütz, Lukas; Gattinger, Andreas; Meier, Matthias; Müller, Adrian; Boller, Thomas; Mäder, Paul; Mathimaran, Natarajan

    2018-01-01

    The application of microbial inoculants (biofertilizers) is a promising technology for future sustainable farming systems in view of rapidly decreasing phosphorus stocks and the need to more efficiently use available nitrogen (N). Various microbial taxa are currently used as biofertilizers, based on their capacity to access nutrients from fertilizers and soil stocks, to fix atmospheric nitrogen, to improve water uptake or to act as biocontrol agents. Despite the existence of a considerable knowledge on effects of specific taxa of biofertilizers, a comprehensive quantitative assessment of the performance of biofertilizers with different traits such as phosphorus solubilization and N fixation applied to various crops at a global scale is missing. We conducted a meta-analysis to quantify benefits of biofertilizers in terms of yield increase, nitrogen and phosphorus use efficiency, based on 171 peer reviewed publications that met eligibility criteria. Major findings are: (i) the superiority of biofertilizer performance in dry climates over other climatic regions (yield response: dry climate +20.0 ± 1.7%, tropical climate +14.9 ± 1.2%, oceanic climate +10.0 ± 3.7%, continental climate +8.5 ± 2.4%); (ii) meta-regression analyses revealed that yield response due to biofertilizer application was generally small at low soil P levels; efficacy increased along higher soil P levels in the order arbuscular mycorrhizal fungi (AMF), P solubilizers, and N fixers; (iii) meta-regressions showed that the success of inoculation with AMF was greater at low organic matter content and at neutral pH. Our comprehensive analysis provides a basis and guidance for proper choice and application of biofertilizers. PMID:29375594

  4. Improving Crop Yield and Nutrient Use Efficiency via Biofertilization—A Global Meta-analysis

    Directory of Open Access Journals (Sweden)

    Lukas Schütz

    2018-01-01

    Full Text Available The application of microbial inoculants (biofertilizers is a promising technology for future sustainable farming systems in view of rapidly decreasing phosphorus stocks and the need to more efficiently use available nitrogen (N. Various microbial taxa are currently used as biofertilizers, based on their capacity to access nutrients from fertilizers and soil stocks, to fix atmospheric nitrogen, to improve water uptake or to act as biocontrol agents. Despite the existence of a considerable knowledge on effects of specific taxa of biofertilizers, a comprehensive quantitative assessment of the performance of biofertilizers with different traits such as phosphorus solubilization and N fixation applied to various crops at a global scale is missing. We conducted a meta-analysis to quantify benefits of biofertilizers in terms of yield increase, nitrogen and phosphorus use efficiency, based on 171 peer reviewed publications that met eligibility criteria. Major findings are: (i the superiority of biofertilizer performance in dry climates over other climatic regions (yield response: dry climate +20.0 ± 1.7%, tropical climate +14.9 ± 1.2%, oceanic climate +10.0 ± 3.7%, continental climate +8.5 ± 2.4%; (ii meta-regression analyses revealed that yield response due to biofertilizer application was generally small at low soil P levels; efficacy increased along higher soil P levels in the order arbuscular mycorrhizal fungi (AMF, P solubilizers, and N fixers; (iii meta-regressions showed that the success of inoculation with AMF was greater at low organic matter content and at neutral pH. Our comprehensive analysis provides a basis and guidance for proper choice and application of biofertilizers.

  5. Better Autologistic Regression

    Directory of Open Access Journals (Sweden)

    Mark A. Wolters

    2017-11-01

    Full Text Available Autologistic regression is an important probability model for dichotomous random variables observed along with covariate information. It has been used in various fields for analyzing binary data possessing spatial or network structure. The model can be viewed as an extension of the autologistic model (also known as the Ising model, quadratic exponential binary distribution, or Boltzmann machine to include covariates. It can also be viewed as an extension of logistic regression to handle responses that are not independent. Not all authors use exactly the same form of the autologistic regression model. Variations of the model differ in two respects. First, the variable coding—the two numbers used to represent the two possible states of the variables—might differ. Common coding choices are (zero, one and (minus one, plus one. Second, the model might appear in either of two algebraic forms: a standard form, or a recently proposed centered form. Little attention has been paid to the effect of these differences, and the literature shows ambiguity about their importance. It is shown here that changes to either coding or centering in fact produce distinct, non-nested probability models. Theoretical results, numerical studies, and analysis of an ecological data set all show that the differences among the models can be large and practically significant. Understanding the nature of the differences and making appropriate modeling choices can lead to significantly improved autologistic regression analyses. The results strongly suggest that the standard model with plus/minus coding, which we call the symmetric autologistic model, is the most natural choice among the autologistic variants.

  6. Grain filling parameters and yield components in wheat

    OpenAIRE

    Brdar Milka; Kobiljski Borislav; Balalić-Kraljević Marija

    2006-01-01

    Grain yield of wheat (Triticum aestivum L.) is influenced by number of grains per unit area and grain weight, which is result of grain filling duration and rate. The aim of the study was to investigate the relationships between grain filling parameters in 4 wheat genotypes of different earliness and yield components. Nonlinear regression estimated and observed parameters were analyzed. Rang of estimated parameters corresponds to rang of observed parameters. Stepwise MANOVA indicated that the ...

  7. Studies on the Effects of Climatic Factors on Dryland Wheat Grain Yield in Maragheh Region

    Directory of Open Access Journals (Sweden)

    V. Feiziasl

    2011-01-01

    Full Text Available Abstract In order to study the effects of climate variables on rainfed wheat grain yield, climate data and wheat yield for 10 years (1995-2005 collected from Dryland Agricultural Research Institute (DARI in Maragheh as the main station in cold and semi-cold areas. Collected data were analyzed by correlation coefficient, simple regression, stepwise regression and path analysis. The results showed that relationships between grain yield with average relative humidity and total rainfall of growing season was positive and significant at 5% and 1% probabilities, respectively. However, evaluation between grain yield with sunny hours and class A pan evaporation was negative and significant (p

  8. The Norwegian Healthier Goats program--modeling lactation curves using a multilevel cubic spline regression model.

    Science.gov (United States)

    Nagel-Alne, G E; Krontveit, R; Bohlin, J; Valle, P S; Skjerve, E; Sølverød, L S

    2014-07-01

    In 2001, the Norwegian Goat Health Service initiated the Healthier Goats program (HG), with the aim of eradicating caprine arthritis encephalitis, caseous lymphadenitis, and Johne's disease (caprine paratuberculosis) in Norwegian goat herds. The aim of the present study was to explore how control and eradication of the above-mentioned diseases by enrolling in HG affected milk yield by comparison with herds not enrolled in HG. Lactation curves were modeled using a multilevel cubic spline regression model where farm, goat, and lactation were included as random effect parameters. The data material contained 135,446 registrations of daily milk yield from 28,829 lactations in 43 herds. The multilevel cubic spline regression model was applied to 4 categories of data: enrolled early, control early, enrolled late, and control late. For enrolled herds, the early and late notations refer to the situation before and after enrolling in HG; for nonenrolled herds (controls), they refer to development over time, independent of HG. Total milk yield increased in the enrolled herds after eradication: the total milk yields in the fourth lactation were 634.2 and 873.3 kg in enrolled early and enrolled late herds, respectively, and 613.2 and 701.4 kg in the control early and control late herds, respectively. Day of peak yield differed between enrolled and control herds. The day of peak yield came on d 6 of lactation for the control early category for parities 2, 3, and 4, indicating an inability of the goats to further increase their milk yield from the initial level. For enrolled herds, on the other hand, peak yield came between d 49 and 56, indicating a gradual increase in milk yield after kidding. Our results indicate that enrollment in the HG disease eradication program improved the milk yield of dairy goats considerably, and that the multilevel cubic spline regression was a suitable model for exploring effects of disease control and eradication on milk yield. Copyright © 2014

  9. Comparing parametric and nonparametric regression methods for panel data

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard; Henningsen, Arne

    We investigate and compare the suitability of parametric and non-parametric stochastic regression methods for analysing production technologies and the optimal firm size. Our theoretical analysis shows that the most commonly used functional forms in empirical production analysis, Cobb......-Douglas and Translog, are unsuitable for analysing the optimal firm size. We show that the Translog functional form implies an implausible linear relationship between the (logarithmic) firm size and the elasticity of scale, where the slope is artificially related to the substitutability between the inputs....... The practical applicability of the parametric and non-parametric regression methods is scrutinised and compared by an empirical example: we analyse the production technology and investigate the optimal size of Polish crop farms based on a firm-level balanced panel data set. A nonparametric specification test...

  10. Accelerated failure time regression for backward recurrence times and current durations

    DEFF Research Database (Denmark)

    Keiding, N; Fine, J P; Hansen, O H

    2011-01-01

    Backward recurrence times in stationary renewal processes and current durations in dynamic populations observed at a cross-section may yield estimates of underlying interarrival times or survival distributions under suitable stationarity assumptions. Regression models have been proposed for these......Backward recurrence times in stationary renewal processes and current durations in dynamic populations observed at a cross-section may yield estimates of underlying interarrival times or survival distributions under suitable stationarity assumptions. Regression models have been proposed...... for these situations, but accelerated failure time models have the particularly attractive feature that they are preserved when going from the backward recurrence times to the underlying survival distribution of interest. This simple fact has recently been noticed in a sociological context and is here illustrated...... by a study of current duration of time to pregnancy...

  11. Differentiating regressed melanoma from regressed lichenoid keratosis.

    Science.gov (United States)

    Chan, Aegean H; Shulman, Kenneth J; Lee, Bonnie A

    2017-04-01

    Distinguishing regressed lichen planus-like keratosis (LPLK) from regressed melanoma can be difficult on histopathologic examination, potentially resulting in mismanagement of patients. We aimed to identify histopathologic features by which regressed melanoma can be differentiated from regressed LPLK. Twenty actively inflamed LPLK, 12 LPLK with regression and 15 melanomas with regression were compared and evaluated by hematoxylin and eosin staining as well as Melan-A, microphthalmia transcription factor (MiTF) and cytokeratin (AE1/AE3) immunostaining. (1) A total of 40% of regressed melanomas showed complete or near complete loss of melanocytes within the epidermis with Melan-A and MiTF immunostaining, while 8% of regressed LPLK exhibited this finding. (2) Necrotic keratinocytes were seen in the epidermis in 33% regressed melanomas as opposed to all of the regressed LPLK. (3) A dense infiltrate of melanophages in the papillary dermis was seen in 40% of regressed melanomas, a feature not seen in regressed LPLK. In summary, our findings suggest that a complete or near complete loss of melanocytes within the epidermis strongly favors a regressed melanoma over a regressed LPLK. In addition, necrotic epidermal keratinocytes and the presence of a dense band-like distribution of dermal melanophages can be helpful in differentiating these lesions. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  12. N-terminal pro-B-type natriuretic peptide measurement is useful in predicting left ventricular hypertrophy regression after aortic valve replacement in patients with severe aortic stenosis.

    Science.gov (United States)

    Lee, Mirae; Choi, Jin-Oh; Park, Sung-Ji; Kim, Eun Young; Park, PyoWon; Oh, Jae K; Jeon, Eun-Seok

    2015-01-01

    The predictive factors for early left ventricular hypertrophy (LVH) regression after aortic valve replacement (AVR) have not been fully elucidated. This study was conducted to investigate which preoperative parameters predict early LVH regression after AVR. 87 consecutive patients who underwent AVR due to isolated severe aortic stenosis (AS) were analysed. Patients with ejection fraction regression of LVH at the midterm follow-up was determined. In multivariate analysis, including preoperative echocardiographic parameters, only E/e' ratio was associated with midterm LVH regression (OR 1.11, 95% CI 1.01 to 1.22; p=0.035). When preoperative NT-proBNP was added to the analysis, logNT-proBNP was found to be the single significant predictor of midterm LVH regression (OR 2.00, 95% CI 1.08 to 3.71; p=0.028). By receiver operating characteristic curve analysis, a cut-off value of 440 pg/mL for NT-proBNP yielded a sensitivity of 72% and a specificity of 77% for the prediction of LVH regression after AVR. Preoperative NT-proBNP was an independent predictor for early LVH regression after AVR in patients with isolated severe AS.

  13. ESTIMATION OF PEA GRAIN YIELD STABILITY (Pisum sativum L.

    Directory of Open Access Journals (Sweden)

    Tihomir Čupić

    2003-06-01

    Full Text Available The paper aimed to determine yield and estimate pea grain yield stability of newly-created lines JSG-1 (cultivar in recognition process as well as compare with foreign origin cultivars in agroecological area of east Slavonia. The trial was set up by a randomized block design on the experimental field of Agricultural Institute Osijek in four replicates in the five-year period (1998 – 2002. Six (five foreign and one inland cultivars were included by the trial: Eiffil, Erbi, JP-5, JSG-1 (in a recognition process, Torsz and Baccara. Stability parameters were calculated by the grouping method after Francis and Kannenberg (1978 and by the model of individual stability estimation after Eberhart and Russel method (1966. According to Francis and Kannenberg, cultivars Eiffil, Erbi, JSG-1 and Baccara belonged to group I known for high yield and low trait varying coefficient, thus, represent stabile yield cultivars. According to regression coefficient and regression deviation variance the most stabile cultivar appeared to be cultivar JSG-1 (bi =1.06 and S2 di=0.010 and the lowest one was Torsz (bi =0.67 and S2 di =0.160. Cultivar Baccara (bi = 1.22 and S2 di =0.034 was comprised by the group of unstabile and adaptible for high-yielding environments.

  14. Multiple correlation analyses of metabolic and endocrine profiles with fertility in primiparous and multiparous cows.

    Science.gov (United States)

    Wathes, D C; Bourne, N; Cheng, Z; Mann, G E; Taylor, V J; Coffey, M P

    2007-03-01

    Results from 4 studies were combined (representing a total of 500 lactations) to investigate the relationships between metabolic parameters and fertility in dairy cows. Information was collected on blood metabolic traits and body condition score at 1 to 2 wk prepartum and at 2, 4, and 7 wk postpartum. Fertility traits were days to commencement of luteal activity, days to first service, days to conception, and failure to conceive. Primiparous and multiparous cows were considered separately. Initial linear regression analyses were used to determine relationships among fertility, metabolic, and endocrine traits at each time point. All metabolic and endocrine traits significantly related to fertility were included in stepwise multiple regression analyses alone (model 1), including peak milk yield and interval to commencement of luteal activity (model 2), and with the further addition of dietary group (model 3). In multiparous cows, extended calving to conception intervals were associated prepartum with greater concentrations of leptin and lesser concentrations of nonesterified fatty acids and urea, and postpartum with reduced insulin-like growth factor-I at 2 wk, greater urea at 7 wk, and greater peak milk yield. In primiparous cows, extended calving to conception intervals were associated with more body condition and more urea prepartum, elevated urea postpartum, and more body condition loss by 7 wk. In conclusion, some metabolic measurements were associated with poorer fertility outcomes. Relationships between fertility and metabolic and endocrine traits varied both according to the lactation number of the cow and with the time relative to calving.

  15. Relation of watershed setting and stream nutrient yields at selected sites in central and eastern North Carolina, 1997-2008

    Science.gov (United States)

    Harden, Stephen L.; Cuffney, Thomas F.; Terziotti, Silvia; Kolb, Katharine R.

    2013-01-01

    than 10 percent point-source flow contributions to streamflow had higher yields relative to undeveloped watersheds (having less than 10 and 15 percent developed and agricultural land uses, respectively) and watersheds with relatively low agricultural land use (between 15 and 30 percent). The statistical tests further indicated that the median annual yields for total P were statistically higher for watersheds with high agricultural land use (greater than 30 percent) compared to the undeveloped watersheds and watersheds with low agricultural land use. The total P yields also were higher for watersheds with low urban land use (between 10 and 30 percent developed land) compared to the undeveloped watersheds. The study data indicate that grouping and examining stream nutrient yields based on the land-use classifications used in this report can be useful for characterizing relations between watershed settings and nutrient yields in streams located throughout central and eastern North Carolina. Compiled study data also were analyzed with four regression tree models as a means of determining which watershed environmental variables or combination of variables result in basins that are likely to have high or low nutrient yields. The regression tree analyses indicated that some of the environmental variables examined in this study were useful for predicting yields of nitrate, total N, and total P. When the median annual nutrient yields for all 48 sites were evaluated as a group (Model 1), annual point-source flow yields had the greatest influence on nitrate and total N yields observed in streams, and annual streamflow yields had the greatest influence on yields of total P. The Model 1 results indicated that watersheds with higher annual point-source flow yields had higher annual yields of nitrate and total N, and watersheds with higher annual streamflow yields had higher annual yields of total P. When sites with high point-source flows (greater than 10 percent of total streamflow

  16. Covariate Imbalance and Adjustment for Logistic Regression Analysis of Clinical Trial Data

    Science.gov (United States)

    Ciolino, Jody D.; Martin, Reneé H.; Zhao, Wenle; Jauch, Edward C.; Hill, Michael D.; Palesch, Yuko Y.

    2014-01-01

    In logistic regression analysis for binary clinical trial data, adjusted treatment effect estimates are often not equivalent to unadjusted estimates in the presence of influential covariates. This paper uses simulation to quantify the benefit of covariate adjustment in logistic regression. However, International Conference on Harmonization guidelines suggest that covariate adjustment be pre-specified. Unplanned adjusted analyses should be considered secondary. Results suggest that that if adjustment is not possible or unplanned in a logistic setting, balance in continuous covariates can alleviate some (but never all) of the shortcomings of unadjusted analyses. The case of log binomial regression is also explored. PMID:24138438

  17. Variable selection and model choice in geoadditive regression models.

    Science.gov (United States)

    Kneib, Thomas; Hothorn, Torsten; Tutz, Gerhard

    2009-06-01

    Model choice and variable selection are issues of major concern in practical regression analyses, arising in many biometric applications such as habitat suitability analyses, where the aim is to identify the influence of potentially many environmental conditions on certain species. We describe regression models for breeding bird communities that facilitate both model choice and variable selection, by a boosting algorithm that works within a class of geoadditive regression models comprising spatial effects, nonparametric effects of continuous covariates, interaction surfaces, and varying coefficients. The major modeling components are penalized splines and their bivariate tensor product extensions. All smooth model terms are represented as the sum of a parametric component and a smooth component with one degree of freedom to obtain a fair comparison between the model terms. A generic representation of the geoadditive model allows us to devise a general boosting algorithm that automatically performs model choice and variable selection.

  18. Lawrence Livermore National Laboratory seismic yield determination for the NPE

    Energy Technology Data Exchange (ETDEWEB)

    Rohrer, R. [Lawrence Livermore National Lab., CA (United States)

    1994-12-31

    The Lawrence Livermore National Laboratory recorded seismic signals from the Non-Proliferation experiment at the Nevada Test Site on September 22, 1993, at seismic stations near Mina, Nevada; Kanab Utah; Landers, California; and Elko, Nevada. Yields were calculated from these recorded seismic amplitudes at the stations using statistical amplitude- yield regression curves from earlier nuclear experiments performed near the Non-Proliferation experiment. The weighted seismic yield average using these amplitudes is 1.9 kt with a standard deviation of 19%. The calibrating experiments were nuclear, so this yield is equivalent to a 1.9-kt nuclear experiment.

  19. Multiple regression and beyond an introduction to multiple regression and structural equation modeling

    CERN Document Server

    Keith, Timothy Z

    2014-01-01

    Multiple Regression and Beyond offers a conceptually oriented introduction to multiple regression (MR) analysis and structural equation modeling (SEM), along with analyses that flow naturally from those methods. By focusing on the concepts and purposes of MR and related methods, rather than the derivation and calculation of formulae, this book introduces material to students more clearly, and in a less threatening way. In addition to illuminating content necessary for coursework, the accessibility of this approach means students are more likely to be able to conduct research using MR or SEM--and more likely to use the methods wisely. Covers both MR and SEM, while explaining their relevance to one another Also includes path analysis, confirmatory factor analysis, and latent growth modeling Figures and tables throughout provide examples and illustrate key concepts and techniques For additional resources, please visit: http://tzkeith.com/.

  20. PARAMETRIC AND NON PARAMETRIC (MARS: MULTIVARIATE ADDITIVE REGRESSION SPLINES) LOGISTIC REGRESSIONS FOR PREDICTION OF A DICHOTOMOUS RESPONSE VARIABLE WITH AN EXAMPLE FOR PRESENCE/ABSENCE OF AMPHIBIANS

    Science.gov (United States)

    The purpose of this report is to provide a reference manual that could be used by investigators for making informed use of logistic regression using two methods (standard logistic regression and MARS). The details for analyses of relationships between a dependent binary response ...

  1. Longitudinal changes in telomere length and associated genetic parameters in dairy cattle analysed using random regression models.

    Directory of Open Access Journals (Sweden)

    Luise A Seeker

    Full Text Available Telomeres cap the ends of linear chromosomes and shorten with age in many organisms. In humans short telomeres have been linked to morbidity and mortality. With the accumulation of longitudinal datasets the focus shifts from investigating telomere length (TL to exploring TL change within individuals over time. Some studies indicate that the speed of telomere attrition is predictive of future disease. The objectives of the present study were to 1 characterize the change in bovine relative leukocyte TL (RLTL across the lifetime in Holstein Friesian dairy cattle, 2 estimate genetic parameters of RLTL over time and 3 investigate the association of differences in individual RLTL profiles with productive lifespan. RLTL measurements were analysed using Legendre polynomials in a random regression model to describe TL profiles and genetic variance over age. The analyses were based on 1,328 repeated RLTL measurements of 308 female Holstein Friesian dairy cattle. A quadratic Legendre polynomial was fitted to the fixed effect of age in months and to the random effect of the animal identity. Changes in RLTL, heritability and within-trait genetic correlation along the age trajectory were calculated and illustrated. At a population level, the relationship between RLTL and age was described by a positive quadratic function. Individuals varied significantly regarding the direction and amount of RLTL change over life. The heritability of RLTL ranged from 0.36 to 0.47 (SE = 0.05-0.08 and remained statistically unchanged over time. The genetic correlation of RLTL at birth with measurements later in life decreased with the time interval between samplings from near unity to 0.69, indicating that TL later in life might be regulated by different genes than TL early in life. Even though animals differed in their RLTL profiles significantly, those differences were not correlated with productive lifespan (p = 0.954.

  2. Soybean yield modeling using bootstrap methods for small samples

    Energy Technology Data Exchange (ETDEWEB)

    Dalposso, G.A.; Uribe-Opazo, M.A.; Johann, J.A.

    2016-11-01

    One of the problems that occur when working with regression models is regarding the sample size; once the statistical methods used in inferential analyzes are asymptotic if the sample is small the analysis may be compromised because the estimates will be biased. An alternative is to use the bootstrap methodology, which in its non-parametric version does not need to guess or know the probability distribution that generated the original sample. In this work we used a set of soybean yield data and physical and chemical soil properties formed with fewer samples to determine a multiple linear regression model. Bootstrap methods were used for variable selection, identification of influential points and for determination of confidence intervals of the model parameters. The results showed that the bootstrap methods enabled us to select the physical and chemical soil properties, which were significant in the construction of the soybean yield regression model, construct the confidence intervals of the parameters and identify the points that had great influence on the estimated parameters. (Author)

  3. Correcting for multivariate measurement error by regression calibration in meta-analyses of epidemiological studies.

    NARCIS (Netherlands)

    Kromhout, D.

    2009-01-01

    Within-person variability in measured values of multiple risk factors can bias their associations with disease. The multivariate regression calibration (RC) approach can correct for such measurement error and has been applied to studies in which true values or independent repeat measurements of the

  4. Two levels ARIMAX and regression models for forecasting time series data with calendar variation effects

    Science.gov (United States)

    Suhartono, Lee, Muhammad Hisyam; Prastyo, Dedy Dwi

    2015-12-01

    The aim of this research is to develop a calendar variation model for forecasting retail sales data with the Eid ul-Fitr effect. The proposed model is based on two methods, namely two levels ARIMAX and regression methods. Two levels ARIMAX and regression models are built by using ARIMAX for the first level and regression for the second level. Monthly men's jeans and women's trousers sales in a retail company for the period January 2002 to September 2009 are used as case study. In general, two levels of calendar variation model yields two models, namely the first model to reconstruct the sales pattern that already occurred, and the second model to forecast the effect of increasing sales due to Eid ul-Fitr that affected sales at the same and the previous months. The results show that the proposed two level calendar variation model based on ARIMAX and regression methods yields better forecast compared to the seasonal ARIMA model and Neural Networks.

  5. Quasi-experimental evidence on tobacco tax regressivity.

    Science.gov (United States)

    Koch, Steven F

    2018-01-01

    Tobacco taxes are known to reduce tobacco consumption and to be regressive, such that tobacco control policy may have the perverse effect of further harming the poor. However, if tobacco consumption falls faster amongst the poor than the rich, tobacco control policy can actually be progressive. We take advantage of persistent and committed tobacco control activities in South Africa to examine the household tobacco expenditure burden. For the analysis, we make use of two South African Income and Expenditure Surveys (2005/06 and 2010/11) that span a series of such tax increases and have been matched across the years, yielding 7806 matched pairs of tobacco consuming households and 4909 matched pairs of cigarette consuming households. By matching households across the surveys, we are able to examine both the regressivity of the household tobacco burden, and any change in that regressivity, and since tobacco taxes have been a consistent component of tobacco prices, our results also relate to the regressivity of tobacco taxes. Like previous research into cigarette and tobacco expenditures, we find that the tobacco burden is regressive; thus, so are tobacco taxes. However, we find that over the five-year period considered, the tobacco burden has decreased, and, most importantly, falls less heavily on the poor. Thus, the tobacco burden and the tobacco tax is less regressive in 2010/11 than in 2005/06. Thus, increased tobacco taxes can, in at least some circumstances, reduce the financial burden that tobacco places on households. Copyright © 2017 Elsevier Ltd. All rights reserved.

  6. Measurement Error in Education and Growth Regressions

    NARCIS (Netherlands)

    Portela, M.; Teulings, C.N.; Alessie, R.

    The perpetual inventory method used for the construction of education data per country leads to systematic measurement error. This paper analyses the effect of this measurement error on GDP regressions. There is a systematic difference in the education level between census data and observations

  7. Measurement error in education and growth regressions

    NARCIS (Netherlands)

    Portela, Miguel; Teulings, Coen; Alessie, R.

    2004-01-01

    The perpetual inventory method used for the construction of education data per country leads to systematic measurement error. This paper analyses the effect of this measurement error on GDP regressions. There is a systematic difference in the education level between census data and observations

  8. Salience Assignment for Multiple-Instance Data and Its Application to Crop Yield Prediction

    Science.gov (United States)

    Wagstaff, Kiri L.; Lane, Terran

    2010-01-01

    An algorithm was developed to generate crop yield predictions from orbital remote sensing observations, by analyzing thousands of pixels per county and the associated historical crop yield data for those counties. The algorithm determines which pixels contain which crop. Since each known yield value is associated with thousands of individual pixels, this is a multiple instance learning problem. Because individual crop growth is related to the resulting yield, this relationship has been leveraged to identify pixels that are individually related to corn, wheat, cotton, and soybean yield. Those that have the strongest relationship to a given crop s yield values are most likely to contain fields with that crop. Remote sensing time series data (a new observation every 8 days) was examined for each pixel, which contains information for that pixel s growth curve, peak greenness, and other relevant features. An alternating-projection (AP) technique was used to first estimate the "salience" of each pixel, with respect to the given target (crop yield), and then those estimates were used to build a regression model that relates input data (remote sensing observations) to the target. This is achieved by constructing an exemplar for each crop in each county that is a weighted average of all the pixels within the county; the pixels are weighted according to the salience values. The new regression model estimate then informs the next estimate of the salience values. By iterating between these two steps, the algorithm converges to a stable estimate of both the salience of each pixel and the regression model. The salience values indicate which pixels are most relevant to each crop under consideration.

  9. Diagnostic yield of lumbosacral magnetic resonance imaging requested by paediatric urology consultations.

    Science.gov (United States)

    Fernández-Ibieta, M; Rojas Ticona, J; Villamil, V; Guirao Piñera, M J; López García, A; Zambudio Carmona, G

    2017-11-01

    In the historical series, the diagnostic yield of lumbosacral magnetic resonance imaging to rule out occult spinal dysraphism (or occult myelodysplasia), requested by paediatric urology, ranged from 2% to 15%. The aim of this study was to define our cost-effectiveness in children with urinary symptoms and to define endpoints that increase the possibility of finding occult spinal dysraphism. A screening was conducted on patients with urinary dysfunction for whom an magnetic resonance imaging was requested by the paediatric urology clinic, for persistent symptoms after treatment, voiding dysfunction or other clinical or urodynamic findings. We analysed clinical (UTI, daytime leaks, enuresis, voiding dysfunction, urgency, renal ultrasonography, lumbosacral radiography, history of acute urine retention, skin stigma and myalgia) and urodynamic endpoints (hyperactivity or areflexia, voiding dysfunction, interrupted pattern, accommodation value and maximum flow). A univariate analysis was conducted with SPSS 20.0. We analysed 21 patients during the period 2011-2015. The median age was 6 years (3-10). Three patients (14.3%) had occult spinal dysraphism: one spinal lipoma, one filum lipomatosus and one caudal regression syndrome with channel stenosis. The endpoints with statistically significant differences were the myalgias and the history of acute urine retention (66.7% vs. 5.6%, P=.04; OR= 34; 95%CI: 1.5-781 for both endpoints). The diagnostic yield of magnetic resonance imaging requested for children with urinary dysfunctions without skin stigma or neuro-orthopaedic abnormalities is low, although nonnegligible. In this group, the patients with a history of acute urine retention and muscle pain (pain, «cramps») can experience a greater diagnostic yield or positive predictive value. Copyright © 2017 AEU. Publicado por Elsevier España, S.L.U. All rights reserved.

  10. Estimation of the yield of poplars in plantations of fast-growing species within current results

    Directory of Open Access Journals (Sweden)

    Martin Fajman

    2009-01-01

    Full Text Available Current results are presented of allometric yield estimates of the poplar short rotation coppice. According to a literature review it is obvious that yield estimates, based on measurable quantities of a growing stand, depend not only on the selected tree specie or its clone, but also on the site location. The Jap-105 poplar clone (P. nigra x P. maximowiczii allometric relations were analyzed by regression methods aimed at the creation of the yield estimation methodology at a testing site in Domanínek. Altogether, the twelve polynomial dependences of particular measured quantities approved the high empirical data conformity with the tested regression model (correlation index from 0.9033 to 0.9967. Within the forward stepwise regression, factors were selected, which explain best examined estimates of the total biomass DM; i.e. d.b.h. and stem height. Furthermore, the KESTEMONT’s (1971 mo­del was verified with a satisfying conformity as well. Approving presented yield estimation methods, the presented models will be checked in a large-scale field trial.

  11. Extended mitogenomic phylogenetic analyses yield new insight into crocodylian evolution and their survival of the Cretaceous-Tertiary boundary.

    Science.gov (United States)

    Roos, Jonas; Aggarwal, Ramesh K; Janke, Axel

    2007-11-01

    The mitochondrial genomes of the dwarf crocodile, Osteolaemus tetraspis, and two species of dwarf caimans, the smooth-fronted caiman, Paleosuchus trigonatus, and Cuvier's dwarf caiman, Paleosuchus palpebrosus, were sequenced and included in a mitogenomic phylogenetic study. The phylogenetic analyses, which included a total of ten crocodylian species, yielded strong support to a basal split between Crocodylidae and Alligatoridae. Osteolaemus fell within the Crocodylidae as the sister group to Crocodylus. Gavialis and Tomistoma, which joined on a common branch, constituted a sister group to Crocodylus/Osteolaemus. This suggests that extant crocodylians are organized in two families: Alligatoridae and Crocodylidae. Within the Alligatoridae there was a basal split between Alligator and a branch that contained Paleosuchus and Caiman. The analyses also provided molecular estimates of various divergences applying recently established crocodylian and outgroup fossil calibration points. Molecular estimates based on amino acid data placed the divergence between Crocodylidae and Alligatoridae at 97-103 million years ago and that between Alligator and Caiman/Paleosuchus at 65-72 million years ago. Other crocodilian divergences were placed after the Cretaceous-Tertiary boundary. Thus, according to the molecular estimates, three extant crocodylian lineages have their roots in the Cretaceous. Considering the crocodylian diversification in the Cretaceous the molecular datings suggest that the extinction of the dinosaurs was also to some extent paralleled in the crocodylian evolution. However, for whatever reason, some crocodylian lineages survived into the Tertiary.

  12. Satellite-based assessment of grassland yields

    Science.gov (United States)

    Grant, K.; Siegmund, R.; Wagner, M.; Hartmann, S.

    2015-04-01

    Cutting date and frequency are important parameters determining grassland yields in addition to the effects of weather, soil conditions, plant composition and fertilisation. Because accurate and area-wide data of grassland yields are currently not available, cutting frequency can be used to estimate yields. In this project, a method to detect cutting dates via surface changes in radar images is developed. The combination of this method with a grassland yield model will result in more reliable and regional-wide numbers of grassland yields. For the test-phase of the monitoring project, a study area situated southeast of Munich, Germany, was chosen due to its high density of managed grassland. For determining grassland cutting robust amplitude change detection techniques are used evaluating radar amplitude or backscatter statistics before and after the cutting event. CosmoSkyMed and Sentinel-1A data were analysed. All detected cuts were verified according to in-situ measurements recorded in a GIS database. Although the SAR systems had various acquisition geometries, the amount of detected grassland cut was quite similar. Of 154 tested grassland plots, covering in total 436 ha, 116 and 111 cuts were detected using CosmoSkyMed and Sentinel-1A radar data, respectively. Further improvement of radar data processes as well as additional analyses with higher sample number and wider land surface coverage will follow for optimisation of the method and for validation and generalisation of the results of this feasibility study. The automation of this method will than allow for an area-wide and cost efficient cutting date detection service improving grassland yield models.

  13. Genotype X Environment Interaction for Yield in Field Pea Pisum ...

    African Journals Online (AJOL)

    user

    analysis of variance with individual stability regression co- efficient ... environmental score derived from a principal component ... Grain yield analysis was carried .... Analysis of variance for Additive Main effects and Multiple Interaction (AMMI).

  14. Modelling fourier regression for time series data- a case study: modelling inflation in foods sector in Indonesia

    Science.gov (United States)

    Prahutama, Alan; Suparti; Wahyu Utami, Tiani

    2018-03-01

    Regression analysis is an analysis to model the relationship between response variables and predictor variables. The parametric approach to the regression model is very strict with the assumption, but nonparametric regression model isn’t need assumption of model. Time series data is the data of a variable that is observed based on a certain time, so if the time series data wanted to be modeled by regression, then we should determined the response and predictor variables first. Determination of the response variable in time series is variable in t-th (yt), while the predictor variable is a significant lag. In nonparametric regression modeling, one developing approach is to use the Fourier series approach. One of the advantages of nonparametric regression approach using Fourier series is able to overcome data having trigonometric distribution. In modeling using Fourier series needs parameter of K. To determine the number of K can be used Generalized Cross Validation method. In inflation modeling for the transportation sector, communication and financial services using Fourier series yields an optimal K of 120 parameters with R-square 99%. Whereas if it was modeled by multiple linear regression yield R-square 90%.

  15. Panel data specifications in nonparametric kernel regression

    DEFF Research Database (Denmark)

    Czekaj, Tomasz Gerard; Henningsen, Arne

    parametric panel data estimators to analyse the production technology of Polish crop farms. The results of our nonparametric kernel regressions generally differ from the estimates of the parametric models but they only slightly depend on the choice of the kernel functions. Based on economic reasoning, we...

  16. Clinical and pathologic factors affecting lymph node yields in colorectal cancer.

    Directory of Open Access Journals (Sweden)

    Ta-Wen Hsu

    Full Text Available OBJECTIVE: Lymph node yield is recommended as a benchmark of quality care in colorectal cancer. The objective of this study was to evaluate the impact of various factors upon lymph node yield and to identify independent factors associated with lymph node harvest. MATERIALS AND METHODS: The records of 162 patients with Stage I to Stage III colorectal cancers seen in one institution were reviewed. These patients underwent radical surgery as definitive therapy; high-risk patients then received adjuvant treatment. Pathologic and demographic data were recorded and analyzed. The subgroup analysis of lymph node yields was determined using a t-test and analysis of variants. Linear regression model and multivariable analysis were used to perform potential confounding and predicting variables. RESULTS: Five variables had significant association with lymph node yield after adjustment for other factors in a multiple linear regression model. These variables were: tumor size, surgical method, specimen length, and individual surgeon and pathologist. The model with these five significant variables interpreted 44.4% of the variation. CONCLUSIONS: Patients, tumor characteristics and surgical variables all influence the number of lymph nodes retrieved. Physicians are the main gatekeepers. Adequate training and optimized guidelines could greatly improve the quality of lymph node yields.

  17. The relative importance of imaging markers for the prediction of Alzheimer's disease dementia in mild cognitive impairment — Beyond classical regression

    Directory of Open Access Journals (Sweden)

    Stefan J. Teipel

    2015-01-01

    Penalized regression yielded more parsimonious models than unpenalized stepwise regression for the integration of multiregional and multimodal imaging information. The advantage of penalized regression was particularly strong with a high number of collinear predictors.

  18. Statistical methods in regression and calibration analysis of chromosome aberration data

    International Nuclear Information System (INIS)

    Merkle, W.

    1983-01-01

    The method of iteratively reweighted least squares for the regression analysis of Poisson distributed chromosome aberration data is reviewed in the context of other fit procedures used in the cytogenetic literature. As an application of the resulting regression curves methods for calculating confidence intervals on dose from aberration yield are described and compared, and, for the linear quadratic model a confidence interval is given. Emphasis is placed on the rational interpretation and the limitations of various methods from a statistical point of view. (orig./MG)

  19. Retro-regression--another important multivariate regression improvement.

    Science.gov (United States)

    Randić, M

    2001-01-01

    We review the serious problem associated with instabilities of the coefficients of regression equations, referred to as the MRA (multivariate regression analysis) "nightmare of the first kind". This is manifested when in a stepwise regression a descriptor is included or excluded from a regression. The consequence is an unpredictable change of the coefficients of the descriptors that remain in the regression equation. We follow with consideration of an even more serious problem, referred to as the MRA "nightmare of the second kind", arising when optimal descriptors are selected from a large pool of descriptors. This process typically causes at different steps of the stepwise regression a replacement of several previously used descriptors by new ones. We describe a procedure that resolves these difficulties. The approach is illustrated on boiling points of nonanes which are considered (1) by using an ordered connectivity basis; (2) by using an ordering resulting from application of greedy algorithm; and (3) by using an ordering derived from an exhaustive search for optimal descriptors. A novel variant of multiple regression analysis, called retro-regression (RR), is outlined showing how it resolves the ambiguities associated with both "nightmares" of the first and the second kind of MRA.

  20. Modified Regression Correlation Coefficient for Poisson Regression Model

    Science.gov (United States)

    Kaengthong, Nattacha; Domthong, Uthumporn

    2017-09-01

    This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).

  1. Historical effects of temperature and precipitation on California crop yields

    Energy Technology Data Exchange (ETDEWEB)

    Lobell, D.B. [Energy and Environment Directorate, Lawrence Livermore National Laboratory, Livermore, CA 94550 (United States); Cahill, K.N. [Interdisciplinary Graduate Program in Environment and Resources, Stanford University, Stanford, CA 94305 (United States); Field, C.B. [Department of Global Ecology, Carnegie Institution, Stanford, CA 94305 (United States)

    2007-03-15

    For the 1980-2003 period, we analyzed the relationship between crop yield and three climatic variables (minimum temperature, maximum temperature, and precipitation) for 12 major Californian crops: wine grapes, lettuce, almonds, strawberries, table grapes, hay, oranges, cotton, tomatoes, walnuts, avocados, and pistachios. The months and climatic variables of greatest importance to each crop were used to develop regressions relating yield to climatic conditions. For most crops, fairly simple equations using only 2-3 variables explained more than two-thirds of observed yield variance. The types of variables and months identified suggest that relatively poorly understood processes such as crop infection, pollination, and dormancy may be important mechanisms by which climate influences crop yield. Recent climatic trends have had mixed effects on crop yields, with orange and walnut yields aided, avocado yields hurt, and most crops little affected by recent climatic trends. Yield-climate relationships can provide a foundation for forecasting crop production within a year and for projecting the impact of future climate changes.

  2. Similar estimates of temperature impacts on global wheat yield by three independent methods

    DEFF Research Database (Denmark)

    Liu, Bing; Asseng, Senthold; Müller, Christoph

    2016-01-01

    The potential impact of global temperature change on global crop yield has recently been assessed with different methods. Here we show that grid-based and point-based simulations and statistical regressions (from historic records), without deliberate adaptation or CO2 fertilization effects, produ......-method ensemble, it was possible to quantify ‘method uncertainty’ in addition to model uncertainty. This significantly improves confidence in estimates of climate impacts on global food security.......The potential impact of global temperature change on global crop yield has recently been assessed with different methods. Here we show that grid-based and point-based simulations and statistical regressions (from historic records), without deliberate adaptation or CO2 fertilization effects, produce...... similar estimates of temperature impact on wheat yields at global and national scales. With a 1 °C global temperature increase, global wheat yield is projected to decline between 4.1% and 6.4%. Projected relative temperature impacts from different methods were similar for major wheat-producing countries...

  3. Similar Estimates of Temperature Impacts on Global Wheat Yield by Three Independent Methods

    Science.gov (United States)

    Liu, Bing; Asseng, Senthold; Muller, Christoph; Ewart, Frank; Elliott, Joshua; Lobell, David B.; Martre, Pierre; Ruane, Alex C.; Wallach, Daniel; Jones, James W.; hide

    2016-01-01

    The potential impact of global temperature change on global crop yield has recently been assessed with different methods. Here we show that grid-based and point-based simulations and statistical regressions (from historic records), without deliberate adaptation or CO2 fertilization effects, produce similar estimates of temperature impact on wheat yields at global and national scales. With a 1 C global temperature increase, global wheat yield is projected to decline between 4.1% and 6.4%. Projected relative temperature impacts from different methods were similar for major wheat-producing countries China, India, USA and France, but less so for Russia. Point-based and grid-based simulations, and to some extent the statistical regressions, were consistent in projecting that warmer regions are likely to suffer more yield loss with increasing temperature than cooler regions. By forming a multi-method ensemble, it was possible to quantify 'method uncertainty' in addition to model uncertainty. This significantly improves confidence in estimates of climate impacts on global food security.

  4. Similar estimates of temperature impacts on global wheat yield by three independent methods

    Science.gov (United States)

    Liu, Bing; Asseng, Senthold; Müller, Christoph; Ewert, Frank; Elliott, Joshua; Lobell, David B.; Martre, Pierre; Ruane, Alex C.; Wallach, Daniel; Jones, James W.; Rosenzweig, Cynthia; Aggarwal, Pramod K.; Alderman, Phillip D.; Anothai, Jakarat; Basso, Bruno; Biernath, Christian; Cammarano, Davide; Challinor, Andy; Deryng, Delphine; Sanctis, Giacomo De; Doltra, Jordi; Fereres, Elias; Folberth, Christian; Garcia-Vila, Margarita; Gayler, Sebastian; Hoogenboom, Gerrit; Hunt, Leslie A.; Izaurralde, Roberto C.; Jabloun, Mohamed; Jones, Curtis D.; Kersebaum, Kurt C.; Kimball, Bruce A.; Koehler, Ann-Kristin; Kumar, Soora Naresh; Nendel, Claas; O'Leary, Garry J.; Olesen, Jørgen E.; Ottman, Michael J.; Palosuo, Taru; Prasad, P. V. Vara; Priesack, Eckart; Pugh, Thomas A. M.; Reynolds, Matthew; Rezaei, Ehsan E.; Rötter, Reimund P.; Schmid, Erwin; Semenov, Mikhail A.; Shcherbak, Iurii; Stehfest, Elke; Stöckle, Claudio O.; Stratonovitch, Pierre; Streck, Thilo; Supit, Iwan; Tao, Fulu; Thorburn, Peter; Waha, Katharina; Wall, Gerard W.; Wang, Enli; White, Jeffrey W.; Wolf, Joost; Zhao, Zhigan; Zhu, Yan

    2016-12-01

    The potential impact of global temperature change on global crop yield has recently been assessed with different methods. Here we show that grid-based and point-based simulations and statistical regressions (from historic records), without deliberate adaptation or CO2 fertilization effects, produce similar estimates of temperature impact on wheat yields at global and national scales. With a 1 °C global temperature increase, global wheat yield is projected to decline between 4.1% and 6.4%. Projected relative temperature impacts from different methods were similar for major wheat-producing countries China, India, USA and France, but less so for Russia. Point-based and grid-based simulations, and to some extent the statistical regressions, were consistent in projecting that warmer regions are likely to suffer more yield loss with increasing temperature than cooler regions. By forming a multi-method ensemble, it was possible to quantify `method uncertainty’ in addition to model uncertainty. This significantly improves confidence in estimates of climate impacts on global food security.

  5. Determining the Most Important Soil Properties Affecting the Yield of Saffron in the Ghayenat Area

    Directory of Open Access Journals (Sweden)

    amir ranjbar

    2016-02-01

    Full Text Available Introduction: Saffron is one of the most important economic plants in the Khorasan province. Awareness of soil quality in agricultural lands is essential for the best management of lands and for obtaining maximum economic benefit. In general, plant growth is a function of environmental factors especially chemical and physical properties of soil (20. It has been demonstrated that there was a positive and high correlation between soil organic matter and saffron yield. Increasing the yield of saffron due to organic matter is probably due to soil nutrient, especially phosphorous and nitrogen and also improvement of soil physical quality (6, 28, 29. The yield of saffron in soils with high nitrogen as a result of vegetative growth is high (8. Shahandeh (6 found that most of the variation of saffron yield depends on soil properties. Due to the economic importance of saffron and the role of soil properties on saffron yield, this research was conducted to find the relationship between saffron yield and some soil physical and chemical properties, and to determine the contribution of soil properties that have the greatest impact on saffron yield in the Ghayenat area. Materials and Methods: This research was performed in 30 saffron fields (30 soil samples of the Ghayenat area (longitude 59° 10΄ 10.37˝ - 59° 11΄ 38.41˝ and latitude 33° 43΄ 35.08˝ - 33΄ 44΄ 02.78˝, which is located in the Khrasan province of Iran. In this research, 21 soil properties were regarded as the total data set (TDS. Then the principal component analysis (PCA was used to determine the most important soil properties affecting saffron yield as a minimum data set (MDS and the stepwise regression to estimate saffron yield. To estimate the yield of saffron in stepwise regression method, saffron yield was considered as a dependent variable and soil physical and chemical properties were considered to be independent variables. Results and Discussion: According to the PCA method

  6. SPECIFICS OF THE APPLICATIONS OF MULTIPLE REGRESSION MODEL IN THE ANALYSES OF THE EFFECTS OF GLOBAL FINANCIAL CRISES

    Directory of Open Access Journals (Sweden)

    Željko V. Račić

    2010-12-01

    Full Text Available This paper aims to present the specifics of the application of multiple linear regression model. The economic (financial crisis is analyzed in terms of gross domestic product which is in a function of the foreign trade balance (on one hand and the credit cards, i.e. indebtedness of the population on this basis (on the other hand, in the USA (from 1999. to 2008. We used the extended application model which shows how the analyst should run the whole development process of regression model. This process began with simple statistical features and the application of regression procedures, and ended with residual analysis, intended for the study of compatibility of data and model settings. This paper also analyzes the values of some standard statistics used in the selection of appropriate regression model. Testing of the model is carried out with the use of the Statistics PASW 17 program.

  7. Determination of Selection Index of Cocoa (Theobroma Cacao L.) Yield Traits Using Regression Methods

    OpenAIRE

    Setyawan, Bayu; Taryono; Mitrowihardjo, Suyadi

    2016-01-01

    The increasing chocolate consumption has not been followed by growing production of dry cocoa beans. In order to support the increase in cocoa production, planting materials with high yield are needed. The objective of this research was to determine the components of cocoa traits affecting weight of dry cocoa beans, and set a selection index for superior cocoa trees. The experiment material were four cocoa hybrid populations of which their family ancestry were unknown, and were planted on Sam...

  8. INFLUENCE OF WEATHER CONDITIONS ON GRAIN YIELD, OIL CONTENT AND OIL YIELD OF NEW OS SUNFLOWER HYBRIDS

    Directory of Open Access Journals (Sweden)

    Anto Mijić

    2017-01-01

    Full Text Available With the purpose of determining the influence of weather conditions on the yield components of sunflower, the results of three-year field trials are analysed in the paper. In the trials sown in Osijek in 2013, 2014 and 2015, there were 15 sunflower hybrids: two foreign hybrids and 13 hybrid combinations of the Agricultural Institute Osijek. In the period before sowing (January – March, the highest amount of precipitation was in 2013 (213.1 mm, then in 2015 (167.9 mm, and the lowest in 2014 (109.5 mm. In the growing period (April – September, the highest amount of precipitation (487.3 mm was in 2014, 475.7 mm in 2013, and in 2015 it was the lowest (251.6 mm. In 2013, during the growing period, the mean monthly air temperature was 19.1°C, in 2015 19.9°C, and in 2014 18.6°C. Of these years, statistically significant at the P=0.05, the highest value of the analysed traits was recorded in 2013: grain yield of 6.47 t ha-1, oil content 51.69% and oil yield 3.05 t ha-1. Grain yield, oil content and oil yield were lower in 2015, and the lowest in 2014. Matej, a newly recognized sunflower hybrid of the Agricultural Institute Osijek had the highest values of grain and oil yield (6.95 and 3.39 t ha-1, and by its oil content of 53.44%, it was in the third place. For high grain and oil yields of sunflower, in addition to the optimal air temperature, the amount and distribution of precipitation before and also during the growing season are very important.

  9. Modeling maximum daily temperature using a varying coefficient regression model

    Science.gov (United States)

    Han Li; Xinwei Deng; Dong-Yum Kim; Eric P. Smith

    2014-01-01

    Relationships between stream water and air temperatures are often modeled using linear or nonlinear regression methods. Despite a strong relationship between water and air temperatures and a variety of models that are effective for data summarized on a weekly basis, such models did not yield consistently good predictions for summaries such as daily maximum temperature...

  10. Targeting: Logistic Regression, Special Cases and Extensions

    Directory of Open Access Journals (Sweden)

    Helmut Schaeben

    2014-12-01

    Full Text Available Logistic regression is a classical linear model for logit-transformed conditional probabilities of a binary target variable. It recovers the true conditional probabilities if the joint distribution of predictors and the target is of log-linear form. Weights-of-evidence is an ordinary logistic regression with parameters equal to the differences of the weights of evidence if all predictor variables are discrete and conditionally independent given the target variable. The hypothesis of conditional independence can be tested in terms of log-linear models. If the assumption of conditional independence is violated, the application of weights-of-evidence does not only corrupt the predicted conditional probabilities, but also their rank transform. Logistic regression models, including the interaction terms, can account for the lack of conditional independence, appropriate interaction terms compensate exactly for violations of conditional independence. Multilayer artificial neural nets may be seen as nested regression-like models, with some sigmoidal activation function. Most often, the logistic function is used as the activation function. If the net topology, i.e., its control, is sufficiently versatile to mimic interaction terms, artificial neural nets are able to account for violations of conditional independence and yield very similar results. Weights-of-evidence cannot reasonably include interaction terms; subsequent modifications of the weights, as often suggested, cannot emulate the effect of interaction terms.

  11. MODELING POLLINATION FACTORS THAT INFLUENCE ALFALFA SEED YIELD IN NORTH-CENTRAL NEVADA

    OpenAIRE

    BREAZEALE, Don; FERNANDEZ, George; NARAYANAN, Rangesan

    2008-01-01

    The relative importance of both environmental and management factors on alfalfa seed yield was investigated on North–Central Nevada farms. Multiple linear regression models using 2002-2003 data revealed that cumulative tripped fl owers increased seed yield in both years. Field location does not appear to make a difference in the observed variation in tripped fl ower production. The results suggest that seed yield can be increased by (a) by placing bee shelters closer and (b) cultural practice...

  12. Independent contrasts and PGLS regression estimators are equivalent.

    Science.gov (United States)

    Blomberg, Simon P; Lefevre, James G; Wells, Jessie A; Waterhouse, Mary

    2012-05-01

    We prove that the slope parameter of the ordinary least squares regression of phylogenetically independent contrasts (PICs) conducted through the origin is identical to the slope parameter of the method of generalized least squares (GLSs) regression under a Brownian motion model of evolution. This equivalence has several implications: 1. Understanding the structure of the linear model for GLS regression provides insight into when and why phylogeny is important in comparative studies. 2. The limitations of the PIC regression analysis are the same as the limitations of the GLS model. In particular, phylogenetic covariance applies only to the response variable in the regression and the explanatory variable should be regarded as fixed. Calculation of PICs for explanatory variables should be treated as a mathematical idiosyncrasy of the PIC regression algorithm. 3. Since the GLS estimator is the best linear unbiased estimator (BLUE), the slope parameter estimated using PICs is also BLUE. 4. If the slope is estimated using different branch lengths for the explanatory and response variables in the PIC algorithm, the estimator is no longer the BLUE, so this is not recommended. Finally, we discuss whether or not and how to accommodate phylogenetic covariance in regression analyses, particularly in relation to the problem of phylogenetic uncertainty. This discussion is from both frequentist and Bayesian perspectives.

  13. Face compression yield strength of the copper-Inconel composite specimen

    International Nuclear Information System (INIS)

    Horie, T.

    1987-05-01

    A new equation for the face compression yield strength of copper-Inconel composite material has been derived. Elastic-plastic finite element analyses were also made for composite specimens with various aspect ratios to examine the edge effect of the specimen. According to the results of both the new equation and the analyses, the face compression yield strength of the composite should be decreased by about 25% from the value obtained with Becker's equation

  14. Logistic regression and multiple classification analyses to explore risk factors of under-5 mortality in bangladesh

    International Nuclear Information System (INIS)

    Bhowmik, K.R.; Islam, S.

    2016-01-01

    Logistic regression (LR) analysis is the most common statistical methodology to find out the determinants of childhood mortality. However, the significant predictors cannot be ranked according to their influence on the response variable. Multiple classification (MC) analysis can be applied to identify the significant predictors with a priority index which helps to rank the predictors. The main objective of the study is to find the socio-demographic determinants of childhood mortality at neonatal, post-neonatal, and post-infant period by fitting LR model as well as to rank those through MC analysis. The study is conducted using the data of Bangladesh Demographic and Health Survey 2007 where birth and death information of children were collected from their mothers. Three dichotomous response variables are constructed from children age at death to fit the LR and MC models. Socio-economic and demographic variables significantly associated with the response variables separately are considered in LR and MC analyses. Both the LR and MC models identified the same significant predictors for specific childhood mortality. For both the neonatal and child mortality, biological factors of children, regional settings, and parents socio-economic status are found as 1st, 2nd, and 3rd significant groups of predictors respectively. Mother education and household environment are detected as major significant predictors of post-neonatal mortality. This study shows that MC analysis with or without LR analysis can be applied to detect determinants with rank which help the policy makers taking initiatives on a priority basis. (author)

  15. Clonal stability of latex yield in eleven clones of Hevea brasiliensis Muell. Arg.

    Directory of Open Access Journals (Sweden)

    K.O. Omokhafe

    2003-01-01

    Full Text Available Eleven Hevea brasiliensis clones were evaluated for clonal stability of latex yield. A randomized complete block design was used with four replicates, two locations, seven years and three periods per year. Stability analysis was based on clone x year and clone x year x location interactions. Five stability parameters viz environmental variance, shukla's stability variance, regression of clonal latex yield on environmental index, variance due to regression and variance due to deviation from regression were applied. There was significant clone x environment effect at the two levels of interaction. Among the eleven clones, C 162 was outstanding for clonal stability and it can serve as donor parent for stability alleles. Three clones (C 76, C 150 and C 154 were also stable. The four stable clones (C 76, C 150, C 154 and C 162 are suitable for broad-spectrum recommendation for latex yield. Five clones (C 83, C 143, C 163, C 202 and RRIM 600 will require environment-specific recommendation because of their unstable phenotype. The stability feature of two clones (C 145 and C 159 was not clear and this will be investigated in subsequent studies.

  16. Yields of historical exploration programs

    International Nuclear Information System (INIS)

    Huslende, T.

    1995-01-01

    The paper relates to an method of evaluation developed for analysing the yield of historical exploration programs by computerized simulation. The most important elements show in coarse features how the results can be used in the different analyses. The evaluation is to be executed annually for the comparison and sorting of data from different offshore sites. Topics are exploration evaluation study, evaluation process, handling of exploration costs, discovered reserves, development projects, cash flow analysis, analysis of results, finding cost, international comparison. 1 ref., 11 figs

  17. Cubic-spline interpolation to estimate effects of inbreeding on milk yield in first lactation Holstein cows

    Directory of Open Access Journals (Sweden)

    Makram J. Geha

    2011-01-01

    Full Text Available Milk yield records (305d, 2X, actual milk yield of 123,639 registered first lactation Holstein cows were used to compare linear regression (y = β0 + β1X + e ,quadratic regression, (y = β0 + β1X + β2X2 + e cubic regression (y = β0 + β1X + β2X2 + β3X3 + e and fixed factor models, with cubic-spline interpolation models, for estimating the effects of inbreeding on milk yield. Ten animal models, all with herd-year-season of calving as fixed effect, were compared using the Akaike corrected-Information Criterion (AICc. The cubic-spline interpolation model with seven knots had the lowest AICc, whereas for all those labeled as "traditional", AICc was higher than the best model. Results from fitting inbreeding using a cubic-spline with seven knots were compared to results from fitting inbreeding as a linear covariate or as a fixed factor with seven levels. Estimates of inbreeding effects were not significantly different between the cubic-spline model and the fixed factor model, but were significantly different from the linear regression model. Milk yield decreased significantly at inbreeding levels greater than 9%. Variance component estimates were similar for the three models. Ranking of the top 100 sires with daughter records remained unaffected by the model used.

  18. Estimating the impact of mineral aerosols on crop yields in food insecure regions using statistical crop models

    Science.gov (United States)

    Hoffman, A.; Forest, C. E.; Kemanian, A.

    2016-12-01

    A significant number of food-insecure nations exist in regions of the world where dust plays a large role in the climate system. While the impacts of common climate variables (e.g. temperature, precipitation, ozone, and carbon dioxide) on crop yields are relatively well understood, the impact of mineral aerosols on yields have not yet been thoroughly investigated. This research aims to develop the data and tools to progress our understanding of mineral aerosol impacts on crop yields. Suspended dust affects crop yields by altering the amount and type of radiation reaching the plant, modifying local temperature and precipitation. While dust events (i.e. dust storms) affect crop yields by depleting the soil of nutrients or by defoliation via particle abrasion. The impact of dust on yields is modeled statistically because we are uncertain which impacts will dominate the response on national and regional scales considered in this study. Multiple linear regression is used in a number of large-scale statistical crop modeling studies to estimate yield responses to various climate variables. In alignment with previous work, we develop linear crop models, but build upon this simple method of regression with machine-learning techniques (e.g. random forests) to identify important statistical predictors and isolate how dust affects yields on the scales of interest. To perform this analysis, we develop a crop-climate dataset for maize, soybean, groundnut, sorghum, rice, and wheat for the regions of West Africa, East Africa, South Africa, and the Sahel. Random forest regression models consistently model historic crop yields better than the linear models. In several instances, the random forest models accurately capture the temperature and precipitation threshold behavior in crops. Additionally, improving agricultural technology has caused a well-documented positive trend that dominates time series of global and regional yields. This trend is often removed before regression with

  19. Prediction, Regression and Critical Realism

    DEFF Research Database (Denmark)

    Næss, Petter

    2004-01-01

    This paper considers the possibility of prediction in land use planning, and the use of statistical research methods in analyses of relationships between urban form and travel behaviour. Influential writers within the tradition of critical realism reject the possibility of predicting social...... phenomena. This position is fundamentally problematic to public planning. Without at least some ability to predict the likely consequences of different proposals, the justification for public sector intervention into market mechanisms will be frail. Statistical methods like regression analyses are commonly...... seen as necessary in order to identify aggregate level effects of policy measures, but are questioned by many advocates of critical realist ontology. Using research into the relationship between urban structure and travel as an example, the paper discusses relevant research methods and the kinds...

  20. Dual Regression

    OpenAIRE

    Spady, Richard; Stouli, Sami

    2012-01-01

    We propose dual regression as an alternative to the quantile regression process for the global estimation of conditional distribution functions under minimal assumptions. Dual regression provides all the interpretational power of the quantile regression process while avoiding the need for repairing the intersecting conditional quantile surfaces that quantile regression often produces in practice. Our approach introduces a mathematical programming characterization of conditional distribution f...

  1. Intermediate and advanced topics in multilevel logistic regression analysis.

    Science.gov (United States)

    Austin, Peter C; Merlo, Juan

    2017-09-10

    Multilevel data occur frequently in health services, population and public health, and epidemiologic research. In such research, binary outcomes are common. Multilevel logistic regression models allow one to account for the clustering of subjects within clusters of higher-level units when estimating the effect of subject and cluster characteristics on subject outcomes. A search of the PubMed database demonstrated that the use of multilevel or hierarchical regression models is increasing rapidly. However, our impression is that many analysts simply use multilevel regression models to account for the nuisance of within-cluster homogeneity that is induced by clustering. In this article, we describe a suite of analyses that can complement the fitting of multilevel logistic regression models. These ancillary analyses permit analysts to estimate the marginal or population-average effect of covariates measured at the subject and cluster level, in contrast to the within-cluster or cluster-specific effects arising from the original multilevel logistic regression model. We describe the interval odds ratio and the proportion of opposed odds ratios, which are summary measures of effect for cluster-level covariates. We describe the variance partition coefficient and the median odds ratio which are measures of components of variance and heterogeneity in outcomes. These measures allow one to quantify the magnitude of the general contextual effect. We describe an R 2 measure that allows analysts to quantify the proportion of variation explained by different multilevel logistic regression models. We illustrate the application and interpretation of these measures by analyzing mortality in patients hospitalized with a diagnosis of acute myocardial infarction. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd. © 2017 The Authors. Statistics in Medicine published by John Wiley & Sons Ltd.

  2. Predictions of biochar production and torrefaction performance from sugarcane bagasse using interpolation and regression analysis.

    Science.gov (United States)

    Chen, Wei-Hsin; Hsu, Hung-Jen; Kumar, Gopalakrishnan; Budzianowski, Wojciech M; Ong, Hwai Chyuan

    2017-12-01

    This study focuses on the biochar formation and torrefaction performance of sugarcane bagasse, and they are predicted using the bilinear interpolation (BLI), inverse distance weighting (IDW) interpolation, and regression analysis. It is found that the biomass torrefied at 275°C for 60min or at 300°C for 30min or longer is appropriate to produce biochar as alternative fuel to coal with low carbon footprint, but the energy yield from the torrefaction at 300°C is too low. From the biochar yield, enhancement factor of HHV, and energy yield, the results suggest that the three methods are all feasible for predicting the performance, especially for the enhancement factor. The power parameter of unity in the IDW method provides the best predictions and the error is below 5%. The second order in regression analysis gives a more reasonable approach than the first order, and is recommended for the predictions. Copyright © 2017 Elsevier Ltd. All rights reserved.

  3. Application of Artificial Neural Networks in Canola Crop Yield Prediction

    Directory of Open Access Journals (Sweden)

    S. J. Sajadi

    2014-02-01

    Full Text Available Crop yield prediction has an important role in agricultural policies such as specification of the crop price. Crop yield prediction researches have been based on regression analysis. In this research canola yield was predicted using Artificial Neural Networks (ANN using 11 crop year climate data (1998-2009 in Gonbad-e-Kavoos region of Golestan province. ANN inputs were mean weekly rainfall, mean weekly temperature, mean weekly relative humidity and mean weekly sun shine hours and ANN output was canola yield (kg/ha. Multi-Layer Perceptron networks (MLP with Levenberg-Marquardt backpropagation learning algorithm was used for crop yield prediction and Root Mean Square Error (RMSE and square of the Correlation Coefficient (R2 criterions were used to evaluate the performance of the ANN. The obtained results show that the 13-20-1 network has the lowest RMSE equal to 101.235 and maximum value of R2 equal to 0.997 and is suitable for predicting canola yield with climate factors.

  4. Regression Analyses on the Butterfly Ballot Effect: A Statistical Perspective of the US 2000 Election

    Science.gov (United States)

    Wu, Dane W.

    2002-01-01

    The year 2000 US presidential election between Al Gore and George Bush has been the most intriguing and controversial one in American history. The state of Florida was the trigger for the controversy, mainly, due to the use of the misleading "butterfly ballot". Using prediction (or confidence) intervals for least squares regression lines…

  5. Using NDVI and guided sampling to develop yield prediction maps of processing tomato crop

    Energy Technology Data Exchange (ETDEWEB)

    Fortes, A.; Henar Prieto, M. del; García-Martín, A.; Córdoba, A.; Martínez, L.; Campillo, C.

    2015-07-01

    The use of yield prediction maps is an important tool for the delineation of within-field management zones. Vegetation indices based on crop reflectance are of potential use in the attainment of this objective. There are different types of vegetation indices based on crop reflectance, the most commonly used of which is the NDVI (normalized difference vegetation index). NDVI values are reported to have good correlation with several vegetation parameters including the ability to predict yield. The field research was conducted in two commercial farms of processing tomato crop, Cantillana and Enviciados. An NDVI prediction map developed through ordinary kriging technique was used for guided sampling of processing tomato yield. Yield was studied and related with NDVI, and finally a prediction map of crop yield for the entire plot was generated using two geostatistical methodologies (ordinary and regression kriging). Finally, a comparison was made between the yield obtained at validation points and the yield values according to the prediction maps. The most precise yield maps were obtained with the regression kriging methodology with RRMSE values of 14% and 17% in Cantillana and Enviciados, respectively, using the NDVI as predictor. The coefficient of correlation between NDVI and yield was correlated in the point samples taken in the two locations, with values of 0.71 and 0.67 in Cantillana and Enviciados, respectively. The results suggest that the use of a massive sampling parameter such as NDVI is a good indicator of the distribution of within-field yield variation. (Author)

  6. Predictors of success of external cephalic version and cephalic presentation at birth among 1253 women with non-cephalic presentation using logistic regression and classification tree analyses.

    Science.gov (United States)

    Hutton, Eileen K; Simioni, Julia C; Thabane, Lehana

    2017-08-01

    Among women with a fetus with a non-cephalic presentation, external cephalic version (ECV) has been shown to reduce the rate of breech presentation at birth and cesarean birth. Compared with ECV at term, beginning ECV prior to 37 weeks' gestation decreases the number of infants in a non-cephalic presentation at birth. The purpose of this secondary analysis was to investigate factors associated with a successful ECV procedure and to present this in a clinically useful format. Data were collected as part of the Early ECV Pilot and Early ECV2 Trials, which randomized 1776 women with a fetus in breech presentation to either early ECV (34-36 weeks' gestation) or delayed ECV (at or after 37 weeks). The outcome of interest was successful ECV, defined as the fetus being in a cephalic presentation immediately following the procedure, as well as at the time of birth. The importance of several factors in predicting successful ECV was investigated using two statistical methods: logistic regression and classification and regression tree (CART) analyses. Among nulliparas, non-engagement of the presenting part and an easily palpable fetal head were independently associated with success. Among multiparas, non-engagement of the presenting part, gestation less than 37 weeks and an easily palpable fetal head were found to be independent predictors of success. These findings were consistent with results of the CART analyses. Regardless of parity, descent of the presenting part was the most discriminating factor in predicting successful ECV and cephalic presentation at birth. © 2017 Nordic Federation of Societies of Obstetrics and Gynecology.

  7. Quantitative Prediction of Coalbed Gas Content Based on Seismic Multiple-Attribute Analyses

    Directory of Open Access Journals (Sweden)

    Renfang Pan

    2015-09-01

    Full Text Available Accurate prediction of gas planar distribution is crucial to selection and development of new CBM exploration areas. Based on seismic attributes, well logging and testing data we found that seismic absorption attenuation, after eliminating the effects of burial depth, shows an evident correlation with CBM gas content; (positive structure curvature has a negative correlation with gas content; and density has a negative correlation with gas content. It is feasible to use the hydrocarbon index (P*G and pseudo-Poisson ratio attributes for detection of gas enrichment zones. Based on seismic multiple-attribute analyses, a multiple linear regression equation was established between the seismic attributes and gas content at the drilling wells. Application of this equation to the seismic attributes at locations other than the drilling wells yielded a quantitative prediction of planar gas distribution. Prediction calculations were performed for two different models, one using pre-stack inversion and the other one disregarding pre-stack inversion. A comparison of the results indicates that both models predicted a similar trend for gas content distribution, except that the model using pre-stack inversion yielded a prediction result with considerably higher precision than the other model.

  8. Cigarette nicotine yields and nicotine intake among Japanese male workers

    OpenAIRE

    Ueda, K; Kawachi, I; Nakamura, M; Nogami, H; Shirokawa, N; Masui, S; Okayama, A; Oshima, A

    2002-01-01

    Objectives: To analyse brand nicotine yield including "ultra low" brands (that is, cigarettes yielding ≤ 0.1 mg of nicotine by Federal Trade Commission (FTC) methods) in relation to nicotine intake (urinary nicotine, cotinine and trans-3'-hydroxycotinine) among 246 Japanese male smokers.

  9. Assessment of adaptability and stability of grain yield in bread wheat genotypes under different sowing times in Punjab

    International Nuclear Information System (INIS)

    Anwar, J.; Hussain, M.; Ali, M.A.; Subhani, G.M.; Munir, M.

    2011-01-01

    Twenty advanced lines/genotypes of wheat including two check varieties were sown under two different sowing times through out the Punjab province at 18 different locations with diverse environments to study their stability and adaptability. Normal sowing was done in second week of November 2007 while the delayed sowing was completed during second week of December 2007 during crop season 2007-08. The pooled analysis of variance showed significant differences among environments and genotypes for grain yield demonstrating the presence of considerable variations (p<0.01) among genotypes as well as diversity of growing environments at various locations for both normal and late sown wheat crops. The highest average grain yield was obtained at Jalandar Seed Farm, Arifwala and Pak. German Farm, Multan for normal and delayed sown crops, respectively. Most of the locations emerged as high yielding in normal sowing compared to late sown crop. Dendrograms of 18 locations based on the average yield of 20 wheat genotypes grown under normal and late sown crop revealed two main clusters. Under both normal and late sowing, none of the varieties exceeded the check Seher-2006, however, the check was followed by the advanced lines V-04022 and V-05066 for normal sown crop and Shafaq-2006, V-05066 and V-04022 under delayed sowing. All the genotypes revealed decline in grain yield for late sown wheat crop. The analysis of stability based on mean grain yield, regression coefficient and deviation from regression advocated that the cultivars V-05066 and V-03BT007 were most stable and adapted to diverse environmental conditions of Punjab. These cultivars revealed unit regression and non-significant deviations from regression. The check variety Seher-2006 produced maximum yield for both sowing times that suggested its consistent and stable performance across the environments. (author)

  10. Invited review: A commentary on predictive cheese yield formulas.

    Science.gov (United States)

    Emmons, D B; Modler, H W

    2010-12-01

    Predictive cheese yield formulas have evolved from one based only on casein and fat in 1895. Refinements have included moisture and salt in cheese and whey solids as separate factors, paracasein instead of casein, and exclusion of whey solids from moisture associated with cheese protein. The General, Barbano, and Van Slyke formulas were tested critically using yield and composition of milk, whey, and cheese from 22 vats of Cheddar cheese. The General formula is based on the sum of cheese components: fat, protein, moisture, salt, whey solids free of fat and protein, as well as milk salts associated with paracasein. The testing yielded unexpected revelations. It was startling that the sum of components in cheese was SofC) in cheese. The apparent low estimation of SofC led to the idea of adjusting upwards, for each vat, the 5 measured components in the formula by the observed SofC, as a fraction. The mean of the adjusted predicted yields as percentages of actual yields was 99.99%. The adjusted forms of the General, Barbano, and Van Slyke formulas gave predicted yields equal to the actual yields. It was apparent that unadjusted yield formulas did not accurately predict yield; however, unadjusted PY%AY can be useful as a control tool for analyses of cheese and milk. It was unexpected that total milk protein in the adjusted General formula gave the same predicted yields as casein and paracasein, indicating that casein or paracasein may not always be necessary for successful yield prediction. The use of constants for recovery of fat and protein in the adjusted General formula gave adjusted predicted yields equal to actual yields, indicating that analyses of cheese for protein and fat may not always be necessary for yield prediction. Composition of cheese was estimated using a predictive formula; actual yield was needed for estimation of composition. Adjusted formulas are recommended for estimating target yields and cheese yield efficiency. Constants for solute exclusion

  11. Correlation maps to assess soybean yield from EVI data in Paraná State, Brazil

    Directory of Open Access Journals (Sweden)

    Gleyce Kelly Dantas Araújo Figueiredo

    Full Text Available ABSTRACT Vegetation indices are widely used to monitor crop development and generally used as input data in models to forecast yield. The first step of this study consisted of using monthly Maximum Value Composites to create correlation maps using Enhanced Vegetation Index (EVI from Moderate Resolution Imaging Spectroradiometer (MODIS sensor mounted on Terra satellite and historical yield during the soybean crop cycle in Paraná State, Brazil, from 2000/2001 to 2010/2011. We compared the ability of forecasting crop yield based on correlation maps and crop specific masks. We ran a preliminary regression model to test its ability on yield estimation for four municipalities during the soybean growing season. A regression model was developed for both methodologies to forecast soybean crop yield using leave-one-out cross validation. The Root Mean Squared Error (RMSE values in the implementation of the model ranged from 0.037 t ha−1 to 0.19 t ha−1 using correlation maps, while for crop specific masks, it varied from 0.21 t ha−1 to 0.35 t ha−1. The model was able to explain 96 % to 98 % of the variance in estimated yield from correlation maps, while it was able to explain only 2 % to 67 % for crop specific mask approach. The results showed that the correlation maps could be used to predict crop yield more effectively than crop specific masks. In addition, this method can provide an indication of soybean yield prior to harvesting.

  12. Modelling of seed yield and its components in tall fescue (Festuca ...

    African Journals Online (AJOL)

    AJL

    2011-10-03

    Oct 3, 2011 ... Ridge regression analysis was used to derive a steady algorithmic .... included three replicates [3 × 6 = 18 plots (treatments), stochastic ..... The parameter estimates of the five seed yield components of a total of 327 samples.

  13. Quantum algorithm for linear regression

    Science.gov (United States)

    Wang, Guoming

    2017-07-01

    We present a quantum algorithm for fitting a linear regression model to a given data set using the least-squares approach. Differently from previous algorithms which yield a quantum state encoding the optimal parameters, our algorithm outputs these numbers in the classical form. So by running it once, one completely determines the fitted model and then can use it to make predictions on new data at little cost. Moreover, our algorithm works in the standard oracle model, and can handle data sets with nonsparse design matrices. It runs in time poly( log2(N ) ,d ,κ ,1 /ɛ ) , where N is the size of the data set, d is the number of adjustable parameters, κ is the condition number of the design matrix, and ɛ is the desired precision in the output. We also show that the polynomial dependence on d and κ is necessary. Thus, our algorithm cannot be significantly improved. Furthermore, we also give a quantum algorithm that estimates the quality of the least-squares fit (without computing its parameters explicitly). This algorithm runs faster than the one for finding this fit, and can be used to check whether the given data set qualifies for linear regression in the first place.

  14. On Rigorous Drought Assessment Using Daily Time Scale: Non-Stationary Frequency Analyses, Revisited Concepts, and a New Method to Yield Non-Parametric Indices

    Directory of Open Access Journals (Sweden)

    Charles Onyutha

    2017-10-01

    Full Text Available Some of the problems in drought assessments are that: analyses tend to focus on coarse temporal scales, many of the methods yield skewed indices, a few terminologies are ambiguously used, and analyses comprise an implicit assumption that the observations come from a stationary process. To solve these problems, this paper introduces non-stationary frequency analyses of quantiles. How to use non-parametric rescaling to obtain robust indices that are not (or minimally skewed is also introduced. To avoid ambiguity, some concepts on, e.g., incidence, extremity, etc., were revisited through shift from monthly to daily time scale. Demonstrations on the introduced methods were made using daily flow and precipitation insufficiency (precipitation minus potential evapotranspiration from the Blue Nile basin in Africa. Results show that, when a significant trend exists in extreme events, stationarity-based quantiles can be far different from those when non-stationarity is considered. The introduced non-parametric indices were found to closely agree with the well-known standardized precipitation evapotranspiration indices in many aspects but skewness. Apart from revisiting some concepts, the advantages of the use of fine instead of coarse time scales in drought assessment were given. The links for obtaining freely downloadable tools on how to implement the introduced methods were provided.

  15. Geodesic least squares regression for scaling studies in magnetic confinement fusion

    International Nuclear Information System (INIS)

    Verdoolaege, Geert

    2015-01-01

    In regression analyses for deriving scaling laws that occur in various scientific disciplines, usually standard regression methods have been applied, of which ordinary least squares (OLS) is the most popular. However, concerns have been raised with respect to several assumptions underlying OLS in its application to scaling laws. We here discuss a new regression method that is robust in the presence of significant uncertainty on both the data and the regression model. The method, which we call geodesic least squares regression (GLS), is based on minimization of the Rao geodesic distance on a probabilistic manifold. We demonstrate the superiority of the method using synthetic data and we present an application to the scaling law for the power threshold for the transition to the high confinement regime in magnetic confinement fusion devices

  16. Seed yield and quality of okra as influenced by sowing dates

    African Journals Online (AJOL)

    Administrator

    2011-06-20

    Jun 20, 2011 ... and quality seed of okra over four consecutive years having seven sowing times at fortnight interval ..... Figures in parentheses are angular transformed values. ... Table 6. Step-wise regression model on seed yield of okra with ...

  17. Controls and forecasts of nitrate yields in forested watersheds: A view over mainland Portugal.

    Science.gov (United States)

    Pacheco, F A L; Santos, R M B; Sanches Fernandes, L F; Pereira, M G; Cortes, R M V

    2015-12-15

    A study on nitrate yields was conducted in forested watersheds of mainland Portugal. The prime goal was to rank parameters in descending order of their contribution to the export of nitrate towards streams and lakes. To attain the goal, variables like soil loss, rainfall intensity, topography, soil type, forest composition and environmental disturbances such as hardwood harvesting or wildfires were organized in a conceptual yield model. Because some parameters were potentially collinear, a robust multivariate statistical technique was selected to execute the conceptual model and perform the aforementioned ranking, namely Partial Least Squares (PLS) regression. This technique was tested with a sample of 60 forested watersheds (>70% of forest occupation), being subject to a double-validation process to ensure prediction capability. According to final regression coefficients, soil erosion seems to regulate nitrate distribution across the basins, because soil loss and type, rainfall intensity and topography explained around 60% of nitrate yield variance. The major importance of erosion is followed by a moderate role of biochemical processes such as nitrification or nutrient uptake, which accounted for approximately 15% of nitrate yield variance. In this case, deciduous forests and scrubland seem to behave as net sinks of nitrate while coniferous and mixed forests seem to act dually, as net sources or sinks. The least important parameters are the environmental disturbances, explaining no more than 5% of nitrate yield variance. The results of PLS regression were coupled in a scenario analysis with measures designed to protect soil from erosion and surface water from eutrophication. These interventions are to be implemented until 2045, according to regional plans of forest management. Considering the key role of erosion in explaining nitrate dynamics across the catchments, it was not surprising to verify that soil protection measures may reduce nitrate yields by some 35

  18. A Comparison of Advanced Regression Algorithms for Quantifying Urban Land Cover

    Directory of Open Access Journals (Sweden)

    Akpona Okujeni

    2014-07-01

    Full Text Available Quantitative methods for mapping sub-pixel land cover fractions are gaining increasing attention, particularly with regard to upcoming hyperspectral satellite missions. We evaluated five advanced regression algorithms combined with synthetically mixed training data for quantifying urban land cover from HyMap data at 3.6 and 9 m spatial resolution. Methods included support vector regression (SVR, kernel ridge regression (KRR, artificial neural networks (NN, random forest regression (RFR and partial least squares regression (PLSR. Our experiments demonstrate that both kernel methods SVR and KRR yield high accuracies for mapping complex urban surface types, i.e., rooftops, pavements, grass- and tree-covered areas. SVR and KRR models proved to be stable with regard to the spatial and spectral differences between both images and effectively utilized the higher complexity of the synthetic training mixtures for improving estimates for coarser resolution data. Observed deficiencies mainly relate to known problems arising from spectral similarities or shadowing. The remaining regressors either revealed erratic (NN or limited (RFR and PLSR performances when comprehensively mapping urban land cover. Our findings suggest that the combination of kernel-based regression methods, such as SVR and KRR, with synthetically mixed training data is well suited for quantifying urban land cover from imaging spectrometer data at multiple scales.

  19. Easy methods for extracting individual regression slopes: Comparing SPSS, R, and Excel

    Directory of Open Access Journals (Sweden)

    Roland Pfister

    2013-10-01

    Full Text Available Three different methods for extracting coefficientsof linear regression analyses are presented. The focus is on automatic and easy-to-use approaches for common statistical packages: SPSS, R, and MS Excel / LibreOffice Calc. Hands-on examples are included for each analysis, followed by a brief description of how a subsequent regression coefficient analysis is performed.

  20. The analysis of nonstationary time series using regression, correlation and cointegration

    DEFF Research Database (Denmark)

    Johansen, Søren

    2012-01-01

    There are simple well-known conditions for the validity of regression and correlation as statistical tools. We analyse by examples the effect of nonstationarity on inference using these methods and compare them to model based inference using the cointegrated vector autoregressive model. Finally we...... analyse some monthly data from US on interest rates as an illustration of the methods...

  1. Regression: A Bibliography.

    Science.gov (United States)

    Pedrini, D. T.; Pedrini, Bonnie C.

    Regression, another mechanism studied by Sigmund Freud, has had much research, e.g., hypnotic regression, frustration regression, schizophrenic regression, and infra-human-animal regression (often directly related to fixation). Many investigators worked with hypnotic age regression, which has a long history, going back to Russian reflexologists.…

  2. Quantitative Genetic Analysis for Yield and Yield Components in Boro Rice (Oryza sativa L.

    Directory of Open Access Journals (Sweden)

    Supriyo CHAKRABORTY

    2010-03-01

    Full Text Available Twenty-nine genotypes of boro rice (Oryza sativa L. were grown in a randomized block design with three replications in plots of 4m x 1m with a crop geometry of 20 cm x 20 cm between November-April, in Regional Agricultural Research Station, Nagaon, India. Quantitative data were collected on five randomly selected plants of each genotype per replication for yield/plant, and six other yield components, namely plant height, panicles/plant, panicle length, effective grains/panicle, 100 grain weight and harvest index. Mean values of the characters for each genotype were used for analysis of variance and covariance to obtain information on genotypic and phenotypic correlation along with coheritability between two characters. Path analyses were carried out to estimate the direct and indirect effects of boro rice�s yield components. The objective of the study was to identify the characters that mostly influence the yield for increasing boro rice productivity through breeding program. Correlation analysis revealed significant positive genotypic correlation of yield/plant with plant height (0.21, panicles/plant (0.53, panicle length (0.53, effective grains/panicle (0.57 and harvest index (0.86. Path analysis based on genotypic correlation coefficients elucidated high positive direct effect of harvest index (0.8631, panicle length (0.2560 and 100 grain weight (0.1632 on yield/plant with a residual effect of 0.33. Plant height and panicles/plant recorded high positive indirect effect on yield/plant via harvest index whereas effective grains/panicle on yield/plant via harvest index and panicle length. Results of the present study suggested that five component characters, namely harvest index, effective grains/plant, panicle length, panicles/plant and plant height influenced the yield of boro rice. A genotype with higher magnitude of these component characters could be either selected from the existing genotypes or evolved by breeding program for genetic

  3. Relation between academic yield and stress in medical students

    Directory of Open Access Journals (Sweden)

    Sandra Patricia González Peña

    2006-12-01

    Full Text Available Objective. To study risk factors that where found as influence in the academic yield (stress, alcohol, friendships, depression and family relations in the students of the Medicine Faculty of the Universidad de Manizales. Materials and methods: Descriptive study integrated by random selected sample, who were attending of II to XI semester of the Medicine faculty. An anonymous survey was conduced about sociodemographic, cultural, academic and motivational characteristics,including stress, depresión, family disfunction and substance abuse. We correlated all variables with academic yield using chi square test, Pearson`s coefficient and lineal regression. Results: 212 students of ages between 17 and 31 years where analyzed, in which the majority where from another city. Some of the factors were detected which affect the academic yield of the students as it is stress, depression, the family function and friendships among others. Conclusions: A significant relation between academic yield and stress was found. In turn, stress variable was influenced by depression, alcohol and family relation.

  4. Tutorial on Using Regression Models with Count Outcomes Using R

    Directory of Open Access Journals (Sweden)

    A. Alexander Beaujean

    2016-02-01

    Full Text Available Education researchers often study count variables, such as times a student reached a goal, discipline referrals, and absences. Most researchers that study these variables use typical regression methods (i.e., ordinary least-squares either with or without transforming the count variables. In either case, using typical regression for count data can produce parameter estimates that are biased, thus diminishing any inferences made from such data. As count-variable regression models are seldom taught in training programs, we present a tutorial to help educational researchers use such methods in their own research. We demonstrate analyzing and interpreting count data using Poisson, negative binomial, zero-inflated Poisson, and zero-inflated negative binomial regression models. The count regression methods are introduced through an example using the number of times students skipped class. The data for this example are freely available and the R syntax used run the example analyses are included in the Appendix.

  5. Use of probabilistic weights to enhance linear regression myoelectric control.

    Science.gov (United States)

    Smith, Lauren H; Kuiken, Todd A; Hargrove, Levi J

    2015-12-01

    Clinically available prostheses for transradial amputees do not allow simultaneous myoelectric control of degrees of freedom (DOFs). Linear regression methods can provide simultaneous myoelectric control, but frequently also result in difficulty with isolating individual DOFs when desired. This study evaluated the potential of using probabilistic estimates of categories of gross prosthesis movement, which are commonly used in classification-based myoelectric control, to enhance linear regression myoelectric control. Gaussian models were fit to electromyogram (EMG) feature distributions for three movement classes at each DOF (no movement, or movement in either direction) and used to weight the output of linear regression models by the probability that the user intended the movement. Eight able-bodied and two transradial amputee subjects worked in a virtual Fitts' law task to evaluate differences in controllability between linear regression and probability-weighted regression for an intramuscular EMG-based three-DOF wrist and hand system. Real-time and offline analyses in able-bodied subjects demonstrated that probability weighting improved performance during single-DOF tasks (p linear regression control. Use of probability weights can improve the ability to isolate individual during linear regression myoelectric control, while maintaining the ability to simultaneously control multiple DOFs.

  6. Advanced statistics: linear regression, part I: simple linear regression.

    Science.gov (United States)

    Marill, Keith A

    2004-01-01

    Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.

  7. Building optimal regression tree by ant colony system-genetic algorithm: Application to modeling of melting points

    Energy Technology Data Exchange (ETDEWEB)

    Hemmateenejad, Bahram, E-mail: hemmatb@sums.ac.ir [Department of Chemistry, Shiraz University, Shiraz (Iran, Islamic Republic of); Medicinal and Natural Products Chemistry Research Center, Shiraz University of Medical Sciences, Shiraz (Iran, Islamic Republic of); Shamsipur, Mojtaba [Department of Chemistry, Razi University, Kermanshah (Iran, Islamic Republic of); Zare-Shahabadi, Vali [Young Researchers Club, Mahshahr Branch, Islamic Azad University, Mahshahr (Iran, Islamic Republic of); Akhond, Morteza [Department of Chemistry, Shiraz University, Shiraz (Iran, Islamic Republic of)

    2011-10-17

    Highlights: {yields} Ant colony systems help to build optimum classification and regression trees. {yields} Using of genetic algorithm operators in ant colony systems resulted in more appropriate models. {yields} Variable selection in each terminal node of the tree gives promising results. {yields} CART-ACS-GA could model the melting point of organic materials with prediction errors lower than previous models. - Abstract: The classification and regression trees (CART) possess the advantage of being able to handle large data sets and yield readily interpretable models. A conventional method of building a regression tree is recursive partitioning, which results in a good but not optimal tree. Ant colony system (ACS), which is a meta-heuristic algorithm and derived from the observation of real ants, can be used to overcome this problem. The purpose of this study was to explore the use of CART and its combination with ACS for modeling of melting points of a large variety of chemical compounds. Genetic algorithm (GA) operators (e.g., cross averring and mutation operators) were combined with ACS algorithm to select the best solution model. In addition, at each terminal node of the resulted tree, variable selection was done by ACS-GA algorithm to build an appropriate partial least squares (PLS) model. To test the ability of the resulted tree, a set of approximately 4173 structures and their melting points were used (3000 compounds as training set and 1173 as validation set). Further, an external test set containing of 277 drugs was used to validate the prediction ability of the tree. Comparison of the results obtained from both trees showed that the tree constructed by ACS-GA algorithm performs better than that produced by recursive partitioning procedure.

  8. The Analysis of Nonstationary Time Series Using Regression, Correlation and Cointegration

    Directory of Open Access Journals (Sweden)

    Søren Johansen

    2012-06-01

    Full Text Available There are simple well-known conditions for the validity of regression and correlation as statistical tools. We analyse by examples the effect of nonstationarity on inference using these methods and compare them to model based inference using the cointegrated vector autoregressive model. Finally we analyse some monthly data from US on interest rates as an illustration of the methods.

  9. Tools to support interpreting multiple regression in the face of multicollinearity.

    Science.gov (United States)

    Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K

    2012-01-01

    While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.

  10. Testing and Modeling Fuel Regression Rate in a Miniature Hybrid Burner

    Directory of Open Access Journals (Sweden)

    Luciano Fanton

    2012-01-01

    Full Text Available Ballistic characterization of an extended group of innovative HTPB-based solid fuel formulations for hybrid rocket propulsion was performed in a lab-scale burner. An optical time-resolved technique was used to assess the quasisteady regression history of single perforation, cylindrical samples. The effects of metalized additives and radiant heat transfer on the regression rate of such formulations were assessed. Under the investigated operating conditions and based on phenomenological models from the literature, analyses of the collected experimental data show an appreciable influence of the radiant heat flux from burnt gases and soot for both unloaded and loaded fuel formulations. Pure HTPB regression rate data are satisfactorily reproduced, while the impressive initial regression rates of metalized formulations require further assessment.

  11. Blackleg (Leptosphaeria maculans) Severity and Yield Loss in Canola in Alberta, Canada

    Science.gov (United States)

    Hwang, Sheau-Fang; Strelkov, Stephen E.; Peng, Gary; Ahmed, Hafiz; Zhou, Qixing; Turnbull, George

    2016-01-01

    Blackleg, caused by Leptosphaeria maculans, is an important disease of oilseed rape (Brassica napus L.) in Canada and throughout the world. Severe epidemics of blackleg can result in significant yield losses. Understanding disease-yield relationships is a prerequisite for measuring the agronomic efficacy and economic benefits of control methods. Field experiments were conducted in 2013, 2014, and 2015 to determine the relationship between blackleg disease severity and yield in a susceptible cultivar and in moderately resistant to resistant canola hybrids. Disease severity was lower, and seed yield was 120%–128% greater, in the moderately resistant to resistant hybrids compared with the susceptible cultivar. Regression analysis showed that pod number and seed yield declined linearly as blackleg severity increased. Seed yield per plant decreased by 1.8 g for each unit increase in disease severity, corresponding to a decline in yield of 17.2% for each unit increase in disease severity. Pyraclostrobin fungicide reduced disease severity in all site-years and increased yield. These results show that the reduction of blackleg in canola crops substantially improves yields. PMID:27447676

  12. Blackleg (Leptosphaeria maculans Severity and Yield Loss in Canola in Alberta, Canada

    Directory of Open Access Journals (Sweden)

    Sheau-Fang Hwang

    2016-07-01

    Full Text Available Blackleg, caused by Leptosphaeria maculans, is an important disease of oilseed rape (Brassica napus L. in Canada and throughout the world. Severe epidemics of blackleg can result in significant yield losses. Understanding disease-yield relationships is a prerequisite for measuring the agronomic efficacy and economic benefits of control methods. Field experiments were conducted in 2013, 2014, and 2015 to determine the relationship between blackleg disease severity and yield in a susceptible cultivar and in moderately resistant to resistant canola hybrids. Disease severity was lower, and seed yield was 120%–128% greater, in the moderately resistant to resistant hybrids compared with the susceptible cultivar. Regression analysis showed that pod number and seed yield declined linearly as blackleg severity increased. Seed yield per plant decreased by 1.8 g for each unit increase in disease severity, corresponding to a decline in yield of 17.2% for each unit increase in disease severity. Pyraclostrobin fungicide reduced disease severity in all site-years and increased yield. These results show that the reduction of blackleg in canola crops substantially improves yields.

  13. Differential item functioning (DIF) analyses of health-related quality of life instruments using logistic regression

    DEFF Research Database (Denmark)

    Scott, Neil W; Fayers, Peter M; Aaronson, Neil K

    2010-01-01

    Differential item functioning (DIF) methods can be used to determine whether different subgroups respond differently to particular items within a health-related quality of life (HRQoL) subscale, after allowing for overall subgroup differences in that scale. This article reviews issues that arise ...... when testing for DIF in HRQoL instruments. We focus on logistic regression methods, which are often used because of their efficiency, simplicity and ease of application....

  14. YIELD STABILITY OF NEW HYBRID RICE ACROSS LOCATIONS

    Directory of Open Access Journals (Sweden)

    Satoto

    2016-02-01

    Full Text Available The adaptation of hybrid rice varieties mostly are in specific location and season, but there are some of the varieties have a wide adaptation then adopted by the farmer in the large area. Replicated yield trials were conducted to study the stability of hybrid rice yield and identify the best location to optimize their yield per ha. The trials were conducted in three location such as Sukamandi, Salatiga and Malang during two seasons in 2011. Data across location and season were analazed by using AMMI and Eberhart Russel methods. The AMMI analysis showed that the IR79156A/PK88 was adaptable to favorable environments but unstable. This hybrid is always performing well and produce the higher yield compare to check variety. Some of other hybrids were good only in specific location, i.e. IR62829A/BP2280-1E-12-22 and IR58029A/BP2280-1E-12-22. Those hybrids produced higher yield in Salatiga and Malang, respectively. Seem to AMMI analysis, the result of Eberhart and Russell method also showed that IR79156A/PK81 was the best hybrid with regression slope (b around 1 with the yield average higher than average of all hybrids. It indicated that this hybrid has a wide adaptation and probably can be cultivated in the wider ecosystem.

  15. Measure of the e+e-{yields}bb Cross Section at the LEP Energies; Medida de la seccion eficaz e''+e''-{yields}bb a las Energias de LEP

    Energy Technology Data Exchange (ETDEWEB)

    Arce Dubois, P

    1992-07-01

    In the present work I analyse the data collected during 1990 by the L3 detector, situated in the electron-positron collider LEP. After selecting the events e''+e''-{yields} bb through their semileptonic decays into muons, I calculate the cross section for the process e''+e''- {yields} bb at different energy points around the mass of the vectorial boson Z, and I measure some parameters of the Standard Model, namely, the Br(b{yields}{mu} ),{gamma}{sub z}n-{yields}bb/{gamma}{sub z}n{yields}had and {gamma}{sub z}n{yields}bb{gamma}{sub z}n{yields}e''+e''-. (Author) 26 refs.

  16. The impact of global signal regression on resting state correlations: are anti-correlated networks introduced?

    Science.gov (United States)

    Murphy, Kevin; Birn, Rasmus M; Handwerker, Daniel A; Jones, Tyler B; Bandettini, Peter A

    2009-02-01

    Low-frequency fluctuations in fMRI signal have been used to map several consistent resting state networks in the brain. Using the posterior cingulate cortex as a seed region, functional connectivity analyses have found not only positive correlations in the default mode network but negative correlations in another resting state network related to attentional processes. The interpretation is that the human brain is intrinsically organized into dynamic, anti-correlated functional networks. Global variations of the BOLD signal are often considered nuisance effects and are commonly removed using a general linear model (GLM) technique. This global signal regression method has been shown to introduce negative activation measures in standard fMRI analyses. The topic of this paper is whether such a correction technique could be the cause of anti-correlated resting state networks in functional connectivity analyses. Here we show that, after global signal regression, correlation values to a seed voxel must sum to a negative value. Simulations also show that small phase differences between regions can lead to spurious negative correlation values. A combination breath holding and visual task demonstrates that the relative phase of global and local signals can affect connectivity measures and that, experimentally, global signal regression leads to bell-shaped correlation value distributions, centred on zero. Finally, analyses of negatively correlated networks in resting state data show that global signal regression is most likely the cause of anti-correlations. These results call into question the interpretation of negatively correlated regions in the brain when using global signal regression as an initial processing step.

  17. Trends in nicotine yield in smoke and its relationship with design characteristics among popular US cigarette brands, 1997-2005.

    Science.gov (United States)

    Connolly, Gregory N; Alpert, Hillel R; Wayne, Geoffrey Ferris; Koh, Howard

    2007-10-01

    To determine whether nicotine yields in the smoke of cigarettes would show an overall increase over time or an increasing trend limited to any particular market category (eg, full flavour vs light, medium (mild) or ultralight; mentholated vs non-mentholated), manufacturer, or brand family or brand style, and whether nicotine yields in smoke would be associated with measurable trends in cigarette design. Machine-based measures of nicotine yield in smoke and measures of cigarette design features related to nicotine delivery (ventilation, nicotine content in the tobacco rod and number of puffs), as well as market category descriptors, were obtained from annual reports filed with the Massachusetts Department of Public by tobacco manufacturers for 1997-2005. Trends in nicotine yield and its relationship with design features and market parameters were analysed with multilevel mixed-effects regression modelling procedures. A statistically significant trend was confirmed in increased nicotine yield, of 0.019 (1.1%) mg/cig/year over the period 1997-2005 and 0.029 (1.6%) mg/cig/year over the period 1998-2005. The increasing trend was observed in all major market categories (mentholated vs non-mentholated, and full flavour vs light, medium (mild) or ultralight). Nicotine yield in smoke was positively associated with nicotine concentration in the tobacco and number of puffs per cigarette, both of which showed increasing trends during the study period. This study confirms increased machine-measured levels of nicotine, the addictive agent in cigarettes, in smoke, to be a result of increased nicotine in the tobacco rod and other design modifications.

  18. Approximate analyses of inelastic effects in pipework

    International Nuclear Information System (INIS)

    Jobson, D.A.

    1983-01-01

    This presentation shows figures concerned with analyses of inelastic effects in pipework as follows: comparison of experimental and calculated simplified analyses results for free end rotation and for circumferential strain; interrupted stress relaxation; regenerated relaxation caused by reversed yield; buckling of straight pipe under combined bending and torsion; results of fatigues test of pipe bend

  19. An Original Stepwise Multilevel Logistic Regression Analysis of Discriminatory Accuracy

    DEFF Research Database (Denmark)

    Merlo, Juan; Wagner, Philippe; Ghith, Nermin

    2016-01-01

    BACKGROUND AND AIM: Many multilevel logistic regression analyses of "neighbourhood and health" focus on interpreting measures of associations (e.g., odds ratio, OR). In contrast, multilevel analysis of variance is rarely considered. We propose an original stepwise analytical approach that disting...

  20. Estimates of nitrate loads and yields from groundwater to streams in the Chesapeake Bay watershed based on land use and geology

    Science.gov (United States)

    Terziotti, Silvia; Capel, Paul D.; Tesoriero, Anthony J.; Hopple, Jessica A.; Kronholm, Scott C.

    2018-03-07

    The water quality of the Chesapeake Bay may be adversely affected by dissolved nitrate carried in groundwater discharge to streams. To estimate the concentrations, loads, and yields of nitrate from groundwater to streams for the Chesapeake Bay watershed, a regression model was developed based on measured nitrate concentrations from 156 small streams with watersheds less than 500 square miles (mi2 ) at baseflow. The regression model has three predictive variables: geologic unit, percent developed land, and percent agricultural land. Comparisons of estimated and actual values within geologic units were closely matched. The coefficient of determination (R2 ) for the model was 0.6906. The model was used to calculate baseflow nitrate concentrations at over 83,000 National Hydrography Dataset Plus Version 2 catchments and aggregated to 1,966 total 12-digit hydrologic units in the Chesapeake Bay watershed. The modeled output geospatial data layers provided estimated annual loads and yields of nitrate from groundwater into streams. The spatial distribution of annual nitrate yields from groundwater estimated by this method was compared to the total watershed yields of all sources estimated from a Chesapeake Bay SPAtially Referenced Regressions On Watershed attributes (SPARROW) water-quality model. The comparison showed similar spatial patterns. The regression model for groundwater contribution had similar but lower yields, suggesting that groundwater is an important source of nitrogen for streams in the Chesapeake Bay watershed.

  1. Particle debonding using different yield criteria

    DEFF Research Database (Denmark)

    Legarth, Brian Nyvang; Kuroda, Mitsutoshi

    2004-01-01

    Effects of plastic anisotropy in relation to debonding of rigid inclusions embedded in an elastic-viscoplastic metal are studied. Full finite strain analyses are carried out for plane cells assuming plane stress or plane strain. The overall stress strain response is calculated, when the cell......-matrix debonding. Keeping all material parameters fixed, the material response of the plane strain cell is considerably affected, due to debonding at a much reduced overall plastic strain compared to the corresponding plane stress cell. (C) 2004 Elsevier SAS. All rights reserved....... extent and shape of the particular yield function considered. The required overall straining of the cell for debonding initiation is related to the extent of the yield surfaces, since a high yield stress promotes debonding. Additionally, the maximum overall stress level for the cell is lower for the Hill...

  2. Ordinary least square regression, orthogonal regression, geometric mean regression and their applications in aerosol science

    International Nuclear Information System (INIS)

    Leng Ling; Zhang Tianyi; Kleinman, Lawrence; Zhu Wei

    2007-01-01

    Regression analysis, especially the ordinary least squares method which assumes that errors are confined to the dependent variable, has seen a fair share of its applications in aerosol science. The ordinary least squares approach, however, could be problematic due to the fact that atmospheric data often does not lend itself to calling one variable independent and the other dependent. Errors often exist for both measurements. In this work, we examine two regression approaches available to accommodate this situation. They are orthogonal regression and geometric mean regression. Comparisons are made theoretically as well as numerically through an aerosol study examining whether the ratio of organic aerosol to CO would change with age

  3. Random regression models to estimate genetic parameters for milk production of Guzerat cows using orthogonal Legendre polynomials

    Directory of Open Access Journals (Sweden)

    Maria Gabriela Campolina Diniz Peixoto

    2014-05-01

    Full Text Available The objective of this work was to compare random regression models for the estimation of genetic parameters for Guzerat milk production, using orthogonal Legendre polynomials. Records (20,524 of test-day milk yield (TDMY from 2,816 first-lactation Guzerat cows were used. TDMY grouped into 10-monthly classes were analyzed for additive genetic effect and for environmental and residual permanent effects (random effects, whereas the contemporary group, calving age (linear and quadratic effects and mean lactation curve were analized as fixed effects. Trajectories for the additive genetic and permanent environmental effects were modeled by means of a covariance function employing orthogonal Legendre polynomials ranging from the second to the fifth order. Residual variances were considered in one, four, six, or ten variance classes. The best model had six residual variance classes. The heritability estimates for the TDMY records varied from 0.19 to 0.32. The random regression model that used a second-order Legendre polynomial for the additive genetic effect, and a fifth-order polynomial for the permanent environmental effect is adequate for comparison by the main employed criteria. The model with a second-order Legendre polynomial for the additive genetic effect, and that with a fourth-order for the permanent environmental effect could also be employed in these analyses.

  4. Interpreting Multiple Linear Regression: A Guidebook of Variable Importance

    Science.gov (United States)

    Nathans, Laura L.; Oswald, Frederick L.; Nimon, Kim

    2012-01-01

    Multiple regression (MR) analyses are commonly employed in social science fields. It is also common for interpretation of results to typically reflect overreliance on beta weights, often resulting in very limited interpretations of variable importance. It appears that few researchers employ other methods to obtain a fuller understanding of what…

  5. Estimating the exceedance probability of rain rate by logistic regression

    Science.gov (United States)

    Chiu, Long S.; Kedem, Benjamin

    1990-01-01

    Recent studies have shown that the fraction of an area with rain intensity above a fixed threshold is highly correlated with the area-averaged rain rate. To estimate the fractional rainy area, a logistic regression model, which estimates the conditional probability that rain rate over an area exceeds a fixed threshold given the values of related covariates, is developed. The problem of dependency in the data in the estimation procedure is bypassed by the method of partial likelihood. Analyses of simulated scanning multichannel microwave radiometer and observed electrically scanning microwave radiometer data during the Global Atlantic Tropical Experiment period show that the use of logistic regression in pixel classification is superior to multiple regression in predicting whether rain rate at each pixel exceeds a given threshold, even in the presence of noisy data. The potential of the logistic regression technique in satellite rain rate estimation is discussed.

  6. Exploratory regression analysis: a tool for selecting models and determining predictor importance.

    Science.gov (United States)

    Braun, Michael T; Oswald, Frederick L

    2011-06-01

    Linear regression analysis is one of the most important tools in a researcher's toolbox for creating and testing predictive models. Although linear regression analysis indicates how strongly a set of predictor variables, taken together, will predict a relevant criterion (i.e., the multiple R), the analysis cannot indicate which predictors are the most important. Although there is no definitive or unambiguous method for establishing predictor variable importance, there are several accepted methods. This article reviews those methods for establishing predictor importance and provides a program (in Excel) for implementing them (available for direct download at http://dl.dropbox.com/u/2480715/ERA.xlsm?dl=1) . The program investigates all 2(p) - 1 submodels and produces several indices of predictor importance. This exploratory approach to linear regression, similar to other exploratory data analysis techniques, has the potential to yield both theoretical and practical benefits.

  7. Linear regression and the normality assumption.

    Science.gov (United States)

    Schmidt, Amand F; Finan, Chris

    2017-12-16

    Researchers often perform arbitrary outcome transformations to fulfill the normality assumption of a linear regression model. This commentary explains and illustrates that in large data settings, such transformations are often unnecessary, and worse may bias model estimates. Linear regression assumptions are illustrated using simulated data and an empirical example on the relation between time since type 2 diabetes diagnosis and glycated hemoglobin levels. Simulation results were evaluated on coverage; i.e., the number of times the 95% confidence interval included the true slope coefficient. Although outcome transformations bias point estimates, violations of the normality assumption in linear regression analyses do not. The normality assumption is necessary to unbiasedly estimate standard errors, and hence confidence intervals and P-values. However, in large sample sizes (e.g., where the number of observations per variable is >10) violations of this normality assumption often do not noticeably impact results. Contrary to this, assumptions on, the parametric model, absence of extreme observations, homoscedasticity, and independency of the errors, remain influential even in large sample size settings. Given that modern healthcare research typically includes thousands of subjects focusing on the normality assumption is often unnecessary, does not guarantee valid results, and worse may bias estimates due to the practice of outcome transformations. Copyright © 2017 Elsevier Inc. All rights reserved.

  8. Comparison between linear and non-parametric regression models for genome-enabled prediction in wheat.

    Science.gov (United States)

    Pérez-Rodríguez, Paulino; Gianola, Daniel; González-Camacho, Juan Manuel; Crossa, José; Manès, Yann; Dreisigacker, Susanne

    2012-12-01

    In genome-enabled prediction, parametric, semi-parametric, and non-parametric regression models have been used. This study assessed the predictive ability of linear and non-linear models using dense molecular markers. The linear models were linear on marker effects and included the Bayesian LASSO, Bayesian ridge regression, Bayes A, and Bayes B. The non-linear models (this refers to non-linearity on markers) were reproducing kernel Hilbert space (RKHS) regression, Bayesian regularized neural networks (BRNN), and radial basis function neural networks (RBFNN). These statistical models were compared using 306 elite wheat lines from CIMMYT genotyped with 1717 diversity array technology (DArT) markers and two traits, days to heading (DTH) and grain yield (GY), measured in each of 12 environments. It was found that the three non-linear models had better overall prediction accuracy than the linear regression specification. Results showed a consistent superiority of RKHS and RBFNN over the Bayesian LASSO, Bayesian ridge regression, Bayes A, and Bayes B models.

  9. Polynomial regression analysis and significance test of the regression function

    International Nuclear Information System (INIS)

    Gao Zhengming; Zhao Juan; He Shengping

    2012-01-01

    In order to analyze the decay heating power of a certain radioactive isotope per kilogram with polynomial regression method, the paper firstly demonstrated the broad usage of polynomial function and deduced its parameters with ordinary least squares estimate. Then significance test method of polynomial regression function is derived considering the similarity between the polynomial regression model and the multivariable linear regression model. Finally, polynomial regression analysis and significance test of the polynomial function are done to the decay heating power of the iso tope per kilogram in accord with the authors' real work. (authors)

  10. Spatial Regression and Prediction of Water Quality in a Watershed with Complex Pollution Sources.

    Science.gov (United States)

    Yang, Xiaoying; Liu, Qun; Luo, Xingzhang; Zheng, Zheng

    2017-08-16

    Fast economic development, burgeoning population growth, and rapid urbanization have led to complex pollution sources contributing to water quality deterioration simultaneously in many developing countries including China. This paper explored the use of spatial regression to evaluate the impacts of watershed characteristics on ambient total nitrogen (TN) concentration in a heavily polluted watershed and make predictions across the region. Regression results have confirmed the substantial impact on TN concentration by a variety of point and non-point pollution sources. In addition, spatial regression has yielded better performance than ordinary regression in predicting TN concentrations. Due to its best performance in cross-validation, the river distance based spatial regression model was used to predict TN concentrations across the watershed. The prediction results have revealed a distinct pattern in the spatial distribution of TN concentrations and identified three critical sub-regions in priority for reducing TN loads. Our study results have indicated that spatial regression could potentially serve as an effective tool to facilitate water pollution control in watersheds under diverse physical and socio-economical conditions.

  11. Climate Change Impact on Rainfall: How will Threaten Wheat Yield?

    Science.gov (United States)

    Tafoughalti, K.; El Faleh, E. M.; Moujahid, Y.; Ouargaga, F.

    2018-05-01

    Climate change has a significant impact on the environmental condition of the agricultural region. Meknes has an agrarian economy and wheat production is of paramount importance. As most arable area are under rainfed system, Meknes is one of the sensitive regions to rainfall variability and consequently to climate change. Therefore, the use of changes in rainfall is vital for detecting the influence of climate system on agricultural productivity. This article identifies rainfall temporal variability and its impact on wheat yields. We used monthly rainfall records for three decades and wheat yields records of fifteen years. Rainfall variability is assessed utilizing the precipitation concentration index and the variation coefficient. The association between wheat yields and cumulative rainfall amounts of different scales was calculated based on a regression model. The analysis shown moderate seasonal and irregular annual rainfall distribution. Yields fluctuated from 210 to 4500 Kg/ha with 52% of coefficient of variation. The correlation results shows that wheat yields are strongly correlated with rainfall of the period January to March. This investigation concluded that climate change is altering wheat yield and it is crucial to adept the necessary adaptation to challenge the risk.

  12. Derivation of ozone flux-yield relationships for lettuce: A key horticultural crop

    International Nuclear Information System (INIS)

    Goumenaki, Eleni; Fernandez, Ignacio Gonzalez; Papanikolaou, Antigoni; Papadopoulou, Despoina; Askianakis, Christos; Kouvarakis, George; Barnes, Jeremy

    2007-01-01

    Ozone flux-response relationships were derived for lettuce, employing a multiplicative approach to model the manner in which stomatal conductance is influenced by key environmental variables, using a dataset collected during field experimentation in Crete and yield-response relationships derived from parallel open-top chamber experiments. Regional agronomic practices were adopted throughout. Computed versus measured data revealed that the derived model explained 51% (P -2 s -1 . Regressions employing very low or zero flux thresholds resulted in the strongest yield-flux relationships (explaining ∼80% (P < 0.05) of the variation in the dataset). - Establishment of ozone flux-yield relationships for a commercially-important horticultural crop grown widely in the Mediterranean

  13. The best of both worlds: Phylogenetic eigenvector regression and mapping

    Directory of Open Access Journals (Sweden)

    José Alexandre Felizola Diniz Filho

    2015-09-01

    Full Text Available Eigenfunction analyses have been widely used to model patterns of autocorrelation in time, space and phylogeny. In a phylogenetic context, Diniz-Filho et al. (1998 proposed what they called Phylogenetic Eigenvector Regression (PVR, in which pairwise phylogenetic distances among species are submitted to a Principal Coordinate Analysis, and eigenvectors are then used as explanatory variables in regression, correlation or ANOVAs. More recently, a new approach called Phylogenetic Eigenvector Mapping (PEM was proposed, with the main advantage of explicitly incorporating a model-based warping in phylogenetic distance in which an Ornstein-Uhlenbeck (O-U process is fitted to data before eigenvector extraction. Here we compared PVR and PEM in respect to estimated phylogenetic signal, correlated evolution under alternative evolutionary models and phylogenetic imputation, using simulated data. Despite similarity between the two approaches, PEM has a slightly higher prediction ability and is more general than the original PVR. Even so, in a conceptual sense, PEM may provide a technique in the best of both worlds, combining the flexibility of data-driven and empirical eigenfunction analyses and the sounding insights provided by evolutionary models well known in comparative analyses.

  14. Reduced Rank Regression

    DEFF Research Database (Denmark)

    Johansen, Søren

    2008-01-01

    The reduced rank regression model is a multivariate regression model with a coefficient matrix with reduced rank. The reduced rank regression algorithm is an estimation procedure, which estimates the reduced rank regression model. It is related to canonical correlations and involves calculating...

  15. Genetic correlations between body condition score, yield and fertility in Holstein heifers estimated by random regression models

    NARCIS (Netherlands)

    Veerkamp, R.F.; Koenen, E.P.C.; Jong, de G.

    2001-01-01

    Twenty type classifiers scored body condition (BCS) of 91,738 first-parity cows from 601 sires and 5518 maternal grandsires. Fertility data during first lactation were extracted for 177,220 cows, of which 67,278 also had a BCS observation, and first-lactation 305-d milk, fat, and protein yields were

  16. Gene and transcript abundances of bacterial type III secretion systems from the rumen microbiome are correlated with methane yield in sheep.

    Science.gov (United States)

    Kamke, Janine; Soni, Priya; Li, Yang; Ganesh, Siva; Kelly, William J; Leahy, Sinead C; Shi, Weibing; Froula, Jeff; Rubin, Edward M; Attwood, Graeme T

    2017-08-08

    Ruminants are important contributors to global methane emissions via microbial fermentation in their reticulo-rumens. This study is part of a larger program, characterising the rumen microbiomes of sheep which vary naturally in methane yield (g CH 4 /kg DM/day) and aims to define differences in microbial communities, and in gene and transcript abundances that can explain the animal methane phenotype. Rumen microbiome metagenomic and metatranscriptomic data were analysed by Gene Set Enrichment, sparse partial least squares regression and the Wilcoxon Rank Sum test to estimate correlations between specific KEGG bacterial pathways/genes and high methane yield in sheep. KEGG genes enriched in high methane yield sheep were reassembled from raw reads and existing contigs and analysed by MEGAN to predict their phylogenetic origin. Protein coding sequences from Succinivibrio dextrinosolvens strains were analysed using Effective DB to predict bacterial type III secreted proteins. The effect of S. dextrinosolvens strain H5 growth on methane formation by rumen methanogens was explored using co-cultures. Detailed analysis of the rumen microbiomes of high methane yield sheep shows that gene and transcript abundances of bacterial type III secretion system genes are positively correlated with methane yield in sheep. Most of the bacterial type III secretion system genes could not be assigned to a particular bacterial group, but several genes were affiliated with the genus Succinivibrio, and searches of bacterial genome sequences found that strains of S. dextrinosolvens were part of a small group of rumen bacteria that encode this type of secretion system. In co-culture experiments, S. dextrinosolvens strain H5 showed a growth-enhancing effect on a methanogen belonging to the order Methanomassiliicoccales, and inhibition of a representative of the Methanobrevibacter gottschalkii clade. This is the first report of bacterial type III secretion system genes being associated with high

  17. Prediction of radiation levels in residences: A methodological comparison of CART [Classification and Regression Tree Analysis] and conventional regression

    International Nuclear Information System (INIS)

    Janssen, I.; Stebbings, J.H.

    1990-01-01

    In environmental epidemiology, trace and toxic substance concentrations frequently have very highly skewed distributions ranging over one or more orders of magnitude, and prediction by conventional regression is often poor. Classification and Regression Tree Analysis (CART) is an alternative in such contexts. To compare the techniques, two Pennsylvania data sets and three independent variables are used: house radon progeny (RnD) and gamma levels as predicted by construction characteristics in 1330 houses; and ∼200 house radon (Rn) measurements as predicted by topographic parameters. CART may identify structural variables of interest not identified by conventional regression, and vice versa, but in general the regression models are similar. CART has major advantages in dealing with other common characteristics of environmental data sets, such as missing values, continuous variables requiring transformations, and large sets of potential independent variables. CART is most useful in the identification and screening of independent variables, greatly reducing the need for cross-tabulations and nested breakdown analyses. There is no need to discard cases with missing values for the independent variables because surrogate variables are intrinsic to CART. The tree-structured approach is also independent of the scale on which the independent variables are measured, so that transformations are unnecessary. CART identifies important interactions as well as main effects. The major advantages of CART appear to be in exploring data. Once the important variables are identified, conventional regressions seem to lead to results similar but more interpretable by most audiences. 12 refs., 8 figs., 10 tabs

  18. Logistic Regression Analysis of Operational Errors and Routine Operations Using Sector Characteristics

    National Research Council Canada - National Science Library

    Pfleiderer, Elaine M; Scroggins, Cheryl L; Manning, Carol A

    2009-01-01

    Two separate logistic regression analyses were conducted for low- and high-altitude sectors to determine whether a set of dynamic sector characteristics variables could reliably discriminate between operational error (OE...

  19. Quantile Regression Methods

    DEFF Research Database (Denmark)

    Fitzenberger, Bernd; Wilke, Ralf Andreas

    2015-01-01

    if the mean regression model does not. We provide a short informal introduction into the principle of quantile regression which includes an illustrative application from empirical labor market research. This is followed by briefly sketching the underlying statistical model for linear quantile regression based......Quantile regression is emerging as a popular statistical approach, which complements the estimation of conditional mean models. While the latter only focuses on one aspect of the conditional distribution of the dependent variable, the mean, quantile regression provides more detailed insights...... by modeling conditional quantiles. Quantile regression can therefore detect whether the partial effect of a regressor on the conditional quantiles is the same for all quantiles or differs across quantiles. Quantile regression can provide evidence for a statistical relationship between two variables even...

  20. A Monte Carlo simulation study comparing linear regression, beta regression, variable-dispersion beta regression and fractional logit regression at recovering average difference measures in a two sample design.

    Science.gov (United States)

    Meaney, Christopher; Moineddin, Rahim

    2014-01-24

    In biomedical research, response variables are often encountered which have bounded support on the open unit interval--(0,1). Traditionally, researchers have attempted to estimate covariate effects on these types of response data using linear regression. Alternative modelling strategies may include: beta regression, variable-dispersion beta regression, and fractional logit regression models. This study employs a Monte Carlo simulation design to compare the statistical properties of the linear regression model to that of the more novel beta regression, variable-dispersion beta regression, and fractional logit regression models. In the Monte Carlo experiment we assume a simple two sample design. We assume observations are realizations of independent draws from their respective probability models. The randomly simulated draws from the various probability models are chosen to emulate average proportion/percentage/rate differences of pre-specified magnitudes. Following simulation of the experimental data we estimate average proportion/percentage/rate differences. We compare the estimators in terms of bias, variance, type-1 error and power. Estimates of Monte Carlo error associated with these quantities are provided. If response data are beta distributed with constant dispersion parameters across the two samples, then all models are unbiased and have reasonable type-1 error rates and power profiles. If the response data in the two samples have different dispersion parameters, then the simple beta regression model is biased. When the sample size is small (N0 = N1 = 25) linear regression has superior type-1 error rates compared to the other models. Small sample type-1 error rates can be improved in beta regression models using bias correction/reduction methods. In the power experiments, variable-dispersion beta regression and fractional logit regression models have slightly elevated power compared to linear regression models. Similar results were observed if the

  1. Estimation of 305 Day Milk Yield from Cumulative Monthly and Bimonthly Test Day Records in Indonesian Holstein Cattle

    Science.gov (United States)

    Rahayu, A. P.; Hartatik, T.; Purnomoadi, A.; Kurnianto, E.

    2018-02-01

    The aims of this study were to estimate 305 day first lactation milk yield of Indonesian Holstein cattle from cumulative monthly and bimonthly test day records and to analyze its accuracy.The first lactation records of 258 dairy cows from 2006 to 2014 consisted of 2571 monthly (MTDY) and 1281 bimonthly test day yield (BTDY) records were used. Milk yields were estimated by regression method. Correlation coefficients between actual and estimated milk yield by cumulative MTDY were 0.70, 0.78, 0.83, 0.86, 0.89, 0.92, 0.94 and 0.96 for 2-9 months, respectively, meanwhile by cumulative BTDY were 0.69, 0.81, 0.87 and 0.92 for 2, 4, 6 and 8 months, respectively. The accuracy of fitting regression models (R2) increased with the increasing in the number of cumulative test day used. The used of 5 cumulative MTDY was considered sufficient for estimating 305 day first lactation milk yield with 80.6% accuracy and 7% error percentage of estimation. The estimated milk yield from MTDY was more accurate than BTDY by 1.1 to 2% less error percentage in the same time.

  2. Linear Multivariable Regression Models for Prediction of Eddy Dissipation Rate from Available Meteorological Data

    Science.gov (United States)

    MCKissick, Burnell T. (Technical Monitor); Plassman, Gerald E.; Mall, Gerald H.; Quagliano, John R.

    2005-01-01

    Linear multivariable regression models for predicting day and night Eddy Dissipation Rate (EDR) from available meteorological data sources are defined and validated. Model definition is based on a combination of 1997-2000 Dallas/Fort Worth (DFW) data sources, EDR from Aircraft Vortex Spacing System (AVOSS) deployment data, and regression variables primarily from corresponding Automated Surface Observation System (ASOS) data. Model validation is accomplished through EDR predictions on a similar combination of 1994-1995 Memphis (MEM) AVOSS and ASOS data. Model forms include an intercept plus a single term of fixed optimal power for each of these regression variables; 30-minute forward averaged mean and variance of near-surface wind speed and temperature, variance of wind direction, and a discrete cloud cover metric. Distinct day and night models, regressing on EDR and the natural log of EDR respectively, yield best performance and avoid model discontinuity over day/night data boundaries.

  3. Analysis of meteorological variations on wheat yield and its estimation using remotely sensed data. A case study of selected districts of Punjab Province, Pakistan (2001-14

    Directory of Open Access Journals (Sweden)

    Rafia Mumtaz

    2017-10-01

    Full Text Available Land management for crop production is an essential human activity that supports life on Earth. The main challenge to be faced by the agriculture sector in coming years is to feed the rapidly growing population while maintaining the key resources such as soil fertility, efficient land use, and water. Climate change is also a critical factor that impacts agricultural production. Among others, a major effect of climate change is the potential alterations in the growth cycle of crops which would likely lead to a decline in the agricultural output. Due to the increasing demand for proper agricultural management, this study explores the effects of meteorological variation on wheat yield in Chakwal and Faisalabad districts of Punjab, Pakistan and used normalised difference vegetation index (NDVI as a predictor for yield estimates. For NDVI data (2001-14, the NDVI product of Moderate Resolution Imaging spectrometer (MODIS 16-day composites data has been used. The crop area mapping has been realised by classifying the satellite data into different land use/land covers using iterative self-organising (ISO data clustering. The land cover for the wheat crop was mapped using a crop calendar. The relation of crop yield with NDVI and the impact of meteorological parameters on wheat growth and its yield has been analysed at various development stages. A strong correlation of rainfall and temperature was found with NDVI data, which determined NDVI as a strong predictor of yield estimation. The wheat yield estimates were obtained by linearly regressing the reported crop yield against the time series of MODIS NDVI profiles. The wheat NDVI profiles have shown a parabolic pattern across the growing season, therefore parabolic least square fit (LSF has been applied prior to linear regression. The coefficients of determination (R2 between the reported and estimated yield was found to be 0.88 and 0.73, respectively, for Chakwal and Faisalabad. This indicates that the

  4. Regression Phalanxes

    OpenAIRE

    Zhang, Hongyang; Welch, William J.; Zamar, Ruben H.

    2017-01-01

    Tomal et al. (2015) introduced the notion of "phalanxes" in the context of rare-class detection in two-class classification problems. A phalanx is a subset of features that work well for classification tasks. In this paper, we propose a different class of phalanxes for application in regression settings. We define a "Regression Phalanx" - a subset of features that work well together for prediction. We propose a novel algorithm which automatically chooses Regression Phalanxes from high-dimensi...

  5. Analyses of polycyclic aromatic hydrocarbon (PAH) and chiral-PAH analogues-methyl-β-cyclodextrin guest-host inclusion complexes by fluorescence spectrophotometry and multivariate regression analysis.

    Science.gov (United States)

    Greene, LaVana; Elzey, Brianda; Franklin, Mariah; Fakayode, Sayo O

    2017-03-05

    The negative health impact of polycyclic aromatic hydrocarbons (PAHs) and differences in pharmacological activity of enantiomers of chiral molecules in humans highlights the need for analysis of PAHs and their chiral analogue molecules in humans. Herein, the first use of cyclodextrin guest-host inclusion complexation, fluorescence spectrophotometry, and chemometric approach to PAH (anthracene) and chiral-PAH analogue derivatives (1-(9-anthryl)-2,2,2-triflouroethanol (TFE)) analyses are reported. The binding constants (K b ), stoichiometry (n), and thermodynamic properties (Gibbs free energy (ΔG), enthalpy (ΔH), and entropy (ΔS)) of anthracene and enantiomers of TFE-methyl-β-cyclodextrin (Me-β-CD) guest-host complexes were also determined. Chemometric partial-least-square (PLS) regression analysis of emission spectra data of Me-β-CD-guest-host inclusion complexes was used for the determination of anthracene and TFE enantiomer concentrations in Me-β-CD-guest-host inclusion complex samples. The values of calculated K b and negative ΔG suggest the thermodynamic favorability of anthracene-Me-β-CD and enantiomeric of TFE-Me-β-CD inclusion complexation reactions. However, anthracene-Me-β-CD and enantiomer TFE-Me-β-CD inclusion complexations showed notable differences in the binding affinity behaviors and thermodynamic properties. The PLS regression analysis resulted in square-correlation-coefficients of 0.997530 or better and a low LOD of 3.81×10 -7 M for anthracene and 3.48×10 -8 M for TFE enantiomers at physiological conditions. Most importantly, PLS regression accurately determined the anthracene and TFE enantiomer concentrations with an average low error of 2.31% for anthracene, 4.44% for R-TFE and 3.60% for S-TFE. The results of the study are highly significant because of its high sensitivity and accuracy for analysis of PAH and chiral PAH analogue derivatives without the need of an expensive chiral column, enantiomeric resolution, or use of a polarized

  6. Genotypic Correlation and Path Analysis of Some Traits related to Oil Yield and Grain Yield in Canola (Brassica napus L. under Non-stress and Water Deficit Stress Conditions

    Directory of Open Access Journals (Sweden)

    A Ismaili

    2017-03-01

    between seed yield and different characters were subjected to path coefficient analysis separately for partitioning these values into direct and indirect effects. Step-wise regression technique was used to determine the best model, which accounted for variation exist in plant seed and oil yield as dependent variables in separate analysis. Direct and indirect effects of traits entered to regression model were determined by using path coefficient analysis. Results and Discussion Results of this study showed significant differences among all genotypes performances, and also stress condition caused a significant decrease in performance of all studied traits. The highest seed yield obtained from Geronimo and Dante (with 3668 and 3505 kg.ha-1, respectively under non stress condition, and the highest seed yield obtained from Zarfam and Dante (with 2948 and 2860 kg ha-1, respectively under drought stress condition. Genotype Licord produced the highest oil content, which was significantly higher than that produced by other genotypes in either regime. Genotypic and phenotypic correlation coefficients were estimated between all traits and using stepwise regression, best model was introduced for two conditions. Under Non-stress condition, the average of genetic correlations between grain yield and silique length was high and positive (0.92**, suggesting that the selection of prolific plants resulted in a gain of selection for yield. Under water deficit stress condition, a negative average of genetic correlations (-0.28 was observed for grain yield and days to maturity. Path analysis based on the genotypic correlation under non-stress conditions between grain yield and other traits showed that number of pods per plant and pod length had direct effects on grain yield, while under drought conditions, pod length and plant height had important direct effects. Results of path analysis for oil yield under non-stress and stress conditions showed that grain yield had the most direct effect on

  7. Wheat Yield Forecasting for Punjab Province from Vegetation Index Time Series and Historic Crop Statistics

    Directory of Open Access Journals (Sweden)

    Jan Dempewolf

    2014-10-01

    Full Text Available Policy makers, government planners and agricultural market participants in Pakistan require accurate and timely information about wheat yield and production. Punjab Province is by far the most important wheat producing region in the country. The manual collection of field data and data processing for crop forecasting by the provincial government requires significant amounts of time before official reports can be released. Several studies have shown that wheat yield can be effectively forecast using satellite remote sensing data. In this study, we developed a methodology for estimating wheat yield and area for Punjab Province from freely available Landsat and MODIS satellite imagery approximately six weeks before harvest. Wheat yield was derived by regressing reported yield values against time series of four different peak-season MODIS-derived vegetation indices. We also tested deriving wheat area from the same MODIS time series using a regression-tree approach. Among the four evaluated indices, WDRVI provided more consistent and accurate yield forecasts compared to NDVI, EVI2 and saturation-adjusted normalized difference vegetation index (SANDVI. The lowest RMSE values at the district level for forecast versus reported yield were found when using six or more years of training data. Forecast yield for the 2007/2008 to 2012/2013 growing seasons were within 0.2% and 11.5% of final reported values. Absolute deviations of wheat area and production forecasts from reported values were slightly greater compared to using the previous year's or the three- or six-year moving average values, implying that 250-m MODIS data does not provide sufficient spatial resolution for providing improved wheat area and production forecasts.

  8. Epidemiology and impact of Fasciola hepatica exposure in high-yielding dairy herds.

    Science.gov (United States)

    Howell, Alison; Baylis, Matthew; Smith, Rob; Pinchbeck, Gina; Williams, Diana

    2015-09-01

    The liver fluke Fasciola hepatica is a trematode parasite with a worldwide distribution and is the cause of important production losses in the dairy industry. The aim of this observational study was to assess the prevalence of exposure to F. hepatica in a group of high yielding dairy herds, to determine the risk factors and investigate their associations with production and fertility parameters. Bulk milk tank samples from 606 herds that supply a single retailer with liquid milk were tested with an antibody ELISA for F. hepatica. Multivariable linear regression was used to investigate the effect of farm management and environmental risk factors on F. hepatica exposure. Higher rainfall, grazing boggy pasture, presence of beef cattle on farm, access to a stream or pond and smaller herd size were associated with an increased risk of exposure. Univariable regression was used to look for associations between fluke exposure and production-related variables including milk yield, composition, somatic cell count and calving index. Although causation cannot be assumed, a significant (phepatica exposure and estimated milk yield at the herd level, representing a 15% decrease in yield for an increase in F. hepatica exposure from the 25th to the 75th percentile. This remained significant when fertility, farm management and environmental factors were controlled for. No associations were found between F. hepatica exposure and any of the other production, disease or fertility variables. Copyright © 2015 The Authors. Published by Elsevier B.V. All rights reserved.

  9. A method for fitting regression splines with varying polynomial order in the linear mixed model.

    Science.gov (United States)

    Edwards, Lloyd J; Stewart, Paul W; MacDougall, James E; Helms, Ronald W

    2006-02-15

    The linear mixed model has become a widely used tool for longitudinal analysis of continuous variables. The use of regression splines in these models offers the analyst additional flexibility in the formulation of descriptive analyses, exploratory analyses and hypothesis-driven confirmatory analyses. We propose a method for fitting piecewise polynomial regression splines with varying polynomial order in the fixed effects and/or random effects of the linear mixed model. The polynomial segments are explicitly constrained by side conditions for continuity and some smoothness at the points where they join. By using a reparameterization of this explicitly constrained linear mixed model, an implicitly constrained linear mixed model is constructed that simplifies implementation of fixed-knot regression splines. The proposed approach is relatively simple, handles splines in one variable or multiple variables, and can be easily programmed using existing commercial software such as SAS or S-plus. The method is illustrated using two examples: an analysis of longitudinal viral load data from a study of subjects with acute HIV-1 infection and an analysis of 24-hour ambulatory blood pressure profiles.

  10. Use of empirical likelihood to calibrate auxiliary information in partly linear monotone regression models.

    Science.gov (United States)

    Chen, Baojiang; Qin, Jing

    2014-05-10

    In statistical analysis, a regression model is needed if one is interested in finding the relationship between a response variable and covariates. When the response depends on the covariate, then it may also depend on the function of this covariate. If one has no knowledge of this functional form but expect for monotonic increasing or decreasing, then the isotonic regression model is preferable. Estimation of parameters for isotonic regression models is based on the pool-adjacent-violators algorithm (PAVA), where the monotonicity constraints are built in. With missing data, people often employ the augmented estimating method to improve estimation efficiency by incorporating auxiliary information through a working regression model. However, under the framework of the isotonic regression model, the PAVA does not work as the monotonicity constraints are violated. In this paper, we develop an empirical likelihood-based method for isotonic regression model to incorporate the auxiliary information. Because the monotonicity constraints still hold, the PAVA can be used for parameter estimation. Simulation studies demonstrate that the proposed method can yield more efficient estimates, and in some situations, the efficiency improvement is substantial. We apply this method to a dementia study. Copyright © 2013 John Wiley & Sons, Ltd.

  11. Genotype-environment interaction and phenotypic stability for girth growth and rubber yield of Hevea clones in São Paulo State, Brazil

    Directory of Open Access Journals (Sweden)

    Gonçalves Paulo de Souza

    2003-01-01

    Full Text Available The best-yielding, best vigour and most stable Hevea clones are identified by growing clones in different environments. However, research on the stability in Hevea brasiliensis (Willd. Adr. ex Juss. Muell.-Arg. is scarce. The objectives of this work were to assess genotype-environment interaction and determine stable genotypes. Stability analysis were performed on results for girth growth and rubber yield of seven clones from five comparative trials conducted over 10 years (girth growth and four years (rubber yield in São Paulo State, Brazil. Stability was estimated using the Eberhart and Russell (1966 method. Year by location and location variability were the dominant sources of interactions. The stability analysis identified GT 1 and IAN 873 as the most stable clones for girth growth and rubber yield respectively since their regression coefficients were almost the unity (b = 1 and they had one of the lowest deviations from regressions (S2di. Their coefficient of determination (R² was as high as 89.5% and 89.8% confirming their stability. In contrast, clones such as PB 235, PR 261, and RRIM 701 for girth growth and clones such as GT 1 for rubber yield with regression coefficients greater than one were regarded as sensitive to environment changes.

  12. Evaluating a satellite-based seasonal evapotranspiration product and identifying its relationship with other satellite-derived products and crop yield: A case study for Ethiopia

    Science.gov (United States)

    Tadesse, Tsegaye; Senay, Gabriel B.; Berhan, Getachew; Regassa, Teshome; Beyene, Shimelis

    2015-08-01

    Satellite-derived evapotranspiration anomalies and normalized difference vegetation index (NDVI) products from Moderate Resolution Imaging Spectroradiometer (MODIS) data are currently used for African agricultural drought monitoring and food security status assessment. In this study, a process to evaluate satellite-derived evapotranspiration (ETa) products with a geospatial statistical exploratory technique that uses NDVI, satellite-derived rainfall estimate (RFE), and crop yield data has been developed. The main goal of this study was to evaluate the ETa using the NDVI and RFE, and identify a relationship between the ETa and Ethiopia's cereal crop (i.e., teff, sorghum, corn/maize, barley, and wheat) yields during the main rainy season. Since crop production is one of the main factors affecting food security, the evaluation of remote sensing-based seasonal ETa was done to identify the appropriateness of this tool as a proxy for monitoring vegetation condition in drought vulnerable and food insecure areas to support decision makers. The results of this study showed that the comparison between seasonal ETa and RFE produced strong correlation (R2 > 0.99) for all 41 crop growing zones in Ethiopia. The results of the spatial regression analyses of seasonal ETa and NDVI using Ordinary Least Squares and Geographically Weighted Regression showed relatively weak yearly spatial relationships (R2 products have a good predictive potential for these 31 identified zones in Ethiopia. Decision makers may potentially use ETa products for monitoring cereal crop yields and early warning of food insecurity during drought years for these identified zones.

  13. Nitrogen utilization and biomass yield in trickle bed air biofilters.

    Science.gov (United States)

    Kim, Daekeun; Sorial, George A

    2010-10-15

    Nitrogen utilization and subsequent biomass yield were investigated in four independent lab-scale trickle bed air biofilters (TBABs) fed with different VOCs substrate. The VOCs considered were two aromatic (toluene, styrene) and two oxygenated (methyl ethyl ketone (MEK), methyl isobutyl ketone (MIBK)). Long-term observations of TBABs performances show that more nitrogen was required to sustain high VOC removal, but the one fed with a high loading of VOC utilized much more nitrogen for sustaining biomass yield. The ratio N(consumption)/N(growth) was an effective indicator in evaluating nitrogen utilization in the system. Substrate VOC availability in the system was significant in determining nitrogen utilization and biomass yield. VOC substrate availability in the TBAB system was effectively identified by using maximum practical concentrations in the biofilm. Biomass yield coefficient, which was driven from the regression analysis between CO(2) production rate and substrate consumption rate, was effective in evaluating the TBAB performance with respect to nitrogen utilization and VOC removal. Biomass yield coefficients (g biomass/g substrate, dry weight basis) were observed to be 0.668, 0.642, 0.737, and 0.939 for toluene, styrene, MEK, and MIBK, respectively. 2010 Elsevier B.V. All rights reserved.

  14. Determining Balıkesir’s Energy Potential Using a Regression Analysis Computer Program

    Directory of Open Access Journals (Sweden)

    Bedri Yüksel

    2014-01-01

    Full Text Available Solar power and wind energy are used concurrently during specific periods, while at other times only the more efficient is used, and hybrid systems make this possible. When establishing a hybrid system, the extent to which these two energy sources support each other needs to be taken into account. This paper is a study of the effects of wind speed, insolation levels, and the meteorological parameters of temperature and humidity on the energy potential in Balıkesir, in the Marmara region of Turkey. The relationship between the parameters was studied using a multiple linear regression method. Using a designed-for-purpose computer program, two different regression equations were derived, with wind speed being the dependent variable in the first and insolation levels in the second. The regression equations yielded accurate results. The computer program allowed for the rapid calculation of different acceptance rates. The results of the statistical analysis proved the reliability of the equations. An estimate of identified meteorological parameters and unknown parameters could be produced with a specified precision by using the regression analysis method. The regression equations also worked for the evaluation of energy potential.

  15. The study of logistic regression of risk factor on the death cause of uranium miners

    International Nuclear Information System (INIS)

    Wen Jinai; Yuan Liyun; Jiang Ruyi

    1999-01-01

    Logistic regression model has widely been used in the field of medicine. The computer software on this model is popular, but it is worth to discuss how to use this model correctly. Using SPSS (Statistical Package for the Social Science) software, unconditional logistic regression method was adopted to carry out multi-factor analyses on the cause of total death, cancer death and lung cancer death of uranium miners. The data is from radioepidemiological database of one uranium mine. The result show that attained age is a risk factor in the logistic regression analyses of total death, cancer death and lung cancer death. In the logistic regression analysis of cancer death, there is a negative correlation between the age of exposure and cancer death. This shows that the younger the age at exposure, the bigger the risk of cancer death. In the logistic regression analysis of lung cancer death, there is a positive correlation between the cumulated exposure and lung cancer death, this show that cumulated exposure is a most important risk factor of lung cancer death on uranium miners. It has been documented by many foreign reports that the lung cancer death rate is higher in uranium miners

  16. Boosted beta regression.

    Directory of Open Access Journals (Sweden)

    Matthias Schmid

    Full Text Available Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1. Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures.

  17. Similar estimates of temperature impacts on global wheat yield by three independent methods

    NARCIS (Netherlands)

    Liu, Bing; Asseng, Senthold; Müller, Christoph; Ewert, Frank; Elliott, Joshua; Lobell, David B.; Martre, Pierre; Ruane, Alex C.; Wallach, Daniel; Jones, James W.; Supit, Iwan; Wolf, Joost

    2016-01-01

    The potential impact of global temperature change on global crop yield has recently been assessed with different methods. Here we show that grid-based and point-based simulations and statistical regressions (from historic records), without deliberate adaptation or CO 2 fertilization effects,

  18. The Forecasting Power of the Yield Curve, a Supervised Factor Model Approach

    DEFF Research Database (Denmark)

    Boldrini, Lorenzo; Hillebrand, Eric Tobias

    loadings have the Nelson and Siegel (1987) structure and we consider one forecast target at a time. We compare the forecasting performance of our specification to benchmark models such as principal components regression, partial least squares, and ARMA(p,q) processes. We use the yield curve data from G...

  19. Regression to Causality : Regression-style presentation influences causal attribution

    DEFF Research Database (Denmark)

    Bordacconi, Mats Joe; Larsen, Martin Vinæs

    2014-01-01

    of equivalent results presented as either regression models or as a test of two sample means. Our experiment shows that the subjects who were presented with results as estimates from a regression model were more inclined to interpret these results causally. Our experiment implies that scholars using regression...... models – one of the primary vehicles for analyzing statistical results in political science – encourage causal interpretation. Specifically, we demonstrate that presenting observational results in a regression model, rather than as a simple comparison of means, makes causal interpretation of the results...... more likely. Our experiment drew on a sample of 235 university students from three different social science degree programs (political science, sociology and economics), all of whom had received substantial training in statistics. The subjects were asked to compare and evaluate the validity...

  20. Evaluation of Logistic Regression and Multivariate Adaptive Regression Spline Models for Groundwater Potential Mapping Using R and GIS

    Directory of Open Access Journals (Sweden)

    Soyoung Park

    2017-07-01

    Full Text Available This study mapped and analyzed groundwater potential using two different models, logistic regression (LR and multivariate adaptive regression splines (MARS, and compared the results. A spatial database was constructed for groundwater well data and groundwater influence factors. Groundwater well data with a high potential yield of ≥70 m3/d were extracted, and 859 locations (70% were used for model training, whereas the other 365 locations (30% were used for model validation. We analyzed 16 groundwater influence factors including altitude, slope degree, slope aspect, plan curvature, profile curvature, topographic wetness index, stream power index, sediment transport index, distance from drainage, drainage density, lithology, distance from fault, fault density, distance from lineament, lineament density, and land cover. Groundwater potential maps (GPMs were constructed using LR and MARS models and tested using a receiver operating characteristics curve. Based on this analysis, the area under the curve (AUC for the success rate curve of GPMs created using the MARS and LR models was 0.867 and 0.838, and the AUC for the prediction rate curve was 0.836 and 0.801, respectively. This implies that the MARS model is useful and effective for groundwater potential analysis in the study area.

  1. Correcting for multivariate measurement error by regression calibration in meta-analyses of epidemiological studies

    DEFF Research Database (Denmark)

    Tybjærg-Hansen, Anne

    2009-01-01

    Within-person variability in measured values of multiple risk factors can bias their associations with disease. The multivariate regression calibration (RC) approach can correct for such measurement error and has been applied to studies in which true values or independent repeat measurements...... of the risk factors are observed on a subsample. We extend the multivariate RC techniques to a meta-analysis framework where multiple studies provide independent repeat measurements and information on disease outcome. We consider the cases where some or all studies have repeat measurements, and compare study......-specific, averaged and empirical Bayes estimates of RC parameters. Additionally, we allow for binary covariates (e.g. smoking status) and for uncertainty and time trends in the measurement error corrections. Our methods are illustrated using a subset of individual participant data from prospective long-term studies...

  2. Trends in nicotine yield in smoke and its relationship with design characteristics among popular US cigarette brands, 1997–2005

    Science.gov (United States)

    Connolly, Gregory N; Alpert, Hillel R; Wayne, Geoffrey Ferris; Koh, Howard

    2007-01-01

    Objectives To determine whether nicotine yields in the smoke of cigarettes would show an overall increase over time or an increasing trend limited to any particular market category (eg, full flavour vs light, medium (mild) or ultralight; mentholated vs non‐mentholated), manufacturer, or brand family or brand style, and whether nicotine yields in smoke would be associated with measurable trends in cigarette design. Methods Machine‐based measures of nicotine yield in smoke and measures of cigarette design features related to nicotine delivery (ventilation, nicotine content in the tobacco rod and number of puffs), as well as market category descriptors, were obtained from annual reports filed with the Massachusetts Department of Public by tobacco manufacturers for 1997–2005. Trends in nicotine yield and its relationship with design features and market parameters were analysed with multilevel mixed‐effects regression modelling procedures. Results A statistically significant trend was confirmed in increased nicotine yield, of 0.019 (1.1%) mg/cig/year over the period 1997–2005 and 0.029 (1.6%) mg/cig/year over the period 1998–2005. The increasing trend was observed in all major market categories (mentholated vs non‐mentholated, and full flavour vs light, medium (mild) or ultralight). Nicotine yield in smoke was positively associated with nicotine concentration in the tobacco and number of puffs per cigarette, both of which showed increasing trends during the study period. Conclusions This study confirms increased machine‐measured levels of nicotine, the addictive agent in cigarettes, in smoke, to be a result of increased nicotine in the tobacco rod and other design modifications. PMID:17897974

  3. Use of vegetation health data for estimation of aus rice yield in bangladesh.

    Science.gov (United States)

    Rahman, Atiqur; Roytman, Leonid; Krakauer, Nir Y; Nizamuddin, Mohammad; Goldberg, Mitch

    2009-01-01

    Rice is a vital staple crop for Bangladesh and surrounding countries, with interannual variation in yields depending on climatic conditions. We compared Bangladesh yield of aus rice, one of the main varieties grown, from official agricultural statistics with Vegetation Health (VH) Indices [Vegetation Condition Index (VCI), Temperature Condition Index (TCI) and Vegetation Health Index (VHI)] computed from Advanced Very High Resolution Radiometer (AVHRR) data covering a period of 15 years (1991-2005). A strong correlation was found between aus rice yield and VCI and VHI during the critical period of aus rice development that occurs during March-April (weeks 8-13 of the year), several months in advance of the rice harvest. Stepwise principal component regression (PCR) was used to construct a model to predict yield as a function of critical-period VHI. The model reduced the yield prediction error variance by 62% compared with a prediction of average yield for each year. Remote sensing is a valuable tool for estimating rice yields well in advance of harvest and at a low cost.

  4. Use of Vegetation Health Data for Estimation of Aus Rice Yield in Bangladesh

    Directory of Open Access Journals (Sweden)

    Mohammad Nizamuddin

    2009-04-01

    Full Text Available Rice is a vital staple crop for Bangladesh and surrounding countries, with interannual variation in yields depending on climatic conditions. We compared Bangladesh yield of aus rice, one of the main varieties grown, from official agricultural statistics with Vegetation Health (VH Indices [Vegetation Condition Index (VCI, Temperature Condition Index (TCI and Vegetation Health Index (VHI] computed from Advanced Very High Resolution Radiometer (AVHRR data covering a period of 15 years (1991–2005. A strong correlation was found between aus rice yield and VCI and VHI during the critical period of aus rice development that occurs during March-April (weeks 8–13 of the year, several months in advance of the rice harvest. Stepwise principal component regression (PCR was used to construct a model to predict yield as a function of critical-period VHI. The model reduced the yield prediction error variance by 62% compared with a prediction of average yield for each year. Remote sensing is a valuable tool for estimating rice yields well in advance of harvest and at a low cost.

  5. PATH COEFFICIENT ANALYSIS OF SEVERAL COMPONENTS OIL YIELD IN SUNFLOWER (HELIANTHUS ANNUUS L.

    Directory of Open Access Journals (Sweden)

    A. MIjić

    2006-06-01

    Full Text Available The objective of investigation was to analyse oil yield components and their relations by simple coefficient correlations as well as direct and indirect effects to oil yield by path analysis. Twenty-four sunflower hybrids were included in the investigation and their seven traits (plant height, head diameter, 1000 seed weight, hec- tolitar mass, grain yield, oil content and oil yield. Very strong positive correlation was estimated between grain yield and oil yield, strong positive correlation between hectolitar mass and oil yield, and middle corre- lation among oil yield and: 1000 seed weight, plaint height and oil content. There was no correlation between grain yields and oil content. Grain yield showed the strongest effect to oil yield. Oil content had lower effect to oil yield. Other traits showed no significant effect to oil yield, and their effect to oil yield was covered by indirect effect of grain yield.

  6. Information fusion via constrained principal component regression for robust quantification with incomplete calibrations

    International Nuclear Information System (INIS)

    Vogt, Frank

    2013-01-01

    Graphical abstract: Analysis Task: Determine the albumin (= protein) concentration in microalgae cells as a function of the cells’ nutrient availability. Left Panel: The predicted albumin concentrations as obtained by conventional principal component regression features low reproducibility and are partially higher than the concentrations of algae in which albumin is contained. Right Panel: Augmenting an incomplete PCR calibration with additional expert information derives reasonable albumin concentrations which now reveal a significant dependency on the algae's nutrient situation. -- Highlights: •Make quantitative analyses of compounds embedded in largely unknown chemical matrices robust. •Improved concentration prediction with originally insufficient calibration models. •Chemometric approach for incorporating expertise from other fields and/or researchers. •Ensure chemical, biological, or medicinal meaningfulness of quantitative analyses. -- Abstract: Incomplete calibrations are encountered in many applications and hamper chemometric data analyses. Such situations arise when target analytes are embedded in a chemically complex matrix from which calibration concentrations cannot be determined with reasonable efforts. In other cases, the samples’ chemical composition may fluctuate in an unpredictable way and thus cannot be comprehensively covered by calibration samples. The reason for calibration model to fail is the regression principle itself which seeks to explain measured data optimally in terms of the (potentially incomplete) calibration model but does not consider chemical meaningfulness. This study presents a novel chemometric approach which is based on experimentally feasible calibrations, i.e. concentration series of the target analytes outside the chemical matrix (‘ex situ calibration’). The inherent lack-of-information is then compensated by incorporating additional knowledge in form of regression constraints. Any outside knowledge can be

  7. The use of near-infrared scanning for the prediction of pulp yield and ...

    African Journals Online (AJOL)

    Calibration models to predict pulp yield, cellulose and lignin content were developed by applying chemometrics and partial least squares regression. Validation and determination of prediction accuracy of the models were performed using independent data. The prediction of cellulose and lignin were acceptable with ...

  8. Paradox of spontaneous cancer regression: implications for fluctuational radiothermy and radiotherapy

    International Nuclear Information System (INIS)

    Roy, Prasun K.; Dutta Majumder, D.; Biswas, Jaydip

    1999-01-01

    Spontaneous regression of malignant tumours without treatment is a most enigmatic phenomenon with immense therapeutic potentialities. We analyse such cases to find that the commonest cause is a preceding episode of high fever-induced thermal fluctuation which produce fluctuation of biochemical and immunological parameters. Using Prigogine-Glansdorff thermodynamic stability formalism and biocybernetic principles, we develop the theoretical foundation of tumour regression induced by thermal, radiational or oxygenational fluctuations. For regression, a preliminary threshold condition of fluctuations is derived, namely σ > 2.83. We present some striking confirmation of such fluctuation-induced regression of various therapy-resistant masses as Ewing tumour, neurogranuloma and Lewis lung carcinoma by utilising σ > 2.83. Our biothermodynamic stability model of malignancy appears to illuminate the marked increase of aggressiveness of mammalian malignancy which occurred around 250 million years ago when homeothermic warm-blooded pre-mammals evolved. Using experimental data, we propose a novel approach of multi-modal hyper-fluctuation therapy involving modulation of radiotherapeutic hyper-fractionation, temperature, radiothermy and immune-status. (author)

  9. Effect of wheat gluten proteins on bioethanol yield from grain

    Energy Technology Data Exchange (ETDEWEB)

    Buresova, Iva [Agrotest Fyto, Ltd., Havlickova 2787/121, 767 01 Kromeriz (Czech Republic); Hrivna, Ludek [Mendel University in Brno, Zemedelska 1, 613 00 Brno (Czech Republic)

    2011-04-15

    Bioethanol can be used as motor fuel and/or as a gasoline enhancer. A high yield feedstock for bioethanol production is cereal grain. Cereal grains containing less gluten proteins (glutenin and gliadin), but high starch, are favoured by distillers because they increase the bioethanol conversion. The direct effect of wheat gluten proteins on bioethanol yield was studied on triticale grain. Examined triticale Presto 1R.1D{sub 5+10}-2 and Presto Valdy were developed by introducing selected segments of wheat chromosome 1D into triticale chromosome 1R. Even if the samples analysed in this study do not afford to make definitive assumptions, it can be noticed that in analysed cases the presence of gliadin had more significant effect on investigated parameters than the presence of glutenin. Despite the presence of glutenin subunits did not significantly decrease the investigated parameters - specific weight, Hagberg falling number and starch content in grain met the requirements for grain for bioethanol production - protein content was higher than is optimal. The fermentation experiments demonstrated good bioethanol yields but depression in grain yields caused by the presence of wheat gliadin and glutenin decreased the energy balance of Presto Valdy and Presto 1R.1D{sub 5+10}-2. (author)

  10. A Comparison of Regression Techniques for Estimation of Above-Ground Winter Wheat Biomass Using Near-Surface Spectroscopy

    Directory of Open Access Journals (Sweden)

    Jibo Yue

    2018-01-01

    Full Text Available Above-ground biomass (AGB provides a vital link between solar energy consumption and yield, so its correct estimation is crucial to accurately monitor crop growth and predict yield. In this work, we estimate AGB by using 54 vegetation indexes (e.g., Normalized Difference Vegetation Index, Soil-Adjusted Vegetation Index and eight statistical regression techniques: artificial neural network (ANN, multivariable linear regression (MLR, decision-tree regression (DT, boosted binary regression tree (BBRT, partial least squares regression (PLSR, random forest regression (RF, support vector machine regression (SVM, and principal component regression (PCR, which are used to analyze hyperspectral data acquired by using a field spectrophotometer. The vegetation indexes (VIs determined from the spectra were first used to train regression techniques for modeling and validation to select the best VI input, and then summed with white Gaussian noise to study how remote sensing errors affect the regression techniques. Next, the VIs were divided into groups of different sizes by using various sampling methods for modeling and validation to test the stability of the techniques. Finally, the AGB was estimated by using a leave-one-out cross validation with these powerful techniques. The results of the study demonstrate that, of the eight techniques investigated, PLSR and MLR perform best in terms of stability and are most suitable when high-accuracy and stable estimates are required from relatively few samples. In addition, RF is extremely robust against noise and is best suited to deal with repeated observations involving remote-sensing data (i.e., data affected by atmosphere, clouds, observation times, and/or sensor noise. Finally, the leave-one-out cross-validation method indicates that PLSR provides the highest accuracy (R2 = 0.89, RMSE = 1.20 t/ha, MAE = 0.90 t/ha, NRMSE = 0.07, CV (RMSE = 0.18; thus, PLSR is best suited for works requiring high

  11. Wheat yield vulnerability: relation to rainfall and suggestions for adaptation

    Directory of Open Access Journals (Sweden)

    Khalid Tafoughalti

    2018-04-01

    Full Text Available Wheat production is of paramount importance in the region of Meknes, which is mainly produced under rainfed conditions. It is the dominant cereal, the greater proportion being the soft type. During the past few decades, rainfall flaws have caused a number of cases of droughts. These flaws have seriously affecting wheat production. The main objective of this study is the assessment of rainfall variability at monthly, seasonal and annual scales and to determine their impact on wheat yields. To reduce this impact we suggested some mechanisms of adaptation. We used monthly rainfall records for three decades and wheat yields records of fifteen years. Rainfall variability is assessed utilizing the precipitation concentration index and the variation coefficient. The association between wheat yields and cumulative rainfall amounts of different scales was calculated based on a regression model to evaluate the impact of rainfall on wheat yields. Data analysis shown moderate seasonal and irregular annual rainfall distribution. Yields fluctuated from 210 to 4500 Kg/ha with 52% of coefficient of variation. The correlation results shows that soft wheat and hard wheat are strongly correlated with the period of January to March than with the whole growing-season. While they are adversely correlated with the mid-spring. This investigation concluded that synchronizing appropriate adaptation with the period of January to March was crucial to achieving success yield of wheat.

  12. Yield models for Eucalyptus globulus fuelwood plantations in Ethiopia

    Energy Technology Data Exchange (ETDEWEB)

    Pukkala, T.; Pohjonen, V. (Joensuu Univ. (FI). Faculty of Forestry)

    1990-01-01

    Based on 53 tree analyses and 105 sample plots of Eucalyptus globulus, models for volume and biomass at single tree and stand levels were developed. The possible growing sites were divided into four site classes. In seedling stands, the site class I corresponds to yield class 44 m{sup 3} ha{sup -1} year{sup -1}, in coppice stands to yield class 46 m{sup 3} ha{sup -1} year{sup -1}. The site class IV corresponds in seedling stand to yield class 9 m{sup 3} ha{sup -1} year{sup -1}, in coppice stands to yield class 13 m{sup 3} ha{sup -1} year{sup -1}. The maximum mean annual increment was reached in seedling stands at the age of 18-19 years, in coppice stands at the age of 14 years. (author).

  13. Physics constrained nonlinear regression models for time series

    International Nuclear Information System (INIS)

    Majda, Andrew J; Harlim, John

    2013-01-01

    A central issue in contemporary science is the development of data driven statistical nonlinear dynamical models for time series of partial observations of nature or a complex physical model. It has been established recently that ad hoc quadratic multi-level regression (MLR) models can have finite-time blow up of statistical solutions and/or pathological behaviour of their invariant measure. Here a new class of physics constrained multi-level quadratic regression models are introduced, analysed and applied to build reduced stochastic models from data of nonlinear systems. These models have the advantages of incorporating memory effects in time as well as the nonlinear noise from energy conserving nonlinear interactions. The mathematical guidelines for the performance and behaviour of these physics constrained MLR models as well as filtering algorithms for their implementation are developed here. Data driven applications of these new multi-level nonlinear regression models are developed for test models involving a nonlinear oscillator with memory effects and the difficult test case of the truncated Burgers–Hopf model. These new physics constrained quadratic MLR models are proposed here as process models for Bayesian estimation through Markov chain Monte Carlo algorithms of low frequency behaviour in complex physical data. (paper)

  14. Temporal Synchronization Analysis for Improving Regression Modeling of Fecal Indicator Bacteria Levels

    Science.gov (United States)

    Multiple linear regression models are often used to predict levels of fecal indicator bacteria (FIB) in recreational swimming waters based on independent variables (IVs) such as meteorologic, hydrodynamic, and water-quality measures. The IVs used for these analyses are traditiona...

  15. Regressão múltipla stepwise e hierárquica em Psicologia Organizacional: aplicações, problemas e soluções Stepwise and hierarchical multiple regression in organizational psychology: Applications, problemas and solutions

    Directory of Open Access Journals (Sweden)

    Gardênia Abbad

    2002-01-01

    Full Text Available Este artigo discute algumas aplicações das técnicas de análise de regressão múltipla stepwise e hierárquica, as quais são muito utilizadas em pesquisas da área de Psicologia Organizacional. São discutidas algumas estratégias de identificação e de solução de problemas relativos à ocorrência de erros do Tipo I e II e aos fenômenos de supressão, complementaridade e redundância nas equações de regressão múltipla. São apresentados alguns exemplos de pesquisas nas quais esses padrões de associação entre variáveis estiveram presentes e descritas as estratégias utilizadas pelos pesquisadores para interpretá-los. São discutidas as aplicações dessas análises no estudo de interação entre variáveis e na realização de testes para avaliação da linearidade do relacionamento entre variáveis. Finalmente, são apresentadas sugestões para lidar com as limitações das análises de regressão múltipla (stepwise e hierárquica.This article discusses applications of stepwise and hierarchical multiple regression analyses to research in organizational psychology. Strategies for identifying type I and II errors, and solutions to potential problems that may arise from such errors are proposed. In addition, phenomena such as suppression, complementarity, and redundancy are reviewed. The article presents examples of research where these phenomena occurred, and the manner in which they were explained by researchers. Some applications of multiple regression analyses to studies involving between-variable interactions are presented, along with tests used to analyze the presence of linearity among variables. Finally, some suggestions are provided for dealing with limitations implicit in multiple regression analyses (stepwise and hierarchical.

  16. Measure of the e+e-{yields}bb Cross Section at the LEP Energies; Medida de la seccion eficaz e''+e''-{yields}bb a las Energias de LEP

    Energy Technology Data Exchange (ETDEWEB)

    Arce Dubois, P.

    1992-07-01

    In the present work I analyse the data collected during 1990 by the L3 detector, situated in the electron-positron collider LEP. After selecting the events e''+e''-{yields} bb through their semileptonic decays into muons, I calculate the cross section for the process e''+e''- {yields} bb at different energy points around the mass of the vectorial boson Z, and I measure some parameters of the Standard Model, namely, the Br(b{yields}{mu} ),{gamma}{sub z}n-{yields}bb/{gamma}{sub z}n{yields}had and {gamma}{sub z}n{yields}bb{gamma}{sub z}n{yields}e''+e''-. (Author) 26 refs.

  17. Relationship between compatibilizer and yield strength of PLA/PP Blend

    Science.gov (United States)

    Jariyakulsith, Pattanun; Puajindanetr, Somchai

    2018-01-01

    The aim of this research is to study the relationship between compatibilizer and yield strength of polylactic acid (PLA) and polypropylene (PP) blend. The PLA is blended with PP (PLA/PP) at the ratios of 70/30, 50/50 and 30/70. In addition, (1) polypropylene grafted maleic anhydride (PP-g-MAH) as a compatibilizer at 0.3 and 0.7 part per hundred of PLA/PP resin (phr) and (2) dicumyl peroxide (DCP) being an initiator at 0.03 and 0.07 phr are added in each composition. Yield strength is characterized to study the interaction between compatibilizer, initiator and yield strength by using experimental design of multilevel full factorial. The results show that (1) the yield strength of PLA/PP blend are increased after addition of compatibilizer. Because the adding of PP-g-MAH and DCP resulted in improving compatibility between PLA and PP. (2) there are interaction between PP-g-MAH and DCP that have affected the final properties of PLA/PP blend. The highest yield strength of 27.68 MPa is provided at the ratio of 70/30 blend by using the 0.3 phr of PP-g-MAH and 0.03 phr of DCP. Linear regression model is fitted and follow the assumptions of normal distribution.

  18. Use of probabilistic weights to enhance linear regression myoelectric control

    Science.gov (United States)

    Smith, Lauren H.; Kuiken, Todd A.; Hargrove, Levi J.

    2015-12-01

    Objective. Clinically available prostheses for transradial amputees do not allow simultaneous myoelectric control of degrees of freedom (DOFs). Linear regression methods can provide simultaneous myoelectric control, but frequently also result in difficulty with isolating individual DOFs when desired. This study evaluated the potential of using probabilistic estimates of categories of gross prosthesis movement, which are commonly used in classification-based myoelectric control, to enhance linear regression myoelectric control. Approach. Gaussian models were fit to electromyogram (EMG) feature distributions for three movement classes at each DOF (no movement, or movement in either direction) and used to weight the output of linear regression models by the probability that the user intended the movement. Eight able-bodied and two transradial amputee subjects worked in a virtual Fitts’ law task to evaluate differences in controllability between linear regression and probability-weighted regression for an intramuscular EMG-based three-DOF wrist and hand system. Main results. Real-time and offline analyses in able-bodied subjects demonstrated that probability weighting improved performance during single-DOF tasks (p < 0.05) by preventing extraneous movement at additional DOFs. Similar results were seen in experiments with two transradial amputees. Though goodness-of-fit evaluations suggested that the EMG feature distributions showed some deviations from the Gaussian, equal-covariance assumptions used in this experiment, the assumptions were sufficiently met to provide improved performance compared to linear regression control. Significance. Use of probability weights can improve the ability to isolate individual during linear regression myoelectric control, while maintaining the ability to simultaneously control multiple DOFs.

  19. Evaluation of Linear Regression Simultaneous Myoelectric Control Using Intramuscular EMG.

    Science.gov (United States)

    Smith, Lauren H; Kuiken, Todd A; Hargrove, Levi J

    2016-04-01

    The objective of this study was to evaluate the ability of linear regression models to decode patterns of muscle coactivation from intramuscular electromyogram (EMG) and provide simultaneous myoelectric control of a virtual 3-DOF wrist/hand system. Performance was compared to the simultaneous control of conventional myoelectric prosthesis methods using intramuscular EMG (parallel dual-site control)-an approach that requires users to independently modulate individual muscles in the residual limb, which can be challenging for amputees. Linear regression control was evaluated in eight able-bodied subjects during a virtual Fitts' law task and was compared to performance of eight subjects using parallel dual-site control. An offline analysis also evaluated how different types of training data affected prediction accuracy of linear regression control. The two control systems demonstrated similar overall performance; however, the linear regression method demonstrated improved performance for targets requiring use of all three DOFs, whereas parallel dual-site control demonstrated improved performance for targets that required use of only one DOF. Subjects using linear regression control could more easily activate multiple DOFs simultaneously, but often experienced unintended movements when trying to isolate individual DOFs. Offline analyses also suggested that the method used to train linear regression systems may influence controllability. Linear regression myoelectric control using intramuscular EMG provided an alternative to parallel dual-site control for 3-DOF simultaneous control at the wrist and hand. The two methods demonstrated different strengths in controllability, highlighting the tradeoff between providing simultaneous control and the ability to isolate individual DOFs when desired.

  20. Comparative analyses of B{yields}K{sub 2}{sup *}l{sup +}l{sup -} in the standard model and new physics scenarios

    Energy Technology Data Exchange (ETDEWEB)

    Li, Run-Hui [Institute of High Energy Physics, Beijing (China); Yonsei Univ., Seoul (Korea, Republic of). Dept. of Physics and IPAP; Lue, Cai-Dian [Institute of High Energy Physics, Beijing (China); Wang, Wei [Deutsches Elektronen-Synchrotron (DESY), Hamburg (Germany)

    2010-12-15

    We analyze the B {yields} K{sub 2}{sup *}({yields} K{pi})l{sup +}l{sup -} (with l=e,{mu},{tau}) decay in the standard model and two new physics scenarios: vector-like quark model and family non-universal Z{sup '} model. We derive the differential angular distributions of the quasi-four-body decay, using the recently calculated form factors in the perturbative QCD approach. Branching ratios, polarizations, forward-backward asymmetries and transversity amplitudes are predicted, from which we find a promising prospective to observe this channel on the future experiment. We also update the constraints on effective Wilson coefficients and/or free parameters in these two new physics scenarios by making use of the experimental data of B{yields}K{sup *}l{sup +}l{sup -} and b{yields}sl{sup +}l{sup -}. Their impact on B{yields}K{sub 2}{sup *}l{sup +}l{sup -} is subsequently explored and in particular the zero-crossing point for the forward-backward asymmetry in these new physics scenarios can sizably deviate from the SM scenario. In addition we also generalize the analysis to a similar mode B{sub s}{yields}f{sup '}{sub 2}(1525)({yields}K{sup +}K{sup -})l{sup +}l{sup -}. (orig.)

  1. Evaluation of quantitative relationships between saffron yield and nutrition (on farm trial

    Directory of Open Access Journals (Sweden)

    mohamad ali behdani

    2009-06-01

    Full Text Available In order to relate production of saffron and utilization of nutrients, a study was conducted in 2001 and 2002. Four selected locations for this study were Birjand, Gonabad, Qaen and Torbat-Haydariah, which are the main saffron production centers in Iran. This study was performed in 160 saffron farms, aged between 1 and 5 years. Manure, nitrogen and phosphorous fertilizers showed a positive linear relation with yield and length of flowering, while nitrogen and phosphorous showed a negative linear relation with start of flowering period. Yield of saffron showed a significant and positive correlation with the amount of applied manure and the saffron farms with age 4-5 year had highest yield. Our results showed that manure was the most effective factor in production of saffron. The beneficial effects of manure could be due to slow release of nutrients and enhancing soil physical properties. Stepwise regression analysis of yield and fertilizer application showed that 67 percent of yield variations was attributed to manure and phosphorous application.

  2. Can trial sequential monitoring boundaries reduce spurious inferences from meta-analyses?

    DEFF Research Database (Denmark)

    Thorlund, Kristian; Devereaux, P J; Wetterslev, Jørn

    2008-01-01

    BACKGROUND: Results from apparently conclusive meta-analyses may be false. A limited number of events from a few small trials and the associated random error may be under-recognized sources of spurious findings. The information size (IS, i.e. number of participants) required for a reliable......-analyses after each included trial and evaluated their results using a conventional statistical criterion (alpha = 0.05) and two-sided Lan-DeMets monitoring boundaries. We examined the proportion of false positive results and important inaccuracies in estimates of treatment effects that resulted from the two...... approaches. RESULTS: Using the random-effects model and final data, 12 of the meta-analyses yielded P > alpha = 0.05, and 21 yielded P alpha = 0.05. The monitoring boundaries eliminated all false positives. Important inaccuracies in estimates were observed in 6 out of 21 meta-analyses using the conventional...

  3. Using synthetic data to evaluate multiple regression and principal component analyses for statistical modeling of daily building energy consumption

    Energy Technology Data Exchange (ETDEWEB)

    Reddy, T.A. (Energy Systems Lab., Texas A and M Univ., College Station, TX (United States)); Claridge, D.E. (Energy Systems Lab., Texas A and M Univ., College Station, TX (United States))

    1994-01-01

    Multiple regression modeling of monitored building energy use data is often faulted as a reliable means of predicting energy use on the grounds that multicollinearity between the regressor variables can lead both to improper interpretation of the relative importance of the various physical regressor parameters and to a model with unstable regressor coefficients. Principal component analysis (PCA) has the potential to overcome such drawbacks. While a few case studies have already attempted to apply this technique to building energy data, the objectives of this study were to make a broader evaluation of PCA and multiple regression analysis (MRA) and to establish guidelines under which one approach is preferable to the other. Four geographic locations in the US with different climatic conditions were selected and synthetic data sequence representative of daily energy use in large institutional buildings were generated in each location using a linear model with outdoor temperature, outdoor specific humidity and solar radiation as the three regression variables. MRA and PCA approaches were then applied to these data sets and their relative performances were compared. Conditions under which PCA seems to perform better than MRA were identified and preliminary recommendations on the use of either modeling approach formulated. (orig.)

  4. The Application of Classical and Neural Regression Models for the Valuation of Residential Real Estate

    Directory of Open Access Journals (Sweden)

    Mach Łukasz

    2017-06-01

    Full Text Available The research process aimed at building regression models, which helps to valuate residential real estate, is presented in the following article. Two widely used computational tools i.e. the classical multiple regression and regression models of artificial neural networks were used in order to build models. An attempt to define the utilitarian usefulness of the above-mentioned tools and comparative analysis of them is the aim of the conducted research. Data used for conducting analyses refers to the secondary transactional residential real estate market.

  5. Maximum Entropy Discrimination Poisson Regression for Software Reliability Modeling.

    Science.gov (United States)

    Chatzis, Sotirios P; Andreou, Andreas S

    2015-11-01

    Reliably predicting software defects is one of the most significant tasks in software engineering. Two of the major components of modern software reliability modeling approaches are: 1) extraction of salient features for software system representation, based on appropriately designed software metrics and 2) development of intricate regression models for count data, to allow effective software reliability data modeling and prediction. Surprisingly, research in the latter frontier of count data regression modeling has been rather limited. More specifically, a lack of simple and efficient algorithms for posterior computation has made the Bayesian approaches appear unattractive, and thus underdeveloped in the context of software reliability modeling. In this paper, we try to address these issues by introducing a novel Bayesian regression model for count data, based on the concept of max-margin data modeling, effected in the context of a fully Bayesian model treatment with simple and efficient posterior distribution updates. Our novel approach yields a more discriminative learning technique, making more effective use of our training data during model inference. In addition, it allows of better handling uncertainty in the modeled data, which can be a significant problem when the training data are limited. We derive elegant inference algorithms for our model under the mean-field paradigm and exhibit its effectiveness using the publicly available benchmark data sets.

  6. Fusion neutron yield and flux calculation on HT-7 superconducting tokamak

    International Nuclear Information System (INIS)

    Fu Yanzhang; Zhu Yubao; Chen Juequan

    2006-01-01

    Neutron yield and flux have been numerically estimated on HT-7 tokamak. The total fusion neutron yield and neutron flux distribution on different positions and azimuth angles of the device are presented. Analyses on the errors induced by ion temperature and density distribution factors are given in detail. The results of the calculations provide a useful database for neutron diagnostics and neutron radiation protection. (authors)

  7. Tax System in Poland – Progressive or Regressive?

    Directory of Open Access Journals (Sweden)

    Jacek Tomkiewicz

    2016-03-01

    Full Text Available Purpose: To analyse the impact of the Polish fiscal regime on the general revenue of the country, and specifically to establish whether the cumulative tax burden borne by Polish households is progressive or regressive.Methodology: On the basis of Eurostat and OECD data, the author has analysed fiscal regimes in EU Member States and in OECD countries. The tax burden of households within different income groups has also been examined pursuant to applicable fiscal laws and data pertaining to the revenue and expenditure of households published by the Central Statistical Office (CSO.Conclusions: The fiscal regime in Poland is regressive; that is, the relative fiscal burden decreases as the taxpayer’s income increases.Research Implications: The article contributes to the on-going discussion on social cohesion, in particular with respect to economic policy instruments aimed at the redistribution of income within the economy.Originality: The author presents an analysis of data pertaining to fiscal policies in EU Member States and OECD countries and assesses the impact of the legal environment (fiscal regime and social security system in Poland on income distribution within the economy. The impact of the total tax burden (direct and indirect taxes, social security contributions on the economic situation of households from different income groups has been calculated using an original formula.

  8. Climatically driven yield variability of major crops in Khakassia (South Siberia)

    Science.gov (United States)

    Babushkina, Elena A.; Belokopytova, Liliana V.; Zhirnova, Dina F.; Shah, Santosh K.; Kostyakova, Tatiana V.

    2017-12-01

    We investigated the variability of yield of the three main crop cultures in the Khakassia Republic: spring wheat, spring barley, and oats. In terms of yield values, variability characteristics, and climatic response, the agricultural territory of Khakassia can be divided into three zones: (1) the Northern Zone, where crops yield has a high positive response to the amount of precipitation, May-July, and a moderately negative one to the temperatures of the same period; (2) the Central Zone, where crops yield depends mainly on temperatures; and (3) the Southern Zone, where climate has the least expressed impact on yield. The dominant pattern in the crops yield is caused by water stress during periods of high temperatures and low moisture supply with heat stress as additional reason. Differences between zones are due to combinations of temperature latitudinal gradient, precipitation altitudinal gradient, and the presence of a well-developed hydrological network and the irrigational system as moisture sources in the Central Zone. More detailed analysis shows differences in the climatic sensitivity of crops during phases of their vegetative growth and grain development and, to a lesser extent, during harvesting period. Multifactor linear regression models were constructed to estimate climate- and autocorrelation-induced variability of the crops yield. These models allowed prediction of the possibility of yield decreasing by at least 2-11% in the next decade due to increasing of the regional summer temperatures.

  9. Modelling crop yield in Iberia under drought conditions

    Science.gov (United States)

    Ribeiro, Andreia; Páscoa, Patrícia; Russo, Ana; Gouveia, Célia

    2017-04-01

    The improved assessment of the cereal yield and crop loss under drought conditions are essential to meet the increasing economy demands. The growing frequency and severity of the extreme drought conditions in the Iberian Peninsula (IP) has been likely responsible for negative impacts on agriculture, namely on crop yield losses. Therefore, a continuous monitoring of vegetation activity and a reliable estimation of drought impacts is crucial to contribute for the agricultural drought management and development of suitable information tools. This works aims to assess the influence of drought conditions in agricultural yields over the IP, considering cereal yields from mainly rainfed agriculture for the provinces with higher productivity. The main target is to develop a strategy to model drought risk on agriculture for wheat yield at a province level. In order to achieve this goal a combined assessment was made using a drought indicator (Standardized Precipitation Evapotranspiration Index, SPEI) to evaluate drought conditions together with a widely used vegetation index (Normalized Difference Vegetation Index, NDVI) to monitor vegetation activity. A correlation analysis between detrended wheat yield and SPEI was performed in order to assess the vegetation response to each time scale of drought occurrence and also identify the moment of the vegetative cycle when the crop yields are more vulnerable to drought conditions. The time scales and months of SPEI, together with the months of NDVI, better related with wheat yield were chosen to perform a multivariate regression analysis to simulate crop yield. Model results are satisfactory and highlighted the usefulness of such analysis in the framework of developing a drought risk model for crop yields. In terms of an operational point of view, the results aim to contribute to an improved understanding of crop yield management under dry conditions, particularly adding substantial information on the advantages of combining

  10. The non-condition logistic regression analysis of the reason of hypothyroidism after hyperthyroidism with 131I treatment

    International Nuclear Information System (INIS)

    Dang Yaping; Hu Guoying; Meng Xianwen

    1994-01-01

    There are many opinions on the reason of hypothyroidism after hyperthyroidism with 131 I treatment. In this respect, there are a few scientific analyses and reports. The non-condition logistic regression solved this problem successfully. It has a higher scientific value and confidence in the risk factor analysis. 748 follow-up patients' data were analysed by the non-condition logistic regression. The results shown that the half-life and 131 I dose were the main causes of the incidence of hypothyroidism. The degree of confidence is 92.4%

  11. Spatial regression analysis on 32 years of total column ozone data

    NARCIS (Netherlands)

    Knibbe, J.S.; van der A, J.R.; de Laat, A.T.J.

    2014-01-01

    Multiple-regression analyses have been performed on 32 years of total ozone column data that was spatially gridded with a 1 × 1.5° resolution. The total ozone data consist of the MSR (Multi Sensor Reanalysis; 1979-2008) and 2 years of assimilated SCIAMACHY (SCanning Imaging Absorption spectroMeter

  12. Crop yield estimation in 2014 for Vojvodina using methods of remote sensing

    Directory of Open Access Journals (Sweden)

    Jovanović Dušan

    2014-01-01

    Full Text Available Monitoring phenology of crops and yield estimate based on vegetation indices as well as other parameters such as temperature or amount of rainfall were largely reported in literature. In this research, MODIS Normalized Difference Vegetation Index (NDVI was used as an indicator of specific crop condition; the other parameter was Land Surface Temperature (LST which can indicate the amount of crop moisture. Trial years were 2011, 2012, and 2013. For those years sowing structure was acquired from agricultural organizations Nova Budućnost from Žarkovac and Sava Kovačević from Vrbas, both in Serbia. Also, satellite images with high and medium resolution for these areas and years were available. Multiple linear regression was used for crop yield estimate for Vojvodina Province, Serbia where the NDVI and LST were independent variables and the average yield for specific crop was the dependent variable. The results of crop yield estimate two months before harvest are presented (excluding wheat.

  13. Government bond yields and foreign ownership of debt

    NARCIS (Netherlands)

    Broos, Menno; de Haan, Jakob

    2012-01-01

    We analyse the government bond yield spread vis-a-vis Germany for 10 countries in the euro area for the period 1991 to 2009. Our results suggest a positive relationship between the marginal impact of government debt on the spread and foreign ownership of the government debt of the country concerned

  14. Economic Analyses of Ware Yam Production in Orlu Agricultural ...

    African Journals Online (AJOL)

    Economic Analyses of Ware Yam Production in Orlu Agricultural Zone of Imo State. ... International Journal of Agriculture and Rural Development ... statistics, gross margin analysis, marginal analysis and multiple regression analysis. Results ...

  15. Brazilian Soybean Yields and Yield Gaps Vary with Farm Size

    Science.gov (United States)

    Jeffries, G. R.; Cohn, A.; Griffin, T. S.; Bragança, A.

    2017-12-01

    Understanding the farm size-specific characteristics of crop yields and yield gaps may help to improve yields by enabling better targeting of technical assistance and agricultural development programs. Linking remote sensing-based yield estimates with property boundaries provides a novel view of the relationship between farm size and yield structure (yield magnitude, gaps, and stability over time). A growing literature documents variations in yield gaps, but largely ignores the role of farm size as a factor shaping yield structure. Research on the inverse farm size-productivity relationship (IR) theory - that small farms are more productive than large ones all else equal - has documented that yield magnitude may vary by farm size, but has not considered other yield structure characteristics. We examined farm size - yield structure relationships for soybeans in Brazil for years 2001-2015. Using out-of-sample soybean yield predictions from a statistical model, we documented 1) gaps between the 95th percentile of attained yields and mean yields within counties and individual fields, and 2) yield stability defined as the standard deviation of time-detrended yields at given locations. We found a direct relationship between soy yields and farm size at the national level, while the strength and the sign of the relationship varied by region. Soybean yield gaps were found to be inversely related to farm size metrics, even when yields were only compared to farms of similar size. The relationship between farm size and yield stability was nonlinear, with mid-sized farms having the most stable yields. The work suggests that farm size is an important factor in understanding yield structure and that opportunities for improving soy yields in Brazil are greatest among smaller farms.

  16. Quantile regression for the statistical analysis of immunological data with many non-detects.

    Science.gov (United States)

    Eilers, Paul H C; Röder, Esther; Savelkoul, Huub F J; van Wijk, Roy Gerth

    2012-07-07

    Immunological parameters are hard to measure. A well-known problem is the occurrence of values below the detection limit, the non-detects. Non-detects are a nuisance, because classical statistical analyses, like ANOVA and regression, cannot be applied. The more advanced statistical techniques currently available for the analysis of datasets with non-detects can only be used if a small percentage of the data are non-detects. Quantile regression, a generalization of percentiles to regression models, models the median or higher percentiles and tolerates very high numbers of non-detects. We present a non-technical introduction and illustrate it with an implementation to real data from a clinical trial. We show that by using quantile regression, groups can be compared and that meaningful linear trends can be computed, even if more than half of the data consists of non-detects. Quantile regression is a valuable addition to the statistical methods that can be used for the analysis of immunological datasets with non-detects.

  17. Gender Gaps in Mathematics, Science and Reading Achievements in Muslim Countries: A Quantile Regression Approach

    Science.gov (United States)

    Shafiq, M. Najeeb

    2013-01-01

    Using quantile regression analyses, this study examines gender gaps in mathematics, science, and reading in Azerbaijan, Indonesia, Jordan, the Kyrgyz Republic, Qatar, Tunisia, and Turkey among 15-year-old students. The analyses show that girls in Azerbaijan achieve as well as boys in mathematics and science and overachieve in reading. In Jordan,…

  18. Does the inclusion of grey literature influence estimates of intervention effectiveness reported in meta-analyses?

    Science.gov (United States)

    McAuley, L; Pham, B; Tugwell, P; Moher, D

    2000-10-07

    The inclusion of only a subset of all available evidence in a meta-analysis may introduce biases and threaten its validity; this is particularly likely if the subset of included studies differ from those not included, which may be the case for published and grey literature (unpublished studies, with limited distribution). We set out to examine whether exclusion of grey literature, compared with its inclusion in meta-analysis, provides different estimates of the effectiveness of interventions assessed in randomised trials. From a random sample of 135 meta-analyses, we identified and retrieved 33 publications that included both grey and published primary studies. The 33 publications contributed 41 separate meta-analyses from several disease areas. General characteristics of the meta-analyses and associated studies and outcome data at the trial level were collected. We explored the effects of the inclusion of grey literature on the quantitative results using logistic-regression analyses. 33% of the meta-analyses were found to include some form of grey literature. The grey literature, when included, accounts for between 4.5% and 75% of the studies in a meta-analysis. On average, published work, compared with grey literature, yielded significantly larger estimates of the intervention effect by 15% (ratio of odds ratios=1.15 [95% CI 1.04-1.28]). Excluding abstracts from the analysis further compounded the exaggeration (1.33 [1.10-1.60]). The exclusion of grey literature from meta-analyses can lead to exaggerated estimates of intervention effectiveness. In general, meta-analysts should attempt to identify, retrieve, and include all reports, grey and published, that meet predefined inclusion criteria.

  19. Estimating milk yield and value losses from increased somatic cell count on US dairy farms.

    Science.gov (United States)

    Hadrich, J C; Wolf, C A; Lombard, J; Dolak, T M

    2018-04-01

    Milk loss due to increased somatic cell counts (SCC) results in economic losses for dairy producers. This research uses 10 mo of consecutive dairy herd improvement data from 2013 and 2014 to estimate milk yield loss using SCC as a proxy for clinical and subclinical mastitis. A fixed effects regression was used to examine factors that affected milk yield while controlling for herd-level management. Breed, milking frequency, days in milk, seasonality, SCC, cumulative months with SCC greater than 100,000 cells/mL, lactation, and herd size were variables included in the regression analysis. The cumulative months with SCC above a threshold was included as a proxy for chronic mastitis. Milk yield loss increased as the number of test days with SCC ≥100,000 cells/mL increased. Results from the regression were used to estimate a monetary value of milk loss related to SCC as a function of cow and operation related explanatory variables for a representative dairy cow. The largest losses occurred from increased cumulative test days with a SCC ≥100,000 cells/mL, with daily losses of $1.20/cow per day in the first month to $2.06/cow per day in mo 10. Results demonstrate the importance of including the duration of months above a threshold SCC when estimating milk yield losses. Cows with chronic mastitis, measured by increased consecutive test days with SCC ≥100,000 cells/mL, resulted in higher milk losses than cows with a new infection. This provides farm managers with a method to evaluate the trade-off between treatment and culling decisions as it relates to mastitis control and early detection. Copyright © 2018 American Dairy Science Association. Published by Elsevier Inc. All rights reserved.

  20. Effect of forest on sediment yield in North China

    Directory of Open Access Journals (Sweden)

    Yu Xinxiao Prof.

    2013-06-01

    Full Text Available Forest-sediment relationship is a hot and important issue in Ecohydrology studies. China has implemented many large-scale reforestation programmes in the last decades to address the growing soil erosion and desertification. In this study, we made statistical and graphic analyses on the long-term hydrological data of the 39 watersheds in the rocky mountain area of the North China, and then we were able to analyze the effect of forest on sediment yield. Our results show that the effect is weak in the lees-precipitation regions (when MAP 500 mm, the impact of forest on reducing sediment yield is different with the varied forest coverage (f, the relationship between the sediment yield and forest coverage show a quadratic polynomial.

  1. Area under the curve predictions of dalbavancin, a new lipoglycopeptide agent, using the end of intravenous infusion concentration data point by regression analyses such as linear, log-linear and power models.

    Science.gov (United States)

    Bhamidipati, Ravi Kanth; Syed, Muzeeb; Mullangi, Ramesh; Srinivas, Nuggehally

    2018-02-01

    1. Dalbavancin, a lipoglycopeptide, is approved for treating gram-positive bacterial infections. Area under plasma concentration versus time curve (AUC inf ) of dalbavancin is a key parameter and AUC inf /MIC ratio is a critical pharmacodynamic marker. 2. Using end of intravenous infusion concentration (i.e. C max ) C max versus AUC inf relationship for dalbavancin was established by regression analyses (i.e. linear, log-log, log-linear and power models) using 21 pairs of subject data. 3. The predictions of the AUC inf were performed using published C max data by application of regression equations. The quotient of observed/predicted values rendered fold difference. The mean absolute error (MAE)/root mean square error (RMSE) and correlation coefficient (r) were used in the assessment. 4. MAE and RMSE values for the various models were comparable. The C max versus AUC inf exhibited excellent correlation (r > 0.9488). The internal data evaluation showed narrow confinement (0.84-1.14-fold difference) with a RMSE models predicted AUC inf with a RMSE of 3.02-27.46% with fold difference largely contained within 0.64-1.48. 5. Regardless of the regression models, a single time point strategy of using C max (i.e. end of 30-min infusion) is amenable as a prospective tool for predicting AUC inf of dalbavancin in patients.

  2. Estimation of Constituent Concentrations, Loads, and Yields in Streams of Johnson County, Northeast Kansas, Using Continuous Water-Quality Monitoring and Regression Models, October 2002 through December 2006

    Science.gov (United States)

    Rasmussen, Teresa J.; Lee, Casey J.; Ziegler, Andrew C.

    2008-01-01

    Johnson County is one of the most rapidly developing counties in Kansas. Population growth and expanding urban land use affect the quality of county streams, which are important for human and environmental health, water supply, recreation, and aesthetic value. This report describes estimates of streamflow and constituent concentrations, loads, and yields in relation to watershed characteristics in five Johnson County streams using continuous in-stream sensor measurements. Specific conductance, pH, water temperature, turbidity, and dissolved oxygen were monitored in five watersheds from October 2002 through December 2006. These continuous data were used in conjunction with discrete water samples to develop regression models for continuously estimating concentrations of other constituents. Continuous regression-based concentrations were estimated for suspended sediment, total suspended solids, dissolved solids and selected major ions, nutrients (nitrogen and phosphorus species), and fecal-indicator bacteria. Continuous daily, monthly, seasonal, and annual loads were calculated from concentration estimates and streamflow. The data are used to describe differences in concentrations, loads, and yields and to explain these differences relative to watershed characteristics. Water quality at the five monitoring sites varied according to hydrologic conditions; contributing drainage area; land use (including degree of urbanization); relative contributions from point and nonpoint constituent sources; and human activity within each watershed. Dissolved oxygen (DO) concentrations were less than the Kansas aquatic-life-support criterion of 5.0 mg/L less than 10 percent of the time at all sites except Indian Creek, which had DO concentrations less than the criterion about 15 percent of the time. Concentrations of suspended sediment, chloride (winter only), indicator bacteria, and pesticides were substantially larger during periods of increased streamflow. Suspended

  3. Logistic Regression in the Identification of Hazards in Construction

    Science.gov (United States)

    Drozd, Wojciech

    2017-10-01

    The construction site and its elements create circumstances that are conducive to the formation of risks to safety during the execution of works. Analysis indicates the critical importance of these factors in the set of characteristics that describe the causes of accidents in the construction industry. This article attempts to analyse the characteristics related to the construction site, in order to indicate their importance in defining the circumstances of accidents at work. The study includes sites inspected in 2014 - 2016 by the employees of the District Labour Inspectorate in Krakow (Poland). The analysed set of detailed (disaggregated) data includes both quantitative and qualitative characteristics. The substantive task focused on classification modelling in the identification of hazards in construction and identifying those of the analysed characteristics that are important in an accident. In terms of methodology, resource data analysis using statistical classifiers, in the form of logistic regression, was the method used.

  4. Stability Parameters for Grain Yield and its Component Traits in Maize Hybrids of Different FAO Maturity Groups

    Directory of Open Access Journals (Sweden)

    Dragan Djurovic

    2014-12-01

    Full Text Available An objective evaluation of maize hybrids in intensive cropping systems requires identification not only of yield components and other agronomically important traits but also of stability parameters. Grain yield and its components were assessed in 11 maize hybrids with different lengths of growing season (FAO 300-700 maturity groups using analysis of variance and regression analysis at three different locations in Western Serbia. The test hybrids and locations showed significant differences in grain yield, grain moisture content at maturity, 1,000-kernel weight and ear length. A significant interaction was observed between all traits and the environment. The hybrids with higher mean values of the traits, regardless of maturity group, generally exhibited sensitivity i.e. adaptation to more favourable environmental conditions as compared to those having lower mean values. Regression coefficient (bi values for grain yield mostly suggested no significant differences relative to the mean. The medium-season hybrid gave high yields and less favourable values of stability parameters at most locations and in most years, as compared to mediumlate hybrids. As compared to medium-early hybrids, medium-late hybrids (FAO 600 and 700 mostly exhibited unfavourable values of stability parameters i.e. a specific response and better adaptation to favourable environmental conditions, and gave higher average yields. Apart from producing lower average yields, FAO 300 and 400 hybrids showed higher yield stability as compared to the other hybrids tested. Medium-late hybrids had higher yields and showed a better response to favourable environmental conditions compared to early-maturing hybrids. Therefore, they can be recommended for intensive cultural practices and low-stress environments. Due to their more favourable stability parameter values, medium-early hybrids can be recommended for low-intensity cultural practices and stressful environments.

  5. A Fast Gradient Method for Nonnegative Sparse Regression With Self-Dictionary

    Science.gov (United States)

    Gillis, Nicolas; Luce, Robert

    2018-01-01

    A nonnegative matrix factorization (NMF) can be computed efficiently under the separability assumption, which asserts that all the columns of the given input data matrix belong to the cone generated by a (small) subset of them. The provably most robust methods to identify these conic basis columns are based on nonnegative sparse regression and self dictionaries, and require the solution of large-scale convex optimization problems. In this paper we study a particular nonnegative sparse regression model with self dictionary. As opposed to previously proposed models, this model yields a smooth optimization problem where the sparsity is enforced through linear constraints. We show that the Euclidean projection on the polyhedron defined by these constraints can be computed efficiently, and propose a fast gradient method to solve our model. We compare our algorithm with several state-of-the-art methods on synthetic data sets and real-world hyperspectral images.

  6. Use of multiple linear regression and logistic regression models to investigate changes in birthweight for term singleton infants in Scotland.

    Science.gov (United States)

    Bonellie, Sandra R

    2012-10-01

    To illustrate the use of regression and logistic regression models to investigate changes over time in size of babies particularly in relation to social deprivation, age of the mother and smoking. Mean birthweight has been found to be increasing in many countries in recent years, but there are still a group of babies who are born with low birthweights. Population-based retrospective cohort study. Multiple linear regression and logistic regression models are used to analyse data on term 'singleton births' from Scottish hospitals between 1994-2003. Mothers who smoke are shown to give birth to lighter babies on average, a difference of approximately 0.57 Standard deviations lower (95% confidence interval. 0.55-0.58) when adjusted for sex and parity. These mothers are also more likely to have babies that are low birthweight (odds ratio 3.46, 95% confidence interval 3.30-3.63) compared with non-smokers. Low birthweight is 30% more likely where the mother lives in the most deprived areas compared with the least deprived, (odds ratio 1.30, 95% confidence interval 1.21-1.40). Smoking during pregnancy is shown to have a detrimental effect on the size of infants at birth. This effect explains some, though not all, of the observed socioeconomic birthweight. It also explains much of the observed birthweight differences by the age of the mother.   Identifying mothers at greater risk of having a low birthweight baby as important implications for the care and advice this group receives. © 2012 Blackwell Publishing Ltd.

  7. Time-adaptive quantile regression

    DEFF Research Database (Denmark)

    Møller, Jan Kloppenborg; Nielsen, Henrik Aalborg; Madsen, Henrik

    2008-01-01

    and an updating procedure are combined into a new algorithm for time-adaptive quantile regression, which generates new solutions on the basis of the old solution, leading to savings in computation time. The suggested algorithm is tested against a static quantile regression model on a data set with wind power......An algorithm for time-adaptive quantile regression is presented. The algorithm is based on the simplex algorithm, and the linear optimization formulation of the quantile regression problem is given. The observations have been split to allow a direct use of the simplex algorithm. The simplex method...... production, where the models combine splines and quantile regression. The comparison indicates superior performance for the time-adaptive quantile regression in all the performance parameters considered....

  8. Use of abrupt strain path change for determining subsequent yield surface

    DEFF Research Database (Denmark)

    Kuroda, Mitsutoshi; Tvergaard, Viggo

    1999-01-01

    is prescribed, which makes the stress point move quickly along the yield surface. It is assumed that a closed-loop testing machine is used for the experiment, so that the strain path can be prescribed according to strain gauge measurements. Relative to the standard method of determining yield surface shapes...... by probing in many different stress directions from the elastic region, using some chosen plastic strain offset, the main advantage of the proposed method is that elastic unloading is not needed prior to tracing the yield surface. The method is illustrated here by a few analyses, first for the simplest how...

  9. The predictive power of dividend yields for future inflation: Money illusion or rational causes?

    DEFF Research Database (Denmark)

    Engsted, Tom; Pedersen, Thomas Quistgaard

    slope coefficients that increase numerically with the horizon in regressions of future inflation onto the dividend yield, in accordance with the data. A purely rational version of the model with no money illusion, but with a link from expected inflation to real consumption growth, also generates...

  10. Study of pp{yields}pp{eta} reaction at threshold; Etude de la reaction pp{yields}pp{eta} au seuil

    Energy Technology Data Exchange (ETDEWEB)

    Taleb, A

    1994-11-01

    The {eta} production has been studied through the pp {yields} pp{eta} reaction at threshold. Data were taken at the Synchrotron of the ``Laboratoire National Saturne``. The detection in coincidence of the two protons scattered near 0 deg and analysed with the magnetic spectrometer SPES3 allows the reconstruction of missing mass spectra for the {eta} signature. A simulation program which takes into account all the experimental set up characteristics has been realized and tested through the pp {yields} d{pi}{sup +} reaction detected simultaneously with pp {yields} pp{eta}. The generated proton momentum spectra for pp {yields} pp{eta} show a pronounced {eta} mass dependence. This characteristic, connected to the kinematical properties of pp {yields} pp{eta} at threshold, is used to extract the mass of the meson {eta}. The obtained value, m{sub {eta}} = 547.65 {+-} 0.18 MeV, is in good agreement with measurement done recently through the pd {yields} {sup H}e{eta} reaction. The total cross section {sigma}{sub t} of pp {yields} pp{eta} measured at 1260, 1265 and 1300 MeV presents a strong energy dependence. This cross section increases less with energy than the phase-space. The influence of p-p and {eta}-p final state interactions in our measurements is studied. Our results are compared with theoretical predictions and assess the dominant character of the baryonic resonance N{sup *}(1535) in the {eta} mechanism production at threshold. These experimental results give an energy dependence which is not well reproduced by the theoretical predictions. This discrepancy could be an incorrect description of the {eta}-p interaction in the models. (author). 48 refs., 60 figs., 15 tabs.

  11. Mitogenomic analyses from ancient DNA

    DEFF Research Database (Denmark)

    Paijmans, Johanna L. A.; Gilbert, Tom; Hofreiter, Michael

    2013-01-01

    The analysis of ancient DNA is playing an increasingly important role in conservation genetic, phylogenetic and population genetic analyses, as it allows incorporating extinct species into DNA sequence trees and adds time depth to population genetics studies. For many years, these types of DNA...... analyses (whether using modern or ancient DNA) were largely restricted to the analysis of short fragments of the mitochondrial genome. However, due to many technological advances during the past decade, a growing number of studies have explored the power of complete mitochondrial genome sequences...... yielded major progress with regard to both the phylogenetic positions of extinct species, as well as resolving population genetics questions in both extinct and extant species....

  12. CHANGES IN CLIMATIC CHARACTERISTICS AND CROP YIELD IN KWARA STATE (NIGERIA

    Directory of Open Access Journals (Sweden)

    O. Oriola

    2017-01-01

    Full Text Available This paper assessed the vagaries of climatic elements on crop yield in Kwara State with a view to predicting the future climatic suitability level for selected crops in the state. Descriptive and infrential statistics analytical methods were used to examine the pattern of climatic elements for a period of 30 years. Analysis of variance was used to examine the variations in crop yield and also to determine whether or not significant differences in the harvests of the period under investigation. Correlation analysis was used to determine the relationship between climatic elements and crop yield while multiple regression analysis was used to determine the contribution of each climatic elements to crop yield. Time series analysis was used to project crop yield from 2014 to 2025. GAEZ model was adopted to determine the climatic suitability for the selected crops over time 1960 - 2050 and ArcGIS 10.3 software was used to produce the crop suitability maps. The result revealed that cassava, yam, maize and cowpea would be less suitable for production with the rate at which the climate is changing. The result also revealed that the climatic suitability level for cassava, yam, maize and cowpea would reduce drastically with time. The prediction shows severe impacts of changes in the selected climatic elements on both overall climatic suitability and crop the selected crops yield for by 2050.

  13. Predicting seed yield in perennial ryegrass using repeated canopy reflectance measurements and PLSR

    DEFF Research Database (Denmark)

    Gislum, René; Deleuran, Lise Christina; Boelt, Birte

    2009-01-01

    with first year seed crops using three sowing rates and three spring nitrogen (N) application rates. PLSR models were developed for each year and showed correlation coefficients of 0.71, 0.76, and 0.92, respectively. Regression coefficients showed in these experiments that the optimum time for canopy...... reflectance measurements was from approximately 600 cumulative growing degree-days (CGDD) to approximately 900 CGDD. This is the period just before and at heading of the seed crop. Furthermore, regression coefficients showed that information about N and water is important. The results support the development......Repeated canopy reflectance measurements together with partial least-squares regression (PLSR) were used to predict seed yield in perennial ryegrass (Lolium perenne L.). The measurements were performed during the spring and summer growing seasons of 2001 to 2003 in three field experiments...

  14. Mortality risk prediction in burn injury: Comparison of logistic regression with machine learning approaches.

    Science.gov (United States)

    Stylianou, Neophytos; Akbarov, Artur; Kontopantelis, Evangelos; Buchan, Iain; Dunn, Ken W

    2015-08-01

    Predicting mortality from burn injury has traditionally employed logistic regression models. Alternative machine learning methods have been introduced in some areas of clinical prediction as the necessary software and computational facilities have become accessible. Here we compare logistic regression and machine learning predictions of mortality from burn. An established logistic mortality model was compared to machine learning methods (artificial neural network, support vector machine, random forests and naïve Bayes) using a population-based (England & Wales) case-cohort registry. Predictive evaluation used: area under the receiver operating characteristic curve; sensitivity; specificity; positive predictive value and Youden's index. All methods had comparable discriminatory abilities, similar sensitivities, specificities and positive predictive values. Although some machine learning methods performed marginally better than logistic regression the differences were seldom statistically significant and clinically insubstantial. Random forests were marginally better for high positive predictive value and reasonable sensitivity. Neural networks yielded slightly better prediction overall. Logistic regression gives an optimal mix of performance and interpretability. The established logistic regression model of burn mortality performs well against more complex alternatives. Clinical prediction with a small set of strong, stable, independent predictors is unlikely to gain much from machine learning outside specialist research contexts. Copyright © 2015 Elsevier Ltd and ISBI. All rights reserved.

  15. Econometric analysis of realised covariation: high frequency covariance, regression and correlation in financial economics

    OpenAIRE

    Ole E. Barndorff-Nielsen; Neil Shephard

    2002-01-01

    This paper analyses multivariate high frequency financial data using realised covariation. We provide a new asymptotic distribution theory for standard methods such as regression, correlation analysis and covariance. It will be based on a fixed interval of time (e.g. a day or week), allowing the number of high frequency returns during this period to go to infinity. Our analysis allows us to study how high frequency correlations, regressions and covariances change through time. In particular w...

  16. Tomato Yield and Water Use Efficiency - Coupling Effects between Growth Stage Specific Soil Water Deficits

    DEFF Research Database (Denmark)

    Chen, Si; Zhenjiang, Zhou; Andersen, Mathias Neumann

    2015-01-01

    To investigate the sensitivity of tomato yield and water use efficiency (WUE) to soil water content at different growth stages, the central composite rotatable design (CCRD) was employed in a five-factor-five-level pot experiment under regulated deficit irrigation. Two regression models concerning...... the effects of stage-specific soil water content on tomato yield and WUE were established. The results showed that the lowest available soil water (ASW) content (around 28%) during vegetative growth stage (here denoted θ1) resulted in high yield and WUE. Moderate (around 69% ASW) during blooming and fruit...... effects of ASW in two growth stages were between θ2 and θ5, θ3. In both cases a moderate θ2 was a precondition for maximum yield response to increasing θ5 and θ3. Sensitivity analysis revealed that yield was most sensitive to soil water content at fruit maturity (θ5). Numerical inspection...

  17. Genetic Variation and Association Analysis of the SSR Markers Linked to the Major Drought-Yield QTLs of Rice.

    Science.gov (United States)

    Tabkhkar, Narjes; Rabiei, Babak; Samizadeh Lahiji, Habibollah; Hosseini Chaleshtori, Maryam

    2018-02-24

    Drought is one of the major abiotic stresses, which hampers the production of rice worldwide. Informative molecular markers are valuable tools for improving the drought tolerance in various varieties of rice. The present study was conducted to evaluate the informative simple sequence repeat (SSR) markers in a diverse set of rice genotypes. The genetic diversity analyses of the 83 studied rice genotypes were performed using 34 SSR markers closely linked to the major quantitative trait loci (QTLs) of grain yield under drought stress (qDTYs). In general, our results indicated high levels of polymorphism. In addition, we screened these rice genotypes at the reproductive stage under both drought stress and nonstressful conditions. The results of the regression analysis demonstrated a significant relationship between 11 SSR marker alleles and the plant paddy weight under stressful conditions. Under the nonstressful conditions, 16 SSR marker alleles showed a significant correlation with the plant paddy weight. Finally, four markers (RM279, RM231, RM166, and RM231) demonstrated a significant association with the plant paddy weight under both stressful and nonstressful conditions. These informative-associated alleles may be useful for improving the crop yield under both drought stress and nonstressful conditions in breeding programs.

  18. Regression analysis by example

    CERN Document Server

    Chatterjee, Samprit

    2012-01-01

    Praise for the Fourth Edition: ""This book is . . . an excellent source of examples for regression analysis. It has been and still is readily readable and understandable."" -Journal of the American Statistical Association Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however, requires a balance of theoretical results, empirical rules, and subjective judgment. Regression Analysis by Example, Fifth Edition has been expanded

  19. In-Season Yield Prediction of Cabbage with a Hand-Held Active Canopy Sensor.

    Science.gov (United States)

    Ji, Rongting; Min, Ju; Wang, Yuan; Cheng, Hu; Zhang, Hailin; Shi, Weiming

    2017-10-08

    Efficient and precise yield prediction is critical to optimize cabbage yields and guide fertilizer application. A two-year field experiment was conducted to establish a yield prediction model for cabbage by using the Greenseeker hand-held optical sensor. Two cabbage cultivars (Jianbao and Pingbao) were used and Jianbao cultivar was grown for 2 consecutive seasons but Pingbao was only grown in the second season. Four chemical nitrogen application rates were implemented: 0, 80, 140, and 200 kg·N·ha -1 . Normalized difference vegetation index (NDVI) was collected 20, 50, 70, 80, 90, 100, 110, 120, 130, and 140 days after transplanting (DAT). Pearson correlation analysis and regression analysis were performed to identify the relationship between the NDVI measurements and harvested yields of cabbage. NDVI measurements obtained at 110 DAT were significantly correlated to yield and explained 87-89% and 75-82% of the cabbage yield variation of Jianbao cultivar over the two-year experiment and 77-81% of the yield variability of Pingbao cultivar. Adjusting the yield prediction models with CGDD (cumulative growing degree days) could make remarkable improvement to the accuracy of the prediction model and increase the determination coefficient to 0.82, while the modification with DFP (days from transplanting when GDD > 0) values did not. The integrated exponential yield prediction equation was better than linear or quadratic functions and could accurately make in-season estimation of cabbage yields with different cultivars between years.

  20. Retrieving relevant factors with exploratory SEM and principal-covariate regression: A comparison.

    Science.gov (United States)

    Vervloet, Marlies; Van den Noortgate, Wim; Ceulemans, Eva

    2018-02-12

    Behavioral researchers often linearly regress a criterion on multiple predictors, aiming to gain insight into the relations between the criterion and predictors. Obtaining this insight from the ordinary least squares (OLS) regression solution may be troublesome, because OLS regression weights show only the effect of a predictor on top of the effects of other predictors. Moreover, when the number of predictors grows larger, it becomes likely that the predictors will be highly collinear, which makes the regression weights' estimates unstable (i.e., the "bouncing beta" problem). Among other procedures, dimension-reduction-based methods have been proposed for dealing with these problems. These methods yield insight into the data by reducing the predictors to a smaller number of summarizing variables and regressing the criterion on these summarizing variables. Two promising methods are principal-covariate regression (PCovR) and exploratory structural equation modeling (ESEM). Both simultaneously optimize reduction and prediction, but they are based on different frameworks. The resulting solutions have not yet been compared; it is thus unclear what the strengths and weaknesses are of both methods. In this article, we focus on the extents to which PCovR and ESEM are able to extract the factors that truly underlie the predictor scores and can predict a single criterion. The results of two simulation studies showed that for a typical behavioral dataset, ESEM (using the BIC for model selection) in this regard is successful more often than PCovR. Yet, in 93% of the datasets PCovR performed equally well, and in the case of 48 predictors, 100 observations, and large differences in the strengths of the factors, PCovR even outperformed ESEM.

  1. Applied logistic regression

    CERN Document Server

    Hosmer, David W; Sturdivant, Rodney X

    2013-01-01

     A new edition of the definitive guide to logistic regression modeling for health science and other applications This thoroughly expanded Third Edition provides an easily accessible introduction to the logistic regression (LR) model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables. Applied Logistic Regression, Third Edition emphasizes applications in the health sciences and handpicks topics that best suit the use of modern statistical software. The book provides readers with state-of-

  2. Privacy-Preserving Distributed Linear Regression on High-Dimensional Data

    Directory of Open Access Journals (Sweden)

    Gascón Adrià

    2017-10-01

    Full Text Available We propose privacy-preserving protocols for computing linear regression models, in the setting where the training dataset is vertically distributed among several parties. Our main contribution is a hybrid multi-party computation protocol that combines Yao’s garbled circuits with tailored protocols for computing inner products. Like many machine learning tasks, building a linear regression model involves solving a system of linear equations. We conduct a comprehensive evaluation and comparison of different techniques for securely performing this task, including a new Conjugate Gradient Descent (CGD algorithm. This algorithm is suitable for secure computation because it uses an efficient fixed-point representation of real numbers while maintaining accuracy and convergence rates comparable to what can be obtained with a classical solution using floating point numbers. Our technique improves on Nikolaenko et al.’s method for privacy-preserving ridge regression (S&P 2013, and can be used as a building block in other analyses. We implement a complete system and demonstrate that our approach is highly scalable, solving data analysis problems with one million records and one hundred features in less than one hour of total running time.

  3. Normalization Ridge Regression in Practice I: Comparisons Between Ordinary Least Squares, Ridge Regression and Normalization Ridge Regression.

    Science.gov (United States)

    Bulcock, J. W.

    The problem of model estimation when the data are collinear was examined. Though the ridge regression (RR) outperforms ordinary least squares (OLS) regression in the presence of acute multicollinearity, it is not a problem free technique for reducing the variance of the estimates. It is a stochastic procedure when it should be nonstochastic and it…

  4. Quantile regression for the statistical analysis of immunological data with many non-detects

    NARCIS (Netherlands)

    Eilers, P.H.C.; Roder, E.; Savelkoul, H.F.J.; Wijk, van R.G.

    2012-01-01

    Background Immunological parameters are hard to measure. A well-known problem is the occurrence of values below the detection limit, the non-detects. Non-detects are a nuisance, because classical statistical analyses, like ANOVA and regression, cannot be applied. The more advanced statistical

  5. Quantile regression for the statistical analysis of immunological data with many non-detects

    NARCIS (Netherlands)

    P.H.C. Eilers (Paul); E. Röder (Esther); H.F.J. Savelkoul (Huub); R. Gerth van Wijk (Roy)

    2012-01-01

    textabstractBackground: Immunological parameters are hard to measure. A well-known problem is the occurrence of values below the detection limit, the non-detects. Non-detects are a nuisance, because classical statistical analyses, like ANOVA and regression, cannot be applied. The more advanced

  6. Estimating yield gaps at the cropping system level.

    Science.gov (United States)

    Guilpart, Nicolas; Grassini, Patricio; Sadras, Victor O; Timsina, Jagadish; Cassman, Kenneth G

    2017-05-01

    Yield gap analyses of individual crops have been used to estimate opportunities for increasing crop production at local to global scales, thus providing information crucial to food security. However, increases in crop production can also be achieved by improving cropping system yield through modification of spatial and temporal arrangement of individual crops. In this paper we define the cropping system yield potential as the output from the combination of crops that gives the highest energy yield per unit of land and time, and the cropping system yield gap as the difference between actual energy yield of an existing cropping system and the cropping system yield potential. Then, we provide a framework to identify alternative cropping systems which can be evaluated against the current ones. A proof-of-concept is provided with irrigated rice-maize systems at four locations in Bangladesh that represent a range of climatic conditions in that country. The proposed framework identified (i) realistic alternative cropping systems at each location, and (ii) two locations where expected improvements in crop production from changes in cropping intensity (number of crops per year) were 43% to 64% higher than from improving the management of individual crops within the current cropping systems. The proposed framework provides a tool to help assess food production capacity of new systems ( e.g. with increased cropping intensity) arising from climate change, and assess resource requirements (water and N) and associated environmental footprint per unit of land and production of these new systems. By expanding yield gap analysis from individual crops to the cropping system level and applying it to new systems, this framework could also be helpful to bridge the gap between yield gap analysis and cropping/farming system design.

  7. Patterns of medicinal plant use: an examination of the Ecuadorian Shuar medicinal flora using contingency table and binomial analyses.

    Science.gov (United States)

    Bennett, Bradley C; Husby, Chad E

    2008-03-28

    Botanical pharmacopoeias are non-random subsets of floras, with some taxonomic groups over- or under-represented. Moerman [Moerman, D.E., 1979. Symbols and selectivity: a statistical analysis of Native American medical ethnobotany, Journal of Ethnopharmacology 1, 111-119] introduced linear regression/residual analysis to examine these patterns. However, regression, the commonly-employed analysis, suffers from several statistical flaws. We use contingency table and binomial analyses to examine patterns of Shuar medicinal plant use (from Amazonian Ecuador). We first analyzed the Shuar data using Moerman's approach, modified to better meet requirements of linear regression analysis. Second, we assessed the exact randomization contingency table test for goodness of fit. Third, we developed a binomial model to test for non-random selection of plants in individual families. Modified regression models (which accommodated assumptions of linear regression) reduced R(2) to from 0.59 to 0.38, but did not eliminate all problems associated with regression analyses. Contingency table analyses revealed that the entire flora departs from the null model of equal proportions of medicinal plants in all families. In the binomial analysis, only 10 angiosperm families (of 115) differed significantly from the null model. These 10 families are largely responsible for patterns seen at higher taxonomic levels. Contingency table and binomial analyses offer an easy and statistically valid alternative to the regression approach.

  8. Vector regression introduced

    Directory of Open Access Journals (Sweden)

    Mok Tik

    2014-06-01

    Full Text Available This study formulates regression of vector data that will enable statistical analysis of various geodetic phenomena such as, polar motion, ocean currents, typhoon/hurricane tracking, crustal deformations, and precursory earthquake signals. The observed vector variable of an event (dependent vector variable is expressed as a function of a number of hypothesized phenomena realized also as vector variables (independent vector variables and/or scalar variables that are likely to impact the dependent vector variable. The proposed representation has the unique property of solving the coefficients of independent vector variables (explanatory variables also as vectors, hence it supersedes multivariate multiple regression models, in which the unknown coefficients are scalar quantities. For the solution, complex numbers are used to rep- resent vector information, and the method of least squares is deployed to estimate the vector model parameters after transforming the complex vector regression model into a real vector regression model through isomorphism. Various operational statistics for testing the predictive significance of the estimated vector parameter coefficients are also derived. A simple numerical example demonstrates the use of the proposed vector regression analysis in modeling typhoon paths.

  9. Evaluating a satellite-based seasonal evapotranspiration product and identifying its relationship with other satellite-derived products and crop yield: A case study for Ethiopia

    Science.gov (United States)

    Tadesse, Tsegaye; Senay, Gabriel B.; Berhan, Getachew; Regassa, Teshome; Beyene, Shimelis

    2015-01-01

    Satellite-derived evapotranspiration anomalies and normalized difference vegetation index (NDVI) products from Moderate Resolution Imaging Spectroradiometer (MODIS) data are currently used for African agricultural drought monitoring and food security status assessment. In this study, a process to evaluate satellite-derived evapotranspiration (ETa) products with a geospatial statistical exploratory technique that uses NDVI, satellite-derived rainfall estimate (RFE), and crop yield data has been developed. The main goal of this study was to evaluate the ETa using the NDVI and RFE, and identify a relationship between the ETa and Ethiopia’s cereal crop (i.e., teff, sorghum, corn/maize, barley, and wheat) yields during the main rainy season. Since crop production is one of the main factors affecting food security, the evaluation of remote sensing-based seasonal ETa was done to identify the appropriateness of this tool as a proxy for monitoring vegetation condition in drought vulnerable and food insecure areas to support decision makers. The results of this study showed that the comparison between seasonal ETa and RFE produced strong correlation (R2 > 0.99) for all 41 crop growing zones in Ethiopia. The results of the spatial regression analyses of seasonal ETa and NDVI using Ordinary Least Squares and Geographically Weighted Regression showed relatively weak yearly spatial relationships (R2 < 0.7) for all cropping zones. However, for each individual crop zones, the correlation between NDVI and ETa ranged between 0.3 and 0.84 for about 44% of the cropping zones. Similarly, for each individual crop zones, the correlation (R2) between the seasonal ETa anomaly and de-trended cereal crop yield was between 0.4 and 0.82 for 76% (31 out of 41) of the crop growing zones. The preliminary results indicated that the ETa products have a good predictive potential for these 31 identified zones in Ethiopia. Decision makers may potentially use ETa products for monitoring cereal

  10. Modeling and prediction of Turkey's electricity consumption using Support Vector Regression

    International Nuclear Information System (INIS)

    Kavaklioglu, Kadir

    2011-01-01

    Support Vector Regression (SVR) methodology is used to model and predict Turkey's electricity consumption. Among various SVR formalisms, ε-SVR method was used since the training pattern set was relatively small. Electricity consumption is modeled as a function of socio-economic indicators such as population, Gross National Product, imports and exports. In order to facilitate future predictions of electricity consumption, a separate SVR model was created for each of the input variables using their current and past values; and these models were combined to yield consumption prediction values. A grid search for the model parameters was performed to find the best ε-SVR model for each variable based on Root Mean Square Error. Electricity consumption of Turkey is predicted until 2026 using data from 1975 to 2006. The results show that electricity consumption can be modeled using Support Vector Regression and the models can be used to predict future electricity consumption. (author)

  11. A note on the use of multiple linear regression in molecular ecology.

    Science.gov (United States)

    Frasier, Timothy R

    2016-03-01

    Multiple linear regression analyses (also often referred to as generalized linear models--GLMs, or generalized linear mixed models--GLMMs) are widely used in the analysis of data in molecular ecology, often to assess the relative effects of genetic characteristics on individual fitness or traits, or how environmental characteristics influence patterns of genetic differentiation. However, the coefficients resulting from multiple regression analyses are sometimes misinterpreted, which can lead to incorrect interpretations and conclusions within individual studies, and can propagate to wider-spread errors in the general understanding of a topic. The primary issue revolves around the interpretation of coefficients for independent variables when interaction terms are also included in the analyses. In this scenario, the coefficients associated with each independent variable are often interpreted as the independent effect of each predictor variable on the predicted variable. However, this interpretation is incorrect. The correct interpretation is that these coefficients represent the effect of each predictor variable on the predicted variable when all other predictor variables are zero. This difference may sound subtle, but the ramifications cannot be overstated. Here, my goals are to raise awareness of this issue, to demonstrate and emphasize the problems that can result and to provide alternative approaches for obtaining the desired information. © 2015 John Wiley & Sons Ltd.

  12. Applied linear regression

    CERN Document Server

    Weisberg, Sanford

    2013-01-01

    Praise for the Third Edition ""...this is an excellent book which could easily be used as a course text...""-International Statistical Institute The Fourth Edition of Applied Linear Regression provides a thorough update of the basic theory and methodology of linear regression modeling. Demonstrating the practical applications of linear regression analysis techniques, the Fourth Edition uses interesting, real-world exercises and examples. Stressing central concepts such as model building, understanding parameters, assessing fit and reliability, and drawing conclusions, the new edition illus

  13. Regression Models and Fuzzy Logic Prediction of TBM Penetration Rate

    Directory of Open Access Journals (Sweden)

    Minh Vu Trieu

    2017-03-01

    Full Text Available This paper presents statistical analyses of rock engineering properties and the measured penetration rate of tunnel boring machine (TBM based on the data of an actual project. The aim of this study is to analyze the influence of rock engineering properties including uniaxial compressive strength (UCS, Brazilian tensile strength (BTS, rock brittleness index (BI, the distance between planes of weakness (DPW, and the alpha angle (Alpha between the tunnel axis and the planes of weakness on the TBM rate of penetration (ROP. Four (4 statistical regression models (two linear and two nonlinear are built to predict the ROP of TBM. Finally a fuzzy logic model is developed as an alternative method and compared to the four statistical regression models. Results show that the fuzzy logic model provides better estimations and can be applied to predict the TBM performance. The R-squared value (R2 of the fuzzy logic model scores the highest value of 0.714 over the second runner-up of 0.667 from the multiple variables nonlinear regression model.

  14. Regression Models and Fuzzy Logic Prediction of TBM Penetration Rate

    Science.gov (United States)

    Minh, Vu Trieu; Katushin, Dmitri; Antonov, Maksim; Veinthal, Renno

    2017-03-01

    This paper presents statistical analyses of rock engineering properties and the measured penetration rate of tunnel boring machine (TBM) based on the data of an actual project. The aim of this study is to analyze the influence of rock engineering properties including uniaxial compressive strength (UCS), Brazilian tensile strength (BTS), rock brittleness index (BI), the distance between planes of weakness (DPW), and the alpha angle (Alpha) between the tunnel axis and the planes of weakness on the TBM rate of penetration (ROP). Four (4) statistical regression models (two linear and two nonlinear) are built to predict the ROP of TBM. Finally a fuzzy logic model is developed as an alternative method and compared to the four statistical regression models. Results show that the fuzzy logic model provides better estimations and can be applied to predict the TBM performance. The R-squared value (R2) of the fuzzy logic model scores the highest value of 0.714 over the second runner-up of 0.667 from the multiple variables nonlinear regression model.

  15. Reconstruction of missing daily streamflow data using dynamic regression models

    Science.gov (United States)

    Tencaliec, Patricia; Favre, Anne-Catherine; Prieur, Clémentine; Mathevet, Thibault

    2015-12-01

    River discharge is one of the most important quantities in hydrology. It provides fundamental records for water resources management and climate change monitoring. Even very short data-gaps in this information can cause extremely different analysis outputs. Therefore, reconstructing missing data of incomplete data sets is an important step regarding the performance of the environmental models, engineering, and research applications, thus it presents a great challenge. The objective of this paper is to introduce an effective technique for reconstructing missing daily discharge data when one has access to only daily streamflow data. The proposed procedure uses a combination of regression and autoregressive integrated moving average models (ARIMA) called dynamic regression model. This model uses the linear relationship between neighbor and correlated stations and then adjusts the residual term by fitting an ARIMA structure. Application of the model to eight daily streamflow data for the Durance river watershed showed that the model yields reliable estimates for the missing data in the time series. Simulation studies were also conducted to evaluate the performance of the procedure.

  16. High yield CTMP fibres as a possibility of the more efficient yield of wood raw material

    Directory of Open Access Journals (Sweden)

    Klašnja Bojana A.

    2004-01-01

    Full Text Available The evident shortage of wood as a raw material has become a limiting factor in the pulp and paper industry which is the greatest consumer of wood in Europe. The situation in our country is similar. During the few past years, the production of poplar and willow pulpwood was 220.000 m3 per year, which is insufficient for the planned increase in the production of sulphate pulp (175.000 tons till 2005. This paper deals with the aspects of the more efficient yield of raw material, based on the significantly higher yield of CTMP fibres, as well as with the significance of the lower adverse effect on the environment. It also analyses the conditions of production and the quality of the obtained fibres, as a possible substitute for chemical pulp and secondary fibres in papers of different quality. The main reasons for the production and use of CTMP fibres in our country are reported.

  17. Examining growth, yield and bean quality of Ethiopian coffee trees

    NARCIS (Netherlands)

    Bote, Adugna

    2016-01-01

    Coffee (Coffeaarabica L.)bean production and quality are determined by a diversity of interacting factors (e.g. shade, nitrogen, crop traits). Bean yield increases with increase in radiation, but adequate fertilizer suppliesare needed to sustain the productivity. This thesis analysed

  18. Cross-validation pitfalls when selecting and assessing regression and classification models.

    Science.gov (United States)

    Krstajic, Damjan; Buturovic, Ljubomir J; Leahy, David E; Thomas, Simon

    2014-03-29

    We address the problem of selecting and assessing classification and regression models using cross-validation. Current state-of-the-art methods can yield models with high variance, rendering them unsuitable for a number of practical applications including QSAR. In this paper we describe and evaluate best practices which improve reliability and increase confidence in selected models. A key operational component of the proposed methods is cloud computing which enables routine use of previously infeasible approaches. We describe in detail an algorithm for repeated grid-search V-fold cross-validation for parameter tuning in classification and regression, and we define a repeated nested cross-validation algorithm for model assessment. As regards variable selection and parameter tuning we define two algorithms (repeated grid-search cross-validation and double cross-validation), and provide arguments for using the repeated grid-search in the general case. We show results of our algorithms on seven QSAR datasets. The variation of the prediction performance, which is the result of choosing different splits of the dataset in V-fold cross-validation, needs to be taken into account when selecting and assessing classification and regression models. We demonstrate the importance of repeating cross-validation when selecting an optimal model, as well as the importance of repeating nested cross-validation when assessing a prediction error.

  19. SOCR Analyses: Implementation and Demonstration of a New Graphical Statistics Educational Toolkit

    Directory of Open Access Journals (Sweden)

    Annie Chu

    2009-04-01

    Full Text Available The web-based, Java-written SOCR (Statistical Online Computational Resource toolshave been utilized in many undergraduate and graduate level statistics courses for sevenyears now (Dinov 2006; Dinov et al. 2008b. It has been proven that these resourcescan successfully improve students' learning (Dinov et al. 2008b. Being rst publishedonline in 2005, SOCR Analyses is a somewhat new component and it concentrate on datamodeling for both parametric and non-parametric data analyses with graphical modeldiagnostics. One of the main purposes of SOCR Analyses is to facilitate statistical learn-ing for high school and undergraduate students. As we have already implemented SOCRDistributions and Experiments, SOCR Analyses and Charts fulll the rest of a standardstatistics curricula. Currently, there are four core components of SOCR Analyses. Linearmodels included in SOCR Analyses are simple linear regression, multiple linear regression,one-way and two-way ANOVA. Tests for sample comparisons include t-test in the para-metric category. Some examples of SOCR Analyses' in the non-parametric category areWilcoxon rank sum test, Kruskal-Wallis test, Friedman's test, Kolmogorov-Smirno testand Fligner-Killeen test. Hypothesis testing models include contingency table, Friedman'stest and Fisher's exact test. The last component of Analyses is a utility for computingsample sizes for normal distribution. In this article, we present the design framework,computational implementation and the utilization of SOCR Analyses.

  20. Understanding poisson regression.

    Science.gov (United States)

    Hayat, Matthew J; Higgins, Melinda

    2014-04-01

    Nurse investigators often collect study data in the form of counts. Traditional methods of data analysis have historically approached analysis of count data either as if the count data were continuous and normally distributed or with dichotomization of the counts into the categories of occurred or did not occur. These outdated methods for analyzing count data have been replaced with more appropriate statistical methods that make use of the Poisson probability distribution, which is useful for analyzing count data. The purpose of this article is to provide an overview of the Poisson distribution and its use in Poisson regression. Assumption violations for the standard Poisson regression model are addressed with alternative approaches, including addition of an overdispersion parameter or negative binomial regression. An illustrative example is presented with an application from the ENSPIRE study, and regression modeling of comorbidity data is included for illustrative purposes. Copyright 2014, SLACK Incorporated.

  1. Alternative Methods of Regression

    CERN Document Server

    Birkes, David

    2011-01-01

    Of related interest. Nonlinear Regression Analysis and its Applications Douglas M. Bates and Donald G. Watts ".an extraordinary presentation of concepts and methods concerning the use and analysis of nonlinear regression models.highly recommend[ed].for anyone needing to use and/or understand issues concerning the analysis of nonlinear regression models." --Technometrics This book provides a balance between theory and practice supported by extensive displays of instructive geometrical constructs. Numerous in-depth case studies illustrate the use of nonlinear regression analysis--with all data s

  2. Time series regression model for infectious disease and weather.

    Science.gov (United States)

    Imai, Chisato; Armstrong, Ben; Chalabi, Zaid; Mangtani, Punam; Hashizume, Masahiro

    2015-10-01

    Time series regression has been developed and long used to evaluate the short-term associations of air pollution and weather with mortality or morbidity of non-infectious diseases. The application of the regression approaches from this tradition to infectious diseases, however, is less well explored and raises some new issues. We discuss and present potential solutions for five issues often arising in such analyses: changes in immune population, strong autocorrelations, a wide range of plausible lag structures and association patterns, seasonality adjustments, and large overdispersion. The potential approaches are illustrated with datasets of cholera cases and rainfall from Bangladesh and influenza and temperature in Tokyo. Though this article focuses on the application of the traditional time series regression to infectious diseases and weather factors, we also briefly introduce alternative approaches, including mathematical modeling, wavelet analysis, and autoregressive integrated moving average (ARIMA) models. Modifications proposed to standard time series regression practice include using sums of past cases as proxies for the immune population, and using the logarithm of lagged disease counts to control autocorrelation due to true contagion, both of which are motivated from "susceptible-infectious-recovered" (SIR) models. The complexity of lag structures and association patterns can often be informed by biological mechanisms and explored by using distributed lag non-linear models. For overdispersed models, alternative distribution models such as quasi-Poisson and negative binomial should be considered. Time series regression can be used to investigate dependence of infectious diseases on weather, but may need modifying to allow for features specific to this context. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  3. Establishing a Mathematical Equations and Improving the Production of L-tert-Leucine by Uniform Design and Regression Analysis.

    Science.gov (United States)

    Jiang, Wei; Xu, Chao-Zhen; Jiang, Si-Zhi; Zhang, Tang-Duo; Wang, Shi-Zhen; Fang, Bai-Shan

    2017-04-01

    L-tert-Leucine (L-Tle) and its derivatives are extensively used as crucial building blocks for chiral auxiliaries, pharmaceutically active ingredients, and ligands. Combining with formate dehydrogenase (FDH) for regenerating the expensive coenzyme NADH, leucine dehydrogenase (LeuDH) is continually used for synthesizing L-Tle from α-keto acid. A multilevel factorial experimental design was executed for research of this system. In this work, an efficient optimization method for improving the productivity of L-Tle was developed. And the mathematical model between different fermentation conditions and L-Tle yield was also determined in the form of the equation by using uniform design and regression analysis. The multivariate regression equation was conveniently implemented in water, with a space time yield of 505.9 g L -1  day -1 and an enantiomeric excess value of >99 %. These results demonstrated that this method might become an ideal protocol for industrial production of chiral compounds and unnatural amino acids such as chiral drug intermediates.

  4. Building a new predictor for multiple linear regression technique-based corrective maintenance turnaround time.

    Science.gov (United States)

    Cruz, Antonio M; Barr, Cameron; Puñales-Pozo, Elsa

    2008-01-01

    This research's main goals were to build a predictor for a turnaround time (TAT) indicator for estimating its values and use a numerical clustering technique for finding possible causes of undesirable TAT values. The following stages were used: domain understanding, data characterisation and sample reduction and insight characterisation. Building the TAT indicator multiple linear regression predictor and clustering techniques were used for improving corrective maintenance task efficiency in a clinical engineering department (CED). The indicator being studied was turnaround time (TAT). Multiple linear regression was used for building a predictive TAT value model. The variables contributing to such model were clinical engineering department response time (CE(rt), 0.415 positive coefficient), stock service response time (Stock(rt), 0.734 positive coefficient), priority level (0.21 positive coefficient) and service time (0.06 positive coefficient). The regression process showed heavy reliance on Stock(rt), CE(rt) and priority, in that order. Clustering techniques revealed the main causes of high TAT values. This examination has provided a means for analysing current technical service quality and effectiveness. In doing so, it has demonstrated a process for identifying areas and methods of improvement and a model against which to analyse these methods' effectiveness.

  5. A classical regression framework for mediation analysis: fitting one model to estimate mediation effects.

    Science.gov (United States)

    Saunders, Christina T; Blume, Jeffrey D

    2017-10-26

    Mediation analysis explores the degree to which an exposure's effect on an outcome is diverted through a mediating variable. We describe a classical regression framework for conducting mediation analyses in which estimates of causal mediation effects and their variance are obtained from the fit of a single regression model. The vector of changes in exposure pathway coefficients, which we named the essential mediation components (EMCs), is used to estimate standard causal mediation effects. Because these effects are often simple functions of the EMCs, an analytical expression for their model-based variance follows directly. Given this formula, it is instructive to revisit the performance of routinely used variance approximations (e.g., delta method and resampling methods). Requiring the fit of only one model reduces the computation time required for complex mediation analyses and permits the use of a rich suite of regression tools that are not easily implemented on a system of three equations, as would be required in the Baron-Kenny framework. Using data from the BRAIN-ICU study, we provide examples to illustrate the advantages of this framework and compare it with the existing approaches. © The Author 2017. Published by Oxford University Press.

  6. Effective behaviour change techniques for physical activity and healthy eating in overweight and obese adults; systematic review and meta-regression analyses.

    Science.gov (United States)

    Samdal, Gro Beate; Eide, Geir Egil; Barth, Tom; Williams, Geoffrey; Meland, Eivind

    2017-03-28

    This systematic review aims to explain the heterogeneity in results of interventions to promote physical activity and healthy eating for overweight and obese adults, by exploring the differential effects of behaviour change techniques (BCTs) and other intervention characteristics. The inclusion criteria specified RCTs with ≥ 12 weeks' duration, from January 2007 to October 2014, for adults (mean age ≥ 40 years, mean BMI ≥ 30). Primary outcomes were measures of healthy diet or physical activity. Two reviewers rated study quality, coded the BCTs, and collected outcome results at short (≤6 months) and long term (≥12 months). Meta-analyses and meta-regressions were used to estimate effect sizes (ES), heterogeneity indices (I 2 ) and regression coefficients. We included 48 studies containing a total of 82 outcome reports. The 32 long term reports had an overall ES = 0.24 with 95% confidence interval (CI): 0.15 to 0.33 and I 2  = 59.4%. The 50 short term reports had an ES = 0.37 with 95% CI: 0.26 to 0.48, and I 2  = 71.3%. The number of BCTs unique to the intervention group, and the BCTs goal setting and self-monitoring of behaviour predicted the effect at short and long term. The total number of BCTs in both intervention arms and using the BCTs goal setting of outcome, feedback on outcome of behaviour, implementing graded tasks, and adding objects to the environment, e.g. using a step counter, significantly predicted the effect at long term. Setting a goal for change; and the presence of reporting bias independently explained 58.8% of inter-study variation at short term. Autonomy supportive and person-centred methods as in Motivational Interviewing, the BCTs goal setting of behaviour, and receiving feedback on the outcome of behaviour, explained all of the between study variations in effects at long term. There are similarities, but also differences in effective BCTs promoting change in healthy eating and physical activity and

  7. Statistical modelling for precision agriculture: A case study in optimal environmental schedules for Agaricus Bisporus production via variable domain functional regression

    Science.gov (United States)

    Panayi, Efstathios; Kyriakides, George

    2017-01-01

    Quantifying the effects of environmental factors over the duration of the growing process on Agaricus Bisporus (button mushroom) yields has been difficult, as common functional data analysis approaches require fixed length functional data. The data available from commercial growers, however, is of variable duration, due to commercial considerations. We employ a recently proposed regression technique termed Variable-Domain Functional Regression in order to be able to accommodate these irregular-length datasets. In this way, we are able to quantify the contribution of covariates such as temperature, humidity and water spraying volumes across the growing process, and for different lengths of growing processes. Our results indicate that optimal oxygen and temperature levels vary across the growing cycle and we propose environmental schedules for these covariates to optimise overall yields. PMID:28961254

  8. Investigation of Barely Grain Yield Improvement during the Last Half Century across Golestan Province

    Directory of Open Access Journals (Sweden)

    M Hajipoor

    2017-03-01

    Full Text Available Introduction Barely (Hordeum vulgare is the fourth most important cereal after wheat, corn and rice. Regarding the role of breeding to increase barley grain yield a large number of studies have been done in different countries. However, a few studies have been performed across Golestan Province, in Iran. Therefore, this study was conducted to know what barley traits have changed with grain yield during recent years. How these traits will further improve the barley grain yield in the future breeding programs? Materials and Methods In order to study barely grain yield improvement during the last half century across Golestan Province, this expriment carried out at randomized complete block desing with 4 replications in Gonbad kavous university research field in 2013-2014. Treatments were included nine barley cultivars: Sahra, Dasht, Torkaman, Gorgan4, Nimruz, Mahoor, Khoram, Reyhan and Yousef. We analysed the results using ANOVA in the statistical software package SAS. Step by step regression analysis and pathway analysis was done to evaluate the relative proportion of different traits on yield and direct and indirect impacts of yield components on grain yield, respectively. Results and Discussion The results showed that the values of the studied parameters were significantly different in different cultivars. Results illustrated that the hieghest and the lowest grain weight were related to Mahoor (37.33 mg and Torkaman (22.66 mg, respectively. Due to the high number of rows per spike in barely cultivars, grains are closer together and there are less space for growth and phothosynthetic material storage. In addition, total assimilation was not enough to fill the grain of cultivars which have the more grain numbers per spike and it caused low grain weight. Although thousand grain weight is among the main grain yield components with high heritability, it influenced by other components such as the number of spikes and its length. The highest and the

  9. Introduction to regression graphics

    CERN Document Server

    Cook, R Dennis

    2009-01-01

    Covers the use of dynamic and interactive computer graphics in linear regression analysis, focusing on analytical graphics. Features new techniques like plot rotation. The authors have composed their own regression code, using Xlisp-Stat language called R-code, which is a nearly complete system for linear regression analysis and can be utilized as the main computer program in a linear regression course. The accompanying disks, for both Macintosh and Windows computers, contain the R-code and Xlisp-Stat. An Instructor's Manual presenting detailed solutions to all the problems in the book is ava

  10. Automatic yield-line analysis of slabs using discontinuity layout optimization.

    Science.gov (United States)

    Gilbert, Matthew; He, Linwei; Smith, Colin C; Le, Canh V

    2014-08-08

    The yield-line method of analysis is a long established and extremely effective means of estimating the maximum load sustainable by a slab or plate. However, although numerous attempts to automate the process of directly identifying the critical pattern of yield-lines have been made over the past few decades, to date none has proved capable of reliably analysing slabs of arbitrary geometry. Here, it is demonstrated that the discontinuity layout optimization (DLO) procedure can successfully be applied to such problems. The procedure involves discretization of the problem using nodes inter-connected by potential yield-line discontinuities, with the critical layout of these then identified using linear programming. The procedure is applied to various benchmark problems, demonstrating that highly accurate solutions can be obtained, and showing that DLO provides a truly systematic means of directly and reliably automatically identifying yield-line patterns. Finally, since the critical yield-line patterns for many problems are found to be quite complex in form, a means of automatically simplifying these is presented.

  11. Multicollinearity in spatial genetics: separating the wheat from the chaff using commonality analyses.

    Science.gov (United States)

    Prunier, J G; Colyn, M; Legendre, X; Nimon, K F; Flamand, M C

    2015-01-01

    Direct gradient analyses in spatial genetics provide unique opportunities to describe the inherent complexity of genetic variation in wildlife species and are the object of many methodological developments. However, multicollinearity among explanatory variables is a systemic issue in multivariate regression analyses and is likely to cause serious difficulties in properly interpreting results of direct gradient analyses, with the risk of erroneous conclusions, misdirected research and inefficient or counterproductive conservation measures. Using simulated data sets along with linear and logistic regressions on distance matrices, we illustrate how commonality analysis (CA), a detailed variance-partitioning procedure that was recently introduced in the field of ecology, can be used to deal with nonindependence among spatial predictors. By decomposing model fit indices into unique and common (or shared) variance components, CA allows identifying the location and magnitude of multicollinearity, revealing spurious correlations and thus thoroughly improving the interpretation of multivariate regressions. Despite a few inherent limitations, especially in the case of resistance model optimization, this review highlights the great potential of CA to account for complex multicollinearity patterns in spatial genetics and identifies future applications and lines of research. We strongly urge spatial geneticists to systematically investigate commonalities when performing direct gradient analyses. © 2014 John Wiley & Sons Ltd.

  12. Optimizing rice yields while minimizing yield-scaled global warming potential.

    Science.gov (United States)

    Pittelkow, Cameron M; Adviento-Borbe, Maria A; van Kessel, Chris; Hill, James E; Linquist, Bruce A

    2014-05-01

    To meet growing global food demand with limited land and reduced environmental impact, agricultural greenhouse gas (GHG) emissions are increasingly evaluated with respect to crop productivity, i.e., on a yield-scaled as opposed to area basis. Here, we compiled available field data on CH4 and N2 O emissions from rice production systems to test the hypothesis that in response to fertilizer nitrogen (N) addition, yield-scaled global warming potential (GWP) will be minimized at N rates that maximize yields. Within each study, yield N surplus was calculated to estimate deficit or excess N application rates with respect to the optimal N rate (defined as the N rate at which maximum yield was achieved). Relationships between yield N surplus and GHG emissions were assessed using linear and nonlinear mixed-effects models. Results indicate that yields increased in response to increasing N surplus when moving from deficit to optimal N rates. At N rates contributing to a yield N surplus, N2 O and yield-scaled N2 O emissions increased exponentially. In contrast, CH4 emissions were not impacted by N inputs. Accordingly, yield-scaled CH4 emissions decreased with N addition. Overall, yield-scaled GWP was minimized at optimal N rates, decreasing by 21% compared to treatments without N addition. These results are unique compared to aerobic cropping systems in which N2 O emissions are the primary contributor to GWP, meaning yield-scaled GWP may not necessarily decrease for aerobic crops when yields are optimized by N fertilizer addition. Balancing gains in agricultural productivity with climate change concerns, this work supports the concept that high rice yields can be achieved with minimal yield-scaled GWP through optimal N application rates. Moreover, additional improvements in N use efficiency may further reduce yield-scaled GWP, thereby strengthening the economic and environmental sustainability of rice systems. © 2013 John Wiley & Sons Ltd.

  13. Regression analysis of growth responses to water depth in three wetland plant species

    DEFF Research Database (Denmark)

    Sorrell, Brian K; Tanner, Chris C; Brix, Hans

    2012-01-01

    depths from 0 – 0.5 m. Morphological and growth responses to depth were followed for 54 days before harvest, and then analysed by repeated measures analysis of covariance, and non-linear and quantile regression analysis (QRA), to compare flooding tolerances. Principal results Growth responses to depth...

  14. Time-trend of melanoma screening practice by primary care physicians: A meta-regression analysis

    OpenAIRE

    Valachis, Antonis; Mauri, Davide; Karampoiki, Vassiliki; Polyzos, Nikolaos P; Cortinovis, Ivan; Koukourakis, Georgios; Zacharias, Georgios; Xilomenos, Apostolos; Tsappi, Maria; Casazza, Giovanni

    2009-01-01

    Objective To assess whether the proportion of primary care physicians implementing full body skin examination (FBSE) to screen for melanoma changed over time. Methods Meta-regression analyses of available data. Data Sources: MEDLINE, ISI, Cochrane Central Register of Controlled Trials. Results Fifteen studies surveying 10,336 physicians were included in the analyses. Overall, 15%?82% of them reported to perform FBSE to screen for melanoma. The proportion of physicians using FBSE screening ten...

  15. Statistical evaluation of fuel yield and morphological variates for some promising energy plantation tree species in western Rajasthan

    Energy Technology Data Exchange (ETDEWEB)

    Kalla, J.C.

    1977-01-01

    Stepwise regression analysis suggested that tree height and collar diameter were, in general, the morphological parameters that most reliably predicted fuel yield in Acacia nilotica, A. tortilis, Albizzia lebbek, Azadirachta indica and Prosopis juliflora.

  16. Modelling spatial and temporal variations of annual suspended sediment yields from small agricultural catchments.

    Science.gov (United States)

    Rymszewicz, A; Bruen, M; O'Sullivan, J J; Turner, J N; Lawler, D M; Harrington, J R; Conroy, E; Kelly-Quinn, M

    2018-04-01

    Estimates of sediment yield are important for ecological and geomorphological assessment of fluvial systems and for assessment of soil erosion within a catchment. Many regulatory frameworks, such as the Convention for the Protection of the Marine Environment of the North-East Atlantic, derived from the Oslo and Paris Commissions (OSPAR) require reporting of annual sediment fluxes. While they may be measured in large rivers, sediment flux is rarely measured in smaller rivers. Measurements of sediment transport at a national scale can be also challenging and therefore, sediment yield models are often utilised by water resource managers for the predictions of sediment yields in the ungauged catchments. Regression based models, calibrated to field measurements, can offer an advantage over complex and computational models due to their simplicity, easy access to input data and due to the additional insights into factors controlling sediment export in the study sites. While traditionally calibrated to long-term average values of sediment yields such predictions cannot represent temporal variations. This study addresses this issue in a novel way by taking account of the variation from year to year in hydrological variables in the developed models (using annual mean runoff, annual mean flow, flows exceeded in five percentage of the time (Q5) and seasonal rainfall estimated separately for each year of observations). Other parameters included in the models represent spatial differences influenced by factors such as soil properties (% poorly drained soils and % peaty soils), land-use (% pasture or % arable lands), channel slope (S1085) and drainage network properties (drainage density). Catchment descriptors together with year-specific hydrological variables can explain both spatial differences and inter-annual variability of suspended sediment yields. The methodology is demonstrated by deriving equations from Irish data-sets (compiled in this study) with the best model

  17. Effects of diurnal temperature range and drought on wheat yield in Spain

    Science.gov (United States)

    Hernandez-Barrera, S.; Rodriguez-Puebla, C.; Challinor, A. J.

    2017-07-01

    This study aims to provide new insight on the wheat yield historical response to climate processes throughout Spain by using statistical methods. Our data includes observed wheat yield, pseudo-observations E-OBS for the period 1979 to 2014, and outputs of general circulation models in phase 5 of the Coupled Models Inter-comparison Project (CMIP5) for the period 1901 to 2099. In investigating the relationship between climate and wheat variability, we have applied the approach known as the partial least-square regression, which captures the relevant climate drivers accounting for variations in wheat yield. We found that drought occurring in autumn and spring and the diurnal range of temperature experienced during the winter are major processes to characterize the wheat yield variability in Spain. These observable climate processes are used for an empirical model that is utilized in assessing the wheat yield trends in Spain under different climate conditions. To isolate the trend within the wheat time series, we implemented the adaptive approach known as Ensemble Empirical Mode Decomposition. Wheat yields in the twenty-first century are experiencing a downward trend that we claim is a consequence of widespread drought over the Iberian Peninsula and an increase in the diurnal range of temperature. These results are important to inform about the wheat vulnerability in this region to coming changes and to develop adaptation strategies.

  18. Time series regression-based pairs trading in the Korean equities market

    Science.gov (United States)

    Kim, Saejoon; Heo, Jun

    2017-07-01

    Pairs trading is an instance of statistical arbitrage that relies on heavy quantitative data analysis to profit by capitalising low-risk trading opportunities provided by anomalies of related assets. A key element in pairs trading is the rule by which open and close trading triggers are defined. This paper investigates the use of time series regression to define the rule which has previously been identified with fixed threshold-based approaches. Empirical results indicate that our approach may yield significantly increased excess returns compared to ones obtained by previous approaches on large capitalisation stocks in the Korean equities market.

  19. A computational approach to compare regression modelling strategies in prediction research.

    Science.gov (United States)

    Pajouheshnia, Romin; Pestman, Wiebe R; Teerenstra, Steven; Groenwold, Rolf H H

    2016-08-25

    It is often unclear which approach to fit, assess and adjust a model will yield the most accurate prediction model. We present an extension of an approach for comparing modelling strategies in linear regression to the setting of logistic regression and demonstrate its application in clinical prediction research. A framework for comparing logistic regression modelling strategies by their likelihoods was formulated using a wrapper approach. Five different strategies for modelling, including simple shrinkage methods, were compared in four empirical data sets to illustrate the concept of a priori strategy comparison. Simulations were performed in both randomly generated data and empirical data to investigate the influence of data characteristics on strategy performance. We applied the comparison framework in a case study setting. Optimal strategies were selected based on the results of a priori comparisons in a clinical data set and the performance of models built according to each strategy was assessed using the Brier score and calibration plots. The performance of modelling strategies was highly dependent on the characteristics of the development data in both linear and logistic regression settings. A priori comparisons in four empirical data sets found that no strategy consistently outperformed the others. The percentage of times that a model adjustment strategy outperformed a logistic model ranged from 3.9 to 94.9 %, depending on the strategy and data set. However, in our case study setting the a priori selection of optimal methods did not result in detectable improvement in model performance when assessed in an external data set. The performance of prediction modelling strategies is a data-dependent process and can be highly variable between data sets within the same clinical domain. A priori strategy comparison can be used to determine an optimal logistic regression modelling strategy for a given data set before selecting a final modelling approach.

  20. Di-lepton yield from the decay of excited 28Si states

    International Nuclear Information System (INIS)

    Bacelar, J.C.; Buda, A.; Balanda, A.; Krasznahorkay, A.; Ploeg, H. van der; Sujkowski, Z.; Woude, A. van der

    1994-01-01

    The first dilepton yield measurements from excited nuclear states obtained with a new Positron-Electron Pair Spectroscopic Instrument (PEPSI) are reported. Nuclear states in 28 Si, with an initial excitation energy E * =50 MeV, were populated via the isospin T=0 reaction 4 He+ 24 Mg and the mixed-isospin 3 He+ 25 Mg reaction. In both reactions the dilepton (e + e - ) and photon decay yields were measured concurrently. An excess of counts in the e + e - spectrum, over the converted photon yield, is observed in the energy region above 15 MeV. An analyses is discussed whereby the observed excess counts are assumed to represent the isoscalar E0 strength in excited nuclear states. (orig.)

  1. Multivariate differential analyses of adolescents' experiences of aggression in families

    Directory of Open Access Journals (Sweden)

    Chris Myburgh

    2011-01-01

    Full Text Available Aggression is part of South African society and has implications for the mental health of persons living in South Africa. If parents are aggressive adolescents are also likely to be aggressive and that will impact negatively on their mental health. In this article the nature and extent of adolescents' experiences of aggression and aggressive behaviour in the family are investigated. A deductive explorative quantitative approach was followed. Aggression is reasoned to be dependent on aspects such as self-concept, moral reasoning, communication, frustration tolerance and family relationships. To analyse the data from questionnaires of 101 families (95 adolescents, 95 mothers and 91 fathers Cronbach Alpha, various consecutive first and second order factor analyses, correlations, multiple regression, MANOVA, ANOVA and Scheffè/ Dunnett tests were used. It was found that aggression correlated negatively with the independent variables; and the correlations between adolescents and their parents were significant. Regression analyses indicated that different predictors predicted aggression. Furthermore, differences between adolescents and their parents indicated that the experienced levels of aggression between adolescents and their parents were small. Implications for education are given.

  2. Effect of Integrated Nutrient Management on Yield and Yield ...

    African Journals Online (AJOL)

    Declining soil fertility is one of the major problems causing yield reduction of barley ... (VC) with inorganic NP on growth, yield and yield components of food barley. ... The experiments were laid out in a randomized complete block design with ...

  3. Preference learning with evolutionary Multivariate Adaptive Regression Spline model

    DEFF Research Database (Denmark)

    Abou-Zleikha, Mohamed; Shaker, Noor; Christensen, Mads Græsbøll

    2015-01-01

    This paper introduces a novel approach for pairwise preference learning through combining an evolutionary method with Multivariate Adaptive Regression Spline (MARS). Collecting users' feedback through pairwise preferences is recommended over other ranking approaches as this method is more appealing...... for function approximation as well as being relatively easy to interpret. MARS models are evolved based on their efficiency in learning pairwise data. The method is tested on two datasets that collectively provide pairwise preference data of five cognitive states expressed by users. The method is analysed...

  4. A random regression model in analysis of litter size in pigs | Lukovi& ...

    African Journals Online (AJOL)

    Dispersion parameters for number of piglets born alive (NBA) were estimated using a random regression model (RRM). Two data sets of litter records from the Nemščak farm in Slovenia were used for analyses. The first dataset (DS1) included records from the first to the sixth parity. The second dataset (DS2) was extended ...

  5. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

    Science.gov (United States)

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.

  6. Spatially-explicit modeling of multi-scale drivers of aboveground forest biomass and water yield in watersheds of the Southeastern United States.

    Science.gov (United States)

    Ajaz Ahmed, Mukhtar Ahmed; Abd-Elrahman, Amr; Escobedo, Francisco J; Cropper, Wendell P; Martin, Timothy A; Timilsina, Nilesh

    2017-09-01

    Understanding ecosystem processes and the influence of regional scale drivers can provide useful information for managing forest ecosystems. Examining more local scale drivers of forest biomass and water yield can also provide insights for identifying and better understanding the effects of climate change and management on forests. We used diverse multi-scale datasets, functional models and Geographically Weighted Regression (GWR) to model ecosystem processes at the watershed scale and to interpret the influence of ecological drivers across the Southeastern United States (SE US). Aboveground forest biomass (AGB) was determined from available geospatial datasets and water yield was estimated using the Water Supply and Stress Index (WaSSI) model at the watershed level. Our geostatistical model examined the spatial variation in these relationships between ecosystem processes, climate, biophysical, and forest management variables at the watershed level across the SE US. Ecological and management drivers at the watershed level were analyzed locally to identify whether drivers contribute positively or negatively to aboveground forest biomass and water yield ecosystem processes and thus identifying potential synergies and tradeoffs across the SE US region. Although AGB and water yield drivers varied geographically across the study area, they were generally significantly influenced by climate (rainfall and temperature), land-cover factor1 (Water and barren), land-cover factor2 (wetland and forest), organic matter content high, rock depth, available water content, stand age, elevation, and LAI drivers. These drivers were positively or negatively associated with biomass or water yield which significantly contributes to ecosystem interactions or tradeoff/synergies. Our study introduced a spatially-explicit modelling framework to analyze the effect of ecosystem drivers on forest ecosystem structure, function and provision of services. This integrated model approach facilitates

  7. Statistical modelling of grapevine yield in the Port Wine region under present and future climate conditions

    Science.gov (United States)

    Santos, João A.; Malheiro, Aureliano C.; Karremann, Melanie K.; Pinto, Joaquim G.

    2011-03-01

    The impact of projected climate change on wine production was analysed for the Demarcated Region of Douro, Portugal. A statistical grapevine yield model (GYM) was developed using climate parameters as predictors. Statistically significant correlations were identified between annual yield and monthly mean temperatures and monthly precipitation totals during the growing cycle. These atmospheric factors control grapevine yield in the region, with the GYM explaining 50.4% of the total variance in the yield time series in recent decades. Anomalously high March rainfall (during budburst, shoot and inflorescence development) favours yield, as well as anomalously high temperatures and low precipitation amounts in May and June (May: flowering and June: berry development). The GYM was applied to a regional climate model output, which was shown to realistically reproduce the GYM predictors. Finally, using ensemble simulations under the A1B emission scenario, projections for GYM-derived yield in the Douro Region, and for the whole of the twenty-first century, were analysed. A slight upward trend in yield is projected to occur until about 2050, followed by a steep and continuous increase until the end of the twenty-first century, when yield is projected to be about 800 kg/ha above current values. While this estimate is based on meteorological parameters alone, changes due to elevated CO2 may further enhance this effect. In spite of the associated uncertainties, it can be stated that projected climate change may significantly benefit wine yield in the Douro Valley.

  8. Yield trends and yield gap analysis of major crops in the world

    OpenAIRE

    Hengsdijk, H.; Langeveld, J.W.A.

    2009-01-01

    This study aims to quantify the gap between current and potential yields of major crops in the world, and the production constraints that contribute to this yield gap. Using an expert-based evaluation of yield gaps and the literature, global and regional yields and yield trends of major crops are quantified, yield gaps evaluated by crop experts, current yield progress by breeding estimated, and different yield projections compared. Results show decreasing yield growth for wheat and rice, but ...

  9. The characteristics of high-yield genotype of early-mature mutant lines in barley

    International Nuclear Information System (INIS)

    Chen Xiulan; Han Yuepeng; He Zhentian; Yang Hefeng

    2000-01-01

    The correlation and genetic parameters of eight agronomic traits of 36 early mature mutant lines induced from barley Sunong 9052 were studied by stepwise regression and path analysis. The results showed that: (1) the growing period of early mutants was shortened 2-13 days from that of their parent and the trait of yield had a great mutation range; (2) the number of grain per panicle significantly correlated with the days from sowing to heading; (3) according to direct path coefficients, the main characters related with individual plant-yield were in order of productive panicle per plant > 1000-grain-weight > number of grain per panicle > fertility, the high-yield genotype had more productive panicle and higher 10000-grain-weight, and to increase the yield in the breeding of early mature mutation was to select the lines with more tillers and productive panicles, higher 1000-grain-weight and lower number of grain per panicle; (4) the higher broad-sense heritability and genetic variation coefficient were found in 1000-grain-weight and the days from sowing to heading

  10. Equity yields

    NARCIS (Netherlands)

    Vrugt, E.; van Binsbergen, J.H.; Koijen, R.S.J.; Hueskes, W.

    2013-01-01

    We study a new data set of dividend futures with maturities up to ten years across three world regions: the US, Europe, and Japan. We use these asset prices to construct equity yields, analogous to bond yields. We decompose the equity yields to obtain a term structure of expected dividend growth

  11. Yield Strength Testing in Human Cadaver Nasal Septal Cartilage and L-Strut Constructs.

    Science.gov (United States)

    Liu, Yuan F; Messinger, Kelton; Inman, Jared C

    2017-01-01

    To our knowledge, yield strength testing in human nasal septal cartilage has not been reported to date. An understanding of the basic mechanics of the nasal septum may help surgeons decide how much of an L-strut to preserve and how much grafting is needed. To determine the factors correlated with yield strength of the cartilaginous nasal septum and to explore the association between L-strut width and thickness in determining yield strength. In an anatomy laboratory, yield strength of rectangular pieces of fresh cadaver nasal septal cartilage was measured, and regression was performed to identify the factors correlated with yield strength. To measure yield strength in L-shaped models, 4 bonded paper L-struts models were constructed for every possible combination of the width and thickness, for a total of 240 models. Mathematical modeling using the resultant data with trend lines and surface fitting was performed to quantify the associations among L-strut width, thickness, and yield strength. The study dates were November 1, 2015, to April 1, 2016. The factors correlated with nasal cartilage yield strength and the associations among L-strut width, thickness, and yield strength in L-shaped models. Among 95 cartilage pieces from 12 human cadavers (mean [SD] age, 67.7 [12.6] years) and 240 constructed L-strut models, L-strut thickness was the only factor correlated with nasal septal cartilage yield strength (coefficient for thickness, 5.54; 95% CI, 4.08-7.00; P cadaver nasal septal cartilage, L-strut thickness was significantly associated with yield strength. In a bonded paper L-strut model, L-strut thickness had a more important role in determining yield strength than L-strut width. Surgeons should consider the thickness of potential L-struts when determining the amount of cartilaginous septum to harvest and graft. NA.

  12. Principal Covariates Clusterwise Regression (PCCR): Accounting for Multicollinearity and Population Heterogeneity in Hierarchically Organized Data.

    Science.gov (United States)

    Wilderjans, Tom Frans; Vande Gaer, Eva; Kiers, Henk A L; Van Mechelen, Iven; Ceulemans, Eva

    2017-03-01

    In the behavioral sciences, many research questions pertain to a regression problem in that one wants to predict a criterion on the basis of a number of predictors. Although in many cases, ordinary least squares regression will suffice, sometimes the prediction problem is more challenging, for three reasons: first, multiple highly collinear predictors can be available, making it difficult to grasp their mutual relations as well as their relations to the criterion. In that case, it may be very useful to reduce the predictors to a few summary variables, on which one regresses the criterion and which at the same time yields insight into the predictor structure. Second, the population under study may consist of a few unknown subgroups that are characterized by different regression models. Third, the obtained data are often hierarchically structured, with for instance, observations being nested into persons or participants within groups or countries. Although some methods have been developed that partially meet these challenges (i.e., principal covariates regression (PCovR), clusterwise regression (CR), and structural equation models), none of these methods adequately deals with all of them simultaneously. To fill this gap, we propose the principal covariates clusterwise regression (PCCR) method, which combines the key idea's behind PCovR (de Jong & Kiers in Chemom Intell Lab Syst 14(1-3):155-164, 1992) and CR (Späth in Computing 22(4):367-373, 1979). The PCCR method is validated by means of a simulation study and by applying it to cross-cultural data regarding satisfaction with life.

  13. A Remote Sensing-Derived Corn Yield Assessment Model

    Science.gov (United States)

    Shrestha, Ranjay Man

    be further associated with the actual yield. Utilizing satellite remote sensing products, such as daily NDVI derived from Moderate Resolution Imaging Spectroradiometer (MODIS) at 250 m pixel size, the crop yield estimation can be performed at a very fine spatial resolution. Therefore, this study examined the potential of these daily NDVI products within agricultural studies and crop yield assessments. In this study, a regression-based approach was proposed to estimate the annual corn yield through changes in MODIS daily NDVI time series. The relationship between daily NDVI and corn yield was well defined and established, and as changes in corn phenology and yield were directly reflected by the changes in NDVI within the growing season, these two entities were combined to develop a relational model. The model was trained using 15 years (2000-2014) of historical NDVI and county-level corn yield data for four major corn producing states: Kansas, Nebraska, Iowa, and Indiana, representing four climatic regions as South, West North Central, East North Central, and Central, respectively, within the U.S. Corn Belt area. The model's goodness of fit was well defined with a high coefficient of determination (R2>0.81). Similarly, using 2015 yield data for validation, 92% of average accuracy signified the performance of the model in estimating corn yield at county level. Besides providing the county-level corn yield estimations, the derived model was also accurate enough to estimate the yield at finer spatial resolution (field level). The model's assessment accuracy was evaluated using the randomly selected field level corn yield within the study area for 2014, 2015, and 2016. A total of over 120 plot level corn yield were used for validation, and the overall average accuracy was 87%, which statistically justified the model's capability to estimate plot-level corn yield. Additionally, the proposed model was applied to the impact estimation by examining the changes in corn yield

  14. Yield surfaces for perforated plates with square arrays of holes

    International Nuclear Information System (INIS)

    Bhattacharya, A.; Venkat Raj, V.

    2004-01-01

    A symmetric model of a perforated plate containing a 3x3 array of circular holes, arranged in a square pattern, was chosen and elastoplastic finite element analyses were carried out to determine the limit stresses for both pitch and diagonal directions of loading, for different values of biaxiality ratios. Plane stress conditions were assumed and the Tresca and von Mises yield criteria were employed to obtain two different sets of results. Yield surfaces were constructed and 'general cut-out factors' were determined for four different ligament efficiencies. The FEM results obtained by the authors using the Tresca and von Mises yield criteria were compared with the corresponding results of [J. Pressure Vessel Technol. Trans. ASME (1975) 146-154] and [J. Pressure Vessel Technol. Trans. ASME (1997) 122-126], respectively. The results based on the Tresca yield criterion obtained by the present authors and those in [J. Pressure Vessel Technol. Trans. ASME (1975) 146-154] were found to agree well with each other. In the case of the von Mises yield criterion, the agreement with [J. Pressure Vessel Technol. Trans. ASME 122-126] is generally good. The estimates based on the Tresca yield criterion are seen to result in lower values of limit stresses and cut-out factors as compared to those based on the von Mises yield criterion. The difference is attributed to the yield criterion and the flow rule chosen for the analysis. The shape as well as the size of the yield surface was found to depend on the ligament efficiency

  15. Application of digital-image-correlation techniques in analysing ...

    Indian Academy of Sciences (India)

    Basis theory of strain analysis using the digital image correlation method .... Type 304N Stainless Steel (Modulus of Elasticity = 193 MPa, Tensile Yield .... also proves the accuracy of the qualitative analyses by using the DIC ... We thank the National Science Council of Taiwan for supporting this research through grant. No.

  16. Application of support vector regression (SVR) for stream flow prediction on the Amazon basin

    CSIR Research Space (South Africa)

    Du Toit, Melise

    2016-10-01

    Full Text Available regression technique is used in this study to analyse historical stream flow occurrences and predict stream flow values for the Amazon basin. Up to twelve month predictions are made and the coefficient of determination and root-mean-square error are used...

  17. Economics and yields of energy plantations: Status and potential

    International Nuclear Information System (INIS)

    Kenney, W.A.; Gambles, R.L.; Zsuffa, L.

    1992-01-01

    A study was carried out to: determine the factors affecting the cost of energy conversion feedstocks in short rotation intensive culture plantations of trees; determine the factors influencing biomass yield; identify interrelationships between the previous two objectives; present estimates of potential biomass yields and associated economics; and to identify gaps in the knowledge of the economics and yields of biomass production and their interrelationships. Reported costs for most aspects had a wide range. Currently, yields of 10-15 dry Mg/hectare/y are readily achievable. Using the cost and yield data, and assuming a biomass price of $40/dry Mg, a series of cash flow analyses were performed. For the low cost inputs, all scenarios were marginally profitable. For the high cost inputs, none of the scenarios were profitable. A current scenario, using figures for contract farming, was not profitable, however this system would break even with a yield of 23.3 dry Mg/hectare/y, within the range of some production clones. A future scenario using farm labour with increased productivity, product values, and machinery efficiencies yielded a profit-making situation. The addition of incentives increased profitability. There is great potential for the production of woody biomass in Canada as a feedstock for energy and other products. Continued and more intensive breeding and selection to develop high yielding stress tolerant clones, cost efficient harvesting systems, continued research into optimization of planting density, rotation length and cultural techniques, and characterization of promising clones with respect to nutrient-use efficiency, site requirements and pest/disease resistance are important areas for further work. 81 refs., 3 figs., 13 tabs

  18. Grain Yield, Its Components, Genetic Diversity and Heritability in Chickpea (Cicer arietinum L.

    Directory of Open Access Journals (Sweden)

    M. Kakaei

    2015-09-01

    Full Text Available The current research was carried out to investigate grain yield and components and their genetic diversity and heritability of some important agronomic traits, in 19 chickpea genotypes, based on a randomized complete block design with 3 replications in Research Field of Bu-Ali Sina University, Hamadan, Iran in 2011-2012 growing seasons. The ANOVA results showed that, there were highly significant differences (p < 0.01 among genotypes for the SPAD number, number of sub-branch per plant, pod number per plant, 100-kernel weight, grain yield, biological yield, and harvest index. The mean comparisons results indicated that the genotypes 14, 12, 4 and 19 (with 234.7, 240, 250.3 and 259.4 kilogram of grain yield per ha, respectively and the genotypes 18, 8, 15, and 6 (with 151.01, 167.6, 167.8 and 189 kilogram of grain yield per ha, respectively had the maximum and minimum economic yield, respectively. According to phonotypical correlation results, there were positive and significant (p < 0.01 correlations between grain yield and pod number per plant (0.623**, plant height (0.432**, harvest index (0.425** and biomass (0.349**. Step-wise regression indicated that the pod number per plant, harvest index, biomass, number of sub-branch per plant, and plant height were the most effective traits on economic yield and they explained 84.68 percent of the variation in economic yield. Furthermore, harvest index and seed number per plant had the maximum and minimum heritability, respectively, indicating that they could be hired as sources of variation for improving the grain yield and selecting superior genotypes.

  19. Linear regression in astronomy. I

    Science.gov (United States)

    Isobe, Takashi; Feigelson, Eric D.; Akritas, Michael G.; Babu, Gutti Jogesh

    1990-01-01

    Five methods for obtaining linear regression fits to bivariate data with unknown or insignificant measurement errors are discussed: ordinary least-squares (OLS) regression of Y on X, OLS regression of X on Y, the bisector of the two OLS lines, orthogonal regression, and 'reduced major-axis' regression. These methods have been used by various researchers in observational astronomy, most importantly in cosmic distance scale applications. Formulas for calculating the slope and intercept coefficients and their uncertainties are given for all the methods, including a new general form of the OLS variance estimates. The accuracy of the formulas was confirmed using numerical simulations. The applicability of the procedures is discussed with respect to their mathematical properties, the nature of the astronomical data under consideration, and the scientific purpose of the regression. It is found that, for problems needing symmetrical treatment of the variables, the OLS bisector performs significantly better than orthogonal or reduced major-axis regression.

  20. Logic regression and its extensions.

    Science.gov (United States)

    Schwender, Holger; Ruczinski, Ingo

    2010-01-01

    Logic regression is an adaptive classification and regression procedure, initially developed to reveal interacting single nucleotide polymorphisms (SNPs) in genetic association studies. In general, this approach can be used in any setting with binary predictors, when the interaction of these covariates is of primary interest. Logic regression searches for Boolean (logic) combinations of binary variables that best explain the variability in the outcome variable, and thus, reveals variables and interactions that are associated with the response and/or have predictive capabilities. The logic expressions are embedded in a generalized linear regression framework, and thus, logic regression can handle a variety of outcome types, such as binary responses in case-control studies, numeric responses, and time-to-event data. In this chapter, we provide an introduction to the logic regression methodology, list some applications in public health and medicine, and summarize some of the direct extensions and modifications of logic regression that have been proposed in the literature. Copyright © 2010 Elsevier Inc. All rights reserved.

  1. Practical Aspects of Log-ratio Coordinate Representations in Regression with Compositional Response

    Directory of Open Access Journals (Sweden)

    Fišerová Eva

    2016-10-01

    Full Text Available Regression analysis with compositional response, observations carrying relative information, is an appropriate tool for statistical modelling in many scientific areas (e.g. medicine, geochemistry, geology, economics. Even though this technique has been recently intensively studied, there are still some practical aspects that deserve to be further analysed. Here we discuss the issue related to the coordinate representation of compositional data. It is shown that linear relation between particular orthonormal coordinates and centred log-ratio coordinates can be utilized to simplify the computation concerning regression parameters estimation and hypothesis testing. To enhance interpretation of regression parameters, the orthogonal coordinates and their relation with orthonormal and centred log-ratio coordinates are presented. Further we discuss the quality of prediction in different coordinate system. It is shown that the mean squared error (MSE for orthonormal coordinates is less or equal to the MSE for log-transformed data. Finally, an illustrative real-world example from geology is presented.

  2. Closing Yield Gaps: How Sustainable Can We Be?

    Directory of Open Access Journals (Sweden)

    Prajal Pradhan

    Full Text Available Global food production needs to be increased by 60-110% between 2005 and 2050 to meet growing food and feed demand. Intensification and/or expansion of agriculture are the two main options available to meet the growing crop demands. Land conversion to expand cultivated land increases GHG emissions and impacts biodiversity and ecosystem services. Closing yield gaps to attain potential yields may be a viable option to increase the global crop production. Traditional methods of agricultural intensification often have negative externalities. Therefore, there is a need to explore location-specific methods of sustainable agricultural intensification. We identified regions where the achievement of potential crop calorie production on currently cultivated land will meet the present and future food demand based on scenario analyses considering population growth and changes in dietary habits. By closing yield gaps in the current irrigated and rain-fed cultivated land, about 24% and 80% more crop calories can respectively be produced compared to 2000. Most countries will reach food self-sufficiency or improve their current food self-sufficiency levels if potential crop production levels are achieved. As a novel approach, we defined specific input and agricultural management strategies required to achieve the potential production by overcoming biophysical and socioeconomic constraints causing yield gaps. The management strategies include: fertilizers, pesticides, advanced soil management, land improvement, management strategies coping with weather induced yield variability, and improving market accessibility. Finally, we estimated the required fertilizers (N, P2O5, and K2O to attain the potential yields. Globally, N-fertilizer application needs to increase by 45-73%, P2O5-fertilizer by 22-46%, and K2O-fertilizer by 2-3 times compared to the year 2010 to attain potential crop production. The sustainability of such agricultural intensification largely depends

  3. Distillation time effect on lavender essential oil yield and composition.

    Science.gov (United States)

    Zheljazkov, Valtcho D; Cantrell, Charles L; Astatkie, Tess; Jeliazkova, Ekaterina

    2013-01-01

    Lavender (Lavandula angustifolia Mill.) is one of the most widely grown essential oil crops in the world. Commercial extraction of lavender oil is done using steam distillation. The objective of this study was to evaluate the effect of the length of the distillation time (DT) on lavender essential oil yield and composition when extracted from dried flowers. Therefore, the following distillation times (DT) were tested in this experiment: 1.5 min, 3 min, 3.75 min, 7.5 min, 15 min, 30 min, 60 min, 90 min, 120 min, 150 min, 180 min, and 240 min. The essential oil yield (range 0.5-6.8%) reached a maximum at 60 min DT. The concentrations of cineole (range 6.4-35%) and fenchol (range 1.7-2.9%) were highest at the 1.5 min DT and decreased with increasing length of the DT. The concentration of camphor (range 6.6-9.2%) reached a maximum at 7.5-15 min DT, while the concentration of linalool acetate (range 15-38%) reached a maximum at 30 min DT. Results suggest that lavender essential oil yield may not increase after 60 min DT. The change in essential oil yield, and the concentrations of cineole, fenchol and linalool acetate as DT changes were modeled very well by the asymptotic nonlinear regression model. DT may be used to modify the chemical profile of lavender oil and to obtain oils with differential chemical profiles from the same lavender flowers. DT must be taken into consideration when citing or comparing reports on lavender essential oil yield and composition.

  4. Tumor regression patterns in retinoblastoma

    International Nuclear Information System (INIS)

    Zafar, S.N.; Siddique, S.N.; Zaheer, N.

    2016-01-01

    To observe the types of tumor regression after treatment, and identify the common pattern of regression in our patients. Study Design: Descriptive study. Place and Duration of Study: Department of Pediatric Ophthalmology and Strabismus, Al-Shifa Trust Eye Hospital, Rawalpindi, Pakistan, from October 2011 to October 2014. Methodology: Children with unilateral and bilateral retinoblastoma were included in the study. Patients were referred to Pakistan Institute of Medical Sciences, Islamabad, for chemotherapy. After every cycle of chemotherapy, dilated funds examination under anesthesia was performed to record response of the treatment. Regression patterns were recorded on RetCam II. Results: Seventy-four tumors were included in the study. Out of 74 tumors, 3 were ICRB group A tumors, 43 were ICRB group B tumors, 14 tumors belonged to ICRB group C, and remaining 14 were ICRB group D tumors. Type IV regression was seen in 39.1% (n=29) tumors, type II in 29.7% (n=22), type III in 25.6% (n=19), and type I in 5.4% (n=4). All group A tumors (100%) showed type IV regression. Seventeen (39.5%) group B tumors showed type IV regression. In group C, 5 tumors (35.7%) showed type II regression and 5 tumors (35.7%) showed type IV regression. In group D, 6 tumors (42.9%) regressed to type II non-calcified remnants. Conclusion: The response and success of the focal and systemic treatment, as judged by the appearance of different patterns of tumor regression, varies with the ICRB grouping of the tumor. (author)

  5. Ranking contributing areas of salt and selenium in the Lower Gunnison River Basin, Colorado, using multiple linear regression models

    Science.gov (United States)

    Linard, Joshua I.

    2013-01-01

    Mitigating the effects of salt and selenium on water quality in the Grand Valley and lower Gunnison River Basin in western Colorado is a major concern for land managers. Previous modeling indicated means to improve the models by including more detailed geospatial data and a more rigorous method for developing the models. After evaluating all possible combinations of geospatial variables, four multiple linear regression models resulted that could estimate irrigation-season salt yield, nonirrigation-season salt yield, irrigation-season selenium yield, and nonirrigation-season selenium yield. The adjusted r-squared and the residual standard error (in units of log-transformed yield) of the models were, respectively, 0.87 and 2.03 for the irrigation-season salt model, 0.90 and 1.25 for the nonirrigation-season salt model, 0.85 and 2.94 for the irrigation-season selenium model, and 0.93 and 1.75 for the nonirrigation-season selenium model. The four models were used to estimate yields and loads from contributing areas corresponding to 12-digit hydrologic unit codes in the lower Gunnison River Basin study area. Each of the 175 contributing areas was ranked according to its estimated mean seasonal yield of salt and selenium.

  6. Analysis of Palm Oil Production, Export, and Government Consumption to Gross Domestic Product of Five Districts in West Kalimantan by Panel Regression

    Science.gov (United States)

    Sulistianingsih, E.; Kiftiah, M.; Rosadi, D.; Wahyuni, H.

    2017-04-01

    Gross Domestic Product (GDP) is an indicator of economic growth in a region. GDP is a panel data, which consists of cross-section and time series data. Meanwhile, panel regression is a tool which can be utilised to analyse panel data. There are three models in panel regression, namely Common Effect Model (CEM), Fixed Effect Model (FEM) and Random Effect Model (REM). The models will be chosen based on results of Chow Test, Hausman Test and Lagrange Multiplier Test. This research analyses palm oil about production, export, and government consumption to five district GDP are in West Kalimantan, namely Sanggau, Sintang, Sambas, Ketapang and Bengkayang by panel regression. Based on the results of analyses, it concluded that REM, which adjusted-determination-coefficient is 0,823, is the best model in this case. Also, according to the result, only Export and Government Consumption that influence GDP of the districts.

  7. Combining Alphas via Bounded Regression

    Directory of Open Access Journals (Sweden)

    Zura Kakushadze

    2015-11-01

    Full Text Available We give an explicit algorithm and source code for combining alpha streams via bounded regression. In practical applications, typically, there is insufficient history to compute a sample covariance matrix (SCM for a large number of alphas. To compute alpha allocation weights, one then resorts to (weighted regression over SCM principal components. Regression often produces alpha weights with insufficient diversification and/or skewed distribution against, e.g., turnover. This can be rectified by imposing bounds on alpha weights within the regression procedure. Bounded regression can also be applied to stock and other asset portfolio construction. We discuss illustrative examples.

  8. Distributed specific sediment yield estimations in Japan attributed to extreme-rainfall-induced slope failures under a changing climate

    Directory of Open Access Journals (Sweden)

    K. Ono

    2011-01-01

    Full Text Available The objective of this study was to estimate the potential sediment yield distribution in Japan attributed to extreme-rainfall-induced slope failures in the future. For this purpose, a regression relationship between the slope failure probability and the subsequent sediment yield was developed by using sediment yield observations from 59 dams throughout Japan. The slope failure probability accounts for the effects of topography (as relief energy, geology and hydro-climate variations (hydraulic gradient changes due to extreme rainfall variations and determines the potential slope failure occurrence with a 1-km resolution. The applicability of the developed relationship was then validated by comparing the simulated and observed sediment yields in another 43 dams. To incorporate the effects of a changing climate, extreme rainfall variations were estimated by using two climate change scenarios (the MRI-RCM20 Ver.2 model A2 scenario and the MIROC A1B scenario for the future and by accounting for the slope failure probability through the effect of extreme rainfall on the hydraulic gradient. Finally, the developed slope failure hazard-sediment yield relationship was employed to estimate the potential sediment yield distribution under a changing climate in Japan.

    Time series analyses of annual sediment yields covering 15–20 years in 59 dams reveal that extreme sedimentation events have a high probability of occurring on average every 5–7 years. Therefore, the extreme-rainfall-induced slope failure probability with a five-year return period has a statistically robust relationship with specific sediment yield observations (with r2 = 0.65. The verification demonstrated that the model is effective for use in simulating specific sediment yields with r2 = 0.74. The results of the GCM scenarios suggest that the sediment yield issue will be critical in Japan in the future. When the spatially averaged sediment

  9. UAV based mapping of variation in grassland yield for forage production in Arctic environments

    Science.gov (United States)

    Davids, C.; Karlsen, S. R.; Jørgensen, M.; Ancin Murguzur, F. J.

    2017-12-01

    Grassland cultivation for animal feed is the key agricultural activity in northern Norway. Even though the growing season has increased by at least a week in the last 30 years, grassland yields appear to have declined, probably due to more challenging winter conditions and changing agronomy practices. The ability for local and regional crop productivity forecasting would assist farmers with management decisions and would provide local and national authorities with a better overview over productivity and potential problems due to e.g. winter damage. Remote sensing technology has long been used to estimate and map the variability of various biophysical parameters, but calibration is important. In order to establish the relationship between spectral reflectance and grass yield in northern European environments we combine Sentinel-2 time series, UAV-based multispectral measurements, and ground-based spectroradiometry, with biomass analyses and observations of species composition. In this presentation we will focus on the results from the UAV data acquisition. We used a multirotor UAV with different sensors (a multispectral Rikola camera, and NDVI and RGB cameras) to image a number of cultivated grasslands of different age and productivity in northern Norway in June/July 2016 and 2017. Following UAV data acquisition, 10 to 20 in situ measurements were made per field using a FieldSpec3 (350-2500 nm). In addition, samples were taken to determine biomass and grass species composition. The imaging and sampling was done immediately prior to harvesting. The Rikola camera, when used as a stand-alone camera mounted on a UAV, can collect 15 bands with a spectral width of 10-15 nm in the range between 500-890 nm. In the initial analysis of the 2016 data we investigated how well different vegetation indices correlated with biomass and showed that vegetation indices that include red edge bands perform better than widely used indices such as NDVI. We will extend the analysis with

  10. riskRegression

    DEFF Research Database (Denmark)

    Ozenne, Brice; Sørensen, Anne Lyngholm; Scheike, Thomas

    2017-01-01

    In the presence of competing risks a prediction of the time-dynamic absolute risk of an event can be based on cause-specific Cox regression models for the event and the competing risks (Benichou and Gail, 1990). We present computationally fast and memory optimized C++ functions with an R interface...... for predicting the covariate specific absolute risks, their confidence intervals, and their confidence bands based on right censored time to event data. We provide explicit formulas for our implementation of the estimator of the (stratified) baseline hazard function in the presence of tied event times. As a by...... functionals. The software presented here is implemented in the riskRegression package....

  11. Econometric analysis of realized covariation: high frequency based covariance, regression, and correlation in financial economics

    DEFF Research Database (Denmark)

    Barndorff-Nielsen, Ole Eiler; Shephard, N.

    2004-01-01

    This paper analyses multivariate high frequency financial data using realized covariation. We provide a new asymptotic distribution theory for standard methods such as regression, correlation analysis, and covariance. It will be based on a fixed interval of time (e.g., a day or week), allowing...... the number of high frequency returns during this period to go to infinity. Our analysis allows us to study how high frequency correlations, regressions, and covariances change through time. In particular we provide confidence intervals for each of these quantities....

  12. Adaptabilidade e estabilidade via regressão não paramétrica em genótipos de café Adaptability and stability based on nonparametric regression in coffee genotypes

    Directory of Open Access Journals (Sweden)

    Moysés Nascimento

    2010-01-01

    Full Text Available O objetivo deste trabalho foi avaliar uma metodologia de análise de adaptabilidade e estabilidade fenotípica de genótipos de café baseada em regressão não paramétrica. A técnica utilizada difere das demais, pois reduz a influência na estimação do parâmetro de adaptabilidade de algum ponto extremo, ocasionado pela presença de genótipos com respostas demasiadamente diferenciadas a determinado ambiente. Foram utilizados dados provenientes de um experimento sobre produtividade média de grãos de 40 genótipos de café (Coffea canephora, com delineamento em blocos ao acaso, com seis repetições. Os genótipos foram avaliados em cinco anos (1996, 1998, 1999, 2000 e 2001, em dois locais (Sooretama e Marilândia, ES no total de dez ambientes. A metodologia proposta demonstrou ser adequada e eficiente, pois extingue os efeitos impróprios induzidos pela presença de pontos extremos e evita a recomendação incorreta de genótipos quanto à adaptabilidade.The objective of this work was to evaluate a methodology of phenotypic adaptability and stability analyses of coffee genotypes based on nonparametric regression. The technique used differs from other techniques because it reduces the influence of extreme points resulting from the presence of genotypes whose answers to a certain environment are too different on the estimation of the adaptability parameter. Data from an experiment studying the average yield of 40 coffee (Coffea canephora genotypes in a randomized block design with six replicates were used to evaluate the method. The genotypes were evaluated along five years (1996, 1998, 1999, 2000 and 2001 in two locations (Sooretama and Marilândia, ES, Brazil, in a total of ten environments. The methodology proposed proved adequate and efficient, since it eliminates the disproportionate effects induced by the presence of extreme points and avoids misleading recommendations of genotypes in terms of adaptability.

  13. Identifying the most promising genotypes in lentil for cultivation in a wide range of environments of Pakistan using various yield stability measures

    International Nuclear Information System (INIS)

    Ali, A.; Zahid, M.A.

    2012-01-01

    The present study was aimed to identify the most promising high yielding lentil genotype for a wide range of environments of Pakistan using 8 stability measures. The experiment consisted of 12 lentil genotypes grown at 11 locations falling in different agro-ecological zones of Pakistan for 2 years during 2006/07 and 2007/08 under national uniform yield testing. The General Linear Model (GLM) of MINITAB (version 15) was used for two-way analysis of variance for lentil yield data to examine the total variation into genotypes, environments and genotype x environment interaction. The percent variation of 2 major contributors, environment and GxE interaction, was permissible to perform stability analysis to evaluate stable genotypes across the environments. The genotype x environment interaction means were used for eight stability measures (genotype mean, genotype variance, coefficient of variation, ecovalence, interaction variance, regression slope, deviation mean square, coefficient of determination). The stability measures depicted that the genotype NARC-06-1 with high mean yield (1140 kg/ha -1/), regression slope (1.09) close to unity and less statistics of remaining stability measures except high value of R/sup 2/ for yield proved to be the best within the pool of studied genotypes. The results clearly suggest that the genotype NARC-06-1 may prove to be a widely adapted high yielding stable variety for a broad spectrum of environments of Pakistan. (author)

  14. Fission yield calculation using toy model based on Monte Carlo simulation

    International Nuclear Information System (INIS)

    Jubaidah; Kurniadi, Rizal

    2015-01-01

    Toy model is a new approximation in predicting fission yield distribution. Toy model assumes nucleus as an elastic toy consist of marbles. The number of marbles represents the number of nucleons, A. This toy nucleus is able to imitate the real nucleus properties. In this research, the toy nucleons are only influenced by central force. A heavy toy nucleus induced by a toy nucleon will be split into two fragments. These two fission fragments are called fission yield. In this research, energy entanglement is neglected. Fission process in toy model is illustrated by two Gaussian curves intersecting each other. There are five Gaussian parameters used in this research. They are scission point of the two curves (R c ), mean of left curve (μ L ) and mean of right curve (μ R ), deviation of left curve (σ L ) and deviation of right curve (σ R ). The fission yields distribution is analyses based on Monte Carlo simulation. The result shows that variation in σ or µ can significanly move the average frequency of asymmetry fission yields. This also varies the range of fission yields distribution probability. In addition, variation in iteration coefficient only change the frequency of fission yields. Monte Carlo simulation for fission yield calculation using toy model successfully indicates the same tendency with experiment results, where average of light fission yield is in the range of 90yield is in about 135

  15. Fission yield calculation using toy model based on Monte Carlo simulation

    Energy Technology Data Exchange (ETDEWEB)

    Jubaidah, E-mail: jubaidah@student.itb.ac.id [Nuclear Physics and Biophysics Division, Department of Physics, Bandung Institute of Technology. Jl. Ganesa No. 10 Bandung – West Java, Indonesia 40132 (Indonesia); Physics Department, Faculty of Mathematics and Natural Science – State University of Medan. Jl. Willem Iskandar Pasar V Medan Estate – North Sumatera, Indonesia 20221 (Indonesia); Kurniadi, Rizal, E-mail: rijalk@fi.itb.ac.id [Nuclear Physics and Biophysics Division, Department of Physics, Bandung Institute of Technology. Jl. Ganesa No. 10 Bandung – West Java, Indonesia 40132 (Indonesia)

    2015-09-30

    Toy model is a new approximation in predicting fission yield distribution. Toy model assumes nucleus as an elastic toy consist of marbles. The number of marbles represents the number of nucleons, A. This toy nucleus is able to imitate the real nucleus properties. In this research, the toy nucleons are only influenced by central force. A heavy toy nucleus induced by a toy nucleon will be split into two fragments. These two fission fragments are called fission yield. In this research, energy entanglement is neglected. Fission process in toy model is illustrated by two Gaussian curves intersecting each other. There are five Gaussian parameters used in this research. They are scission point of the two curves (R{sub c}), mean of left curve (μ{sub L}) and mean of right curve (μ{sub R}), deviation of left curve (σ{sub L}) and deviation of right curve (σ{sub R}). The fission yields distribution is analyses based on Monte Carlo simulation. The result shows that variation in σ or µ can significanly move the average frequency of asymmetry fission yields. This also varies the range of fission yields distribution probability. In addition, variation in iteration coefficient only change the frequency of fission yields. Monte Carlo simulation for fission yield calculation using toy model successfully indicates the same tendency with experiment results, where average of light fission yield is in the range of 90yield is in about 135

  16. Regression in autistic spectrum disorders.

    Science.gov (United States)

    Stefanatos, Gerry A

    2008-12-01

    A significant proportion of children diagnosed with Autistic Spectrum Disorder experience a developmental regression characterized by a loss of previously-acquired skills. This may involve a loss of speech or social responsitivity, but often entails both. This paper critically reviews the phenomena of regression in autistic spectrum disorders, highlighting the characteristics of regression, age of onset, temporal course, and long-term outcome. Important considerations for diagnosis are discussed and multiple etiological factors currently hypothesized to underlie the phenomenon are reviewed. It is argued that regressive autistic spectrum disorders can be conceptualized on a spectrum with other regressive disorders that may share common pathophysiological features. The implications of this viewpoint are discussed.

  17. Mathematical and statistical analysis of the effect of boron on yield parameters of wheat

    Energy Technology Data Exchange (ETDEWEB)

    Rawashdeh, Hamzeh [Water Management and Environment Research Department, National Center for Agricultural Research and Extension, P.O. Box 639, Baqa 19381 (Jordan); Sala, Florin [Soil Science and Plant Nutrition, Faculty of Agriculture, Banat University of Agricultural Sciences and Veterinary Medicine “Regele Mihai I al României” from Timişoara, Timişoara, 300645 (Romania); Boldea, Marius [Mathematics and Statistics, Faculty of Agriculture, Banat University of Agricultural Sciences and Veterinary Medicine “Regele Mihai I al României” from Timisoara, Timişoara, 300645 (Romania)

    2015-03-10

    The main objective of this research is to investigate the effect of foliar applications of boron at different growth stages on yield and yield parameters of wheat. The contribution of boron in achieving yield parameters is described by second degree polynomial equations, with high statistical confidence (p<0.01; F theoretical < F calculated, according to ANOVA test, for Alfa = 0.05). Regression analysis, based on R{sup 2} values obtained, made it possible to evaluate the particular contribution of boron to the realization of yield parameters. This was lower for spike length (R{sup 2} = 0.812), thousand seeds weight (R{sup 2} = 0.850) and higher in the case of the number of spikelets (R{sup 2} = 0.936) and the number of seeds on a spike (R{sup 2} = 0.960). These results confirm that boron plays an important part in achieving the number of seeds on a spike in the case of wheat, as the contribution of this element to the process of flower fertilization is well-known. In regards to productivity elements, the contribution of macroelements to yield quantity is clear, the contribution of B alone being R{sup 2} = 0.868.

  18. Item Response Theory Modeling and Categorical Regression Analyses of the Five-Factor Model Rating Form: A Study on Italian Community-Dwelling Adolescent Participants and Adult Participants.

    Science.gov (United States)

    Fossati, Andrea; Widiger, Thomas A; Borroni, Serena; Maffei, Cesare; Somma, Antonella

    2017-06-01

    To extend the evidence on the reliability and construct validity of the Five-Factor Model Rating Form (FFMRF) in its self-report version, two independent samples of Italian participants, which were composed of 510 adolescent high school students and 457 community-dwelling adults, respectively, were administered the FFMRF in its Italian translation. Adolescent participants were also administered the Italian translation of the Borderline Personality Features Scale for Children-11 (BPFSC-11), whereas adult participants were administered the Italian translation of the Triarchic Psychopathy Measure (TriPM). Cronbach α values were consistent with previous findings; in both samples, average interitem r values indicated acceptable internal consistency for all FFMRF scales. A multidimensional graded item response theory model indicated that the majority of FFMRF items had adequate discrimination parameters; information indices supported the reliability of the FFMRF scales. Both categorical (i.e., item-level) and scale-level regression analyses suggested that the FFMRF scores may predict a nonnegligible amount of variance in the BPFSC-11 total score in adolescent participants, and in the TriPM scale scores in adult participants.

  19. Understanding logistic regression analysis

    OpenAIRE

    Sperandei, Sandro

    2014-01-01

    Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using ex...

  20. Evidence for a climate signal in trends of global crop yield variability over the past 50 years

    International Nuclear Information System (INIS)

    Osborne, T M; Wheeler, T R

    2013-01-01

    Low variability of crop production from year to year is desirable for many reasons, including reduced income risk and stability of supplies. Therefore, it is important to understand the nature of yield variability, whether it is changing through time, and how it varies between crops and regions. Previous studies have shown that national crop yield variability has changed in the past, with the direction and magnitude dependent on crop type and location. Whilst such studies acknowledge the importance of climate variability in determining yield variability, it has been assumed that its magnitude and its effect on crop production have not changed through time and, hence, that changes to yield variability have been due to non-climatic factors. We address this assumption by jointly examining yield and climate variability for three major crops (rice, wheat and maize) over the past 50 years. National yield time series and growing season temperature and precipitation were de-trended and related using multiple linear regression. Yield variability changed significantly in half of the crop–country combinations examined. For several crop–country combinations, changes in yield variability were related to changes in climate variability. (letter)

  1. Mid-Season High-Resolution Satellite Imagery for Forecasting Site-Specific Corn Yield

    Directory of Open Access Journals (Sweden)

    Nahuel R. Peralta

    2016-10-01

    Full Text Available A timely and accurate crop yield forecast is crucial to make better decisions on crop management, marketing, and storage by assessing ahead and implementing based on expected crop performance. The objective of this study was to investigate the potential of high-resolution satellite imagery data collected at mid-growing season for identification of within-field variability and to forecast corn yield at different sites within a field. A test was conducted on yield monitor data and RapidEye satellite imagery obtained for 22 cornfields located in five different counties (Clay, Dickinson, Rice, Saline, and Washington of Kansas (total of 457 ha. Three basic tests were conducted on the data: (1 spatial dependence on each of the yield and vegetation indices (VIs using Moran’s I test; (2 model selection for the relationship between imagery data and actual yield using ordinary least square regression (OLS and spatial econometric (SPL models; and (3 model validation for yield forecasting purposes. Spatial autocorrelation analysis (Moran’s I test for both yield and VIs (red edge NDVI = NDVIre, normalized difference vegetation index = NDVIr, SRre = red-edge simple ratio, near infrared = NIR and green-NDVI = NDVIG was tested positive and statistically significant for most of the fields (p < 0.05, except for one. Inclusion of spatial adjustment to model improved the model fit on most fields as compared to OLS models, with the spatial adjustment coefficient significant for half of the fields studied. When selected models were used for prediction to validate dataset, a striking similarity (RMSE = 0.02 was obtained between predicted and observed yield within a field. Yield maps could assist implementing more effective site-specific management tools and could be utilized as a proxy of yield monitor data. In summary, high-resolution satellite imagery data can be reasonably used to forecast yield via utilization of models that include spatial adjustment to

  2. Nonparametric regression using the concept of minimum energy

    International Nuclear Information System (INIS)

    Williams, Mike

    2011-01-01

    It has recently been shown that an unbinned distance-based statistic, the energy, can be used to construct an extremely powerful nonparametric multivariate two sample goodness-of-fit test. An extension to this method that makes it possible to perform nonparametric regression using multiple multivariate data sets is presented in this paper. The technique, which is based on the concept of minimizing the energy of the system, permits determination of parameters of interest without the need for parametric expressions of the parent distributions of the data sets. The application and performance of this new method is discussed in the context of some simple example analyses.

  3. Linear regression in astronomy. II

    Science.gov (United States)

    Feigelson, Eric D.; Babu, Gutti J.

    1992-01-01

    A wide variety of least-squares linear regression procedures used in observational astronomy, particularly investigations of the cosmic distance scale, are presented and discussed. The classes of linear models considered are (1) unweighted regression lines, with bootstrap and jackknife resampling; (2) regression solutions when measurement error, in one or both variables, dominates the scatter; (3) methods to apply a calibration line to new data; (4) truncated regression models, which apply to flux-limited data sets; and (5) censored regression models, which apply when nondetections are present. For the calibration problem we develop two new procedures: a formula for the intercept offset between two parallel data sets, which propagates slope errors from one regression to the other; and a generalization of the Working-Hotelling confidence bands to nonstandard least-squares lines. They can provide improved error analysis for Faber-Jackson, Tully-Fisher, and similar cosmic distance scale relations.

  4. A Matlab program for stepwise regression

    Directory of Open Access Journals (Sweden)

    Yanhong Qi

    2016-03-01

    Full Text Available The stepwise linear regression is a multi-variable regression for identifying statistically significant variables in the linear regression equation. In present study, we presented the Matlab program of stepwise regression.

  5. IMPROVING CORRELATION FUNCTION FITTING WITH RIDGE REGRESSION: APPLICATION TO CROSS-CORRELATION RECONSTRUCTION

    International Nuclear Information System (INIS)

    Matthews, Daniel J.; Newman, Jeffrey A.

    2012-01-01

    Cross-correlation techniques provide a promising avenue for calibrating photometric redshifts and determining redshift distributions using spectroscopy which is systematically incomplete (e.g., current deep spectroscopic surveys fail to obtain secure redshifts for 30%-50% or more of the galaxies targeted). In this paper, we improve on the redshift distribution reconstruction methods from our previous work by incorporating full covariance information into our correlation function fits. Correlation function measurements are strongly covariant between angular or spatial bins, and accounting for this in fitting can yield substantial reduction in errors. However, frequently the covariance matrices used in these calculations are determined from a relatively small set (dozens rather than hundreds) of subsamples or mock catalogs, resulting in noisy covariance matrices whose inversion is ill-conditioned and numerically unstable. We present here a method of conditioning the covariance matrix known as ridge regression which results in a more well behaved inversion than other techniques common in large-scale structure studies. We demonstrate that ridge regression significantly improves the determination of correlation function parameters. We then apply these improved techniques to the problem of reconstructing redshift distributions. By incorporating full covariance information, applying ridge regression, and changing the weighting of fields in obtaining average correlation functions, we obtain reductions in the mean redshift distribution reconstruction error of as much as ∼40% compared to previous methods. We provide a description of POWERFIT, an IDL code for performing power-law fits to correlation functions with ridge regression conditioning that we are making publicly available.

  6. Fission yields data generation and benchmarks of decay heat estimation of a nuclear fuel

    Science.gov (United States)

    Gil, Choong-Sup; Kim, Do Heon; Yoo, Jae Kwon; Lee, Jounghwa

    2017-09-01

    Fission yields data with the ENDF-6 format of 235U, 239Pu, and several actinides dependent on incident neutron energies have been generated using the GEF code. In addition, fission yields data libraries of ORIGEN-S, -ARP modules in the SCALE code, have been generated with the new data. The decay heats by ORIGEN-S using the new fission yields data have been calculated and compared with the measured data for validation in this study. The fission yields data ORIGEN-S libraries based on ENDF/B-VII.1, JEFF-3.1.1, and JENDL/FPY-2011 have also been generated, and decay heats were calculated using the ORIGEN-S libraries for analyses and comparisons.

  7. Yield, yield components and dry matter digestibility of alfalfa experimental populations

    Directory of Open Access Journals (Sweden)

    Katić Slobodan

    2010-01-01

    Full Text Available Alfalfa is the most important forage crop grown in the temperate regions. It is cultivated for production of vegetative aerial mass used fresh or as hay, and recently as haylage and silage. In many centres worldwide, efforts are made to breed and create new alfalfa cultivars with both higher yields and of higher nutritional value. The aim of this paper was to determine yield and digestibility of 12 experimental populations of alfalfa, and to compare their results to the yields of well-known domestic alfalfa commercial cultivars. The results show significant differences in yield of green forage and dry matter among alfalfa populations, as well as in yield components, height, proportion of leaves in yield and growth rate (tab. 1, 2 and 3. Differences between in vitro digestible dry matter (% and yields of in vitro digestible dry matter (t ha-1 were also significant (tab. 5 and 6. Yield and quality of experimental populations were at the same level or higher than of control cultivars. Synthetic SINUSA exceeded the control cutivars (NS Mediana ZMS V and Banat VS in yield and quality of dry matter. .

  8. [Hyperspectral Estimation of Apple Tree Canopy LAI Based on SVM and RF Regression].

    Science.gov (United States)

    Han, Zhao-ying; Zhu, Xi-cun; Fang, Xian-yi; Wang, Zhuo-yuan; Wang, Ling; Zhao, Geng-Xing; Jiang, Yuan-mao

    2016-03-01

    Leaf area index (LAI) is the dynamic index of crop population size. Hyperspectral technology can be used to estimate apple canopy LAI rapidly and nondestructively. It can be provide a reference for monitoring the tree growing and yield estimation. The Red Fuji apple trees of full bearing fruit are the researching objects. Ninety apple trees canopies spectral reflectance and LAI values were measured by the ASD Fieldspec3 spectrometer and LAI-2200 in thirty orchards in constant two years in Qixia research area of Shandong Province. The optimal vegetation indices were selected by the method of correlation analysis of the original spectral reflectance and vegetation indices. The models of predicting the LAI were built with the multivariate regression analysis method of support vector machine (SVM) and random forest (RF). The new vegetation indices, GNDVI527, ND-VI676, RVI682, FD-NVI656 and GRVI517 and the previous two main vegetation indices, NDVI670 and NDVI705, are in accordance with LAI. In the RF regression model, the calibration set decision coefficient C-R2 of 0.920 and validation set decision coefficient V-R2 of 0.889 are higher than the SVM regression model by 0.045 and 0.033 respectively. The root mean square error of calibration set C-RMSE of 0.249, the root mean square error validation set V-RMSE of 0.236 are lower than that of the SVM regression model by 0.054 and 0.058 respectively. Relative analysis of calibrating error C-RPD and relative analysis of validation set V-RPD reached 3.363 and 2.520, 0.598 and 0.262, respectively, which were higher than the SVM regression model. The measured and predicted the scatterplot trend line slope of the calibration set and validation set C-S and V-S are close to 1. The estimation result of RF regression model is better than that of the SVM. RF regression model can be used to estimate the LAI of red Fuji apple trees in full fruit period.

  9. Slope Controls Grain Yield and Climatic Yield in Mountainous Yunnan province, China

    Science.gov (United States)

    Duan, X.; Rong, L.; Gu, Z.; Feng, D.

    2017-12-01

    Mountainous regions are increasingly vulnerable to food insecurity because of limited arable land, growing population pressure, and climate change. Development of sustainable mountain agriculture will require an increased understanding of the effects of environmental factors on grain and climatic yields. The objective of this study was to explore the relationships between actual grain yield, climatic yield, and environmental factors in a mountainous region in China. We collected data on the average grain yield per unit area in 119 counties in Yunnan province from 1985 to 2012, and chose 17 environmental factors for the same period. Our results showed that actual grain yield ranged from 1.43 to 6.92 t·ha-1, and the climatic yield ranged from -0.15 to -0.01 t·ha-1. Lower climatic yield but higher grain yield was generally found in central areas and at lower slopes and elevations in the western and southwestern counties of Yunnan province. Higher climatic yield but lower grain yield were found in northwestern parts of Yunnan province on steep slopes. Annual precipation and temperature had a weak influence on the climatic yield. Slope explained 44.62 and 26.29% of the variation in grain yield and climatic yield. The effects of topography on grain and climatic yields were greater than climatic factors. Slope was the most important environmental variable for the variability in climatic and grain yields in the mountainous Yunnan province due to the highly heterogeneous topographic conditions. Conversion of slopes to terraces in areas with higher climatic yields is an effective way to maintain grain production in response to climate variability. Additionally, soil amendments and soil and water conservation measures should be considered to maintain soil fertility and aid in sustainable development in central areas, and in counties at lower slopes and elevations in western and southwestern Yunnan province.

  10. Influence of reaction conditions and type of alcohol on biodiesel yields and process economics of supercritical transesterification

    International Nuclear Information System (INIS)

    Micic, Radoslav D.; Tomić, Milan D.; Kiss, Ferenc E.; Nikolić-Djorić, Emilija B.; Simikić, Mirko Ð.

    2014-01-01

    Highlights: • Transesterification in supercritical methanol, ethanol and 1-propanol investigated. • Effect of alcohol, reaction temperature, pressure and time on yields analyzed. • Temperature has the highest impact on yield, followed by time and pressure. • Direct material and energy costs for each of the production alternatives estimated. • Lowest costs are achieved at highest yields even at very low oil prices. - Abstract: Experiments with transesterification of rapeseed oil in supercritical alcohols (methanol, ethanol and 1-propanol) were carried out in a batch reactor at various reaction temperatures (250–350 °C), working pressure (8–12 MPa), reaction time, and constant 42:1 alcohol to oil molar ratio. Influence of different alcohols and reaction conditions on biodiesel yield was investigated using linear multiple regression models. Temperature had the highest impact on yields, followed by reaction time and pressure. With increased molecular weight of alcohols, relative importance of temperature for explanation of yields decreased and relative importance of time and pressure increased. Economic assessment has revealed that transesterification in supercritical methanol has the lowest direct material and energy costs. Yield has crucial impact on process economics. Direct costs decrease with increase in biodiesel yields. Even at very low prices of oil feedstock the lowest cost is achieved at the highest yield

  11. Quantile regression theory and applications

    CERN Document Server

    Davino, Cristina; Vistocco, Domenico

    2013-01-01

    A guide to the implementation and interpretation of Quantile Regression models This book explores the theory and numerous applications of quantile regression, offering empirical data analysis as well as the software tools to implement the methods. The main focus of this book is to provide the reader with a comprehensivedescription of the main issues concerning quantile regression; these include basic modeling, geometrical interpretation, estimation and inference for quantile regression, as well as issues on validity of the model, diagnostic tools. Each methodological aspect is explored and

  12. Fungible weights in logistic regression.

    Science.gov (United States)

    Jones, Jeff A; Waller, Niels G

    2016-06-01

    In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  13. Yield trends and yield gap analysis of major crops in the world

    NARCIS (Netherlands)

    Hengsdijk, H.; Langeveld, J.W.A.

    2009-01-01

    This study aims to quantify the gap between current and potential yields of major crops in the world, and the production constraints that contribute to this yield gap. Using an expert-based evaluation of yield gaps and the literature, global and regional yields and yield trends of major crops are

  14. Application of logistic regression for landslide susceptibility zoning of Cekmece Area, Istanbul, Turkey

    Science.gov (United States)

    Duman, T. Y.; Can, T.; Gokceoglu, C.; Nefeslioglu, H. A.; Sonmez, H.

    2006-11-01

    As a result of industrialization, throughout the world, cities have been growing rapidly for the last century. One typical example of these growing cities is Istanbul, the population of which is over 10 million. Due to rapid urbanization, new areas suitable for settlement and engineering structures are necessary. The Cekmece area located west of the Istanbul metropolitan area is studied, because the landslide activity is extensive in this area. The purpose of this study is to develop a model that can be used to characterize landslide susceptibility in map form using logistic regression analysis of an extensive landslide database. A database of landslide activity was constructed using both aerial-photography and field studies. About 19.2% of the selected study area is covered by deep-seated landslides. The landslides that occur in the area are primarily located in sandstones with interbedded permeable and impermeable layers such as claystone, siltstone and mudstone. About 31.95% of the total landslide area is located at this unit. To apply logistic regression analyses, a data matrix including 37 variables was constructed. The variables used in the forwards stepwise analyses are different measures of slope, aspect, elevation, stream power index (SPI), plan curvature, profile curvature, geology, geomorphology and relative permeability of lithological units. A total of 25 variables were identified as exerting strong influence on landslide occurrence, and included by the logistic regression equation. Wald statistics values indicate that lithology, SPI and slope are more important than the other parameters in the equation. Beta coefficients of the 25 variables included the logistic regression equation provide a model for landslide susceptibility in the Cekmece area. This model is used to generate a landslide susceptibility map that correctly classified 83.8% of the landslide-prone areas.

  15. [Predicting the impact of climate change in the next 40 years on the yield of maize in China].

    Science.gov (United States)

    Ma, Yu-ping; Sun, Lin-li; E, You-hao; Wu, Wei

    2015-01-01

    Climate change will significantly affect agricultural production in China. The combination of the integral regression model and the latest climate projection may well assess the impact of future climate change on crop yield. In this paper, the correlation model of maize yield and meteorological factors was firstly established for different provinces in China by using the integral regression method, then the impact of climate change in the next 40 years on China's maize production was evaluated combined the latest climate prediction with the reason be ing analyzed. The results showed that if the current speeds of maize variety improvement and science and technology development were constant, maize yield in China would be mainly in an increasing trend of reduction with time in the next 40 years in a range generally within 5%. Under A2 climate change scenario, the region with the most reduction of maize yield would be the Northeast except during 2021-2030, and the reduction would be generally in the range of 2.3%-4.2%. Maize yield reduction would be also high in the Northwest, Southwest and middle and lower reaches of Yangtze River after 2031. Under B2 scenario, the reduction of 5.3% in the Northeast in 2031-2040 would be the greatest across all regions. Other regions with considerable maize yield reduction would be mainly in the Northwest and the Southwest. Reduction in maize yield in North China would be small, generally within 2%, under any scenarios, and that in South China would be almost unchanged. The reduction of maize yield in most regions would be greater under A2 scenario than under B2 scenario except for the period of 2021-2030. The effect of the ten day precipitation on maize yield in northern China would be almost positive. However, the effect of ten day average temperature on yield of maize in all regions would be generally negative. The main reason of maize yield reduction was temperature increase in most provinces but precipitation decrease in a few

  16. A systematic review and meta-regression of mobile-bearing versus fixed-bearing total knee replacement in 41 studies.

    Science.gov (United States)

    van der Voort, P; Pijls, B G; Nouta, K A; Valstar, E R; Jacobs, W C H; Nelissen, R G H H

    2013-09-01

    Mobile-bearing (MB) total knee replacement (TKR) was introduced to reduce the risk of aseptic loosening and wear of polyethylene inserts. However, no consistent clinical advantages of mobile- over fixed-bearing (FB) TKR have been found. In this study we evaluated whether mobile bearings have an advantage over fixed bearings with regard to revision rates and clinical outcome scores. Furthermore, we determined which modifying variables affected the outcome. A systematic search of the literature was conducted to collect clinical trials comparing MB and FB in primary TKR. The primary outcomes were revision rates for any reason, aseptic loosening and wear. Secondary outcomes included range of movement, Knee Society score (KSS), Oxford knee score (OKS), Short-Form 12 (SF-12) score and radiological parameters. Meta-regression techniques were used to explore factors modifying the observed effect. Our search yielded 1827 publications, of which 41 studies met our inclusion criteria, comprising over 6000 TKRs. Meta-analyses showed no clinically relevant differences in terms of revision rates, clinical outcome scores or patient-reported outcome measures between MB and FB TKRs. It appears that theoretical assumptions of superiority of MB over FB TKR are not borne out in clinical practice.

  17. Evaluation of logistic regression models and effect of covariates for case-control study in RNA-Seq analysis.

    Science.gov (United States)

    Choi, Seung Hoan; Labadorf, Adam T; Myers, Richard H; Lunetta, Kathryn L; Dupuis, Josée; DeStefano, Anita L

    2017-02-06

    Next generation sequencing provides a count of RNA molecules in the form of short reads, yielding discrete, often highly non-normally distributed gene expression measurements. Although Negative Binomial (NB) regression has been generally accepted in the analysis of RNA sequencing (RNA-Seq) data, its appropriateness has not been exhaustively evaluated. We explore logistic regression as an alternative method for RNA-Seq studies designed to compare cases and controls, where disease status is modeled as a function of RNA-Seq reads using simulated and Huntington disease data. We evaluate the effect of adjusting for covariates that have an unknown relationship with gene expression. Finally, we incorporate the data adaptive method in order to compare false positive rates. When the sample size is small or the expression levels of a gene are highly dispersed, the NB regression shows inflated Type-I error rates but the Classical logistic and Bayes logistic (BL) regressions are conservative. Firth's logistic (FL) regression performs well or is slightly conservative. Large sample size and low dispersion generally make Type-I error rates of all methods close to nominal alpha levels of 0.05 and 0.01. However, Type-I error rates are controlled after applying the data adaptive method. The NB, BL, and FL regressions gain increased power with large sample size, large log2 fold-change, and low dispersion. The FL regression has comparable power to NB regression. We conclude that implementing the data adaptive method appropriately controls Type-I error rates in RNA-Seq analysis. Firth's logistic regression provides a concise statistical inference process and reduces spurious associations from inaccurately estimated dispersion parameters in the negative binomial framework.

  18. Spatial analysis of the dairy yield using a conditional autoregressive model / Análise espacial da produção leiteira usando um modelo autoregressivo condicional

    Directory of Open Access Journals (Sweden)

    João Domingos Scalon

    2010-07-01

    Full Text Available The dairy yield is one of the most important activities for the Brazilian economy and the use of statistical models may improve the decision making in this productive sector. The aim of this paper was to compare the performance of both the traditional linear regression model and the spatial regression model called conditional autoregressive (CAR to explain how some covariates may contribute for the dairy yield. This work used a database on dairy yield supplied by the Brazilian Institute of Geography and Statistics (IBGE and another database on geographical information of the state of Minas Gerais provided by the Integrated Program of Technological Use of Geographical Information (GEOMINAS. The results showed the superiority of the CAR model over the traditional linear regression model to explain the dairy yield. The CAR model allowed the identification of two different spatial clusters of counties related to the dairy yield in the state of Minas Gerais. The first cluster represents the region where one observes the biggest levels of dairy yield. It is formed by the counties of the Triângulo Mineiro. The second cluster is formed by the northern counties of the state that present the lesser levels of dairy yield. A produção de leite é uma das atividades mais importantes para a economia brasileira e o uso de modelos estatísticos pode auxiliar a tomada de decisão neste setor produtivo. O objetivo deste artigo foi comparar o desempenho do modelo de regressão linear tradicional e do modelo de regressão espacial, denominado de autoregressivo condicional (CAR, para explicar como algumas variáveis preditoras contribuem para a quantidade de leite produzido. Este trabalho usou uma base de dados sobre a produção de leite fornecida pelo Instituto Brasileiro de Geografia e Estatística (IBGE e outra base de dados sobre informações geográficas do estado de Minas Gerais, fornecida pelo Programa Integrado de Uso da Tecnologia de Geoprocessamento

  19. Relationship Between Seed Yield And Some of Fruit Traits in Iranian Squash (Cucurbita pepo L. Accissions

    Directory of Open Access Journals (Sweden)

    R. Barzegar

    2016-02-01

    Full Text Available In order to evaluation of squash (Cucurbita pepo seed yield per fruit and its relations with other characteristics of fruit include: length, diameter, length: diameter ratio (fruit shape, flesh thickness, thousand seed weight and fruit weight, an experiment was conducted using 24 accessions of squash as a randomized complete-block design with three replications. Morphological traits were evaluated according to UPOV descriptor and UPGMA clustering algorithm clustered the accessions in 4 groups (predominantly on the basis of fruit shape. Correlation, regression and path analysis were done for mentioned characteristics in 4 type-fruit groups. There was negative correlation between seed yield of individual fruit and its length and fruit length: diameter ratio. But fruit weight, fruit diameter, and thousand seeds weight had positive correlation with seed yield. Seed weight: fruit weight ratio had negative relationship with fruit weight. Therefore small size fruit is more suitable for seed yield per area. Path analysis was showed fruit weight had the most positive direct effect on seed yield per fruit in all groups.

  20. An introduction to using Bayesian linear regression with clinical data.

    Science.gov (United States)

    Baldwin, Scott A; Larson, Michael J

    2017-11-01

    Statistical training psychology focuses on frequentist methods. Bayesian methods are an alternative to standard frequentist methods. This article provides researchers with an introduction to fundamental ideas in Bayesian modeling. We use data from an electroencephalogram (EEG) and anxiety study to illustrate Bayesian models. Specifically, the models examine the relationship between error-related negativity (ERN), a particular event-related potential, and trait anxiety. Methodological topics covered include: how to set up a regression model in a Bayesian framework, specifying priors, examining convergence of the model, visualizing and interpreting posterior distributions, interval estimates, expected and predicted values, and model comparison tools. We also discuss situations where Bayesian methods can outperform frequentist methods as well has how to specify more complicated regression models. Finally, we conclude with recommendations about reporting guidelines for those using Bayesian methods in their own research. We provide data and R code for replicating our analyses. Copyright © 2017 Elsevier Ltd. All rights reserved.

  1. Update of neutron dose yields as a function of energy for protons and deuterons incident on beryllium targets

    International Nuclear Information System (INIS)

    Ten Haken, R.K.; Awschalom, M.; Rosenberg, I.

    1982-11-01

    Neutron absorbed dose yields (absorbed dose rates per unit incident current on targets at a given SAD or SSD) increase with incident charged particle energy for both protons and deuterons. Analyses of neutron dose yield versus incident particle energy have been performed for both deuterons and protons. It is the purpose of this report to update those analyses by pooling all of the more recent published results and to reanalyze the trend of yield, Y, versus incident energy, E, which in the past has been described by an expression of the form Y = aE/sup b/, where a and b are empirical constants. From the reanalyzed trend it is concluded that for a given size cyclotron (E/sub p/ = 2E/sub d/), the dose yields using protons are higher than those using deuterons up to a proton energy E/sub p/ of 64 MeV

  2. Principal component regression analysis with SPSS.

    Science.gov (United States)

    Liu, R X; Kuang, J; Gong, Q; Hou, X L

    2003-06-01

    The paper introduces all indices of multicollinearity diagnoses, the basic principle of principal component regression and determination of 'best' equation method. The paper uses an example to describe how to do principal component regression analysis with SPSS 10.0: including all calculating processes of the principal component regression and all operations of linear regression, factor analysis, descriptives, compute variable and bivariate correlations procedures in SPSS 10.0. The principal component regression analysis can be used to overcome disturbance of the multicollinearity. The simplified, speeded up and accurate statistical effect is reached through the principal component regression analysis with SPSS.

  3. Variation on biomass yield and morphological traits of energy grasses from the genus Miscanthus during the first years of crop establishment

    Energy Technology Data Exchange (ETDEWEB)

    Jezowski, S.; Glowacka, K.; Kaczmarek, Z. [Institute of Plant Genetics, Polish Academy of Sciences, ul. Strzeszynska 34, 60-479 Poznan (Poland)

    2011-02-15

    This study presents the results of investigations of variation, genotype x year interactions and genotype x year x location interactions for the yield and morphological traits of several selected clones of energy grasses of the genus Miscanthus. The analyses were performed on the best clones of selected hybrid plants, which were obtained within the species M. sinensis or are the result of interspecific hybridization of M. sinensis and M. sacchariflorus. Analyses were conducted on the basis of three-year field trials at two locations. The young plants produced from in vitro cultures were planted at a density of one plant per m{sup 2}. The early stages of plant development, from planting until peak yield in the third year of cultivation, were analysed. Statistical analyses performed on the yield and morphological traits as well as changes in these characteristics over the successive years of the study showed considerable genotypic variation for traits under study. Moreover, significant genotype x year interactions as well as genotype x year x location interactions were observed in terms of yield and morphological traits. Based on the collective results of the study, we suggest that apart from M. x giganteus particularly hybrids of M. sinensis x M. sacchariflorus, should be taken into consideration in genetic and breeding studies on the improvement of yield from energy grasses of the genus Miscanthus. (author)

  4. On yield gaps and yield gains in intercropping

    NARCIS (Netherlands)

    Gou, Fang; Yin, Wen; Hong, Yu; Werf, van der Wopke; Chai, Qiang; Heerink, Nico; Ittersum, van Martin K.

    2017-01-01

    Wheat-maize relay intercropping has been widely used by farmers in northwest China, and based on field experiments agronomists report it has a higher productivity than sole crops. However, the yields from farmers’ fields have not been investigated yet. Yield gap analysis provides a framework to

  5. Length bias correction in gene ontology enrichment analysis using logistic regression.

    Science.gov (United States)

    Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

    2012-01-01

    When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.

  6. Soil moisture estimation using multi linear regression with terraSAR-X data

    Directory of Open Access Journals (Sweden)

    G. García

    2016-06-01

    Full Text Available The first five centimeters of soil form an interface where the main heat fluxes exchanges between the land surface and the atmosphere occur. Besides ground measurements, remote sensing has proven to be an excellent tool for the monitoring of spatial and temporal distributed data of the most relevant Earth surface parameters including soil’s parameters. Indeed, active microwave sensors (Synthetic Aperture Radar - SAR offer the opportunity to monitor soil moisture (HS at global, regional and local scales by monitoring involved processes. Several inversion algorithms, that derive geophysical information as HS from SAR data, were developed. Many of them use electromagnetic models for simulating the backscattering coefficient and are based on statistical techniques, such as neural networks, inversion methods and regression models. Recent studies have shown that simple multiple regression techniques yield satisfactory results. The involved geophysical variables in these methodologies are descriptive of the soil structure, microwave characteristics and land use. Therefore, in this paper we aim at developing a multiple linear regression model to estimate HS on flat agricultural regions using TerraSAR-X satellite data and data from a ground weather station. The results show that the backscatter, the precipitation and the relative humidity are the explanatory variables of HS. The results obtained presented a RMSE of 5.4 and a R2  of about 0.6

  7. Physiological quality of soybean seeds under different yield environments and plant density

    Directory of Open Access Journals (Sweden)

    Felipe A. Baron

    Full Text Available ABSTRACT Yield potential of agricultural fields associated with plant spatial arrangement could determine the physiological quality of soybean (Glycine max L. seeds. Thus, this study aimed to evaluate the physiological quality of soybean seeds from different yield environments and plant densities. Experiments were carried out in Boa Vista das Missões-RS, Brazil, during the 2014/2015 growing season. Yield environments were delineated by overlapping yield maps from the 2008, 2009/2010 and 2011/2012 growing seasons. The experimental design was a randomized complete block in a 2 x 5 factorial arrangement with two yield environments (low and high and five plant densities, with four replicates. Two varieties were tested: Brasmax Ativa RR (10, 15, 20, 25 and 30 plants m-1 and Nidera 5909 RR (5, 10, 15, 20 and 25 plants m-1. After harvested, the seeds were analysed as following: first count index, germination, abnormal seedlings, dead seeds, electrical conductivity, accelerate aging test, root length, hypocotyl length and seedling length. The spatial variability of seed vigor in the production field could be reduced by adjusting plant density, but the adjustment should consider the variety. Harvest according to yield environment is a strategy to separate lots of seeds with higher vigor, originated from high-yield environments.

  8. The intermediate endpoint effect in logistic and probit regression

    Science.gov (United States)

    MacKinnon, DP; Lockwood, CM; Brown, CH; Wang, W; Hoffman, JM

    2010-01-01

    Background An intermediate endpoint is hypothesized to be in the middle of the causal sequence relating an independent variable to a dependent variable. The intermediate variable is also called a surrogate or mediating variable and the corresponding effect is called the mediated, surrogate endpoint, or intermediate endpoint effect. Clinical studies are often designed to change an intermediate or surrogate endpoint and through this intermediate change influence the ultimate endpoint. In many intermediate endpoint clinical studies the dependent variable is binary, and logistic or probit regression is used. Purpose The purpose of this study is to describe a limitation of a widely used approach to assessing intermediate endpoint effects and to propose an alternative method, based on products of coefficients, that yields more accurate results. Methods The intermediate endpoint model for a binary outcome is described for a true binary outcome and for a dichotomization of a latent continuous outcome. Plots of true values and a simulation study are used to evaluate the different methods. Results Distorted estimates of the intermediate endpoint effect and incorrect conclusions can result from the application of widely used methods to assess the intermediate endpoint effect. The same problem occurs for the proportion of an effect explained by an intermediate endpoint, which has been suggested as a useful measure for identifying intermediate endpoints. A solution to this problem is given based on the relationship between latent variable modeling and logistic or probit regression. Limitations More complicated intermediate variable models are not addressed in the study, although the methods described in the article can be extended to these more complicated models. Conclusions Researchers are encouraged to use an intermediate endpoint method based on the product of regression coefficients. A common method based on difference in coefficient methods can lead to distorted

  9. The impact of mineral fertilization and atmospheric precipitation on yield of field crops on family farms

    Directory of Open Access Journals (Sweden)

    Munćan Mihajlo

    2016-01-01

    Full Text Available The field crop production, as the most important branch of plant production of the Republic of Serbia, in the period 2002-2011, was carried out on an average of over 2.7 million hectares, 82.7% of which took place on the individual farms/family holdings. Hence, the subject of research in this paper covers yields of major field crops realized on family farms in the region of Vojvodina in the period 1972-2011. The main objective of the research is to study the interdependence of utilization of mineral fertilizers and atmospheric precipitation during the vegetation period and realized yields of major field crops on family farms in the observed period. The regression analysis was applied in order to verify dependencies and determine the form of dependence of achieved yields from examined variables. The results showed that the main limiting factors for obtaining high and stable yields of field crops is inadequate use of fertilizers and the lack of precipitation during the vegetation period.

  10. Quantifying trade-offs between future yield levels, food availability and forest and woodland conservation in Benin.

    Science.gov (United States)

    Duku, Confidence; Zwart, Sander J; van Bussel, Lenny G J; Hein, Lars

    2018-01-01

    Meeting the dual objectives of food security and ecosystem protection is a major challenge in sub-Saharan Africa (SSA). To this end agricultural intensification is considered desirable, yet, there remain uncertainties regarding the impact of climate change on opportunities for agricultural intensification and the adequacy of intensification options given the rapid population growth. We quantify trade-offs between levels of yield gap closure, food availability and forest and woodland conservation under different scenarios. Each scenario is made up of a combination of variants of four parameters i.e. (1) climate change based on Representative Concentration Pathways (RCPs); (2) population growth based on Shared Socioeconomic Pathways (SSPs); (3) cropland expansion with varying degrees of deforestation; and (4) different degrees of yield gap closure. We carry out these analyses for three major food crops, i.e. maize, cassava and yam, in Benin. Our analyses show that in most of the scenarios, the required levels of yield gap closures required to maintain the current levels of food availability can be achieved by 2050 by maintaining the average rate of yield increases recorded over the past two and half decades in addition to the current cropping intensity. However, yields will have to increase at a faster rate than has been recorded over the past two and half decades in order to achieve the required levels of yield gap closures by 2100. Our analyses also show that without the stated levels of yield gap closure, the areas under maize, cassava and yam cultivation will have to increase by 95%, 102% and 250% respectively in order to maintain the current levels of per capita food availability. Our study shows that food security outcomes and forest and woodland conservation goals in Benin and likely the larger SSA region are inextricably linked together and require holistic management strategies that considers trade-offs and co-benefits. Copyright © 2017 Elsevier B.V. All

  11. A Quantile Regression Approach to Estimating the Distribution of Anesthetic Procedure Time during Induction.

    Directory of Open Access Journals (Sweden)

    Hsin-Lun Wu

    Full Text Available Although procedure time analyses are important for operating room management, it is not easy to extract useful information from clinical procedure time data. A novel approach was proposed to analyze procedure time during anesthetic induction. A two-step regression analysis was performed to explore influential factors of anesthetic induction time (AIT. Linear regression with stepwise model selection was used to select significant correlates of AIT and then quantile regression was employed to illustrate the dynamic relationships between AIT and selected variables at distinct quantiles. A total of 1,060 patients were analyzed. The first and second-year residents (R1-R2 required longer AIT than the third and fourth-year residents and attending anesthesiologists (p = 0.006. Factors prolonging AIT included American Society of Anesthesiologist physical status ≧ III, arterial, central venous and epidural catheterization, and use of bronchoscopy. Presence of surgeon before induction would decrease AIT (p < 0.001. Types of surgery also had significant influence on AIT. Quantile regression satisfactorily estimated extra time needed to complete induction for each influential factor at distinct quantiles. Our analysis on AIT demonstrated the benefit of quantile regression analysis to provide more comprehensive view of the relationships between procedure time and related factors. This novel two-step regression approach has potential applications to procedure time analysis in operating room management.

  12. Direct and regression methods do not give different estimates of digestible and metabolizable energy of wheat for pigs.

    Science.gov (United States)

    Bolarinwa, O A; Adeola, O

    2012-12-01

    Digestible and metabolizable energy contents of feed ingredients for pigs can be determined by direct or indirect methods. There are situations when only the indirect approach is suitable and the regression method is a robust indirect approach. This study was conducted to compare the direct and regression methods for determining the energy value of wheat for pigs. Twenty-four barrows with an average initial BW of 31 kg were assigned to 4 diets in a randomized complete block design. The 4 diets consisted of 969 g wheat/kg plus minerals and vitamins (sole wheat) for the direct method, corn (Zea mays)-soybean (Glycine max) meal reference diet (RD), RD + 300 g wheat/kg, and RD + 600 g wheat/kg. The 3 corn-soybean meal diets were used for the regression method and wheat replaced the energy-yielding ingredients, corn and soybean meal, so that the same ratio of corn and soybean meal across the experimental diets was maintained. The wheat used was analyzed to contain 883 g DM, 15.2 g N, and 3.94 Mcal GE/kg. Each diet was fed to 6 barrows in individual metabolism crates for a 5-d acclimation followed by a 5-d total but separate collection of feces and urine. The DE and ME for the sole wheat diet were 3.83 and 3.77 Mcal/kg DM, respectively. Because the sole wheat diet contained 969 g wheat/kg, these translate to 3.95 Mcal DE/kg DM and 3.89 Mcal ME/kg DM. The RD used for the regression approach yielded 4.00 Mcal DE and 3.91 Mcal ME/kg DM diet. Increasing levels of wheat in the RD linearly reduced (P direct method (3.95 and 3.89 Mcal/kg DM) did not differ (0.78 < P < 0.89) from those obtained using the regression method (3.96 and 3.88 Mcal/kg DM).

  13. Logistic regression models

    CERN Document Server

    Hilbe, Joseph M

    2009-01-01

    This book really does cover everything you ever wanted to know about logistic regression … with updates available on the author's website. Hilbe, a former national athletics champion, philosopher, and expert in astronomy, is a master at explaining statistical concepts and methods. Readers familiar with his other expository work will know what to expect-great clarity.The book provides considerable detail about all facets of logistic regression. No step of an argument is omitted so that the book will meet the needs of the reader who likes to see everything spelt out, while a person familiar with some of the topics has the option to skip "obvious" sections. The material has been thoroughly road-tested through classroom and web-based teaching. … The focus is on helping the reader to learn and understand logistic regression. The audience is not just students meeting the topic for the first time, but also experienced users. I believe the book really does meet the author's goal … .-Annette J. Dobson, Biometric...

  14. Institutions and deforestation in the Brazilian amazon: a geographic regression discontinuity analysis

    OpenAIRE

    Bogetvedt, Ingvild Engen; Hauge, Mari Johnsrud

    2017-01-01

    This study explores the impact of institutional quality at the municipal level on deforestation in the Legal Amazon. We add to this insufficiently understood topic by implementing a geographic regression discontinuity design. By taking advantage of high-resolution spatial data on deforestation combined with an objective measure of corruption used as a proxy for institutional quality, we analyse 138 Brazilian municipalities in the period of 2002-2004. Our empirical findings show...

  15. Closing Yield Gaps: How Sustainable Can We Be?

    Science.gov (United States)

    Pradhan, Prajal; Fischer, Günther; van Velthuizen, Harrij; Reusser, Dominik E.; Kropp, Juergen P.

    2015-01-01

    Global food production needs to be increased by 60–110% between 2005 and 2050 to meet growing food and feed demand. Intensification and/or expansion of agriculture are the two main options available to meet the growing crop demands. Land conversion to expand cultivated land increases GHG emissions and impacts biodiversity and ecosystem services. Closing yield gaps to attain potential yields may be a viable option to increase the global crop production. Traditional methods of agricultural intensification often have negative externalities. Therefore, there is a need to explore location-specific methods of sustainable agricultural intensification. We identified regions where the achievement of potential crop calorie production on currently cultivated land will meet the present and future food demand based on scenario analyses considering population growth and changes in dietary habits. By closing yield gaps in the current irrigated and rain-fed cultivated land, about 24% and 80% more crop calories can respectively be produced compared to 2000. Most countries will reach food self-sufficiency or improve their current food self-sufficiency levels if potential crop production levels are achieved. As a novel approach, we defined specific input and agricultural management strategies required to achieve the potential production by overcoming biophysical and socioeconomic constraints causing yield gaps. The management strategies include: fertilizers, pesticides, advanced soil management, land improvement, management strategies coping with weather induced yield variability, and improving market accessibility. Finally, we estimated the required fertilizers (N, P2O5, and K2O) to attain the potential yields. Globally, N-fertilizer application needs to increase by 45–73%, P2O5-fertilizer by 22–46%, and K2O-fertilizer by 2–3 times compared to the year 2010 to attain potential crop production. The sustainability of such agricultural intensification largely depends on the way

  16. Farmers Extension Program Effects on Yield Gap in North China Plain

    Science.gov (United States)

    Sum, N.; Zhao, Y.

    2015-12-01

    Improving crop yield of the lowest yielding smallholder farmers in developing countries is essential to both food security of the country and the farmers' livelihood. Although wheat and maize production in most developed countries have reached 80% or greater of yield potential determined by simulated models, yield gap remains high in the developing world. One of these cases is the yield gap of maize in the North China Plain (NCP), where the average farmer's yield is 41% of his or her potential yield. This large yield gap indicates opportunity to raise yields substantially by improving agronomy, especially in nutrition management, irrigation facility, and mechanization issues such as technical services. Farmers' agronomic knowledge is essential to yield performance. In order to propagate such knowledge to farmers, agricultural extension programs, especially in-the-field guidance with training programs at targeted demonstration fields, have become prevalent in China. Although traditional analyses of the effects of the extension program are done through surveys, they are limited to only one to two years and to a small area. However, the spatial analysis tool Google Earth Engine (GEE) and its extensive satellite imagery data allow for unprecedented spatial temporal analysis of yield variation. We used GEE to analyze maize yield in Quzhou county in the North China Plain from 2007 to 2013. We based our analysis on the distance from a demonstration farm plot, the source of the farmers' agronomic knowledge. Our hypothesis was that the farther the farmers' fields were from the demonstration plot, the less access they would have to the knowledge, and the less increase in yield over time. Testing this hypothesis using GEE helps us determine the effectiveness of the demonstration plot in disseminating optimal agronomic practices in addition to evaluating yield performance of the demonstration field itself. Furthermore, we can easily extend this methodology to analyze the whole

  17. A Bayesian goodness of fit test and semiparametric generalization of logistic regression with measurement data.

    Science.gov (United States)

    Schörgendorfer, Angela; Branscum, Adam J; Hanson, Timothy E

    2013-06-01

    Logistic regression is a popular tool for risk analysis in medical and population health science. With continuous response data, it is common to create a dichotomous outcome for logistic regression analysis by specifying a threshold for positivity. Fitting a linear regression to the nondichotomized response variable assuming a logistic sampling model for the data has been empirically shown to yield more efficient estimates of odds ratios than ordinary logistic regression of the dichotomized endpoint. We illustrate that risk inference is not robust to departures from the parametric logistic distribution. Moreover, the model assumption of proportional odds is generally not satisfied when the condition of a logistic distribution for the data is violated, leading to biased inference from a parametric logistic analysis. We develop novel Bayesian semiparametric methodology for testing goodness of fit of parametric logistic regression with continuous measurement data. The testing procedures hold for any cutoff threshold and our approach simultaneously provides the ability to perform semiparametric risk estimation. Bayes factors are calculated using the Savage-Dickey ratio for testing the null hypothesis of logistic regression versus a semiparametric generalization. We propose a fully Bayesian and a computationally efficient empirical Bayesian approach to testing, and we present methods for semiparametric estimation of risks, relative risks, and odds ratios when parametric logistic regression fails. Theoretical results establish the consistency of the empirical Bayes test. Results from simulated data show that the proposed approach provides accurate inference irrespective of whether parametric assumptions hold or not. Evaluation of risk factors for obesity shows that different inferences are derived from an analysis of a real data set when deviations from a logistic distribution are permissible in a flexible semiparametric framework. © 2013, The International Biometric

  18. Interpret with caution: multicollinearity in multiple regression of cognitive data.

    Science.gov (United States)

    Morrison, Catriona M

    2003-08-01

    Shibihara and Kondo in 2002 reported a reanalysis of the 1997 Kanji picture-naming data of Yamazaki, Ellis, Morrison, and Lambon-Ralph in which independent variables were highly correlated. Their addition of the variable visual familiarity altered the previously reported pattern of results, indicating that visual familiarity, but not age of acquisition, was important in predicting Kanji naming speed. The present paper argues that caution should be taken when drawing conclusions from multiple regression analyses in which the independent variables are so highly correlated, as such multicollinearity can lead to unreliable output.

  19. Evaluation of Hail Simulated Damage on Marketable Tuber Yield of Potato Agria Cultivar in Ardabil Region

    Directory of Open Access Journals (Sweden)

    D. Hassanpanah

    2012-07-01

    Full Text Available This study was conducted at Ardabil Agriculture and Natural Resources Research Station during the year of 2010. A factorial experiment based on randomized complete block design with four replications and two factors were used to evaluate the effect of simulated hail damage to foliage at different growth stages of potato Agria cultivar on marketable tuber yield. The first factor consisted of six levels of foliar damage (0, 20, 40, 60, 80 and 100 percent and the second factor of five levels of plant growth stages (2, 5, 8, 11 and 15 weeks after the growing. Analysis of variance showed that there were significant differences among plants for levels and times of hail damage and their interactions in terms of marketable tuber yield. Percentage of marketable yield reduction at early stages of vegetative growth (2 weeks after growing was minimal. Occurrence of hail damage at the tuberization and bulking stages (5, 8 and 11 weeks after growing severely reduced marketable tuber yield. While, its damage at late growing stages of (14 weeks after growing on tuber yield was not appreciable. Times of hail damage on marketable tuber yield reduction was calculated through the regression. Relative reduction of marketable tuber yield at the early stages of vegetative growth, due to hail damage, against non-marketable tuber yield was higher than of bulking stage.

  20. Yield gap determinants for wheat production in major irrigated cropping zones of punjab, pakistan

    International Nuclear Information System (INIS)

    Hussain, A.; Aujla, K.M.; Badar, N.

    2014-01-01

    Yield gap is useful measurement for crop productivity and the extent to which crop productivity falls below some potential level. The study was carried out to analyze the yield gap and determinants of wheat production in the Punjab province of Pakistan. It is based on cross sectional data from 210 farmers for the crop year 2009-10. Results suggest that farm level wheat yields are less than the potential yield level by 33.0%, 43.0% and 50.6% in the mixed-cropping, cotton-wheat and rice-wheat zones of the province, respectively. Ordinary least square regression analysis of wheat production by assuming Cobb-Douglas specification reveals that the number of irrigations, usage of farm yard manure and fertilizers contribute positively and significantly to wheat crop production. Coefficients of dummy variables for cropping zones indicate that farmers in the mixed cropping zone are obtaining better yield of the wheat crop as compared to their counterparts in other selected cropping zones. These results suggested that farmers can increase wheat productivity by increasing the use of factor inputs; however, poverty may be a constraint on realizing these gains. Thus, wheat production can be increased in the country by helping resource poor farmers through suitable support mechanisms. (author)

  1. Understanding logistic regression analysis.

    Science.gov (United States)

    Sperandei, Sandro

    2014-01-01

    Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.

  2. Satellite-based studies of maize yield spatial variations and their causes in China

    Science.gov (United States)

    Zhao, Y.

    2013-12-01

    Maize production in China has been expanding significantly in the past two decades, but yield has become relatively stagnant in the past few years, and needs to be improved to meet increasing demand. Multiple studies found that the gap between potential and actual yield of maize is as large as 40% to 60% of yield potential. Although a few major causes of yield gap have been qualitatively identified with surveys, there has not been spatial analysis aimed at quantifying relative importance of specific biophysical and socio-economic causes, information which would be useful for targeting interventions. This study analyzes the causes of yield variation at field and village level in Quzhou county of North China Plain (NCP). We combine remote sensing and crop modeling to estimate yields in 2009-2012, and identify fields that are consistently high or low yielding. To establish the relationship between yield and potential factors, we gather data on those factors through a household survey. We select targeted survey fields such that not only both extremes of yield distribution but also all soil texture categories in the county is covered. Our survey assesses management and biophysical factors as well as social factors such as farmers' access to agronomic knowledge, which is approximated by distance to the closest demonstration plot or 'Science and technology backyard'. Our survey covers 10 townships, 53 villages and 180 fields. Three to ten farmers are surveyed depending on the amount of variation present among sub pixels of each field. According to survey results, we extract the amount of variation within as well as between villages and or soil type. The higher within village or within field variation, the higher importance of management factors. Factors such as soil type and access to knowledge are more represented by between village variation. Through regression and analysis of variance, we gain more quantitative and thorough understanding of causes to yield variation at

  3. Ajuste de modelos de platô de resposta via regressão isotônica Response plateau models fitting via isotonic regression

    Directory of Open Access Journals (Sweden)

    Renata Pires Gonçalves

    2012-02-01

    . The experiments of type dosage x response are very common in the determination of levels of nutrients in optimal food balance and include the use of regression models to achieve this objective. Nevertheless, the regression analysis routine, generally, uses a priori information about a possible relationship between the response variable. The isotonic regression is a method of estimation by least squares that generates estimates which preserves data ordering. In the theory of isotonic regression this information is essential and it is expected to increase fitting efficiency. The objective of this work was to use an isotonic regression methodology, as an alternative way of analyzing data of Zn deposition in tibia of male birds of Hubbard lineage. We considered the models of plateau response of polynomial quadratic and linear exponential forms. In addition to these models, we also proposed the fitting of a logarithmic model to the data and the efficiency of the methodology was evaluated by Monte Carlo simulations, considering different scenarios for the parametric values. The isotonization of the data yielded an improvement in all the fitting quality parameters evaluated. Among the models used, the logarithmic presented estimates of the parameters more consistent with the values reported in literature.

  4. Minimax Regression Quantiles

    DEFF Research Database (Denmark)

    Bache, Stefan Holst

    A new and alternative quantile regression estimator is developed and it is shown that the estimator is root n-consistent and asymptotically normal. The estimator is based on a minimax ‘deviance function’ and has asymptotically equivalent properties to the usual quantile regression estimator. It is......, however, a different and therefore new estimator. It allows for both linear- and nonlinear model specifications. A simple algorithm for computing the estimates is proposed. It seems to work quite well in practice but whether it has theoretical justification is still an open question....

  5. Regression with Sparse Approximations of Data

    DEFF Research Database (Denmark)

    Noorzad, Pardis; Sturm, Bob L.

    2012-01-01

    We propose sparse approximation weighted regression (SPARROW), a method for local estimation of the regression function that uses sparse approximation with a dictionary of measurements. SPARROW estimates the regression function at a point with a linear combination of a few regressands selected...... by a sparse approximation of the point in terms of the regressors. We show SPARROW can be considered a variant of \\(k\\)-nearest neighbors regression (\\(k\\)-NNR), and more generally, local polynomial kernel regression. Unlike \\(k\\)-NNR, however, SPARROW can adapt the number of regressors to use based...

  6. Evaluating the Performance of Polynomial Regression Method with Different Parameters during Color Characterization

    Directory of Open Access Journals (Sweden)

    Bangyong Sun

    2014-01-01

    Full Text Available The polynomial regression method is employed to calculate the relationship of device color space and CIE color space for color characterization, and the performance of different expressions with specific parameters is evaluated. Firstly, the polynomial equation for color conversion is established and the computation of polynomial coefficients is analysed. And then different forms of polynomial equations are used to calculate the RGB and CMYK’s CIE color values, while the corresponding color errors are compared. At last, an optimal polynomial expression is obtained by analysing several related parameters during color conversion, including polynomial numbers, the degree of polynomial terms, the selection of CIE visual spaces, and the linearization.

  7. Water-Yield Relations of Drip Irrigated Watermelon in Temperate Climatic Conditions

    Directory of Open Access Journals (Sweden)

    Pejić Borivoj

    2016-08-01

    Full Text Available The objective of the study, conducted in Vojvodina a northern part of the Serbia Republic, was to analyse the effect of drip irrigation on yield, evapotranspiration and water productivity of watermelon (Cirullus lanatus Thunb. grown with plasticulture. Irrigation was scheduled on the basis of water balance method. Daily evapotranspiration was computed using the reference evapotranspiration and crop coefficient. The yield of watermelon in irrigation conditions (37,28 t/ha was significantly higher compared to non irrigated (9,98 t/ha. Water used on evapotranspiration in irrigation conditions was 398 mm and 117 mm on non irrigated variant. The crop yield response factor of 1,04 for the whole growing season reveals that relative yield decrease was nearly equal to the rate of evapotranspiration deficit. The values of irrigation water use efficiency and evapotranspiration water use efficiency were 9,93 kg/m3 and 10,29 kg/m3 respectively. The determined results could be used as a good platform for watermelon growers in the region, in terms of improvement of the optimum utilization of irrigation water.

  8. A simple approach to power and sample size calculations in logistic regression and Cox regression models.

    Science.gov (United States)

    Vaeth, Michael; Skovlund, Eva

    2004-06-15

    For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.

  9. A study of the irradiation of sawdust culture medium and the yield of fresh lentinus

    International Nuclear Information System (INIS)

    Xu Meiyu; Wu Jinshui

    1989-01-01

    60 Co γ-rays at the doses of 1.0-1.8 Mrad were used to irradiate sawdust material for sterilization, It was found that 60 Co γ-rays could promote the breakdown of cellulose into soluble sugar, and thus promoted the growth of hyphae and increased the yield of L.edode. Determination of regressive analysis and correlation coefficients indicated that there was a high positive correlation between soluble sugar and the growth of hyphae(r = 0.9546**) , which can be expressed as Y = 3.12 + 1.30X; A high positive correlation between soluble sugar and the yield of L.edode(r = 0.9935**) can be expressed as Y = 31.95 + 86.69X; and a high positive correlation between the hyphae growing rate and the yield of L.edode(r = 0.9531**) can be expressed as Y = -145.34 + 60.92X

  10. Buckwheat yield and its quality as affected by laser biostimulation of its seeds

    International Nuclear Information System (INIS)

    Koper, R.; Mikos-Bielak, M.

    2003-01-01

    The influence of various doses of laser radiation applied to buckwheat seed bio stimulation on the yield, and changes of chemical composition was analysed. A 12-25 percent increase of yield was observed in bio stimulated plants. The most positive effects were achieved after seed triple radiation using a laser of 30 mW power for 0.1 s. Bio stimulation caused a slight increase of protein, fat and fiber content a large increase of soluble and reducing sugars and a decrease of starch level

  11. VARIABILITY OF YIELD AND YIELD COMPONENTS IN “EGUSI ...

    African Journals Online (AJOL)

    journal

    Estimate of expected genetic advance in seed yield plant-1 ranged between. 25.90-48.40%. ..... values in fruit and seed yield characters have been reported in culinary melon, ... and Khund, A. 2004. Extent of heterosis and heritability in some.

  12. Development of Super-Ensemble techniques for ocean analyses: the Mediterranean Sea case

    Science.gov (United States)

    Pistoia, Jenny; Pinardi, Nadia; Oddo, Paolo; Collins, Matthew; Korres, Gerasimos; Drillet, Yann

    2017-04-01

    Short-term ocean analyses for Sea Surface Temperature SST in the Mediterranean Sea can be improved by a statistical post-processing technique, called super-ensemble. This technique consists in a multi-linear regression algorithm applied to a Multi-Physics Multi-Model Super-Ensemble (MMSE) dataset, a collection of different operational forecasting analyses together with ad-hoc simulations produced by modifying selected numerical model parameterizations. A new linear regression algorithm based on Empirical Orthogonal Function filtering techniques is capable to prevent overfitting problems, even if best performances are achieved when we add correlation to the super-ensemble structure using a simple spatial filter applied after the linear regression. Our outcomes show that super-ensemble performances depend on the selection of an unbiased operator and the length of the learning period, but the quality of the generating MMSE dataset has the largest impact on the MMSE analysis Root Mean Square Error (RMSE) evaluated with respect to observed satellite SST. Lower RMSE analysis estimates result from the following choices: 15 days training period, an overconfident MMSE dataset (a subset with the higher quality ensemble members), and the least square algorithm being filtered a posteriori.

  13. A spline-based regression parameter set for creating customized DARTEL MRI brain templates from infancy to old age

    Directory of Open Access Journals (Sweden)

    Marko Wilke

    2018-02-01

    Full Text Available This dataset contains the regression parameters derived by analyzing segmented brain MRI images (gray matter and white matter from a large population of healthy subjects, using a multivariate adaptive regression splines approach. A total of 1919 MRI datasets ranging in age from 1–75 years from four publicly available datasets (NIH, C-MIND, fCONN, and IXI were segmented using the CAT12 segmentation framework, writing out gray matter and white matter images normalized using an affine-only spatial normalization approach. These images were then subjected to a six-step DARTEL procedure, employing an iterative non-linear registration approach and yielding increasingly crisp intermediate images. The resulting six datasets per tissue class were then analyzed using multivariate adaptive regression splines, using the CerebroMatic toolbox. This approach allows for flexibly modelling smoothly varying trajectories while taking into account demographic (age, gender as well as technical (field strength, data quality predictors. The resulting regression parameters described here can be used to generate matched DARTEL or SHOOT templates for a given population under study, from infancy to old age. The dataset and the algorithm used to generate it are publicly available at https://irc.cchmc.org/software/cerebromatic.php. Keywords: MRI template creation, Multivariate adaptive regression splines, DARTEL, Structural MRI

  14. [Effects of Chemical Fertilizers and Organic Fertilizer on Yield of Ligusticum chuanxiong Rhizome].

    Science.gov (United States)

    Liang, Qin; Chen, Xing-fu; Li, Yan; Zhang, Jun; Meng, Jie; Peng, Shi-ming

    2015-10-01

    To study the effects of different N, P, K and organic fertilizer (OF) on yield of Ligusticum chuanxiong rhizome, in order to provide the theoretical foundation for the establishment of standardization cultivation techniques. The field plot experiments used Ligusticum chuanxiong rhizome which planted in Pengshan as material, and were studied by the four factors and five levels with quadratic regression rotation-orthogonal combination design. According to the data obtained, a function model which could predict the fertilization and yield of Ligusticum chuanxiong rhizome accurately was established. The model analysis showed that the yields of Ligusticum chuanxiong rhizome were significantly influenced by the N, P, K and OF applications. Among these factors, the order of increase rates by the fertilizers was K > OF > N > P; The effect of interaction between N and K, N and OF, K and OF on the yield of Ligusticum chuanxiong rhizome were significantly different. High levels of N and P, N and organic fertilizer, K and organic fertilizer were conducive to improve the yield of Ligusticum chuanxiong rhizome. The results showed that the optimal fertilizer application rates of N was 148.20 - 172.28 kg/hm2, P was 511.92 - 599.40 kg/hm2, K was 249.70 - 282.37 kg/hm2, and OF was 940.00 - 1 104.00 kg/hm2. N, P, K and OF obviously affect the yield of Ligusticum chuanxiong rhizome. K and OF can significantly increase the yield of Ligusticum chuanxiong rhizome. Thus it is suggested that properly high mount of K and OF and appropriate increasing N are two favorable factors for cultivating Ligusticum chuanxiong.

  15. Fission product yields

    International Nuclear Information System (INIS)

    Valenta, V.; Hep, J.

    1978-01-01

    Data are summed up necessary for determining the yields of individual fission products from different fissionable nuclides. Fractional independent yields, cumulative and isobaric yields are presented here for the thermal fission of 235 U, 239 Pu, 241 Pu and for fast fission (approximately 1 MeV) of 235 U, 238 U, 239 Pu, 241 Pu; these values are included into the 5th version of the YIELDS library, supplementing the BIBFP library. A comparison is made of experimental data and possible improvements of calculational methods are suggested. (author)

  16. Single-electron multiplication statistics as a combination of Poissonian pulse height distributions using constraint regression methods

    International Nuclear Information System (INIS)

    Ballini, J.-P.; Cazes, P.; Turpin, P.-Y.

    1976-01-01

    Analysing the histogram of anode pulse amplitudes allows a discussion of the hypothesis that has been proposed to account for the statistical processes of secondary multiplication in a photomultiplier. In an earlier work, good agreement was obtained between experimental and reconstructed spectra, assuming a first dynode distribution including two Poisson distributions of distinct mean values. This first approximation led to a search for a method which could give the weights of several Poisson distributions of distinct mean values. Three methods have been briefly exposed: classical linear regression, constraint regression (d'Esopo's method), and regression on variables subject to error. The use of these methods gives an approach of the frequency function which represents the dispersion of the punctual mean gain around the whole first dynode mean gain value. Comparison between this function and the one employed in Polya distribution allows the statement that the latter is inadequate to describe the statistical process of secondary multiplication. Numerous spectra obtained with two kinds of photomultiplier working under different physical conditions have been analysed. Then two points are discussed: - Does the frequency function represent the dynode structure and the interdynode collection process. - Is the model (the multiplication process of all dynodes but the first one, is Poissonian) valid whatever the photomultiplier and the utilization conditions. (Auth.)

  17. Evaluation of weather-based rice yield models in India

    Science.gov (United States)

    Sudharsan, D.; Adinarayana, J.; Reddy, D. Raji; Sreenivas, G.; Ninomiya, S.; Hirafuji, M.; Kiura, T.; Tanaka, K.; Desai, U. B.; Merchant, S. N.

    2013-01-01

    The objective of this study was to compare two different rice simulation models—standalone (Decision Support System for Agrotechnology Transfer [DSSAT]) and web based (SImulation Model for RIce-Weather relations [SIMRIW])—with agrometeorological data and agronomic parameters for estimation of rice crop production in southern semi-arid tropics of India. Studies were carried out on the BPT5204 rice variety to evaluate two crop simulation models. Long-term experiments were conducted in a research farm of Acharya N G Ranga Agricultural University (ANGRAU), Hyderabad, India. Initially, the results were obtained using 4 years (1994-1997) of data with weather parameters from a local weather station to evaluate DSSAT simulated results with observed values. Linear regression models used for the purpose showed a close relationship between DSSAT and observed yield. Subsequently, yield comparisons were also carried out with SIMRIW and DSSAT, and validated with actual observed values. Realizing the correlation coefficient values of SIMRIW simulation values in acceptable limits, further rice experiments in monsoon (Kharif) and post-monsoon (Rabi) agricultural seasons (2009, 2010 and 2011) were carried out with a location-specific distributed sensor network system. These proximal systems help to simulate dry weight, leaf area index and potential yield by the Java based SIMRIW on a daily/weekly/monthly/seasonal basis. These dynamic parameters are useful to the farming community for necessary decision making in a ubiquitous manner. However, SIMRIW requires fine tuning for better results/decision making.

  18. Using MODIS Data to Predict Regional Corn Yields

    Directory of Open Access Journals (Sweden)

    Ho-Young Ban

    2016-12-01

    Full Text Available A simple approach was developed to predict corn yields using the MoDerate Resolution Imaging Spectroradiometer (MODIS data product from two geographically separate major corn crop production regions: Illinois, USA and Heilongjiang, China. The MOD09A1 data, which are eight-day interval surface reflectance data, were obtained from day of the year (DOY 89 to 337 to calculate the leaf area index (LAI. The sum of the LAI from early in the season to a given date in the season (end of DOY (EOD was well fitted to a logistic function and represented seasonal changes in leaf area duration (LAD. A simple phenology model was derived to estimate the dates of emergence and maturity using the LAD-logistic function parameters b1 and b2, which represented the rate of increase in LAI and the date of maximum LAI, respectively. The phenology model predicted emergence and maturity dates fairly well, with root mean square error (RMSE values of 6.3 and 4.9 days for the validation dataset, respectively. Two simple linear regression models (YP and YF were established using LAD as the variable to predict corn yield. The yield model YP used LAD from predicted emergence to maturity, and the yield model YF used LAD for a predetermined period from DOY 89 to a particular EOD. When state/province corn yields for the validation dataset were predicted at DOY 321, near completion of the corn harvest, the YP model, including the predicted phenology, performed much better than the YF model, with RMSE values of 0.68 t/ha and 0.66 t/ha for Illinois and Heilongjiang, respectively. The YP model showed similar or better performance, even for the much earlier pre-harvest yield prediction at DOY 257. In addition, the model performance showed no difference between the two study regions with very different climates and cultivation methods, including cultivar and irrigation management. These results suggested that the approach described in this paper has potential for application to

  19. Regression modeling methods, theory, and computation with SAS

    CERN Document Server

    Panik, Michael

    2009-01-01

    Regression Modeling: Methods, Theory, and Computation with SAS provides an introduction to a diverse assortment of regression techniques using SAS to solve a wide variety of regression problems. The author fully documents the SAS programs and thoroughly explains the output produced by the programs.The text presents the popular ordinary least squares (OLS) approach before introducing many alternative regression methods. It covers nonparametric regression, logistic regression (including Poisson regression), Bayesian regression, robust regression, fuzzy regression, random coefficients regression,

  20. Semiparametric regression during 2003–2007

    KAUST Repository

    Ruppert, David; Wand, M.P.; Carroll, Raymond J.

    2009-01-01

    Semiparametric regression is a fusion between parametric regression and nonparametric regression that integrates low-rank penalized splines, mixed model and hierarchical Bayesian methodology – thus allowing more streamlined handling of longitudinal and spatial correlation. We review progress in the field over the five-year period between 2003 and 2007. We find semiparametric regression to be a vibrant field with substantial involvement and activity, continual enhancement and widespread application.

  1. Unbalanced Regressions and the Predictive Equation

    DEFF Research Database (Denmark)

    Osterrieder, Daniela; Ventosa-Santaulària, Daniel; Vera-Valdés, J. Eduardo

    Predictive return regressions with persistent regressors are typically plagued by (asymptotically) biased/inconsistent estimates of the slope, non-standard or potentially even spurious statistical inference, and regression unbalancedness. We alleviate the problem of unbalancedness in the theoreti......Predictive return regressions with persistent regressors are typically plagued by (asymptotically) biased/inconsistent estimates of the slope, non-standard or potentially even spurious statistical inference, and regression unbalancedness. We alleviate the problem of unbalancedness...

  2. Interrelationships of somatic cell count, mastitis, and milk yield in a low somatic cell count herd.

    Science.gov (United States)

    Deluyker, H A; Gay, J M; Weaver, L D

    1993-11-01

    In a high yielding low SCC herd, changes in milk yield associated with SCC and occurrence of clinical mastitis and differences in SCC with parity, clinical mastitis, and DIM were investigated. Milk yield data were obtained at every milking, and SCC was measured once every 48 h in 117 cows during the first 119 d postpartum. Effects of SCC and clinical mastitis on cumulative milk yield in the first 119 d postpartum were evaluated with least squares linear regression. Repeated measures ANOVA was used to detect changes in SCC. The SCC was highest at lactation onset, and cows with clinical mastitis had significantly higher SCC. During the 10 d prior to onset of clinical mastitis, SCC was higher in affected cows than in matched unaffected controls and surged just prior to diagnosis. During the 10-d period following a mastitis treatment, SCC differences between treated and control cows remained significant but became smaller with time and returned to the premastitis differences. Occurrence of clinical mastitis was associated with 5% milk yield loss. Cows with mean SCC > 245,000 cells/ml over the 119 d showed 6.2% yield loss compared with cows with SCC 245,000 cells/ml) because a greater percentage of cows (26%) had clinical mastitis than elevated SCC (12.5%).

  3. Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?

    Science.gov (United States)

    Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun

    2014-12-01

    Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.

  4. Gastrointestinal Spatiotemporal mRNA Expression of Ghrelin vs Growth Hormone Receptor and New Growth Yield Machine Learning Model Based on Perturbation Theory.

    Science.gov (United States)

    Ran, Tao; Liu, Yong; Li, Hengzhi; Tang, Shaoxun; He, Zhixiong; Munteanu, Cristian R; González-Díaz, Humberto; Tan, Zhiliang; Zhou, Chuanshe

    2016-07-27

    The management of ruminant growth yield has economic importance. The current work presents a study of the spatiotemporal dynamic expression of Ghrelin and GHR at mRNA levels throughout the gastrointestinal tract (GIT) of kid goats under housing and grazing systems. The experiments show that the feeding system and age affected the expression of either Ghrelin or GHR with different mechanisms. Furthermore, the experimental data are used to build new Machine Learning models based on the Perturbation Theory, which can predict the effects of perturbations of Ghrelin and GHR mRNA expression on the growth yield. The models consider eight longitudinal GIT segments (rumen, abomasum, duodenum, jejunum, ileum, cecum, colon and rectum), seven time points (0, 7, 14, 28, 42, 56 and 70 d) and two feeding systems (Supplemental and Grazing feeding) as perturbations from the expected values of the growth yield. The best regression model was obtained using Random Forest, with the coefficient of determination R(2) of 0.781 for the test subset. The current results indicate that the non-linear regression model can accurately predict the growth yield and the key nodes during gastrointestinal development, which is helpful to optimize the feeding management strategies in ruminant production system.

  5. Adjusting for Confounding in Early Postlaunch Settings: Going Beyond Logistic Regression Models.

    Science.gov (United States)

    Schmidt, Amand F; Klungel, Olaf H; Groenwold, Rolf H H

    2016-01-01

    Postlaunch data on medical treatments can be analyzed to explore adverse events or relative effectiveness in real-life settings. These analyses are often complicated by the number of potential confounders and the possibility of model misspecification. We conducted a simulation study to compare the performance of logistic regression, propensity score, disease risk score, and stabilized inverse probability weighting methods to adjust for confounding. Model misspecification was induced in the independent derivation dataset. We evaluated performance using relative bias confidence interval coverage of the true effect, among other metrics. At low events per coefficient (1.0 and 0.5), the logistic regression estimates had a large relative bias (greater than -100%). Bias of the disease risk score estimates was at most 13.48% and 18.83%. For the propensity score model, this was 8.74% and >100%, respectively. At events per coefficient of 1.0 and 0.5, inverse probability weighting frequently failed or reduced to a crude regression, resulting in biases of -8.49% and 24.55%. Coverage of logistic regression estimates became less than the nominal level at events per coefficient ≤5. For the disease risk score, inverse probability weighting, and propensity score, coverage became less than nominal at events per coefficient ≤2.5, ≤1.0, and ≤1.0, respectively. Bias of misspecified disease risk score models was 16.55%. In settings with low events/exposed subjects per coefficient, disease risk score methods can be useful alternatives to logistic regression models, especially when propensity score models cannot be used. Despite better performance of disease risk score methods than logistic regression and propensity score models in small events per coefficient settings, bias, and coverage still deviated from nominal.

  6. Interpretation of commonly used statistical regression models.

    Science.gov (United States)

    Kasza, Jessica; Wolfe, Rory

    2014-01-01

    A review of some regression models commonly used in respiratory health applications is provided in this article. Simple linear regression, multiple linear regression, logistic regression and ordinal logistic regression are considered. The focus of this article is on the interpretation of the regression coefficients of each model, which are illustrated through the application of these models to a respiratory health research study. © 2013 The Authors. Respirology © 2013 Asian Pacific Society of Respirology.

  7. A test of inflated zeros for Poisson regression models.

    Science.gov (United States)

    He, Hua; Zhang, Hui; Ye, Peng; Tang, Wan

    2017-01-01

    Excessive zeros are common in practice and may cause overdispersion and invalidate inference when fitting Poisson regression models. There is a large body of literature on zero-inflated Poisson models. However, methods for testing whether there are excessive zeros are less well developed. The Vuong test comparing a Poisson and a zero-inflated Poisson model is commonly applied in practice. However, the type I error of the test often deviates seriously from the nominal level, rendering serious doubts on the validity of the test in such applications. In this paper, we develop a new approach for testing inflated zeros under the Poisson model. Unlike the Vuong test for inflated zeros, our method does not require a zero-inflated Poisson model to perform the test. Simulation studies show that when compared with the Vuong test our approach not only better at controlling type I error rate, but also yield more power.

  8. Linear regression

    CERN Document Server

    Olive, David J

    2017-01-01

    This text covers both multiple linear regression and some experimental design models. The text uses the response plot to visualize the model and to detect outliers, does not assume that the error distribution has a known parametric distribution, develops prediction intervals that work when the error distribution is unknown, suggests bootstrap hypothesis tests that may be useful for inference after variable selection, and develops prediction regions and large sample theory for the multivariate linear regression model that has m response variables. A relationship between multivariate prediction regions and confidence regions provides a simple way to bootstrap confidence regions. These confidence regions often provide a practical method for testing hypotheses. There is also a chapter on generalized linear models and generalized additive models. There are many R functions to produce response and residual plots, to simulate prediction intervals and hypothesis tests, to detect outliers, and to choose response trans...

  9. Regression modeling of ground-water flow

    Science.gov (United States)

    Cooley, R.L.; Naff, R.L.

    1985-01-01

    Nonlinear multiple regression methods are developed to model and analyze groundwater flow systems. Complete descriptions of regression methodology as applied to groundwater flow models allow scientists and engineers engaged in flow modeling to apply the methods to a wide range of problems. Organization of the text proceeds from an introduction that discusses the general topic of groundwater flow modeling, to a review of basic statistics necessary to properly apply regression techniques, and then to the main topic: exposition and use of linear and nonlinear regression to model groundwater flow. Statistical procedures are given to analyze and use the regression models. A number of exercises and answers are included to exercise the student on nearly all the methods that are presented for modeling and statistical analysis. Three computer programs implement the more complex methods. These three are a general two-dimensional, steady-state regression model for flow in an anisotropic, heterogeneous porous medium, a program to calculate a measure of model nonlinearity with respect to the regression parameters, and a program to analyze model errors in computed dependent variables such as hydraulic head. (USGS)

  10. yield and yield componemts of extra early maize (zea mays l.)

    African Journals Online (AJOL)

    SHARIFAI

    maize crop and improve the soil structures and chemical nutrients of the soil. The significant interaction between intra-row spacing and poultry manure on cob diameter, 100 grain weight and grain yield showed the importance of poultry manure on yield and yield components of maize crop. Poultry manure increases both ...

  11. An adapted yield criterion for the evolution of subsequent yield surfaces

    Science.gov (United States)

    Küsters, N.; Brosius, A.

    2017-09-01

    In numerical analysis of sheet metal forming processes, the anisotropic material behaviour is often modelled with isotropic work hardening and an average Lankford coefficient. In contrast, experimental observations show an evolution of the Lankford coefficients, which can be associated with a yield surface change due to kinematic and distortional hardening. Commonly, extensive efforts are carried out to describe these phenomena. In this paper an isotropic material model based on the Yld2000-2d criterion is adapted with an evolving yield exponent in order to change the yield surface shape. The yield exponent is linked to the accumulative plastic strain. This change has the effect of a rotating yield surface normal. As the normal is directly related to the Lankford coefficient, the change can be used to model the evolution of the Lankford coefficient during yielding. The paper will focus on the numerical implementation of the adapted material model for the FE-code LS-Dyna, mpi-version R7.1.2-d. A recently introduced identification scheme [1] is used to obtain the parameters for the evolving yield surface and will be briefly described for the proposed model. The suitability for numerical analysis will be discussed for deep drawing processes in general. Efforts for material characterization and modelling will be compared to other common yield surface descriptions. Besides experimental efforts and achieved accuracy, the potential of flexibility in material models and the risk of ambiguity during identification are of major interest in this paper.

  12. Dissolved-solids sources, loads, yields, and concentrations in streams of the conterminous United States

    Science.gov (United States)

    Anning, David W.; Flynn, Marilyn E.

    2014-01-01

    Recent studies have shown that excessive dissolved-solids concentrations in water can have adverse effects on the environment and on agricultural, domestic, municipal, and industrial water users. Such effects motivated the U.S. Geological Survey’s National Water Quality Assessment Program to develop a SPAtially-Referenced Regression on Watershed Attributes (SPARROW) model that has improved the understanding of sources, loads, yields, and concentrations of dissolved solids in streams of the conterminous United States.

  13. Extending the linear model with R generalized linear, mixed effects and nonparametric regression models

    CERN Document Server

    Faraway, Julian J

    2005-01-01

    Linear models are central to the practice of statistics and form the foundation of a vast range of statistical methodologies. Julian J. Faraway''s critically acclaimed Linear Models with R examined regression and analysis of variance, demonstrated the different methods available, and showed in which situations each one applies. Following in those footsteps, Extending the Linear Model with R surveys the techniques that grow from the regression model, presenting three extensions to that framework: generalized linear models (GLMs), mixed effect models, and nonparametric regression models. The author''s treatment is thoroughly modern and covers topics that include GLM diagnostics, generalized linear mixed models, trees, and even the use of neural networks in statistics. To demonstrate the interplay of theory and practice, throughout the book the author weaves the use of the R software environment to analyze the data of real examples, providing all of the R commands necessary to reproduce the analyses. All of the ...

  14. Comparison of Four Weighting Methods in Fuzzy-based Land Suitability to Predict Wheat Yield

    Directory of Open Access Journals (Sweden)

    Fatemeh Rahmati

    2017-06-01

    climatic conditions like mean, maximum and minimum air temperatures during growing period as well as edaphologic properties like EC, pH, ESP, percent of clay, silt, sand, gravel, gypsum and CaCO3 content. Climatic data collected from the Shahrekord synoptic station were used to assess climatic land suitability for wheat. Qualitative land suitability evaluation was carried out using the fuzzy approach. Potential yield was calculated using the method proposed by FAO. Using MATLAB software, qualitative and quantitative land evaluation were classified based on fuzzy logic approach. In fuzzy method, climatic factors are used to achieve climatic index. Clay and sand percent were applied to calculate soil texture. To determine the membership degrees,bell membership functions were used. Parameters of function shapes were transformed to equations with variable coefficients and the best coefficients were eventually chosen based on the model determination coefficient. In evaluation method based on fuzzy logic, the weights are used for land characteristics. In fuzzy logic method, weights were calculated by four methods. These methods consist of neural network using 1 neuron and 4 neurons, multivariate and Partial Least Squares (PLS regressions. Comparison of the coefficient of determination results of multivariate regression and RMSE is carried out between observed and predicted yield. Weight calculations were conducted by using MINITAB software to PLS and multivariate regression. Also, Neurosolution 5 was used for weight calculation based on neural network. Results and Discussion: The calculated weights were differed by using the four applied methods. In all methods, the maximum weight was related to gravel, and minimum weight was related to clay. The results of land index and predicted yield calculation were different in some points (3, 6, 7, 13, 14, 19, and 21 for four methods. The coefficient of determination of calculated weights were 0.595, 0.56, 0.6 and 0.56 for neural network

  15. Inheritance of grain yield and its correlation with yield components in ...

    African Journals Online (AJOL)

    SAM

    2014-03-19

    Mar 19, 2014 ... 7 × 7 incomplete diallel cross of seven wheat parents during the crop season of 2009 to 2010. Mean square of general ... Genetic background and yield traits of the seven parents. Parent. Pedigree. Released year ..... Correlation and path analysis for yield and yield contributing characters in wheat (Triticum ...

  16. A comparison of random forest regression and multiple linear regression for prediction in neuroscience.

    Science.gov (United States)

    Smith, Paul F; Ganesh, Siva; Liu, Ping

    2013-10-30

    Regression is a common statistical tool for prediction in neuroscience. However, linear regression is by far the most common form of regression used, with regression trees receiving comparatively little attention. In this study, the results of conventional multiple linear regression (MLR) were compared with those of random forest regression (RFR), in the prediction of the concentrations of 9 neurochemicals in the vestibular nucleus complex and cerebellum that are part of the l-arginine biochemical pathway (agmatine, putrescine, spermidine, spermine, l-arginine, l-ornithine, l-citrulline, glutamate and γ-aminobutyric acid (GABA)). The R(2) values for the MLRs were higher than the proportion of variance explained values for the RFRs: 6/9 of them were ≥ 0.70 compared to 4/9 for RFRs. Even the variables that had the lowest R(2) values for the MLRs, e.g. ornithine (0.50) and glutamate (0.61), had much lower proportion of variance explained values for the RFRs (0.27 and 0.49, respectively). The RSE values for the MLRs were lower than those for the RFRs in all but two cases. In general, MLRs seemed to be superior to the RFRs in terms of predictive value and error. In the case of this data set, MLR appeared to be superior to RFR in terms of its explanatory value and error. This result suggests that MLR may have advantages over RFR for prediction in neuroscience with this kind of data set, but that RFR can still have good predictive value in some cases. Copyright © 2013 Elsevier B.V. All rights reserved.

  17. Logistic regression applied to natural hazards: rare event logistic regression with replications

    OpenAIRE

    Guns, M.; Vanacker, Veerle

    2012-01-01

    Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logisti...

  18. Multilevel covariance regression with correlated random effects in the mean and variance structure.

    Science.gov (United States)

    Quintero, Adrian; Lesaffre, Emmanuel

    2017-09-01

    Multivariate regression methods generally assume a constant covariance matrix for the observations. In case a heteroscedastic model is needed, the parametric and nonparametric covariance regression approaches can be restrictive in the literature. We propose a multilevel regression model for the mean and covariance structure, including random intercepts in both components and allowing for correlation between them. The implied conditional covariance function can be different across clusters as a result of the random effect in the variance structure. In addition, allowing for correlation between the random intercepts in the mean and covariance makes the model convenient for skewedly distributed responses. Furthermore, it permits us to analyse directly the relation between the mean response level and the variability in each cluster. Parameter estimation is carried out via Gibbs sampling. We compare the performance of our model to other covariance modelling approaches in a simulation study. Finally, the proposed model is applied to the RN4CAST dataset to identify the variables that impact burnout of nurses in Belgium. © 2017 WILEY-VCH Verlag GmbH & Co. KGaA, Weinheim.

  19. Bivariate least squares linear regression: Towards a unified analytic formalism. I. Functional models

    Science.gov (United States)

    Caimmi, R.

    2011-08-01

    Concerning bivariate least squares linear regression, the classical approach pursued for functional models in earlier attempts ( York, 1966, 1969) is reviewed using a new formalism in terms of deviation (matrix) traces which, for unweighted data, reduce to usual quantities leaving aside an unessential (but dimensional) multiplicative factor. Within the framework of classical error models, the dependent variable relates to the independent variable according to the usual additive model. The classes of linear models considered are regression lines in the general case of correlated errors in X and in Y for weighted data, and in the opposite limiting situations of (i) uncorrelated errors in X and in Y, and (ii) completely correlated errors in X and in Y. The special case of (C) generalized orthogonal regression is considered in detail together with well known subcases, namely: (Y) errors in X negligible (ideally null) with respect to errors in Y; (X) errors in Y negligible (ideally null) with respect to errors in X; (O) genuine orthogonal regression; (R) reduced major-axis regression. In the limit of unweighted data, the results determined for functional models are compared with their counterparts related to extreme structural models i.e. the instrumental scatter is negligible (ideally null) with respect to the intrinsic scatter ( Isobe et al., 1990; Feigelson and Babu, 1992). While regression line slope and intercept estimators for functional and structural models necessarily coincide, the contrary holds for related variance estimators even if the residuals obey a Gaussian distribution, with the exception of Y models. An example of astronomical application is considered, concerning the [O/H]-[Fe/H] empirical relations deduced from five samples related to different stars and/or different methods of oxygen abundance determination. For selected samples and assigned methods, different regression models yield consistent results within the errors (∓ σ) for both

  20. Variability in soybean yield in Brazil stemming from the interaction of heterogeneous management and climate variability

    Science.gov (United States)

    Cohn, A.; Bragança, A.; Jeffries, G. R.

    2017-12-01

    An increasing share of global agricultural production can be found in the humid tropics. Therefore, an improved understanding of the mechanisms governing variability in the output of tropical agricultural systems is of increasing importance for food security including through climate change adaptation. Yet, the long window over which many tropical crops can be sown, the diversity of crop varieties and management practices combine to challenge inference into climate risk to cropping output in analyses of tropical crop-climate sensitivity employing administrative data. In this paper, we leverage a newly developed spatially explicit dataset of soybean yields in Brazil to combat this problem. The dataset was built by training a model of remotely-sensed vegetation index data and land cover classification data using a rich in situ dataset of soybean yield and management variables collected over the period 2006 to 2016. The dataset contains soybean yields by plant date, cropping frequency, and maturity group for each 5km grid cell in Brazil. We model variation in these yields using an approach enabling the estimation of the influence of management factors on the sensitivity of soybean yields to variability in: cumulative solar radiation, extreme degree days, growing degree days, flooding rain in the harvest period, and dry spells in the rainy season. We find strong variation in climate sensitivity by management class. Planting date and maturity group each explained a great deal more variation in yield sensitivity than did cropping frequency. Brazil collects comparatively fine spatial resolution yield data. But, our attempt to replicate our results using administrative soy yield data revealed substantially lesser crop-climate sensitivity; suggesting that previous analyses employing administrative data may have underestimated climate risk to tropical soy production.

  1. A spline-based regression parameter set for creating customized DARTEL MRI brain templates from infancy to old age.

    Science.gov (United States)

    Wilke, Marko

    2018-02-01

    This dataset contains the regression parameters derived by analyzing segmented brain MRI images (gray matter and white matter) from a large population of healthy subjects, using a multivariate adaptive regression splines approach. A total of 1919 MRI datasets ranging in age from 1-75 years from four publicly available datasets (NIH, C-MIND, fCONN, and IXI) were segmented using the CAT12 segmentation framework, writing out gray matter and white matter images normalized using an affine-only spatial normalization approach. These images were then subjected to a six-step DARTEL procedure, employing an iterative non-linear registration approach and yielding increasingly crisp intermediate images. The resulting six datasets per tissue class were then analyzed using multivariate adaptive regression splines, using the CerebroMatic toolbox. This approach allows for flexibly modelling smoothly varying trajectories while taking into account demographic (age, gender) as well as technical (field strength, data quality) predictors. The resulting regression parameters described here can be used to generate matched DARTEL or SHOOT templates for a given population under study, from infancy to old age. The dataset and the algorithm used to generate it are publicly available at https://irc.cchmc.org/software/cerebromatic.php.

  2. Effects of cutting frequency on alfalfa yield and yield components in ...

    African Journals Online (AJOL)

    Effects of cutting frequency on alfalfa yield and yield components in Songnen Plain, Northeast China. J Chen, F Tang, R Zhu, C Gao, G Di, Y Zhang. Abstract. The productivity and quality of alfalfa (Medicago sativa L.) is strongly influenced by cutting frequency (F). To clarify that the yield and quality of alfalfa if affected by F, ...

  3. A Seemingly Unrelated Poisson Regression Model

    OpenAIRE

    King, Gary

    1989-01-01

    This article introduces a new estimator for the analysis of two contemporaneously correlated endogenous event count variables. This seemingly unrelated Poisson regression model (SUPREME) estimator combines the efficiencies created by single equation Poisson regression model estimators and insights from "seemingly unrelated" linear regression models.

  4. Systematic review of treatment modalities for gingival depigmentation: a random-effects poisson regression analysis.

    Science.gov (United States)

    Lin, Yi Hung; Tu, Yu Kang; Lu, Chun Tai; Chung, Wen Chen; Huang, Chiung Fang; Huang, Mao Suan; Lu, Hsein Kun

    2014-01-01

    Repigmentation variably occurs with different treatment methods in patients with gingival pigmentation. A systemic review was conducted of various treatment modalities for eliminating melanin pigmentation of the gingiva, comprising bur abrasion, scalpel surgery, cryosurgery, electrosurgery, gingival grafts, and laser techniques, to compare the recurrence rates (Rrs) of these treatment procedures. Electronic databases, including PubMed, Web of Science, Google, and Medline were comprehensively searched, and manual searches were conducted for studies published from January 1951 to June 2013. After applying inclusion and exclusion criteria, the final list of articles was reviewed in depth to achieve the objectives of this review. A Poisson regression was used to analyze the outcome of depigmentation using the various treatment methods. The systematic review was based on case reports mainly. In total, 61 eligible publications met the defined criteria. The various therapeutic procedures showed variable clinical results with a wide range of Rrs. A random-effects Poisson regression showed that cryosurgery (Rr = 0.32%), electrosurgery (Rr = 0.74%), and laser depigmentation (Rr = 1.16%) yielded superior result, whereas bur abrasion yielded the highest Rr (8.89%). Within the limit of the sampling level, the present evidence-based results show that cryosurgery exhibits the optimal predictability for depigmentation of the gingiva among all procedures examined, followed by electrosurgery and laser techniques. It is possible to treat melanin pigmentation of the gingiva with various methods and prevent repigmentation. Among those treatment modalities, cryosurgery, electrosurgery, and laser surgery appear to be the best choices for treating gingival pigmentation. © 2014 Wiley Periodicals, Inc.

  5. Multimodel Inference for the Prediction of Disease Symptoms and Yield Loss of Potato in a Two-Year Crop Rotation Experiment

    NARCIS (Netherlands)

    Berg, van den W.; Vos, J.; Grasman, J.

    2012-01-01

    The second order Akaike information criterion was used for the assessment of 139 regression models for three responses of potato test crops: (a) incidence of Spongospora subterranea on the harvested tubers, (b) percentage of haulms infected with Verticillium dahliae, and (c) tuber yield. Six

  6. Comparison of logistic regression and neural models in predicting the outcome of biopsy in breast cancer from MRI findings

    International Nuclear Information System (INIS)

    Abdolmaleki, P.; Yarmohammadi, M.; Gity, M.

    2004-01-01

    Background: We designed an algorithmic model based on regression analysis and a non-algorithmic model based on the Artificial Neural Network. Materials and methods: The ability of these models was compared together in clinical application to differentiate malignant from benign breast tumors in a study group of 161 patient's records. Each patient's record consisted of 6 subjective features extracted from MRI appearance. These findings were enclosed as features extracted for an Artificial Neural Network as well as a logistic regression model to predict biopsy outcome. After both models had been trained perfectly on samples (n=100), the validation samples (n=61) were presented to the trained network as well as the established logistic regression models. Finally, the diagnostic performance of models were compared to the that of the radiologist in terms of sensitivity, specificity and accuracy, using receiver operating characteristic curve analysis. Results: The average out put of the Artificial Neural Network yielded a perfect sensitivity (98%) and high accuracy (90%) similar to that one of an expert radiologist (96% and 92%) while specificity was smaller than that (67%) verses 80%). The output of the logistic regression model using significant features showed improvement in specificity from 60% for the logistic regression model using all features to 93% for the reduced logistic regression model, keeping the accuracy around 90%. Conclusion: Results show that Artificial Neural Network and logistic regression model prove the relationship between extracted morphological features and biopsy results. Using statistically significant variables reduced logistic regression model outperformed of Artificial Neural Network with remarkable specificity while keeping high sensitivity is achieved

  7. Recursive Algorithm For Linear Regression

    Science.gov (United States)

    Varanasi, S. V.

    1988-01-01

    Order of model determined easily. Linear-regression algorithhm includes recursive equations for coefficients of model of increased order. Algorithm eliminates duplicative calculations, facilitates search for minimum order of linear-regression model fitting set of data satisfactory.

  8. Applied regression analysis a research tool

    CERN Document Server

    Pantula, Sastry; Dickey, David

    1998-01-01

    Least squares estimation, when used appropriately, is a powerful research tool. A deeper understanding of the regression concepts is essential for achieving optimal benefits from a least squares analysis. This book builds on the fundamentals of statistical methods and provides appropriate concepts that will allow a scientist to use least squares as an effective research tool. Applied Regression Analysis is aimed at the scientist who wishes to gain a working knowledge of regression analysis. The basic purpose of this book is to develop an understanding of least squares and related statistical methods without becoming excessively mathematical. It is the outgrowth of more than 30 years of consulting experience with scientists and many years of teaching an applied regression course to graduate students. Applied Regression Analysis serves as an excellent text for a service course on regression for non-statisticians and as a reference for researchers. It also provides a bridge between a two-semester introduction to...

  9. Low dose irradiation influence on yield and quality of fruit juice

    International Nuclear Information System (INIS)

    Mitchell, G.E.; Isaacs, A.R.; Williams, D.J.; McLauchlan, R.L.; Nottingham, S.M.; Hammerton, K.

    1991-01-01

    Granny Smith apples, Valencia oranges, and Pearlette grapes grown in Queensland, Australia were irradiated at 0, 75, 300 and 600 Gy. Following irradiation, juice was extracted and analysed for quality and compositional changes. Irradiation treatment significantly (p0.05) decreased yield of apple juice (by 6.3% w/w at 600 Gy) and grape juice (by 4.8% w/w at 600 Gy) but did not significantly (p0.05) affect yield of orange juice (OJ). Acceptability significantly (p0.05) decreased in OJ after 600 Gy treatment. Other changes in quality and composition were minimal. Juice from irradiated apples and oranges stored at 5 degrees C for 8 wk showed similar results

  10. Forecasting olive crop yields based on long-term aero biological data series and bio climatic conditions for the southern Iberian Peninsula

    Energy Technology Data Exchange (ETDEWEB)

    Aguilera, F.; Ruiz-Valenzuela, L.

    2014-06-01

    In the present study, bio-meteorological models for predicting olive-crop production in the southern Iberian Peninsula were developed. These covered a 16-year period: 1994-2009. The forecasting models were constructed using the partial least-squares regression method, taking the annual olive yield as the dependent variable, and both aero biological and meteorological parameters as the independent variables. Two regression models were built for the prediction of crop production prior to the final harvest at two different times of the year: July and November. The percentage variance explained by the models was between 83% and 93%. Through these forecasting models, the main factors that influence olive-crop yield were identified. Pollen index and accumulated precipitation, especially as rain recorded during the pre-flowering months, were the most important parameters for providing an explanation of fluctuations in fruit production. The temperature recorded during the two months preceding budburst was another important variable, which showed positive effects on the final yield. The July model that provides accurate predictions of fruit production eight months prior to the final harvest is proposed as an optimal model to forecast fruit produced by olive trees in western Mediterranean areas. (Author)

  11. A high throughput DNA extraction method with high yield and quality

    Directory of Open Access Journals (Sweden)

    Xin Zhanguo

    2012-07-01

    Full Text Available Abstract Background Preparation of large quantity and high quality genomic DNA from a large number of plant samples is a major bottleneck for most genetic and genomic analyses, such as, genetic mapping, TILLING (Targeting Induced Local Lesion IN Genome, and next-generation sequencing directly from sheared genomic DNA. A variety of DNA preparation methods and commercial kits are available. However, they are either low throughput, low yield, or costly. Here, we describe a method for high throughput genomic DNA isolation from sorghum [Sorghum bicolor (L. Moench] leaves and dry seeds with high yield, high quality, and affordable cost. Results We developed a high throughput DNA isolation method by combining a high yield CTAB extraction method with an improved cleanup procedure based on MagAttract kit. The method yielded large quantity and high quality DNA from both lyophilized sorghum leaves and dry seeds. The DNA yield was improved by nearly 30 fold with 4 times less consumption of MagAttract beads. The method can also be used in other plant species, including cotton leaves and pine needles. Conclusion A high throughput system for DNA extraction from sorghum leaves and seeds was developed and validated. The main advantages of the method are low cost, high yield, high quality, and high throughput. One person can process two 96-well plates in a working day at a cost of $0.10 per sample of magnetic beads plus other consumables that other methods will also need.

  12. Rice yield estimation based on weather conditions and on technological level of production systems in Brazil Estimativa da produtividade de arroz baseada em condições meteorológicas e no nível tecnológico dos sistemas de produção no Brasil

    Directory of Open Access Journals (Sweden)

    José Eduardo Boffino de Almeida Monteiro

    2013-02-01

    Full Text Available The objective of this work was to evaluate an estimation system for rice yield in Brazil, based on simple agrometeorological models and on the technological level of production systems. This estimation system incorporates the conceptual basis proposed by Doorenbos & Kassam for potential and attainable yields with empirical adjusts for maximum yield and crop sensitivity to water deficit, considering five categories of rice yield. Rice yield was estimated from 2000/2001 to 2007/2008, and compared to IBGE yield data. Regression analyses between model estimates and data from IBGE surveys resulted in significant coefficients of determination, with less dispersion in the South than in the North and Northeast regions of the country. Index of model efficiency (E1' ranged from 0.01 in the lower yield classes to 0.45 in higher ones, and mean absolute error ranged from 58 to 250 kg ha‑1, respectively.O objetivo deste trabalho foi avaliar um sistema de estimativa de produtividade da cultura do arroz no Brasil, com base em modelos agrometeorológicos simples e no nível tecnológico dos sistemas produtivos. Este sistema de estimativa incorpora as bases conceituais propostas por Doorenbos & Kassam para produtividades potenciais e alcançáveis com ajustes empíricos para produtividade máxima e sensibilidade da cultura à deficiência hídrica, considerando-se cinco classes de produtividade de arroz. A produtividade de arroz foi estimada de 2000/2001 a 2007/2008 e comparada aos dados do IBGE. Análises de regressão entre as estimativas do modelo e dados do IBGE resultaram em coeficientes de determinação significativos, com menor dispersão na região Sul do que nas regiões Norte e Nordeste do país. O índice de eficiência do modelo (E1' variou de 0,01, nas classes de menor produtividade, a 0,45 nas classes de maior produtividade, e o erro médio absoluto variou de 58 a 250 kg ha‑1, respectivamente.

  13. A Casting Yield Optimization Case Study: Forging Ram

    DEFF Research Database (Denmark)

    Kotas, Petr; Tutum, Cem Celal; Hattel, Jesper Henri

    2010-01-01

    This work summarizes the findings of multi-objective optimization of a gravity sand-cast steel part for which an increase of the casting yield via riser optimization was considered. This was accomplished by coupling a casting simulation software package with an optimization module. The benefits...... of this approach, recently adopted in foundry industry world wide and based on fully automated computer optimization, were demonstrated. First, analyses of filling and solidification of the original casting design were conducted in the standard simulation environment to determine potential flaws and inadequacies...

  14. Standards for Standardized Logistic Regression Coefficients

    Science.gov (United States)

    Menard, Scott

    2011-01-01

    Standardized coefficients in logistic regression analysis have the same utility as standardized coefficients in linear regression analysis. Although there has been no consensus on the best way to construct standardized logistic regression coefficients, there is now sufficient evidence to suggest a single best approach to the construction of a…

  15. [Application of negative binomial regression and modified Poisson regression in the research of risk factors for injury frequency].

    Science.gov (United States)

    Cao, Qingqing; Wu, Zhenqiang; Sun, Ying; Wang, Tiezhu; Han, Tengwei; Gu, Chaomei; Sun, Yehuan

    2011-11-01

    To Eexplore the application of negative binomial regression and modified Poisson regression analysis in analyzing the influential factors for injury frequency and the risk factors leading to the increase of injury frequency. 2917 primary and secondary school students were selected from Hefei by cluster random sampling method and surveyed by questionnaire. The data on the count event-based injuries used to fitted modified Poisson regression and negative binomial regression model. The risk factors incurring the increase of unintentional injury frequency for juvenile students was explored, so as to probe the efficiency of these two models in studying the influential factors for injury frequency. The Poisson model existed over-dispersion (P Poisson regression and negative binomial regression model, was fitted better. respectively. Both showed that male gender, younger age, father working outside of the hometown, the level of the guardian being above junior high school and smoking might be the results of higher injury frequencies. On a tendency of clustered frequency data on injury event, both the modified Poisson regression analysis and negative binomial regression analysis can be used. However, based on our data, the modified Poisson regression fitted better and this model could give a more accurate interpretation of relevant factors affecting the frequency of injury.

  16. Logistic regression for dichotomized counts.

    Science.gov (United States)

    Preisser, John S; Das, Kalyan; Benecha, Habtamu; Stamm, John W

    2016-12-01

    Sometimes there is interest in a dichotomized outcome indicating whether a count variable is positive or zero. Under this scenario, the application of ordinary logistic regression may result in efficiency loss, which is quantifiable under an assumed model for the counts. In such situations, a shared-parameter hurdle model is investigated for more efficient estimation of regression parameters relating to overall effects of covariates on the dichotomous outcome, while handling count data with many zeroes. One model part provides a logistic regression containing marginal log odds ratio effects of primary interest, while an ancillary model part describes the mean count of a Poisson or negative binomial process in terms of nuisance regression parameters. Asymptotic efficiency of the logistic model parameter estimators of the two-part models is evaluated with respect to ordinary logistic regression. Simulations are used to assess the properties of the models with respect to power and Type I error, the latter investigated under both misspecified and correctly specified models. The methods are applied to data from a randomized clinical trial of three toothpaste formulations to prevent incident dental caries in a large population of Scottish schoolchildren. © The Author(s) 2014.

  17. Quantifying Shapes: Mathematical Techniques for Analysing Visual Representations of Sound and Music

    Directory of Open Access Journals (Sweden)

    Genevieve L. Noyce

    2013-12-01

    Full Text Available Research on auditory-visual correspondences has a long tradition but innovative experimental paradigms and analytic tools are sparse. In this study, we explore different ways of analysing real-time visual representations of sound and music drawn by both musically-trained and untrained individuals. To that end, participants' drawing responses captured by an electronic graphics tablet were analysed using various regression, clustering, and classification techniques. Results revealed that a Gaussian process (GP regression model with a linear plus squared-exponential covariance function was able to model the data sufficiently, whereas a simpler GP was not a good fit. Spectral clustering analysis was the best of a variety of clustering techniques, though no strong groupings are apparent in these data. This was confirmed by variational Bayes analysis, which only fitted one Gaussian over the dataset. Slight trends in the optimised hyperparameters between musically-trained and untrained individuals allowed for the building of a successful GP classifier that differentiated between these two groups. In conclusion, this set of techniques provides useful mathematical tools for analysing real-time visualisations of sound and can be applied to similar datasets as well.

  18. Bayesian ARTMAP for regression.

    Science.gov (United States)

    Sasu, L M; Andonie, R

    2013-10-01

    Bayesian ARTMAP (BA) is a recently introduced neural architecture which uses a combination of Fuzzy ARTMAP competitive learning and Bayesian learning. Training is generally performed online, in a single-epoch. During training, BA creates input data clusters as Gaussian categories, and also infers the conditional probabilities between input patterns and categories, and between categories and classes. During prediction, BA uses Bayesian posterior probability estimation. So far, BA was used only for classification. The goal of this paper is to analyze the efficiency of BA for regression problems. Our contributions are: (i) we generalize the BA algorithm using the clustering functionality of both ART modules, and name it BA for Regression (BAR); (ii) we prove that BAR is a universal approximator with the best approximation property. In other words, BAR approximates arbitrarily well any continuous function (universal approximation) and, for every given continuous function, there is one in the set of BAR approximators situated at minimum distance (best approximation); (iii) we experimentally compare the online trained BAR with several neural models, on the following standard regression benchmarks: CPU Computer Hardware, Boston Housing, Wisconsin Breast Cancer, and Communities and Crime. Our results show that BAR is an appropriate tool for regression tasks, both for theoretical and practical reasons. Copyright © 2013 Elsevier Ltd. All rights reserved.

  19. Mechanisms of neuroblastoma regression

    Science.gov (United States)

    Brodeur, Garrett M.; Bagatell, Rochelle

    2014-01-01

    Recent genomic and biological studies of neuroblastoma have shed light on the dramatic heterogeneity in the clinical behaviour of this disease, which spans from spontaneous regression or differentiation in some patients, to relentless disease progression in others, despite intensive multimodality therapy. This evidence also suggests several possible mechanisms to explain the phenomena of spontaneous regression in neuroblastomas, including neurotrophin deprivation, humoral or cellular immunity, loss of telomerase activity and alterations in epigenetic regulation. A better understanding of the mechanisms of spontaneous regression might help to identify optimal therapeutic approaches for patients with these tumours. Currently, the most druggable mechanism is the delayed activation of developmentally programmed cell death regulated by the tropomyosin receptor kinase A pathway. Indeed, targeted therapy aimed at inhibiting neurotrophin receptors might be used in lieu of conventional chemotherapy or radiation in infants with biologically favourable tumours that require treatment. Alternative approaches consist of breaking immune tolerance to tumour antigens or activating neurotrophin receptor pathways to induce neuronal differentiation. These approaches are likely to be most effective against biologically favourable tumours, but they might also provide insights into treatment of biologically unfavourable tumours. We describe the different mechanisms of spontaneous neuroblastoma regression and the consequent therapeutic approaches. PMID:25331179

  20. Using the Ridge Regression Procedures to Estimate the Multiple Linear Regression Coefficients

    Science.gov (United States)

    Gorgees, HazimMansoor; Mahdi, FatimahAssim

    2018-05-01

    This article concerns with comparing the performance of different types of ordinary ridge regression estimators that have been already proposed to estimate the regression parameters when the near exact linear relationships among the explanatory variables is presented. For this situations we employ the data obtained from tagi gas filling company during the period (2008-2010). The main result we reached is that the method based on the condition number performs better than other methods since it has smaller mean square error (MSE) than the other stated methods.