WorldWideScience

Sample records for regression identified factors

  1. Using exploratory regression to identify optimal driving factors for cellular automaton modeling of land use change.

    Science.gov (United States)

    Feng, Yongjiu; Tong, Xiaohua

    2017-09-22

    Defining transition rules is an important issue in cellular automaton (CA)-based land use modeling because these models incorporate highly correlated driving factors. Multicollinearity among correlated driving factors may produce negative effects that must be eliminated from the modeling. Using exploratory regression under pre-defined criteria, we identified all possible combinations of factors from the candidate factors affecting land use change. Three combinations that incorporate five driving factors meeting pre-defined criteria were assessed. With the selected combinations of factors, three logistic regression-based CA models were built to simulate dynamic land use change in Shanghai, China, from 2000 to 2015. For comparative purposes, a CA model with all candidate factors was also applied to simulate the land use change. Simulations using three CA models with multicollinearity eliminated performed better (with accuracy improvements about 3.6%) than the model incorporating all candidate factors. Our results showed that not all candidate factors are necessary for accurate CA modeling and the simulations were not sensitive to changes in statistically non-significant driving factors. We conclude that exploratory regression is an effective method to search for the optimal combinations of driving factors, leading to better land use change models that are devoid of multicollinearity. We suggest identification of dominant factors and elimination of multicollinearity before building land change models, making it possible to simulate more realistic outcomes.

  2. A regression tree for identifying combinations of fall risk factors associated to recurrent falling: a cross-sectional elderly population-based study.

    Science.gov (United States)

    Kabeshova, A; Annweiler, C; Fantino, B; Philip, T; Gromov, V A; Launay, C P; Beauchet, O

    2014-06-01

    Regression tree (RT) analyses are particularly adapted to explore the risk of recurrent falling according to various combinations of fall risk factors compared to logistic regression models. The aims of this study were (1) to determine which combinations of fall risk factors were associated with the occurrence of recurrent falls in older community-dwellers, and (2) to compare the efficacy of RT and multiple logistic regression model for the identification of recurrent falls. A total of 1,760 community-dwelling volunteers (mean age ± standard deviation, 71.0 ± 5.1 years; 49.4 % female) were recruited prospectively in this cross-sectional study. Age, gender, polypharmacy, use of psychoactive drugs, fear of falling (FOF), cognitive disorders and sad mood were recorded. In addition, the history of falls within the past year was recorded using a standardized questionnaire. Among 1,760 participants, 19.7 % (n = 346) were recurrent fallers. The RT identified 14 nodes groups and 8 end nodes with FOF as the first major split. Among participants with FOF, those who had sad mood and polypharmacy formed the end node with the greatest OR for recurrent falls (OR = 6.06 with p falls (OR = 0.25 with p factors for recurrent falls, the combination most associated with recurrent falls involving FOF, sad mood and polypharmacy. The FOF emerged as the risk factor strongly associated with recurrent falls. In addition, RT and multiple logistic regression were not sensitive enough to identify the majority of recurrent fallers but appeared efficient in detecting individuals not at risk of recurrent falls.

  3. Identifying Environmental and Social Factors Predisposing to Pathological Gambling Combining Standard Logistic Regression and Logic Learning Machine.

    Science.gov (United States)

    Parodi, Stefano; Dosi, Corrado; Zambon, Antonella; Ferrari, Enrico; Muselli, Marco

    2017-12-01

    Identifying potential risk factors for problem gambling (PG) is of primary importance for planning preventive and therapeutic interventions. We illustrate a new approach based on the combination of standard logistic regression and an innovative method of supervised data mining (Logic Learning Machine or LLM). Data were taken from a pilot cross-sectional study to identify subjects with PG behaviour, assessed by two internationally validated scales (SOGS and Lie/Bet). Information was obtained from 251 gamblers recruited in six betting establishments. Data on socio-demographic characteristics, lifestyle and cognitive-related factors, and type, place and frequency of preferred gambling were obtained by a self-administered questionnaire. The following variables associated with PG were identified: instant gratification games, alcohol abuse, cognitive distortion, illegal behaviours and having started gambling with a relative or a friend. Furthermore, the combination of LLM and LR indicated the presence of two different types of PG, namely: (a) daily gamblers, more prone to illegal behaviour, with poor money management skills and who started gambling at an early age, and (b) non-daily gamblers, characterised by superstitious beliefs and a higher preference for immediate reward games. Finally, instant gratification games were strongly associated with the number of games usually played. Studies on gamblers habitually frequently betting shops are rare. The finding of different types of PG by habitual gamblers deserves further analysis in larger studies. Advanced data mining algorithms, like LLM, are powerful tools and potentially useful in identifying risk factors for PG.

  4. APPLICATION OF MULTIPLE LOGISTIC REGRESSION, BAYESIAN LOGISTIC AND CLASSIFICATION TREE TO IDENTIFY THE SIGNIFICANT FACTORS INFLUENCING CRASH SEVERITY

    Directory of Open Access Journals (Sweden)

    MILAD TAZIK

    2017-11-01

    Full Text Available Identifying cases in which road crashes result in fatality or injury of drivers may help improve their safety. In this study, datasets of crashes happened in TehranQom freeway, Iran, were examined by three models (multiple logistic regression, Bayesian logistic and classification tree to analyse the contribution of several variables to fatal accidents. For multiple logistic regression and Bayesian logistic models, the odds ratio was calculated for each variable. The model which best suited the identification of accident severity was determined based on AIC and DIC criteria. Based on the results of these two models, rollover crashes (OR = 14.58, %95 CI: 6.8-28.6, not using of seat belt (OR = 5.79, %95 CI: 3.1-9.9, exceeding speed limits (OR = 4.02, %95 CI: 1.8-7.9 and being female (OR = 2.91, %95 CI: 1.1-6.1 were the most important factors in fatalities of drivers. In addition, the results of the classification tree model have verified the findings of the other models.

  5. Identifying the critical success factors in the coverage of low vision services using the classification analysis and regression tree methodology.

    Science.gov (United States)

    Chiang, Peggy Pei-Chia; Xie, Jing; Keeffe, Jill Elizabeth

    2011-04-25

    To identify the critical success factors (CSF) associated with coverage of low vision services. Data were collected from a survey distributed to Vision 2020 contacts, government, and non-government organizations (NGOs) in 195 countries. The Classification and Regression Tree Analysis (CART) was used to identify the critical success factors of low vision service coverage. Independent variables were sourced from the survey: policies, epidemiology, provision of services, equipment and infrastructure, barriers to services, human resources, and monitoring and evaluation. Socioeconomic and demographic independent variables: health expenditure, population statistics, development status, and human resources in general, were sourced from the World Health Organization (WHO), World Bank, and the United Nations (UN). The findings identified that having >50% of children obtaining devices when prescribed (χ(2) = 44; P 3 rehabilitation workers per 10 million of population (χ(2) = 4.50; P = 0.034), higher percentage of population urbanized (χ(2) = 14.54; P = 0.002), a level of private investment (χ(2) = 14.55; P = 0.015), and being fully funded by government (χ(2) = 6.02; P = 0.014), are critical success factors associated with coverage of low vision services. This study identified the most important predictors for countries with better low vision coverage. The CART is a useful and suitable methodology in survey research and is a novel way to simplify a complex global public health issue in eye care.

  6. Spontaneous regression of retinopathy of prematurity:incidence and predictive factors

    Directory of Open Access Journals (Sweden)

    Rui-Hong Ju

    2013-08-01

    Full Text Available AIM:To evaluate the incidence of spontaneous regression of changes in the retina and vitreous in active stage of retinopathy of prematurity(ROP and identify the possible relative factors during the regression.METHODS: This was a retrospective, hospital-based study. The study consisted of 39 premature infants with mild ROP showed spontaneous regression (Group A and 17 with severe ROP who had been treated before naturally involuting (Group B from August 2008 through May 2011. Data on gender, single or multiple pregnancy, gestational age, birth weight, weight gain from birth to the sixth week of life, use of oxygen in mechanical ventilation, total duration of oxygen inhalation, surfactant given or not, need for and times of blood transfusion, 1,5,10-min Apgar score, presence of bacterial or fungal or combined infection, hyaline membrane disease (HMD, patent ductus arteriosus (PDA, duration of stay in the neonatal intensive care unit (NICU and duration of ROP were recorded.RESULTS: The incidence of spontaneous regression of ROP with stage 1 was 86.7%, and with stage 2, stage 3 was 57.1%, 5.9%, respectively. With changes in zone Ⅲ regression was detected 100%, in zoneⅡ 46.2% and in zoneⅠ 0%. The mean duration of ROP in spontaneous regression group was 5.65±3.14 weeks, lower than that of the treated ROP group (7.34±4.33 weeks, but this difference was not statistically significant (P=0.201. GA, 1min Apgar score, 5min Apgar score, duration of NICU stay, postnatal age of initial screening and oxygen therapy longer than 10 days were significant predictive factors for the spontaneous regression of ROP (P<0.05. Retinal hemorrhage was the only independent predictive factor the spontaneous regression of ROP (OR 0.030, 95%CI 0.001-0.775, P=0.035.CONCLUSION:This study showed most stage 1 and 2 ROP and changes in zone Ⅲ can spontaneously regression in the end. Retinal hemorrhage is weakly inversely associated with the spontaneous regression.

  7. Identifying Interacting Genetic Variations by Fish-Swarm Logic Regression

    Science.gov (United States)

    Yang, Aiyuan; Yan, Chunxia; Zhu, Feng; Zhao, Zhongmeng; Cao, Zhi

    2013-01-01

    Understanding associations between genotypes and complex traits is a fundamental problem in human genetics. A major open problem in mapping phenotypes is that of identifying a set of interacting genetic variants, which might contribute to complex traits. Logic regression (LR) is a powerful multivariant association tool. Several LR-based approaches have been successfully applied to different datasets. However, these approaches are not adequate with regard to accuracy and efficiency. In this paper, we propose a new LR-based approach, called fish-swarm logic regression (FSLR), which improves the logic regression process by incorporating swarm optimization. In our approach, a school of fish agents are conducted in parallel. Each fish agent holds a regression model, while the school searches for better models through various preset behaviors. A swarm algorithm improves the accuracy and the efficiency by speeding up the convergence and preventing it from dropping into local optimums. We apply our approach on a real screening dataset and a series of simulation scenarios. Compared to three existing LR-based approaches, our approach outperforms them by having lower type I and type II error rates, being able to identify more preset causal sites, and performing at faster speeds. PMID:23984382

  8. Identifying Interacting Genetic Variations by Fish-Swarm Logic Regression

    Directory of Open Access Journals (Sweden)

    Xuanping Zhang

    2013-01-01

    Full Text Available Understanding associations between genotypes and complex traits is a fundamental problem in human genetics. A major open problem in mapping phenotypes is that of identifying a set of interacting genetic variants, which might contribute to complex traits. Logic regression (LR is a powerful multivariant association tool. Several LR-based approaches have been successfully applied to different datasets. However, these approaches are not adequate with regard to accuracy and efficiency. In this paper, we propose a new LR-based approach, called fish-swarm logic regression (FSLR, which improves the logic regression process by incorporating swarm optimization. In our approach, a school of fish agents are conducted in parallel. Each fish agent holds a regression model, while the school searches for better models through various preset behaviors. A swarm algorithm improves the accuracy and the efficiency by speeding up the convergence and preventing it from dropping into local optimums. We apply our approach on a real screening dataset and a series of simulation scenarios. Compared to three existing LR-based approaches, our approach outperforms them by having lower type I and type II error rates, being able to identify more preset causal sites, and performing at faster speeds.

  9. Identifying the Safety Factors over Traffic Signs in State Roads using a Panel Quantile Regression Approach.

    Science.gov (United States)

    Šarić, Željko; Xu, Xuecai; Duan, Li; Babić, Darko

    2018-06-20

    This study intended to investigate the interactions between accident rate and traffic signs in state roads located in Croatia, and accommodate the heterogeneity attributed to unobserved factors. The data from 130 state roads between 2012 and 2016 were collected from Traffic Accident Database System maintained by the Republic of Croatia Ministry of the Interior. To address the heterogeneity, a panel quantile regression model was proposed, in which quantile regression model offers a more complete view and a highly comprehensive analysis of the relationship between accident rate and traffic signs, while the panel data model accommodates the heterogeneity attributed to unobserved factors. Results revealed that (1) low visibility of material damage (MD) and death or injured (DI) increased the accident rate; (2) the number of mandatory signs and the number of warning signs were more likely to reduce the accident rate; (3)average speed limit and the number of invalid traffic signs per km exhibited a high accident rate. To our knowledge, it's the first attempt to analyze the interactions between accident consequences and traffic signs by employing a panel quantile regression model; by involving the visibility, the present study demonstrates that the low visibility causes a relatively higher risk of MD and DI; It is noteworthy that average speed limit corresponds with accident rate positively; The number of mandatory signs and the number of warning signs are more likely to reduce the accident rate; The number of invalid traffic signs per km are significant for accident rate, thus regular maintenance should be kept for a safer roadway environment.

  10. Assessing risk factors for periodontitis using regression

    Science.gov (United States)

    Lobo Pereira, J. A.; Ferreira, Maria Cristina; Oliveira, Teresa

    2013-10-01

    Multivariate statistical analysis is indispensable to assess the associations and interactions between different factors and the risk of periodontitis. Among others, regression analysis is a statistical technique widely used in healthcare to investigate and model the relationship between variables. In our work we study the impact of socio-demographic, medical and behavioral factors on periodontal health. Using regression, linear and logistic models, we can assess the relevance, as risk factors for periodontitis disease, of the following independent variables (IVs): Age, Gender, Diabetic Status, Education, Smoking status and Plaque Index. The multiple linear regression analysis model was built to evaluate the influence of IVs on mean Attachment Loss (AL). Thus, the regression coefficients along with respective p-values will be obtained as well as the respective p-values from the significance tests. The classification of a case (individual) adopted in the logistic model was the extent of the destruction of periodontal tissues defined by an Attachment Loss greater than or equal to 4 mm in 25% (AL≥4mm/≥25%) of sites surveyed. The association measures include the Odds Ratios together with the correspondent 95% confidence intervals.

  11. Risk Factors of Falls in Community-Dwelling Older Adults: Logistic Regression Tree Analysis

    Science.gov (United States)

    Yamashita, Takashi; Noe, Douglas A.; Bailer, A. John

    2012-01-01

    Purpose of the Study: A novel logistic regression tree-based method was applied to identify fall risk factors and possible interaction effects of those risk factors. Design and Methods: A nationally representative sample of American older adults aged 65 years and older (N = 9,592) in the Health and Retirement Study 2004 and 2006 modules was used.…

  12. [Application of negative binomial regression and modified Poisson regression in the research of risk factors for injury frequency].

    Science.gov (United States)

    Cao, Qingqing; Wu, Zhenqiang; Sun, Ying; Wang, Tiezhu; Han, Tengwei; Gu, Chaomei; Sun, Yehuan

    2011-11-01

    To Eexplore the application of negative binomial regression and modified Poisson regression analysis in analyzing the influential factors for injury frequency and the risk factors leading to the increase of injury frequency. 2917 primary and secondary school students were selected from Hefei by cluster random sampling method and surveyed by questionnaire. The data on the count event-based injuries used to fitted modified Poisson regression and negative binomial regression model. The risk factors incurring the increase of unintentional injury frequency for juvenile students was explored, so as to probe the efficiency of these two models in studying the influential factors for injury frequency. The Poisson model existed over-dispersion (P Poisson regression and negative binomial regression model, was fitted better. respectively. Both showed that male gender, younger age, father working outside of the hometown, the level of the guardian being above junior high school and smoking might be the results of higher injury frequencies. On a tendency of clustered frequency data on injury event, both the modified Poisson regression analysis and negative binomial regression analysis can be used. However, based on our data, the modified Poisson regression fitted better and this model could give a more accurate interpretation of relevant factors affecting the frequency of injury.

  13. Identifying the Prognosis Factors in Death after Liver Transplantation via Adaptive LASSO in Iran

    Directory of Open Access Journals (Sweden)

    Hadi Raeisi Shahraki

    2016-01-01

    Full Text Available Despite the widespread use of liver transplantation as a routine therapy in liver diseases, the effective factors on its outcomes are still controversial. This study attempted to identify the most effective factors on death after liver transplantation. For this purpose, modified least absolute shrinkage and selection operator (LASSO, called Adaptive LASSO, was utilized. One of the best advantages of this method is considering high number of factors. Therefore, in a historical cohort study from 2008 to 2013, the clinical findings of 680 patients undergoing liver transplant surgery were considered. Ridge and Adaptive LASSO regression methods were then implemented to identify the most effective factors on death. To compare the performance of these two models, receiver operating characteristic (ROC curve was used. According to the results, 12 factors in Ridge regression and 9 ones in Adaptive LASSO regression were significant. The area under the ROC curve (AUC of Adaptive LASSO was equal to 89% (95% CI: 86%–91%, which was significantly greater than Ridge regression (64%, 95% CI: 61%–68% (p<0.001. As a conclusion, the significant factors and the performance criteria revealed the superiority of Adaptive LASSO method as a penalized model versus traditional regression model in the present study.

  14. An application in identifying high-risk populations in alternative tobacco product use utilizing logistic regression and CART: a heuristic comparison.

    Science.gov (United States)

    Lei, Yang; Nollen, Nikki; Ahluwahlia, Jasjit S; Yu, Qing; Mayo, Matthew S

    2015-04-09

    Other forms of tobacco use are increasing in prevalence, yet most tobacco control efforts are aimed at cigarettes. In light of this, it is important to identify individuals who are using both cigarettes and alternative tobacco products (ATPs). Most previous studies have used regression models. We conducted a traditional logistic regression model and a classification and regression tree (CART) model to illustrate and discuss the added advantages of using CART in the setting of identifying high-risk subgroups of ATP users among cigarettes smokers. The data were collected from an online cross-sectional survey administered by Survey Sampling International between July 5, 2012 and August 15, 2012. Eligible participants self-identified as current smokers, African American, White, or Latino (of any race), were English-speaking, and were at least 25 years old. The study sample included 2,376 participants and was divided into independent training and validation samples for a hold out validation. Logistic regression and CART models were used to examine the important predictors of cigarettes + ATP users. The logistic regression model identified nine important factors: gender, age, race, nicotine dependence, buying cigarettes or borrowing, whether the price of cigarettes influences the brand purchased, whether the participants set limits on cigarettes per day, alcohol use scores, and discrimination frequencies. The C-index of the logistic regression model was 0.74, indicating good discriminatory capability. The model performed well in the validation cohort also with good discrimination (c-index = 0.73) and excellent calibration (R-square = 0.96 in the calibration regression). The parsimonious CART model identified gender, age, alcohol use score, race, and discrimination frequencies to be the most important factors. It also revealed interesting partial interactions. The c-index is 0.70 for the training sample and 0.69 for the validation sample. The misclassification

  15. Logistic Regression and Path Analysis Method to Analyze Factors influencing Students’ Achievement

    Science.gov (United States)

    Noeryanti, N.; Suryowati, K.; Setyawan, Y.; Aulia, R. R.

    2018-04-01

    Students' academic achievement cannot be separated from the influence of two factors namely internal and external factors. The first factors of the student (internal factors) consist of intelligence (X1), health (X2), interest (X3), and motivation of students (X4). The external factors consist of family environment (X5), school environment (X6), and society environment (X7). The objects of this research are eighth grade students of the school year 2016/2017 at SMPN 1 Jiwan Madiun sampled by using simple random sampling. Primary data are obtained by distributing questionnaires. The method used in this study is binary logistic regression analysis that aims to identify internal and external factors that affect student’s achievement and how the trends of them. Path Analysis was used to determine the factors that influence directly, indirectly or totally on student’s achievement. Based on the results of binary logistic regression, variables that affect student’s achievement are interest and motivation. And based on the results obtained by path analysis, factors that have a direct impact on student’s achievement are students’ interest (59%) and students’ motivation (27%). While the factors that have indirect influences on students’ achievement, are family environment (97%) and school environment (37).

  16. Identifying the Factors That Influence Change in SEBD Using Logistic Regression Analysis

    Science.gov (United States)

    Camilleri, Liberato; Cefai, Carmel

    2013-01-01

    Multiple linear regression and ANOVA models are widely used in applications since they provide effective statistical tools for assessing the relationship between a continuous dependent variable and several predictors. However these models rely heavily on linearity and normality assumptions and they do not accommodate categorical dependent…

  17. Risk factors for pedicled flap necrosis in hand soft tissue reconstruction: a multivariate logistic regression analysis.

    Science.gov (United States)

    Gong, Xu; Cui, Jianli; Jiang, Ziping; Lu, Laijin; Li, Xiucun

    2018-03-01

    Few clinical retrospective studies have reported the risk factors of pedicled flap necrosis in hand soft tissue reconstruction. The aim of this study was to identify non-technical risk factors associated with pedicled flap perioperative necrosis in hand soft tissue reconstruction via a multivariate logistic regression analysis. For patients with hand soft tissue reconstruction, we carefully reviewed hospital records and identified 163 patients who met the inclusion criteria. The characteristics of these patients, flap transfer procedures and postoperative complications were recorded. Eleven predictors were identified. The correlations between pedicled flap necrosis and risk factors were analysed using a logistic regression model. Of 163 skin flaps, 125 flaps survived completely without any complications. The pedicled flap necrosis rate in hands was 11.04%, which included partial flap necrosis (7.36%) and total flap necrosis (3.68%). Soft tissue defects in fingers were noted in 68.10% of all cases. The logistic regression analysis indicated that the soft tissue defect site (P = 0.046, odds ratio (OR) = 0.079, confidence interval (CI) (0.006, 0.959)), flap size (P = 0.020, OR = 1.024, CI (1.004, 1.045)) and postoperative wound infection (P < 0.001, OR = 17.407, CI (3.821, 79.303)) were statistically significant risk factors for pedicled flap necrosis of the hand. Soft tissue defect site, flap size and postoperative wound infection were risk factors associated with pedicled flap necrosis in hand soft tissue defect reconstruction. © 2017 Royal Australasian College of Surgeons.

  18. Regression Analysis to Identify Factors Associated with Urinary Iodine Concentration at the Sub-National Level in India, Ghana, and Senegal

    Science.gov (United States)

    Knowles, Jacky; Kupka, Roland; Dumble, Sam; Garrett, Greg S.; Pandav, Chandrakant S.; Yadav, Kapil; Touré, Ndeye Khady; Foriwa Amoaful, Esi; Gorstein, Jonathan

    2018-01-01

    Single and multiple variable regression analyses were conducted using data from stratified, cluster sample design, iodine surveys in India, Ghana, and Senegal to identify factors associated with urinary iodine concentration (UIC) among women of reproductive age (WRA) at the national and sub-national level. Subjects were survey household respondents, typically WRA. For all three countries, UIC was significantly different (p regression analysis, UIC was significantly associated with strata and household salt iodine category in India and Ghana (p < 0.001). Estimated UIC was 1.6 (95% confidence intervals (CI) 1.3, 2.0) times higher (India) and 1.4 (95% CI 1.2, 1.6) times higher (Ghana) among WRA from households using adequately iodised salt than among WRA from households using non-iodised salt. Other significant associations with UIC were found in India, with having heard of iodine deficiency (1.2 times higher; CI 1.1, 1.3; p < 0.001) and having improved dietary diversity (1.1 times higher, CI 1.0, 1.2; p = 0.015); and in Ghana, with the level of tomato paste consumption the previous week (p = 0.029) (UIC for highest consumption level was 1.2 times lowest level; CI 1.1, 1.4). No significant associations were found in Senegal. Sub-national data on iodine status are required to assess equity of access to optimal iodine intake and to develop strategic responses as needed. PMID:29690505

  19. Logistic regression models of factors influencing the location of bioenergy and biofuels plants

    Science.gov (United States)

    T.M. Young; R.L. Zaretzki; J.H. Perdue; F.M. Guess; X. Liu

    2011-01-01

    Logistic regression models were developed to identify significant factors that influence the location of existing wood-using bioenergy/biofuels plants and traditional wood-using facilities. Logistic models provided quantitative insight for variables influencing the location of woody biomass-using facilities. Availability of "thinnings to a basal area of 31.7m2/ha...

  20. Regression Analysis to Identify Factors Associated with Urinary Iodine Concentration at the Sub-National Level in India, Ghana, and Senegal

    Directory of Open Access Journals (Sweden)

    Jacky Knowles

    2018-04-01

    Full Text Available Single and multiple variable regression analyses were conducted using data from stratified, cluster sample design, iodine surveys in India, Ghana, and Senegal to identify factors associated with urinary iodine concentration (UIC among women of reproductive age (WRA at the national and sub-national level. Subjects were survey household respondents, typically WRA. For all three countries, UIC was significantly different (p < 0.05 by household salt iodine category. Other significant differences were by strata and by household vulnerability to poverty in India and Ghana. In multiple variable regression analysis, UIC was significantly associated with strata and household salt iodine category in India and Ghana (p < 0.001. Estimated UIC was 1.6 (95% confidence intervals (CI 1.3, 2.0 times higher (India and 1.4 (95% CI 1.2, 1.6 times higher (Ghana among WRA from households using adequately iodised salt than among WRA from households using non-iodised salt. Other significant associations with UIC were found in India, with having heard of iodine deficiency (1.2 times higher; CI 1.1, 1.3; p < 0.001 and having improved dietary diversity (1.1 times higher, CI 1.0, 1.2; p = 0.015; and in Ghana, with the level of tomato paste consumption the previous week (p = 0.029 (UIC for highest consumption level was 1.2 times lowest level; CI 1.1, 1.4. No significant associations were found in Senegal. Sub-national data on iodine status are required to assess equity of access to optimal iodine intake and to develop strategic responses as needed.

  1. Methods for identifying SNP interactions: a review on variations of Logic Regression, Random Forest and Bayesian logistic regression.

    Science.gov (United States)

    Chen, Carla Chia-Ming; Schwender, Holger; Keith, Jonathan; Nunkesser, Robin; Mengersen, Kerrie; Macrossan, Paula

    2011-01-01

    Due to advancements in computational ability, enhanced technology and a reduction in the price of genotyping, more data are being generated for understanding genetic associations with diseases and disorders. However, with the availability of large data sets comes the inherent challenges of new methods of statistical analysis and modeling. Considering a complex phenotype may be the effect of a combination of multiple loci, various statistical methods have been developed for identifying genetic epistasis effects. Among these methods, logic regression (LR) is an intriguing approach incorporating tree-like structures. Various methods have built on the original LR to improve different aspects of the model. In this study, we review four variations of LR, namely Logic Feature Selection, Monte Carlo Logic Regression, Genetic Programming for Association Studies, and Modified Logic Regression-Gene Expression Programming, and investigate the performance of each method using simulated and real genotype data. We contrast these with another tree-like approach, namely Random Forests, and a Bayesian logistic regression with stochastic search variable selection.

  2. Logistic regression for risk factor modelling in stuttering research.

    Science.gov (United States)

    Reed, Phil; Wu, Yaqionq

    2013-06-01

    To outline the uses of logistic regression and other statistical methods for risk factor analysis in the context of research on stuttering. The principles underlying the application of a logistic regression are illustrated, and the types of questions to which such a technique has been applied in the stuttering field are outlined. The assumptions and limitations of the technique are discussed with respect to existing stuttering research, and with respect to formulating appropriate research strategies to accommodate these considerations. Finally, some alternatives to the approach are briefly discussed. The way the statistical procedures are employed are demonstrated with some hypothetical data. Research into several practical issues concerning stuttering could benefit if risk factor modelling were used. Important examples are early diagnosis, prognosis (whether a child will recover or persist) and assessment of treatment outcome. After reading this article you will: (a) Summarize the situations in which logistic regression can be applied to a range of issues about stuttering; (b) Follow the steps in performing a logistic regression analysis; (c) Describe the assumptions of the logistic regression technique and the precautions that need to be checked when it is employed; (d) Be able to summarize its advantages over other techniques like estimation of group differences and simple regression. Copyright © 2012 Elsevier Inc. All rights reserved.

  3. Differentiating regressed melanoma from regressed lichenoid keratosis.

    Science.gov (United States)

    Chan, Aegean H; Shulman, Kenneth J; Lee, Bonnie A

    2017-04-01

    Distinguishing regressed lichen planus-like keratosis (LPLK) from regressed melanoma can be difficult on histopathologic examination, potentially resulting in mismanagement of patients. We aimed to identify histopathologic features by which regressed melanoma can be differentiated from regressed LPLK. Twenty actively inflamed LPLK, 12 LPLK with regression and 15 melanomas with regression were compared and evaluated by hematoxylin and eosin staining as well as Melan-A, microphthalmia transcription factor (MiTF) and cytokeratin (AE1/AE3) immunostaining. (1) A total of 40% of regressed melanomas showed complete or near complete loss of melanocytes within the epidermis with Melan-A and MiTF immunostaining, while 8% of regressed LPLK exhibited this finding. (2) Necrotic keratinocytes were seen in the epidermis in 33% regressed melanomas as opposed to all of the regressed LPLK. (3) A dense infiltrate of melanophages in the papillary dermis was seen in 40% of regressed melanomas, a feature not seen in regressed LPLK. In summary, our findings suggest that a complete or near complete loss of melanocytes within the epidermis strongly favors a regressed melanoma over a regressed LPLK. In addition, necrotic epidermal keratinocytes and the presence of a dense band-like distribution of dermal melanophages can be helpful in differentiating these lesions. © 2016 John Wiley & Sons A/S. Published by John Wiley & Sons Ltd.

  4. Identifying perinatal risk factors for infant maltreatment: an ecological approach

    Directory of Open Access Journals (Sweden)

    Hallisey Elaine J

    2006-12-01

    Full Text Available Abstract Background Child maltreatment and its consequences are a persistent problem throughout the world. Public health workers, human services officials, and others are interested in new and efficient ways to determine which geographic areas to target for intervention programs and resources. To improve assessment efforts, selected perinatal factors were examined, both individually and in various combinations, to determine if they are associated with increased risk of infant maltreatment. State of Georgia birth records and abuse and neglect data were analyzed using an area-based, ecological approach with the census tract as a surrogate for the community. Cartographic visualization suggested some correlation exists between risk factors and child maltreatment, so bivariate and multivariate regression were performed. The presence of spatial autocorrelation precluded the use of traditional ordinary least squares regression, therefore a spatial regression model coupled with maximum likelihood estimation was employed. Results Results indicate that all individual factors or their combinations are significantly associated with increased risk of infant maltreatment. The set of perinatal risk factors that best predicts infant maltreatment rates are: mother smoked during pregnancy, families with three or more siblings, maternal age less than 20 years, births to unmarried mothers, Medicaid beneficiaries, and inadequate prenatal care. Conclusion This model enables public health to take a proactive stance, to reasonably predict areas where poor outcomes are likely to occur, and to therefore more efficiently allocate resources. U.S. states that routinely collect the variables the National Center for Health Statistics (NCHS defines for birth certificates can easily identify areas that are at high risk for infant maltreatment. The authors recommend that agencies charged with reducing child maltreatment target communities that demonstrate the perinatal risks

  5. Antibiotic Resistances in Livestock: A Comparative Approach to Identify an Appropriate Regression Model for Count Data

    Directory of Open Access Journals (Sweden)

    Anke Hüls

    2017-05-01

    Full Text Available Antimicrobial resistance in livestock is a matter of general concern. To develop hygiene measures and methods for resistance prevention and control, epidemiological studies on a population level are needed to detect factors associated with antimicrobial resistance in livestock holdings. In general, regression models are used to describe these relationships between environmental factors and resistance outcome. Besides the study design, the correlation structures of the different outcomes of antibiotic resistance and structural zero measurements on the resistance outcome as well as on the exposure side are challenges for the epidemiological model building process. The use of appropriate regression models that acknowledge these complexities is essential to assure valid epidemiological interpretations. The aims of this paper are (i to explain the model building process comparing several competing models for count data (negative binomial model, quasi-Poisson model, zero-inflated model, and hurdle model and (ii to compare these models using data from a cross-sectional study on antibiotic resistance in animal husbandry. These goals are essential to evaluate which model is most suitable to identify potential prevention measures. The dataset used as an example in our analyses was generated initially to study the prevalence and associated factors for the appearance of cefotaxime-resistant Escherichia coli in 48 German fattening pig farms. For each farm, the outcome was the count of samples with resistant bacteria. There was almost no overdispersion and only moderate evidence of excess zeros in the data. Our analyses show that it is essential to evaluate regression models in studies analyzing the relationship between environmental factors and antibiotic resistances in livestock. After model comparison based on evaluation of model predictions, Akaike information criterion, and Pearson residuals, here the hurdle model was judged to be the most appropriate

  6. Identifying predictors of physics item difficulty: A linear regression approach

    Science.gov (United States)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge

  7. Identifying predictors of physics item difficulty: A linear regression approach

    Directory of Open Access Journals (Sweden)

    Hasnija Muratovic

    2011-06-01

    Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal

  8. Novel Application of Statistical Methods to Identify New Urinary Incontinence Risk Factors

    Directory of Open Access Journals (Sweden)

    Theophilus O. Ogunyemi

    2012-01-01

    Full Text Available Longitudinal data for studying urinary incontinence (UI risk factors are rare. Data from one study, the hallmark Medical, Epidemiological, and Social Aspects of Aging (MESA, have been analyzed in the past; however, repeated measures analyses that are crucial for analyzing longitudinal data have not been applied. We tested a novel application of statistical methods to identify UI risk factors in older women. MESA data were collected at baseline and yearly from a sample of 1955 men and women in the community. Only women responding to the 762 baseline and 559 follow-up questions at one year in each respective survey were examined. To test their utility in mining large data sets, and as a preliminary step to creating a predictive index for developing UI, logistic regression, generalized estimating equations (GEEs, and proportional hazard regression (PHREG methods were used on the existing MESA data. The GEE and PHREG combination identified 15 significant risk factors associated with developing UI out of which six of them, namely, urinary frequency, urgency, any urine loss, urine loss after emptying, subject’s anticipation, and doctor’s proactivity, are found most highly significant by both methods. These six factors are potential candidates for constructing a future UI predictive index.

  9. Extralobar pulmonary sequestration in neonates: The natural course and predictive factors associated with spontaneous regression

    Energy Technology Data Exchange (ETDEWEB)

    Yoon, Hee Mang; Jung, Ah Young; Cho, Young Ah; Yoon, Chong Hyun; Lee, Jin Seong [Asan Medical Center Children' s Hospital, University of Ulsan College of Medicine, Department of Radiology and Research Institute of Radiology, Songpa-gu, Seoul (Korea, Republic of); Kim, Ellen Ai-Rhan [University of Ulsan College of Medicine, Division of Neonatology, Asan Medical Center Children' s Hospital, Seoul (Korea, Republic of); Chung, Sung-Hoon [Kyung Hee University School of Medicine, Department of Pediatrics, Seoul (Korea, Republic of); Kim, Seon-Ok [Asan Medical Center, Department of Clinical Epidemiology and Biostatistics, Seoul (Korea, Republic of)

    2017-06-15

    To describe the natural course of extralobar pulmonary sequestration (EPS) and identify factors associated with spontaneous regression of EPS. We retrospectively searched for patients diagnosed with EPS on initial contrast CT scan within 1 month after birth and had a follow-up CT scan without treatment. Spontaneous regression of EPS was assessed by percentage decrease in volume (PDV) and percentage decrease in sum of the diameter of systemic feeding arteries (PDD) by comparing initial and follow-up CT scans. Clinical and CT features were analysed to determine factors associated with PDV and PDD rates. Fifty-one neonates were included. The cumulative proportions of patients reaching PDV > 50 % and PDD > 50 % were 93.0 % and 73.3 % at 4 years, respectively. Tissue attenuation was significantly associated with PDV rate (B = -21.78, P <.001). The tissue attenuation (B = -22.62, P =.001) and diameter of the largest systemic feeding arteries (B = -48.31, P =.011) were significant factors associated with PDD rate. The volume and diameter of systemic feeding arteries of EPS spontaneously decreased within 4 years without treatment. EPSs showing a low tissue attenuation and small diameter of the largest systemic feeding arteries on initial contrast-enhanced CT scans were likely to regress spontaneously. (orig.)

  10. Multiplication factor versus regression analysis in stature estimation from hand and foot dimensions.

    Science.gov (United States)

    Krishan, Kewal; Kanchan, Tanuj; Sharma, Abhilasha

    2012-05-01

    Estimation of stature is an important parameter in identification of human remains in forensic examinations. The present study is aimed to compare the reliability and accuracy of stature estimation and to demonstrate the variability in estimated stature and actual stature using multiplication factor and regression analysis methods. The study is based on a sample of 246 subjects (123 males and 123 females) from North India aged between 17 and 20 years. Four anthropometric measurements; hand length, hand breadth, foot length and foot breadth taken on the left side in each subject were included in the study. Stature was measured using standard anthropometric techniques. Multiplication factors were calculated and linear regression models were derived for estimation of stature from hand and foot dimensions. Derived multiplication factors and regression formula were applied to the hand and foot measurements in the study sample. The estimated stature from the multiplication factors and regression analysis was compared with the actual stature to find the error in estimated stature. The results indicate that the range of error in estimation of stature from regression analysis method is less than that of multiplication factor method thus, confirming that the regression analysis method is better than multiplication factor analysis in stature estimation. Copyright © 2012 Elsevier Ltd and Faculty of Forensic and Legal Medicine. All rights reserved.

  11. A retrospective analysis to identify the factors affecting infection in patients undergoing chemotherapy.

    Science.gov (United States)

    Park, Ji Hyun; Kim, Hyeon-Young; Lee, Hanna; Yun, Eun Kyoung

    2015-12-01

    This study compares the performance of the logistic regression and decision tree analysis methods for assessing the risk factors for infection in cancer patients undergoing chemotherapy. The subjects were 732 cancer patients who were receiving chemotherapy at K university hospital in Seoul, Korea. The data were collected between March 2011 and February 2013 and were processed for descriptive analysis, logistic regression and decision tree analysis using the IBM SPSS Statistics 19 and Modeler 15.1 programs. The most common risk factors for infection in cancer patients receiving chemotherapy were identified as alkylating agents, vinca alkaloid and underlying diabetes mellitus. The logistic regression explained 66.7% of the variation in the data in terms of sensitivity and 88.9% in terms of specificity. The decision tree analysis accounted for 55.0% of the variation in the data in terms of sensitivity and 89.0% in terms of specificity. As for the overall classification accuracy, the logistic regression explained 88.0% and the decision tree analysis explained 87.2%. The logistic regression analysis showed a higher degree of sensitivity and classification accuracy. Therefore, logistic regression analysis is concluded to be the more effective and useful method for establishing an infection prediction model for patients undergoing chemotherapy. Copyright © 2015 Elsevier Ltd. All rights reserved.

  12. What Makes Sports Fans Interactive? Identifying Factors Affecting Chat Interactions in Online Sports Viewing.

    Science.gov (United States)

    Ko, Minsam; Yeo, Jaeryong; Lee, Juyeong; Lee, Uichin; Jang, Young Jae

    2016-01-01

    Sports fans are able to watch games from many locations using TV services while interacting with other fans online. In this paper, we identify the factors that affect sports viewers' online interactions. Using a large-scale dataset of more than 25 million chat messages from a popular social TV site for baseball, we extract various game-related factors, and investigate the relationships between these factors and fans' interactions using a series of multiple regression analyses. As a result, we identify several factors that are significantly related to viewer interactions. In addition, we determine that the influence of these factors varies according to the user group; i.e., active vs. less active users, and loyal vs. non-loyal users.

  13. Identifying Domain-General and Domain-Specific Predictors of Low Mathematics Performance: A Classification and Regression Tree Analysis

    Directory of Open Access Journals (Sweden)

    David J. Purpura

    2017-12-01

    Full Text Available Many children struggle to successfully acquire early mathematics skills. Theoretical and empirical evidence has pointed to deficits in domain-specific skills (e.g., non-symbolic mathematics skills or domain-general skills (e.g., executive functioning and language as underlying low mathematical performance. In the current study, we assessed a sample of 113 three- to five-year old preschool children on a battery of domain-specific and domain-general factors in the fall and spring of their preschool year to identify Time 1 (fall factors associated with low performance in mathematics knowledge at Time 2 (spring. We used the exploratory approach of classification and regression tree analyses, a strategy that uses step-wise partitioning to create subgroups from a larger sample using multiple predictors, to identify the factors that were the strongest classifiers of low performance for younger and older preschool children. Results indicated that the most consistent classifier of low mathematics performance at Time 2 was children’s Time 1 mathematical language skills. Further, other distinct classifiers of low performance emerged for younger and older children. These findings suggest that risk classification for low mathematics performance may differ depending on children’s age.

  14. What Makes Sports Fans Interactive? Identifying Factors Affecting Chat Interactions in Online Sports Viewing.

    Directory of Open Access Journals (Sweden)

    Minsam Ko

    Full Text Available Sports fans are able to watch games from many locations using TV services while interacting with other fans online. In this paper, we identify the factors that affect sports viewers' online interactions. Using a large-scale dataset of more than 25 million chat messages from a popular social TV site for baseball, we extract various game-related factors, and investigate the relationships between these factors and fans' interactions using a series of multiple regression analyses. As a result, we identify several factors that are significantly related to viewer interactions. In addition, we determine that the influence of these factors varies according to the user group; i.e., active vs. less active users, and loyal vs. non-loyal users.

  15. What Makes Sports Fans Interactive? Identifying Factors Affecting Chat Interactions in Online Sports Viewing

    Science.gov (United States)

    Yeo, Jaeryong; Lee, Juyeong

    2016-01-01

    Sports fans are able to watch games from many locations using TV services while interacting with other fans online. In this paper, we identify the factors that affect sports viewers’ online interactions. Using a large-scale dataset of more than 25 million chat messages from a popular social TV site for baseball, we extract various game-related factors, and investigate the relationships between these factors and fans’ interactions using a series of multiple regression analyses. As a result, we identify several factors that are significantly related to viewer interactions. In addition, we determine that the influence of these factors varies according to the user group; i.e., active vs. less active users, and loyal vs. non-loyal users. PMID:26849568

  16. Two-factor logistic regression in pediatric liver transplantation

    Science.gov (United States)

    Uzunova, Yordanka; Prodanova, Krasimira; Spasov, Lyubomir

    2017-12-01

    Using a two-factor logistic regression analysis an estimate is derived for the probability of absence of infections in the early postoperative period after pediatric liver transplantation. The influence of both the bilirubin level and the international normalized ratio of prothrombin time of blood coagulation at the 5th postoperative day is studied.

  17. Influential factors of red-light running at signalized intersection and prediction using a rare events logistic regression model.

    Science.gov (United States)

    Ren, Yilong; Wang, Yunpeng; Wu, Xinkai; Yu, Guizhen; Ding, Chuan

    2016-10-01

    Red light running (RLR) has become a major safety concern at signalized intersection. To prevent RLR related crashes, it is critical to identify the factors that significantly impact the drivers' behaviors of RLR, and to predict potential RLR in real time. In this research, 9-month's RLR events extracted from high-resolution traffic data collected by loop detectors from three signalized intersections were applied to identify the factors that significantly affect RLR behaviors. The data analysis indicated that occupancy time, time gap, used yellow time, time left to yellow start, whether the preceding vehicle runs through the intersection during yellow, and whether there is a vehicle passing through the intersection on the adjacent lane were significantly factors for RLR behaviors. Furthermore, due to the rare events nature of RLR, a modified rare events logistic regression model was developed for RLR prediction. The rare events logistic regression method has been applied in many fields for rare events studies and shows impressive performance, but so far none of previous research has applied this method to study RLR. The results showed that the rare events logistic regression model performed significantly better than the standard logistic regression model. More importantly, the proposed RLR prediction method is purely based on loop detector data collected from a single advance loop detector located 400 feet away from stop-bar. This brings great potential for future field applications of the proposed method since loops have been widely implemented in many intersections and can collect data in real time. This research is expected to contribute to the improvement of intersection safety significantly. Copyright © 2016 Elsevier Ltd. All rights reserved.

  18. Sparse multivariate factor analysis regression models and its applications to integrative genomics analysis.

    Science.gov (United States)

    Zhou, Yan; Wang, Pei; Wang, Xianlong; Zhu, Ji; Song, Peter X-K

    2017-01-01

    The multivariate regression model is a useful tool to explore complex associations between two kinds of molecular markers, which enables the understanding of the biological pathways underlying disease etiology. For a set of correlated response variables, accounting for such dependency can increase statistical power. Motivated by integrative genomic data analyses, we propose a new methodology-sparse multivariate factor analysis regression model (smFARM), in which correlations of response variables are assumed to follow a factor analysis model with latent factors. This proposed method not only allows us to address the challenge that the number of association parameters is larger than the sample size, but also to adjust for unobserved genetic and/or nongenetic factors that potentially conceal the underlying response-predictor associations. The proposed smFARM is implemented by the EM algorithm and the blockwise coordinate descent algorithm. The proposed methodology is evaluated and compared to the existing methods through extensive simulation studies. Our results show that accounting for latent factors through the proposed smFARM can improve sensitivity of signal detection and accuracy of sparse association map estimation. We illustrate smFARM by two integrative genomics analysis examples, a breast cancer dataset, and an ovarian cancer dataset, to assess the relationship between DNA copy numbers and gene expression arrays to understand genetic regulatory patterns relevant to the disease. We identify two trans-hub regions: one in cytoband 17q12 whose amplification influences the RNA expression levels of important breast cancer genes, and the other in cytoband 9q21.32-33, which is associated with chemoresistance in ovarian cancer. © 2016 WILEY PERIODICALS, INC.

  19. Determining factors influencing survival of breast cancer by fuzzy logistic regression model.

    Science.gov (United States)

    Nikbakht, Roya; Bahrampour, Abbas

    2017-01-01

    Fuzzy logistic regression model can be used for determining influential factors of disease. This study explores the important factors of actual predictive survival factors of breast cancer's patients. We used breast cancer data which collected by cancer registry of Kerman University of Medical Sciences during the period of 2000-2007. The variables such as morphology, grade, age, and treatments (surgery, radiotherapy, and chemotherapy) were applied in the fuzzy logistic regression model. Performance of model was determined in terms of mean degree of membership (MDM). The study results showed that almost 41% of patients were in neoplasm and malignant group and more than two-third of them were still alive after 5-year follow-up. Based on the fuzzy logistic model, the most important factors influencing survival were chemotherapy, morphology, and radiotherapy, respectively. Furthermore, the MDM criteria show that the fuzzy logistic regression have a good fit on the data (MDM = 0.86). Fuzzy logistic regression model showed that chemotherapy is more important than radiotherapy in survival of patients with breast cancer. In addition, another ability of this model is calculating possibilistic odds of survival in cancer patients. The results of this study can be applied in clinical research. Furthermore, there are few studies which applied the fuzzy logistic models. Furthermore, we recommend using this model in various research areas.

  20. Identifying risk factors associated with smear positivity of pulmonary tuberculosis in Kazakhstan.

    Directory of Open Access Journals (Sweden)

    Sabrina Hermosilla

    Full Text Available Sputum smear-positive tuberculosis (TB patients have a high risk of transmission and are of great epidemiological and infection control significance. Little is known about the smear-positive populations in high TB burden regions, such as Kazakhstan. The objective of this study is to characterize the smear-positive population in Kazakhstan and identify associated modifiable risk factors.Data on incident TB cases' (identified between April 2012 and March 2014 socio-demographic, risk behavior, and comorbidity characteristics were collected in four regions of Kazakhstan through structured survey and medical record review. We used multivariable logistic regression to determine factors associated with smear positivity.Of the total sample, 193 (34.3% of the 562 study participants tested smear-positive. In the final adjusted multivariable logistic regression model, sex (adjusted odds ratio (aOR = 2.0, 95% CI:1.3-3.1, p < 0.01, incarceration (aOR = 3.6, 95% CI:1.2-11.1, p = 0.03, alcohol dependence (aOR = 2.6, 95% CI:1.2-5.7, p = 0.02, diabetes (aOR = 5.0, 95% CI:2.4-10.7, p < 0.01, and physician access (aOR = 2.7, 95% CI:1.3-5.5p < 0.01 were associated with smear-positivity.Incarceration, alcohol dependence, diabetes, and physician access are associated with smear positivity among incident TB cases in Kazakhstan. To stem the TB epidemic, screening, treatment and prevention policies should address these factors.

  1. Modelling long-term fire occurrence factors in Spain by accounting for local variations with geographically weighted regression

    Science.gov (United States)

    Martínez-Fernández, J.; Chuvieco, E.; Koutsias, N.

    2013-02-01

    Humans are responsible for most forest fires in Europe, but anthropogenic factors behind these events are still poorly understood. We tried to identify the driving factors of human-caused fire occurrence in Spain by applying two different statistical approaches. Firstly, assuming stationary processes for the whole country, we created models based on multiple linear regression and binary logistic regression to find factors associated with fire density and fire presence, respectively. Secondly, we used geographically weighted regression (GWR) to better understand and explore the local and regional variations of those factors behind human-caused fire occurrence. The number of human-caused fires occurring within a 25-yr period (1983-2007) was computed for each of the 7638 Spanish mainland municipalities, creating a binary variable (fire/no fire) to develop logistic models, and a continuous variable (fire density) to build standard linear regression models. A total of 383 657 fires were registered in the study dataset. The binary logistic model, which estimates the probability of having/not having a fire, successfully classified 76.4% of the total observations, while the ordinary least squares (OLS) regression model explained 53% of the variation of the fire density patterns (adjusted R2 = 0.53). Both approaches confirmed, in addition to forest and climatic variables, the importance of variables related with agrarian activities, land abandonment, rural population exodus and developmental processes as underlying factors of fire occurrence. For the GWR approach, the explanatory power of the GW linear model for fire density using an adaptive bandwidth increased from 53% to 67%, while for the GW logistic model the correctly classified observations improved only slightly, from 76.4% to 78.4%, but significantly according to the corrected Akaike Information Criterion (AICc), from 3451.19 to 3321.19. The results from GWR indicated a significant spatial variation in the local

  2. Multiple Linear Regression Analysis of Factors Affecting Real Property Price Index From Case Study Research In Istanbul/Turkey

    Science.gov (United States)

    Denli, H. H.; Koc, Z.

    2015-12-01

    Estimation of real properties depending on standards is difficult to apply in time and location. Regression analysis construct mathematical models which describe or explain relationships that may exist between variables. The problem of identifying price differences of properties to obtain a price index can be converted into a regression problem, and standard techniques of regression analysis can be used to estimate the index. Considering regression analysis for real estate valuation, which are presented in real marketing process with its current characteristics and quantifiers, the method will help us to find the effective factors or variables in the formation of the value. In this study, prices of housing for sale in Zeytinburnu, a district in Istanbul, are associated with its characteristics to find a price index, based on information received from a real estate web page. The associated variables used for the analysis are age, size in m2, number of floors having the house, floor number of the estate and number of rooms. The price of the estate represents the dependent variable, whereas the rest are independent variables. Prices from 60 real estates have been used for the analysis. Same price valued locations have been found and plotted on the map and equivalence curves have been drawn identifying the same valued zones as lines.

  3. Efficient logistic regression designs under an imperfect population identifier.

    Science.gov (United States)

    Albert, Paul S; Liu, Aiyi; Nansel, Tonja

    2014-03-01

    Motivated by actual study designs, this article considers efficient logistic regression designs where the population is identified with a binary test that is subject to diagnostic error. We consider the case where the imperfect test is obtained on all participants, while the gold standard test is measured on a small chosen subsample. Under maximum-likelihood estimation, we evaluate the optimal design in terms of sample selection as well as verification. We show that there may be substantial efficiency gains by choosing a small percentage of individuals who test negative on the imperfect test for inclusion in the sample (e.g., verifying 90% test-positive cases). We also show that a two-stage design may be a good practical alternative to a fixed design in some situations. Under optimal and nearly optimal designs, we compare maximum-likelihood and semi-parametric efficient estimators under correct and misspecified models with simulations. The methodology is illustrated with an analysis from a diabetes behavioral intervention trial. © 2013, The International Biometric Society.

  4. Factor Analysis of Therapist-Identified Treatment Targets in Community-Based Children's Mental Health.

    Science.gov (United States)

    Love, Allison R; Okado, Izumi; Orimoto, Trina E; Mueller, Charles W

    2018-01-01

    The present study used exploratory and confirmatory factor analyses to identify underlying latent factors affecting variation in community therapists' endorsement of treatment targets. As part of a statewide practice management program, therapist completed monthly reports of treatment targets (up to 10 per month) for a sample of youth (n = 790) receiving intensive in-home therapy. Nearly 75 % of youth were diagnosed with multiple co-occurring disorders. Five factors emerged: Disinhibition, Societal Rules Evasion, Social Engagement Deficits, Emotional Distress, and Management of Biodevelopmental Outcomes. Using logistic regression, primary diagnosis predicted therapist selection of Disinhibition and Emotional Distress targets. Client age predicted endorsement of Societal Rules Evasion targets. Practice-to-research implications are discussed.

  5. On the null distribution of Bayes factors in linear regression

    Science.gov (United States)

    We show that under the null, the 2 log (Bayes factor) is asymptotically distributed as a weighted sum of chi-squared random variables with a shifted mean. This claim holds for Bayesian multi-linear regression with a family of conjugate priors, namely, the normal-inverse-gamma prior, the g-prior, and...

  6. Regression Analysis to Identify Factors Associated with Household Salt Iodine Content at the Sub-National Level in Bangladesh, India, Ghana and Senegal

    Science.gov (United States)

    Knowles, Jacky; Kupka, Roland; Dumble, Sam; Garrett, Greg S.; Pandav, Chandrakant S.; Yadav, Kapil; Nahar, Baitun; Touré, Ndeye Khady; Amoaful, Esi Foriwa; Gorstein, Jonathan

    2018-01-01

    Regression analyses of data from stratified, cluster sample, household iodine surveys in Bangladesh, India, Ghana and Senegal were conducted to identify factors associated with household access to adequately iodised salt. For all countries, in single variable analyses, household salt iodine was significantly different (p < 0.05) between strata (geographic areas with representative data, defined by survey design), and significantly higher (p < 0.05) among households: with better living standard scores, where the respondent knew about iodised salt and/or looked for iodised salt at purchase, using salt bought in a sealed package, or using refined grain salt. Other country-level associations were also found. Multiple variable analyses showed a significant association between salt iodine and strata (p < 0.001) in India, Ghana and Senegal and that salt grain type was significantly associated with estimated iodine content in all countries (p < 0.001). Salt iodine relative to the reference (coarse salt) ranged from 1.3 (95% CI 1.2, 1.5) times higher for fine salt in Senegal to 3.6 (95% CI 2.6, 4.9) times higher for washed and 6.5 (95% CI 4.9, 8.8) times higher for refined salt in India. Sub-national data are required to monitor equity of access to adequately iodised salt. Improving household access to refined iodised salt in sealed packaging, would improve iodine intake from household salt in all four countries in this analysis, particularly in areas where there is significant small-scale salt production. PMID:29671774

  7. Retrieving relevant factors with exploratory SEM and principal-covariate regression: A comparison.

    Science.gov (United States)

    Vervloet, Marlies; Van den Noortgate, Wim; Ceulemans, Eva

    2018-02-12

    Behavioral researchers often linearly regress a criterion on multiple predictors, aiming to gain insight into the relations between the criterion and predictors. Obtaining this insight from the ordinary least squares (OLS) regression solution may be troublesome, because OLS regression weights show only the effect of a predictor on top of the effects of other predictors. Moreover, when the number of predictors grows larger, it becomes likely that the predictors will be highly collinear, which makes the regression weights' estimates unstable (i.e., the "bouncing beta" problem). Among other procedures, dimension-reduction-based methods have been proposed for dealing with these problems. These methods yield insight into the data by reducing the predictors to a smaller number of summarizing variables and regressing the criterion on these summarizing variables. Two promising methods are principal-covariate regression (PCovR) and exploratory structural equation modeling (ESEM). Both simultaneously optimize reduction and prediction, but they are based on different frameworks. The resulting solutions have not yet been compared; it is thus unclear what the strengths and weaknesses are of both methods. In this article, we focus on the extents to which PCovR and ESEM are able to extract the factors that truly underlie the predictor scores and can predict a single criterion. The results of two simulation studies showed that for a typical behavioral dataset, ESEM (using the BIC for model selection) in this regard is successful more often than PCovR. Yet, in 93% of the datasets PCovR performed equally well, and in the case of 48 predictors, 100 observations, and large differences in the strengths of the factors, PCovR even outperformed ESEM.

  8. The study of logistic regression of risk factor on the death cause of uranium miners

    International Nuclear Information System (INIS)

    Wen Jinai; Yuan Liyun; Jiang Ruyi

    1999-01-01

    Logistic regression model has widely been used in the field of medicine. The computer software on this model is popular, but it is worth to discuss how to use this model correctly. Using SPSS (Statistical Package for the Social Science) software, unconditional logistic regression method was adopted to carry out multi-factor analyses on the cause of total death, cancer death and lung cancer death of uranium miners. The data is from radioepidemiological database of one uranium mine. The result show that attained age is a risk factor in the logistic regression analyses of total death, cancer death and lung cancer death. In the logistic regression analysis of cancer death, there is a negative correlation between the age of exposure and cancer death. This shows that the younger the age at exposure, the bigger the risk of cancer death. In the logistic regression analysis of lung cancer death, there is a positive correlation between the cumulated exposure and lung cancer death, this show that cumulated exposure is a most important risk factor of lung cancer death on uranium miners. It has been documented by many foreign reports that the lung cancer death rate is higher in uranium miners

  9. DETERMINATION OF FACTORS AFFECTING LENGTH OF STAY WITH MULTINOMIAL LOGISTIC REGRESSION IN TURKEY

    Directory of Open Access Journals (Sweden)

    Öğr. Gör. Rukiye NUMAN TEKİN

    2016-08-01

    Full Text Available Length of stay (LOS has important implications in various aspects of health services, can vary according to a wide range of factors. It is noticed that LOS has been neglected mostly in both theoratical studies and practice of health care management in Turkey. The main purpose of this study is to identify factors related to LOS in Turkey. A retrospective analysis of 2.255.836 patients hospitalized to private, university, foundation university and other (municipality, association and foreigners/minority hospitals hospitals which have an agreement with Social Security Institution (SSI in Turkey, from January 1, 2010, until the December 31, 2010, was examined. Patient’s data were taken from MEDULA (National Electronic Invoice System and SPSS 18.0 was used to perform statistical analysis. In this study t-test, one way anova and multinomial logistic regression are used to determine variables that may affect to LOS. The average LOS of patients was 3,93 days (SD = 5,882. LOS showed a statistically significant difference according to all independent variables used in the study (age, gender, disease class, type of hospitalization, presence of comorbidity, type and number of surgery, season of hospitalization, hospital ownership/bed capacity/ geographical region/residential area/type of service. According to the results of the multinomial lojistic regression analysis, LOS was negatively affected in terms of gender, presence of comorbidity, geographical region of hospital and was positively affected in terms of age, season of hospitalization, hospital bed capacity/ ownership/type of service/residential area.

  10. Regression Analysis to Identify Factors Associated with Household Salt Iodine Content at the Sub-National Level in Bangladesh, India, Ghana and Senegal

    Directory of Open Access Journals (Sweden)

    Jacky Knowles

    2018-04-01

    Full Text Available Regression analyses of data from stratified, cluster sample, household iodine surveys in Bangladesh, India, Ghana and Senegal were conducted to identify factors associated with household access to adequately iodised salt. For all countries, in single variable analyses, household salt iodine was significantly different (p < 0.05 between strata (geographic areas with representative data, defined by survey design, and significantly higher (p < 0.05 among households: with better living standard scores, where the respondent knew about iodised salt and/or looked for iodised salt at purchase, using salt bought in a sealed package, or using refined grain salt. Other country-level associations were also found. Multiple variable analyses showed a significant association between salt iodine and strata (p < 0.001 in India, Ghana and Senegal and that salt grain type was significantly associated with estimated iodine content in all countries (p < 0.001. Salt iodine relative to the reference (coarse salt ranged from 1.3 (95% CI 1.2, 1.5 times higher for fine salt in Senegal to 3.6 (95% CI 2.6, 4.9 times higher for washed and 6.5 (95% CI 4.9, 8.8 times higher for refined salt in India. Sub-national data are required to monitor equity of access to adequately iodised salt. Improving household access to refined iodised salt in sealed packaging, would improve iodine intake from household salt in all four countries in this analysis, particularly in areas where there is significant small-scale salt production.

  11. A new multiple regression model to identify multi-family houses with a high prevalence of sick building symptoms "SBS", within the healthy sustainable house study in Stockholm (3H).

    Science.gov (United States)

    Engvall, Karin; Hult, M; Corner, R; Lampa, E; Norbäck, D; Emenius, G

    2010-01-01

    The aim was to develop a new model to identify residential buildings with higher frequencies of "SBS" than expected, "risk buildings". In 2005, 481 multi-family buildings with 10,506 dwellings in Stockholm were studied by a new stratified random sampling. A standardised self-administered questionnaire was used to assess "SBS", atopy and personal factors. The response rate was 73%. Statistical analysis was performed by multiple logistic regressions. Dwellers owning their building reported less "SBS" than those renting. There was a strong relationship between socio-economic factors and ownership. The regression model, ended up with high explanatory values for age, gender, atopy and ownership. Applying our model, 9% of all residential buildings in Stockholm were classified as "risk buildings" with the highest proportion in houses built 1961-1975 (26%) and lowest in houses built 1985-1990 (4%). To identify "risk buildings", it is necessary to adjust for ownership and population characteristics.

  12. Key factors contributing to accident severity rate in construction industry in Iran: a regression modelling approach.

    Science.gov (United States)

    Soltanzadeh, Ahmad; Mohammadfam, Iraj; Moghimbeigi, Abbas; Ghiasvand, Reza

    2016-03-01

    Construction industry involves the highest risk of occupational accidents and bodily injuries, which range from mild to very severe. The aim of this cross-sectional study was to identify the factors associated with accident severity rate (ASR) in the largest Iranian construction companies based on data about 500 occupational accidents recorded from 2009 to 2013. We also gathered data on safety and health risk management and training systems. Data were analysed using Pearson's chi-squared coefficient and multiple regression analysis. Median ASR (and the interquartile range) was 107.50 (57.24- 381.25). Fourteen of the 24 studied factors stood out as most affecting construction accident severity (p<0.05). These findings can be applied in the design and implementation of a comprehensive safety and health risk management system to reduce ASR.

  13. Generalized reduced rank latent factor regression for high dimensional tensor fields, and neuroimaging-genetic applications.

    Science.gov (United States)

    Tao, Chenyang; Nichols, Thomas E; Hua, Xue; Ching, Christopher R K; Rolls, Edmund T; Thompson, Paul M; Feng, Jianfeng

    2017-01-01

    We propose a generalized reduced rank latent factor regression model (GRRLF) for the analysis of tensor field responses and high dimensional covariates. The model is motivated by the need from imaging-genetic studies to identify genetic variants that are associated with brain imaging phenotypes, often in the form of high dimensional tensor fields. GRRLF identifies from the structure in the data the effective dimensionality of the data, and then jointly performs dimension reduction of the covariates, dynamic identification of latent factors, and nonparametric estimation of both covariate and latent response fields. After accounting for the latent and covariate effects, GRLLF performs a nonparametric test on the remaining factor of interest. GRRLF provides a better factorization of the signals compared with common solutions, and is less susceptible to overfitting because it exploits the effective dimensionality. The generality and the flexibility of GRRLF also allow various statistical models to be handled in a unified framework and solutions can be efficiently computed. Within the field of neuroimaging, it improves the sensitivity for weak signals and is a promising alternative to existing approaches. The operation of the framework is demonstrated with both synthetic datasets and a real-world neuroimaging example in which the effects of a set of genes on the structure of the brain at the voxel level were measured, and the results compared favorably with those from existing approaches. Copyright © 2016. Published by Elsevier Inc.

  14. Schistosoma mansoni reinfection: Analysis of risk factors by classification and regression tree (CART modeling.

    Directory of Open Access Journals (Sweden)

    Andréa Gazzinelli

    Full Text Available Praziquantel (PZQ is an effective chemotherapy for schistosomiasis mansoni and a mainstay for its control and potential elimination. However, it does not prevent against reinfection, which can occur rapidly in areas with active transmission. A guide to ranking the risk factors for Schistosoma mansoni reinfection would greatly contribute to prioritizing resources and focusing prevention and control measures to prevent rapid reinfection. The objective of the current study was to explore the relationship among the socioeconomic, demographic, and epidemiological factors that can influence reinfection by S. mansoni one year after successful treatment with PZQ in school-aged children in Northeastern Minas Gerais state Brazil. Parasitological, socioeconomic, demographic, and water contact information were surveyed in 506 S. mansoni-infected individuals, aged 6 to 15 years, resident in these endemic areas. Eligible individuals were treated with PZQ until they were determined to be negative by the absence of S. mansoni eggs in the feces on two consecutive days of Kato-Katz fecal thick smear. These individuals were surveyed again 12 months from the date of successful treatment with PZQ. A classification and regression tree modeling (CART was then used to explore the relationship between socioeconomic, demographic, and epidemiological variables and their reinfection status. The most important risk factor identified for S. mansoni reinfection was their "heavy" infection at baseline. Additional analyses, excluding heavy infection status, showed that lower socioeconomic status and a lower level of education of the household head were also most important risk factors for S. mansoni reinfection. Our results provide an important contribution toward the control and possible elimination of schistosomiasis by identifying three major risk factors that can be used for targeted treatment and monitoring of reinfection. We suggest that control measures that target

  15. Use of clinical risk factors to identify postmenopausal women with vertebral fractures.

    Science.gov (United States)

    Tobias, J H; Hutchinson, A P; Hunt, L P; McCloskey, E V; Stone, M D; Martin, J C; Thompson, P W; Palferman, T G; Bhalla, A K

    2007-01-01

    Previous studies have been unable to identify risk factors for prevalent vertebral fractures (VF), which are suitable for use in selection strategies intended to target high-risk sub-groups for diagnostic assessment. However, these studies generally consisted of large epidemiology surveys based on questionnaires and were only able to evaluate a limited number of risk factors. Here, we investigated whether a stronger relationship exists with prevalent VF when conventional risk factors are combined with additional information obtained from detailed one-to-one assessment. Women aged 65-75 registered at four geographically distinct GP practices were invited to participate (n=1,518), of whom 540 attended for assessment as follows: a questionnaire asking about risk factors for osteoporosis such as height loss compared to age 25 and history of non-vertebral fracture (NVF), the get-up-and-go test, Margolis back pain score, measurement of wall-tragus and rib-pelvis distances, and BMD as measured by the distal forearm BMD. A lateral thoraco-lumbar spine X-ray was obtained, which was subsequently scored for the presence of significant vertebral deformities. Of the 509 subjects who underwent spinal radiographs, 37 (7.3%) were found to have one or more VF. Following logistic regression analysis, the four most predictive clinical risk factors for prevalent VF were: height loss (P=0.006), past NVF (P=0.004), history of back pain (P=0.075) and age (P=0.05). BMD was also significantly associated with prevalent VF (P=0.002), but its inclusion did not affect associations with other variables. Factors elicited from detailed one-to-one assessment were not related to the risk of one or more prevalent VFs. The area under ROC curves derived from these regressions, which suggested that models for prevalent VF had modest predictive accuracy, were as follows: 0.68 (BMD), 0.74 (four clinical risk factors above) and 0.78 (clinical risk factors + BMD). Analyses were repeated in relation to the

  16. Fuzzy Regression Prediction and Application Based on Multi-Dimensional Factors of Freight Volume

    Science.gov (United States)

    Xiao, Mengting; Li, Cheng

    2018-01-01

    Based on the reality of the development of air cargo, the multi-dimensional fuzzy regression method is used to determine the influencing factors, and the three most important influencing factors of GDP, total fixed assets investment and regular flight route mileage are determined. The system’s viewpoints and analogy methods, the use of fuzzy numbers and multiple regression methods to predict the civil aviation cargo volume. In comparison with the 13th Five-Year Plan for China’s Civil Aviation Development (2016-2020), it is proved that this method can effectively improve the accuracy of forecasting and reduce the risk of forecasting. It is proved that this model predicts civil aviation freight volume of the feasibility, has a high practical significance and practical operation.

  17. Anterior cruciate ligament injury: Identifying information sources and risk factor awareness among the general population.

    Directory of Open Access Journals (Sweden)

    Yasuharu Nagano

    Full Text Available Raising awareness on a disorder is important for its prevention and for promoting public health. However, for sports injuries like the anterior cruciate ligament (ACL injury no studies have investigated the awareness on risk factors for injury and possible preventative measures in the general population. The sources of information among the population are also unclear. The purpose of the present study was to identify these aspects of public awareness about the ACL injury.A questionnaire was randomly distributed among the general population registered with a web based questionnaire supplier, to recruit 900 participants who were aware about the ACL injury. The questionnaire consisted of two parts: Question 1 asked them about their sources of information regarding the ACL injury; Question 2 asked them about the risk factors for ACL injury. Multivariate logistic regression was used to determine the information sources that provide a good understanding of the risk factors.The leading source of information for ACL injury was television (57.0%. However, the results of logistic regression analysis revealed that television was not an effective medium to create awareness about the risk factors, among the general population. Instead "Lecture by a coach", "Classroom session on Health", and "Newspaper" were significantly more effective in creating a good awareness of the risk factors (p < 0.001.

  18. Regression-Based Norms for a Bi-factor Model for Scoring the Brief Test of Adult Cognition by Telephone (BTACT).

    Science.gov (United States)

    Gurnani, Ashita S; John, Samantha E; Gavett, Brandon E

    2015-05-01

    The current study developed regression-based normative adjustments for a bi-factor model of the The Brief Test of Adult Cognition by Telephone (BTACT). Archival data from the Midlife Development in the United States-II Cognitive Project were used to develop eight separate linear regression models that predicted bi-factor BTACT scores, accounting for age, education, gender, and occupation-alone and in various combinations. All regression models provided statistically significant fit to the data. A three-predictor regression model fit best and accounted for 32.8% of the variance in the global bi-factor BTACT score. The fit of the regression models was not improved by gender. Eight different regression models are presented to allow the user flexibility in applying demographic corrections to the bi-factor BTACT scores. Occupation corrections, while not widely used, may provide useful demographic adjustments for adult populations or for those individuals who have attained an occupational status not commensurate with expected educational attainment. © The Author 2015. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

  19. Identifying the important factors in simulation models with many factors

    NARCIS (Netherlands)

    Bettonvil, B.; Kleijnen, J.P.C.

    1994-01-01

    Simulation models may have many parameters and input variables (together called factors), while only a few factors are really important (parsimony principle). For such models this paper presents an effective and efficient screening technique to identify and estimate those important factors. The

  20. Regression: The Apple Does Not Fall Far From the Tree.

    Science.gov (United States)

    Vetter, Thomas R; Schober, Patrick

    2018-05-15

    Researchers and clinicians are frequently interested in either: (1) assessing whether there is a relationship or association between 2 or more variables and quantifying this association; or (2) determining whether 1 or more variables can predict another variable. The strength of such an association is mainly described by the correlation. However, regression analysis and regression models can be used not only to identify whether there is a significant relationship or association between variables but also to generate estimations of such a predictive relationship between variables. This basic statistical tutorial discusses the fundamental concepts and techniques related to the most common types of regression analysis and modeling, including simple linear regression, multiple regression, logistic regression, ordinal regression, and Poisson regression, as well as the common yet often underrecognized phenomenon of regression toward the mean. The various types of regression analysis are powerful statistical techniques, which when appropriately applied, can allow for the valid interpretation of complex, multifactorial data. Regression analysis and models can assess whether there is a relationship or association between 2 or more observed variables and estimate the strength of this association, as well as determine whether 1 or more variables can predict another variable. Regression is thus being applied more commonly in anesthesia, perioperative, critical care, and pain research. However, it is crucial to note that regression can identify plausible risk factors; it does not prove causation (a definitive cause and effect relationship). The results of a regression analysis instead identify independent (predictor) variable(s) associated with the dependent (outcome) variable. As with other statistical methods, applying regression requires that certain assumptions be met, which can be tested with specific diagnostics.

  1. Evaluating risk factors for endemic human Salmonella Enteritidis infections with different phage types in Ontario, Canada using multinomial logistic regression and a case-case study approach

    Directory of Open Access Journals (Sweden)

    Varga Csaba

    2012-10-01

    Full Text Available Abstract Background Identifying risk factors for Salmonella Enteritidis (SE infections in Ontario will assist public health authorities to design effective control and prevention programs to reduce the burden of SE infections. Our research objective was to identify risk factors for acquiring SE infections with various phage types (PT in Ontario, Canada. We hypothesized that certain PTs (e.g., PT8 and PT13a have specific risk factors for infection. Methods Our study included endemic SE cases with various PTs whose isolates were submitted to the Public Health Laboratory-Toronto from January 20th to August 12th, 2011. Cases were interviewed using a standardized questionnaire that included questions pertaining to demographics, travel history, clinical symptoms, contact with animals, and food exposures. A multinomial logistic regression method using the Generalized Linear Latent and Mixed Model procedure and a case-case study design were used to identify risk factors for acquiring SE infections with various PTs in Ontario, Canada. In the multinomial logistic regression model, the outcome variable had three categories representing human infections caused by SE PT8, PT13a, and all other SE PTs (i.e., non-PT8/non-PT13a as a referent category to which the other two categories were compared. Results In the multivariable model, SE PT8 was positively associated with contact with dogs (OR=2.17, 95% CI 1.01-4.68 and negatively associated with pepper consumption (OR=0.35, 95% CI 0.13-0.94, after adjusting for age categories and gender, and using exposure periods and health regions as random effects to account for clustering. Conclusions Our study findings offer interesting hypotheses about the role of phage type-specific risk factors. Multinomial logistic regression analysis and the case-case study approach are novel methodologies to evaluate associations among SE infections with different PTs and various risk factors.

  2. Exploring factors associated with traumatic dental injuries in preschool children: a Poisson regression analysis.

    Science.gov (United States)

    Feldens, Carlos Alberto; Kramer, Paulo Floriani; Ferreira, Simone Helena; Spiguel, Mônica Hermann; Marquezan, Marcela

    2010-04-01

    This cross-sectional study aimed to investigate the factors associated with dental trauma in preschool children using Poisson regression analysis with robust variance. The study population comprised 888 children aged 3- to 5-year-old attending public nurseries in Canoas, southern Brazil. Questionnaires assessing information related to the independent variables (age, gender, race, mother's educational level and family income) were completed by the parents. Clinical examinations were carried out by five trained examiners in order to assess traumatic dental injuries (TDI) according to Andreasen's classification. One of the five examiners was calibrated to assess orthodontic characteristics (open bite and overjet). Multivariable Poisson regression analysis with robust variance was used to determine the factors associated with dental trauma as well as the strengths of association. Traditional logistic regression was also performed in order to compare the estimates obtained by both methods of statistical analysis. 36.4% (323/888) of the children suffered dental trauma and there was no difference in prevalence rates from 3 to 5 years of age. Poisson regression analysis showed that the probability of the outcome was almost 30% higher for children whose mothers had more than 8 years of education (Prevalence Ratio = 1.28; 95% CI = 1.03-1.60) and 63% higher for children with an overjet greater than 2 mm (Prevalence Ratio = 1.63; 95% CI = 1.31-2.03). Odds ratios clearly overestimated the size of the effect when compared with prevalence ratios. These findings indicate the need for preventive orientation regarding TDI, in order to educate parents and caregivers about supervising infants, particularly those with increased overjet and whose mothers have a higher level of education. Poisson regression with robust variance represents a better alternative than logistic regression to estimate the risk of dental trauma in preschool children.

  3. [Socioeconomic factors conditioning obesity in adults. Evidence based on quantile regression and panel data].

    Science.gov (United States)

    Temporelli, Karina L; Viego, Valentina N

    2016-08-01

    Objective To measure the effect of socioeconomic variables on the prevalence of obesity. Factors such as income level, urbanization, incorporation of women into the labor market and access to unhealthy foods are considered in this paper. Method Econometric estimates of the proportion of obese men and women by country were calculated using models based on panel data and quantile regressions, with data from 192 countries for the period 2002-2005.Levels of per capita income, urbanization, income/big mac ratio price and labor indicators for female population were considered as explanatory variables. Results Factors that have influence over obesity in adults differ between men and women; accessibility to fast food is related to male obesity, while the employment mode causes higher rates in women. The underlying socioeconomic factors for obesity are also different depending on the magnitude of this problem in each country; in countries with low prevalence, a greater level of income favor the transition to obesogenic habits, while a higher income level mitigates the problem in those countries with high rates of obesity. Discussion Identifying the socio-economic causes of the significant increase in the prevalence of obesity is essential for the implementation of effective strategies for prevention, since this condition not only affects the quality of life of those who suffer from it but also puts pressure on health systems due to the treatment costs of associated diseases.

  4. Comparison of Linear and Non-linear Regression Analysis to Determine Pulmonary Pressure in Hyperthyroidism.

    Science.gov (United States)

    Scarneciu, Camelia C; Sangeorzan, Livia; Rus, Horatiu; Scarneciu, Vlad D; Varciu, Mihai S; Andreescu, Oana; Scarneciu, Ioan

    2017-01-01

    This study aimed at assessing the incidence of pulmonary hypertension (PH) at newly diagnosed hyperthyroid patients and at finding a simple model showing the complex functional relation between pulmonary hypertension in hyperthyroidism and the factors causing it. The 53 hyperthyroid patients (H-group) were evaluated mainly by using an echocardiographical method and compared with 35 euthyroid (E-group) and 25 healthy people (C-group). In order to identify the factors causing pulmonary hypertension the statistical method of comparing the values of arithmetical means is used. The functional relation between the two random variables (PAPs and each of the factors determining it within our research study) can be expressed by linear or non-linear function. By applying the linear regression method described by a first-degree equation the line of regression (linear model) has been determined; by applying the non-linear regression method described by a second degree equation, a parabola-type curve of regression (non-linear or polynomial model) has been determined. We made the comparison and the validation of these two models by calculating the determination coefficient (criterion 1), the comparison of residuals (criterion 2), application of AIC criterion (criterion 3) and use of F-test (criterion 4). From the H-group, 47% have pulmonary hypertension completely reversible when obtaining euthyroidism. The factors causing pulmonary hypertension were identified: previously known- level of free thyroxin, pulmonary vascular resistance, cardiac output; new factors identified in this study- pretreatment period, age, systolic blood pressure. According to the four criteria and to the clinical judgment, we consider that the polynomial model (graphically parabola- type) is better than the linear one. The better model showing the functional relation between the pulmonary hypertension in hyperthyroidism and the factors identified in this study is given by a polynomial equation of second

  5. Predictive factors in patients eligible for pegfilgrastim prophylaxis focusing on RDI using ordered logistic regression analysis.

    Science.gov (United States)

    Kanbayashi, Yuko; Ishikawa, Takeshi; Kanazawa, Motohiro; Nakajima, Yuki; Kawano, Rumi; Tabuchi, Yusuke; Yoshioka, Tomoko; Ihara, Norihiko; Hosokawa, Toyoshi; Takayama, Koichi; Shikata, Keisuke; Taguchi, Tetsuya

    2018-03-16

    Although pegfilgrastim prophylaxis is expected to maintain the relative dose intensity (RDI) of chemotherapy and improve safety, information is limited. However, the optimal selection of patients eligible for pegfilgrastim prophylaxis is an important issue from a medical economics viewpoint. Therefore, this retrospective study identified factors that could predict these eligible patients to maintain the RDI. The participants included 166 cancer patients undergoing pegfilgrastim prophylaxis combined with chemotherapy in our outpatient chemotherapy center between March 2015 and April 2017. Variables were extracted from clinical records for regression analysis of factors related to maintenance of the RDI. RDI was classified into four categories: 100% = 0, 85% or predictive factors in patients eligible for pegfilgrastim prophylaxis to maintain the RDI. Threshold measures were examined using a receiver operating characteristic (ROC) analysis curve. Age [odds ratio (OR) 1.07, 95% confidence interval (CI) 1.04-1.11; P maintenance. ROC curve analysis of the group that failed to maintain the RDI indicated that the threshold for age was 70 years and above, with a sensitivity of 60.0% and specificity of 80.2% (area under the curve: 0.74). In conclusion, younger age, anemia (less), and administration of pegfilgrastim 24-72 h after chemotherapy were significant factors for RDI maintenance.

  6. Spatial Bayesian latent factor regression modeling of coordinate-based meta-analysis data.

    Science.gov (United States)

    Montagna, Silvia; Wager, Tor; Barrett, Lisa Feldman; Johnson, Timothy D; Nichols, Thomas E

    2018-03-01

    Now over 20 years old, functional MRI (fMRI) has a large and growing literature that is best synthesised with meta-analytic tools. As most authors do not share image data, only the peak activation coordinates (foci) reported in the article are available for Coordinate-Based Meta-Analysis (CBMA). Neuroimaging meta-analysis is used to (i) identify areas of consistent activation; and (ii) build a predictive model of task type or cognitive process for new studies (reverse inference). To simultaneously address these aims, we propose a Bayesian point process hierarchical model for CBMA. We model the foci from each study as a doubly stochastic Poisson process, where the study-specific log intensity function is characterized as a linear combination of a high-dimensional basis set. A sparse representation of the intensities is guaranteed through latent factor modeling of the basis coefficients. Within our framework, it is also possible to account for the effect of study-level covariates (meta-regression), significantly expanding the capabilities of the current neuroimaging meta-analysis methods available. We apply our methodology to synthetic data and neuroimaging meta-analysis datasets. © 2017, The International Biometric Society.

  7. Spatial Bayesian Latent Factor Regression Modeling of Coordinate-based Meta-analysis Data

    Science.gov (United States)

    Montagna, Silvia; Wager, Tor; Barrett, Lisa Feldman; Johnson, Timothy D.; Nichols, Thomas E.

    2017-01-01

    Summary Now over 20 years old, functional MRI (fMRI) has a large and growing literature that is best synthesised with meta-analytic tools. As most authors do not share image data, only the peak activation coordinates (foci) reported in the paper are available for Coordinate-Based Meta-Analysis (CBMA). Neuroimaging meta-analysis is used to 1) identify areas of consistent activation; and 2) build a predictive model of task type or cognitive process for new studies (reverse inference). To simultaneously address these aims, we propose a Bayesian point process hierarchical model for CBMA. We model the foci from each study as a doubly stochastic Poisson process, where the study-specific log intensity function is characterised as a linear combination of a high-dimensional basis set. A sparse representation of the intensities is guaranteed through latent factor modeling of the basis coefficients. Within our framework, it is also possible to account for the effect of study-level covariates (meta-regression), significantly expanding the capabilities of the current neuroimaging meta-analysis methods available. We apply our methodology to synthetic data and neuroimaging meta-analysis datasets. PMID:28498564

  8. Support vector regression model for the estimation of γ-ray buildup factors for multi-layer shields

    International Nuclear Information System (INIS)

    Trontl, Kresimir; Smuc, Tomislav; Pevec, Dubravko

    2007-01-01

    The accuracy of the point-kernel method, which is a widely used practical tool for γ-ray shielding calculations, strongly depends on the quality and accuracy of buildup factors used in the calculations. Although, buildup factors for single-layer shields comprised of a single material are well known, calculation of buildup factors for stratified shields, each layer comprised of different material or a combination of materials, represent a complex physical problem. Recently, a new compact mathematical model for multi-layer shield buildup factor representation has been suggested for embedding into point-kernel codes thus replacing traditionally generated complex mathematical expressions. The new regression model is based on support vector machines learning technique, which is an extension of Statistical Learning Theory. The paper gives complete description of the novel methodology with results pertaining to realistic engineering multi-layer shielding geometries. The results based on support vector regression machine learning confirm that this approach provides a framework for general, accurate and computationally acceptable multi-layer buildup factor model

  9. Evaluation of an inpatient fall risk screening tool to identify the most critical fall risk factors in inpatients.

    Science.gov (United States)

    Hou, Wen-Hsuan; Kang, Chun-Mei; Ho, Mu-Hsing; Kuo, Jessie Ming-Chuan; Chen, Hsiao-Lien; Chang, Wen-Yin

    2017-03-01

    To evaluate the accuracy of the inpatient fall risk screening tool and to identify the most critical fall risk factors in inpatients. Variations exist in several screening tools applied in acute care hospitals for examining risk factors for falls and identifying high-risk inpatients. Secondary data analysis. A subset of inpatient data for the period from June 2011-June 2014 was extracted from the nursing information system and adverse event reporting system of an 818-bed teaching medical centre in Taipei. Data were analysed using descriptive statistics, receiver operating characteristic curve analysis and logistic regression analysis. During the study period, 205 fallers and 37,232 nonfallers were identified. The results revealed that the inpatient fall risk screening tool (cut-off point of ≥3) had a low sensitivity level (60%), satisfactory specificity (87%), a positive predictive value of 2·0% and a negative predictive value of 99%. The receiver operating characteristic curve analysis revealed an area under the curve of 0·805 (sensitivity, 71·8%; specificity, 78%). To increase the sensitivity values, the Youden index suggests at least 1·5 points to be the most suitable cut-off point for the inpatient fall risk screening tool. Multivariate logistic regression analysis revealed a considerably increased fall risk in patients with impaired balance and impaired elimination. The fall risk factor was also significantly associated with days of hospital stay and with admission to surgical wards. The findings can raise awareness about the two most critical risk factors for falls among future clinical nurses and other healthcare professionals and thus facilitate the development of fall prevention interventions. This study highlights the needs for redefining the cut-off points of the inpatient fall risk screening tool to effectively identify inpatients at a high risk of falls. Furthermore, inpatients with impaired balance and impaired elimination should be closely

  10. Identifying the bleeding trauma patient: predictive factors for massive transfusion in an Australasian trauma population.

    Science.gov (United States)

    Hsu, Jeremy Ming; Hitos, Kerry; Fletcher, John P

    2013-09-01

    Military and civilian data would suggest that hemostatic resuscitation results in improved outcomes for exsanguinating patients. However, identification of those patients who are at risk of significant hemorrhage is not clearly defined. We attempted to identify factors that would predict the need for massive transfusion (MT) in an Australasian trauma population, by comparing those trauma patients who did receive massive transfusion with those who did not. Between 1985 and 2010, 1,686 trauma patients receiving at least 1 U of packed red blood cells were identified from our prospectively maintained trauma registry. Demographic, physiologic, laboratory, injury, and outcome variables were reviewed. Univariate analysis determined significant factors between those who received MT and those who did not. A predictive multivariate logistic regression model with backward conditional stepwise elimination was used for MT risk. Statistical analysis was performed using SPSS PASW. MT patients had a higher pulse rate, lower Glasgow Coma Scale (GCS) score, lower systolic blood pressure, lower hemoglobin level, higher Injury Severity Score (ISS), higher international normalized ratio (INR), and longer stay. Initial logistic regression identified base deficit (BD), INR, and hemoperitoneum at laparotomy as independent predictive variables. After assigning cutoff points of BD being greater than 5 and an INR of 1.5 or greater, a further model was created. A BD greater than 5 and either INR of 1.5 or greater or hemoperitoneum was associated with 51 times increase in MT risk (odds ratio, 51.6; 95% confidence interval, 24.9-95.8). The area under the receiver operating characteristic curve for the model was 0.859. From this study, a combination of BD, INR, and hemoperitoneum has demonstrated good predictability for MT. This tool may assist in the determination of those patients who might benefit from hemostatic resuscitation. Prognostic study, level III.

  11. Partial Least Squares Regression for Determining the Control Factors for Runoff and Suspended Sediment Yield during Rainfall Events

    Directory of Open Access Journals (Sweden)

    Nufang Fang

    2015-07-01

    Full Text Available Multivariate statistics are commonly used to identify the factors that control the dynamics of runoff or sediment yields during hydrological processes. However, one issue with the use of conventional statistical methods to address relationships between variables and runoff or sediment yield is multicollinearity. The main objectives of this study were to apply a method for effectively identifying runoff and sediment control factors during hydrological processes and apply that method to a case study. The method combines the clustering approach and partial least squares regression (PLSR models. The case study was conducted in a mountainous watershed in the Three Gorges Area. A total of 29 flood events in three hydrological years in areas with different land uses were obtained. In total, fourteen related variables were separated from hydrographs using the classical hydrograph separation method. Twenty-nine rainfall events were classified into two rainfall regimes (heavy Rainfall Regime I and moderate Rainfall Regime II based on rainfall characteristics and K-means clustering. Four separate PLSR models were constructed to identify the main variables that control runoff and sediment yield for the two rainfall regimes. For Rainfall Regime I, the dominant first-order factors affecting the changes in sediment yield in our study were all of the four rainfall-related variables, flood peak discharge, maximum flood suspended sediment concentration, runoff, and the percentages of forest and farmland. For Rainfall Regime II, antecedent condition-related variables have more effects on both runoff and sediment yield than in Rainfall Regime I. The results suggest that the different control factors of the two rainfall regimes are determined by the rainfall characteristics and thus different runoff mechanisms.

  12. A Matlab program for stepwise regression

    Directory of Open Access Journals (Sweden)

    Yanhong Qi

    2016-03-01

    Full Text Available The stepwise linear regression is a multi-variable regression for identifying statistically significant variables in the linear regression equation. In present study, we presented the Matlab program of stepwise regression.

  13. Identifying at-risk profiles and protective factors for problem gambling: A longitudinal study across adolescence and early adulthood.

    Science.gov (United States)

    Allami, Youssef; Vitaro, Frank; Brendgen, Mara; Carbonneau, René; Tremblay, Richard E

    2018-05-01

    Past studies have identified various risk and protective factors for problem gambling (PG). However, no study has examined the interplay between these factors using a combination of person-centered and variable-centered approaches embedded within a longitudinal design. The present study aimed to (a) identify distinct profiles in early adolescence based on a set of risk factors commonly associated with PG (impulsivity, depression, anxiety, drug-alcohol use, aggressiveness, and antisociality), (b) explore the difference in reported gambling problems between these profiles during midadolescence and early adulthood, and (c) identify family- and peer-related variables that could operate as protective or compensatory factors in this context. Two samples were used: (a) a population sample (N = 1,033) living in low socioeconomic-status neighborhoods and (b) a population sample (N = 3,017) representative of students attending Quebec schools. Latent profile analyses were conducted to identify at-risk profiles based on individual risk factors measured at age 12 years. Negative binomial regression models were estimated to compare profiles in terms of their reported gambling problems at ages 16 and 23. Finally, family- and peer-related variables measured at age 14 were included to test their protective or compensatory role with respect to the link between at-risk profiles and gambling problems. Four profiles were identified: well-adjusted, internalizing, externalizing, and comorbid. Compared to the well-adjusted profile, the externalizing and comorbid profiles reported more gambling problems at ages 16 and 23, but the internalizing profile did not differ significantly. Various protective and compensatory factors emerged for each profile at both time points. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  14. Identifying factors affecting optimal management of agricultural water

    Directory of Open Access Journals (Sweden)

    Masoud Samian

    2015-01-01

    In addition to quantitative methodology such as descriptive statistics and factor analysis a qualitative methodology was employed for dynamic simulation among variables through Vensim software. In this study, the factor analysis technique was used through the Kaiser-Meyer-Olkin (KMO and Bartlett tests. From the results, four key elements were identified as factors affecting the optimal management of agricultural water in Hamedan area. These factors were institutional and legal factors, technical and knowledge factors, economic factors and social factors.

  15. Prediction of tissue-specific cis-regulatory modules using Bayesian networks and regression trees

    Directory of Open Access Journals (Sweden)

    Chen Xiaoyu

    2007-12-01

    Full Text Available Abstract Background In vertebrates, a large part of gene transcriptional regulation is operated by cis-regulatory modules. These modules are believed to be regulating much of the tissue-specificity of gene expression. Results We develop a Bayesian network approach for identifying cis-regulatory modules likely to regulate tissue-specific expression. The network integrates predicted transcription factor binding site information, transcription factor expression data, and target gene expression data. At its core is a regression tree modeling the effect of combinations of transcription factors bound to a module. A new unsupervised EM-like algorithm is developed to learn the parameters of the network, including the regression tree structure. Conclusion Our approach is shown to accurately identify known human liver and erythroid-specific modules. When applied to the prediction of tissue-specific modules in 10 different tissues, the network predicts a number of important transcription factor combinations whose concerted binding is associated to specific expression.

  16. Identifying Factors for Worker Motivation in Zambia's Rural Health Facilities.

    Science.gov (United States)

    Cross, Samuel S; Baernholdt, Dr Marianne

    2017-01-01

    Within Zambia there is a shortage of health workers in rural areas. This study aims to identify motivating factors for retaining rural health workers. Sixty rural health workers completed surveys and 46 were interviewed. They rated the importance of six motivating factors and discussed these and other factors in interviews. An interview was conducted with a Government Human Resources Manager (HR Manager) to elicit contextual information. All six factors were identified as being very important motivators, as were two additional factors. Additional career training was identified by many as the most important factor. Comparison of results and the HR Manager interview revealed that workers lacked knowledge about opportunities and that the HR manager was aware of barriers to career development. The Zambian government might better motivate and retain rural health workers by offering them any combination of identified factors, and by addressing the barriers to career development.

  17. Identifying motivational factors within a multinational company

    Directory of Open Access Journals (Sweden)

    Daniela Bradutanu

    2011-08-01

    Full Text Available The aim of the study is to identify the main motivational factors within a multinational company. The first objective is to identify work functions, formulated on Abraham Maslow’s pyramid, following the identification of the key characteristics that motivate an employee at the work place and last, but not least, the type of motivation that employees focus, intrinsic or extrinsic. The research method targeted a questionnaire based survey, including various company employees and an interview with the manager. The results confirmed that in Romania, employees put great emphasis on extrinsic motivation, a certain income and job security being primary. These results have implications for managers that in order to effectively motivate staff, first, must know their needs and expectations. To identify the main needs and motivational factors we had as a starting point Maslow's pyramid.

  18. Logistic regression analysis of risk factors for postoperative recurrence of spinal tumors and analysis of prognostic factors.

    Science.gov (United States)

    Zhang, Shanyong; Yang, Lili; Peng, Chuangang; Wu, Minfei

    2018-02-01

    The aim of the present study was to investigate the risk factors for postoperative recurrence of spinal tumors by logistic regression analysis and analysis of prognostic factors. In total, 77 male and 48 female patients with spinal tumor were selected in our hospital from January, 2010 to December, 2015 and divided into the benign (n=76) and malignant groups (n=49). All the patients underwent microsurgical resection of spinal tumors and were reviewed regularly 3 months after operation. The McCormick grading system was used to evaluate the postoperative spinal cord function. Data were subjected to statistical analysis. Of the 125 cases, 63 cases showed improvement after operation, 50 cases were stable, and deterioration was found in 12 cases. The improvement rate of patients with cervical spine tumor, which reached 56.3%, was the highest. Fifty-two cases of sensory disturbance, 34 cases of pain, 30 cases of inability to exercise, 26 cases of ataxia, and 12 cases of sphincter disorders were found after operation. Seventy-two cases (57.6%) underwent total resection, 18 cases (14.4%) received subtotal resection, 23 cases (18.4%) received partial resection, and 12 cases (9.6%) were only treated with biopsy/decompression. Postoperative recurrence was found in 57 cases (45.6%). The mean recurrence time of patients in the malignant group was 27.49±6.09 months, and the mean recurrence time of patients in the benign group was 40.62±4.34. The results were significantly different (Pregression analysis of total resection-related factors showed that total resection should be the preferred treatment for patients with benign tumors, thoracic and lumbosacral tumors, and lower McCormick grade, as well as patients without syringomyelia and intramedullary tumors. Logistic regression analysis of recurrence-related factors revealed that the recurrence rate was relatively higher in patients with malignant, cervical, thoracic and lumbosacral, intramedullary tumors, and higher Mc

  19. Logistic regression applied to natural hazards: rare event logistic regression with replications

    Science.gov (United States)

    Guns, M.; Vanacker, V.

    2012-06-01

    Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.

  20. Risk factors for violence in psychosis: systematic review and meta-regression analysis of 110 studies.

    Directory of Open Access Journals (Sweden)

    Katrina Witt

    Full Text Available Previous reviews on risk and protective factors for violence in psychosis have produced contrasting findings. There is therefore a need to clarify the direction and strength of association of risk and protective factors for violent outcomes in individuals with psychosis.We conducted a systematic review and meta-analysis using 6 electronic databases (CINAHL, EBSCO, EMBASE, Global Health, PsycINFO, PUBMED and Google Scholar. Studies were identified that reported factors associated with violence in adults diagnosed, using DSM or ICD criteria, with schizophrenia and other psychoses. We considered non-English language studies and dissertations. Risk and protective factors were meta-analysed if reported in three or more primary studies. Meta-regression examined sources of heterogeneity. A novel meta-epidemiological approach was used to group similar risk factors into one of 10 domains. Sub-group analyses were then used to investigate whether risk domains differed for studies reporting severe violence (rather than aggression or hostility and studies based in inpatient (rather than outpatient settings.There were 110 eligible studies reporting on 45,533 individuals, 8,439 (18.5% of whom were violent. A total of 39,995 (87.8% were diagnosed with schizophrenia, 209 (0.4% were diagnosed with bipolar disorder, and 5,329 (11.8% were diagnosed with other psychoses. Dynamic (or modifiable risk factors included hostile behaviour, recent drug misuse, non-adherence with psychological therapies (p values<0.001, higher poor impulse control scores, recent substance misuse, recent alcohol misuse (p values<0.01, and non-adherence with medication (p value <0.05. We also examined a number of static factors, the strongest of which were criminal history factors. When restricting outcomes to severe violence, these associations did not change materially. In studies investigating inpatient violence, associations differed in strength but not direction.Certain dynamic risk

  1. Logistic regression applied to natural hazards: rare event logistic regression with replications

    Directory of Open Access Journals (Sweden)

    M. Guns

    2012-06-01

    Full Text Available Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logistic regression with replications, combines the strength of probabilistic and statistical methods, and allows overcoming some of the limitations of previous developments through robust variable selection. This technique was here developed for the analyses of landslide controlling factors, but the concept is widely applicable for statistical analyses of natural hazards.

  2. Pathways-driven sparse regression identifies pathways and genes associated with high-density lipoprotein cholesterol in two Asian cohorts.

    Directory of Open Access Journals (Sweden)

    Matt Silver

    2013-11-01

    Full Text Available Standard approaches to data analysis in genome-wide association studies (GWAS ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK

  3. Pathways-Driven Sparse Regression Identifies Pathways and Genes Associated with High-Density Lipoprotein Cholesterol in Two Asian Cohorts

    Science.gov (United States)

    Silver, Matt; Chen, Peng; Li, Ruoying; Cheng, Ching-Yu; Wong, Tien-Yin; Tai, E-Shyong; Teo, Yik-Ying; Montana, Giovanni

    2013-01-01

    Standard approaches to data analysis in genome-wide association studies (GWAS) ignore any potential functional relationships between gene variants. In contrast gene pathways analysis uses prior information on functional structure within the genome to identify pathways associated with a trait of interest. In a second step, important single nucleotide polymorphisms (SNPs) or genes may be identified within associated pathways. The pathways approach is motivated by the fact that genes do not act alone, but instead have effects that are likely to be mediated through their interaction in gene pathways. Where this is the case, pathways approaches may reveal aspects of a trait's genetic architecture that would otherwise be missed when considering SNPs in isolation. Most pathways methods begin by testing SNPs one at a time, and so fail to capitalise on the potential advantages inherent in a multi-SNP, joint modelling approach. Here, we describe a dual-level, sparse regression model for the simultaneous identification of pathways and genes associated with a quantitative trait. Our method takes account of various factors specific to the joint modelling of pathways with genome-wide data, including widespread correlation between genetic predictors, and the fact that variants may overlap multiple pathways. We use a resampling strategy that exploits finite sample variability to provide robust rankings for pathways and genes. We test our method through simulation, and use it to perform pathways-driven gene selection in a search for pathways and genes associated with variation in serum high-density lipoprotein cholesterol levels in two separate GWAS cohorts of Asian adults. By comparing results from both cohorts we identify a number of candidate pathways including those associated with cardiomyopathy, and T cell receptor and PPAR signalling. Highlighted genes include those associated with the L-type calcium channel, adenylate cyclase, integrin, laminin, MAPK signalling and immune

  4. Regression and kriging analysis for grid power factor estimation

    Directory of Open Access Journals (Sweden)

    Rajesh Guntaka

    2014-12-01

    Full Text Available The measurement of power factor (PF in electrical utility grids is a mainstay of load balancing and is also a critical element of transmission and distribution efficiency. The measurement of PF dates back to the earliest periods of electrical power distribution to public grids. In the wide-area distribution grid, measurement of current waveforms is trivial and may be accomplished at any point in the grid using a current tap transformer. However, voltage measurement requires reference to ground and so is more problematic and measurements are normally constrained to points that have ready and easy access to a ground source. We present two mathematical analysis methods based on kriging and linear least square estimation (LLSE (regression to derive PF at nodes with unknown voltages that are within a perimeter of sample nodes with ground reference across a selected power grid. Our results indicate an error average of 1.884% that is within acceptable tolerances for PF measurements that are used in load balancing tasks.

  5. Pediatric malignant hyperthermia: risk factors, morbidity, and mortality identified from the Nationwide Inpatient Sample and Kids' Inpatient Database.

    Science.gov (United States)

    Salazar, Jose H; Yang, Jingyan; Shen, Liang; Abdullah, Fizan; Kim, Tae W

    2014-12-01

    Malignant Hyperthermia (MH) is a potentially fatal metabolic disorder. Due to its rarity, limited evidence exists about risk factors, morbidity, and mortality especially in children. Using the Nationwide Inpatient Sample and the Kid's Inpatient Database (KID), admissions with the ICD-9 code for MH (995.86) were extracted for patients 0-17 years of age. Demographic characteristics were analyzed. Logistic regression was performed to identify patient and hospital characteristics associated with mortality. A subset of patients with a surgical ICD-9 code in the KID was studied to calculate the prevalence of MH in the dataset. A total of 310 pediatric admissions were seen in 13 nonoverlapping years of data. Patients had a mortality of 2.9%. Male sex was predominant (64.8%), and 40.5% of the admissions were treated at centers not identified as children's hospitals. The most common associated diagnosis was rhabdomyolysis, which was present in 26 cases. Regression with the outcome of mortality did not yield significant differences between demographic factors, age, sex race, or hospital type, pediatric vs nonpediatric. Within a surgical subset of 530,449 admissions, MH was coded in 55, giving a rate of 1.04 cases per 10,000 cases. This study is the first to combine two large databases to study MH in the pediatric population. The analysis provides an insight into the risk factors, comorbidities, mortality, and prevalence of MH in the United States population. Until more methodologically rigorous, large-scale studies are done, the use of databases will continue to be the optimal method to study rare diseases. © 2014 John Wiley & Sons Ltd.

  6. Driven Factors Analysis of China’s Irrigation Water Use Efficiency by Stepwise Regression and Principal Component Analysis

    Directory of Open Access Journals (Sweden)

    Renfu Jia

    2016-01-01

    Full Text Available This paper introduces an integrated approach to find out the major factors influencing efficiency of irrigation water use in China. It combines multiple stepwise regression (MSR and principal component analysis (PCA to obtain more realistic results. In real world case studies, classical linear regression model often involves too many explanatory variables and the linear correlation issue among variables cannot be eliminated. Linearly correlated variables will cause the invalidity of the factor analysis results. To overcome this issue and reduce the number of the variables, PCA technique has been used combining with MSR. As such, the irrigation water use status in China was analyzed to find out the five major factors that have significant impacts on irrigation water use efficiency. To illustrate the performance of the proposed approach, the calculation based on real data was conducted and the results were shown in this paper.

  7. Risk factors of regression and undercorrection in photorefractive keratectomy:a case-control study

    Directory of Open Access Journals (Sweden)

    Seyed-Farzad Mohammadi

    2015-10-01

    Full Text Available AIM:To determine risk factors of regression and undercorrection following photorefractive keratectomy (PRK in myopia or myopic astigmatism.METHODS: A case-control study was designed in which eyes with an indication for re-treatment (RT were defined as cases; primary criteria for RT indication, as assessed at least 9mo postoperatively, included an uncorrected distance visual acuity (UDVA of 20/30 or worse and a stable refraction for more than 3mo. Additional considerations included optical quality symptoms and significant higher order aberrations (HOAs. Controls were chosen from the same cohort of operated eyes which had complete post-operative follow up data beyond 9mo and did not need RT. The cohort included patients who had undergone PRK by the Tissue-Saving (TS ablation profile of Technolas 217z100 excimer laser (Bausch & Lomb, Rochester, NY, USA. Mitomycin C had been used in all of the primary procedures.RESULTS:We had 70 case eyes and 158 control eyes, and they were comparable in terms of age, sex and follow-up time (P values:0.58, 1.00 and 0.89, respectively. Pre-operative spherical equivalent of more than -5.00 diopter (D, intended optical zone (OZ diameter of less than 6.00 mm and ocular fixation instability during laser ablation were associated with RT indications (all P values <0.001. These factors maintained their significance in the multiple logistic regression model with odd ratios of 6.12, 6.71 and 7.89, respectively.CONCLUSION:Higher refractive correction (>-5.00 D, smaller OZ (<6.00 mm and unstable fixation during laser ablation of PRK for myopia and myopic astigmatism were found to be strong predictors of undercorrection and regression.

  8. Bias in logistic regression due to imperfect diagnostic test results and practical correction approaches.

    Science.gov (United States)

    Valle, Denis; Lima, Joanna M Tucker; Millar, Justin; Amratia, Punam; Haque, Ubydul

    2015-11-04

    Logistic regression is a statistical model widely used in cross-sectional and cohort studies to identify and quantify the effects of potential disease risk factors. However, the impact of imperfect tests on adjusted odds ratios (and thus on the identification of risk factors) is under-appreciated. The purpose of this article is to draw attention to the problem associated with modelling imperfect diagnostic tests, and propose simple Bayesian models to adequately address this issue. A systematic literature review was conducted to determine the proportion of malaria studies that appropriately accounted for false-negatives/false-positives in a logistic regression setting. Inference from the standard logistic regression was also compared with that from three proposed Bayesian models using simulations and malaria data from the western Brazilian Amazon. A systematic literature review suggests that malaria epidemiologists are largely unaware of the problem of using logistic regression to model imperfect diagnostic test results. Simulation results reveal that statistical inference can be substantially improved when using the proposed Bayesian models versus the standard logistic regression. Finally, analysis of original malaria data with one of the proposed Bayesian models reveals that microscopy sensitivity is strongly influenced by how long people have lived in the study region, and an important risk factor (i.e., participation in forest extractivism) is identified that would have been missed by standard logistic regression. Given the numerous diagnostic methods employed by malaria researchers and the ubiquitous use of logistic regression to model the results of these diagnostic tests, this paper provides critical guidelines to improve data analysis practice in the presence of misclassification error. Easy-to-use code that can be readily adapted to WinBUGS is provided, enabling straightforward implementation of the proposed Bayesian models.

  9. Predictors of postoperative outcomes of cubital tunnel syndrome treatments using multiple logistic regression analysis.

    Science.gov (United States)

    Suzuki, Taku; Iwamoto, Takuji; Shizu, Kanae; Suzuki, Katsuji; Yamada, Harumoto; Sato, Kazuki

    2017-05-01

    This retrospective study was designed to investigate prognostic factors for postoperative outcomes for cubital tunnel syndrome (CubTS) using multiple logistic regression analysis with a large number of patients. Eighty-three patients with CubTS who underwent surgeries were enrolled. The following potential prognostic factors for disease severity were selected according to previous reports: sex, age, type of surgery, disease duration, body mass index, cervical lesion, presence of diabetes mellitus, Workers' Compensation status, preoperative severity, and preoperative electrodiagnostic testing. Postoperative severity of disease was assessed 2 years after surgery by Messina's criteria which is an outcome measure specifically for CubTS. Bivariate analysis was performed to select candidate prognostic factors for multiple linear regression analyses. Multiple logistic regression analysis was conducted to identify the association between postoperative severity and selected prognostic factors. Both bivariate and multiple linear regression analysis revealed only preoperative severity as an independent risk factor for poor prognosis, while other factors did not show any significant association. Although conflicting results exist regarding prognosis of CubTS, this study supports evidence from previous studies and concludes early surgical intervention portends the most favorable prognosis. Copyright © 2017 The Japanese Orthopaedic Association. Published by Elsevier B.V. All rights reserved.

  10. Análise de fatores e regressão bissegmentada em estudos de estratificação ambiental e adaptabilidade em milho Factor analysis and bissegmented regression for studies about environmental stratification and maize adaptability

    Directory of Open Access Journals (Sweden)

    Deoclécio Domingos Garbuglio

    2007-02-01

    Full Text Available O objetivo deste trabalho foi verificar possíveis divergências entre os resultados obtidos nas avaliações da adaptabilidade de 27 genótipos de milho (Zea mays L., e na estratificação de 22 ambientes no Estado do Paraná, por meio de técnicas baseadas na análise de fatores e regressão bissegmentada. As estratificações ambientais foram feitas por meio do método tradicional e por análise de fatores, aliada ao porcentual da porção simples da interação GxA (PS%. As análises de adaptabilidade foram realizadas por meio de regressão bissegmentada e análise de fatores. Pela análise de regressão bissegmentada, os genótipos estudados apresentaram alta performance produtiva; no entanto, não foi constatado o genótipo considerado como ideal. A adaptabilidade dos genótipos, analisada por meio de plotagens gráficas, apresentou respostas diferenciadas quando comparada à regressão bissegmentada. A análise de fatores mostrou-se eficiente nos processos de estratificação ambiental e adaptabilidade dos genótipos de milho.The objective of this work was to verify possible divergences among results obtained on adaptability evaluations of 27 maize genotypes (Zea mays L., and on stratification of 22 environments on Paraná State, Brazil, through techniques of factor analysis and bissegmented regression. The environmental stratifications were made through the traditional methodology and by factor analysis, allied to the percentage of the simple portion of GxE interaction (PS%. Adaptability analyses were carried out through bissegmented regression and factor analysis. By the analysis of bissegmented regression, studied genotypes had presented high productive performance; however, it was not evidenced the genotype considered as ideal. The adaptability of the genotypes, analyzed through graphs, presented different answers when compared to bissegmented regression. Factor analysis was efficient in the processes of environment stratification and

  11. [Analysis of dietary pattern and diabetes mellitus influencing factors identified by classification tree model in adults of Fujian].

    Science.gov (United States)

    Yu, F L; Ye, Y; Yan, Y S

    2017-05-10

    Objective: To find out the dietary patterns and explore the relationship between environmental factors (especially dietary patterns) and diabetes mellitus in the adults of Fujian. Methods: Multi-stage sampling method were used to survey residents aged ≥18 years by questionnaire, physical examination and laboratory detection in 10 disease surveillance points in Fujian. Factor analysis was used to identify the dietary patterns, while logistic regression model was applied to analyze relationship between dietary patterns and diabetes mellitus, and classification tree model was adopted to identify the influencing factors for diabetes mellitus. Results: There were four dietary patterns in the population, including meat, plant, high-quality protein, and fried food and beverages patterns. The result of logistic analysis showed that plant pattern, which has higher factor loading of fresh fruit-vegetables and cereal-tubers, was a protective factor for non-diabetes mellitus. The risk of diabetes mellitus in the population at T2 and T3 levels of factor score were 0.727 (95 %CI: 0.561-0.943) times and 0.736 (95 %CI : 0.573-0.944) times higher, respectively, than those whose factor score was in lowest quartile. Thirteen influencing factors and eleven group at high-risk for diabetes mellitus were identified by classification tree model. The influencing factors were dyslipidemia, age, family history of diabetes, hypertension, physical activity, career, sex, sedentary time, abdominal adiposity, BMI, marital status, sleep time and high-quality protein pattern. Conclusion: There is a close association between dietary patterns and diabetes mellitus. It is necessary to promote healthy and reasonable diet, strengthen the monitoring and control of blood lipids, blood pressure and body weight, and have good lifestyle for the prevention and control of diabetes mellitus.

  12. Identifying the factors underlying discontinuation of triptans.

    Science.gov (United States)

    Wells, Rebecca E; Markowitz, Shira Y; Baron, Eric P; Hentz, Joseph G; Kalidas, Kavita; Mathew, Paul G; Halker, Rashmi; Dodick, David W; Schwedt, Todd J

    2014-02-01

    To identify factors associated with triptan discontinuation among migraine patients. It is unclear why many migraine patients who are prescribed triptans discontinue this treatment. This study investigated correlates of triptan discontinuation with a focus on potentially modifiable factors to improve compliance. This multicenter cross-sectional survey (n = 276) was performed at US tertiary care headache clinics. Headache fellows who were members of the American Headache Society Headache Fellows Research Consortium recruited episodic and chronic migraine patients who were current triptan users (use within prior 3 months and for ≥1 year) or past triptan users (no use within 6 months; prior use within 2 years). Univariate analyses were first completed to compare current triptan users to past users for: migraine characteristics, other migraine treatments, triptan education, triptan efficacy, triptan side effects, type of prescribing provider, Migraine Disability Assessment (MIDAS) scores and Beck Depression Inventory (BDI) scores. Then, a multivariable logistic regression model was selected from all possible combinations of predictor variables to determine the factors that best correlated with triptan discontinuation. Compared with those still using triptans (n = 207), those who had discontinued use (n = 69) had higher rates of medication overuse (30 vs. 18%, P = .04) and were more likely to have ever used opioids for migraine treatment (57 vs. 38%, P = .006) as well as higher MIDAS (mean 63 vs. 37, P = .001) and BDI scores (mean 10.4 vs. 7.4, P = .009). Compared with discontinued users, current triptan users were more likely to have had their triptan prescribed by a specialist (neurologist, headache specialist, or pain specialist) (74 vs. 54%, P = .002) and were more likely to report headache resolution (53 vs. 14%, P  24 (2.6, [1.5, 4.6]), BDI >4 (2.5, [1.4, 4.5]), and a history of ever using opioids for migraine therapy (2.2, [1

  13. Identifying Predictive Factors for Incident Reports in Patients Receiving Radiation Therapy

    Energy Technology Data Exchange (ETDEWEB)

    Elnahal, Shereef M., E-mail: selnaha1@jhmi.edu [Department of Radiation Oncology and Molecular Radiation Sciences, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, Maryland (United States); Blackford, Amanda [Department of Oncology Biostatistics, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, Maryland (United States); Smith, Koren; Souranis, Annette N.; Briner, Valerie; McNutt, Todd R.; DeWeese, Theodore L.; Wright, Jean L.; Terezakis, Stephanie A. [Department of Radiation Oncology and Molecular Radiation Sciences, Sidney Kimmel Comprehensive Cancer Center, Johns Hopkins University School of Medicine, Baltimore, Maryland (United States)

    2016-04-01

    Purpose: To describe radiation therapy cases during which voluntary incident reporting occurred; and identify patient- or treatment-specific factors that place patients at higher risk for incidents. Methods and Materials: We used our institution's incident learning system to build a database of patients with incident reports filed between January 2011 and December 2013. Patient- and treatment-specific data were reviewed for all patients with reported incidents, which were classified by step in the process and root cause. A control group of patients without events was generated for comparison. Summary statistics, likelihood ratios, and mixed-effect logistic regression models were used for group comparisons. Results: The incident and control groups comprised 794 and 499 patients, respectively. Common root causes included documentation errors (26.5%), communication (22.5%), technical treatment planning (37.5%), and technical treatment delivery (13.5%). Incidents were more frequently reported in minors (age <18 years) than in adult patients (37.7% vs 0.4%, P<.001). Patients with head and neck (16% vs 8%, P<.001) and breast (20% vs 15%, P=.03) primaries more frequently had incidents, whereas brain (18% vs 24%, P=.008) primaries were less frequent. Larger tumors (17% vs 10% had T4 lesions, P=.02), and cases on protocol (9% vs 5%, P=.005) or with intensity modulated radiation therapy/image guided intensity modulated radiation therapy (52% vs 43%, P=.001) were more likely to have incidents. Conclusions: We found several treatment- and patient-specific variables associated with incidents. These factors should be considered by treatment teams at the time of peer review to identify patients at higher risk. Larger datasets are required to recommend changes in care process standards, to minimize safety risks.

  14. Identifying Predictive Factors for Incident Reports in Patients Receiving Radiation Therapy

    International Nuclear Information System (INIS)

    Elnahal, Shereef M.; Blackford, Amanda; Smith, Koren; Souranis, Annette N.; Briner, Valerie; McNutt, Todd R.; DeWeese, Theodore L.; Wright, Jean L.; Terezakis, Stephanie A.

    2016-01-01

    Purpose: To describe radiation therapy cases during which voluntary incident reporting occurred; and identify patient- or treatment-specific factors that place patients at higher risk for incidents. Methods and Materials: We used our institution's incident learning system to build a database of patients with incident reports filed between January 2011 and December 2013. Patient- and treatment-specific data were reviewed for all patients with reported incidents, which were classified by step in the process and root cause. A control group of patients without events was generated for comparison. Summary statistics, likelihood ratios, and mixed-effect logistic regression models were used for group comparisons. Results: The incident and control groups comprised 794 and 499 patients, respectively. Common root causes included documentation errors (26.5%), communication (22.5%), technical treatment planning (37.5%), and technical treatment delivery (13.5%). Incidents were more frequently reported in minors (age <18 years) than in adult patients (37.7% vs 0.4%, P<.001). Patients with head and neck (16% vs 8%, P<.001) and breast (20% vs 15%, P=.03) primaries more frequently had incidents, whereas brain (18% vs 24%, P=.008) primaries were less frequent. Larger tumors (17% vs 10% had T4 lesions, P=.02), and cases on protocol (9% vs 5%, P=.005) or with intensity modulated radiation therapy/image guided intensity modulated radiation therapy (52% vs 43%, P=.001) were more likely to have incidents. Conclusions: We found several treatment- and patient-specific variables associated with incidents. These factors should be considered by treatment teams at the time of peer review to identify patients at higher risk. Larger datasets are required to recommend changes in care process standards, to minimize safety risks.

  15. A simple approach to power and sample size calculations in logistic regression and Cox regression models.

    Science.gov (United States)

    Vaeth, Michael; Skovlund, Eva

    2004-06-15

    For a given regression problem it is possible to identify a suitably defined equivalent two-sample problem such that the power or sample size obtained for the two-sample problem also applies to the regression problem. For a standard linear regression model the equivalent two-sample problem is easily identified, but for generalized linear models and for Cox regression models the situation is more complicated. An approximately equivalent two-sample problem may, however, also be identified here. In particular, we show that for logistic regression and Cox regression models the equivalent two-sample problem is obtained by selecting two equally sized samples for which the parameters differ by a value equal to the slope times twice the standard deviation of the independent variable and further requiring that the overall expected number of events is unchanged. In a simulation study we examine the validity of this approach to power calculations in logistic regression and Cox regression models. Several different covariate distributions are considered for selected values of the overall response probability and a range of alternatives. For the Cox regression model we consider both constant and non-constant hazard rates. The results show that in general the approach is remarkably accurate even in relatively small samples. Some discrepancies are, however, found in small samples with few events and a highly skewed covariate distribution. Comparison with results based on alternative methods for logistic regression models with a single continuous covariate indicates that the proposed method is at least as good as its competitors. The method is easy to implement and therefore provides a simple way to extend the range of problems that can be covered by the usual formulas for power and sample size determination. Copyright 2004 John Wiley & Sons, Ltd.

  16. Advanced statistics: linear regression, part II: multiple linear regression.

    Science.gov (United States)

    Marill, Keith A

    2004-01-01

    The applications of simple linear regression in medical research are limited, because in most situations, there are multiple relevant predictor variables. Univariate statistical techniques such as simple linear regression use a single predictor variable, and they often may be mathematically correct but clinically misleading. Multiple linear regression is a mathematical technique used to model the relationship between multiple independent predictor variables and a single dependent outcome variable. It is used in medical research to model observational data, as well as in diagnostic and therapeutic studies in which the outcome is dependent on more than one factor. Although the technique generally is limited to data that can be expressed with a linear function, it benefits from a well-developed mathematical framework that yields unique solutions and exact confidence intervals for regression coefficients. Building on Part I of this series, this article acquaints the reader with some of the important concepts in multiple regression analysis. These include multicollinearity, interaction effects, and an expansion of the discussion of inference testing, leverage, and variable transformations to multivariate models. Examples from the first article in this series are expanded on using a primarily graphic, rather than mathematical, approach. The importance of the relationships among the predictor variables and the dependence of the multivariate model coefficients on the choice of these variables are stressed. Finally, concepts in regression model building are discussed.

  17. Predicting Factors of INSURE Failure in Low Birth Weight Neonates with RDS; A Logistic Regression Model

    Directory of Open Access Journals (Sweden)

    Bita Najafian

    2015-02-01

    Full Text Available Background:Respiratory Distress syndrome is the most common respiratory disease in premature neonate and the most important cause of death among them. We aimed to investigate factors to predict successful or failure of INSURE method as a therapeutic method of RDS.Methods:In a cohort study,45 neonates with diagnosed RDS and birth weight lower than 1500g were included and they underwent INSURE followed by NCPAP(Nasal Continuous Positive Airway Pressure. The patients were divided into failure or successful groups and factors which can predict success of INSURE were investigated by logistic regression in SPSS 16th version.Results:29 and16 neonates were observed in successful and failure groups, respectively. Birth weight was the only variable with significant difference between two groups (P=0.002. Finally logistic regression test showed that birth weight is only predicting factor for success (P: 0.001, EXP[β]: 0.009, CI [95%]: 1.003-0.014 and mortality (P: 0.029, EXP[β]: 0.993, CI [95%]: 0.987-0.999 of neonates treated with INSURE method.Conclusion:Predicting factors which affect on success rate of INSURE can be useful for treating and reducing charge of neonate with RDS and the birth weight is one of the effective factor on INSURE Success in this study.

  18. Predicting Factors of INSURE Failure in Low Birth Weight Neonates with RDS; A Logistic Regression Model

    Directory of Open Access Journals (Sweden)

    Bita Najafian

    2015-02-01

    Full Text Available Background:Respiratory Distress syndrome is the most common respiratory disease in premature neonate and the most important cause of death among them. We aimed to investigate factors to predict successful or failure of INSURE method as a therapeutic method of RDS. Methods:In a cohort study,45 neonates with diagnosed RDS and birth weight lower than 1500g were included and they underwent INSURE followed by NCPAP(Nasal Continuous Positive Airway Pressure. The patients were divided into failure or successful groups and factors which can predict success of INSURE were investigated by logistic regression in SPSS 16th version. Results:29 and16 neonates were observed in successful and failure groups, respectively. Birth weight was the only variable with significant difference between two groups (P=0.002. Finally logistic regression test showed that birth weight is only predicting factor for success (P: 0.001, EXP[β]: 0.009, CI [95%]: 1.003-0.014 and mortality (P: 0.029, EXP[β]: 0.993, CI [95%]: 0.987-0.999 of neonates treated with INSURE method. Conclusion:Predicting factors which affect on success rate of INSURE can be useful for treating and reducing charge of neonate with RDS and the birth weight is one of the effective factor on INSURE Success in this study.

  19. The Research of Regression Method for Forecasting Monthly Electricity Sales Considering Coupled Multi-factor

    Science.gov (United States)

    Wang, Jiangbo; Liu, Junhui; Li, Tiantian; Yin, Shuo; He, Xinhui

    2018-01-01

    The monthly electricity sales forecasting is a basic work to ensure the safety of the power system. This paper presented a monthly electricity sales forecasting method which comprehensively considers the coupled multi-factors of temperature, economic growth, electric power replacement and business expansion. The mathematical model is constructed by using regression method. The simulation results show that the proposed method is accurate and effective.

  20. Children exposed to intimate partner violence: Identifying differential effects of family environment on children's trauma and psychopathology symptoms through regression mixture models.

    Science.gov (United States)

    McDonald, Shelby Elaine; Shin, Sunny; Corona, Rosalie; Maternick, Anna; Graham-Bermann, Sandra A; Ascione, Frank R; Herbert Williams, James

    2016-08-01

    The majority of analytic approaches aimed at understanding the influence of environmental context on children's socioemotional adjustment assume comparable effects of contextual risk and protective factors for all children. Using self-reported data from 289 maternal caregiver-child dyads, we examined the degree to which there are differential effects of severity of intimate partner violence (IPV) exposure, yearly household income, and number of children in the family on posttraumatic stress symptoms (PTS) and psychopathology symptoms (i.e., internalizing and externalizing problems) among school-age children between the ages of 7-12 years. A regression mixture model identified three latent classes that were primarily distinguished by differential effects of IPV exposure severity on PTS and psychopathology symptoms: (1) asymptomatic with low sensitivity to environmental factors (66% of children), (2) maladjusted with moderate sensitivity (24%), and (3) highly maladjusted with high sensitivity (10%). Children with mothers who had higher levels of education were more likely to be in the maladjusted with moderate sensitivity group than the asymptomatic with low sensitivity group. Latino children were less likely to be in both maladjusted groups compared to the asymptomatic group. Overall, the findings suggest differential effects of family environmental factors on PTS and psychopathology symptoms among children exposed to IPV. Implications for research and practice are discussed. Copyright © 2016 Elsevier Ltd. All rights reserved.

  1. Transcriptional profiling provides insights into metronomic cyclophosphamide-activated, innate immune-dependent regression of brain tumor xenografts

    International Nuclear Information System (INIS)

    Doloff, Joshua C; Waxman, David J

    2015-01-01

    Cyclophosphamide treatment on a six-day repeating metronomic schedule induces a dramatic, innate immune cell-dependent regression of implanted gliomas. However, little is known about the underlying mechanisms whereby metronomic cyclophosphamide induces innate immune cell mobilization and recruitment, or about the role of DNA damage and cell stress response pathways in eliciting the immune responses linked to tumor regression. Untreated and metronomic cyclophosphamide-treated human U251 glioblastoma xenografts were analyzed on human microarrays at two treatment time points to identify responsive tumor cell-specific factors and their upstream regulators. Mouse microarray analysis across two glioma models (human U251, rat 9L) was used to identify host factors and gene networks that contribute to the observed immune and tumor regression responses. Metronomic cyclophosphamide increased expression of tumor cell-derived DNA damage, cell stress, and cell death genes, which may facilitate innate immune activation. Increased expression of many host (mouse) immune networks was also seen in both tumor models, including complement components, toll-like receptors, interferons, and cytolysis pathways. Key upstream regulators activated by metronomic cyclophosphamide include members of the interferon, toll-like receptor, inflammatory response, and PPAR signaling pathways, whose activation may contribute to anti-tumor immunity. Many upstream regulators inhibited by metronomic cyclophosphamide, including hypoxia-inducible factors and MAP kinases, have glioma-promoting activity; their inhibition may contribute to the therapeutic effectiveness of the six-day repeating metronomic cyclophosphamide schedule. Large numbers of responsive cytokines, chemokines and immune regulatory genes linked to innate immune cell recruitment and tumor regression were identified, as were several immunosuppressive factors that may contribute to the observed escape of some tumors from metronomic CPA

  2. Identifying Risk Factors for Drug Use in an Iranian Treatment Sample: A Prediction Approach Using Decision Trees.

    Science.gov (United States)

    Amirabadizadeh, Alireza; Nezami, Hossein; Vaughn, Michael G; Nakhaee, Samaneh; Mehrpour, Omid

    2018-05-12

    Substance abuse exacts considerable social and health care burdens throughout the world. The aim of this study was to create a prediction model to better identify risk factors for drug use. A prospective cross-sectional study was conducted in South Khorasan Province, Iran. Of the total of 678 eligible subjects, 70% (n: 474) were randomly selected to provide a training set for constructing decision tree and multiple logistic regression (MLR) models. The remaining 30% (n: 204) were employed in a holdout sample to test the performance of the decision tree and MLR models. Predictive performance of different models was analyzed by the receiver operating characteristic (ROC) curve using the testing set. Independent variables were selected from demographic characteristics and history of drug use. For the decision tree model, the sensitivity and specificity for identifying people at risk for drug abuse were 66% and 75%, respectively, while the MLR model was somewhat less effective at 60% and 73%. Key independent variables in the analyses included first substance experience, age at first drug use, age, place of residence, history of cigarette use, and occupational and marital status. While study findings are exploratory and lack generalizability they do suggest that the decision tree model holds promise as an effective classification approach for identifying risk factors for drug use. Convergent with prior research in Western contexts is that age of drug use initiation was a critical factor predicting a substance use disorder.

  3. Trait Mindfulness as a Limiting Factor for Residual Depressive Symptoms: An Explorative Study Using Quantile Regression

    Science.gov (United States)

    Radford, Sholto; Eames, Catrin; Brennan, Kate; Lambert, Gwladys; Crane, Catherine; Williams, J. Mark G.; Duggan, Danielle S.; Barnhofer, Thorsten

    2014-01-01

    Mindfulness has been suggested to be an important protective factor for emotional health. However, this effect might vary with regard to context. This study applied a novel statistical approach, quantile regression, in order to investigate the relation between trait mindfulness and residual depressive symptoms in individuals with a history of recurrent depression, while taking into account symptom severity and number of episodes as contextual factors. Rather than fitting to a single indicator of central tendency, quantile regression allows exploration of relations across the entire range of the response variable. Analysis of self-report data from 274 participants with a history of three or more previous episodes of depression showed that relatively higher levels of mindfulness were associated with relatively lower levels of residual depressive symptoms. This relationship was most pronounced near the upper end of the response distribution and moderated by the number of previous episodes of depression at the higher quantiles. The findings suggest that with lower levels of mindfulness, residual symptoms are less constrained and more likely to be influenced by other factors. Further, the limiting effect of mindfulness on residual symptoms is most salient in those with higher numbers of episodes. PMID:24988072

  4. Logistic regression analysis of factors associated with avascular necrosis of the femoral head following femoral neck fractures in middle-aged and elderly patients.

    Science.gov (United States)

    Ai, Zi-Sheng; Gao, You-Shui; Sun, Yuan; Liu, Yue; Zhang, Chang-Qing; Jiang, Cheng-Hua

    2013-03-01

    Risk factors for femoral neck fracture-induced avascular necrosis of the femoral head have not been elucidated clearly in middle-aged and elderly patients. Moreover, the high incidence of screw removal in China and its effect on the fate of the involved femoral head require statistical methods to reflect their intrinsic relationship. Ninety-nine patients older than 45 years with femoral neck fracture were treated by internal fixation between May 1999 and April 2004. Descriptive analysis, interaction analysis between associated factors, single factor logistic regression, multivariate logistic regression, and detailed interaction analysis were employed to explore potential relationships among associated factors. Avascular necrosis of the femoral head was found in 15 cases (15.2 %). Age × the status of implants (removal vs. maintenance) and gender × the timing of reduction were interactive according to two-factor interactive analysis. Age, the displacement of fractures, the quality of reduction, and the status of implants were found to be significant factors in single factor logistic regression analysis. Age, age × the status of implants, and the quality of reduction were found to be significant factors in multivariate logistic regression analysis. In fine interaction analysis after multivariate logistic regression analysis, implant removal was the most important risk factor for avascular necrosis in 56-to-85-year-old patients, with a risk ratio of 26.00 (95 % CI = 3.076-219.747). The middle-aged and elderly have less incidence of avascular necrosis of the femoral head following femoral neck fractures treated by cannulated screws. The removal of cannulated screws can induce a significantly high incidence of avascular necrosis of the femoral head in elderly patients, while a high-quality reduction is helpful to reduce avascular necrosis.

  5. Regression in autistic spectrum disorders.

    Science.gov (United States)

    Stefanatos, Gerry A

    2008-12-01

    A significant proportion of children diagnosed with Autistic Spectrum Disorder experience a developmental regression characterized by a loss of previously-acquired skills. This may involve a loss of speech or social responsitivity, but often entails both. This paper critically reviews the phenomena of regression in autistic spectrum disorders, highlighting the characteristics of regression, age of onset, temporal course, and long-term outcome. Important considerations for diagnosis are discussed and multiple etiological factors currently hypothesized to underlie the phenomenon are reviewed. It is argued that regressive autistic spectrum disorders can be conceptualized on a spectrum with other regressive disorders that may share common pathophysiological features. The implications of this viewpoint are discussed.

  6. Identifying factors which enhance capacity to engage in clinical education among podiatry practitioners: an action research project.

    Science.gov (United States)

    Abey, Sally; Lea, Susan; Callaghan, Lynne; Shaw, Steve; Cotton, Debbie

    2015-01-01

    Health profession students develop practical skills whilst integrating theory with practice in a real world environment as an important component of their training. Research in the area of practice placements has identified challenges and barriers to the delivery of effective placement learning. However, there has been little research in podiatry and the question of which factors impact upon clinical educators' capacity to engage with the role remains an under-researched area. This paper presents the second phase of an action research project designed to determine the factors that impact upon clinical educators' capacity to engage with the mentorship role. An online survey was developed and podiatry clinical educators recruited through National Health Service (NHS) Trusts. The survey included socio-demographic items, and questions relating to the factors identified as possible variables influencing clinical educator capacity; the latter was assessed using the 'Clinical Educator Capacity to Engage' scale (CECE). Descriptive statistics were used to explore demographic data whilst the relationship between the CECE and socio-demographic factors were examined using inferential statistics in relation to academic profile, career profile and organisation of the placement. The survey response rate was 42 % (n = 66). Multiple linear regression identified four independent variables which explain a significant proportion of the variability of the dependent variable, 'capacity to engage with clinical education', with an adjusted R2 of 0.428. The four variables were: protected mentorship time, clinical educator relationship with university, sign-off responsibility, and volunteer status. The identification of factors that impact upon clinical educators' capacity to engage in mentoring of students has relevance for strategic planning and policy-making with the emphasis upon capacity-building at an individual level, so that the key attitudes and characteristics that are linked

  7. Using Structured Additive Regression Models to Estimate Risk Factors of Malaria: Analysis of 2010 Malawi Malaria Indicator Survey Data

    Science.gov (United States)

    Chirombo, James; Lowe, Rachel; Kazembe, Lawrence

    2014-01-01

    Background After years of implementing Roll Back Malaria (RBM) interventions, the changing landscape of malaria in terms of risk factors and spatial pattern has not been fully investigated. This paper uses the 2010 malaria indicator survey data to investigate if known malaria risk factors remain relevant after many years of interventions. Methods We adopted a structured additive logistic regression model that allowed for spatial correlation, to more realistically estimate malaria risk factors. Our model included child and household level covariates, as well as climatic and environmental factors. Continuous variables were modelled by assuming second order random walk priors, while spatial correlation was specified as a Markov random field prior, with fixed effects assigned diffuse priors. Inference was fully Bayesian resulting in an under five malaria risk map for Malawi. Results Malaria risk increased with increasing age of the child. With respect to socio-economic factors, the greater the household wealth, the lower the malaria prevalence. A general decline in malaria risk was observed as altitude increased. Minimum temperatures and average total rainfall in the three months preceding the survey did not show a strong association with disease risk. Conclusions The structured additive regression model offered a flexible extension to standard regression models by enabling simultaneous modelling of possible nonlinear effects of continuous covariates, spatial correlation and heterogeneity, while estimating usual fixed effects of categorical and continuous observed variables. Our results confirmed that malaria epidemiology is a complex interaction of biotic and abiotic factors, both at the individual, household and community level and that risk factors are still relevant many years after extensive implementation of RBM activities. PMID:24991915

  8. Neutron Buildup Factors Calculation for Support Vector Regression Application in Shielding Analysis

    International Nuclear Information System (INIS)

    Duckic, P.; Matijevic, M.; Grgic, D.

    2016-01-01

    In this paper initial set of data for neutron buildup factors determination using Support Vector Regression (SVR) method is prepared. The performance of SVR technique strongly depends on the quality of information used for model training. Thus it is very important to provide representable data to the SVR. SVR is a supervised type of learning so it demands data in the input/output form. In the case of neutron buildup factors estimation, the input parameters are the incident neutron energy, shielding thickness and shielding material and the output parameter is the neutron buildup factor value. So far the initial sets of data for different shielding configurations have been obtained using SCALE4.4 sequence SAS3. However, this results were obtained using group constants, thus the incident neutron energy was determined as the average value for each energy group. Obtained this way, the data provided to the SVR are fewer and therefore insufficient. More valuable information is obtained using SCALE6.2beta5 sequence MAVRIC which can perform calculations for the explicit incident neutron energy, which leads to greater maneuvering possibilities when active learning measures are employed, and consequently improves the quality of the developed SVR model.(author).

  9. Deriving amplification factors from simple site parameters using generalized regression neural networks: implications for relevant site proxies

    Science.gov (United States)

    Boudghene Stambouli, Ahmed; Zendagui, Djawad; Bard, Pierre-Yves; Derras, Boumédiène

    2017-07-01

    Most modern seismic codes account for site effects using an amplification factor (AF) that modifies the rock acceleration response spectra in relation to a "site condition proxy," i.e., a parameter related to the velocity profile at the site under consideration. Therefore, for practical purposes, it is interesting to identify the site parameters that best control the frequency-dependent shape of the AF. The goal of the present study is to provide a quantitative assessment of the performance of various site condition proxies to predict the main AF features, including the often used short- and mid-period amplification factors, Fa and Fv, proposed by Borcherdt (in Earthq Spectra 10:617-653, 1994). In this context, the linear, viscoelastic responses of a set of 858 actual soil columns from Japan, the USA, and Europe are computed for a set of 14 real accelerograms with varying frequency contents. The correlation between the corresponding site-specific average amplification factors and several site proxies (considered alone or as multiple combinations) is analyzed using the generalized regression neural network (GRNN). The performance of each site proxy combination is assessed through the variance reduction with respect to the initial amplification factor variability of the 858 profiles. Both the whole period range and specific short- and mid-period ranges associated with the Borcherdt factors Fa and Fv are considered. The actual amplification factor of an arbitrary soil profile is found to be satisfactorily approximated with a limited number of site proxies (4-6). As the usual code practice implies a lower number of site proxies (generally one, sometimes two), a sensitivity analysis is conducted to identify the "best performing" site parameters. The best one is the overall velocity contrast between underlying bedrock and minimum velocity in the soil column. Because these are the most difficult and expensive parameters to measure, especially for thick deposits, other

  10. Principal component regression analysis with SPSS.

    Science.gov (United States)

    Liu, R X; Kuang, J; Gong, Q; Hou, X L

    2003-06-01

    The paper introduces all indices of multicollinearity diagnoses, the basic principle of principal component regression and determination of 'best' equation method. The paper uses an example to describe how to do principal component regression analysis with SPSS 10.0: including all calculating processes of the principal component regression and all operations of linear regression, factor analysis, descriptives, compute variable and bivariate correlations procedures in SPSS 10.0. The principal component regression analysis can be used to overcome disturbance of the multicollinearity. The simplified, speeded up and accurate statistical effect is reached through the principal component regression analysis with SPSS.

  11. Identification of cotton properties to improve yarn count quality by using regression analysis

    International Nuclear Information System (INIS)

    Amin, M.; Ullah, M.; Akbar, A.

    2014-01-01

    Identification of raw material characteristics towards yarn count variation was studied by using statistical techniques. Regression analysis is used to meet the objective. Stepwise regression is used for mode) selection, and coefficient of determination and mean squared error (MSE) criteria are used to identify the contributing factors of cotton properties for yam count. Statistical assumptions of normality, autocorrelation and multicollinearity are evaluated by using probability plot, Durbin Watson test, variance inflation factor (VIF), and then model fitting is carried out. It is found that, invisible (INV), nepness (Nep), grayness (RD), cotton trash (TR) and uniformity index (VI) are the main contributing cotton properties for yarn count variation. The results are also verified by Pareto chart. (author)

  12. Determining the Relationship between U.S. County-Level Adult Obesity Rate and Multiple Risk Factors by PLS Regression and SVM Modeling Approaches

    Directory of Open Access Journals (Sweden)

    Chau-Kuang Chen

    2015-02-01

    Full Text Available Data from the Center for Disease Control (CDC has shown that the obesity rate doubled among adults within the past two decades. This upsurge was the result of changes in human behavior and environment. Partial least squares (PLS regression and support vector machine (SVM models were conducted to determine the relationship between U.S. county-level adult obesity rate and multiple risk factors. The outcome variable was the adult obesity rate. The 23 risk factors were categorized into four domains of the social ecological model including biological/behavioral factor, socioeconomic status, food environment, and physical environment. Of the 23 risk factors related to adult obesity, the top eight significant risk factors with high normalized importance were identified including physical inactivity, natural amenity, percent of households receiving SNAP benefits, and percent of all restaurants being fast food. The study results were consistent with those in the literature. The study showed that adult obesity rate was influenced by biological/behavioral factor, socioeconomic status, food environment, and physical environment embedded in the social ecological theory. By analyzing multiple risk factors of obesity in the communities, may lead to the proposal of more comprehensive and integrated policies and intervention programs to solve the population-based problem.

  13. Use of multilevel logistic regression to identify the causes of differential item functioning.

    Science.gov (United States)

    Balluerka, Nekane; Gorostiaga, Arantxa; Gómez-Benito, Juana; Hidalgo, María Dolores

    2010-11-01

    Given that a key function of tests is to serve as evaluation instruments and for decision making in the fields of psychology and education, the possibility that some of their items may show differential behaviour is a major concern for psychometricians. In recent decades, important progress has been made as regards the efficacy of techniques designed to detect this differential item functioning (DIF). However, the findings are scant when it comes to explaining its causes. The present study addresses this problem from the perspective of multilevel analysis. Starting from a case study in the area of transcultural comparisons, multilevel logistic regression is used: 1) to identify the item characteristics associated with the presence of DIF; 2) to estimate the proportion of variation in the DIF coefficients that is explained by these characteristics; and 3) to evaluate alternative explanations of the DIF by comparing the explanatory power or fit of different sequential models. The comparison of these models confirmed one of the two alternatives (familiarity with the stimulus) and rejected the other (the topic area) as being a cause of differential functioning with respect to the compared groups.

  14. Logistic regression applied to natural hazards: rare event logistic regression with replications

    OpenAIRE

    Guns, M.; Vanacker, Veerle

    2012-01-01

    Statistical analysis of natural hazards needs particular attention, as most of these phenomena are rare events. This study shows that the ordinary rare event logistic regression, as it is now commonly used in geomorphologic studies, does not always lead to a robust detection of controlling factors, as the results can be strongly sample-dependent. In this paper, we introduce some concepts of Monte Carlo simulations in rare event logistic regression. This technique, so-called rare event logisti...

  15. Comparison of multinomial logistic regression and logistic regression: which is more efficient in allocating land use?

    Science.gov (United States)

    Lin, Yingzhi; Deng, Xiangzheng; Li, Xing; Ma, Enjun

    2014-12-01

    Spatially explicit simulation of land use change is the basis for estimating the effects of land use and cover change on energy fluxes, ecology and the environment. At the pixel level, logistic regression is one of the most common approaches used in spatially explicit land use allocation models to determine the relationship between land use and its causal factors in driving land use change, and thereby to evaluate land use suitability. However, these models have a drawback in that they do not determine/allocate land use based on the direct relationship between land use change and its driving factors. Consequently, a multinomial logistic regression method was introduced to address this flaw, and thereby, judge the suitability of a type of land use in any given pixel in a case study area of the Jiangxi Province, China. A comparison of the two regression methods indicated that the proportion of correctly allocated pixels using multinomial logistic regression was 92.98%, which was 8.47% higher than that obtained using logistic regression. Paired t-test results also showed that pixels were more clearly distinguished by multinomial logistic regression than by logistic regression. In conclusion, multinomial logistic regression is a more efficient and accurate method for the spatial allocation of land use changes. The application of this method in future land use change studies may improve the accuracy of predicting the effects of land use and cover change on energy fluxes, ecology, and environment.

  16. Identifiable risk factors in hepatitis b and c

    International Nuclear Information System (INIS)

    Rehman, F.U.; Pervez, A.; Rafiq, A.

    2011-01-01

    Background: Both hepatitis B and C are common infections affecting masses and are leading causes of Chronic Liver Disease in Pakistan as well as worldwide. In majority of cases both viral diseases spread by factors that are preventable. The present study is conducted to determine the identifiable risk factors in patients admitted with Chronic Hepatitis B and C. Methods: An observational study was carried out for a period of 6 months. All age groups and both sexes were included. The patients were interviewed and the identifiable risk factors were looked for. The standard methods for detection of Hepatitis B and C were used. Results: One-hundred and ten patients were studied from January to July 2009. Sixty-five patients had Hepatitis C, 35 had Hepatitis B, and 10 had both Hepatitis B and C. Ninety-three patients had a history of injections and transfusions etc., and 38 had surgical scars. Tattoos were present in 42 patients and nose and/or ear piercing marks were present in 28 patients. The number of risk factors increased in co-infection. Conclusion: There is a role of unhygienic health delivery practices, lack of awareness and resources for standard screening protocol for spread of Hepatitis B and C. (author)

  17. Total-Factor Energy Efficiency (TFEE Evaluation on Thermal Power Industry with DEA, Malmquist and Multiple Regression Techniques

    Directory of Open Access Journals (Sweden)

    Jin-Peng Liu

    2017-07-01

    Full Text Available Under the background of a new round of power market reform, realizing the goals of energy saving and emission reduction, reducing the coal consumption and ensuring the sustainable development are the key issues for thermal power industry. With the biggest economy and energy consumption scales in the world, China should promote the energy efficiency of thermal power industry to solve these problems. Therefore, from multiple perspectives, the factors influential to the energy efficiency of thermal power industry were identified. Based on the economic, social and environmental factors, a combination model with Data Envelopment Analysis (DEA and Malmquist index was constructed to evaluate the total-factor energy efficiency (TFEE in thermal power industry. With the empirical studies from national and provincial levels, the TFEE index can be factorized into the technical efficiency index (TECH, the technical progress index (TPCH, the pure efficiency index (PECH and the scale efficiency index (SECH. The analysis showed that the TFEE was mainly determined by TECH and PECH. Meanwhile, by panel data regression model, unit coal consumption, talents and government supervision were selected as important indexes to have positive effects on TFEE in thermal power industry. In addition, the negative indexes, such as energy price and installed capacity, were also analyzed to control their undesired effects. Finally, considering the analysis results, measures for improving energy efficiency of thermal power industry were discussed widely, such as strengthening technology research and design (R&D, enforcing pollutant and emission reduction, distributing capital and labor rationally and improving the government supervision. Relative study results and suggestions can provide references for Chinese government and enterprises to enhance the energy efficiency level.

  18. Lung cancer and risk factors: how to identify phenotypic markers?

    International Nuclear Information System (INIS)

    Clement-Duchene, Christelle

    2009-01-01

    Lung cancer is the leading cause of death in the world. Most lung cancer are diagnosed at an advanced stage (IIIB and IV), with a poor prognosis. The main risk factors are well known like active smoking, and occupational exposure (asbestos), but 10 a 20% occur in never smokers. In this population, various studies have been conducted in order to identify possible risk factors, and although many have been identified, none seem to explain more than a small percentage of the cases. According to the histological types, adenocarcinoma is now the more frequent type, and its association with the main risk factors (tobacco exposure, asbestos exposure) is still studied. The tumoral location is associated with the exposure to the risk factors. Finally, the survival seems to be different between gender, and between smokers, and never smokers. All these characteristics are perhaps associated with different pathways of carcinogenesis. In this context, we have analyzed a cohort of 1493 patients with lung cancer in order to identify phenotypic markers, and to understand the mechanisms of the lung carcinogenesis. (author) [fr

  19. Identifying patients with therapy-resistant depression by using factor analysis

    DEFF Research Database (Denmark)

    Andreasson, K; Liest, V; Lunde, M

    2010-01-01

    with transcranial pulsed electromagnetic fields (T-PEMF)], in which the relative effect as percentage of improvement during the treatment period was analysed. RESULTS: We identified 2 major factors, the first of which was a general factor. The second was a dual factor consisting of a depression subscale comprising...

  20. Determination of osteoporosis risk factors using a multiple logistic regression model in postmenopausal Turkish women.

    Science.gov (United States)

    Akkus, Zeki; Camdeviren, Handan; Celik, Fatma; Gur, Ali; Nas, Kemal

    2005-09-01

    To determine the risk factors of osteoporosis using a multiple binary logistic regression method and to assess the risk variables for osteoporosis, which is a major and growing health problem in many countries. We presented a case-control study, consisting of 126 postmenopausal healthy women as control group and 225 postmenopausal osteoporotic women as the case group. The study was carried out in the Department of Physical Medicine and Rehabilitation, Dicle University, Diyarbakir, Turkey between 1999-2002. The data from the 351 participants were collected using a standard questionnaire that contains 43 variables. A multiple logistic regression model was then used to evaluate the data and to find the best regression model. We classified 80.1% (281/351) of the participants using the regression model. Furthermore, the specificity value of the model was 67% (84/126) of the control group while the sensitivity value was 88% (197/225) of the case group. We found the distribution of residual values standardized for final model to be exponential using the Kolmogorow-Smirnow test (p=0.193). The receiver operating characteristic curve was found successful to predict patients with risk for osteoporosis. This study suggests that low levels of dietary calcium intake, physical activity, education, and longer duration of menopause are independent predictors of the risk of low bone density in our population. Adequate dietary calcium intake in combination with maintaining a daily physical activity, increasing educational level, decreasing birth rate, and duration of breast-feeding may contribute to healthy bones and play a role in practical prevention of osteoporosis in Southeast Anatolia. In addition, the findings of the present study indicate that the use of multivariate statistical method as a multiple logistic regression in osteoporosis, which maybe influenced by many variables, is better than univariate statistical evaluation.

  1. Lagged kernel machine regression for identifying time windows of susceptibility to exposures of complex mixtures.

    Science.gov (United States)

    Liu, Shelley H; Bobb, Jennifer F; Lee, Kyu Ha; Gennings, Chris; Claus Henn, Birgit; Bellinger, David; Austin, Christine; Schnaas, Lourdes; Tellez-Rojo, Martha M; Hu, Howard; Wright, Robert O; Arora, Manish; Coull, Brent A

    2018-07-01

    The impact of neurotoxic chemical mixtures on children's health is a critical public health concern. It is well known that during early life, toxic exposures may impact cognitive function during critical time intervals of increased vulnerability, known as windows of susceptibility. Knowledge on time windows of susceptibility can help inform treatment and prevention strategies, as chemical mixtures may affect a developmental process that is operating at a specific life phase. There are several statistical challenges in estimating the health effects of time-varying exposures to multi-pollutant mixtures, such as: multi-collinearity among the exposures both within time points and across time points, and complex exposure-response relationships. To address these concerns, we develop a flexible statistical method, called lagged kernel machine regression (LKMR). LKMR identifies critical exposure windows of chemical mixtures, and accounts for complex non-linear and non-additive effects of the mixture at any given exposure window. Specifically, LKMR estimates how the effects of a mixture of exposures change with the exposure time window using a Bayesian formulation of a grouped, fused lasso penalty within a kernel machine regression (KMR) framework. A simulation study demonstrates the performance of LKMR under realistic exposure-response scenarios, and demonstrates large gains over approaches that consider each time window separately, particularly when serial correlation among the time-varying exposures is high. Furthermore, LKMR demonstrates gains over another approach that inputs all time-specific chemical concentrations together into a single KMR. We apply LKMR to estimate associations between neurodevelopment and metal mixtures in Early Life Exposures in Mexico and Neurotoxicology, a prospective cohort study of child health in Mexico City.

  2. Regression Trees Identify Relevant Interactions: Can This Improve the Predictive Performance of Risk Adjustment?

    Science.gov (United States)

    Buchner, Florian; Wasem, Jürgen; Schillo, Sonja

    2017-01-01

    Risk equalization formulas have been refined since their introduction about two decades ago. Because of the complexity and the abundance of possible interactions between the variables used, hardly any interactions are considered. A regression tree is used to systematically search for interactions, a methodologically new approach in risk equalization. Analyses are based on a data set of nearly 2.9 million individuals from a major German social health insurer. A two-step approach is applied: In the first step a regression tree is built on the basis of the learning data set. Terminal nodes characterized by more than one morbidity-group-split represent interaction effects of different morbidity groups. In the second step the 'traditional' weighted least squares regression equation is expanded by adding interaction terms for all interactions detected by the tree, and regression coefficients are recalculated. The resulting risk adjustment formula shows an improvement in the adjusted R 2 from 25.43% to 25.81% on the evaluation data set. Predictive ratios are calculated for subgroups affected by the interactions. The R 2 improvement detected is only marginal. According to the sample level performance measures used, not involving a considerable number of morbidity interactions forms no relevant loss in accuracy. Copyright © 2015 John Wiley & Sons, Ltd. Copyright © 2015 John Wiley & Sons, Ltd.

  3. Risk factors identified for owner-reported feline obesity at around one year of age: Dry diet and indoor lifestyle.

    Science.gov (United States)

    Rowe, Elizabeth; Browne, William; Casey, Rachel; Gruffydd-Jones, Tim; Murray, Jane

    2015-10-01

    Obesity is considered the second most common health problem in pet cats in developed countries. Previous studies investigating risk factors for feline obesity have been cross-sectional, where reverse causality cannot be ruled out. This study is the first to use prospective data from a large scale longitudinal study of pet cats ('Bristol Cats') to identify early-life risk factors for feline overweight/obesity at around one year of age. Data analysed were collected via three owner-completed questionnaires (for cats aged 2-4 months, 6.5-7 months and 12.5-13 months) completed between May 2010 and August 2013. Owner-reported body condition scores (BCS) of cats at age 12.5-13 months, using the 5-point system, were categorised into a dichotomous variable: overweight/obese (BCS 4-5) and not overweight (BCS 1-3) and used as the dependent variable. Cat breed, neuter status, outdoor access, type of diet, frequency of wet and dry food fed and frequency of treats fed were analysed as potential risk factors. Of the 966 cats for which data were available, 7.0% were reported by their owners to be overweight/obese at 12.5-13 months of age. Descriptive data on type of diet fed at different cat ages suggest that a dry diet is the most popular choice for UK domestic cats. Significant potential explanatory variables from univariable logistic regression models were included in multivariable logistic regression models built using stepwise forward-selection. To account for potential hierarchical clustering of data due to multi-cat households these were extended to two-level random intercept models. Models were compared using Wald test p- values. Clustering had no impact on the analysis. The final multivariable logistic regression model identified two risk factors that were independently associated with an increased risk of feline obesity developing at 12.5-13 months of age: restricted or no outdoor access and feeding dry food as the only or major (>50%) type of food in the diet at age 12

  4. Identifying influential factors of business process performance using dependency analysis

    Science.gov (United States)

    Wetzstein, Branimir; Leitner, Philipp; Rosenberg, Florian; Dustdar, Schahram; Leymann, Frank

    2011-02-01

    We present a comprehensive framework for identifying influential factors of business process performance. In particular, our approach combines monitoring of process events and Quality of Service (QoS) measurements with dependency analysis to effectively identify influential factors. The framework uses data mining techniques to construct tree structures to represent dependencies of a key performance indicator (KPI) on process and QoS metrics. These dependency trees allow business analysts to determine how process KPIs depend on lower-level process metrics and QoS characteristics of the IT infrastructure. The structure of the dependencies enables a drill-down analysis of single factors of influence to gain a deeper knowledge why certain KPI targets are not met.

  5. 儿童情绪障碍相关因素的LOGISTIC回归分析%Logistic Regression Analysis of Related Risk Factors of Emotional Disorders in Children

    Institute of Scientific and Technical Information of China (English)

    高鸿云; 冯金英; 徐俊冕; 郑士俊

    2001-01-01

    Objective: To identify the related psychosocial risk factors of emotional disorders in children. Methods:To use case-control approach in which. Diagnosis was made by clinical interview according to ICD-10 criteria. Eighty eight cases and controls separately filled out general condition inventory. The results were put into Logistic regression model for analysis. Results: The children with timid personality, without kindergarten education, or with parents who were administrative or technical personnel, were apt to have emotional disorders. The children who were usually counseled by their mothers had less emotional disorders than those were beaten. Conclusion: The emotional disorders were the results of multiple factors. Prevention of children's emotional disorders should be focused on the children's personality and family education.

  6. A study of the dengue epidemic and meteorological factors in Guangzhou, China, by using a zero-inflated Poisson regression model.

    Science.gov (United States)

    Wang, Chenggang; Jiang, Baofa; Fan, Jingchun; Wang, Furong; Liu, Qiyong

    2014-01-01

    The aim of this study is to develop a model that correctly identifies and quantifies the relationship between dengue and meteorological factors in Guangzhou, China. By cross-correlation analysis, meteorological variables and their lag effects were determined. According to the epidemic characteristics of dengue in Guangzhou, those statistically significant variables were modeled by a zero-inflated Poisson regression model. The number of dengue cases and minimum temperature at 1-month lag, along with average relative humidity at 0- to 1-month lag were all positively correlated with the prevalence of dengue fever, whereas wind velocity and temperature in the same month along with rainfall at 2 months' lag showed negative association with dengue incidence. Minimum temperature at 1-month lag and wind velocity in the same month had a greater impact on the dengue epidemic than other variables in Guangzhou.

  7. Identifying influential factors on integrated marketing planning using information technology

    Directory of Open Access Journals (Sweden)

    Karim Hamdi

    2014-07-01

    Full Text Available This paper presents an empirical investigation to identify important factors influencing integrated marketing planning using information technology. The proposed study designs a questionnaire for measuring integrated marketing planning, which consists of three categories of structural factors, behavioral factors and background factors. There are 40 questions associated with the proposed study in Likert scale. Cronbach alphas have been calculated for structural factors, behavioral factors and background factors as 0.89, 0.86 and 0.83, respectively. Using some statistical test, the study has confirmed the effects of three factors on integrated marketing. In addition, the implementation of Freedman test has revealed that structural factors were the most important factor followed by background factors and behavioral factors.

  8. Risk Factors for Chronic Subdural Hematoma Recurrence Identified Using Quantitative Computed Tomography Analysis of Hematoma Volume and Density.

    Science.gov (United States)

    Stavrinou, Pantelis; Katsigiannis, Sotirios; Lee, Jong Hun; Hamisch, Christina; Krischek, Boris; Mpotsaris, Anastasios; Timmer, Marco; Goldbrunner, Roland

    2017-03-01

    Chronic subdural hematoma (CSDH), a common condition in elderly patients, presents a therapeutic challenge with recurrence rates of 33%. We aimed to identify specific prognostic factors for recurrence using quantitative analysis of hematoma volume and density. We retrospectively reviewed radiographic and clinical data of 227 CSDHs in 195 consecutive patients who underwent evacuation of the hematoma through a single burr hole, 2 burr holes, or a mini-craniotomy. To examine the relationship between hematoma recurrence and various clinical, radiologic, and surgical factors, we used quantitative image-based analysis to measure the hematoma and trapped air volumes and the hematoma densities. Recurrence of CSDH occurred in 35 patients (17.9%). Multivariate logistic regression analysis revealed that the percentage of hematoma drained and postoperative CSDH density were independent risk factors for recurrence. All 3 evacuation methods were equally effective in draining the hematoma (71.7% vs. 73.7% vs. 71.9%) without observable differences in postoperative air volume captured in the subdural space. Quantitative image analysis provided evidence that percentage of hematoma drained and postoperative CSDH density are independent prognostic factors for subdural hematoma recurrence. Copyright © 2016 Elsevier Inc. All rights reserved.

  9. Identifying work ability promoting factors for home care aides and assistant nurses

    Directory of Open Access Journals (Sweden)

    Larsson Agneta

    2012-01-01

    Full Text Available Abstract Background In workplace health promotion, all potential resources needs to be taken into consideration, not only factors relating to the absence of injury and the physical health of the workers, but also psychological aspects. A dynamic balance between the resources of the individual employees and the demands of work is an important prerequisite. In the home care services, there is a noticeable trend towards increased psychosocial strain on employees at work. There are a high frequency of work-related musculoskeletal disorders and injuries, and a low prevalence of sustainable work ability. The aim of this research was to identify factors promoting work ability and self-efficacy in care aides and assistant nurses within home care services. Methods This study is based on cross-sectional data collected in a municipality in northern Sweden. Care aides (n = 58 and assistant nurses (n = 79 replied to a self-administered questionnaire (response rate 46%. Hierarchical multiple regression analyses were performed to assess the influence of several independent variables on self-efficacy (model 1 and work ability (model 2 for care aides and assistant nurses separately. Results Perceptions of personal safety, self-efficacy and musculoskeletal wellbeing contributed to work ability for assistant nurses (R2adj of 0.36, p 2adj of 0.29, p = 0.001. Self-efficacy was associated with the safety climate and the physical demands of the job in both professions (R2adj of 0.24, p = 0.003 for care aides, and also by sex and age for the assistant nurses (R2adj of 0.31, p Conclusions The intermediate factors contributed differently to work ability in the two professions. Self-efficacy, personal safety and musculoskeletal wellbeing were important for the assistant nurses, while the work ability of the care aides was associated with the safety climate, but also with the non-changeable factors age and seniority. All these factors are important to acknowledge in

  10. Regression Levels of Selected Affective Factors on Science Achievement: A Structural Equation Model with TIMSS 2011 Data

    Science.gov (United States)

    Akilli, Mustafa

    2015-01-01

    The aim of this study is to demonstrate the science success regression levels of chosen emotional features of 8th grade students using Structural Equation Model. The study was conducted by the analysis of students' questionnaires and science success in TIMSS 2011 data using SEM. Initially, the factors that are thought to have an effect on science…

  11. Identifying factors affecting about outsourcing in paraclinical services

    African Journals Online (AJOL)

    Objective: Outsourcing refers to the transfer of services or functions to an outsider supplier, which controls them through a contract or cooperative. The main problem of senior managers in health organizations is determining the services which should be outsourced. The present study seeks to identify the factors that affect ...

  12. Logistic Regression Modeling of Diminishing Manufacturing Sources for Integrated Circuits

    National Research Council Canada - National Science Library

    Gravier, Michael

    1999-01-01

    .... The research identified logistic regression as a powerful tool for analysis of DMSMS and further developed twenty models attempting to identify the "best" way to model and predict DMSMS using logistic regression...

  13. Logistic regression and multiple classification analyses to explore risk factors of under-5 mortality in bangladesh

    International Nuclear Information System (INIS)

    Bhowmik, K.R.; Islam, S.

    2016-01-01

    Logistic regression (LR) analysis is the most common statistical methodology to find out the determinants of childhood mortality. However, the significant predictors cannot be ranked according to their influence on the response variable. Multiple classification (MC) analysis can be applied to identify the significant predictors with a priority index which helps to rank the predictors. The main objective of the study is to find the socio-demographic determinants of childhood mortality at neonatal, post-neonatal, and post-infant period by fitting LR model as well as to rank those through MC analysis. The study is conducted using the data of Bangladesh Demographic and Health Survey 2007 where birth and death information of children were collected from their mothers. Three dichotomous response variables are constructed from children age at death to fit the LR and MC models. Socio-economic and demographic variables significantly associated with the response variables separately are considered in LR and MC analyses. Both the LR and MC models identified the same significant predictors for specific childhood mortality. For both the neonatal and child mortality, biological factors of children, regional settings, and parents socio-economic status are found as 1st, 2nd, and 3rd significant groups of predictors respectively. Mother education and household environment are detected as major significant predictors of post-neonatal mortality. This study shows that MC analysis with or without LR analysis can be applied to detect determinants with rank which help the policy makers taking initiatives on a priority basis. (author)

  14. Identifying Motivation of the Local Governments to Improve the Sustainability Transparency

    Directory of Open Access Journals (Sweden)

    Andrés NAVARRO GALERA

    2015-06-01

    Full Text Available This paper examines the sustainability tran-sparency of governments, i.e., the disclosure of information on the sustainability of their ac-tions. To do so, we identify contributory factors to the online disclosure of environmental, social, economic and general information by local gov-ernments in Nordic countries. Linear regression analysis was used to identify factors inf uencing the online dissemination of government infor-mation on sustainability; a factor analysis, as a precursor to linear regression, led us to reduce 14 explanatory variables to four factors: f nancial risk, demography, professional qualif cations and local government resources. The results obtained show that local f -nancial priorities have a greater impact on the sustainability-related content of governmental websites than does concern for the needs of the population. Furthermore, an organization’s dis-closure of its f nancial risks, together with greater awareness of stakeholders’ demands, could pro-mote transparency in the f eld of environmental, social and economic sustainability, while local demographic characteristics could foster the publication of information on environmental sus-tainability.

  15. Identifying Dietary Patterns Associated with Mild Cognitive Impairment in Older Korean Adults Using Reduced Rank Regression

    Directory of Open Access Journals (Sweden)

    Dayeon Shin

    2018-01-01

    Full Text Available Diet plays a crucial role in cognitive function. Few studies have examined the relationship between dietary patterns and cognitive functions of older adults in the Korean population. This study aimed to identify the effect of dietary patterns on the risk of mild cognitive impairment. A total of 239 participants, including 88 men and 151 women, aged 65 years and older were selected from health centers in the district of Seoul, Gyeonggi province, and Incheon, in Korea. Dietary patterns were determined using Reduced Rank Regression (RRR methods with responses regarding vitamin B6, vitamin C, and iron intakes, based on both a one-day 24-h recall and a food frequency questionnaire. Cognitive function was assessed using the Korean-Mini Mental State Examination (K-MMSE. Multivariable logistic regression models were used to estimate the association between dietary pattern score and the risk of mild cognitive impairment. A total of 20 (8% out of the 239 participants had mild cognitive impairment. Three dietary patterns were identified: seafood and vegetables, high meat, and bread, ham, and alcohol. Among the three dietary patterns, the older adult population who adhered to the seafood and vegetables pattern, characterized by high intake of seafood, vegetables, fruits, bread, snacks, soy products, beans, chicken, pork, ham, egg, and milk had a decreased risk of mild cognitive impairment compared to those who did not (adjusted odds ratios 0.06, 95% confidence interval 0.01–0.72 after controlling for gender, supplementation, education, history of dementia, physical activity, body mass index (BMI, and duration of sleep. The other two dietary patterns were not significantly associated with the risk of mild cognitive impairment. In conclusion, high consumption of fruits, vegetables, seafood, and protein foods was significantly associated with reduced mild cognitive impairment in older Korean adults. These results can contribute to the establishment of

  16. Local bilinear multiple-output quantile/depth regression

    Czech Academy of Sciences Publication Activity Database

    Hallin, M.; Lu, Z.; Paindaveine, D.; Šiman, Miroslav

    2015-01-01

    Roč. 21, č. 3 (2015), s. 1435-1466 ISSN 1350-7265 R&D Projects: GA MŠk(CZ) 1M06047 Institutional support: RVO:67985556 Keywords : conditional depth * growth chart * halfspace depth * local bilinear regression * multivariate quantile * quantile regression * regression depth Subject RIV: BA - General Mathematics Impact factor: 1.372, year: 2015 http://library.utia.cas.cz/separaty/2015/SI/siman-0446857.pdf

  17. Risk factors for exclusive breastfeeding lasting less than two months-Identifying women in need of targeted breastfeeding support.

    Directory of Open Access Journals (Sweden)

    Karin Cato

    Full Text Available Breastfeeding rates in Sweden are declining, and it is important to identify women at risk for early cessation of exclusive breastfeeding.The aim of this study was to investigate factors associated with exclusive breastfeeding lasting less than two months postpartum.A population-based longitudinal study was conducted at Uppsala University Hospital, Sweden. Six hundred and seventy-nine women were included in this sub-study. Questionnaires were sent at five days, six weeks and six months postpartum, including questions on breastfeeding initiation and duration as well as several other background variables. The main outcome measure was exclusive breastfeeding lasting less than two months postpartum. Multivariable logistic regression analysis was used in order to calculate adjusted Odds Ratios (AOR and 95% Confidence Intervals (95% CI.Seventy-seven percent of the women reported exclusive breastfeeding at two months postpartum. The following variables in the multivariate regression analysis were independently associated with exclusive breastfeeding lasting less than two months postpartum: being a first time mother (AOR 2.15, 95% CI 1.32-3.49, reporting emotional distress during pregnancy (AOR 2.21, 95% CI 1.35-3.62 and giving birth by cesarean section (AOR 2.63, 95% CI 1.34-5.17.Factors associated with shorter exclusive breastfeeding duration were determined. Identification of women experiencing emotional distress during pregnancy, as well as scrutiny of caregiving routines on cesarean section need to be addressed, in order to give individual targeted breastfeeding support and promote longer breastfeeding duration.

  18. Identifying risk factors for first-episode neck pain: A systematic review.

    Science.gov (United States)

    Kim, Rebecca; Wiest, Colin; Clark, Kelly; Cook, Chad; Horn, Maggie

    2018-02-01

    Neck pain affects 15.1% of the United States' general population every 3 months, and ranks fourth in global disability. Because of the tendency for neck pain to become a chronic issue, it is important to identify risk factors that could encourage prevention and early diagnosis. The purpose of this systematic review was to identify risk factors for a first episode of neck pain. Three databases were searched with key words such as "neck pain" and "first incidence." Risk factors from the resulting articles were reported as either a physical or psychosocial risk factor and ranked by the strength of their odds/risk/hazard ratio: empowering leadership, high perceived social climate, leisure physical activity, and cervical extensor endurance. Most risk factors found for neck pain were related to psychosocial characteristics, rather than physical characteristics. A number of these risk factors were mediating factors, suggesting that a prevention-based program may be useful in modifying the existence of the risk factors before the occurrence of neck pain. Copyright © 2017 Elsevier Ltd. All rights reserved.

  19. Depression in Intimate Partner Violence Victims in Slovenia: A Crippling Pattern of Factors Identified in Family Practice Attendees

    Directory of Open Access Journals (Sweden)

    Nena Kopčavar Guček

    2018-01-01

    Full Text Available This multi-centre cross-sectional study explored associations between prevalence of depression and exposure to intimate partner violence (IPV at any time in patients’ adult life in 471 participants of a previous IPV study. In 2016, 174 interviews were performed, using the Short Form Domestic Violence Exposure Questionnaire, the Zung Scale and questions about behavioural patterns of exposure to IPV. Family doctors reviewed patients’ medical charts for period from 2012 to 2016, using the Domestic Violence Exposure Medical Chart Check List, for conditions which persisted for at least three years. Depression was found to be associated with any exposure to IPV in adult life and was more likely to affect women. In multivariable logistic regression modelling, factors associated with self-rated depression were identified (p < 0.05. Exposure to emotional and physical violence was identified as a risk factor in the first model, explaining 23% of the variance. The second model explained 66% of the variance; past divorce, dysfunctional family relationships and a history of incapacity to work increased the likelihood of depression in patients. Family doctors should consider IPV exposure when detecting depression, since lifetime IPV exposure was found to be 40.4% and 36.9% of depressed revealed it.

  20. Depression in Intimate Partner Violence Victims in Slovenia: A Crippling Pattern of Factors Identified in Family Practice Attendees.

    Science.gov (United States)

    Guček, Nena Kopčavar; Selič, Polona

    2018-01-26

    This multi-centre cross-sectional study explored associations between prevalence of depression and exposure to intimate partner violence (IPV) at any time in patients' adult life in 471 participants of a previous IPV study. In 2016, 174 interviews were performed, using the Short Form Domestic Violence Exposure Questionnaire, the Zung Scale and questions about behavioural patterns of exposure to IPV. Family doctors reviewed patients' medical charts for period from 2012 to 2016, using the Domestic Violence Exposure Medical Chart Check List, for conditions which persisted for at least three years. Depression was found to be associated with any exposure to IPV in adult life and was more likely to affect women. In multivariable logistic regression modelling, factors associated with self-rated depression were identified (p < 0.05). Exposure to emotional and physical violence was identified as a risk factor in the first model, explaining 23% of the variance. The second model explained 66% of the variance; past divorce, dysfunctional family relationships and a history of incapacity to work increased the likelihood of depression in patients. Family doctors should consider IPV exposure when detecting depression, since lifetime IPV exposure was found to be 40.4% and 36.9% of depressed revealed it.

  1. Prediction of radiation levels in residences: A methodological comparison of CART [Classification and Regression Tree Analysis] and conventional regression

    International Nuclear Information System (INIS)

    Janssen, I.; Stebbings, J.H.

    1990-01-01

    In environmental epidemiology, trace and toxic substance concentrations frequently have very highly skewed distributions ranging over one or more orders of magnitude, and prediction by conventional regression is often poor. Classification and Regression Tree Analysis (CART) is an alternative in such contexts. To compare the techniques, two Pennsylvania data sets and three independent variables are used: house radon progeny (RnD) and gamma levels as predicted by construction characteristics in 1330 houses; and ∼200 house radon (Rn) measurements as predicted by topographic parameters. CART may identify structural variables of interest not identified by conventional regression, and vice versa, but in general the regression models are similar. CART has major advantages in dealing with other common characteristics of environmental data sets, such as missing values, continuous variables requiring transformations, and large sets of potential independent variables. CART is most useful in the identification and screening of independent variables, greatly reducing the need for cross-tabulations and nested breakdown analyses. There is no need to discard cases with missing values for the independent variables because surrogate variables are intrinsic to CART. The tree-structured approach is also independent of the scale on which the independent variables are measured, so that transformations are unnecessary. CART identifies important interactions as well as main effects. The major advantages of CART appear to be in exploring data. Once the important variables are identified, conventional regressions seem to lead to results similar but more interpretable by most audiences. 12 refs., 8 figs., 10 tabs

  2. Sensitivity of Microstructural Factors Influencing the Impact Toughness of Hypoeutectoid Steels with Ferrite-Pearlite Structure using Multiple Regression Analysis

    International Nuclear Information System (INIS)

    Lee, Seung-Yong; Lee, Sang-In; Hwang, Byoung-chul

    2016-01-01

    In this study, the effect of microstructural factors on the impact toughness of hypoeutectoid steels with ferrite-pearlite structure was quantitatively investigated using multiple regression analysis. Microstructural analysis results showed that the pearlite fraction increased with increasing austenitizing temperature and decreasing transformation temperature which substantially decreased the pearlite interlamellar spacing and cementite thickness depending on carbon content. The impact toughness of hypoeutectoid steels usually increased as interlamellar spacing or cementite thickness decreased, although the impact toughness was largely associated with pearlite fraction. Based on these results, multiple regression analysis was performed to understand the individual effect of pearlite fraction, interlamellar spacing, and cementite thickness on the impact toughness. The regression analysis results revealed that pearlite fraction significantly affected impact toughness at room temperature, while cementite thickness did at low temperature.

  3. Meta-regression analysis of commensal and pathogenic Escherichia coli survival in soil and water.

    Science.gov (United States)

    Franz, Eelco; Schijven, Jack; de Roda Husman, Ana Maria; Blaak, Hetty

    2014-06-17

    The extent to which pathogenic and commensal E. coli (respectively PEC and CEC) can survive, and which factors predominantly determine the rate of decline, are crucial issues from a public health point of view. The goal of this study was to provide a quantitative summary of the variability in E. coli survival in soil and water over a broad range of individual studies and to identify the most important sources of variability. To that end, a meta-regression analysis on available literature data was conducted. The considerable variation in reported decline rates indicated that the persistence of E. coli is not easily predictable. The meta-analysis demonstrated that for soil and water, the type of experiment (laboratory or field), the matrix subtype (type of water and soil), and temperature were the main factors included in the regression analysis. A higher average decline rate in soil of PEC compared with CEC was observed. The regression models explained at best 57% of the variation in decline rate in soil and 41% of the variation in decline rate in water. This indicates that additional factors, not included in the current meta-regression analysis, are of importance but rarely reported. More complete reporting of experimental conditions may allow future inference on the global effects of these variables on the decline rate of E. coli.

  4. Logistic Regression Analysis on Factors Affecting Adoption of Rice-Fish Farming in North Iran

    Directory of Open Access Journals (Sweden)

    Seyyed Ali NOORHOSSEINI-NIYAKI

    2012-06-01

    Full Text Available We evaluated the factors influencing the adoption of rice-fish farming in the Tavalesh region near the Caspian Sea in northern Iran. We conducted a survey with open-ended questions. Data were collected from 184 respondents (61 adopters and 123 non-adopters randomly sampled from selected villages and analyzed using logistic regression and multi-response analysis. Family size, number of contacts with an extension agent, participation in extension-education activities, membership in social institutions and the presence of farm workers were the most important socio-economic factors for the adoption of rice-fish farming system. In addition, economic problems were the most common issue reported by adopters. Other issues such as lack of access to appropriate fish food, losses of fish, lack of access to high quality fish fingerlings and dehydration and poor water quality were also important to a number of farmers.

  5. Pre-Analytical Parameters Affecting Vascular Endothelial Growth Factor Measurement in Plasma: Identifying Confounders.

    Science.gov (United States)

    Walz, Johanna M; Boehringer, Daniel; Deissler, Heidrun L; Faerber, Lothar; Goepfert, Jens C; Heiduschka, Peter; Kleeberger, Susannah M; Klettner, Alexa; Krohne, Tim U; Schneiderhan-Marra, Nicole; Ziemssen, Focke; Stahl, Andreas

    2016-01-01

    Vascular endothelial growth factor-A (VEGF-A) is intensively investigated in various medical fields. However, comparing VEGF-A measurements is difficult because sample acquisition and pre-analytic procedures differ between studies. We therefore investigated which variables act as confounders of VEGF-A measurements. Following a standardized protocol, blood was taken at three clinical sites from six healthy participants (one male and one female participant at each center) twice one week apart. The following pre-analytical parameters were varied in order to analyze their impact on VEGF-A measurements: analyzing center, anticoagulant (EDTA vs. PECT / CTAD), cannula (butterfly vs. neonatal), type of centrifuge (swing-out vs. fixed-angle), time before and after centrifugation, filling level (completely filled vs. half-filled tubes) and analyzing method (ELISA vs. multiplex bead array). Additionally, intrapersonal variations over time and sex differences were explored. Statistical analysis was performed using a linear regression model. The following parameters were identified as statistically significant independent confounders of VEGF-A measurements: analyzing center, anticoagulant, centrifuge, analyzing method and sex of the proband. The following parameters were no significant confounders in our data set: intrapersonal variation over one week, cannula, time before and after centrifugation and filling level of collection tubes. VEGF-A measurement results can be affected significantly by the identified pre-analytical parameters. We recommend the use of CTAD anticoagulant, a standardized type of centrifuge and one central laboratory using the same analyzing method for all samples.

  6. Pre-Analytical Parameters Affecting Vascular Endothelial Growth Factor Measurement in Plasma: Identifying Confounders.

    Directory of Open Access Journals (Sweden)

    Johanna M Walz

    Full Text Available Vascular endothelial growth factor-A (VEGF-A is intensively investigated in various medical fields. However, comparing VEGF-A measurements is difficult because sample acquisition and pre-analytic procedures differ between studies. We therefore investigated which variables act as confounders of VEGF-A measurements.Following a standardized protocol, blood was taken at three clinical sites from six healthy participants (one male and one female participant at each center twice one week apart. The following pre-analytical parameters were varied in order to analyze their impact on VEGF-A measurements: analyzing center, anticoagulant (EDTA vs. PECT / CTAD, cannula (butterfly vs. neonatal, type of centrifuge (swing-out vs. fixed-angle, time before and after centrifugation, filling level (completely filled vs. half-filled tubes and analyzing method (ELISA vs. multiplex bead array. Additionally, intrapersonal variations over time and sex differences were explored. Statistical analysis was performed using a linear regression model.The following parameters were identified as statistically significant independent confounders of VEGF-A measurements: analyzing center, anticoagulant, centrifuge, analyzing method and sex of the proband. The following parameters were no significant confounders in our data set: intrapersonal variation over one week, cannula, time before and after centrifugation and filling level of collection tubes.VEGF-A measurement results can be affected significantly by the identified pre-analytical parameters. We recommend the use of CTAD anticoagulant, a standardized type of centrifuge and one central laboratory using the same analyzing method for all samples.

  7. Logistic regression analysis of the risk factors of anastomotic fistula after radical resection of esophageal‐cardiac cancer

    Science.gov (United States)

    Huang, Jinxi; Wang, Chenghu; Yuan, Weiwei; Zhang, Zhandong; Chen, Beibei; Zhang, Xiefu

    2017-01-01

    Background This study was conducted to investigate the risk factors of anastomotic fistula after the radical resection of esophageal‐cardiac cancer. Methods Five hundred and forty‐four esophageal‐cardiac cancer patients who underwent surgery and had complete clinical data were included in the study. Fifty patients diagnosed with postoperative anastomotic fistula were considered the case group and the remaining 494 subjects who did not develop postoperative anastomotic fistula were considered the control. The potential risk factors for anastomotic fistula, such as age, gender, diabetes history, smoking history, were collected and compared between the groups. Statistically significant variables were substituted into logistic regression to further evaluate the independent risk factors for postoperative anastomotic fistulas in esophageal‐cardiac cancer. Results The incidence of anastomotic fistulas was 9.2% (50/544). Logistic regression analysis revealed that female gender (P < 0.05), laparoscopic surgery (P < 0.05), decreased postoperative albumin (P < 0.05), and postoperative renal dysfunction (P < 0.05) were independent risk factors for anastomotic fistulas in patients who received surgery for esophageal‐cardiac cancer. Of the 50 anastomotic fistulas, 16 cases were small fistulas, which were only discovered by conventional imaging examination and not presenting clinical symptoms. All of the anastomotic fistulas occurred within seven days after surgery. Five of the patients with anastomotic fistulas underwent a second surgery and three died. Conclusion Female patients with esophageal‐cardiac cancer treated with endoscopic surgery and suffering from postoperative hypoproteinemia and renal dysfunction were susceptible to postoperative anastomotic fistula. PMID:28940985

  8. Using association rule mining to identify risk factors for early childhood caries.

    Science.gov (United States)

    Ivančević, Vladimir; Tušek, Ivan; Tušek, Jasmina; Knežević, Marko; Elheshk, Salaheddin; Luković, Ivan

    2015-11-01

    Early childhood caries (ECC) is a potentially severe disease affecting children all over the world. The available findings are mostly based on a logistic regression model, but data mining, in particular association rule mining, could be used to extract more information from the same data set. ECC data was collected in a cross-sectional analytical study of the 10% sample of preschool children in the South Bačka area (Vojvodina, Serbia). Association rules were extracted from the data by association rule mining. Risk factors were extracted from the highly ranked association rules. Discovered dominant risk factors include male gender, frequent breastfeeding (with other risk factors), high birth order, language, and low body weight at birth. Low health awareness of parents was significantly associated to ECC only in male children. The discovered risk factors are mostly confirmed by the literature, which corroborates the value of the methods. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  9. Management of Industrial Performance Indicators: Regression Analysis and Simulation

    Directory of Open Access Journals (Sweden)

    Walter Roberto Hernandez Vergara

    2017-11-01

    Full Text Available Stochastic methods can be used in problem solving and explanation of natural phenomena through the application of statistical procedures. The article aims to associate the regression analysis and systems simulation, in order to facilitate the practical understanding of data analysis. The algorithms were developed in Microsoft Office Excel software, using statistical techniques such as regression theory, ANOVA and Cholesky Factorization, which made it possible to create models of single and multiple systems with up to five independent variables. For the analysis of these models, the Monte Carlo simulation and analysis of industrial performance indicators were used, resulting in numerical indices that aim to improve the goals’ management for compliance indicators, by identifying systems’ instability, correlation and anomalies. The analytical models presented in the survey indicated satisfactory results with numerous possibilities for industrial and academic applications, as well as the potential for deployment in new analytical techniques.

  10. The mechanical properties of high speed GTAW weld and factors of nonlinear multiple regression model under external transverse magnetic field

    Science.gov (United States)

    Lu, Lin; Chang, Yunlong; Li, Yingmin; He, Youyou

    2013-05-01

    A transverse magnetic field was introduced to the arc plasma in the process of welding stainless steel tubes by high-speed Tungsten Inert Gas Arc Welding (TIG for short) without filler wire. The influence of external magnetic field on welding quality was investigated. 9 sets of parameters were designed by the means of orthogonal experiment. The welding joint tensile strength and form factor of weld were regarded as the main standards of welding quality. A binary quadratic nonlinear regression equation was established with the conditions of magnetic induction and flow rate of Ar gas. The residual standard deviation was calculated to adjust the accuracy of regression model. The results showed that, the regression model was correct and effective in calculating the tensile strength and aspect ratio of weld. Two 3D regression models were designed respectively, and then the impact law of magnetic induction on welding quality was researched.

  11. Identifying factors affecting destination choice of medical tourists: a ...

    African Journals Online (AJOL)

    medical tourism”, has emerged as a new source of competitive advantage all over the world. The present study seeks to identify the factors that affect destination choice of medical tourists. Methods: We systematically searched relevant databases ...

  12. Multicollinearity and Regression Analysis

    Science.gov (United States)

    Daoud, Jamal I.

    2017-12-01

    In regression analysis it is obvious to have a correlation between the response and predictor(s), but having correlation among predictors is something undesired. The number of predictors included in the regression model depends on many factors among which, historical data, experience, etc. At the end selection of most important predictors is something objective due to the researcher. Multicollinearity is a phenomena when two or more predictors are correlated, if this happens, the standard error of the coefficients will increase [8]. Increased standard errors means that the coefficients for some or all independent variables may be found to be significantly different from In other words, by overinflating the standard errors, multicollinearity makes some variables statistically insignificant when they should be significant. In this paper we focus on the multicollinearity, reasons and consequences on the reliability of the regression model.

  13. Computing multiple-output regression quantile regions

    Czech Academy of Sciences Publication Activity Database

    Paindaveine, D.; Šiman, Miroslav

    2012-01-01

    Roč. 56, č. 4 (2012), s. 840-853 ISSN 0167-9473 R&D Projects: GA MŠk(CZ) 1M06047 Institutional research plan: CEZ:AV0Z10750506 Keywords : halfspace depth * multiple-output regression * parametric linear programming * quantile regression Subject RIV: BA - General Mathematics Impact factor: 1.304, year: 2012 http://library.utia.cas.cz/separaty/2012/SI/siman-0376413.pdf

  14. Tumor regression patterns in retinoblastoma

    International Nuclear Information System (INIS)

    Zafar, S.N.; Siddique, S.N.; Zaheer, N.

    2016-01-01

    To observe the types of tumor regression after treatment, and identify the common pattern of regression in our patients. Study Design: Descriptive study. Place and Duration of Study: Department of Pediatric Ophthalmology and Strabismus, Al-Shifa Trust Eye Hospital, Rawalpindi, Pakistan, from October 2011 to October 2014. Methodology: Children with unilateral and bilateral retinoblastoma were included in the study. Patients were referred to Pakistan Institute of Medical Sciences, Islamabad, for chemotherapy. After every cycle of chemotherapy, dilated funds examination under anesthesia was performed to record response of the treatment. Regression patterns were recorded on RetCam II. Results: Seventy-four tumors were included in the study. Out of 74 tumors, 3 were ICRB group A tumors, 43 were ICRB group B tumors, 14 tumors belonged to ICRB group C, and remaining 14 were ICRB group D tumors. Type IV regression was seen in 39.1% (n=29) tumors, type II in 29.7% (n=22), type III in 25.6% (n=19), and type I in 5.4% (n=4). All group A tumors (100%) showed type IV regression. Seventeen (39.5%) group B tumors showed type IV regression. In group C, 5 tumors (35.7%) showed type II regression and 5 tumors (35.7%) showed type IV regression. In group D, 6 tumors (42.9%) regressed to type II non-calcified remnants. Conclusion: The response and success of the focal and systemic treatment, as judged by the appearance of different patterns of tumor regression, varies with the ICRB grouping of the tumor. (author)

  15. Repeated Results Analysis for Middleware Regression Benchmarking

    Czech Academy of Sciences Publication Activity Database

    Bulej, Lubomír; Kalibera, T.; Tůma, P.

    2005-01-01

    Roč. 60, - (2005), s. 345-358 ISSN 0166-5316 R&D Projects: GA ČR GA102/03/0672 Institutional research plan: CEZ:AV0Z10300504 Keywords : middleware benchmarking * regression benchmarking * regression testing Subject RIV: JD - Computer Applications, Robotics Impact factor: 0.756, year: 2005

  16. Identifying factors affecting the safety of mid-block bicycle lanes considering mixed 2-wheeled traffic flow.

    Science.gov (United States)

    Bai, Lu; Chan, Ching-Yao; Liu, Pan; Xu, Chengcheng

    2017-10-03

    Electric bikes (e-bikes) have been one of the fastest growing trip modes in Southeast Asia over the past 2 decades. The increasing popularity of e-bikes raised some safety concerns regarding urban transport systems. The primary objective of this study was to identify whether and how the generalized linear regression model (GLM) could be used to relate cyclists' safety with various contributing factors when riding in a mid-block bike lane. The types of 2-wheeled vehicles in the study included bicycle-style electric bicycles (BSEBs), scooter-style electric bicycles (SSEBs), and regular bicycles (RBs). Traffic conflict technology was applied as a surrogate measure to evaluate the safety of 2-wheeled vehicles. The safety performance model was developed by adopting a generalized linear regression model for relating the frequency of rear-end conflicts between e-bikes and regular bikes to the operating speeds of BSEBs, SSEBs, and RBs in mid-block bike lanes. The frequency of rear-end conflicts between e-bikes and bikes increased with an increase in the operating speeds of e-bikes and the volume of e-bikes and bikes and decreased with an increase in the width of bike lanes. The large speed difference between e-bikes and bikes increased the frequency of rear-end conflicts between e-bikes and bikes in mid-block bike lanes. A 1% increase in the average operating speed of e-bikes would increase the expected number of rear-end conflicts between e-bikes and bikes by 1.48%. A 1% increase in the speed difference between e-bikes and bikes would increase the expected number of rear-end conflicts between e-bikes/bikes by 0.16%. The conflict frequency in mid-block bike lanes can be modeled using generalized linear regression models. The factors that significantly affected the frequency of rear-end conflicts included the operating speeds of e-bikes, the speed difference between e-bikes and regular bikes, the volume of e-bikes, the volume of bikes, and the width of bike lanes. The

  17. Alternative regression models to assess increase in childhood BMI.

    Science.gov (United States)

    Beyerlein, Andreas; Fahrmeir, Ludwig; Mansmann, Ulrich; Toschke, André M

    2008-09-08

    Body mass index (BMI) data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs), quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS). We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity. GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models. GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.

  18. A logistic regression analysis of factors related to the treatment compliance of infertile patients with polycystic ovary syndrome.

    Science.gov (United States)

    Li, Saijiao; He, Aiyan; Yang, Jing; Yin, TaiLang; Xu, Wangming

    2011-01-01

    To investigate factors that can affect compliance with treatment of polycystic ovary syndrome (PCOS) in infertile patients and to provide a basis for clinical treatment, specialist consultation and health education. Patient compliance was assessed via a questionnaire based on the Morisky-Green test and the treatment principles of PCOS. Then interviews were conducted with 99 infertile patients diagnosed with PCOS at Renmin Hospital of Wuhan University in China, from March to September 2009. Finally, these data were analyzed using logistic regression analysis. Logistic regression analysis revealed that a total of 23 (25.6%) of the participants showed good compliance. Factors that significantly (p < 0.05) affected compliance with treatment were the patient's body mass index, convenience of medical treatment and concerns about adverse drug reactions. Patients who are obese, experience inconvenient medical treatment or are concerned about adverse drug reactions are more likely to exhibit noncompliance. Treatment education and intervention aimed at these patients should be strengthened in the clinic to improve treatment compliance. Further research is needed to better elucidate the compliance behavior of patients with PCOS.

  19. Stepwise versus Hierarchical Regression: Pros and Cons

    Science.gov (United States)

    Lewis, Mitzi

    2007-01-01

    Multiple regression is commonly used in social and behavioral data analysis. In multiple regression contexts, researchers are very often interested in determining the "best" predictors in the analysis. This focus may stem from a need to identify those predictors that are supportive of theory. Alternatively, the researcher may simply be interested…

  20. Predicting attention-deficit/hyperactivity disorder severity from psychosocial stress and stress-response genes : A random forest regression approach

    NARCIS (Netherlands)

    Van Der Meer, D.; Hoekstra, P. J.; Van Donkelaar, M.; Bralten, J.; Oosterlaan, J.; Heslenfeld, D.; Faraone, S. V.; Franke, B.; Buitelaar, J. K.; Hartman, C. A.

    2017-01-01

    Identifying genetic variants contributing to attention-deficit/hyperactivity disorder (ADHD) is complicated by the involvement of numerous common genetic variants with small effects, interacting with each other as well as with environmental factors, such as stress exposure. Random forest regression

  1. Predicting attention-deficit/hyperactivity disorder severity from psychosocial stress and stress-response genes : a random forest regression approach

    NARCIS (Netherlands)

    van der Meer, D.; Hoekstra, P. J.; van Donkelaar, Marjolein M. J.; Bralten, Janita; Oosterlaan, J; Heslenfeld, Dirk J.; Faraone, S. V.; Franke, B.; Buitelaar, J. K.; Hartman, C. A.

    2017-01-01

    Identifying genetic variants contributing to attention-deficit/hyperactivity disorder (ADHD) is complicated by the involvement of numerous common genetic variants with small effects, interacting with each other as well as with environmental factors, such as stress exposure. Random forest regression

  2. Logistic regression analysis of prognostic factors in 106 acute-on-chronic liver failure patients with hepatic encephalopathy

    Directory of Open Access Journals (Sweden)

    CUI Yanping

    2014-10-01

    Full Text Available ObjectiveTo analyze the prognostic factors in acute-on-chronic liver failure (ACLF patients with hepatic encephalopathy (HE and to explore the risk factors for prognosis. MethodsA retrospective analysis was performed on 106 ACLF patients with HE who were hospitalized in our hospital from January 2010 to July 2013. The patients were divided into improved group and deteriorated group. The univariate indicators including age, sex, laboratory indicators [total bilirubin (TBil, albumin (Alb, alanine aminotransferase (ALT, aspartate amino-transferase (AST, and prothrombin time activity (PTA], the stage of HE, complications [persistent hyponatremia, digestive tract bleeding, hepatorenal syndrome (HRS, ascites, infection, and spontaneous bacterial peritonitis (SBP], and plasma exchange were analyzed by chi-square test or t-test. Indicators with statistical significance were subsequently analyzed by binary logistic regression. ResultsUnivariate analysis showed that ALT (P=0.009, PTA (P=0.043, the stage of HE (P=0.000, and HRS (P=0.003 were significantly different between the two groups, whereas differences in age, sex, TBil, Alb, AST, persistent hyponatremia, digestive tract bleeding, ascites, infection, SBP, and plasma exchange were not statistically significant (P>0.05. Binary logistic regression demonstrated that PTA (b=-0097, P=0.025, OR=0.908, HRS (b=2.279, P=0.007, OR=9.764, and the stage of HE (b=1873, P=0.000, OR=6.510 were prognostic factors in ACLF patients with HE. ConclusionThe stage of HE, HRS, and PTA are independent influential factors for the prognosis in ACLF patients with HE. Reduced PTA, advanced HE stage, and the presence of HRS indicate worse prognosis.

  3. Identifying individual changes in performance with composite quality indicators while accounting for regression to the mean.

    Science.gov (United States)

    Gajewski, Byron J; Dunton, Nancy

    2013-04-01

    Almost a decade ago Morton and Torgerson indicated that perceived medical benefits could be due to "regression to the mean." Despite this caution, the regression to the mean "effects on the identification of changes in institutional performance do not seem to have been considered previously in any depth" (Jones and Spiegelhalter). As a response, Jones and Spiegelhalter provide a methodology to adjust for regression to the mean when modeling recent changes in institutional performance for one-variable quality indicators. Therefore, in our view, Jones and Spiegelhalter provide a breakthrough methodology for performance measures. At the same time, in the interests of parsimony, it is useful to aggregate individual quality indicators into a composite score. Our question is, can we develop and demonstrate a methodology that extends the "regression to the mean" literature to allow for composite quality indicators? Using a latent variable modeling approach, we extend the methodology to the composite indicator case. We demonstrate the approach on 4 indicators collected by the National Database of Nursing Quality Indicators. A simulation study further demonstrates its "proof of concept."

  4. Estimation of a Reactor Core Power Peaking Factor Using Support Vector Regression and Uncertainty Analysis

    International Nuclear Information System (INIS)

    Bae, In Ho; Naa, Man Gyun; Lee, Yoon Joon; Park, Goon Cherl

    2009-01-01

    The monitoring of detailed 3-dimensional (3D) reactor core power distribution is a prerequisite in the operation of nuclear power reactors to ensure that various safety limits imposed on the LPD and DNBR, are not violated during nuclear power reactor operation. The LPD and DNBR should be calculated in order to perform the two major functions of the core protection calculator system (CPCS) and the core operation limit supervisory system (COLSS). The LPD at the hottest part of a hot fuel rod, which is related to the power peaking factor (PPF, F q ), is more important than the LPD at any other position in a reactor core. The LPD needs to be estimated accurately to prevent nuclear fuel rods from melting. In this study, support vector regression (SVR) and uncertainty analysis have been applied to estimation of reactor core power peaking factor

  5. SU-E-J-212: Identifying Bones From MRI: A Dictionary Learnign and Sparse Regression Approach

    International Nuclear Information System (INIS)

    Ruan, D; Yang, Y; Cao, M; Hu, P; Low, D

    2014-01-01

    Purpose: To develop an efficient and robust scheme to identify bony anatomy based on MRI-only simulation images. Methods: MRI offers important soft tissue contrast and functional information, yet its lack of correlation to electron-density has placed it as an auxiliary modality to CT in radiotherapy simulation and adaptation. An effective scheme to identify bony anatomy is an important first step towards MR-only simulation/treatment paradigm and would satisfy most practical purposes. We utilize a UTE acquisition sequence to achieve visibility of the bone. By contrast to manual + bulk or registration-to identify bones, we propose a novel learning-based approach for improved robustness to MR artefacts and environmental changes. Specifically, local information is encoded with MR image patch, and the corresponding label is extracted (during training) from simulation CT aligned to the UTE. Within each class (bone vs. nonbone), an overcomplete dictionary is learned so that typical patches within the proper class can be represented as a sparse combination of the dictionary entries. For testing, an acquired UTE-MRI is divided to patches using a sliding scheme, where each patch is sparsely regressed against both bone and nonbone dictionaries, and subsequently claimed to be associated with the class with the smaller residual. Results: The proposed method has been applied to the pilot site of brain imaging and it has showed general good performance, with dice similarity coefficient of greater than 0.9 in a crossvalidation study using 4 datasets. Importantly, it is robust towards consistent foreign objects (e.g., headset) and the artefacts relates to Gibbs and field heterogeneity. Conclusion: A learning perspective has been developed for inferring bone structures based on UTE MRI. The imaging setting is subject to minimal motion effects and the post-processing is efficient. The improved efficiency and robustness enables a first translation to MR-only routine. The scheme

  6. SU-E-J-212: Identifying Bones From MRI: A Dictionary Learnign and Sparse Regression Approach

    Energy Technology Data Exchange (ETDEWEB)

    Ruan, D; Yang, Y; Cao, M; Hu, P; Low, D [UCLA, Los Angeles, CA (United States)

    2014-06-01

    Purpose: To develop an efficient and robust scheme to identify bony anatomy based on MRI-only simulation images. Methods: MRI offers important soft tissue contrast and functional information, yet its lack of correlation to electron-density has placed it as an auxiliary modality to CT in radiotherapy simulation and adaptation. An effective scheme to identify bony anatomy is an important first step towards MR-only simulation/treatment paradigm and would satisfy most practical purposes. We utilize a UTE acquisition sequence to achieve visibility of the bone. By contrast to manual + bulk or registration-to identify bones, we propose a novel learning-based approach for improved robustness to MR artefacts and environmental changes. Specifically, local information is encoded with MR image patch, and the corresponding label is extracted (during training) from simulation CT aligned to the UTE. Within each class (bone vs. nonbone), an overcomplete dictionary is learned so that typical patches within the proper class can be represented as a sparse combination of the dictionary entries. For testing, an acquired UTE-MRI is divided to patches using a sliding scheme, where each patch is sparsely regressed against both bone and nonbone dictionaries, and subsequently claimed to be associated with the class with the smaller residual. Results: The proposed method has been applied to the pilot site of brain imaging and it has showed general good performance, with dice similarity coefficient of greater than 0.9 in a crossvalidation study using 4 datasets. Importantly, it is robust towards consistent foreign objects (e.g., headset) and the artefacts relates to Gibbs and field heterogeneity. Conclusion: A learning perspective has been developed for inferring bone structures based on UTE MRI. The imaging setting is subject to minimal motion effects and the post-processing is efficient. The improved efficiency and robustness enables a first translation to MR-only routine. The scheme

  7. A gentle introduction to quantile regression for ecologists

    Science.gov (United States)

    Cade, B.S.; Noon, B.R.

    2003-01-01

    Quantile regression is a way to estimate the conditional quantiles of a response variable distribution in the linear model that provides a more complete view of possible causal relationships between variables in ecological processes. Typically, all the factors that affect ecological processes are not measured and included in the statistical models used to investigate relationships between variables associated with those processes. As a consequence, there may be a weak or no predictive relationship between the mean of the response variable (y) distribution and the measured predictive factors (X). Yet there may be stronger, useful predictive relationships with other parts of the response variable distribution. This primer relates quantile regression estimates to prediction intervals in parametric error distribution regression models (eg least squares), and discusses the ordering characteristics, interval nature, sampling variation, weighting, and interpretation of the estimates for homogeneous and heterogeneous regression models.

  8. Alternative regression models to assess increase in childhood BMI

    Directory of Open Access Journals (Sweden)

    Mansmann Ulrich

    2008-09-01

    Full Text Available Abstract Background Body mass index (BMI data usually have skewed distributions, for which common statistical modeling approaches such as simple linear or logistic regression have limitations. Methods Different regression approaches to predict childhood BMI by goodness-of-fit measures and means of interpretation were compared including generalized linear models (GLMs, quantile regression and Generalized Additive Models for Location, Scale and Shape (GAMLSS. We analyzed data of 4967 children participating in the school entry health examination in Bavaria, Germany, from 2001 to 2002. TV watching, meal frequency, breastfeeding, smoking in pregnancy, maternal obesity, parental social class and weight gain in the first 2 years of life were considered as risk factors for obesity. Results GAMLSS showed a much better fit regarding the estimation of risk factors effects on transformed and untransformed BMI data than common GLMs with respect to the generalized Akaike information criterion. In comparison with GAMLSS, quantile regression allowed for additional interpretation of prespecified distribution quantiles, such as quantiles referring to overweight or obesity. The variables TV watching, maternal BMI and weight gain in the first 2 years were directly, and meal frequency was inversely significantly associated with body composition in any model type examined. In contrast, smoking in pregnancy was not directly, and breastfeeding and parental social class were not inversely significantly associated with body composition in GLM models, but in GAMLSS and partly in quantile regression models. Risk factor specific BMI percentile curves could be estimated from GAMLSS and quantile regression models. Conclusion GAMLSS and quantile regression seem to be more appropriate than common GLMs for risk factor modeling of BMI data.

  9. Multicollinearity in Regression Analyses Conducted in Epidemiologic Studies.

    Science.gov (United States)

    Vatcheva, Kristina P; Lee, MinJae; McCormick, Joseph B; Rahbar, Mohammad H

    2016-04-01

    The adverse impact of ignoring multicollinearity on findings and data interpretation in regression analysis is very well documented in the statistical literature. The failure to identify and report multicollinearity could result in misleading interpretations of the results. A review of epidemiological literature in PubMed from January 2004 to December 2013, illustrated the need for a greater attention to identifying and minimizing the effect of multicollinearity in analysis of data from epidemiologic studies. We used simulated datasets and real life data from the Cameron County Hispanic Cohort to demonstrate the adverse effects of multicollinearity in the regression analysis and encourage researchers to consider the diagnostic for multicollinearity as one of the steps in regression analysis.

  10. Assessing vulnerability to drought: identifying underlying factors across Europe

    Science.gov (United States)

    Urquijo, Julia; Gonzalez Tánago, Itziar; Ballesteros, Mario; De Stefano, Lucia

    2015-04-01

    Drought is considered one of the most severe and damaging natural hazards in terms of people and sectors affected and associated losses. Drought is a normal and recurrent climatic phenomenon that occurs worldwide, although its spatial and temporal characteristics vary significantly among climates. In the case of Europe, in the last thirty years, the region has suffered several drought events that have caused estimated economic damages over a €100 billion and have affected almost 20% of its territory and population. In recent years, there has been a growing awareness among experts and authorities of the need to shift from a reactive crisis approach to a drought risk management approach, as well as of the importance of designing and implementing policies, strategies and plans at country and river basin levels to deal with drought. The identification of whom and what is vulnerable to drought is a central aspect of drought risk mitigation and planning and several authors agree that societal vulnerability often determines drought risk more than the actual precipitation shortfalls. The final aim of a drought vulnerability assessment is to identify the underlying sources of drought impact, in order to develop policy options that help to enhance coping capacity and therefore to prevent drought impact. This study identifies and maps factors underlying vulnerability to drought across Europe. The identification of factors influencing vulnerability starts from the analysis of past drought impacts in four European socioeconomic sectors. This analysis, along with an extensive literature review, led to the selection of vulnerability factors that are both relevant and adequate for the European context. Adopting the IPCC model, vulnerability factors were grouped to describe exposure, sensitivity and adaptive capacity. The aggregation of these components has resulted in the mapping of vulnerability to drought across Europe at NUTS02 level. Final results have been compared with

  11. A multiple linear regression analysis of factors affecting the simulated Basic Life Support (BLS) performance with Automated External Defibrillator (AED) in Flemish lifeguards.

    Science.gov (United States)

    Iserbyt, Peter; Schouppe, Gilles; Charlier, Nathalie

    2015-04-01

    Research investigating lifeguards' performance of Basic Life Support (BLS) with Automated External Defibrillator (AED) is limited. Assessing simulated BLS/AED performance in Flemish lifeguards and identifying factors affecting this performance. Six hundred and sixteen (217 female and 399 male) certified Flemish lifeguards (aged 16-71 years) performed BLS with an AED on a Laerdal ResusciAnne manikin simulating an adult victim of drowning. Stepwise multiple linear regression analysis was conducted with BLS/AED performance as outcome variable and demographic data as explanatory variables. Mean BLS/AED performance for all lifeguards was 66.5%. Compression rate and depth adhered closely to ERC 2010 guidelines. Ventilation volume and flow rate exceeded the guidelines. A significant regression model, F(6, 415)=25.61, p<.001, ES=.38, explained 27% of the variance in BLS performance (R2=.27). Significant predictors were age (beta=-.31, p<.001), years of certification (beta=-.41, p<.001), time on duty per year (beta=-.25, p<.001), practising BLS skills (beta=.11, p=.011), and being a professional lifeguard (beta=-.13, p=.029). 71% of lifeguards reported not practising BLS/AED. Being young, recently certified, few days of employment per year, practising BLS skills and not being a professional lifeguard are factors associated with higher BLS/AED performance. Measures should be taken to prevent BLS/AED performances from decaying with age and longer certification. Refresher courses could include a formal skills test and lifeguards should be encouraged to practise their BLS/AED skills. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  12. Identify the Important Decision Factors of Online Shopping Adoption in Indonesia

    Directory of Open Access Journals (Sweden)

    Lailatul HIJRAH

    2017-12-01

    Full Text Available The objective of this study is to identify factors encouraging a consumer to engage in online shopping activities. The expected contribution of this study is for online entrepreneurs, in order to develop the most suitable business strategy, so that it will be clearly identified and sorted out which factors are the most important and the main motivation of Indonesian consumers to shop via online by using responses from respondents who usually shop online and offline in 3 cities in Indonesia, Jakarta, Surabaya and Samarinda. The research instruments were developed by conducting FGDs on relevant groups, either academics, online shopping activists, suppliers and courier businessmen in Jakarta, Surabaya and Samarinda Cities in effort to extract any information that encourages consumers to online shopping. After conducting FGD, the researcher produced 48 items proposed for factor analysis and after extracted to form eleven constructs, some items were removed because they had less loading factors. The eleven constructs or dimensions are trust, risk, consumer factors, website factors, price, service quality, convenience, subjective norm, product guarantee, variety of products and lifestyle. The implications of this study provide valuable insights about consumer decisions to online shopping or not online shopping.

  13. Changes of platelet GMP-140 in diabetic nephropathy and its multi-factor regression analysis

    International Nuclear Information System (INIS)

    Wang Zizheng; Du Tongxin; Wang Shukui

    2001-01-01

    The relation of platelet GMP-140 and its related factors with diabetic nephropathy was studied. 144 patients of diabetic mellitus without nephropathy (group without DN, mean suffering duration of 25.5 +- 18.6 months); 80 with diabetic nephropathy (group DN, mean suffering duration of 58.7 +- 31.6 months) and 50 normal controls were chosen in the research. Platelet GMP-140, plasma α 1 -MG, β 2 -MG, and 24 hour urine albumin (ALB), IgG, α 1 -MG, β 2 -MG were detected by RIA, while HBA 1 C via chromatographic separation and FBG, PBG, Ch, TG, HDL, FG via biochemical methods. All the data had been processed with software on computer with t-test and linear regression, and multi-factor analysis were done also. The levels of platelet GMP-140, FG, DBP, TG, HBA 1 C and PBG in group DN were significantly higher than those of group without DN and normal control (P 0.05), while they were higher than those of normal controls. Multi-factor analysis of platelet GMP-140 with TG, DBP and HBA 1 C were performed in 80 patients with DN (P 1 C are the independent factors enhancing the activation of platelets. The disturbance of lipid metabolism in type II diabetic mellitus may also enhance the activation of platelets. Elevation of blood pressure may accelerate the initiation and deterioration of DN in which change of platelet GMP-140 is an independent factor. Elevation of HBA 1 C and blood glucose are related closely to the diabetic nephropathy

  14. Using multiobjective tradeoff sets and Multivariate Regression Trees to identify critical and robust decisions for long term water utility planning

    Science.gov (United States)

    Smith, R.; Kasprzyk, J. R.; Balaji, R.

    2017-12-01

    In light of deeply uncertain factors like future climate change and population shifts, responsible resource management will require new types of information and strategies. For water utilities, this entails potential expansion and efficient management of water supply infrastructure systems for changes in overall supply; changes in frequency and severity of climate extremes such as droughts and floods; and variable demands, all while accounting for conflicting long and short term performance objectives. Multiobjective Evolutionary Algorithms (MOEAs) are emerging decision support tools that have been used by researchers and, more recently, water utilities to efficiently generate and evaluate thousands of planning portfolios. The tradeoffs between conflicting objectives are explored in an automated way to produce (often large) suites of portfolios that strike different balances of performance. Once generated, the sets of optimized portfolios are used to support relatively subjective assertions of priorities and human reasoning, leading to adoption of a plan. These large tradeoff sets contain information about complex relationships between decisions and between groups of decisions and performance that, until now, has not been quantitatively described. We present a novel use of Multivariate Regression Trees (MRTs) to analyze tradeoff sets to reveal these relationships and critical decisions. Additionally, when MRTs are applied to tradeoff sets developed for different realizations of an uncertain future, they can identify decisions that are robust across a wide range of conditions and produce fundamental insights about the system being optimized.

  15. Animal models of maternal high fat diet exposure and effects on metabolism in offspring: a meta-regression analysis.

    Science.gov (United States)

    Ribaroff, G A; Wastnedge, E; Drake, A J; Sharpe, R M; Chambers, T J G

    2017-06-01

    Animal models of maternal high fat diet (HFD) demonstrate perturbed offspring metabolism although the effects differ markedly between models. We assessed studies investigating metabolic parameters in the offspring of HFD fed mothers to identify factors explaining these inter-study differences. A total of 171 papers were identified, which provided data from 6047 offspring. Data were extracted regarding body weight, adiposity, glucose homeostasis and lipidaemia. Information regarding the macronutrient content of diet, species, time point of exposure and gestational weight gain were collected and utilized in meta-regression models to explore predictive factors. Publication bias was assessed using Egger's regression test. Maternal HFD exposure did not affect offspring birthweight but increased weaning weight, final bodyweight, adiposity, triglyceridaemia, cholesterolaemia and insulinaemia in both female and male offspring. Hyperglycaemia was found in female offspring only. Meta-regression analysis identified lactational HFD exposure as a key moderator. The fat content of the diet did not correlate with any outcomes. There was evidence of significant publication bias for all outcomes except birthweight. Maternal HFD exposure was associated with perturbed metabolism in offspring but between studies was not accounted for by dietary constituents, species, strain or maternal gestational weight gain. Specific weaknesses in experimental design predispose many of the results to bias. © 2017 The Authors. Obesity Reviews published by John Wiley & Sons Ltd on behalf of World Obesity Federation.

  16. An investigation of the speeding-related crash designation through crash narrative reviews sampled via logistic regression.

    Science.gov (United States)

    Fitzpatrick, Cole D; Rakasi, Saritha; Knodler, Michael A

    2017-01-01

    Speed is one of the most important factors in traffic safety as higher speeds are linked to increased crash risk and higher injury severities. Nearly a third of fatal crashes in the United States are designated as "speeding-related", which is defined as either "the driver behavior of exceeding the posted speed limit or driving too fast for conditions." While many studies have utilized the speeding-related designation in safety analyses, no studies have examined the underlying accuracy of this designation. Herein, we investigate the speeding-related crash designation through the development of a series of logistic regression models that were derived from the established speeding-related crash typologies and validated using a blind review, by multiple researchers, of 604 crash narratives. The developed logistic regression model accurately identified crashes which were not originally designated as speeding-related but had crash narratives that suggested speeding as a causative factor. Only 53.4% of crashes designated as speeding-related contained narratives which described speeding as a causative factor. Further investigation of these crashes revealed that the driver contributing code (DCC) of "driving too fast for conditions" was being used in three separate situations. Additionally, this DCC was also incorrectly used when "exceeding the posted speed limit" would likely have been a more appropriate designation. Finally, it was determined that the responding officer only utilized one DCC in 82% of crashes not designated as speeding-related but contained a narrative indicating speed as a contributing causal factor. The use of logistic regression models based upon speeding-related crash typologies offers a promising method by which all possible speeding-related crashes could be identified. Published by Elsevier Ltd.

  17. Identifying Nonprovider Factors Affecting Pediatric Emergency Medicine Provider Efficiency.

    Science.gov (United States)

    Saleh, Fareed; Breslin, Kristen; Mullan, Paul C; Tillett, Zachary; Chamberlain, James M

    2017-10-31

    The aim of this study was to create a multivariable model of standardized relative value units per hour by adjusting for nonprovider factors that influence efficiency. We obtained productivity data based on billing records measured in emergency relative value units for (1) both evaluation and management of visits and (2) procedures for 16 pediatric emergency medicine providers with more than 750 hours worked per year. Eligible shifts were in an urban, academic pediatric emergency department (ED) with 2 sites: a tertiary care main campus and a satellite community site. We used multivariable linear regression to adjust for the impact of shift and pediatric ED characteristics on individual-provider efficiency and then removed variables from the model with minimal effect on productivity. There were 2998 eligible shifts for the 16 providers during a 3-year period. The resulting model included 4 variables when looking at both ED sites combined. These variables include the following: (1) number of procedures billed by provider, (2) season of the year, (3) shift start time, and (4) day of week. Results were improved when we separately modeled each ED location. A 3-variable model using procedures billed by provider, shift start time, and season explained 23% of the variation in provider efficiency at the academic ED site. A 3-variable model using procedures billed by provider, patient arrivals per hour, and shift start time explained 45% of the variation in provider efficiency at the satellite ED site. Several nonprovider factors affect provider efficiency. These factors should be considered when designing productivity-based incentives.

  18. Retro-regression--another important multivariate regression improvement.

    Science.gov (United States)

    Randić, M

    2001-01-01

    We review the serious problem associated with instabilities of the coefficients of regression equations, referred to as the MRA (multivariate regression analysis) "nightmare of the first kind". This is manifested when in a stepwise regression a descriptor is included or excluded from a regression. The consequence is an unpredictable change of the coefficients of the descriptors that remain in the regression equation. We follow with consideration of an even more serious problem, referred to as the MRA "nightmare of the second kind", arising when optimal descriptors are selected from a large pool of descriptors. This process typically causes at different steps of the stepwise regression a replacement of several previously used descriptors by new ones. We describe a procedure that resolves these difficulties. The approach is illustrated on boiling points of nonanes which are considered (1) by using an ordered connectivity basis; (2) by using an ordering resulting from application of greedy algorithm; and (3) by using an ordering derived from an exhaustive search for optimal descriptors. A novel variant of multiple regression analysis, called retro-regression (RR), is outlined showing how it resolves the ambiguities associated with both "nightmares" of the first and the second kind of MRA.

  19. Modified Regression Correlation Coefficient for Poisson Regression Model

    Science.gov (United States)

    Kaengthong, Nattacha; Domthong, Uthumporn

    2017-09-01

    This study gives attention to indicators in predictive power of the Generalized Linear Model (GLM) which are widely used; however, often having some restrictions. We are interested in regression correlation coefficient for a Poisson regression model. This is a measure of predictive power, and defined by the relationship between the dependent variable (Y) and the expected value of the dependent variable given the independent variables [E(Y|X)] for the Poisson regression model. The dependent variable is distributed as Poisson. The purpose of this research was modifying regression correlation coefficient for Poisson regression model. We also compare the proposed modified regression correlation coefficient with the traditional regression correlation coefficient in the case of two or more independent variables, and having multicollinearity in independent variables. The result shows that the proposed regression correlation coefficient is better than the traditional regression correlation coefficient based on Bias and the Root Mean Square Error (RMSE).

  20. Mechanisms of neuroblastoma regression

    Science.gov (United States)

    Brodeur, Garrett M.; Bagatell, Rochelle

    2014-01-01

    Recent genomic and biological studies of neuroblastoma have shed light on the dramatic heterogeneity in the clinical behaviour of this disease, which spans from spontaneous regression or differentiation in some patients, to relentless disease progression in others, despite intensive multimodality therapy. This evidence also suggests several possible mechanisms to explain the phenomena of spontaneous regression in neuroblastomas, including neurotrophin deprivation, humoral or cellular immunity, loss of telomerase activity and alterations in epigenetic regulation. A better understanding of the mechanisms of spontaneous regression might help to identify optimal therapeutic approaches for patients with these tumours. Currently, the most druggable mechanism is the delayed activation of developmentally programmed cell death regulated by the tropomyosin receptor kinase A pathway. Indeed, targeted therapy aimed at inhibiting neurotrophin receptors might be used in lieu of conventional chemotherapy or radiation in infants with biologically favourable tumours that require treatment. Alternative approaches consist of breaking immune tolerance to tumour antigens or activating neurotrophin receptor pathways to induce neuronal differentiation. These approaches are likely to be most effective against biologically favourable tumours, but they might also provide insights into treatment of biologically unfavourable tumours. We describe the different mechanisms of spontaneous neuroblastoma regression and the consequent therapeutic approaches. PMID:25331179

  1. On directional multiple-output quantile regression

    Czech Academy of Sciences Publication Activity Database

    Paindaveine, D.; Šiman, Miroslav

    2011-01-01

    Roč. 102, č. 2 (2011), s. 193-212 ISSN 0047-259X R&D Projects: GA MŠk(CZ) 1M06047 Grant - others:Commision EC(BE) Fonds National de la Recherche Scientifique Institutional research plan: CEZ:AV0Z10750506 Keywords : multivariate quantile * quantile regression * multiple-output regression * halfspace depth * portfolio optimization * value-at risk Subject RIV: BA - General Mathematics Impact factor: 0.879, year: 2011 http://library.utia.cas.cz/separaty/2011/SI/siman-0364128.pdf

  2. On weighted and locally polynomial directional quantile regression

    Czech Academy of Sciences Publication Activity Database

    Boček, Pavel; Šiman, Miroslav

    2017-01-01

    Roč. 32, č. 3 (2017), s. 929-946 ISSN 0943-4062 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : Quantile regression * Nonparametric regression * Nonparametric regression Subject RIV: IN - Informatics, Computer Science OBOR OECD: Computer sciences, information science, bioinformathics (hardware development to be 2.2, social aspect to be 5.8) Impact factor: 0.434, year: 2016 http://library.utia.cas.cz/separaty/2017/SI/bocek-0458380.pdf

  3. Healthcare Expenditures Associated with Depression Among Individuals with Osteoarthritis: Post-Regression Linear Decomposition Approach.

    Science.gov (United States)

    Agarwal, Parul; Sambamoorthi, Usha

    2015-12-01

    Depression is common among individuals with osteoarthritis and leads to increased healthcare burden. The objective of this study was to examine excess total healthcare expenditures associated with depression among individuals with osteoarthritis in the US. Adults with self-reported osteoarthritis (n = 1881) were identified using data from the 2010 Medical Expenditure Panel Survey (MEPS). Among those with osteoarthritis, chi-square tests and ordinary least square regressions (OLS) were used to examine differences in healthcare expenditures between those with and without depression. Post-regression linear decomposition technique was used to estimate the relative contribution of different constructs of the Anderson's behavioral model, i.e., predisposing, enabling, need, personal healthcare practices, and external environment factors, to the excess expenditures associated with depression among individuals with osteoarthritis. All analysis accounted for the complex survey design of MEPS. Depression coexisted among 20.6 % of adults with osteoarthritis. The average total healthcare expenditures were $13,684 among adults with depression compared to $9284 among those without depression. Multivariable OLS regression revealed that adults with depression had 38.8 % higher healthcare expenditures (p regression linear decomposition analysis indicated that 50 % of differences in expenditures among adults with and without depression can be explained by differences in need factors. Among individuals with coexisting osteoarthritis and depression, excess healthcare expenditures associated with depression were mainly due to comorbid anxiety, chronic conditions and poor health status. These expenditures may potentially be reduced by providing timely intervention for need factors or by providing care under a collaborative care model.

  4. Identifying risk factors that contribute to acute mountain sickness ...

    African Journals Online (AJOL)

    This study is a questionnaire-based study conducted in London and at Everest Base Camp, in which 116 lowlanders were invited to participate and fill in a questionnaire to identify potential risk factors in their history that may have contributed to development of or protection against AMS. Results. A total of 89 lowlanders ...

  5. Investigating the Effect of Complexity Factors in Stoichiometry Problems Using Logistic Regression and Eye Tracking

    Science.gov (United States)

    Tang, Hui; Kirk, John; Pienta, Norbert J.

    2014-01-01

    This paper includes two experiments, one investigating complexity factors in stoichiometry word problems, and the other identifying students' problem-solving protocols by using eye-tracking technology. The word problems used in this study had five different complexity factors, which were randomly assigned by a Web-based tool that we developed. The…

  6. Identifying and assessing the factors affecting skill gap in digital marketing in communication industry companies

    Directory of Open Access Journals (Sweden)

    Fereshteh Ghotbifar

    2017-03-01

    Full Text Available As far as new communication channels are concerned, there have been extensive developments in communications and marketing in digital era. Today, therefore, companies try to take advantage of digital marketing channels to provide suitable services to customers to improve their satisfaction level. However, this study aimed to identify and assess factors affecting skill gap in digital marketing. This was descriptive correlation study. The population consisted of experts in communications industry to identify most important skill gaps in digital marketing and factors affecting them; also, managers and specialists of these companies were investigated to determine the role of identified factors in reducing skills gaps. Using localized questionnaire and interviewing with ten experts who were selected by Delphi snowball method, the skill gaps in marketing and factors affecting them were identified. Also, a researcher made questionnaire with 32 questions was distributed among 226 employees to investigate the identified factors role in reducing skills gap in digital marketing. The results showed that from four identified factors, the components including operational strategic factors and environmental factors had direct and positive impact on creating skill gap in digital marketing of studied companies. The environmental factors such as social and cultural conditions, religion, technology, and economy had more proactive impact on skills gap in digital marketing. Also, the results showed that among skills gaps in digital marketing of studied companies, the skills (Principles of Communication and (Predicting Future had the highest and lowest gaps, respectively.

  7. Risk factors for atherosclerosis - can they be used to identify the ...

    African Journals Online (AJOL)

    Risk factors are often used in preventive care programmes to identify the patient at particular risk for developing atherosclerosis. Risk factors for atherosclerosis have also been shown to be linked to the presence of the disease at a given time, a fact that may be helpful when screening for additional atherosclerotic disease in ...

  8. Poisson Mixture Regression Models for Heart Disease Prediction.

    Science.gov (United States)

    Mufudza, Chipo; Erol, Hamza

    2016-01-01

    Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model.

  9. Poisson Mixture Regression Models for Heart Disease Prediction

    Science.gov (United States)

    Erol, Hamza

    2016-01-01

    Early heart disease control can be achieved by high disease prediction and diagnosis efficiency. This paper focuses on the use of model based clustering techniques to predict and diagnose heart disease via Poisson mixture regression models. Analysis and application of Poisson mixture regression models is here addressed under two different classes: standard and concomitant variable mixture regression models. Results show that a two-component concomitant variable Poisson mixture regression model predicts heart disease better than both the standard Poisson mixture regression model and the ordinary general linear Poisson regression model due to its low Bayesian Information Criteria value. Furthermore, a Zero Inflated Poisson Mixture Regression model turned out to be the best model for heart prediction over all models as it both clusters individuals into high or low risk category and predicts rate to heart disease componentwise given clusters available. It is deduced that heart disease prediction can be effectively done by identifying the major risks componentwise using Poisson mixture regression model. PMID:27999611

  10. EMPIRICAL STUDY OF DIFFERENT FACTORS EFFECTS ON ARTICLES PUBLICATION REGARDING SURVEY INTERVIEWER CHARACTERISTICS USING MULTILEVEL REGRESSION MODEL

    Directory of Open Access Journals (Sweden)

    Alina MOROŞANU

    2013-06-01

    Full Text Available The purpose of this research work is to evaluate the effects which some factors could have on articles publication regarding survey interviewer characteristics. For this, the author studied the existing literature from the various fields in which articles on survey interviewer characteristics has been published and which can be found in online articles database. The analysis was performed on 243 articles achieved by researchers in the time period 1949-2012. Using statistical software R and applying multilevel regression model, the results showed that the time period when the studied articles are made and the interaction between the number of authors and the number of pages affect the most their publication in journals with a certain level of impact factor.

  11. Predicting Success in Product Development: The Application of Principal Component Analysis to Categorical Data and Binomial Logistic Regression

    Directory of Open Access Journals (Sweden)

    Glauco H.S. Mendes

    2013-09-01

    Full Text Available Critical success factors in new product development (NPD in the Brazilian small and medium enterprises (SMEs are identified and analyzed. Critical success factors are best practices that can be used to improve NPD management and performance in a company. However, the traditional method for identifying these factors is survey methods. Subsequently, the collected data are reduced through traditional multivariate analysis. The objective of this work is to develop a logistic regression model for predicting the success or failure of the new product development. This model allows for an evaluation and prioritization of resource commitments. The results will be helpful for guiding management actions, as one way to improve NPD performance in those industries.

  12. Testing hypotheses for differences between linear regression lines

    Science.gov (United States)

    Stanley J. Zarnoch

    2009-01-01

    Five hypotheses are identified for testing differences between simple linear regression lines. The distinctions between these hypotheses are based on a priori assumptions and illustrated with full and reduced models. The contrast approach is presented as an easy and complete method for testing for overall differences between the regressions and for making pairwise...

  13. Regression Models for Repairable Systems

    Czech Academy of Sciences Publication Activity Database

    Novák, Petr

    2015-01-01

    Roč. 17, č. 4 (2015), s. 963-972 ISSN 1387-5841 Institutional support: RVO:67985556 Keywords : Reliability analysis * Repair models * Regression Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.782, year: 2015 http://library.utia.cas.cz/separaty/2015/SI/novak-0450902.pdf

  14. Identifying Key Drivers of Return Reversal with Dynamical Bayesian Factor Graph.

    Directory of Open Access Journals (Sweden)

    Shuai Zhao

    Full Text Available In the stock market, return reversal occurs when investors sell overbought stocks and buy oversold stocks, reversing the stocks' price trends. In this paper, we develop a new method to identify key drivers of return reversal by incorporating a comprehensive set of factors derived from different economic theories into one unified dynamical Bayesian factor graph. We then use the model to depict factor relationships and their dynamics, from which we make some interesting discoveries about the mechanism behind return reversals. Through extensive experiments on the US stock market, we conclude that among the various factors, the liquidity factors consistently emerge as key drivers of return reversal, which is in support of the theory of liquidity effect. Specifically, we find that stocks with high turnover rates or high Amihud illiquidity measures have a greater probability of experiencing return reversals. Apart from the consistent drivers, we find other drivers of return reversal that generally change from year to year, and they serve as important characteristics for evaluating the trends of stock returns. Besides, we also identify some seldom discussed yet enlightening inter-factor relationships, one of which shows that stocks in Finance and Insurance industry are more likely to have high Amihud illiquidity measures in comparison with those in other industries. These conclusions are robust for return reversals under different thresholds.

  15. Virtual machine consolidation enhancement using hybrid regression algorithms

    Directory of Open Access Journals (Sweden)

    Amany Abdelsamea

    2017-11-01

    Full Text Available Cloud computing data centers are growing rapidly in both number and capacity to meet the increasing demands for highly-responsive computing and massive storage. Such data centers consume enormous amounts of electrical energy resulting in high operating costs and carbon dioxide emissions. The reason for this extremely high energy consumption is not just the quantity of computing resources and the power inefficiency of hardware, but rather lies in the inefficient usage of these resources. VM consolidation involves live migration of VMs hence the capability of transferring a VM between physical servers with a close to zero down time. It is an effective way to improve the utilization of resources and increase energy efficiency in cloud data centers. VM consolidation consists of host overload/underload detection, VM selection and VM placement. Most of the current VM consolidation approaches apply either heuristic-based techniques, such as static utilization thresholds, decision-making based on statistical analysis of historical data; or simply periodic adaptation of the VM allocation. Most of those algorithms rely on CPU utilization only for host overload detection. In this paper we propose using hybrid factors to enhance VM consolidation. Specifically we developed a multiple regression algorithm that uses CPU utilization, memory utilization and bandwidth utilization for host overload detection. The proposed algorithm, Multiple Regression Host Overload Detection (MRHOD, significantly reduces energy consumption while ensuring a high level of adherence to Service Level Agreements (SLA since it gives a real indication of host utilization based on three parameters (CPU, Memory, Bandwidth utilizations instead of one parameter only (CPU utilization. Through simulations we show that our approach reduces power consumption by 6 times compared to single factor algorithms using random workload. Also using PlanetLab workload traces we show that MRHOD improves

  16. Detection of epistatic effects with logic regression and a classical linear regression model.

    Science.gov (United States)

    Malina, Magdalena; Ickstadt, Katja; Schwender, Holger; Posch, Martin; Bogdan, Małgorzata

    2014-02-01

    To locate multiple interacting quantitative trait loci (QTL) influencing a trait of interest within experimental populations, usually methods as the Cockerham's model are applied. Within this framework, interactions are understood as the part of the joined effect of several genes which cannot be explained as the sum of their additive effects. However, if a change in the phenotype (as disease) is caused by Boolean combinations of genotypes of several QTLs, this Cockerham's approach is often not capable to identify them properly. To detect such interactions more efficiently, we propose a logic regression framework. Even though with the logic regression approach a larger number of models has to be considered (requiring more stringent multiple testing correction) the efficient representation of higher order logic interactions in logic regression models leads to a significant increase of power to detect such interactions as compared to a Cockerham's approach. The increase in power is demonstrated analytically for a simple two-way interaction model and illustrated in more complex settings with simulation study and real data analysis.

  17. Use of model plant hosts to identify Pseudomonas aeruginosa virulence factors

    Science.gov (United States)

    Rahme, Laurence G.; Tan, Man-Wah; Le, Long; Wong, Sandy M.; Tompkins, Ronald G.; Calderwood, Stephen B.; Ausubel, Frederick M.

    1997-01-01

    We used plants as an in vivo pathogenesis model for the identification of virulence factors of the human opportunistic pathogen Pseudomonas aeruginosa. Nine of nine TnphoA mutant derivatives of P. aeruginosa strain UCBPP-PA14 that were identified in a plant leaf assay for less pathogenic mutants also exhibited significantly reduced pathogenicity in a burned mouse pathogenicity model, suggesting that P. aeruginosa utilizes common strategies to infect both hosts. Seven of these nine mutants contain TnphoA insertions in previously unknown genes. These results demonstrate that an alternative nonvertebrate host of a human bacterial pathogen can be used in an in vivo high throughput screen to identify novel bacterial virulence factors involved in mammalian pathogenesis. PMID:9371831

  18. Gene expression meta-analysis identifies metastatic pathways and transcription factors in breast cancer

    International Nuclear Information System (INIS)

    Thomassen, Mads; Tan, Qihua; Kruse, Torben A

    2008-01-01

    Metastasis is believed to progress in several steps including different pathways but the determination and understanding of these mechanisms is still fragmentary. Microarray analysis of gene expression patterns in breast tumors has been used to predict outcome in recent studies. Besides classification of outcome, these global expression patterns may reflect biological mechanisms involved in metastasis of breast cancer. Our purpose has been to investigate pathways and transcription factors involved in metastasis by use of gene expression data sets. We have analyzed 8 publicly available gene expression data sets. A global approach, 'gene set enrichment analysis' as well as an approach focusing on a subset of significantly differently regulated genes, GenMAPP, has been applied to rank pathway gene sets according to differential regulation in metastasizing tumors compared to non-metastasizing tumors. Meta-analysis has been used to determine overrepresentation of pathways and transcription factors targets, concordant deregulated in metastasizing breast tumors, in several data sets. The major findings are up-regulation of cell cycle pathways and a metabolic shift towards glucose metabolism reflected in several pathways in metastasizing tumors. Growth factor pathways seem to play dual roles; EGF and PDGF pathways are decreased, while VEGF and sex-hormone pathways are increased in tumors that metastasize. Furthermore, migration, proteasome, immune system, angiogenesis, DNA repair and several signal transduction pathways are associated to metastasis. Finally several transcription factors e.g. E2F, NFY, and YY1 are identified as being involved in metastasis. By pathway meta-analysis many biological mechanisms beyond major characteristics such as proliferation are identified. Transcription factor analysis identifies a number of key factors that support central pathways. Several previously proposed treatment targets are identified and several new pathways that may

  19. A Fast Gradient Method for Nonnegative Sparse Regression With Self-Dictionary

    Science.gov (United States)

    Gillis, Nicolas; Luce, Robert

    2018-01-01

    A nonnegative matrix factorization (NMF) can be computed efficiently under the separability assumption, which asserts that all the columns of the given input data matrix belong to the cone generated by a (small) subset of them. The provably most robust methods to identify these conic basis columns are based on nonnegative sparse regression and self dictionaries, and require the solution of large-scale convex optimization problems. In this paper we study a particular nonnegative sparse regression model with self dictionary. As opposed to previously proposed models, this model yields a smooth optimization problem where the sparsity is enforced through linear constraints. We show that the Euclidean projection on the polyhedron defined by these constraints can be computed efficiently, and propose a fast gradient method to solve our model. We compare our algorithm with several state-of-the-art methods on synthetic data sets and real-world hyperspectral images.

  20. Applying Intelligent Algorithms to Automate the Identification of Error Factors.

    Science.gov (United States)

    Jin, Haizhe; Qu, Qingxing; Munechika, Masahiko; Sano, Masataka; Kajihara, Chisato; Duffy, Vincent G; Chen, Han

    2018-05-03

    Medical errors are the manifestation of the defects occurring in medical processes. Extracting and identifying defects as medical error factors from these processes are an effective approach to prevent medical errors. However, it is a difficult and time-consuming task and requires an analyst with a professional medical background. The issues of identifying a method to extract medical error factors and reduce the extraction difficulty need to be resolved. In this research, a systematic methodology to extract and identify error factors in the medical administration process was proposed. The design of the error report, extraction of the error factors, and identification of the error factors were analyzed. Based on 624 medical error cases across four medical institutes in both Japan and China, 19 error-related items and their levels were extracted. After which, they were closely related to 12 error factors. The relational model between the error-related items and error factors was established based on a genetic algorithm (GA)-back-propagation neural network (BPNN) model. Additionally, compared to GA-BPNN, BPNN, partial least squares regression and support vector regression, GA-BPNN exhibited a higher overall prediction accuracy, being able to promptly identify the error factors from the error-related items. The combination of "error-related items, their different levels, and the GA-BPNN model" was proposed as an error-factor identification technology, which could automatically identify medical error factors.

  1. Identifying factors influencing contraceptive use in Bangladesh: evidence from BDHS 2014 data.

    Science.gov (United States)

    Hossain, M B; Khan, M H R; Ababneh, F; Shaw, J E H

    2018-01-30

    Birth control is the conscious control of the birth rate by methods which temporarily prevent conception by interfering with the normal process of ovulation, fertilization, and implantation. High contraceptive prevalence rate is always expected for controlling births for those countries that are experiencing high population growth rate. The factors that influence contraceptive prevalence are also important to know for policy implication purposes in Bangladesh. This study aims to explore the socio-economic, demographic and others key factors that influence the use of contraception in Bangladesh. The contraception data are extracted from the 2014 Bangladesh Demographic and Health Survey (BDHS) data which were collected by using a two stage stratified random sampling technique that is a source of nested variability. The nested sources of variability must be incorporated in the model using random effects in order to model the actual parameter effects on contraceptive prevalence. A mixed effect logistic regression model has been implemented for the binary contraceptive data, where parameters are estimated through generalized estimating equation by assuming exchangeable correlation structure to explore and identify the factors that truly affect the use of contraception in Bangladesh. The prevalence of contraception use by currently married 15-49 years aged women or their husbands is 62.4%. Our study finds that administrative division, place of residence, religion, number of household members, woman's age, occupation, body mass index, breastfeeding practice, husband's education, wish for children, living status with wife, sexual activity in past year, women amenorrheic status, abstaining status, number of children born in last five years and total children ever died were significantly associated with contraception use in Bangladesh. The odds of women experiencing the outcome of interest are not independent due to the nested structure of the data. As a result, a mixed

  2. Genome-wide association study identifies HLA 8.1 ancestral haplotype alleles as major genetic risk factors for myositis phenotypes.

    Science.gov (United States)

    Miller, F W; Chen, W; O'Hanlon, T P; Cooper, R G; Vencovsky, J; Rider, L G; Danko, K; Wedderburn, L R; Lundberg, I E; Pachman, L M; Reed, A M; Ytterberg, S R; Padyukov, L; Selva-O'Callaghan, A; Radstake, T R; Isenberg, D A; Chinoy, H; Ollier, W E R; Scheet, P; Peng, B; Lee, A; Byun, J; Lamb, J A; Gregersen, P K; Amos, C I

    2015-10-01

    Autoimmune muscle diseases (myositis) comprise a group of complex phenotypes influenced by genetic and environmental factors. To identify genetic risk factors in patients of European ancestry, we conducted a genome-wide association study (GWAS) of the major myositis phenotypes in a total of 1710 cases, which included 705 adult dermatomyositis, 473 juvenile dermatomyositis, 532 polymyositis and 202 adult dermatomyositis, juvenile dermatomyositis or polymyositis patients with anti-histidyl-tRNA synthetase (anti-Jo-1) autoantibodies, and compared them with 4724 controls. Single-nucleotide polymorphisms showing strong associations (Pmyositis phenotypes together, as well as for the four clinical and autoantibody phenotypes studied separately. Imputation and regression analyses found that alleles comprising the human leukocyte antigen (HLA) 8.1 ancestral haplotype (AH8.1) defined essentially all the genetic risk in the phenotypes studied. Although the HLA DRB1*03:01 allele showed slightly stronger associations with adult and juvenile dermatomyositis, and HLA B*08:01 with polymyositis and anti-Jo-1 autoantibody-positive myositis, multiple alleles of AH8.1 were required for the full risk effects. Our findings establish that alleles of the AH8.1 comprise the primary genetic risk factors associated with the major myositis phenotypes in geographically diverse Caucasian populations.

  3. Introduction to the use of regression models in epidemiology.

    Science.gov (United States)

    Bender, Ralf

    2009-01-01

    Regression modeling is one of the most important statistical techniques used in analytical epidemiology. By means of regression models the effect of one or several explanatory variables (e.g., exposures, subject characteristics, risk factors) on a response variable such as mortality or cancer can be investigated. From multiple regression models, adjusted effect estimates can be obtained that take the effect of potential confounders into account. Regression methods can be applied in all epidemiologic study designs so that they represent a universal tool for data analysis in epidemiology. Different kinds of regression models have been developed in dependence on the measurement scale of the response variable and the study design. The most important methods are linear regression for continuous outcomes, logistic regression for binary outcomes, Cox regression for time-to-event data, and Poisson regression for frequencies and rates. This chapter provides a nontechnical introduction to these regression models with illustrating examples from cancer research.

  4. Length bias correction in gene ontology enrichment analysis using logistic regression.

    Science.gov (United States)

    Mi, Gu; Di, Yanming; Emerson, Sarah; Cumbie, Jason S; Chang, Jeff H

    2012-01-01

    When assessing differential gene expression from RNA sequencing data, commonly used statistical tests tend to have greater power to detect differential expression of genes encoding longer transcripts. This phenomenon, called "length bias", will influence subsequent analyses such as Gene Ontology enrichment analysis. In the presence of length bias, Gene Ontology categories that include longer genes are more likely to be identified as enriched. These categories, however, are not necessarily biologically more relevant. We show that one can effectively adjust for length bias in Gene Ontology analysis by including transcript length as a covariate in a logistic regression model. The logistic regression model makes the statistical issue underlying length bias more transparent: transcript length becomes a confounding factor when it correlates with both the Gene Ontology membership and the significance of the differential expression test. The inclusion of the transcript length as a covariate allows one to investigate the direct correlation between the Gene Ontology membership and the significance of testing differential expression, conditional on the transcript length. We present both real and simulated data examples to show that the logistic regression approach is simple, effective, and flexible.

  5. Dual Regression

    OpenAIRE

    Spady, Richard; Stouli, Sami

    2012-01-01

    We propose dual regression as an alternative to the quantile regression process for the global estimation of conditional distribution functions under minimal assumptions. Dual regression provides all the interpretational power of the quantile regression process while avoiding the need for repairing the intersecting conditional quantile surfaces that quantile regression often produces in practice. Our approach introduces a mathematical programming characterization of conditional distribution f...

  6. Elliptical multiple-output quantile regression and convex optimization

    Czech Academy of Sciences Publication Activity Database

    Hallin, M.; Šiman, Miroslav

    2016-01-01

    Roč. 109, č. 1 (2016), s. 232-237 ISSN 0167-7152 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : quantile regression * elliptical quantile * multivariate quantile * multiple-output regression Subject RIV: BA - General Mathematics Impact factor: 0.540, year: 2016 http://library.utia.cas.cz/separaty/2016/SI/siman-0458243.pdf

  7. Approximate median regression for complex survey data with skewed response.

    Science.gov (United States)

    Fraser, Raphael André; Lipsitz, Stuart R; Sinha, Debajyoti; Fitzmaurice, Garrett M; Pan, Yi

    2016-12-01

    The ready availability of public-use data from various large national complex surveys has immense potential for the assessment of population characteristics using regression models. Complex surveys can be used to identify risk factors for important diseases such as cancer. Existing statistical methods based on estimating equations and/or utilizing resampling methods are often not valid with survey data due to complex survey design features. That is, stratification, multistage sampling, and weighting. In this article, we accommodate these design features in the analysis of highly skewed response variables arising from large complex surveys. Specifically, we propose a double-transform-both-sides (DTBS)'based estimating equations approach to estimate the median regression parameters of the highly skewed response; the DTBS approach applies the same Box-Cox type transformation twice to both the outcome and regression function. The usual sandwich variance estimate can be used in our approach, whereas a resampling approach would be needed for a pseudo-likelihood based on minimizing absolute deviations (MAD). Furthermore, the approach is relatively robust to the true underlying distribution, and has much smaller mean square error than a MAD approach. The method is motivated by an analysis of laboratory data on urinary iodine (UI) concentration from the National Health and Nutrition Examination Survey. © 2016, The International Biometric Society.

  8. Sensitized mutagenesis screen in Factor V Leiden mice identifies thrombosis suppressor loci.

    Science.gov (United States)

    Westrick, Randal J; Tomberg, Kärt; Siebert, Amy E; Zhu, Guojing; Winn, Mary E; Dobies, Sarah L; Manning, Sara L; Brake, Marisa A; Cleuren, Audrey C; Hobbs, Linzi M; Mishack, Lena M; Johnston, Alexander J; Kotnik, Emilee; Siemieniak, David R; Xu, Jishu; Li, Jun Z; Saunders, Thomas L; Ginsburg, David

    2017-09-05

    Factor V Leiden ( F5 L ) is a common genetic risk factor for venous thromboembolism in humans. We conducted a sensitized N -ethyl- N -nitrosourea (ENU) mutagenesis screen for dominant thrombosuppressor genes based on perinatal lethal thrombosis in mice homozygous for F5 L ( F5 L/L ) and haploinsufficient for tissue factor pathway inhibitor ( Tfpi +/- ). F8 deficiency enhanced the survival of F5 L/L Tfpi +/- mice, demonstrating that F5 L/L Tfpi +/- lethality is genetically suppressible. ENU-mutagenized F5 L/L males and F5 L/+ Tfpi +/- females were crossed to generate 6,729 progeny, with 98 F5 L/L Tfpi +/- offspring surviving until weaning. Sixteen lines, referred to as "modifier of Factor 5 Leiden ( MF5L1-16 )," exhibited transmission of a putative thrombosuppressor to subsequent generations. Linkage analysis in MF5L6 identified a chromosome 3 locus containing the tissue factor gene ( F3 ). Although no ENU-induced F3 mutation was identified, haploinsufficiency for F3 ( F3 +/- ) suppressed F5 L/L Tfpi +/- lethality. Whole-exome sequencing in MF5L12 identified an Actr2 gene point mutation (p.R258G) as the sole candidate. Inheritance of this variant is associated with suppression of F5 L/L Tfpi +/- lethality ( P = 1.7 × 10 -6 ), suggesting that Actr2 p.R258G is thrombosuppressive. CRISPR/Cas9 experiments to generate an independent Actr2 knockin/knockout demonstrated that Actr2 haploinsufficiency is lethal, supporting a hypomorphic or gain-of-function mechanism of action for Actr2 p.R258G Our findings identify F8 and the Tfpi/F3 axis as key regulators in determining thrombosis balance in the setting of F5 L and also suggest a role for Actr2 in this process.

  9. Multi-step polynomial regression method to model and forecast malaria incidence.

    Directory of Open Access Journals (Sweden)

    Chandrajit Chatterjee

    Full Text Available Malaria is one of the most severe problems faced by the world even today. Understanding the causative factors such as age, sex, social factors, environmental variability etc. as well as underlying transmission dynamics of the disease is important for epidemiological research on malaria and its eradication. Thus, development of suitable modeling approach and methodology, based on the available data on the incidence of the disease and other related factors is of utmost importance. In this study, we developed a simple non-linear regression methodology in modeling and forecasting malaria incidence in Chennai city, India, and predicted future disease incidence with high confidence level. We considered three types of data to develop the regression methodology: a longer time series data of Slide Positivity Rates (SPR of malaria; a smaller time series data (deaths due to Plasmodium vivax of one year; and spatial data (zonal distribution of P. vivax deaths for the city along with the climatic factors, population and previous incidence of the disease. We performed variable selection by simple correlation study, identification of the initial relationship between variables through non-linear curve fitting and used multi-step methods for induction of variables in the non-linear regression analysis along with applied Gauss-Markov models, and ANOVA for testing the prediction, validity and constructing the confidence intervals. The results execute the applicability of our method for different types of data, the autoregressive nature of forecasting, and show high prediction power for both SPR and P. vivax deaths, where the one-lag SPR values plays an influential role and proves useful for better prediction. Different climatic factors are identified as playing crucial role on shaping the disease curve. Further, disease incidence at zonal level and the effect of causative factors on different zonal clusters indicate the pattern of malaria prevalence in the city

  10. Identifying Factors Reinforcing Robotization: Interactive Forces of Employment, Working Hour and Wage

    Directory of Open Access Journals (Sweden)

    Joonmo Cho

    2018-02-01

    Full Text Available Unlike previous studies on robotization approaching the future based on the cutting-edge technologies and adopting a framework where robotization is considered as an exogenous variable, this study considers that robotization occurs endogenously and uses it as a dependent variable for an objective examination of the effect of robotization on the labor market. To this end, a robotization indicator is created based on the actual number of industrial robots currently deployed in workplaces, and a multiple regression analysis is performed using the robotization indicator and labor variables such as employment, working hours, and wage. The results using the multiple regression considering the triangular relationship of employment–working-hours–wages show that job destruction due to robotization is not too remarkable yet that use. Our results show the complementary relation between employment and robotization, but the substituting relation between working hour and robotization. The results also demonstrate the effects of union, the size of the company and the proportion of production workers and simple labor workers etc. These findings indicate that the degree of robotization may vary with many factors of the labor market. Limitations of this study and implications for future research are also discussed.

  11. Marital status integration and suicide: A meta-analysis and meta-regression.

    Science.gov (United States)

    Kyung-Sook, Woo; SangSoo, Shin; Sangjin, Shin; Young-Jeon, Shin

    2018-01-01

    Marital status is an index of the phenomenon of social integration within social structures and has long been identified as an important predictor suicide. However, previous meta-analyses have focused only on a particular marital status, or not sufficiently explored moderators. A meta-analysis of observational studies was conducted to explore the relationships between marital status and suicide and to understand the important moderating factors in this association. Electronic databases were searched to identify studies conducted between January 1, 2000 and June 30, 2016. We performed a meta-analysis, subgroup analysis, and meta-regression of 170 suicide risk estimates from 36 publications. Using random effects model with adjustment for covariates, the study found that the suicide risk for non-married versus married was OR = 1.92 (95% CI: 1.75-2.12). The suicide risk was higher for non-married individuals aged analysis by gender, non-married men exhibited a greater risk of suicide than their married counterparts in all sub-analyses, but women aged 65 years or older showed no significant association between marital status and suicide. The suicide risk in divorced individuals was higher than for non-married individuals in both men and women. The meta-regression showed that gender, age, and sample size affected between-study variation. The results of the study indicated that non-married individuals have an aggregate higher suicide risk than married ones. In addition, gender and age were confirmed as important moderating factors in the relationship between marital status and suicide. Copyright © 2017 Elsevier Ltd. All rights reserved.

  12. Identifying the customer satisfaction factors in furniture market

    Directory of Open Access Journals (Sweden)

    Majid Azizi

    2017-05-01

    Full Text Available Purpose – the purpose of this research is to identify the influential factors on customer satisfaction in the Iranian furniture market in order to get acquainted with the fundamental items for planning future sales programs with the purposes of extolling competitive advantages. Design/methodology/approach – A commixture of 6 items and 31 factors were educed from interviewing with 20 experts in furniture designing and manufacturing industry. The collected data from customer need indexes in previous research were also used. Findings – results showed that such factors as economic factors weighting 0.32, product specifications weighting 0.21 and credibility weighting 0.19 were the most important indexes and price weighting 0.195, fame weighting 0.131, quality, durability and resistance weighting 0.116, paying conditions weighting 0.095, designing and decorating in virtual softwares before ordering weighting 0.074, updatedness weighting 0.064 and interaction approach with the weight of 0.42 were the most considerable influential sub-indexes on the satisfaction of the Iranian furniture market customers. Research limitations/implications – by the enhancement of competition throughout the world markets and the inevitable presence of Iran in it, the market activists’ concentration should shift towards paying comprehensive attention to desires and needs of furniture market customers. Practical implications – some important issues on planning suitable manufacturing and marketing programs in furniture market are introduce so that the activists be aware of considering the growing knowledge and awareness of end-users which increases the pressure on the manufacturer side. There are also some solutions in terms of internal and external organizational factors with regard to the complex nature of competitive environment in furniture market. Originality/value – the paper provides an examination of effective factors on customer satisfaction with a

  13. Identifying important motivational factors for professionals in Greek hospitals

    Science.gov (United States)

    Kontodimopoulos, Nick; Paleologou, Victoria; Niakas, Dimitris

    2009-01-01

    Background The purpose of this study was to identify important motivational factors according to the views of health-care professionals in Greek hospitals and particularly to determine if these might differ in the public and private sectors. Methods A previously developed -and validated- instrument addressing four work-related motivators (job attributes, remuneration, co-workers and achievements) was used. Three categories of health care professionals, doctors (N = 354), nurses (N = 581) and office workers (N = 418), working in public and private hospitals, participated and motivation was compared across socio-demographic and occupational variables. Results The range of reported motivational factors was mixed and Maslow's conclusions that lower level motivational factors must be met before ascending to the next level were not confirmed. The highest ranked motivator for the entire sample, and by professional subgroup, was achievements (P motivators were similar, and only one significant difference was observed, namely between doctors and nurses in respect to co-workers (P motivated by all factors significantly more than their public-hospital counterparts. Conclusion The results are in agreement with the literature which focuses attention to management approaches employing both monetary and non-monetary incentives to motivate health care workers. This study showed that intrinsic factors are particularly important and should become a target for effective employee motivation. PMID:19754968

  14. Application of geographically-weighted regression analysis to assess risk factors for malaria hotspots in Keur Soce health and demographic surveillance site.

    Science.gov (United States)

    Ndiath, Mansour M; Cisse, Badara; Ndiaye, Jean Louis; Gomis, Jules F; Bathiery, Ousmane; Dia, Anta Tal; Gaye, Oumar; Faye, Babacar

    2015-11-18

    In Senegal, considerable efforts have been made to reduce malaria morbidity and mortality during the last decade. This resulted in a marked decrease of malaria cases. With the decline of malaria cases, transmission has become sparse in most Senegalese health districts. This study investigated malaria hotspots in Keur Soce sites by using geographically-weighted regression. Because of the occurrence of hotspots, spatial modelling of malaria cases could have a considerable effect in disease surveillance. This study explored and analysed the spatial relationships between malaria occurrence and socio-economic and environmental factors in small communities in Keur Soce, Senegal, using 6 months passive surveillance. Geographically-weighted regression was used to explore the spatial variability of relationships between malaria incidence or persistence and the selected socio-economic, and human predictors. A model comparison of between ordinary least square and geographically-weighted regression was also explored. Vector dataset (spatial) of the study area by village levels and statistical data (non-spatial) on malaria confirmed cases, socio-economic status (bed net use), population data (size of the household) and environmental factors (temperature, rain fall) were used in this exploratory analysis. ArcMap 10.2 and Stata 11 were used to perform malaria hotspots analysis. From Jun to December, a total of 408 confirmed malaria cases were notified. The explanatory variables-household size, housing materials, sleeping rooms, sheep and distance to breeding site returned significant t values of -0.25, 2.3, 4.39, 1.25 and 2.36, respectively. The OLS global model revealed that it explained about 70 % (adjusted R(2) = 0.70) of the variation in malaria occurrence with AIC = 756.23. The geographically-weighted regression of malaria hotspots resulted in coefficient intercept ranging from 1.89 to 6.22 with a median of 3.5. Large positive values are distributed mainly in the southeast

  15. Estimating Loess Plateau Average Annual Precipitation with Multiple Linear Regression Kriging and Geographically Weighted Regression Kriging

    Directory of Open Access Journals (Sweden)

    Qiutong Jin

    2016-06-01

    Full Text Available Estimating the spatial distribution of precipitation is an important and challenging task in hydrology, climatology, ecology, and environmental science. In order to generate a highly accurate distribution map of average annual precipitation for the Loess Plateau in China, multiple linear regression Kriging (MLRK and geographically weighted regression Kriging (GWRK methods were employed using precipitation data from the period 1980–2010 from 435 meteorological stations. The predictors in regression Kriging were selected by stepwise regression analysis from many auxiliary environmental factors, such as elevation (DEM, normalized difference vegetation index (NDVI, solar radiation, slope, and aspect. All predictor distribution maps had a 500 m spatial resolution. Validation precipitation data from 130 hydrometeorological stations were used to assess the prediction accuracies of the MLRK and GWRK approaches. Results showed that both prediction maps with a 500 m spatial resolution interpolated by MLRK and GWRK had a high accuracy and captured detailed spatial distribution data; however, MLRK produced a lower prediction error and a higher variance explanation than GWRK, although the differences were small, in contrast to conclusions from similar studies.

  16. The Promise of Virtual Teams: Identifying Key Factors in Effectiveness and Failure

    Science.gov (United States)

    Horwitz, Frank M.; Bravington, Desmond; Silvis, Ulrik

    2006-01-01

    Purpose: The aim of the investigation is to identify enabling and disenabling factors in the development and operation of virtual teams; to evaluate the importance of factors such as team development, cross-cultural variables, leadership, communication and social cohesion as contributors to virtual team effectiveness. Design/methodology/approach:…

  17. Identifying Trajectories of Borderline Personality Features in Adolescence: Antecedent and Interactive Risk Factors.

    Science.gov (United States)

    Haltigan, John D; Vaillancourt, Tracy

    2016-03-01

    To examine trajectories of adolescent borderline personality (BP) features in a normative-risk cohort (n = 566) of Canadian children assessed at ages 13, 14, 15, and 16 and childhood predictors of trajectory group membership assessed at ages 8, 10, 11, and 12. Data were drawn from the McMaster Teen Study, an on-going study examining relations among bullying, mental health, and academic achievement. Participants and their parents completed a battery of mental health and peer relations questionnaires at each wave of the study. Academic competence was assessed at age 8 (Grade 3). Latent class growth analysis, analysis of variance, and logistic regression were used to analyze the data. Three distinct BP features trajectory groups were identified: elevated or rising, intermediate or stable, and low or stable. Parent- and child-reported mental health symptoms, peer relations risk factors, and intra-individual risk factors were significant predictors of elevated or rising and intermediate or stable trajectory groups. Child-reported attention-deficit hyperactivity disorder (ADHD) and somatization symptoms uniquely predicted elevated or rising trajectory group membership, whereas parent-reported anxiety and child-reported ADHD symptoms uniquely predicted intermediate or stable trajectory group membership. Child-reported somatization symptoms was the only predictor to differentiate the intermediate or stable and elevated or rising trajectory groups (OR 1.15, 95% CI 1.04 to 1.28). Associations between child-reported reactive temperament and elevated BP features trajectory group membership were 10.23 times higher among children who were bullied, supporting a diathesis-stress pathway in the development of BP features for these youth. Findings demonstrate the heterogeneous course of BP features in early adolescence and shed light on the potential prodromal course of later borderline personality disorder. © The Author(s) 2015.

  18. Predictive factors on the efficacy and risk/intensity of tooth sensitivity of dental bleaching: A multi regression and logistic analysis.

    Science.gov (United States)

    Rezende, Márcia; Loguercio, Alessandro D; Kossatz, Stella; Reis, Alessandra

    2016-02-01

    The aim of this study was to identify predictor factors associated with the whitening outcome and risk and intensity of bleaching-induced tooth sensitivity from pooled data of 11 clinical trials of dental bleaching performed by the same research group. The individual patient data of several published and ongoing studies about dental bleaching was collected and retrospectively analyzed. At the patient-level, independent variables (bleaching techniques [at-home and in-office protocols], sex, age and baseline tooth color in shade guide unit [SGU]) as well as dependent variables (color change in shade guide units (ΔSGU), color change in the CIEL*a*b* system (ΔE), risk and intensity of TS in a visual analog scale) were collected. Multivariable linear regression and multivariable logistic regression models were carried out using backward elimination whenever the p-values were higher than 0.05. A significant relationship between baseline color and age on color change estimates was detected (pwhitening degree of 0.07 for the final ΔSGU and 0.69 for the ΔE. The bleaching technique was shown to be a significant predictor of ΔSGU (prisk of TS for at-home bleaching was 51% (95% CI 41.4-60.6) and for the in-office 62.9% (95% CI 56.9-67.3). Younger patients with darker teeth reach a higher degree of whitening. Patient with darker teeth and submitted to at-home bleaching presents lower risk and intensity of TS. The baseline color of the teeth and the patient's age is directly related to the effectiveness of dental bleaching and TS. Copyright © 2015 Elsevier Ltd. All rights reserved.

  19. Which sociodemographic factors are important on smoking behaviour of high school students? The contribution of classification and regression tree methodology in a broad epidemiological survey.

    Science.gov (United States)

    Ozge, C; Toros, F; Bayramkaya, E; Camdeviren, H; Sasmaz, T

    2006-08-01

    The purpose of this study is to evaluate the most important sociodemographic factors on smoking status of high school students using a broad randomised epidemiological survey. Using in-class, self administered questionnaire about their sociodemographic variables and smoking behaviour, a representative sample of total 3304 students of preparatory, 9th, 10th, and 11th grades, from 22 randomly selected schools of Mersin, were evaluated and discriminative factors have been determined using appropriate statistics. In addition to binary logistic regression analysis, the study evaluated combined effects of these factors using classification and regression tree methodology, as a new statistical method. The data showed that 38% of the students reported lifetime smoking and 16.9% of them reported current smoking with a male predominancy and increasing prevalence by age. Second hand smoking was reported at a 74.3% frequency with father predominance (56.6%). The significantly important factors that affect current smoking in these age groups were increased by household size, late birth rank, certain school types, low academic performance, increased second hand smoking, and stress (especially reported as separation from a close friend or because of violence at home). Classification and regression tree methodology showed the importance of some neglected sociodemographic factors with a good classification capacity. It was concluded that, as closely related with sociocultural factors, smoking was a common problem in this young population, generating important academic and social burden in youth life and with increasing data about this behaviour and using new statistical methods, effective coping strategies could be composed.

  20. BENCHMARKING - PRACTICAL TOOLS IDENTIFY KEY SUCCESS FACTORS

    Directory of Open Access Journals (Sweden)

    Olga Ju. Malinina

    2016-01-01

    Full Text Available The article gives a practical example of the application of benchmarking techniques. The object of study selected fashion store Company «HLB & M Hennes & Mauritz», located in the shopping center «Gallery», Krasnodar. Hennes & Mauritz. The purpose of this article is to identify the best ways to develop a fashionable brand clothing store Hennes & Mauritz on the basis of benchmarking techniques. On the basis of conducted market research is a comparative analysis of the data from different perspectives. The result of the author’s study is a generalization of the ndings, the development of the key success factors that will allow to plan a successful trading activities in the future, based on the best experience of competitors.

  1. Identifying key hospital service quality factors in online health communities.

    Science.gov (United States)

    Jung, Yuchul; Hur, Cinyoung; Jung, Dain; Kim, Minki

    2015-04-07

    The volume of health-related user-created content, especially hospital-related questions and answers in online health communities, has rapidly increased. Patients and caregivers participate in online community activities to share their experiences, exchange information, and ask about recommended or discredited hospitals. However, there is little research on how to identify hospital service quality automatically from the online communities. In the past, in-depth analysis of hospitals has used random sampling surveys. However, such surveys are becoming impractical owing to the rapidly increasing volume of online data and the diverse analysis requirements of related stakeholders. As a solution for utilizing large-scale health-related information, we propose a novel approach to identify hospital service quality factors and overtime trends automatically from online health communities, especially hospital-related questions and answers. We defined social media-based key quality factors for hospitals. In addition, we developed text mining techniques to detect such factors that frequently occur in online health communities. After detecting these factors that represent qualitative aspects of hospitals, we applied a sentiment analysis to recognize the types of recommendations in messages posted within online health communities. Korea's two biggest online portals were used to test the effectiveness of detection of social media-based key quality factors for hospitals. To evaluate the proposed text mining techniques, we performed manual evaluations on the extraction and classification results, such as hospital name, service quality factors, and recommendation types using a random sample of messages (ie, 5.44% (9450/173,748) of the total messages). Service quality factor detection and hospital name extraction achieved average F1 scores of 91% and 78%, respectively. In terms of recommendation classification, performance (ie, precision) is 78% on average. Extraction and

  2. Nonparametric Mixture of Regression Models.

    Science.gov (United States)

    Huang, Mian; Li, Runze; Wang, Shaoli

    2013-07-01

    Motivated by an analysis of US house price index data, we propose nonparametric finite mixture of regression models. We study the identifiability issue of the proposed models, and develop an estimation procedure by employing kernel regression. We further systematically study the sampling properties of the proposed estimators, and establish their asymptotic normality. A modified EM algorithm is proposed to carry out the estimation procedure. We show that our algorithm preserves the ascent property of the EM algorithm in an asymptotic sense. Monte Carlo simulations are conducted to examine the finite sample performance of the proposed estimation procedure. An empirical analysis of the US house price index data is illustrated for the proposed methodology.

  3. Assessment of the expected construction company’s net profit using neural network and multiple regression models

    Directory of Open Access Journals (Sweden)

    H.H. Mohamad

    2013-09-01

    This research aims to develop a mathematical model for assessing the expected net profit of any construction company. To achieve the research objective, four steps were performed. First, the main factors affecting firms’ net profit were identified. Second, pertinent data regarding the net profit factors were collected. Third, two different net profit models were developed using the Multiple Regression (MR and the Neural Network (NN techniques. The validity of the proposed models was also investigated. Finally, the results of both MR and NN models were compared to investigate the predictive capabilities of the two models.

  4. Factors associated with trait anger level of juvenile offenders in Hubei province: A binary logistic regression analysis.

    Science.gov (United States)

    Tang, Li-Na; Ye, Xiao-Zhou; Yan, Qiu-Ge; Chang, Hong-Juan; Ma, Yu-Qiao; Liu, De-Bin; Li, Zhi-Gen; Yu, Yi-Zhen

    2017-02-01

    The risk factors of high trait anger of juvenile offenders were explored through questionnaire study in a youth correctional facility of Hubei province, China. A total of 1090 juvenile offenders in Hubei province were investigated by self-compiled social-demographic questionnaire, Childhood Trauma Questionnaire (CTQ), and State-Trait Anger Expression Inventory-II (STAXI-II). The risk factors were analyzed by chi-square tests, correlation analysis, and binary logistic regression analysis with SPSS 19.0. A total of 1082 copies of valid questionnaires were collected. High trait anger group (n=316) was defined as those who scored in the upper 27th percentile of STAXI-II trait anger scale (TAS), and the rest were defined as low trait anger group (n=766). The risk factors associated with high level of trait anger included: childhood emotional abuse, childhood sexual abuse, step family, frequent drug abuse, and frequent internet using (P0.05). It was suggested that traumatic experience in childhood and unhealthy life style may significantly increase the level of trait anger in adulthood. The risk factors of high trait anger and their effects should be taken into consideration seriously.

  5. Identifying and ranking the factors affecting entrepreneurial marketing to facilitate exports

    Directory of Open Access Journals (Sweden)

    Mehdi Habibzadeh

    2016-04-01

    Full Text Available Small and medium enterprises (SMEs are believed the most important components of today’s businesses and they can boost the growth of economy. This paper presents an empirical investigation to identify and rank important factors influencing on entrepreneurial marketing to facilitate exports of SMEs. The study designs a questionnaire in Likert scale and distributes it among 387 randomly selected entrepreneurs who act as managers of some SMEs in city of Tehran, Iran. Cronbach alpha is calculated as 0.873, which is well above the acceptable level. Using principle component analysis, the study has determined four factors including competitive intelligence, competitive advantage, external factors and internal factors to facilitate the export of SMEs.

  6. Identifying Human Factors Issues in Aircraft Maintenance Operations

    Science.gov (United States)

    Veinott, Elizabeth S.; Kanki, Barbara G.; Shafto, Michael G. (Technical Monitor)

    1995-01-01

    Maintenance operations incidents submitted to the Aviation Safety Reporting System (ASRS) between 1986-1992 were systematically analyzed in order to identify issues relevant to human factors and crew coordination. This exploratory analysis involved 95 ASRS reports which represented a wide range of maintenance incidents. The reports were coded and analyzed according to the type of error (e.g, wrong part, procedural error, non-procedural error), contributing factors (e.g., individual, within-team, cross-team, procedure, tools), result of the error (e.g., aircraft damage or not) as well as the operational impact (e.g., aircraft flown to destination, air return, delay at gate). The main findings indicate that procedural errors were most common (48.4%) and that individual and team actions contributed to the errors in more than 50% of the cases. As for operational results, most errors were either corrected after landing at the destination (51.6%) or required the flight crew to stop enroute (29.5%). Interactions among these variables are also discussed. This analysis is a first step toward developing a taxonomy of crew coordination problems in maintenance. By understanding what variables are important and how they are interrelated, we may develop intervention strategies that are better tailored to the human factor issues involved.

  7. Human chorionic gonadotrophin regression rate as a predictive factor of postmolar gestational trophoblastic neoplasm in high-risk hydatidiform mole: a case-control study.

    Science.gov (United States)

    Kim, Bo Wook; Cho, Hanbyoul; Kim, Hyunki; Nam, Eun Ji; Kim, Sang Wun; Kim, Sunghoon; Kim, Young Tae; Kim, Jae-Hoon

    2012-01-01

    The aim of this study was early prediction of postmolar gestational trophoblastic neoplasm (GTN) after evacuation of high-risk mole, by comparison of human chorionic gonadotrophin (hCG) regression rates. Fifty patients with a high-risk mole initially and spontaneously regressing after molar evacuation were selected from January 1, 1996 to May 31, 2010 (spontaneous regression group). Fifty patients with a high-risk mole initially and progressing to postmolar GTN after molar evacuation were selected (postmolar GTN group). hCG regression rates represented as hCG/initial hCG were compared between the two groups. The sensitivity and specificity of these rates for prediction of postmolar GTN were assessed using receiver operating characteristic curves. Multivariate analyses of associations between risk factors and postmolar GTN progression were performed. The mean regression rate of hCG between the two groups was compared. hCG regression rates represented as hCG/initial hCG (%) were 0.36% in the spontaneous regression group and 1.45% in the postmolar GTN group in the second week (p=0.003). Prediction of postmolar GTN by hCG regression rate revealed a sensitivity of 48.0% and specificity of 89.5% with a cut-off value of 0.716% and area under the curve (AUC) of 0.759 in the 2nd week (pfactor for postmolar GTN. Crown Copyright © 2011. Published by Elsevier Ireland Ltd. All rights reserved.

  8. Effective factors contraceptive use by logistic regression model in Tehran, 1996

    Directory of Open Access Journals (Sweden)

    Ramezani F

    1999-07-01

    Full Text Available Despite unwillingness to fertility, about 30% of couples do not use any kind of contraception and this will lead to unwanted pregnancy. In this clinical trial study, 4177 subjects who had at least one alive child, and delivered in one of the 12 university hospitals in Tehran were recruited. This study was conducted in 1996. The questionnaire included some questions about contraceptive use, their attitudes about unwantedness or wantedness of their current pregnancies. Data were analysed using a Logistic Regrassion Model. Results showed that 20.3% of those who had no fertility intention, did not use any kind of contraception methods, 41.1% of the subjects who were using a contraception method before pregnancy, had got pregnant unwantedly. Based on Logistic Regression Model; age, education, previous familiarity of women with contraception methods and husband's education were the most significant factors in contraceptive use. Subjects who were 20 years old and less or 35 years old and more and illeterate subjects were at higher risk for unuse of contraception methods. This risk was not related to the gender of their children that suggests a positive change in their perspectives towards sex and the number of children. It is suggested that health politicians choose an appropriate model to enhance the literacy, education and counseling for the correct usage of contraceptives and prevention of unwanted pregnancy.

  9. Impacts of land use and population density on seasonal surface water quality using a modified geographically weighted regression.

    Science.gov (United States)

    Chen, Qiang; Mei, Kun; Dahlgren, Randy A; Wang, Ting; Gong, Jian; Zhang, Minghua

    2016-12-01

    As an important regulator of pollutants in overland flow and interflow, land use has become an essential research component for determining the relationships between surface water quality and pollution sources. This study investigated the use of ordinary least squares (OLS) and geographically weighted regression (GWR) models to identify the impact of land use and population density on surface water quality in the Wen-Rui Tang River watershed of eastern China. A manual variable excluding-selecting method was explored to resolve multicollinearity issues. Standard regression coefficient analysis coupled with cluster analysis was introduced to determine which variable had the greatest influence on water quality. Results showed that: (1) Impact of land use on water quality varied with spatial and seasonal scales. Both positive and negative effects for certain land-use indicators were found in different subcatchments. (2) Urban land was the dominant factor influencing N, P and chemical oxygen demand (COD) in highly urbanized regions, but the relationship was weak as the pollutants were mainly from point sources. Agricultural land was the primary factor influencing N and P in suburban and rural areas; the relationship was strong as the pollutants were mainly from agricultural surface runoff. Subcatchments located in suburban areas were identified with urban land as the primary influencing factor during the wet season while agricultural land was identified as a more prevalent influencing factor during the dry season. (3) Adjusted R 2 values in OLS models using the manual variable excluding-selecting method averaged 14.3% higher than using stepwise multiple linear regressions. However, the corresponding GWR models had adjusted R 2 ~59.2% higher than the optimal OLS models, confirming that GWR models demonstrated better prediction accuracy. Based on our findings, water resource protection policies should consider site-specific land-use conditions within each watershed to

  10. Real estate value prediction using multivariate regression models

    Science.gov (United States)

    Manjula, R.; Jain, Shubham; Srivastava, Sharad; Rajiv Kher, Pranav

    2017-11-01

    The real estate market is one of the most competitive in terms of pricing and the same tends to vary significantly based on a lot of factors, hence it becomes one of the prime fields to apply the concepts of machine learning to optimize and predict the prices with high accuracy. Therefore in this paper, we present various important features to use while predicting housing prices with good accuracy. We have described regression models, using various features to have lower Residual Sum of Squares error. While using features in a regression model some feature engineering is required for better prediction. Often a set of features (multiple regressions) or polynomial regression (applying a various set of powers in the features) is used for making better model fit. For these models are expected to be susceptible towards over fitting ridge regression is used to reduce it. This paper thus directs to the best application of regression models in addition to other techniques to optimize the result.

  11. Logistic regression modelling: procedures and pitfalls in developing and interpreting prediction models

    Directory of Open Access Journals (Sweden)

    Nataša Šarlija

    2017-01-01

    Full Text Available This study sheds light on the most common issues related to applying logistic regression in prediction models for company growth. The purpose of the paper is 1 to provide a detailed demonstration of the steps in developing a growth prediction model based on logistic regression analysis, 2 to discuss common pitfalls and methodological errors in developing a model, and 3 to provide solutions and possible ways of overcoming these issues. Special attention is devoted to the question of satisfying logistic regression assumptions, selecting and defining dependent and independent variables, using classification tables and ROC curves, for reporting model strength, interpreting odds ratios as effect measures and evaluating performance of the prediction model. Development of a logistic regression model in this paper focuses on a prediction model of company growth. The analysis is based on predominantly financial data from a sample of 1471 small and medium-sized Croatian companies active between 2009 and 2014. The financial data is presented in the form of financial ratios divided into nine main groups depicting following areas of business: liquidity, leverage, activity, profitability, research and development, investing and export. The growth prediction model indicates aspects of a business critical for achieving high growth. In that respect, the contribution of this paper is twofold. First, methodological, in terms of pointing out pitfalls and potential solutions in logistic regression modelling, and secondly, theoretical, in terms of identifying factors responsible for high growth of small and medium-sized companies.

  12. Risk Factors for Brachial Plexus Birth Injury

    Science.gov (United States)

    Louden, Emily; Marcotte, Michael; Mehlman, Charles; Lippert, William; Huang, Bin; Paulson, Andrea

    2018-01-01

    Over the course of decades, the incidence of brachial plexus birth injury (BPBI) has increased despite advances in healthcare which would seem to assist in decreasing the rate. The aim of this study is to identify previously unknown risk factors for BPBI and the risk factors with potential to guide preventative measures. A case control study of 52 mothers who had delivered a child with a BPBI injury and 132 mothers who had delivered without BPBI injury was conducted. Univariate, multivariable and logistic regressions identified risk factors and their combinations. The odds of BPBI were 2.5 times higher when oxytocin was used and 3.7 times higher when tachysystole occurred. The odds of BPBI injury are increased when tachysystole and oxytocin occur during the mother’s labor. Logistic regression identified a higher risk for BPBI when more than three of the following variables (>30 lbs gained during the pregnancy, stage 2 labor >61.5 min, mother’s age >26.4 years, tachysystole, or fetal malpresentation) were present in any combination. PMID:29596309

  13. Risk Factors for Brachial Plexus Birth Injury

    Directory of Open Access Journals (Sweden)

    Emily Louden

    2018-03-01

    Full Text Available Over the course of decades, the incidence of brachial plexus birth injury (BPBI has increased despite advances in healthcare which would seem to assist in decreasing the rate. The aim of this study is to identify previously unknown risk factors for BPBI and the risk factors with potential to guide preventative measures. A case control study of 52 mothers who had delivered a child with a BPBI injury and 132 mothers who had delivered without BPBI injury was conducted. Univariate, multivariable and logistic regressions identified risk factors and their combinations. The odds of BPBI were 2.5 times higher when oxytocin was used and 3.7 times higher when tachysystole occurred. The odds of BPBI injury are increased when tachysystole and oxytocin occur during the mother’s labor. Logistic regression identified a higher risk for BPBI when more than three of the following variables (>30 lbs gained during the pregnancy, stage 2 labor >61.5 min, mother’s age >26.4 years, tachysystole, or fetal malpresentation were present in any combination.

  14. Tools to support interpreting multiple regression in the face of multicollinearity.

    Science.gov (United States)

    Kraha, Amanda; Turner, Heather; Nimon, Kim; Zientek, Linda Reichwein; Henson, Robin K

    2012-01-01

    While multicollinearity may increase the difficulty of interpreting multiple regression (MR) results, it should not cause undue problems for the knowledgeable researcher. In the current paper, we argue that rather than using one technique to investigate regression results, researchers should consider multiple indices to understand the contributions that predictors make not only to a regression model, but to each other as well. Some of the techniques to interpret MR effects include, but are not limited to, correlation coefficients, beta weights, structure coefficients, all possible subsets regression, commonality coefficients, dominance weights, and relative importance weights. This article will review a set of techniques to interpret MR effects, identify the elements of the data on which the methods focus, and identify statistical software to support such analyses.

  15. Multiple regression and beyond an introduction to multiple regression and structural equation modeling

    CERN Document Server

    Keith, Timothy Z

    2014-01-01

    Multiple Regression and Beyond offers a conceptually oriented introduction to multiple regression (MR) analysis and structural equation modeling (SEM), along with analyses that flow naturally from those methods. By focusing on the concepts and purposes of MR and related methods, rather than the derivation and calculation of formulae, this book introduces material to students more clearly, and in a less threatening way. In addition to illuminating content necessary for coursework, the accessibility of this approach means students are more likely to be able to conduct research using MR or SEM--and more likely to use the methods wisely. Covers both MR and SEM, while explaining their relevance to one another Also includes path analysis, confirmatory factor analysis, and latent growth modeling Figures and tables throughout provide examples and illustrate key concepts and techniques For additional resources, please visit: http://tzkeith.com/.

  16. Robust median estimator in logisitc regression

    Czech Academy of Sciences Publication Activity Database

    Hobza, T.; Pardo, L.; Vajda, Igor

    2008-01-01

    Roč. 138, č. 12 (2008), s. 3822-3840 ISSN 0378-3758 R&D Projects: GA MŠk 1M0572 Grant - others:Instituto Nacional de Estadistica (ES) MPO FI - IM3/136; GA MŠk(CZ) MTM 2006-06872 Institutional research plan: CEZ:AV0Z10750506 Keywords : Logistic regression * Median * Robustness * Consistency and asymptotic normality * Morgenthaler * Bianco and Yohai * Croux and Hasellbroeck Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.679, year: 2008 http://library.utia.cas.cz/separaty/2008/SI/vajda-robust%20median%20estimator%20in%20logistic%20regression.pdf

  17. Regression: A Bibliography.

    Science.gov (United States)

    Pedrini, D. T.; Pedrini, Bonnie C.

    Regression, another mechanism studied by Sigmund Freud, has had much research, e.g., hypnotic regression, frustration regression, schizophrenic regression, and infra-human-animal regression (often directly related to fixation). Many investigators worked with hypnotic age regression, which has a long history, going back to Russian reflexologists.…

  18. Sexual harassment: identifying risk factors.

    Science.gov (United States)

    O'Hare, E A; O'Donohue, W

    1998-12-01

    A new model of the etiology of sexual harassment, the four-factor model, is presented and compared with several models of sexual harassment including the biological model, the organizational model, the sociocultural model, and the sex role spillover model. A number of risk factors associated with sexually harassing behavior are examined within the framework of the four-factor model of sexual harassment. These include characteristics of the work environment (e.g., sexist attitudes among co-workers, unprofessional work environment, skewed sex ratios in the workplace, knowledge of grievance procedures for sexual harassment incidents) as well as personal characteristics of the subject (e.g., physical attractiveness, job status, sex-role). Subjects were 266 university female faculty, staff, and students who completed the Sexual Experience Questionnaire to assess the experience of sexual harassment and a questionnaire designed to assess the risk factors stated above. Results indicated that the four-factor model is a better predictor of sexual harassment than the alternative models. The risk factors most strongly associated with sexual harassment were an unprofessional environment in the workplace, sexist atmosphere, and lack of knowledge about the organization's formal grievance procedures.

  19. Risk of Recurrence in Operated Parasagittal Meningiomas: A Logistic Binary Regression Model.

    Science.gov (United States)

    Escribano Mesa, José Alberto; Alonso Morillejo, Enrique; Parrón Carreño, Tesifón; Huete Allut, Antonio; Narro Donate, José María; Méndez Román, Paddy; Contreras Jiménez, Ascensión; Pedrero García, Francisco; Masegosa González, José

    2018-02-01

    Parasagittal meningiomas arise from the arachnoid cells of the angle formed between the superior sagittal sinus (SSS) and the brain convexity. In this retrospective study, we focused on factors that predict early recurrence and recurrence times. We reviewed 125 patients with parasagittal meningiomas operated from 1985 to 2014. We studied the following variables: age, sex, location, laterality, histology, surgeons, invasion of the SSS, Simpson removal grade, follow-up time, angiography, embolization, radiotherapy, recurrence and recurrence time, reoperation, neurologic deficit, degree of dependency, and patient status at the end of follow-up. Patients ranged in age from 26 to 81 years (mean 57.86 years; median 60 years). There were 44 men (35.2%) and 81 women (64.8%). There were 57 patients with neurologic deficits (45.2%). The most common presenting symptom was motor deficit. World Health Organization grade I tumors were identified in 104 patients (84.6%), and the majority were the meningothelial type. Recurrence was detected in 34 cases. Time of recurrence was 9 to 336 months (mean: 84.4 months; median: 79.5 months). Male sex was identified as an independent risk for recurrence with relative risk 2.7 (95% confidence interval 1.21-6.15), P = 0.014. Kaplan-Meier curves for recurrence had statistically significant differences depending on sex, age, histologic type, and World Health Organization histologic grade. A binary logistic regression was made with the Hosmer-Lemeshow test with P > 0.05; sex, tumor size, and histologic type were used in this model. Male sex is an independent risk factor for recurrence that, associated with other factors such tumor size and histologic type, explains 74.5% of all cases in a binary regression model. Copyright © 2017 Elsevier Inc. All rights reserved.

  20. Using reduced rank regression methods to identify dietary patterns associated with obesity: a cross-country study among European and Australian adolescents.

    Science.gov (United States)

    Huybrechts, Inge; Lioret, Sandrine; Mouratidou, Theodora; Gunter, Marc J; Manios, Yannis; Kersting, Mathilde; Gottrand, Frederic; Kafatos, Anthony; De Henauw, Stefaan; Cuenca-García, Magdalena; Widhalm, Kurt; Gonzales-Gross, Marcela; Molnar, Denes; Moreno, Luis A; McNaughton, Sarah A

    2017-01-01

    This study aims to examine repeatability of reduced rank regression (RRR) methods in calculating dietary patterns (DP) and cross-sectional associations with overweight (OW)/obesity across European and Australian samples of adolescents. Data from two cross-sectional surveys in Europe (2006/2007 Healthy Lifestyle in Europe by Nutrition in Adolescence study, including 1954 adolescents, 12-17 years) and Australia (2007 National Children's Nutrition and Physical Activity Survey, including 1498 adolescents, 12-16 years) were used. Dietary intake was measured using two non-consecutive, 24-h recalls. RRR was used to identify DP using dietary energy density, fibre density and percentage of energy intake from fat as the intermediate variables. Associations between DP scores and body mass/fat were examined using multivariable linear and logistic regression as appropriate, stratified by sex. The first DP extracted (labelled 'energy dense, high fat, low fibre') explained 47 and 31 % of the response variation in Australian and European adolescents, respectively. It was similar for European and Australian adolescents and characterised by higher consumption of biscuits/cakes, chocolate/confectionery, crisps/savoury snacks, sugar-sweetened beverages, and lower consumption of yogurt, high-fibre bread, vegetables and fresh fruit. DP scores were inversely associated with BMI z-scores in Australian adolescent boys and borderline inverse in European adolescent boys (so as with %BF). Similarly, a lower likelihood for OW in boys was observed with higher DP scores in both surveys. No such relationships were observed in adolescent girls. In conclusion, the DP identified in this cross-country study was comparable for European and Australian adolescents, demonstrating robustness of the RRR method in calculating DP among populations. However, longitudinal designs are more relevant when studying diet-obesity associations, to prevent reverse causality.

  1. Identifying the readiness of patients in implementing telemedicine in northern Louisiana for an oncology practice.

    Science.gov (United States)

    Gurupur, Varadraj; Shettian, Kruparaj; Xu, Peixin; Hines, Scott; Desselles, Mitzi; Dhawan, Manish; Wan, Thomas Th; Raffenaud, Amanda; Anderson, Lindsey

    2017-09-01

    This study identified the readiness factors that may create challenges in the use of telemedicine among patients in northern Louisiana with cancer. To identify these readiness factors, the team of investigators developed 19 survey questions that were provided to the patients or to their caregivers. The team collected responses from 147 respondents from rural and urban residential backgrounds. These responses were used to identify the individuals' readiness for utilising telemedicine through factor analysis, Cronbach's alpha reliability test, analysis of variance and ordinary least squares regression. The analysis results indicated that the favourable factor (positive readiness item) had a mean value of 3.47, whereas the unfavourable factor (negative readiness item) had a mean value of 2.76. Cronbach's alpha reliability test provided an alpha value of 0.79. Overall, our study indicated a positive attitude towards the use of telemedicine in northern Louisiana.

  2. Five cases of caudal regression with an aberrant abdominal umbilical artery: Further support for a caudal regression-sirenomelia spectrum.

    Science.gov (United States)

    Duesterhoeft, Sara M; Ernst, Linda M; Siebert, Joseph R; Kapur, Raj P

    2007-12-15

    Sirenomelia and caudal regression have sparked centuries of interest and recent debate regarding their classification and pathogenetic relationship. Specific anomalies are common to both conditions, but aside from fusion of the lower extremities, an aberrant abdominal umbilical artery ("persistent vitelline artery") has been invoked as the chief anatomic finding that distinguishes sirenomelia from caudal regression. This observation is important from a pathogenetic viewpoint, in that diversion of blood away from the caudal portion of the embryo through the abdominal umbilical artery ("vascular steal") has been proposed as the primary mechanism leading to sirenomelia. In contrast, caudal regression is hypothesized to arise from primary deficiency of caudal mesoderm. We present five cases of caudal regression that exhibit an aberrant abdominal umbilical artery similar to that typically associated with sirenomelia. Review of the literature identified four similar cases. Collectively, the series lends support for a caudal regression-sirenomelia spectrum with a common pathogenetic basis and suggests that abnormal umbilical arterial anatomy may be the consequence, rather than the cause, of deficient caudal mesoderm. (c) 2007 Wiley-Liss, Inc.

  3. A flexible fuzzy regression algorithm for forecasting oil consumption estimation

    International Nuclear Information System (INIS)

    Azadeh, A.; Khakestani, M.; Saberi, M.

    2009-01-01

    Oil consumption plays a vital role in socio-economic development of most countries. This study presents a flexible fuzzy regression algorithm for forecasting oil consumption based on standard economic indicators. The standard indicators are annual population, cost of crude oil import, gross domestic production (GDP) and annual oil production in the last period. The proposed algorithm uses analysis of variance (ANOVA) to select either fuzzy regression or conventional regression for future demand estimation. The significance of the proposed algorithm is three fold. First, it is flexible and identifies the best model based on the results of ANOVA and minimum absolute percentage error (MAPE), whereas previous studies consider the best fitted fuzzy regression model based on MAPE or other relative error results. Second, the proposed model may identify conventional regression as the best model for future oil consumption forecasting because of its dynamic structure, whereas previous studies assume that fuzzy regression always provide the best solutions and estimation. Third, it utilizes the most standard independent variables for the regression models. To show the applicability and superiority of the proposed flexible fuzzy regression algorithm the data for oil consumption in Canada, United States, Japan and Australia from 1990 to 2005 are used. The results show that the flexible algorithm provides accurate solution for oil consumption estimation problem. The algorithm may be used by policy makers to accurately foresee the behavior of oil consumption in various regions.

  4. Identifying Generalizable Image Segmentation Parameters for Urban Land Cover Mapping through Meta-Analysis and Regression Tree Modeling

    Directory of Open Access Journals (Sweden)

    Brian A. Johnson

    2018-01-01

    Full Text Available The advent of very high resolution (VHR satellite imagery and the development of Geographic Object-Based Image Analysis (GEOBIA have led to many new opportunities for fine-scale land cover mapping, especially in urban areas. Image segmentation is an important step in the GEOBIA framework, so great time/effort is often spent to ensure that computer-generated image segments closely match real-world objects of interest. In the remote sensing community, segmentation is frequently performed using the multiresolution segmentation (MRS algorithm, which is tuned through three user-defined parameters (the scale, shape/color, and compactness/smoothness parameters. The scale parameter (SP is the most important parameter and governs the average size of generated image segments. Existing automatic methods to determine suitable SPs for segmentation are scene-specific and often computationally intensive, so an approach to estimating appropriate SPs that is generalizable (i.e., not scene-specific could speed up the GEOBIA workflow considerably. In this study, we attempted to identify generalizable SPs for five common urban land cover types (buildings, vegetation, roads, bare soil, and water through meta-analysis and nonlinear regression tree (RT modeling. First, we performed a literature search of recent studies that employed GEOBIA for urban land cover mapping and extracted the MRS parameters used, the image properties (i.e., spatial and radiometric resolutions, and the land cover classes mapped. Using this data extracted from the literature, we constructed RT models for each land cover class to predict suitable SP values based on the: image spatial resolution, image radiometric resolution, shape/color parameter, and compactness/smoothness parameter. Based on a visual and quantitative analysis of results, we found that for all land cover classes except water, relatively accurate SPs could be identified using our RT modeling results. The main advantage of our

  5. Advanced statistics: linear regression, part I: simple linear regression.

    Science.gov (United States)

    Marill, Keith A

    2004-01-01

    Simple linear regression is a mathematical technique used to model the relationship between a single independent predictor variable and a single dependent outcome variable. In this, the first of a two-part series exploring concepts in linear regression analysis, the four fundamental assumptions and the mechanics of simple linear regression are reviewed. The most common technique used to derive the regression line, the method of least squares, is described. The reader will be acquainted with other important concepts in simple linear regression, including: variable transformations, dummy variables, relationship to inference testing, and leverage. Simplified clinical examples with small datasets and graphic models are used to illustrate the points. This will provide a foundation for the second article in this series: a discussion of multiple linear regression, in which there are multiple predictor variables.

  6. Boosting structured additive quantile regression for longitudinal childhood obesity data.

    Science.gov (United States)

    Fenske, Nora; Fahrmeir, Ludwig; Hothorn, Torsten; Rzehak, Peter; Höhle, Michael

    2013-07-25

    Childhood obesity and the investigation of its risk factors has become an important public health issue. Our work is based on and motivated by a German longitudinal study including 2,226 children with up to ten measurements on their body mass index (BMI) and risk factors from birth to the age of 10 years. We introduce boosting of structured additive quantile regression as a novel distribution-free approach for longitudinal quantile regression. The quantile-specific predictors of our model include conventional linear population effects, smooth nonlinear functional effects, varying-coefficient terms, and individual-specific effects, such as intercepts and slopes. Estimation is based on boosting, a computer intensive inference method for highly complex models. We propose a component-wise functional gradient descent boosting algorithm that allows for penalized estimation of the large variety of different effects, particularly leading to individual-specific effects shrunken toward zero. This concept allows us to flexibly estimate the nonlinear age curves of upper quantiles of the BMI distribution, both on population and on individual-specific level, adjusted for further risk factors and to detect age-varying effects of categorical risk factors. Our model approach can be regarded as the quantile regression analog of Gaussian additive mixed models (or structured additive mean regression models), and we compare both model classes with respect to our obesity data.

  7. Directional quantile regression in R

    Czech Academy of Sciences Publication Activity Database

    Boček, Pavel; Šiman, Miroslav

    2017-01-01

    Roč. 53, č. 3 (2017), s. 480-492 ISSN 0023-5954 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : multivariate quantile * regression quantile * halfspace depth * depth contour Subject RIV: BD - Theory of Information OBOR OECD: Applied mathematics Impact factor: 0.379, year: 2016 http://library.utia.cas.cz/separaty/2017/SI/bocek-0476587.pdf

  8. Risk Factors for Internet Gaming Disorder: Psychological Factors and Internet Gaming Characteristics.

    Science.gov (United States)

    Rho, Mi Jung; Lee, Hyeseon; Lee, Taek-Ho; Cho, Hyun; Jung, Dong Jin; Kim, Dai-Jin; Choi, In Young

    2017-12-27

    Background : Understanding the risk factors associated with Internet gaming disorder (IGD) is important to predict and diagnose the condition. The purpose of this study is to identify risk factors that predict IGD based on psychological factors and Internet gaming characteristics; Methods : Online surveys were conducted between 26 November and 26 December 2014. There were 3568 Korean Internet game users among a total of 5003 respondents. We identified 481 IGD gamers and 3087 normal Internet gamers, based on Diagnostic and Statistical Manual for Mental Disorders (DSM-5) criteria. Logistic regression analysis was applied to identify significant risk factors for IGD; Results : The following eight risk factors were found to be significantly associated with IGD: functional and dysfunctional impulsivity (odds ratio: 1.138), belief self-control (1.034), anxiety (1.086), pursuit of desired appetitive goals (1.105), money spent on gaming (1.005), weekday game time (1.081), offline community meeting attendance (2.060), and game community membership (1.393; p < 0.05 for all eight risk factors); Conclusions : These risk factors allow for the prediction and diagnosis of IGD. In the future, these risk factors could also be used to inform clinical services for IGD diagnosis and treatment.

  9. Advanced colorectal neoplasia risk stratification by penalized logistic regression.

    Science.gov (United States)

    Lin, Yunzhi; Yu, Menggang; Wang, Sijian; Chappell, Richard; Imperiale, Thomas F

    2016-08-01

    Colorectal cancer is the second leading cause of death from cancer in the United States. To facilitate the efficiency of colorectal cancer screening, there is a need to stratify risk for colorectal cancer among the 90% of US residents who are considered "average risk." In this article, we investigate such risk stratification rules for advanced colorectal neoplasia (colorectal cancer and advanced, precancerous polyps). We use a recently completed large cohort study of subjects who underwent a first screening colonoscopy. Logistic regression models have been used in the literature to estimate the risk of advanced colorectal neoplasia based on quantifiable risk factors. However, logistic regression may be prone to overfitting and instability in variable selection. Since most of the risk factors in our study have several categories, it was tempting to collapse these categories into fewer risk groups. We propose a penalized logistic regression method that automatically and simultaneously selects variables, groups categories, and estimates their coefficients by penalizing the [Formula: see text]-norm of both the coefficients and their differences. Hence, it encourages sparsity in the categories, i.e. grouping of the categories, and sparsity in the variables, i.e. variable selection. We apply the penalized logistic regression method to our data. The important variables are selected, with close categories simultaneously grouped, by penalized regression models with and without the interactions terms. The models are validated with 10-fold cross-validation. The receiver operating characteristic curves of the penalized regression models dominate the receiver operating characteristic curve of naive logistic regressions, indicating a superior discriminative performance. © The Author(s) 2013.

  10. Geographically weighted regression model on poverty indicator

    Science.gov (United States)

    Slamet, I.; Nugroho, N. F. T. A.; Muslich

    2017-12-01

    In this research, we applied geographically weighted regression (GWR) for analyzing the poverty in Central Java. We consider Gaussian Kernel as weighted function. The GWR uses the diagonal matrix resulted from calculating kernel Gaussian function as a weighted function in the regression model. The kernel weights is used to handle spatial effects on the data so that a model can be obtained for each location. The purpose of this paper is to model of poverty percentage data in Central Java province using GWR with Gaussian kernel weighted function and to determine the influencing factors in each regency/city in Central Java province. Based on the research, we obtained geographically weighted regression model with Gaussian kernel weighted function on poverty percentage data in Central Java province. We found that percentage of population working as farmers, population growth rate, percentage of households with regular sanitation, and BPJS beneficiaries are the variables that affect the percentage of poverty in Central Java province. In this research, we found the determination coefficient R2 are 68.64%. There are two categories of district which are influenced by different of significance factors.

  11. Reproductive risk factors assessment for anaemia among pregnant women in India using a multinomial logistic regression model.

    Science.gov (United States)

    Perumal, Vanamail

    2014-07-01

    To assess reproductive risk factors for anaemia among pregnant women in urban and rural areas of India. The International Institute of Population Sciences, India, carried out third National Family Health Survey in 2005-2006 to estimate a key indicator from a sample of ever-married women in the reproductive age group 15-49 years. Data on various dimensions were collected using a structured questionnaire, and anaemia was measured using a portable HemoCue instrument. Anaemia prevalence among pregnant women was compared between rural and urban areas using chi-square test and odds ratio. Multinomial logistic regression analysis was used to determine risk factors. Anaemia prevalence was assessed among 3355 pregnant women from rural areas and 1962 pregnant women from urban areas. Moderate-to-severe anaemia in rural areas (32.4%) is significantly more common than in urban areas (27.3%) with an excess risk of 30%. Gestational age specific prevalence of anaemia significantly increases in rural areas after 6 months. Pregnancy duration is a significant risk factor in both urban and rural areas. In rural areas, increasing age at marriage and mass media exposure are significant protective factors of anaemia. However, more births in the last five years, alcohol consumption and smoking habits are significant risk factors. In rural areas, various reproductive factors and lifestyle characteristics constitute significant risk factors for moderate-to-severe anaemia. Therefore, intensive health education on reproductive practices and the impact of lifestyle characteristics are warranted to reduce anaemia prevalence. © 2014 John Wiley & Sons Ltd.

  12. IDENTIFYING MOTIVATION FACTOR INVOLVEMENT OF SARAWAK MALAY WOMEN ENTREPRENEUR

    Directory of Open Access Journals (Sweden)

    Masyantie Mohamad

    2016-03-01

    Full Text Available Sarawak multilayered cake among Sarawak product signature famous among the local as well as international tourist visiting Sarawak. In fact, Sarawak Malay women entrepreneurs have become very necessary players in the entrepreneurial field specifically in this cottage industries from the early introduction of this business, they have facing various problem in this businesses. Thus, this research aims to build an understanding of motivational factor that encourage Sarawak Malay women entrepreneurial experiences especially in multilayered cake businesses. Using qualitative methods, this research aims to identify the entrepreneurial motivations factors; with regards to start-up motivation by Sarawak Malay women. The finding shows that the motivations that influence Malay women within Kuching, Sarawak areas to start and grow their business are involve self-driven and context driven that motivate them involve in multilayered cakes businesses.

  13. Modeling the number of car theft using Poisson regression

    Science.gov (United States)

    Zulkifli, Malina; Ling, Agnes Beh Yen; Kasim, Maznah Mat; Ismail, Noriszura

    2016-10-01

    Regression analysis is the most popular statistical methods used to express the relationship between the variables of response with the covariates. The aim of this paper is to evaluate the factors that influence the number of car theft using Poisson regression model. This paper will focus on the number of car thefts that occurred in districts in Peninsular Malaysia. There are two groups of factor that have been considered, namely district descriptive factors and socio and demographic factors. The result of the study showed that Bumiputera composition, Chinese composition, Other ethnic composition, foreign migration, number of residence with the age between 25 to 64, number of employed person and number of unemployed person are the most influence factors that affect the car theft cases. These information are very useful for the law enforcement department, insurance company and car owners in order to reduce and limiting the car theft cases in Peninsular Malaysia.

  14. Identifying protective and risk factors for injurious falls in patients hospitalized for acute care: a retrospective case-control study

    Directory of Open Access Journals (Sweden)

    Emmanuel Aryee

    2017-11-01

    Full Text Available Abstract Background Admitted patients who fall and injure themselves during an acute hospitalization incur increased costs, morbidity, and mortality, but little research has been conducted on identifying inpatients at high risk to injure themselves in a fall. Falls risk assessment tools have been unsuccessful due to their low positive predictive value when applied broadly to entire hospital populations. We aimed to identify variables associated with the risk of or protection against injurious fall in the inpatient setting. We also aimed to test the variables in the ABCs mnemonic (Age > 85, Bones-orthopedic conditions, anti-Coagulation and recent surgery for correlation with injurious fall. Methods We performed a retrospective case-control study at an academic tertiary care center comparing admitted patients with injurious fall to admitted patients without fall. We collected data on the demographics, medical and fall history, outcomes, and discharge disposition of injured fallers and control patients. We performed multivariate analysis of potential risk factors for injurious fall with logistic regression to calculate adjusted odds ratios. Results We identified 117 injured fallers and 320 controls. There were no differences in age, anti-coagulation use or fragility fractures between cases and controls. In multivariate analysis, recent surgery (OR 0.46, p = 0.003 was protective; joint replacement (OR 5.58, P = 0.002, psychotropic agents (OR 2.23, p = 0.001, the male sex (OR 2.08, p = 0.003 and history of fall (OR 2.08, p = 0.02 were significantly associated with injurious fall. Conclusion In this study, the variables in the ABCs parameters were among the variables not useful for identifying inpatients at risk of injuring themselves in a fall, while other non-ABCs variables demonstrated a significant association with injurious fall. Recent surgery was a protective factor, and practices around the care of surgical patients could be

  15. Regression analysis of nuclear plant capacity factors

    International Nuclear Information System (INIS)

    Stocks, K.J.; Faulkner, J.I.

    1980-07-01

    Operating data on all commercial nuclear power plants of the PWR, HWR, BWR and GCR types in the Western World are analysed statistically to determine whether the explanatory variables size, year of operation, vintage and reactor supplier are significant in accounting for the variation in capacity factor. The results are compared with a number of previous studies which analysed only United States reactors. The possibility of specification errors affecting the results is also examined. Although, in general, the variables considered are statistically significant, they explain only a small portion of the variation in the capacity factor. The equations thus obtained should certainly not be used to predict the lifetime performance of future large reactors

  16. Regression dilution bias: tools for correction methods and sample size calculation.

    Science.gov (United States)

    Berglund, Lars

    2012-08-01

    Random errors in measurement of a risk factor will introduce downward bias of an estimated association to a disease or a disease marker. This phenomenon is called regression dilution bias. A bias correction may be made with data from a validity study or a reliability study. In this article we give a non-technical description of designs of reliability studies with emphasis on selection of individuals for a repeated measurement, assumptions of measurement error models, and correction methods for the slope in a simple linear regression model where the dependent variable is a continuous variable. Also, we describe situations where correction for regression dilution bias is not appropriate. The methods are illustrated with the association between insulin sensitivity measured with the euglycaemic insulin clamp technique and fasting insulin, where measurement of the latter variable carries noticeable random error. We provide software tools for estimation of a corrected slope in a simple linear regression model assuming data for a continuous dependent variable and a continuous risk factor from a main study and an additional measurement of the risk factor in a reliability study. Also, we supply programs for estimation of the number of individuals needed in the reliability study and for choice of its design. Our conclusion is that correction for regression dilution bias is seldom applied in epidemiological studies. This may cause important effects of risk factors with large measurement errors to be neglected.

  17. Genome-wide association analysis of pain severity in dysmenorrhea identifies association at chromosome 1p13.2, near the nerve growth factor locus.

    Science.gov (United States)

    Jones, Amy V; Hockley, James R F; Hyde, Craig; Gorman, Donal; Sredic-Rhodes, Ana; Bilsland, James; McMurray, Gordon; Furlotte, Nicholas A; Hu, Youna; Hinds, David A; Cox, Peter J; Scollen, Serena

    2016-11-01

    Dysmenorrhea is a common chronic pelvic pain syndrome affecting women of childbearing potential. Family studies suggest that genetic background influences the severity of dysmenorrhea, but genetic predisposition and molecular mechanisms underlying dysmenorrhea are not understood. In this study, we conduct the first genome-wide association study to identify genetic factors associated with dysmenorrhea pain severity. A cohort of females of European descent (n = 11,891) aged 18 to 45 years rated their average dysmenorrhea pain severity. We used a linear regression model adjusting for age and body mass index, identifying one genome-wide significant (P dysmenorrhea pain were more likely to report being positive for endometriosis, polycystic ovarian syndrome, depression, and other psychiatric disorders. Our results indicate that dysmenorrhea pain severity is partly genetically determined. NGF already has an established role in chronic pain disorders, and our findings suggest that NGF may be an important mediator for gynaecological/pelvic pain in the viscera.

  18. Entrepreneurial intention modeling using hierarchical multiple regression

    Directory of Open Access Journals (Sweden)

    Marina Jeger

    2014-12-01

    Full Text Available The goal of this study is to identify the contribution of effectuation dimensions to the predictive power of the entrepreneurial intention model over and above that which can be accounted for by other predictors selected and confirmed in previous studies. As is often the case in social and behavioral studies, some variables are likely to be highly correlated with each other. Therefore, the relative amount of variance in the criterion variable explained by each of the predictors depends on several factors such as the order of variable entry and sample specifics. The results show the modest predictive power of two dimensions of effectuation prior to the introduction of the theory of planned behavior elements. The article highlights the main advantages of applying hierarchical regression in social sciences as well as in the specific context of entrepreneurial intention formation, and addresses some of the potential pitfalls that this type of analysis entails.

  19. Predictive factors of esophageal stenosis associated with tumor regression in radiation therapy for locally advanced esophageal cancer

    International Nuclear Information System (INIS)

    Atsumi, Kazushige; Shioyama, Yoshiyuki; Nakamura, Katsumasa

    2010-01-01

    The purpose of this retrospective study was to clarify the predictive factors correlated with esophageal stenosis within three months after radiation therapy for locally advanced esophageal cancer. We enrolled 47 patients with advanced esophageal cancer with T2-4 and stage II-III who were treated with definitive radiation therapy and achieving complete response of primary lesion at Kyushu University Hospital between January 1998 and December 2005. Esophagography was performed for all patients before treatment and within three months after completion of the radiation therapy, the esophageal stenotic ratio was evaluated. The stenotic ratio was used to define four levels of stenosis: stenosis level 1, stenotic ratio of 0-25%; 2, 25-50%; 3, 50-75%; 4, 75-100%. We then estimated the correlation between the esophageal stenosis level after radiation therapy and each of numerous factors. The numbers and total percentages of patients at each stenosis level were as follows: level 1: n=14 (30%); level 2: 8 (17%); level 3: 14 (30%); and level 4: 11 (23%). Esophageal stenosis in the case of full circumference involvement tended to be more severe and more frequent. Increases in wall thickness tended to be associated with increases in esophageal stenosis severity and frequency. The extent of involved circumference and wall thickness of tumor region were significantly correlated with esophageal stenosis associated with tumor regression in radiation therapy (p=0.0006, p=0.005). For predicting the possibility of esophageal stenosis with tumor regression within three months in radiation therapy, the extent of involved circumference and esophageal wall thickness of the tumor region may be useful. (author)

  20. No rationale for 1 variable per 10 events criterion for binary logistic regression analysis.

    Science.gov (United States)

    van Smeden, Maarten; de Groot, Joris A H; Moons, Karel G M; Collins, Gary S; Altman, Douglas G; Eijkemans, Marinus J C; Reitsma, Johannes B

    2016-11-24

    Ten events per variable (EPV) is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth's correction, are compared. The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect ('separation'). We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth's correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.

  1. Regression Analysis of Combined Gene Expression Regulation in Acute Myeloid Leukemia

    Science.gov (United States)

    Li, Yue; Liang, Minggao; Zhang, Zhaolei

    2014-01-01

    Gene expression is a combinatorial function of genetic/epigenetic factors such as copy number variation (CNV), DNA methylation (DM), transcription factors (TF) occupancy, and microRNA (miRNA) post-transcriptional regulation. At the maturity of microarray/sequencing technologies, large amounts of data measuring the genome-wide signals of those factors became available from Encyclopedia of DNA Elements (ENCODE) and The Cancer Genome Atlas (TCGA). However, there is a lack of an integrative model to take full advantage of these rich yet heterogeneous data. To this end, we developed RACER (Regression Analysis of Combined Expression Regulation), which fits the mRNA expression as response using as explanatory variables, the TF data from ENCODE, and CNV, DM, miRNA expression signals from TCGA. Briefly, RACER first infers the sample-specific regulatory activities by TFs and miRNAs, which are then used as inputs to infer specific TF/miRNA-gene interactions. Such a two-stage regression framework circumvents a common difficulty in integrating ENCODE data measured in generic cell-line with the sample-specific TCGA measurements. As a case study, we integrated Acute Myeloid Leukemia (AML) data from TCGA and the related TF binding data measured in K562 from ENCODE. As a proof-of-concept, we first verified our model formalism by 10-fold cross-validation on predicting gene expression. We next evaluated RACER on recovering known regulatory interactions, and demonstrated its superior statistical power over existing methods in detecting known miRNA/TF targets. Additionally, we developed a feature selection procedure, which identified 18 regulators, whose activities clustered consistently with cytogenetic risk groups. One of the selected regulators is miR-548p, whose inferred targets were significantly enriched for leukemia-related pathway, implicating its novel role in AML pathogenesis. Moreover, survival analysis using the inferred activities identified C-Fos as a potential AML

  2. Multiple Regression Analysis of mRNA-miRNA Associations in Colorectal Cancer Pathway

    Science.gov (United States)

    Wang, Fengfeng; Wong, S. C. Cesar; Chan, Lawrence W. C.; Cho, William C. S.; Yip, S. P.; Yung, Benjamin Y. M.

    2014-01-01

    Background. MicroRNA (miRNA) is a short and endogenous RNA molecule that regulates posttranscriptional gene expression. It is an important factor for tumorigenesis of colorectal cancer (CRC), and a potential biomarker for diagnosis, prognosis, and therapy of CRC. Our objective is to identify the related miRNAs and their associations with genes frequently involved in CRC microsatellite instability (MSI) and chromosomal instability (CIN) signaling pathways. Results. A regression model was adopted to identify the significantly associated miRNAs targeting a set of candidate genes frequently involved in colorectal cancer MSI and CIN pathways. Multiple linear regression analysis was used to construct the model and find the significant mRNA-miRNA associations. We identified three significantly associated mRNA-miRNA pairs: BCL2 was positively associated with miR-16 and SMAD4 was positively associated with miR-567 in the CRC tissue, while MSH6 was positively associated with miR-142-5p in the normal tissue. As for the whole model, BCL2 and SMAD4 models were not significant, and MSH6 model was significant. The significant associations were different in the normal and the CRC tissues. Conclusion. Our results have laid down a solid foundation in exploration of novel CRC mechanisms, and identification of miRNA roles as oncomirs or tumor suppressor mirs in CRC. PMID:24895601

  3. Ordinary least square regression, orthogonal regression, geometric mean regression and their applications in aerosol science

    International Nuclear Information System (INIS)

    Leng Ling; Zhang Tianyi; Kleinman, Lawrence; Zhu Wei

    2007-01-01

    Regression analysis, especially the ordinary least squares method which assumes that errors are confined to the dependent variable, has seen a fair share of its applications in aerosol science. The ordinary least squares approach, however, could be problematic due to the fact that atmospheric data often does not lend itself to calling one variable independent and the other dependent. Errors often exist for both measurements. In this work, we examine two regression approaches available to accommodate this situation. They are orthogonal regression and geometric mean regression. Comparisons are made theoretically as well as numerically through an aerosol study examining whether the ratio of organic aerosol to CO would change with age

  4. Refractive regression after laser in situ keratomileusis.

    Science.gov (United States)

    Yan, Mabel K; Chang, John Sm; Chan, Tommy Cy

    2018-04-26

    Uncorrected refractive errors are a leading cause of visual impairment across the world. In today's society, laser in situ keratomileusis (LASIK) has become the most commonly performed surgical procedure to correct refractive errors. However, regression of the initially achieved refractive correction has been a widely observed phenomenon following LASIK since its inception more than two decades ago. Despite technological advances in laser refractive surgery and various proposed management strategies, post-LASIK regression is still frequently observed and has significant implications for the long-term visual performance and quality of life of patients. This review explores the mechanism of refractive regression after both myopic and hyperopic LASIK, predisposing risk factors and its clinical course. In addition, current preventative strategies and therapies are also reviewed. © 2018 Royal Australian and New Zealand College of Ophthalmologists.

  5. Estimating the Influence of Accident Related Factors on Motorcycle Fatal Accidents using Logistic Regression (Case Study: Denpasar-Bali

    Directory of Open Access Journals (Sweden)

    Wedagama D.M.P.

    2010-01-01

    Full Text Available In Denpasar the capital of Bali Province, motorcycle accident contributes to about 80% of total road accidents. Out of those motorcycle accidents, 32% are fatal accidents. This study investigates the influence of accident related factors on motorcycle fatal accidents in the city of Denpasar during period 2006-2008 using a logistic regression model. The study found that the fatality of collision with pedestrians and right angle accidents were respectively about 0.44 and 0.40 times lower than collision with other vehicles and accidents due to other factors. In contrast, the odds that a motorcycle accident will be fatal due to collision with heavy and light vehicles were 1.67 times more likely than with other motorcycles. Collision with pedestrians, right angle accidents, and heavy and light vehicles were respectively accounted for 31%, 29%, and 63% of motorcycle fatal accidents.

  6. Risk factors for subclinical intramammary infection in dairy goats in two longitudinal field studies evaluated by Bayesian logistic regression

    DEFF Research Database (Denmark)

    Koop, Gerrit; Collar, Carol A.; Toft, Nils

    2013-01-01

    Identification of risk factors for subclinical intramammary infections (IMI) in dairy goats should contribute to improved udder health. Intramammary infection may be diagnosed by bacteriological culture or by somatic cell count (SCC) of a milk sample. Both bacteriological culture and SCC are impe......Identification of risk factors for subclinical intramammary infections (IMI) in dairy goats should contribute to improved udder health. Intramammary infection may be diagnosed by bacteriological culture or by somatic cell count (SCC) of a milk sample. Both bacteriological culture and SCC...... are imperfect tests, particularly lacking sensitivity, which leads to misclassification and thus to biased estimates of odds ratios in risk factor studies. The objective of this study was to evaluate risk factors for the true (latent) IMI status of major pathogens in dairy goats. We used Bayesian logistic...... regression models that accounted for imperfect measurement of IMI by both culture and SCC. Udder half milk samples were collected from 530 Dutch and 438 California dairy goats in 10 herds on 3 occasions during lactation. Udder halves were classified as positive or negative for isolation of a major pathogen...

  7. Polynomial regression analysis and significance test of the regression function

    International Nuclear Information System (INIS)

    Gao Zhengming; Zhao Juan; He Shengping

    2012-01-01

    In order to analyze the decay heating power of a certain radioactive isotope per kilogram with polynomial regression method, the paper firstly demonstrated the broad usage of polynomial function and deduced its parameters with ordinary least squares estimate. Then significance test method of polynomial regression function is derived considering the similarity between the polynomial regression model and the multivariable linear regression model. Finally, polynomial regression analysis and significance test of the polynomial function are done to the decay heating power of the iso tope per kilogram in accord with the authors' real work. (authors)

  8. Latinos in science: Identifying factors that influence the low percentage of Latino representation in the sciences

    Science.gov (United States)

    Miranda, Susan Jennifer

    A mixed methods approach was used to identify factors that influence the underrepresentation of Latinos in the domain of science. The researcher investigated the role of family influences, academic preparation, and personal motivations to determine science-related career choices by Latinos. Binary logistic regression analyses were conducted using information from Latinos gathered from the National Education Longitudinal Study of 1988 (NELS: 88) administered by the National Center for Education Statistics. For the present study, data were analyzed using participants' responses as high school seniors, college students, and post-baccalaureates. Students responded to questions on school, work, parental academic influences, personal aspirations, and self-perception. To provide more insight into the experiences of Latinos in science and support the statistical analyses, nine students majoring in science in a private, urban university located in the northeastern part of the country were interviewed. Eleven variables related to parents' academic support and students' perceptions of parental support were taken together as predictors for two separate criteria from the survey. These results identified parents' level of education and the importance of academics to parents in their teen's college choice as significant predictors in determining college major in science. When the criterion was degree in science, the significant predictor was the frequency parents contacted high school as volunteers. Student interviews supported this information, demonstrating the importance of parental support in attaining a degree in science. Academic preparation was also analyzed. Students' reasons for taking science classes in high school was a significant predictor for science major; significant predictors for science degree were the emphasis placed on objectives in math and science classes and number of courses in biology and physics. Student interviews supported this information and

  9. Identifying the necessary and sufficient number of risk factors for predicting academic failure.

    Science.gov (United States)

    Lucio, Robert; Hunt, Elizabeth; Bornovalova, Marina

    2012-03-01

    Identifying the point at which individuals become at risk for academic failure (grade point average [GPA] academic success or failure. This study focused on 12 school-related factors. Using a thorough 5-step process, we identified which unique risk factors place one at risk for academic failure. Academic engagement, academic expectations, academic self-efficacy, homework completion, school relevance, school safety, teacher relationships (positive relationship), grade retention, school mobility, and school misbehaviors (negative relationship) were uniquely related to GPA even after controlling for all relevant covariates. Next, a receiver operating characteristic curve was used to determine a cutoff point for determining how many risk factors predict academic failure (GPA academic failure, which provides a way for early identification of individuals who are at risk. Further implications of these findings are discussed. PsycINFO Database Record (c) 2012 APA, all rights reserved.

  10. Orthopedic Surgery among Patients with Rheumatoid Arthritis: A Population-based study to Identify Risk factors, Sex differences, and Time trends.

    Science.gov (United States)

    Richter, Michael; Crowson, Cynthia S; Matteson, Eric L; Makol, Ashima

    2017-12-20

    To identify risk factors for large joint (LJS) versus small joint surgery (SJS) in rheumatoid arthritis (RA) and evaluate trends in surgery rates over time. A retrospective medical record review was performed of all orthopedic surgeries following first fulfillment of 1987 ACR criteria for adult-onset RA among residents of Olmsted County, Minnesota, USA in 1980-2013. Risk factors were examined using Cox models adjusted for age, sex and calendar year of RA incidence. Trends in incidence of joint surgeries were examined using Poisson regression models. A total of 1077 patients with RA (mean age 56 years, 69% female, 66% seropositive) were followed for a median of 10.7 years during which 112 (90 women) underwent at least one SJS and 204 (141 women) underwent at least one LJS. Risk factors included advanced age, rheumatoid factor and anti-CCP antibody positivity for both SJS and LJS, and BMI≥30 kg/m 2 for LJS. Risk factors for SJS and LJS at any time during follow-up included the presence of radiographic erosions, large joint swelling, and methotrexate use. SJS rates decreased by calendar year of incidence (hazard ratio 0.53; p=0.001), with significant decline in SJS after 1995. The cumulative incidence of SJS was higher in women than men (p=0.008). In recent years, there has been a significant decline in rates of SJS but not LJS in patients with RA. The incidence of SJS is higher among women. Traditional RA risk factors are strong predictors for SJS and LJS. Increasing age and obesity are predictive of LJS. This article is protected by copyright. All rights reserved. This article is protected by copyright. All rights reserved.

  11. Analysis of γ spectra in airborne radioactivity measurements using multiple linear regressions

    International Nuclear Information System (INIS)

    Bao Min; Shi Quanlin; Zhang Jiamei

    2004-01-01

    This paper describes the net peak counts calculating of nuclide 137 Cs at 662 keV of γ spectra in airborne radioactivity measurements using multiple linear regressions. Mathematic model is founded by analyzing every factor that has contribution to Cs peak counts in spectra, and multiple linear regression function is established. Calculating process adopts stepwise regression, and the indistinctive factors are eliminated by F check. The regression results and its uncertainty are calculated using Least Square Estimation, then the Cs peak net counts and its uncertainty can be gotten. The analysis results for experimental spectrum are displayed. The influence of energy shift and energy resolution on the analyzing result is discussed. In comparison with the stripping spectra method, multiple linear regression method needn't stripping radios, and the calculating result has relation with the counts in Cs peak only, and the calculating uncertainty is reduced. (authors)

  12. A method to identify dependencies between organizational factors using statistical independence test

    International Nuclear Information System (INIS)

    Kim, Y.; Chung, C.H.; Kim, C.; Jae, M.; Jung, J.H.

    2004-01-01

    A considerable number of studies on organizational factors in nuclear power plants have been made especially in recent years, most of which have assumed organizational factors to be independent. However, since organizational factors characterize the organization in terms of safety and efficiency etc. and there would be some factors that have close relations between them. Therefore, from whatever point of view, if we want to identify the characteristics of an organization, the dependence relationships should be considered to get an accurate result. In this study the organization of a reference nuclear power plant in Korea was analyzed for the trip cases of that plant using 20 organizational factors that Jacobs and Haber had suggested: 1) coordination of work, 2) formalization, 3) organizational knowledge, 4) roles and responsibilities, 5) external communication, 6) inter-department communications, 7) intra-departmental communications, 8) organizational culture, 9) ownership, 10) safety culture, 11) time urgency, 12) centralization, 13) goal prioritization, 14) organizational learning, 15) problem identification, 16) resource allocation, 17) performance evaluation, 18) personnel selection, 19) technical knowledge, and 20) training. By utilizing the results of the analysis, a method to identify the dependence relationships between organizational factors is presented. The statistical independence test for the analysis result of the trip cases is adopted to reveal dependencies. This method is geared to the needs to utilize many kinds of data that has been obtained as the operating years of nuclear power plants increase, and more reliable dependence relations may be obtained by using these abundant data

  13. Identify and rank key factors influencing the adoption of cloud computing for a healthy Electronics

    Directory of Open Access Journals (Sweden)

    Javad Shukuhy

    2015-02-01

    Full Text Available Cloud computing as a new technology with Internet infrastructure and new approaches can be significant benefits in providing medical services electronically. Aplying this technology in E-Health requires consideration of various factors. The main objective of this study is to identify and rank the factors influencing the adoption of e-health cloud. Based on the Technology-Organization-Environment (TOE framework and Human-Organization-Technology fit (HOT-fit model, 16 sub-factors were identified in four major factors. With survey of 60 experts, academics and experts in health information technology and with the help of fuzzy analytic hierarchy process had ranked these sub-factors and factors. In the literature, considering newness this study, no internal or external study, have not alluded these number of criteria. The results show that when deciding to adopt cloud computing in E-Health, respectively, must be considered technological, human, organizational and environmental factors.

  14. An exploratory study to identify critical factors of innovation culture in organizations

    Directory of Open Access Journals (Sweden)

    Hamed Asgari

    2013-07-01

    Full Text Available During the past two decades, there has been a growing trend on knowledge-based organizations. Innovation, on the other hand, plays essential role on building competitive business units. In this paper, we present an exploratory study to identify critical factors of innovation culture in organizations. We detect important factors influencing innovation culture in construction industry based on the implementation of factor analysis. The proposed study designs a questionnaire and distributes it among 400 experts who are involved in construction industry. Cronbach alpha has been calculated as 0.779, which validates the overall questionnaire. The results of factor analysis have indicated that six factors of building cultural infrastructures, education, organizational vision, established culture, strategic culture and flexible culture are the most important items influencing innovation culture.

  15. Reduced Rank Regression

    DEFF Research Database (Denmark)

    Johansen, Søren

    2008-01-01

    The reduced rank regression model is a multivariate regression model with a coefficient matrix with reduced rank. The reduced rank regression algorithm is an estimation procedure, which estimates the reduced rank regression model. It is related to canonical correlations and involves calculating...

  16. Multiple linear regression and regression with time series error models in forecasting PM10 concentrations in Peninsular Malaysia.

    Science.gov (United States)

    Ng, Kar Yong; Awang, Norhashidah

    2018-01-06

    Frequent haze occurrences in Malaysia have made the management of PM 10 (particulate matter with aerodynamic less than 10 μm) pollution a critical task. This requires knowledge on factors associating with PM 10 variation and good forecast of PM 10 concentrations. Hence, this paper demonstrates the prediction of 1-day-ahead daily average PM 10 concentrations based on predictor variables including meteorological parameters and gaseous pollutants. Three different models were built. They were multiple linear regression (MLR) model with lagged predictor variables (MLR1), MLR model with lagged predictor variables and PM 10 concentrations (MLR2) and regression with time series error (RTSE) model. The findings revealed that humidity, temperature, wind speed, wind direction, carbon monoxide and ozone were the main factors explaining the PM 10 variation in Peninsular Malaysia. Comparison among the three models showed that MLR2 model was on a same level with RTSE model in terms of forecasting accuracy, while MLR1 model was the worst.

  17. Using beta-binomial regression for high-precision differential methylation analysis in multifactor whole-genome bisulfite sequencing experiments

    Science.gov (United States)

    2014-01-01

    Background Whole-genome bisulfite sequencing currently provides the highest-precision view of the epigenome, with quantitative information about populations of cells down to single nucleotide resolution. Several studies have demonstrated the value of this precision: meaningful features that correlate strongly with biological functions can be found associated with only a few CpG sites. Understanding the role of DNA methylation, and more broadly the role of DNA accessibility, requires that methylation differences between populations of cells are identified with extreme precision and in complex experimental designs. Results In this work we investigated the use of beta-binomial regression as a general approach for modeling whole-genome bisulfite data to identify differentially methylated sites and genomic intervals. Conclusions The regression-based analysis can handle medium- and large-scale experiments where it becomes critical to accurately model variation in methylation levels between replicates and account for influence of various experimental factors like cell types or batch effects. PMID:24962134

  18. Risk Factors for Internet Gaming Disorder: Psychological Factors and Internet Gaming Characteristics

    Directory of Open Access Journals (Sweden)

    Mi Jung Rho

    2017-12-01

    Full Text Available Background: Understanding the risk factors associated with Internet gaming disorder (IGD is important to predict and diagnose the condition. The purpose of this study is to identify risk factors that predict IGD based on psychological factors and Internet gaming characteristics; Methods: Online surveys were conducted between 26 November and 26 December 2014. There were 3568 Korean Internet game users among a total of 5003 respondents. We identified 481 IGD gamers and 3087 normal Internet gamers, based on Diagnostic and Statistical Manual for Mental Disorders (DSM-5 criteria. Logistic regression analysis was applied to identify significant risk factors for IGD; Results: The following eight risk factors were found to be significantly associated with IGD: functional and dysfunctional impulsivity (odds ratio: 1.138, belief self-control (1.034, anxiety (1.086, pursuit of desired appetitive goals (1.105, money spent on gaming (1.005, weekday game time (1.081, offline community meeting attendance (2.060, and game community membership (1.393; p < 0.05 for all eight risk factors; Conclusions: These risk factors allow for the prediction and diagnosis of IGD. In the future, these risk factors could also be used to inform clinical services for IGD diagnosis and treatment.

  19. Risk Factors for Internet Gaming Disorder: Psychological Factors and Internet Gaming Characteristics

    Science.gov (United States)

    Lee, Hyeseon; Lee, Taek-Ho; Cho, Hyun; Kim, Dai-Jin; Choi, In Young

    2017-01-01

    Background: Understanding the risk factors associated with Internet gaming disorder (IGD) is important to predict and diagnose the condition. The purpose of this study is to identify risk factors that predict IGD based on psychological factors and Internet gaming characteristics; Methods: Online surveys were conducted between 26 November and 26 December 2014. There were 3568 Korean Internet game users among a total of 5003 respondents. We identified 481 IGD gamers and 3087 normal Internet gamers, based on Diagnostic and Statistical Manual for Mental Disorders (DSM-5) criteria. Logistic regression analysis was applied to identify significant risk factors for IGD; Results: The following eight risk factors were found to be significantly associated with IGD: functional and dysfunctional impulsivity (odds ratio: 1.138), belief self-control (1.034), anxiety (1.086), pursuit of desired appetitive goals (1.105), money spent on gaming (1.005), weekday game time (1.081), offline community meeting attendance (2.060), and game community membership (1.393; p < 0.05 for all eight risk factors); Conclusions: These risk factors allow for the prediction and diagnosis of IGD. In the future, these risk factors could also be used to inform clinical services for IGD diagnosis and treatment. PMID:29280953

  20. Determinants of LSIL Regression in Women from a Colombian Cohort

    International Nuclear Information System (INIS)

    Molano, Monica; Gonzalez, Mauricio; Gamboa, Oscar; Ortiz, Natasha; Luna, Joaquin; Hernandez, Gustavo; Posso, Hector; Murillo, Raul; Munoz, Nubia

    2010-01-01

    Objective: To analyze the role of Human Papillomavirus (HPV) and other risk factors in the regression of cervical lesions in women from the Bogota Cohort. Methods: 200 HPV positive women with abnormal cytology were included for regression analysis. The time of lesion regression was modeled using methods for interval censored survival time data. Median duration of total follow-up was 9 years. Results: 80 (40%) women were diagnosed with Atypical Squamous Cells of Undetermined Significance (ASCUS) or Atypical Glandular Cells of Undetermined Significance (AGUS) while 120 (60%) were diagnosed with Low Grade Squamous Intra-epithelial Lesions (LSIL). Globally, 40% of the lesions were still present at first year of follow up, while 1.5% was still present at 5 year check-up. The multivariate model showed similar regression rates for lesions in women with ASCUS/AGUS and women with LSIL (HR= 0.82, 95% CI 0.59-1.12). Women infected with HR HPV types and those with mixed infections had lower regression rates for lesions than did women infected with LR types (HR=0.526, 95% CI 0.33-0.84, for HR types and HR=0.378, 95% CI 0.20-0.69, for mixed infections). Furthermore, women over 30 years had a higher lesion regression rate than did women under 30 years (HR1.53, 95% CI 1.03-2.27). The study showed that the median time for lesion regression was 9 months while the median time for HPV clearance was 12 months. Conclusions: In the studied population, the type of infection and the age of the women are critical factors for the regression of cervical lesions.

  1. Quantile Regression Methods

    DEFF Research Database (Denmark)

    Fitzenberger, Bernd; Wilke, Ralf Andreas

    2015-01-01

    if the mean regression model does not. We provide a short informal introduction into the principle of quantile regression which includes an illustrative application from empirical labor market research. This is followed by briefly sketching the underlying statistical model for linear quantile regression based......Quantile regression is emerging as a popular statistical approach, which complements the estimation of conditional mean models. While the latter only focuses on one aspect of the conditional distribution of the dependent variable, the mean, quantile regression provides more detailed insights...... by modeling conditional quantiles. Quantile regression can therefore detect whether the partial effect of a regressor on the conditional quantiles is the same for all quantiles or differs across quantiles. Quantile regression can provide evidence for a statistical relationship between two variables even...

  2. Combined and interactive effects of environmental and GWAS-identified risk factors in ovarian cancer

    DEFF Research Database (Denmark)

    Pearce, Celeste Leigh; Rossing, Mary Anne; Lee, Alice W

    2013-01-01

    There are several well-established environmental risk factors for ovarian cancer, and recent genome-wide association studies have also identified six variants that influence disease risk. However, the interplay between such risk factors and susceptibility loci has not been studied....

  3. Identify and Classify Critical Success Factor of Agile Software Development Methodology Using Mind Map

    OpenAIRE

    Tasneem Abd El Hameed; Mahmoud Abd EL Latif; Sherif Kholief

    2016-01-01

    Selecting the right method, right personnel and right practices, and applying them adequately, determine the success of software development. In this paper, a qualitative study is carried out among the critical factors of success from previous studies. The factors of success match with their relative principles to illustrate the most valuable factor for agile approach success, this paper also prove that the twelve principles poorly identified for few factors resulting from qualitative and qua...

  4. Factors affecting the HIV/AIDS epidemic: An ecological analysis of ...

    African Journals Online (AJOL)

    Factors affecting the HIV/AIDS epidemic: An ecological analysis of global data. ... Backward multiple linear regression analysis identified the proportion of Muslims, physicians density, and adolescent fertility rate are as the three most prominent factors linked with the national HIV epidemic. Conclusions: The findings support ...

  5. Moderation analysis using a two-level regression model.

    Science.gov (United States)

    Yuan, Ke-Hai; Cheng, Ying; Maxwell, Scott

    2014-10-01

    Moderation analysis is widely used in social and behavioral research. The most commonly used model for moderation analysis is moderated multiple regression (MMR) in which the explanatory variables of the regression model include product terms, and the model is typically estimated by least squares (LS). This paper argues for a two-level regression model in which the regression coefficients of a criterion variable on predictors are further regressed on moderator variables. An algorithm for estimating the parameters of the two-level model by normal-distribution-based maximum likelihood (NML) is developed. Formulas for the standard errors (SEs) of the parameter estimates are provided and studied. Results indicate that, when heteroscedasticity exists, NML with the two-level model gives more efficient and more accurate parameter estimates than the LS analysis of the MMR model. When error variances are homoscedastic, NML with the two-level model leads to essentially the same results as LS with the MMR model. Most importantly, the two-level regression model permits estimating the percentage of variance of each regression coefficient that is due to moderator variables. When applied to data from General Social Surveys 1991, NML with the two-level model identified a significant moderation effect of race on the regression of job prestige on years of education while LS with the MMR model did not. An R package is also developed and documented to facilitate the application of the two-level model.

  6. A Monte Carlo simulation study comparing linear regression, beta regression, variable-dispersion beta regression and fractional logit regression at recovering average difference measures in a two sample design.

    Science.gov (United States)

    Meaney, Christopher; Moineddin, Rahim

    2014-01-24

    In biomedical research, response variables are often encountered which have bounded support on the open unit interval--(0,1). Traditionally, researchers have attempted to estimate covariate effects on these types of response data using linear regression. Alternative modelling strategies may include: beta regression, variable-dispersion beta regression, and fractional logit regression models. This study employs a Monte Carlo simulation design to compare the statistical properties of the linear regression model to that of the more novel beta regression, variable-dispersion beta regression, and fractional logit regression models. In the Monte Carlo experiment we assume a simple two sample design. We assume observations are realizations of independent draws from their respective probability models. The randomly simulated draws from the various probability models are chosen to emulate average proportion/percentage/rate differences of pre-specified magnitudes. Following simulation of the experimental data we estimate average proportion/percentage/rate differences. We compare the estimators in terms of bias, variance, type-1 error and power. Estimates of Monte Carlo error associated with these quantities are provided. If response data are beta distributed with constant dispersion parameters across the two samples, then all models are unbiased and have reasonable type-1 error rates and power profiles. If the response data in the two samples have different dispersion parameters, then the simple beta regression model is biased. When the sample size is small (N0 = N1 = 25) linear regression has superior type-1 error rates compared to the other models. Small sample type-1 error rates can be improved in beta regression models using bias correction/reduction methods. In the power experiments, variable-dispersion beta regression and fractional logit regression models have slightly elevated power compared to linear regression models. Similar results were observed if the

  7. Screening for violence risk factors identifies young adults at risk for return emergency department visit for injury.

    Science.gov (United States)

    Hankin, Abigail; Wei, Stanley; Foreman, Juron; Houry, Debra

    2014-08-01

    Homicide is the second leading cause of death among youth aged 15-24. Prior cross-sectional studies, in non-healthcare settings, have reported exposure to community violence, peer behavior, and delinquency as risk factors for violent injury. However, longitudinal cohort studies have not been performed to evaluate the temporal or predictive relationship between these risk factors and emergency department (ED) visits for injuries among at-risk youth. The objective was to assess whether self-reported exposure to violence risk factors in young adults can be used to predict future ED visits for injuries over a 1-year period. This prospective cohort study was performed in the ED of a Southeastern US Level I trauma center. Eligible participants were patients aged 18-24, presenting for any chief complaint. We excluded patients if they were critically ill, incarcerated, or could not read English. Initial recruitment occurred over a 6-month period, by a research assistant in the ED for 3-5 days per week, with shifts scheduled such that they included weekends and weekdays, over the hours from 8AM-8PM. At the time of initial contact in the ED, patients were asked to complete a written questionnaire, consisting of previously validated instruments measuring the following risk factors: a) aggression, b) perceived likelihood of violence, c) recent violent behavior, d) peer behavior, e) community exposure to violence, and f) positive future outlook. At 12 months following the initial ED visit, the participants' medical records were reviewed to identify any subsequent ED visits for injury-related complaints. We analyzed data with chi-square and logistic regression analyses. Three hundred thirty-two patients were approached, of whom 300 patients consented. Participants' average age was 21.1 years, with 60.1% female, 86.0% African American. After controlling for participant gender, ethnicity, or injury complaint at time of first visit, return visits for injuries were significantly

  8. IPF-LASSO: Integrative L1-Penalized Regression with Penalty Factors for Prediction Based on Multi-Omics Data

    Directory of Open Access Journals (Sweden)

    Anne-Laure Boulesteix

    2017-01-01

    Full Text Available As modern biotechnologies advance, it has become increasingly frequent that different modalities of high-dimensional molecular data (termed “omics” data in this paper, such as gene expression, methylation, and copy number, are collected from the same patient cohort to predict the clinical outcome. While prediction based on omics data has been widely studied in the last fifteen years, little has been done in the statistical literature on the integration of multiple omics modalities to select a subset of variables for prediction, which is a critical task in personalized medicine. In this paper, we propose a simple penalized regression method to address this problem by assigning different penalty factors to different data modalities for feature selection and prediction. The penalty factors can be chosen in a fully data-driven fashion by cross-validation or by taking practical considerations into account. In simulation studies, we compare the prediction performance of our approach, called IPF-LASSO (Integrative LASSO with Penalty Factors and implemented in the R package ipflasso, with the standard LASSO and sparse group LASSO. The use of IPF-LASSO is also illustrated through applications to two real-life cancer datasets. All data and codes are available on the companion website to ensure reproducibility.

  9. Spatial and Inter-temporal Sources of Poverty, Inequality and Gender Disparities in Cameroon: a Regression-Based Decomposition Analysis

    OpenAIRE

    Boniface Ngah Epo; Francis Menjo Baye; Nadine Teme Angele Manga

    2011-01-01

    This study applies the regression-based inequality decomposition technique to explain poverty and inequality trends in Cameroon. We also identify gender related factors which explain income disparities and discrimination based on the 2001 and 2007 Cameroon household consumption surveys. The results show that education, health, employment in the formal sector, age cohorts, household size, gender, ownership of farmland and urban versus rural residence explain household economic wellbeing; dispa...

  10. Effects of measurement errors on psychometric measurements in ergonomics studies: Implications for correlations, ANOVA, linear regression, factor analysis, and linear discriminant analysis.

    Science.gov (United States)

    Liu, Yan; Salvendy, Gavriel

    2009-05-01

    This paper aims to demonstrate the effects of measurement errors on psychometric measurements in ergonomics studies. A variety of sources can cause random measurement errors in ergonomics studies and these errors can distort virtually every statistic computed and lead investigators to erroneous conclusions. The effects of measurement errors on five most widely used statistical analysis tools have been discussed and illustrated: correlation; ANOVA; linear regression; factor analysis; linear discriminant analysis. It has been shown that measurement errors can greatly attenuate correlations between variables, reduce statistical power of ANOVA, distort (overestimate, underestimate or even change the sign of) regression coefficients, underrate the explanation contributions of the most important factors in factor analysis and depreciate the significance of discriminant function and discrimination abilities of individual variables in discrimination analysis. The discussions will be restricted to subjective scales and survey methods and their reliability estimates. Other methods applied in ergonomics research, such as physical and electrophysiological measurements and chemical and biomedical analysis methods, also have issues of measurement errors, but they are beyond the scope of this paper. As there has been increasing interest in the development and testing of theories in ergonomics research, it has become very important for ergonomics researchers to understand the effects of measurement errors on their experiment results, which the authors believe is very critical to research progress in theory development and cumulative knowledge in the ergonomics field.

  11. No rationale for 1 variable per 10 events criterion for binary logistic regression analysis

    Directory of Open Access Journals (Sweden)

    Maarten van Smeden

    2016-11-01

    Full Text Available Abstract Background Ten events per variable (EPV is a widely advocated minimal criterion for sample size considerations in logistic regression analysis. Of three previous simulation studies that examined this minimal EPV criterion only one supports the use of a minimum of 10 EPV. In this paper, we examine the reasons for substantial differences between these extensive simulation studies. Methods The current study uses Monte Carlo simulations to evaluate small sample bias, coverage of confidence intervals and mean square error of logit coefficients. Logistic regression models fitted by maximum likelihood and a modified estimation procedure, known as Firth’s correction, are compared. Results The results show that besides EPV, the problems associated with low EPV depend on other factors such as the total sample size. It is also demonstrated that simulation results can be dominated by even a few simulated data sets for which the prediction of the outcome by the covariates is perfect (‘separation’. We reveal that different approaches for identifying and handling separation leads to substantially different simulation results. We further show that Firth’s correction can be used to improve the accuracy of regression coefficients and alleviate the problems associated with separation. Conclusions The current evidence supporting EPV rules for binary logistic regression is weak. Given our findings, there is an urgent need for new research to provide guidance for supporting sample size considerations for binary logistic regression analysis.

  12. Using a Delphi Method to Identify Human Factors Contributing to Nursing Errors.

    Science.gov (United States)

    Roth, Cheryl; Brewer, Melanie; Wieck, K Lynn

    2017-07-01

    The purpose of this study was to identify human factors associated with nursing errors. Using a Delphi technique, this study used feedback from a panel of nurse experts (n = 25) on an initial qualitative survey questionnaire followed by summarizing the results with feedback and confirmation. Synthesized factors regarding causes of errors were incorporated into a quantitative Likert-type scale, and the original expert panel participants were queried a second time to validate responses. The list identified 24 items as most common causes of nursing errors, including swamping and errors made by others that nurses are expected to recognize and fix. The responses provided a consensus top 10 errors list based on means with heavy workload and fatigue at the top of the list. The use of the Delphi survey established consensus and developed a platform upon which future study of nursing errors can evolve as a link to future solutions. This list of human factors in nursing errors should serve to stimulate dialogue among nurses about how to prevent errors and improve outcomes. Human and system failures have been the subject of an abundance of research, yet nursing errors continue to occur. © 2016 Wiley Periodicals, Inc.

  13. Predicting Taxi-Out Time at Congested Airports with Optimization-Based Support Vector Regression Methods

    Directory of Open Access Journals (Sweden)

    Guan Lian

    2018-01-01

    Full Text Available Accurate prediction of taxi-out time is significant precondition for improving the operationality of the departure process at an airport, as well as reducing the long taxi-out time, congestion, and excessive emission of greenhouse gases. Unfortunately, several of the traditional methods of predicting taxi-out time perform unsatisfactorily at congested airports. This paper describes and tests three of those conventional methods which include Generalized Linear Model, Softmax Regression Model, and Artificial Neural Network method and two improved Support Vector Regression (SVR approaches based on swarm intelligence algorithm optimization, which include Particle Swarm Optimization (PSO and Firefly Algorithm. In order to improve the global searching ability of Firefly Algorithm, adaptive step factor and Lévy flight are implemented simultaneously when updating the location function. Six factors are analysed, of which delay is identified as one significant factor in congested airports. Through a series of specific dynamic analyses, a case study of Beijing International Airport (PEK is tested with historical data. The performance measures show that the proposed two SVR approaches, especially the Improved Firefly Algorithm (IFA optimization-based SVR method, not only perform as the best modelling measures and accuracy rate compared with the representative forecast models, but also can achieve a better predictive performance when dealing with abnormal taxi-out time states.

  14. Regression Phalanxes

    OpenAIRE

    Zhang, Hongyang; Welch, William J.; Zamar, Ruben H.

    2017-01-01

    Tomal et al. (2015) introduced the notion of "phalanxes" in the context of rare-class detection in two-class classification problems. A phalanx is a subset of features that work well for classification tasks. In this paper, we propose a different class of phalanxes for application in regression settings. We define a "Regression Phalanx" - a subset of features that work well together for prediction. We propose a novel algorithm which automatically chooses Regression Phalanxes from high-dimensi...

  15. Identifying risk factors for PTSD in women seeking medical help after rape

    OpenAIRE

    Möller, Anna Tiihonen; Bäckström, Torbjörn; Söndergaard, Hans Peter; Helström, Lotti

    2014-01-01

    Objectives: Rape has been found to be the trauma most commonly associated with Posttraumatic Stress Disorder (PTSD) among women. It is therefore important to be able to identify those women at greatest risk of developing PTSD. The aims of the present study were to analyze the PTSD prevalence six months after sexual assaults and identify the major risk factors for developing PTSD. Methods: Participants were 317 female victims of rape who sought help at the Emergency Clinic for Raped Women at S...

  16. Lifestyle factors in U.S. residential electricity consumption

    International Nuclear Information System (INIS)

    Sanquist, Thomas F.; Orr, Heather; Shui Bin; Bittner, Alvah C.

    2012-01-01

    A multivariate statistical approach to lifestyle analysis of residential electricity consumption is described and illustrated. Factor analysis of selected variables from the 2005 U.S. Residential Energy Consumption Survey (RECS) identified five lifestyle factors reflecting social and behavioral patterns associated with air conditioning, laundry usage, personal computer usage, climate zone of residence, and TV use. These factors were also estimated for 2001 RECS data. Multiple regression analysis using the lifestyle factors yields solutions accounting for approximately 40% of the variance in electricity consumption for both years. By adding the household and market characteristics of income, local electricity price and access to natural gas, variance accounted for is increased to approximately 54%. Income contributed ∼1% unique variance to the models, indicating that lifestyle factors reflecting social and behavioral patterns better account for consumption differences than income. Geographic segmentation of factor scores shows distinct clusters of consumption and lifestyle factors, particularly in suburban locations. The implications for tailored policy and planning interventions are discussed in relation to lifestyle issues. - Highlights: ► Illustrates lifestyle analysis of residential electricity consumption. ► Lifestyle factors based on social and behavioral decisions and equipment use. ► Regression models using lifestyle factors account for 40% of consumption variance. ► Lifestyle factors are stable over time when applied to other data sets. ► Energy reduction opportunities are identified by segmentation analysis.

  17. A Quantile Regression Approach to Estimating the Distribution of Anesthetic Procedure Time during Induction.

    Directory of Open Access Journals (Sweden)

    Hsin-Lun Wu

    Full Text Available Although procedure time analyses are important for operating room management, it is not easy to extract useful information from clinical procedure time data. A novel approach was proposed to analyze procedure time during anesthetic induction. A two-step regression analysis was performed to explore influential factors of anesthetic induction time (AIT. Linear regression with stepwise model selection was used to select significant correlates of AIT and then quantile regression was employed to illustrate the dynamic relationships between AIT and selected variables at distinct quantiles. A total of 1,060 patients were analyzed. The first and second-year residents (R1-R2 required longer AIT than the third and fourth-year residents and attending anesthesiologists (p = 0.006. Factors prolonging AIT included American Society of Anesthesiologist physical status ≧ III, arterial, central venous and epidural catheterization, and use of bronchoscopy. Presence of surgeon before induction would decrease AIT (p < 0.001. Types of surgery also had significant influence on AIT. Quantile regression satisfactorily estimated extra time needed to complete induction for each influential factor at distinct quantiles. Our analysis on AIT demonstrated the benefit of quantile regression analysis to provide more comprehensive view of the relationships between procedure time and related factors. This novel two-step regression approach has potential applications to procedure time analysis in operating room management.

  18. High-throughput metabolic profiling of diverse green Coffea arabica beans identified tryptophan as a universal discrimination factor for immature beans.

    Science.gov (United States)

    Setoyama, Daiki; Iwasa, Keiko; Seta, Harumichi; Shimizu, Hiroaki; Fujimura, Yoshinori; Miura, Daisuke; Wariishi, Hiroyuki; Nagai, Chifumi; Nakahara, Koichi

    2013-01-01

    The maturity of green coffee beans is the most influential determinant of the quality and flavor of the resultant coffee beverage. However, the chemical compounds that can be used to discriminate the maturity of the beans remain uncharacterized. We herein analyzed four distinct stages of maturity (immature, semi-mature, mature and overripe) of nine different varieties of green Coffea arabica beans hand-harvested from a single experimental field in Hawaii. After developing a high-throughput experimental system for sample preparation and liquid chromatography-mass spectrometry (LC-MS) measurement, we applied metabolic profiling, integrated with chemometric techniques, to explore the relationship between the metabolome and maturity of the sample in a non-biased way. For the multivariate statistical analyses, a partial least square (PLS) regression model was successfully created, which allowed us to accurately predict the maturity of the beans based on the metabolomic information. As a result, tryptophan was identified to be the best contributor to the regression model; the relative MS intensity of tryptophan was higher in immature beans than in those after the semi-mature stages in all arabica varieties investigated, demonstrating a universal discrimination factor for diverse arabica beans. Therefore, typtophan, either alone or together with other metabolites, may be utilized for traders as an assessment standard when purchasing qualified trading green arabica bean products. Furthermore, our results suggest that the tryptophan metabolism may be tightly linked to the development of coffee cherries and/or beans.

  19. Research and analyze of physical health using multiple regression analysis

    Directory of Open Access Journals (Sweden)

    T. S. Kyi

    2014-01-01

    Full Text Available This paper represents the research which is trying to create a mathematical model of the "healthy people" using the method of regression analysis. The factors are the physical parameters of the person (such as heart rate, lung capacity, blood pressure, breath holding, weight height coefficient, flexibility of the spine, muscles of the shoulder belt, abdominal muscles, squatting, etc.., and the response variable is an indicator of physical working capacity. After performing multiple regression analysis, obtained useful multiple regression models that can predict the physical performance of boys the aged of fourteen to seventeen years. This paper represents the development of regression model for the sixteen year old boys and analyzed results.

  20. Factors Impacting Online Ratings for Otolaryngologists.

    Science.gov (United States)

    Calixto, Nathaniel E; Chiao, Whitney; Durr, Megan L; Jiang, Nancy

    2018-06-01

    To identify factors associated with online patient ratings and comments for a nationwide sample of otolaryngologists. Ratings, demographic information, and written comments were obtained for a random sample of otolaryngologists from HealthGrades.com and Vitals.com . Online Presence Score (OPS) was based on 10 criteria, including professional website and social media profiles. Regression analyses identified factors associated with increased rating. We evaluated for correlations between OPS and other attributes with star rating and used chi-square tests to evaluate content differences between positive and negative comments. On linear regression, increased OPS was associated with higher ratings on HealthGrades and Vitals; higher ratings were also associated with younger age on Vitals and less experience on HealthGrades. However, detailed correlation studies showed weak correlation between OPS and rating; age and graduation year also showed low correlation with ratings. Negative comments more likely focused on surgeon-independent factors or poor bedside manner. Though younger otolaryngologists with greater online presence tend to have higher ratings, weak correlations suggest that age and online presence have only a small impact on the content found on ratings websites. While most written comments are positive, deficiencies in bedside manner or other physician-independent factors tend to elicit negative comments.

  1. Tombusvirus-yeast interactions identify conserved cell-intrinsic viral restriction factors

    Directory of Open Access Journals (Sweden)

    Zsuzsanna eSasvari

    2014-08-01

    Full Text Available To combat viral infections, plants possess innate and adaptive immune pathways, such as RNA silencing, R gene and recessive gene-mediated resistance mechanisms. However, it is likely that additional cell-intrinsic restriction factors (CIRF are also involved in limiting plant virus replication. This review discusses novel CIRFs with antiviral functions, many of them RNA-binding proteins or affecting the RNA binding activities of viral replication proteins. The CIRFs against tombusviruses have been identified in yeast (Saccharomyces cerevisiae, which is developed as an advanced model organism. Grouping of the identified CIRFs based on their known cellular functions and subcellular localization in yeast reveals that TBSV replication is limited by a wide variety of host gene functions. Yeast proteins with the highest connectivity in the network map include the well-characterized Xrn1p 5’-3’ exoribonuclease, Act1p actin protein and Cse4p centromere protein. The protein network map also reveals an important interplay between the pro-viral Hsp70 cellular chaperone and the antiviral co-chaperones, and possibly key roles for the ribosomal or ribosome-associated factors. We discuss the antiviral functions of selected CIRFs, such as the RNA binding nucleolin, ribonucleases, WW-domain proteins, single- and multi-domain cyclophilins, TPR-domain co-chaperones and cellular ion pumps. These restriction factors frequently target the RNA-binding region in the viral replication proteins, thus interfering with the recruitment of the viral RNA for replication and the assembly of the membrane-bound viral replicase. Although many of the characterized CIRFs act directly against TBSV, we propose that the TPR-domain co-chaperones function as guardians of the cellular Hsp70 chaperone system, which is subverted efficiently by TBSV for viral replicase assembly in the absence of the TPR-domain co-chaperones.

  2. ENHANCED PREDICTION OF STUDENT DROPOUTS USING FUZZY INFERENCE SYSTEM AND LOGISTIC REGRESSION

    Directory of Open Access Journals (Sweden)

    A. Saranya

    2016-01-01

    Full Text Available Predicting college and school dropouts is a major problem in educational system and has complicated challenge due to data imbalance and multi dimensionality, which can affect the low performance of students. In this paper, we have collected different database from various colleges, among these 500 best real attributes are identified in order to identify the factor that affecting dropout students using neural based classification algorithm and different mining technique are implemented for data processing. We also propose a Dropout Prediction Algorithm (DPA using fuzzy logic and Logistic Regression based inference system because the weighted average will improve the performance of whole system. We are experimented our proposed work with all other classification systems and documented as the best outcomes. The aggregated data is given to the decision trees for better dropout prediction. The accuracy of overall system 98.6% it shows the proposed work depicts efficient prediction.

  3. Observed to expected or logistic regression to identify hospitals with high or low 30-day mortality?

    Science.gov (United States)

    Helgeland, Jon; Clench-Aas, Jocelyne; Laake, Petter; Veierød, Marit B.

    2018-01-01

    Introduction A common quality indicator for monitoring and comparing hospitals is based on death within 30 days of admission. An important use is to determine whether a hospital has higher or lower mortality than other hospitals. Thus, the ability to identify such outliers correctly is essential. Two approaches for detection are: 1) calculating the ratio of observed to expected number of deaths (OE) per hospital and 2) including all hospitals in a logistic regression (LR) comparing each hospital to a form of average over all hospitals. The aim of this study was to compare OE and LR with respect to correctly identifying 30-day mortality outliers. Modifications of the methods, i.e., variance corrected approach of OE (OE-Faris), bias corrected LR (LR-Firth), and trimmed mean variants of LR and LR-Firth were also studied. Materials and methods To study the properties of OE and LR and their variants, we performed a simulation study by generating patient data from hospitals with known outlier status (low mortality, high mortality, non-outlier). Data from simulated scenarios with varying number of hospitals, hospital volume, and mortality outlier status, were analysed by the different methods and compared by level of significance (ability to falsely claim an outlier) and power (ability to reveal an outlier). Moreover, administrative data for patients with acute myocardial infarction (AMI), stroke, and hip fracture from Norwegian hospitals for 2012–2014 were analysed. Results None of the methods achieved the nominal (test) level of significance for both low and high mortality outliers. For low mortality outliers, the levels of significance were increased four- to fivefold for OE and OE-Faris. For high mortality outliers, OE and OE-Faris, LR 25% trimmed and LR-Firth 10% and 25% trimmed maintained approximately the nominal level. The methods agreed with respect to outlier status for 94.1% of the AMI hospitals, 98.0% of the stroke, and 97.8% of the hip fracture hospitals

  4. Boosted beta regression.

    Directory of Open Access Journals (Sweden)

    Matthias Schmid

    Full Text Available Regression analysis with a bounded outcome is a common problem in applied statistics. Typical examples include regression models for percentage outcomes and the analysis of ratings that are measured on a bounded scale. In this paper, we consider beta regression, which is a generalization of logit models to situations where the response is continuous on the interval (0,1. Consequently, beta regression is a convenient tool for analyzing percentage responses. The classical approach to fit a beta regression model is to use maximum likelihood estimation with subsequent AIC-based variable selection. As an alternative to this established - yet unstable - approach, we propose a new estimation technique called boosted beta regression. With boosted beta regression estimation and variable selection can be carried out simultaneously in a highly efficient way. Additionally, both the mean and the variance of a percentage response can be modeled using flexible nonlinear covariate effects. As a consequence, the new method accounts for common problems such as overdispersion and non-binomial variance structures.

  5. Regression to Causality : Regression-style presentation influences causal attribution

    DEFF Research Database (Denmark)

    Bordacconi, Mats Joe; Larsen, Martin Vinæs

    2014-01-01

    of equivalent results presented as either regression models or as a test of two sample means. Our experiment shows that the subjects who were presented with results as estimates from a regression model were more inclined to interpret these results causally. Our experiment implies that scholars using regression...... models – one of the primary vehicles for analyzing statistical results in political science – encourage causal interpretation. Specifically, we demonstrate that presenting observational results in a regression model, rather than as a simple comparison of means, makes causal interpretation of the results...... more likely. Our experiment drew on a sample of 235 university students from three different social science degree programs (political science, sociology and economics), all of whom had received substantial training in statistics. The subjects were asked to compare and evaluate the validity...

  6. Identifying risk profiles for childhood obesity using recursive partitioning based on individual, familial, and neighborhood environment factors.

    Science.gov (United States)

    Van Hulst, Andraea; Roy-Gagnon, Marie-Hélène; Gauvin, Lise; Kestens, Yan; Henderson, Mélanie; Barnett, Tracie A

    2015-02-15

    Few studies consider how risk factors within multiple levels of influence operate synergistically to determine childhood obesity. We used recursive partitioning analysis to identify unique combinations of individual, familial, and neighborhood factors that best predict obesity in children, and tested whether these predict 2-year changes in body mass index (BMI). Data were collected in 2005-2008 and in 2008-2011 for 512 Quebec youth (8-10 years at baseline) with a history of parental obesity (QUALITY study). CDC age- and sex-specific BMI percentiles were computed and children were considered obese if their BMI was ≥95th percentile. Individual (physical activity and sugar-sweetened beverage intake), familial (household socioeconomic status and measures of parental obesity including both BMI and waist circumference), and neighborhood (disadvantage, prestige, and presence of parks, convenience stores, and fast food restaurants) factors were examined. Recursive partitioning, a method that generates a classification tree predicting obesity based on combined exposure to a series of variables, was used. Associations between resulting varying risk group membership and BMI percentile at baseline and 2-year follow up were examined using linear regression. Recursive partitioning yielded 7 subgroups with a prevalence of obesity equal to 8%, 11%, 26%, 28%, 41%, 60%, and 63%, respectively. The 2 highest risk subgroups comprised i) children not meeting physical activity guidelines, with at least one BMI-defined obese parent and 2 abdominally obese parents, living in disadvantaged neighborhoods without parks and, ii) children with these characteristics, except with access to ≥1 park and with access to ≥1 convenience store. Group membership was strongly associated with BMI at baseline, but did not systematically predict change in BMI. Findings support the notion that obesity is predicted by multiple factors in different settings and provide some indications of potentially

  7. Simulation Experiments in Practice: Statistical Design and Regression Analysis

    OpenAIRE

    Kleijnen, J.P.C.

    2007-01-01

    In practice, simulation analysts often change only one factor at a time, and use graphical analysis of the resulting Input/Output (I/O) data. The goal of this article is to change these traditional, naïve methods of design and analysis, because statistical theory proves that more information is obtained when applying Design Of Experiments (DOE) and linear regression analysis. Unfortunately, classic DOE and regression analysis assume a single simulation response that is normally and independen...

  8. Return to Work: A Cut-Off of FIM Gain with Montebello Rehabilitation Factor Score in Order to Identify Predictive Factors in Subjects with Acquired Brain Injury.

    Science.gov (United States)

    Franceschini, Marco; Massimiani, Maria Pia; Paravati, Stefano; Agosti, Maurizio

    2016-01-01

    Return to work (RTW) for people with acquired brain injury (ABI) represents a main objective of rehabilitation: this work presents a strong correlation between personal well-being and quality of life. The aim of this study is to investigate the prognostic factors that can predict RTW after ABI (traumatic or non- traumatic aetiology) in patients without disorders of consciousness (e.g. coma, vegetative or minimally conscious state) at the beginning of their admission to rehabilitation. At the end of a 6-month follow-up after discharge, data were successfully collected in 69 patients. The rehabilitation effectiveness (functional Recovery) between admission and discharge was assessed by Functional Independent Measure (FIM) gain, through the Montebello Rehabilitation Factor Score (MRFS), which was obtained as follows: (discharge FIM-admission FIM)/(Maximum possible FIM-Admission FIM) x 100. The cut-off value (criterion) deriving from MRFS, which helped identify RTW patients, resulted in .659 (sn 88.9%; sp 52.4%). Considering the Mini Mental State Examination (MMSE) and the MRFS data, the multivariable binary logistic regression analysis presented 62.96% of correct RTW classification cases, 80.95% of non-RTW leading to an overall satisfactory predictability of 73.91%. The results of the present study suggest that occupational therapy intervention could modify cut-off in patients with an MFRS close to target at the end of an in-hospital rehabilitative program thus developing their capabilities and consequently surpassing cut-off itself.

  9. Return to Work: A Cut-Off of FIM Gain with Montebello Rehabilitation Factor Score in Order to Identify Predictive Factors in Subjects with Acquired Brain Injury.

    Directory of Open Access Journals (Sweden)

    Marco Franceschini

    Full Text Available Return to work (RTW for people with acquired brain injury (ABI represents a main objective of rehabilitation: this work presents a strong correlation between personal well-being and quality of life. The aim of this study is to investigate the prognostic factors that can predict RTW after ABI (traumatic or non- traumatic aetiology in patients without disorders of consciousness (e.g. coma, vegetative or minimally conscious state at the beginning of their admission to rehabilitation. At the end of a 6-month follow-up after discharge, data were successfully collected in 69 patients. The rehabilitation effectiveness (functional Recovery between admission and discharge was assessed by Functional Independent Measure (FIM gain, through the Montebello Rehabilitation Factor Score (MRFS, which was obtained as follows: (discharge FIM-admission FIM/(Maximum possible FIM-Admission FIM x 100. The cut-off value (criterion deriving from MRFS, which helped identify RTW patients, resulted in .659 (sn 88.9%; sp 52.4%. Considering the Mini Mental State Examination (MMSE and the MRFS data, the multivariable binary logistic regression analysis presented 62.96% of correct RTW classification cases, 80.95% of non-RTW leading to an overall satisfactory predictability of 73.91%. The results of the present study suggest that occupational therapy intervention could modify cut-off in patients with an MFRS close to target at the end of an in-hospital rehabilitative program thus developing their capabilities and consequently surpassing cut-off itself.

  10. Regression Equations for Birth Weight Estimation using ...

    African Journals Online (AJOL)

    In this study, Birth Weight has been estimated from anthropometric measurements of hand and foot. Linear regression equations were formed from each of the measured variables. These simple equations can be used to estimate Birth Weight of new born babies, in order to identify those with low birth weight and referred to ...

  11. Evaluating predictive models for solar energy growth in the US states and identifying the key drivers

    Science.gov (United States)

    Chakraborty, Joheen; Banerji, Sugata

    2018-03-01

    Driven by a desire to control climate change and reduce the dependence on fossil fuels, governments around the world are increasing the adoption of renewable energy sources. However, among the US states, we observe a wide disparity in renewable penetration. In this study, we have identified and cleaned over a dozen datasets representing solar energy penetration in each US state, and the potentially relevant socioeconomic and other factors that may be driving the growth in solar. We have applied a number of predictive modeling approaches - including machine learning and regression - on these datasets over a 17-year period and evaluated the relative performance of the models. Our goals were: (1) identify the most important factors that are driving the growth in solar, (2) choose the most effective predictive modeling technique for solar growth, and (3) develop a model for predicting next year’s solar growth using this year’s data. We obtained very promising results with random forests (about 90% efficacy) and varying degrees of success with support vector machines and regression techniques (linear, polynomial, ridge). We also identified states with solar growth slower than expected and representing a potential for stronger growth in future.

  12. A review of a priori regression models for warfarin maintenance dose prediction.

    Directory of Open Access Journals (Sweden)

    Ben Francis

    Full Text Available A number of a priori warfarin dosing algorithms, derived using linear regression methods, have been proposed. Although these dosing algorithms may have been validated using patients derived from the same centre, rarely have they been validated using a patient cohort recruited from another centre. In order to undertake external validation, two cohorts were utilised. One cohort formed by patients from a prospective trial and the second formed by patients in the control arm of the EU-PACT trial. Of these, 641 patients were identified as having attained stable dosing and formed the dataset used for validation. Predicted maintenance doses from six criterion fulfilling regression models were then compared to individual patient stable warfarin dose. Predictive ability was assessed with reference to several statistics including the R-square and mean absolute error. The six regression models explained different amounts of variability in the stable maintenance warfarin dose requirements of the patients in the two validation cohorts; adjusted R-squared values ranged from 24.2% to 68.6%. An overview of the summary statistics demonstrated that no one dosing algorithm could be considered optimal. The larger validation cohort from the prospective trial produced more consistent statistics across the six dosing algorithms. The study found that all the regression models performed worse in the validation cohort when compared to the derivation cohort. Further, there was little difference between regression models that contained pharmacogenetic coefficients and algorithms containing just non-pharmacogenetic coefficients. The inconsistency of results between the validation cohorts suggests that unaccounted population specific factors cause variability in dosing algorithm performance. Better methods for dosing that take into account inter- and intra-individual variability, at the initiation and maintenance phases of warfarin treatment, are needed.

  13. A review of a priori regression models for warfarin maintenance dose prediction.

    Science.gov (United States)

    Francis, Ben; Lane, Steven; Pirmohamed, Munir; Jorgensen, Andrea

    2014-01-01

    A number of a priori warfarin dosing algorithms, derived using linear regression methods, have been proposed. Although these dosing algorithms may have been validated using patients derived from the same centre, rarely have they been validated using a patient cohort recruited from another centre. In order to undertake external validation, two cohorts were utilised. One cohort formed by patients from a prospective trial and the second formed by patients in the control arm of the EU-PACT trial. Of these, 641 patients were identified as having attained stable dosing and formed the dataset used for validation. Predicted maintenance doses from six criterion fulfilling regression models were then compared to individual patient stable warfarin dose. Predictive ability was assessed with reference to several statistics including the R-square and mean absolute error. The six regression models explained different amounts of variability in the stable maintenance warfarin dose requirements of the patients in the two validation cohorts; adjusted R-squared values ranged from 24.2% to 68.6%. An overview of the summary statistics demonstrated that no one dosing algorithm could be considered optimal. The larger validation cohort from the prospective trial produced more consistent statistics across the six dosing algorithms. The study found that all the regression models performed worse in the validation cohort when compared to the derivation cohort. Further, there was little difference between regression models that contained pharmacogenetic coefficients and algorithms containing just non-pharmacogenetic coefficients. The inconsistency of results between the validation cohorts suggests that unaccounted population specific factors cause variability in dosing algorithm performance. Better methods for dosing that take into account inter- and intra-individual variability, at the initiation and maintenance phases of warfarin treatment, are needed.

  14. Prenatal diagnosis of Caudal Regression Syndrome : a case report

    Directory of Open Access Journals (Sweden)

    Celikaslan Nurgul

    2001-12-01

    Full Text Available Abstract Background Caudal regression is a rare syndrome which has a spectrum of congenital malformations ranging from simple anal atresia to absence of sacral, lumbar and possibly lower thoracic vertebrae, to the most severe form which is known as sirenomelia. Maternal diabetes, genetic predisposition and vascular hypoperfusion have been suggested as possible causative factors. Case presentation We report a case of caudal regression syndrome diagnosed in utero at 22 weeks' of gestation. Prenatal ultrasound examination revealed a sudden interruption of the spine and "frog-like" position of lower limbs. Termination of pregnancy and autopsy findings confirmed the diagnosis. Conclusion Prenatal ultrasonographic diagnosis of caudal regression syndrome is possible at 22 weeks' of gestation by ultrasound examination.

  15. Linear regression analysis: part 14 of a series on evaluation of scientific publications.

    Science.gov (United States)

    Schneider, Astrid; Hommel, Gerhard; Blettner, Maria

    2010-11-01

    Regression analysis is an important statistical method for the analysis of medical data. It enables the identification and characterization of relationships among multiple factors. It also enables the identification of prognostically relevant risk factors and the calculation of risk scores for individual prognostication. This article is based on selected textbooks of statistics, a selective review of the literature, and our own experience. After a brief introduction of the uni- and multivariable regression models, illustrative examples are given to explain what the important considerations are before a regression analysis is performed, and how the results should be interpreted. The reader should then be able to judge whether the method has been used correctly and interpret the results appropriately. The performance and interpretation of linear regression analysis are subject to a variety of pitfalls, which are discussed here in detail. The reader is made aware of common errors of interpretation through practical examples. Both the opportunities for applying linear regression analysis and its limitations are presented.

  16. Applying psychological theories to evidence-based clinical practice: identifying factors predictive of placing preventive fissure sealants.

    Science.gov (United States)

    Bonetti, Debbie; Johnston, Marie; Clarkson, Jan E; Grimshaw, Jeremy; Pitts, Nigel B; Eccles, Martin; Steen, Nick; Thomas, Ruth; Maclennan, Graeme; Glidewell, Liz; Walker, Anne

    2010-04-08

    Psychological models are used to understand and predict behaviour in a wide range of settings, but have not been consistently applied to health professional behaviours, and the contribution of differing theories is not clear. This study explored the usefulness of a range of models to predict an evidence-based behaviour -- the placing of fissure sealants. Measures were collected by postal questionnaire from a random sample of general dental practitioners (GDPs) in Scotland. Outcomes were behavioural simulation (scenario decision-making), and behavioural intention. Predictor variables were from the Theory of Planned Behaviour (TPB), Social Cognitive Theory (SCT), Common Sense Self-regulation Model (CS-SRM), Operant Learning Theory (OLT), Implementation Intention (II), Stage Model, and knowledge (a non-theoretical construct). Multiple regression analysis was used to examine the predictive value of each theoretical model individually. Significant constructs from all theories were then entered into a 'cross theory' stepwise regression analysis to investigate their combined predictive value. Behavioural simulation - theory level variance explained was: TPB 31%; SCT 29%; II 7%; OLT 30%. Neither CS-SRM nor stage explained significant variance. In the cross theory analysis, habit (OLT), timeline acute (CS-SRM), and outcome expectancy (SCT) entered the equation, together explaining 38% of the variance. Behavioural intention - theory level variance explained was: TPB 30%; SCT 24%; OLT 58%, CS-SRM 27%. GDPs in the action stage had significantly higher intention to place fissure sealants. In the cross theory analysis, habit (OLT) and attitude (TPB) entered the equation, together explaining 68% of the variance in intention. The study provides evidence that psychological models can be useful in understanding and predicting clinical behaviour. Taking a theory-based approach enables the creation of a replicable methodology for identifying factors that may predict clinical behaviour

  17. Logistic Regression in the Identification of Hazards in Construction

    Science.gov (United States)

    Drozd, Wojciech

    2017-10-01

    The construction site and its elements create circumstances that are conducive to the formation of risks to safety during the execution of works. Analysis indicates the critical importance of these factors in the set of characteristics that describe the causes of accidents in the construction industry. This article attempts to analyse the characteristics related to the construction site, in order to indicate their importance in defining the circumstances of accidents at work. The study includes sites inspected in 2014 - 2016 by the employees of the District Labour Inspectorate in Krakow (Poland). The analysed set of detailed (disaggregated) data includes both quantitative and qualitative characteristics. The substantive task focused on classification modelling in the identification of hazards in construction and identifying those of the analysed characteristics that are important in an accident. In terms of methodology, resource data analysis using statistical classifiers, in the form of logistic regression, was the method used.

  18. Comparing the index-flood and multiple-regression methods using L-moments

    Science.gov (United States)

    Malekinezhad, H.; Nachtnebel, H. P.; Klik, A.

    In arid and semi-arid regions, the length of records is usually too short to ensure reliable quantile estimates. Comparing index-flood and multiple-regression analyses based on L-moments was the main objective of this study. Factor analysis was applied to determine main influencing variables on flood magnitude. Ward’s cluster and L-moments approaches were applied to several sites in the Namak-Lake basin in central Iran to delineate homogeneous regions based on site characteristics. Homogeneity test was done using L-moments-based measures. Several distributions were fitted to the regional flood data and index-flood and multiple-regression methods as two regional flood frequency methods were compared. The results of factor analysis showed that length of main waterway, compactness coefficient, mean annual precipitation, and mean annual temperature were the main variables affecting flood magnitude. The study area was divided into three regions based on the Ward’s method of clustering approach. The homogeneity test based on L-moments showed that all three regions were acceptably homogeneous. Five distributions were fitted to the annual peak flood data of three homogeneous regions. Using the L-moment ratios and the Z-statistic criteria, GEV distribution was identified as the most robust distribution among five candidate distributions for all the proposed sub-regions of the study area, and in general, it was concluded that the generalised extreme value distribution was the best-fit distribution for every three regions. The relative root mean square error (RRMSE) measure was applied for evaluating the performance of the index-flood and multiple-regression methods in comparison with the curve fitting (plotting position) method. In general, index-flood method gives more reliable estimations for various flood magnitudes of different recurrence intervals. Therefore, this method should be adopted as regional flood frequency method for the study area and the Namak-Lake basin

  19. Identifying environmental risk factors and mapping the risk of human West Nile virus in South Dakota.

    Science.gov (United States)

    Hess, A.; Davis, J. K.; Wimberly, M. C.

    2017-12-01

    Human West Nile virus (WNV) first arrived in the USA in 1999 and has since then spread across the country. Today, the highest incidence rates are found in the state of South Dakota. The disease occurrence depends on the complex interaction between the mosquito vector, the bird host and the dead-end human host. Understanding the spatial domain of this interaction and being able to identify disease transmission hotspots is crucial for effective disease prevention and mosquito control. In this study we use geospatial environmental information to understand what drives the spatial distribution of cases of human West Nile virus in South Dakota and to map relative infection risk across the state. To map the risk of human West Nile virus in South Dakota, we used geocoded human case data from the years 2004-2016. Satellite data from the Landsat ETM+ and MODIS for the years 2003 to 2016 were used to characterize environmental patterns. From these datasets we calculated indices, such as the normalized differenced vegetation index (NDVI) and the normalized differenced water index (NDWI). In addition, datasets such as the National Land Data Assimilation System (NLDAS), National Land Cover Dataset (NLCD), National Wetland inventory (NWI), National Elevation Dataset (NED) and Soil Survey Geographic Database (SSURGO) were utilized. Environmental variables were summarized for a buffer zone around the case and control points. We used a boosted regression tree model to identify the most important variables describing the risk of WNV infection. We generated a risk map by applying this model across the entire state. We found that the highest relative risk is present in the James River valley in northeastern South Dakota. Factors that were identified as influencing the transmission risk include inter-annual variability of vegetation cover, water availability and temperature. Land covers such as grasslands, low developed areas and wetlands were also found to be good predictors for human

  20. Multicollinearity in Regression Analyses Conducted in Epidemiologic Studies

    OpenAIRE

    Vatcheva, Kristina P.; Lee, MinJae; McCormick, Joseph B.; Rahbar, Mohammad H.

    2016-01-01

    The adverse impact of ignoring multicollinearity on findings and data interpretation in regression analysis is very well documented in the statistical literature. The failure to identify and report multicollinearity could result in misleading interpretations of the results. A review of epidemiological literature in PubMed from January 2004 to December 2013, illustrated the need for a greater attention to identifying and minimizing the effect of multicollinearity in analysis of data from epide...

  1. Patient and carer identified factors which contribute to safety incidents in primary care: a qualitative study.

    Science.gov (United States)

    Hernan, Andrea L; Giles, Sally J; Fuller, Jeffrey; Johnson, Julie K; Walker, Christine; Dunbar, James A

    2015-09-01

    Patients can have an important role in reducing harm in primary-care settings. Learning from patient experience and feedback could improve patient safety. Evidence that captures patients' views of the various contributory factors to creating safe primary care is largely absent. The aim of this study was to address this evidence gap. Four focus groups and eight semistructured interviews were conducted with 34 patients and carers from south-east Australia. Participants were asked to describe their experiences of primary care. Audio recordings were transcribed verbatim and specific factors that contribute to safety incidents were identified in the analysis using the Yorkshire Contributory Factors Framework (YCFF). Other factors emerging from the data were also ascertained and added to the analytical framework. Thirteen factors that contribute to safety incidents in primary care were ascertained. Five unique factors for the primary-care setting were discovered in conjunction with eight factors present in the YCFF from hospital settings. The five unique primary care contributing factors to safety incidents represented a range of levels within the primary-care system from local working conditions to the upstream organisational level and the external policy context. The 13 factors included communication, access, patient factors, external policy context, dignity and respect, primary-secondary interface, continuity of care, task performance, task characteristics, time in the consultation, safety culture, team factors and the physical environment. Patient and carer feedback of this type could help primary-care professionals better understand and identify potential safety concerns and make appropriate service improvements. The comprehensive range of factors identified provides the groundwork for developing tools that systematically capture the multiple contributory factors to patient safety. Published by the BMJ Publishing Group Limited. For permission to use (where not

  2. A comparison of non-homogeneous Markov regression models with application to Alzheimer’s disease progression

    Science.gov (United States)

    Hubbard, R. A.; Zhou, X.H.

    2011-01-01

    Markov regression models are useful tools for estimating the impact of risk factors on rates of transition between multiple disease states. Alzheimer’s disease (AD) is an example of a multi-state disease process in which great interest lies in identifying risk factors for transition. In this context, non-homogeneous models are required because transition rates change as subjects age. In this report we propose a non-homogeneous Markov regression model that allows for reversible and recurrent disease states, transitions among multiple states between observations, and unequally spaced observation times. We conducted simulation studies to demonstrate performance of estimators for covariate effects from this model and compare performance with alternative models when the underlying non-homogeneous process was correctly specified and under model misspecification. In simulation studies, we found that covariate effects were biased if non-homogeneity of the disease process was not accounted for. However, estimates from non-homogeneous models were robust to misspecification of the form of the non-homogeneity. We used our model to estimate risk factors for transition to mild cognitive impairment (MCI) and AD in a longitudinal study of subjects included in the National Alzheimer’s Coordinating Center’s Uniform Data Set. Using our model, we found that subjects with MCI affecting multiple cognitive domains were significantly less likely to revert to normal cognition. PMID:22419833

  3. [Determination of priority unfavorable environmental factors].

    Science.gov (United States)

    Zaikova, Z A; Burdukovskaya, A V; Belykh, A I

    In the Irkutsk region there are recorded high indices of rates of morbidity, disability, mortality rate of the working-age population and low levels of life expectancy of the population, that is confirmed by ranking position levels among the all subjects of the Russian Federation. According to all mentioned indices of health the region is inside the top ten unfavorable regions of Russia. In relation to the problem in the state of health of the adult population the estimation of the causal relationships between environmental factors and certain health indices is actual. The list of studiedfactors included health indices that characterize the harmful working conditions of the working population and basic socioeconomic indices in the region. Estimation of causal-relationship relationships was performed with the use of methods of multivariate analysis - correlation and multiple linear regression. In the selection offactors for the construction of mathematical models of multiple regression there were used methods of the analysis of variables variability, pair correlation coefficients matrix and sequential switching covariates to eliminate the problems of multicollinearity, pre-standardization of indices for the elevation of the numerical stability of regression analysis algorithm. As a result of the execution of the analysis there were constructed statistical models for the dependence in the system variables “environment - public health”, which allowed to identify the most informative regression models for the adult population health according to indices of primary disability of the population, the mortality rate and life expectancy of the working age population. According to results of the analysis there were identified priority factors affecting on the health of the adult population of the Irkutsk region. To these factors there are referred the proportion of workplaces failing to meet sanitary standards for vibration and 8 socio-economic indices of living

  4. Combination of supervised and semi-supervised regression models for improved unbiased estimation

    DEFF Research Database (Denmark)

    Arenas-Garía, Jeronimo; Moriana-Varo, Carlos; Larsen, Jan

    2010-01-01

    In this paper we investigate the steady-state performance of semisupervised regression models adjusted using a modified RLS-like algorithm, identifying the situations where the new algorithm is expected to outperform standard RLS. By using an adaptive combination of the supervised and semisupervi......In this paper we investigate the steady-state performance of semisupervised regression models adjusted using a modified RLS-like algorithm, identifying the situations where the new algorithm is expected to outperform standard RLS. By using an adaptive combination of the supervised...

  5. Statistical and regression analyses of detected extrasolar systems

    Czech Academy of Sciences Publication Activity Database

    Pintr, Pavel; Peřinová, V.; Lukš, A.; Pathak, A.

    2013-01-01

    Roč. 75, č. 1 (2013), s. 37-45 ISSN 0032-0633 Institutional support: RVO:61389021 Keywords : Exoplanets * Kepler candidates * Regression analysis Subject RIV: BN - Astronomy, Celestial Mechanics, Astrophysics Impact factor: 1.630, year: 2013 http://www.sciencedirect.com/science/article/pii/S0032063312003066

  6. Identifying risk sources of air contamination by polycyclic aromatic hydrocarbons.

    Science.gov (United States)

    Huzlik, Jiri; Bozek, Frantisek; Pawelczyk, Adam; Licbinsky, Roman; Naplavova, Magdalena; Pondelicek, Michael

    2017-09-01

    This article is directed to determining concentrations of polycyclic aromatic hydrocarbons (PAHs), which are sorbed to solid particles in the air. Pollution sources were identified on the basis of the ratio of benzo[ghi]perylene (BghiPe) to benzo[a]pyrene (BaP). Because various important information is lost by determining the simple ratio of concentrations, least squares linear regression (classic ordinary least squares regression), reduced major axis, orthogonal regression, and Kendall-Theil robust diagnostics were utilized for identification. Statistical evaluation using all aforementioned methods demonstrated different ratios of the monitored PAHs in the intervals examined during warmer and colder periods. Analogous outputs were provided by comparing gradients of the emission factors acquired from the measured concentrations of BghiPe and BaP in motor vehicle exhaust gases. Based on these outputs, it was possible plausibly to state that the influence of burning organic fuels in heating stoves is prevalent in colder periods whereas in warmer periods transport was the exclusive source because other sources of PAH emissions were not found in the examined locations. Copyright © 2017 Elsevier Ltd. All rights reserved.

  7. Using the community pharmacy to identify patients at risk of poor asthma control and factors which contribute to this poor control.

    Science.gov (United States)

    Armour, Carol L; Lemay, Kate; Saini, Bandana; Reddel, Helen K; Bosnic-Anticevich, Sinthia Z; Smith, Lorraine D; Burton, Deborah; Song, Yun Ju Christine; Alles, Marie Chehani; Stewart, Kay; Emmerton, Lynne; Krass, Ines

    2011-11-01

    Although asthma can be well controlled by appropriate medication delivered in an appropriate way at an appropriate time, there is evidence that management is often suboptimal. This results in poor asthma control, poor quality of life, and significant morbidity. The objective of this study was to describe a population recruited in community pharmacy identified by trained community pharmacists as being at risk for poor asthma outcomes and to identify factors associated with poor asthma control. It used a cross-sectional design in 96 pharmacies in metropolitan and regional New South Wales, Victoria, Queensland, and Australian Capital Territory in Australia. Community pharmacists with specialized asthma training enrolled 570 patients aged ≥18 years with doctor-diagnosed asthma who were considered at risk of poor asthma outcomes and then conducted a comprehensive asthma assessment. In this assessment, asthma control was classified using a symptom and activity tool based on self-reported frequency of symptoms during the previous month and categorized as poor, fair, or good. Asthma history was discussed, and lung function and inhaler technique were also assessed by the pharmacist. Medication use/adherence was recorded from both pharmacy records and the Brief Medication Questionnaire (BMQ). The symptom and activity tool identified that 437 (77%) recruited patients had poor asthma control. Of the 570 patients, 117 (21%) smoked, 108 (19%) had an action plan, 372 (69%) used combination of inhaled corticosteroid (ICS)/long-acting β(2)-agonist (LABA) medications, and only 17-28% (depending on device) used their inhaler device correctly. In terms of adherence, 90% had their ICS or ICS/LABA dispensed <6 times in the previous 6 months, which is inconsistent with regular use; this low adherence was confirmed from the BMQ scores. A logistic regression model showed that patients who smoked had incorrect inhaler technique or low adherence (assessed by either dispensing history or

  8. Directional quantile regression in Octave (and MATLAB)

    Czech Academy of Sciences Publication Activity Database

    Boček, Pavel; Šiman, Miroslav

    2016-01-01

    Roč. 52, č. 1 (2016), s. 28-51 ISSN 0023-5954 R&D Projects: GA ČR GA14-07234S Institutional support: RVO:67985556 Keywords : quantile regression * multivariate quantile * depth contour * Matlab Subject RIV: IN - Informatics, Computer Science Impact factor: 0.379, year: 2016 http://library.utia.cas.cz/separaty/2016/SI/bocek-0458380.pdf

  9. Risk factors for inadequate TB case finding in Rural Western Kenya: a comparison of actively and passively identified TB patients.

    Directory of Open Access Journals (Sweden)

    Anna H Van't Hoog

    Full Text Available The findings of a prevalence survey conducted in western Kenya, in a population with 14.9% HIV prevalence suggested inadequate case finding. We found a high burden of infectious and largely undiagnosed pulmonary tuberculosis (PTB, that a quarter of the prevalent cases had not yet sought care, and a low case detection rate.We aimed to identify factors associated with inadequate case finding among adults with PTB in this population by comparing characteristics of 194 PTB patients diagnosed in a health facility after self-report, i.e., through passive case detection, with 88 patients identified through active case detection during the prevalence survey. We examined associations between method of case detection and patient characteristics, including HIV-status, socio-demographic variables and disease severity in univariable and multivariable logistic regression analyses.HIV-infection was associated with faster passive case detection in univariable analysis (crude OR 3.5, 95% confidence interval (CI 2.0-5.9, but in multivariable logistic regression this was largely explained by the presence of cough, illness and clinically diagnosed smear-negative TB (adjusted OR (aOR HIV 1.8, 95% CI 0.85-3.7. Among the HIV-uninfected passive case detection was less successful in older patients aOR 0.76, 95%CI 0.60-0.97 per 10 years increase, and women (aOR 0.27, 95%CI 0.10-0.73. Reported current or past alcohol use reduced passive case detection in both groups (0.42, 95% CI 0.23-0.79. Among smear-positive patients median durations of cough were 4.0 and 6.9 months in HIV-infected and uninfected patients, respectively.HIV-uninfected patients with infectious TB who were older, female, relatively less ill, or had a cough of a shorter duration were less likely found through passive case detection. In addition to intensified case finding in HIV-infected persons, increasing the suspicion of TB among HIV-uninfected women and the elderly are needed to improve TB case

  10. Identifying Contextual Factors of Employee Satisfaction of Performance Management at a Thai State Enterprise

    Directory of Open Access Journals (Sweden)

    Molraudee Saratun

    2013-11-01

    Full Text Available Normal 0 false false false IN X-NONE X-NONE MicrosoftInternetExplorer4 Although there has been an increase in Performance Management (PM literature over the years arguing that PM perceptions are likely to be a function of PM process components and contextual factors, the actual relationship between the contextual factors and employee satisfaction of PM remains little explored.  Extending previous research, this study examines relationships between contextual factors and employees’ PM satisfaction.  Derived from the literature, these contextual factors are motivation and empowerment of employees, role conflict, role ambiguity, perceived organisational support, procedural justice and distributive justice.  Seven directional hypotheses are tested accordingly through a series of regression analyses.  This article finds that these contextual factors, with the exception of role conflict, are directly predictive of enhanced employees’ PM satisfaction at the Thai state enterprise. Keywords: Performance management, contextual factors, performance management satisfaction, public organisations, Thailand. /* Style Definitions */ table.MsoNormalTable {mso-style-name:"Table Normal"; mso-tstyle-rowband-size:0; mso-tstyle-colband-size:0; mso-style-noshow:yes; mso-style-priority:99; mso-style-qformat:yes; mso-style-parent:""; mso-padding-alt:0cm 5.4pt 0cm 5.4pt; mso-para-margin:0cm; mso-para-margin-bottom:.0001pt; mso-pagination:widow-orphan; font-size:11.0pt; font-family:"Calibri","sans-serif"; mso-ascii-font-family:Calibri; mso-ascii-theme-font:minor-latin; mso-fareast-font-family:"Times New Roman"; mso-fareast-theme-font:minor-fareast; mso-hansi-font-family:Calibri; mso-hansi-theme-font:minor-latin; mso-bidi-font-family:"Times New Roman"; mso-bidi-theme-font:minor-bidi;}

  11. Selecting the correct weighting factors for linear and quadratic calibration curves with least-squares regression algorithm in bioanalytical LC-MS/MS assays and impacts of using incorrect weighting factors on curve stability, data quality, and assay performance.

    Science.gov (United States)

    Gu, Huidong; Liu, Guowen; Wang, Jian; Aubry, Anne-Françoise; Arnold, Mark E

    2014-09-16

    A simple procedure for selecting the correct weighting factors for linear and quadratic calibration curves with least-squares regression algorithm in bioanalytical LC-MS/MS assays is reported. The correct weighting factor is determined by the relationship between the standard deviation of instrument responses (σ) and the concentrations (x). The weighting factor of 1, 1/x, or 1/x(2) should be selected if, over the entire concentration range, σ is a constant, σ(2) is proportional to x, or σ is proportional to x, respectively. For the first time, we demonstrated with detailed scientific reasoning, solid historical data, and convincing justification that 1/x(2) should always be used as the weighting factor for all bioanalytical LC-MS/MS assays. The impacts of using incorrect weighting factors on curve stability, data quality, and assay performance were thoroughly investigated. It was found that the most stable curve could be obtained when the correct weighting factor was used, whereas other curves using incorrect weighting factors were unstable. It was also found that there was a very insignificant impact on the concentrations reported with calibration curves using incorrect weighting factors as the concentrations were always reported with the passing curves which actually overlapped with or were very close to the curves using the correct weighting factor. However, the use of incorrect weighting factors did impact the assay performance significantly. Finally, the difference between the weighting factors of 1/x(2) and 1/y(2) was discussed. All of the findings can be generalized and applied into other quantitative analysis techniques using calibration curves with weighted least-squares regression algorithm.

  12. Gender Differences in Risk Factors for Single and Recurrent Falls Among the Community-Dwelling Elderly

    OpenAIRE

    Yu Mei O; Fatima El Fakiri

    2015-01-01

    The purpose of this study was to identify gender differences in risk factors of fall accidents among older people, and whether these factors differ between single and recurrent fallers. A total of 4,426 individuals aged ≥65 years from two large-scale health surveys provided data. Logistic regression analyses were used to identify risk factors and to determine the risk model for falling and recurrent falling in men and ...

  13. Factors associated with smoking cessation in Brazil

    OpenAIRE

    Cesar Augusto Oviedo Tejada; Fernanda Ewerling; Anderson Moreira Aristides dos Santos; Andréa Dâmaso Bertoldi; Ana Maria Menezes

    2013-01-01

    Tobacco has been identified as the drug with the highest addiction rate and the leading cause of avoidable deaths. The current study thus aimed to identify the determinants of smoking cessation in a Brazilian population sample based on data from the National Household Sample Survey for 2008. The study analyzed socioeconomic, residential, and health-related data as well as individual habits. Data analysis used Poisson regression. The following factors were associated with smoking cessation: ag...

  14. QRank: a novel quantile regression tool for eQTL discovery.

    Science.gov (United States)

    Song, Xiaoyu; Li, Gen; Zhou, Zhenwei; Wang, Xianling; Ionita-Laza, Iuliana; Wei, Ying

    2017-07-15

    Over the past decade, there has been a remarkable improvement in our understanding of the role of genetic variation in complex human diseases, especially via genome-wide association studies. However, the underlying molecular mechanisms are still poorly characterized, impending the development of therapeutic interventions. Identifying genetic variants that influence the expression level of a gene, i.e. expression quantitative trait loci (eQTLs), can help us understand how genetic variants influence traits at the molecular level. While most eQTL studies focus on identifying mean effects on gene expression using linear regression, evidence suggests that genetic variation can impact the entire distribution of the expression level. Motivated by the potential higher order associations, several studies investigated variance eQTLs. In this paper, we develop a Quantile Rank-score based test (QRank), which provides an easy way to identify eQTLs that are associated with the conditional quantile functions of gene expression. We have applied the proposed QRank to the Genotype-Tissue Expression project, an international tissue bank for studying the relationship between genetic variation and gene expression in human tissues, and found that the proposed QRank complements the existing methods, and identifies new eQTLs with heterogeneous effects across different quantile levels. Notably, we show that the eQTLs identified by QRank but missed by linear regression are associated with greater enrichment in genome-wide significant SNPs from the GWAS catalog, and are also more likely to be tissue specific than eQTLs identified by linear regression. An R package is available on R CRAN at https://cran.r-project.org/web/packages/QRank . xs2148@cumc.columbia.edu. Supplementary data are available at Bioinformatics online. © The Author (2017). Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  15. Identifying Watershed, Landscape, and Engineering Design Factors that Influence the Biotic Condition of Restored Streams

    Directory of Open Access Journals (Sweden)

    Barbara Doll

    2016-04-01

    Full Text Available Restored stream reaches at 79 sites across North Carolina were sampled for aquatic macroinvertebrates using a rapid bioassessment protocol. Morphological design parameters and geographic factors, including watershed and landscape parameters (e.g., valley slope, substrate, were also compiled for these streams. Principal component regression analyses revealed correlations between design and landscape variables with macroinvertebrate metrics. The correlations were strengthened by adding watershed variables. Ridge regression was used to find the best-fit model for predicting dominant taxa from the “pollution sensitive” orders of Ephemeroptera (mayflies, Plecoptera (stoneflies, and Trichoptera (caddisflies, or EPT taxa, resulting in coefficient weights that were most interpretable relative to site selection and design parameters. Results indicate that larger (wider streams located in the mountains and foothills where there are steeper valleys, larger substrate, and undeveloped watersheds are expected to have higher numbers of dominant EPT taxa. In addition, EPT taxa numbers are positively correlated with accessible floodplain width and negatively correlated with width-to-depth ratio and sinuosity. This study indicates that both site selection and design should be carefully considered in order to maximize the resulting biotic condition and associated potential ecological uplift of the stream.

  16. What factors facilitate regulatory competence in supervising the safety of nuclear technology?

    International Nuclear Information System (INIS)

    Mishar, Marina

    2012-01-01

    The proposed utilization of nuclear energy for electricity generation as the alternative energy source requires Atomic Energy Licensing Board (AELB), the Malaysian nuclear regulatory body taking key role in supervising the safety of the program. This study looked into factors influencing the competency of current AELB human resource as technical competency has been identified as one of the main contributors to the success of a civil nuclear power program. The four quadrant competency model developed by International Atomic Energy Agency was utilized as the required competency. A comprehensive study on 81 personnel from five states in different geographic regions of the country were carried out to investigate the impact of six factors related to competency (educational level, years of working experience, trainings attended, participation in technical committees, numbers of technical papers written and number of technical presentation presented) on four dependent measures in the areas of regulatory competency (legal basis, technical disciplines, regulatory practices and personal and interpersonal effectiveness). Multiple regression (method enter) identified factors that had significant contribution to level of competency while stepwise regression resulted in identifying predictors to enhance competencies. Results were mixed but each of the independent factors is a predictor to different competencies. This study had identified the best predictors that could significantly contribute to the enhancement of regulatory. (author)

  17. Regression analysis with categorized regression calibrated exposure: some interesting findings

    Directory of Open Access Journals (Sweden)

    Hjartåker Anette

    2006-07-01

    Full Text Available Abstract Background Regression calibration as a method for handling measurement error is becoming increasingly well-known and used in epidemiologic research. However, the standard version of the method is not appropriate for exposure analyzed on a categorical (e.g. quintile scale, an approach commonly used in epidemiologic studies. A tempting solution could then be to use the predicted continuous exposure obtained through the regression calibration method and treat it as an approximation to the true exposure, that is, include the categorized calibrated exposure in the main regression analysis. Methods We use semi-analytical calculations and simulations to evaluate the performance of the proposed approach compared to the naive approach of not correcting for measurement error, in situations where analyses are performed on quintile scale and when incorporating the original scale into the categorical variables, respectively. We also present analyses of real data, containing measures of folate intake and depression, from the Norwegian Women and Cancer study (NOWAC. Results In cases where extra information is available through replicated measurements and not validation data, regression calibration does not maintain important qualities of the true exposure distribution, thus estimates of variance and percentiles can be severely biased. We show that the outlined approach maintains much, in some cases all, of the misclassification found in the observed exposure. For that reason, regression analysis with the corrected variable included on a categorical scale is still biased. In some cases the corrected estimates are analytically equal to those obtained by the naive approach. Regression calibration is however vastly superior to the naive method when applying the medians of each category in the analysis. Conclusion Regression calibration in its most well-known form is not appropriate for measurement error correction when the exposure is analyzed on a

  18. Identifying Critical Factors Influencing the Rents of Public Rental Housing Delivery by PPPs: The Case of Nanjing

    Directory of Open Access Journals (Sweden)

    Jingfeng Yuan

    2017-02-01

    Full Text Available The occupancy rate of Public Rental Housing (PRH in China is relatively low due to the unreasonable rents. At the same time, the development of PRH using Public Private Partnerships (PPPs increases the complexity of the rents. Therefore, the critical factors influencing the rents of PRH delivery by PPPs should be identified. Based on the comprehensive literature, this article identified a conceptual model for the factors influencing the rents of PRH delivery by PPPs in China, composed of 14 factors grouped in three factor packages, and discussed the relationships among three factor packages. A survey based on Nanjing was conducted to assess the relative significance of 14 factors. According to the results, six critical factors were identified: construction costs, household income, floor area and structure, transportation, market rents in the same district and public facilities. In addition, the proposed conceptual model had a good fit. The results also supported two hypothetical relationships among three factor packages: (1 the increase of the affordability of the target tenants had a positive effect on the increase of profits of private sectors; and (2 the increase of the affordability of the target tenants had a positive effect on the increase of level of the characteristics of PRH units. For future research, six critical factors and the relationships among three factor packages can be used to determine the reasonable rents for PRH delivery by PPPs in China.

  19. Study of risk factors affecting both hypertension and obesity outcome by using multivariate multilevel logistic regression models

    Directory of Open Access Journals (Sweden)

    Sepedeh Gholizadeh

    2016-07-01

    Full Text Available Background:Obesity and hypertension are the most important non-communicable diseases thatin many studies, the prevalence and their risk factors have been performedin each geographic region univariately.Study of factors affecting both obesity and hypertension may have an important role which to be adrressed in this study. Materials &Methods:This cross-sectional study was conducted on 1000 men aged 20-70 living in Bushehr province. Blood pressure was measured three times and the average of them was considered as one of the response variables. Hypertension was defined as systolic blood pressure ≥140 (and-or diastolic blood pressure ≥90 and obesity was defined as body mass index ≥25. Data was analyzed by using multilevel, multivariate logistic regression model by MlwiNsoftware. Results:Intra class correlations in cluster level obtained 33% for high blood pressure and 37% for obesity, so two level model was fitted to data. The prevalence of obesity and hypertension obtained 43.6% (0.95%CI; 40.6-46.5, 29.4% (0.95%CI; 26.6-32.1 respectively. Age, gender, smoking, hyperlipidemia, diabetes, fruit and vegetable consumption and physical activity were the factors affecting blood pressure (p≤0.05. Age, gender, hyperlipidemia, diabetes, fruit and vegetable consumption, physical activity and place of residence are effective on obesity (p≤0.05. Conclusion: The multilevel models with considering levels distribution provide more precise estimates. As regards obesity and hypertension are the major risk factors for cardiovascular disease, by knowing the high-risk groups we can d careful planning to prevention of non-communicable diseases and promotion of society health.

  20. [Multiple linear regression and ROC curve analysis of the factors of lumbar spine bone mineral density].

    Science.gov (United States)

    Zhang, Xiaodong; Zhao, Yinxia; Hu, Shaoyong; Hao, Shuai; Yan, Jiewen; Zhang, Lingyan; Zhao, Jing; Li, Shaolin

    2015-09-01

    To investigate the correlation between the lumbar vertebra bone mineral density (BMD) and age, gender, height, weight, body mass index, waistline, hipline, bone marrow and abdomen fat, and to explore the key factor affecting the BMD. A total of 72 cases were randomly recruited. All the subjects underwent a spectroscopic examination of the third lumber vertebra with single-voxel method in 1.5T MR. Lipid fractions (FF%) were measured. Quantitative CT were also performed to get the BMD of L3 and the corresponding abdomen subcutaneous adipose tissue (SAT) and visceral adipose tissue (VAT). The statistical analysis were performed by SPSS 19.0. Multiple linear regression showed except the age and FF% showed significant difference (P0.05). The correlation of age and FF% with BMD was statistically negatively significant (r=-0.830, -0.521, P<0.05). The ROC curve analysis showed that the sensitivety and specificity of predicting osteoporosis were 81.8% and 86.9%, with a threshold of 58.5 years old. And it showed that the sensitivety and specificity of predicting osteoporosis were 90.9% and 55.7%, with a threshold of 52.8% for FF%. The lumbar vertebra BMD was significantly and negatively correlated with age and bone marrow FF%, but it was not significantly correlated with gender, height, weight, BMI, waistline, hipline, SAT and VAT. And age was the critical factor.

  1. on the determination of risk factors associated with the recovery of ...

    African Journals Online (AJOL)

    DJFLEX

    binary logistic regression models is used to identify some risk factors associated with the recovery process. Result ... Consider the stochastic process { Xn, n = 1,2, } with countable finite values. ... are unknown parameters to be estimated.

  2. Cloning and Characterizing Genes Involved in Monoterpene Induced Mammary Tumor Regression.

    Science.gov (United States)

    1996-10-01

    AD GRANT NUMBER DAMDI7-94-J-4041 TITLE: Cloning and Characterizing Genes Involved in Monoterpene Induced Mammary Tumor Regression PRINCIPAL...October 1996 Annual (1 Sep 95 - 31 Aug 96) 4. TITLE AND SUBTITLE 5. FUNDING NUMBERS Cloning and Characterizing Genes Involved in Monoterpene Induced... Monoterpene -induced/repressed genes were identified in regressing rat mammary carcinomas treated with dietary limonene using a newly developed method

  3. Identifying fallers with Parkinson's disease using home-based tests: who is at risk?

    Science.gov (United States)

    Lim, Inge; van Wegen, Erwin; Jones, Diana; Rochester, Lynn; Nieuwboer, Alice; Willems, Anne Marie; Baker, Katherine; Hetherington, Vicki; Kwakkel, Gert

    2008-12-15

    The objective of this work is to determine risk factors for falling in patients with Parkinson's disease (PD) using home-based assessments and develop a prediction model. Data on falls, balance, gait-related activities, and nonmotor symptoms were obtained from 153 PD patients (Hoehn-Yahr 2-4) in their home. Fifty-one candidate determinants for falling were independently tested using bivariate logistic regression analysis. A multivariate logistic regression model was developed to identify patients susceptible to falls. Sixty-six subjects (43%) were classified as fallers. Eighteen determinants for falling were selected. The final multivariate model showed an accuracy of 74% and included: (1) Freezing of Gait Questionnaire, (2) Timed Get Up and Go (TGUG) score, (3) disease duration, (4) item 15 of the Unified Parkinson's Disease Rating Scale. Based on disease duration, freezing symptoms, walking problems, and a prolonged TGUG duration, assessed in the home situation, it was possible to accurately identify 74% of PD patients as fallers. (c) 2008 Movement Disorder Society.

  4. Factors Influencing Employee Satisfaction in the Police Service: The Case of Slovenia

    OpenAIRE

    Tomaževič, Nina; Seljak, Janko; Aristovnik, Aleksander

    2014-01-01

    The paper has two purposes – first, to examine the dimensionality of employee satisfaction and, second, to identify the impact of the groups of factors on employee satisfaction. The measurement of satisfaction of all employees in the Slovenian Police based on the comprehensive on-line questionnaire. Factor analysis was used to formulate the facets of satisfaction. Multiple linear regression analysis was performed to identify the predictors of three facets of employee satisfaction. Three fac...

  5. Using exploratory factor analysis of FFQ data to identify dietary patterns among Yup'ik people.

    Science.gov (United States)

    Ryman, Tove K; Austin, Melissa A; Hopkins, Scarlett; Philip, Jacques; O'Brien, Diane; Thummel, Kenneth; Boyer, Bert B

    2014-03-01

    An FFQ developed by the Center for Alaska Native Health Research for studies in Yup'ik people includes market foods and subsistence foods such as moose, seal, waterfowl and salmon that may be related to disease risk. Because the FFQ contains >100 food items, we sought to characterize dietary patterns more simply for use in ongoing pharmacogenomics studies. Exploratory factor analysis was used to derive a small number of 'factors' that explain a substantial amount of the variation in the Yup'ik diet. We estimated factor scores and measured associations with demographic characteristics and biomarkers. South-west Alaska, USA. Yup'ik people (n 358) aged ≥18 years. We identified three factors that each accounted for ≥10 % of the common variance: the first characterized by 'processed foods' (e.g. salty snacks, sweetened cereals); the second by 'fruits and vegetables' (e.g. fresh citrus, potato salad); and the third by 'subsistence foods' (seal or walrus soup, non-oily fish). Participants from coastal communities had higher values for the 'subsistence' factor, whereas participants from inland communities had higher values for the 'fruits and vegetables' factor. A biomarker of marine intake, δ 15N, was correlated with the 'subsistence' factor, whereas a biomarker of corn- and sugarcane-based market food intake, δ 13C, was correlated with 'processed foods'. The exploratory factor analysis identified three factors that appeared to reflect dietary patterns among Yup'ik based on associations with participant characteristics and biomarkers. These factors will be useful for chronic disease studies in this population.

  6. Prevalence of and risk factors for retinopathy of prematurity in a ...

    African Journals Online (AJOL)

    Twenty-four previously reported risk factors for the development of ROP were identified for use in a multivariate logistic regression (MLR) analysis. Results. A total of 356 patients were included. The overall prevalence of ROP was 21.8% and that of clinically significant ROP (CSROP) 4.4%. The risk factors with a statistically ...

  7. Remote sensing and GIS-based landslide hazard analysis and cross-validation using multivariate logistic regression model on three test areas in Malaysia

    Science.gov (United States)

    Pradhan, Biswajeet

    2010-05-01

    This paper presents the results of the cross-validation of a multivariate logistic regression model using remote sensing data and GIS for landslide hazard analysis on the Penang, Cameron, and Selangor areas in Malaysia. Landslide locations in the study areas were identified by interpreting aerial photographs and satellite images, supported by field surveys. SPOT 5 and Landsat TM satellite imagery were used to map landcover and vegetation index, respectively. Maps of topography, soil type, lineaments and land cover were constructed from the spatial datasets. Ten factors which influence landslide occurrence, i.e., slope, aspect, curvature, distance from drainage, lithology, distance from lineaments, soil type, landcover, rainfall precipitation, and normalized difference vegetation index (ndvi), were extracted from the spatial database and the logistic regression coefficient of each factor was computed. Then the landslide hazard was analysed using the multivariate logistic regression coefficients derived not only from the data for the respective area but also using the logistic regression coefficients calculated from each of the other two areas (nine hazard maps in all) as a cross-validation of the model. For verification of the model, the results of the analyses were then compared with the field-verified landslide locations. Among the three cases of the application of logistic regression coefficient in the same study area, the case of Selangor based on the Selangor logistic regression coefficients showed the highest accuracy (94%), where as Penang based on the Penang coefficients showed the lowest accuracy (86%). Similarly, among the six cases from the cross application of logistic regression coefficient in other two areas, the case of Selangor based on logistic coefficient of Cameron showed highest (90%) prediction accuracy where as the case of Penang based on the Selangor logistic regression coefficients showed the lowest accuracy (79%). Qualitatively, the cross

  8. Confidence intervals for distinguishing ordinal and disordinal interactions in multiple regression.

    Science.gov (United States)

    Lee, Sunbok; Lei, Man-Kit; Brody, Gene H

    2015-06-01

    Distinguishing between ordinal and disordinal interaction in multiple regression is useful in testing many interesting theoretical hypotheses. Because the distinction is made based on the location of a crossover point of 2 simple regression lines, confidence intervals of the crossover point can be used to distinguish ordinal and disordinal interactions. This study examined 2 factors that need to be considered in constructing confidence intervals of the crossover point: (a) the assumption about the sampling distribution of the crossover point, and (b) the possibility of abnormally wide confidence intervals for the crossover point. A Monte Carlo simulation study was conducted to compare 6 different methods for constructing confidence intervals of the crossover point in terms of the coverage rate, the proportion of true values that fall to the left or right of the confidence intervals, and the average width of the confidence intervals. The methods include the reparameterization, delta, Fieller, basic bootstrap, percentile bootstrap, and bias-corrected accelerated bootstrap methods. The results of our Monte Carlo simulation study suggest that statistical inference using confidence intervals to distinguish ordinal and disordinal interaction requires sample sizes more than 500 to be able to provide sufficiently narrow confidence intervals to identify the location of the crossover point. (c) 2015 APA, all rights reserved).

  9. Identifying risk factors for PTSD in women seeking medical help after rape.

    Science.gov (United States)

    Tiihonen Möller, Anna; Bäckström, Torbjörn; Söndergaard, Hans Peter; Helström, Lotti

    2014-01-01

    Rape has been found to be the trauma most commonly associated with Posttraumatic Stress Disorder (PTSD) among women. It is therefore important to be able to identify those women at greatest risk of developing PTSD. The aims of the present study were to analyze the PTSD prevalence six months after sexual assaults and identify the major risk factors for developing PTSD. Participants were 317 female victims of rape who sought help at the Emergency Clinic for Raped Women at Stockholm South Hospital, Sweden. Baseline assessments of mental health were carried out and followed up after six months. Thirty-nine percent of the women had developed PTSD at the six month assessment, and 47% suffered from moderate or severe depression. The major risk factors for PTSD were having been sexually assaulted by more than one person, suffering from acute stress disorder (ASD) shortly after the assault, having been exposed to several acts during the assault, having been injured, having co-morbid depression, and having a history of more than two earlier traumas. Further, ASD on its own was found to be a poor predictor of PTSD because of the substantial ceiling effect after sexual assaults. Development of PTSD is common in the aftermath of sexual assaults. Increased risk of developing PTSD is caused by a combination of victim vulnerability and the extent of the dramatic nature of the current assault. By identifying those women at greatest risk of developing PTSD appropriate therapeutic resources can be directed.

  10. The impact of factors influencing destination quality on overall customer satisfaction

    Directory of Open Access Journals (Sweden)

    Ida Vajčnerová

    2013-01-01

    Full Text Available The paper deals with problems concerning tourist destination quality and the impact of individual factors on the overall satisfaction of a customer – a visitor to a destination. The methodology is based on the evaluation of individual factors that were identified on the basis of previously conducted researches and that consider all attributes of a destination which visitors are able to evaluate on the basis of their experience and consequent satisfaction. The used data were obtained by a questionnaire surveys, the evaluation is qualitative, and the ten-degree Likert scale is used. The practical part of the paper will focus on creating a multidimensional regression model for overall customer satisfaction for evaluating individual quality factors. The objective of the paper is to identify factors that influence overall customer satisfaction the most; this will be done on the basis of the statistical importance of individual regression parameters. The analysis will be conducted for the data from the regions of Lednice-Valtice Area and Znojemsko and Podyjí; subsequently these two regions will be compared.

  11. Maternal and Neonatal Birth Factors Affecting the Age of ASD Diagnosis.

    Science.gov (United States)

    Darcy-Mahoney, Ashley; Minter, Bonnie; Higgins, Melinda; Guo, Ying; Zauche, Lauren Head; Hirst, Jessica

    2016-12-01

    Early diagnosis of autism spectrum disorders (ASD) enables early intervention that improves long term functioning of children with ASD but is often delayed until age of school entry. Few studies have identified factors that affect timely diagnosis. This study addressed how maternal education, race, age, marital status as well as neonatal birth factors affect the age at which a child is diagnosed with ASD. This study involved a retrospective analysis of 664 records of children treated at one of the largest autism treatment centers in the United States from March 1, 2009 to December 30, 2010. Logistic regression and Cox proportional hazards regression were used to identify maternal and neonatal factors associated with age of diagnosis. Infant gender, maternal race, marital status, and maternal age were identified as significant factors for predicting the age of ASD diagnosis. In the Cox proportional hazards regression model, only maternal race and marital status were included. Median survival age till diagnosis of children born to married mothers was 53.4 months compared to 57.8 months and 63.7 months of children born to single and divorced or widowed mothers respectively. Median survival age till diagnosis for children of African American mothers was 53.8 months compared to 57.2 months for children of Caucasian mothers. No statistically significant difference of timing of ASD diagnosis was found for children of varying gestational age. Children born to older or married mothers and mothers of minority races were more likely to have an earlier ASD diagnosis. No statistically significant differences in timing of ASD diagnosis were found for children born at varying gestational ages. Identification of these factors has the potential to inform public health outreach aimed at promoting timely ASD diagnosis. This work could enhance clinical practice for timelier diagnoses of ASD by supporting parents and clinicians around the world in identifying risk factors beyond gender

  12. A tandem regression-outlier analysis of a ligand cellular system for key structural modifications around ligand binding.

    Science.gov (United States)

    Lin, Ying-Ting

    2013-04-30

    A tandem technique of hard equipment is often used for the chemical analysis of a single cell to first isolate and then detect the wanted identities. The first part is the separation of wanted chemicals from the bulk of a cell; the second part is the actual detection of the important identities. To identify the key structural modifications around ligand binding, the present study aims to develop a counterpart of tandem technique for cheminformatics. A statistical regression and its outliers act as a computational technique for separation. A PPARγ (peroxisome proliferator-activated receptor gamma) agonist cellular system was subjected to such an investigation. Results show that this tandem regression-outlier analysis, or the prioritization of the context equations tagged with features of the outliers, is an effective regression technique of cheminformatics to detect key structural modifications, as well as their tendency of impact to ligand binding. The key structural modifications around ligand binding are effectively extracted or characterized out of cellular reactions. This is because molecular binding is the paramount factor in such ligand cellular system and key structural modifications around ligand binding are expected to create outliers. Therefore, such outliers can be captured by this tandem regression-outlier analysis.

  13. Spatiotemporal Pattern of PM2.5 Concentrations in Mainland China and Analysis of Its Influencing Factors using Geographically Weighted Regression

    Science.gov (United States)

    Luo, Jieqiong; Du, Peijun; Samat, Alim; Xia, Junshi; Che, Meiqin; Xue, Zhaohui

    2017-01-01

    Based on annual average PM2.5 gridded dataset, this study first analyzed the spatiotemporal pattern of PM2.5 across Mainland China during 1998-2012. Then facilitated with meteorological site data, land cover data, population and Gross Domestic Product (GDP) data, etc., the contributions of latent geographic factors, including socioeconomic factors (e.g., road, agriculture, population, industry) and natural geographical factors (e.g., topography, climate, vegetation) to PM2.5 were explored through Geographically Weighted Regression (GWR) model. The results revealed that PM2.5 concentrations increased while the spatial pattern remained stable, and the proportion of areas with PM2.5 concentrations greater than 35 μg/m3 significantly increased from 23.08% to 29.89%. Moreover, road, agriculture, population and vegetation showed the most significant impacts on PM2.5. Additionally, the Moran’s I for the residuals of GWR was 0.025 (not significant at a 0.01 level), indicating that the GWR model was properly specified. The local coefficient estimates of GDP in some cities were negative, suggesting the existence of the inverted-U shaped Environmental Kuznets Curve (EKC) for PM2.5 in Mainland China. The effects of each latent factor on PM2.5 in various regions were different. Therefore, regional measures and strategies for controlling PM2.5 should be formulated in terms of the local impacts of specific factors.

  14. Time-adaptive quantile regression

    DEFF Research Database (Denmark)

    Møller, Jan Kloppenborg; Nielsen, Henrik Aalborg; Madsen, Henrik

    2008-01-01

    and an updating procedure are combined into a new algorithm for time-adaptive quantile regression, which generates new solutions on the basis of the old solution, leading to savings in computation time. The suggested algorithm is tested against a static quantile regression model on a data set with wind power......An algorithm for time-adaptive quantile regression is presented. The algorithm is based on the simplex algorithm, and the linear optimization formulation of the quantile regression problem is given. The observations have been split to allow a direct use of the simplex algorithm. The simplex method...... production, where the models combine splines and quantile regression. The comparison indicates superior performance for the time-adaptive quantile regression in all the performance parameters considered....

  15. Logistic regression analysis of psychosocial correlates associated with recovery from schizophrenia in a Chinese community.

    Science.gov (United States)

    Tse, Samson; Davidson, Larry; Chung, Ka-Fai; Yu, Chong Ho; Ng, King Lam; Tsoi, Emily

    2015-02-01

    More mental health services are adopting the recovery paradigm. This study adds to prior research by (a) using measures of stages of recovery and elements of recovery that were designed and validated in a non-Western, Chinese culture and (b) testing which demographic factors predict advanced recovery and whether placing importance on certain elements predicts advanced recovery. We examined recovery and factors associated with recovery among 75 Hong Kong adults who were diagnosed with schizophrenia and assessed to be in clinical remission. Data were collected on socio-demographic factors, recovery stages and elements associated with recovery. Logistic regression analysis was used to identify variables that could best predict stages of recovery. Receiver operating characteristic curves were used to detect the classification accuracy of the model (i.e. rates of correct classification of stages of recovery). Logistic regression results indicated that stages of recovery could be distinguished with reasonable accuracy for Stage 3 ('living with disability', classification accuracy = 75.45%) and Stage 4 ('living beyond disability', classification accuracy = 75.50%). However, there was no sufficient information to predict Combined Stages 1 and 2 ('overwhelmed by disability' and 'struggling with disability'). It was found that having a meaningful role and age were the most important differentiators of recovery stage. Preliminary findings suggest that adopting salient life roles personally is important to recovery and that this component should be incorporated into mental health services. © The Author(s) 2014.

  16. Integrative analysis of multiple diverse omics datasets by sparse group multitask regression

    Directory of Open Access Journals (Sweden)

    Dongdong eLin

    2014-10-01

    Full Text Available A variety of high throughput genome-wide assays enable the exploration of genetic risk factors underlying complex traits. Although these studies have remarkable impact on identifying susceptible biomarkers, they suffer from issues such as limited sample size and low reproducibility. Combining individual studies of different genetic levels/platforms has the promise to improve the power and consistency of biomarker identification. In this paper, we propose a novel integrative method, namely sparse group multitask regression, for integrating diverse omics datasets, platforms and populations to identify risk genes/factors of complex diseases. This method combines multitask learning with sparse group regularization, which will: 1 treat the biomarker identification in each single study as a task and then combine them by multitask learning; 2 group variables from all studies for identifying significant genes; 3 enforce sparse constraint on groups of variables to overcome the ‘small sample, but large variables’ problem. We introduce two sparse group penalties: sparse group lasso and sparse group ridge in our multitask model, and provide an effective algorithm for each model. In addition, we propose a significance test for the identification of potential risk genes. Two simulation studies are performed to evaluate the performance of our integrative method by comparing it with conventional meta-analysis method. The results show that our sparse group multitask method outperforms meta-analysis method significantly. In an application to our osteoporosis studies, 7 genes are identified as significant genes by our method and are found to have significant effects in other three independent studies for validation. The most significant gene SOD2 has been identified in our previous osteoporosis study involving the same expression dataset. Several other genes such as TREML2, HTR1E and GLO1 are shown to be novel susceptible genes for osteoporosis, as confirmed

  17. Applying spatial regression to evaluate risk factors for microbiological contamination of urban groundwater sources in Juba, South Sudan

    Science.gov (United States)

    Engström, Emma; Mörtberg, Ulla; Karlström, Anders; Mangold, Mikael

    2017-06-01

    This study developed methodology for statistically assessing groundwater contamination mechanisms. It focused on microbial water pollution in low-income regions. Risk factors for faecal contamination of groundwater-fed drinking-water sources were evaluated in a case study in Juba, South Sudan. The study was based on counts of thermotolerant coliforms in water samples from 129 sources, collected by the humanitarian aid organisation Médecins Sans Frontières in 2010. The factors included hydrogeological settings, land use and socio-economic characteristics. The results showed that the residuals of a conventional probit regression model had a significant positive spatial autocorrelation (Moran's I = 3.05, I-stat = 9.28); therefore, a spatial model was developed that had better goodness-of-fit to the observations. The most significant factor in this model ( p-value 0.005) was the distance from a water source to the nearest Tukul area, an area with informal settlements that lack sanitation services. It is thus recommended that future remediation and monitoring efforts in the city be concentrated in such low-income regions. The spatial model differed from the conventional approach: in contrast with the latter case, lowland topography was not significant at the 5% level, as the p-value was 0.074 in the spatial model and 0.040 in the traditional model. This study showed that statistical risk-factor assessments of groundwater contamination need to consider spatial interactions when the water sources are located close to each other. Future studies might further investigate the cut-off distance that reflects spatial autocorrelation. Particularly, these results advise research on urban groundwater quality.

  18. A multi-factor GIS method to identify optimal geographic locations for electric vehicle (EV) charging stations

    Science.gov (United States)

    Zhang, Yongqin; Iman, Kory

    2018-05-01

    Fuel-based transportation is one of the major contributors to poor air quality in the United States. Electric Vehicle (EV) is potentially the cleanest transportation technology to our environment. This research developed a spatial suitability model to identify optimal geographic locations for installing EV charging stations for travelling public. The model takes into account a variety of positive and negative factors to identify prime locations for installing EV charging stations in Wasatch Front, Utah, where automobile emission causes severe air pollution due to atmospheric inversion condition near the valley floor. A walkable factor grid was created to store index scores from input factor layers to determine prime locations. 27 input factors including land use, demographics, employment centers etc. were analyzed. Each factor layer was analyzed to produce a summary statistic table to determine the site suitability. Potential locations that exhibit high EV charging usage were identified and scored. A hot spot map was created to demonstrate high, moderate, and low suitability areas for installing EV charging stations. A spatially well distributed EV charging system was then developed, aiming to reduce "range anxiety" from traveling public. This spatial methodology addresses the complex problem of locating and establishing a robust EV charging station infrastructure for decision makers to build a clean transportation infrastructure, and eventually improve environment pollution.

  19. Identifying factors associated with the discharge of male State patients from Weskoppies Hospital

    Directory of Open Access Journals (Sweden)

    Riaan G. Prinsloo

    2017-12-01

    Full Text Available Background: Designated psychiatric facilities are responsible for the care, treatment and reintegration of State patients. The necessary long-term care places a considerable strain on health-care resources. Resource use should be optimised while managing the risks that patients pose to themselves and the community. Identifying unique factors associated with earlier discharge may decrease the length of stay. Factors associated with protracted inpatient care without discharge could identify patients who require early and urgent intervention. Aim: We identify socio-economic, demographic, psychiatric and charge-related factors associated with the discharge of male State patients. Methods: We reviewed the files of discharged and admitted forensic State patients at Weskoppies Psychiatric Hospital. Data were captured in an electronic recording sheet. The association between factors and the outcome measure (discharged vs. admitted was determined using chi-squared tests and Fischer’s exact tests. Results: Discharged State patients were associated with being a primary caregiver (p = 0.031 having good insight into illness (p = 0.025 or offence (p = 0.005 and having had multiple successful leaves of absences. A lack of substance abuse during admission (p = 0.027, an absence of a diagnosis of substance use disorder (p = 0.013 and the absence of verbal and physical aggression (p = 0.002 and p = 0.016 were associated with being discharged. Prolonged total length of stay (9–12 years, p = 0.031 and prolonged length of stay in open wards (6–9 years, p = 0.000 were associated with being discharged. A history of previous offences (p = 0.022, a diagnosis of substance use disorder (p = 0.023, recent substance abuse (p = 0.018 and a history of physical aggression since admission (p = 0.017 were associated with continued admission. Conclusion: Discharge of State patients is associated with an absence of substance abuse, lack of aggression

  20. A systematic review and meta-regression analysis of mivacurium for tracheal intubation

    NARCIS (Netherlands)

    Vanlinthout, L.E.H.; Mesfin, S.H.; Hens, N.; Vanacker, B.F.; Robertson, E.N.; Booij, L.H.D.J.

    2014-01-01

    We systematically reviewed factors associated with intubation conditions in randomised controlled trials of mivacurium, using random-effects meta-regression analysis. We included 29 studies of 1050 healthy participants. Four factors explained 72.9% of the variation in the probability of excellent

  1. Socioeconomic Factors Associated with Post-Mastectomy Immediate Reconstruction in a Contemporary Cohort of Breast Cancer Survivors.

    Science.gov (United States)

    Schumacher, Jessica R; Taylor, Lauren J; Tucholka, Jennifer L; Poore, Samuel; Eggen, Amanda; Steiman, Jennifer; Wilke, Lee G; Greenberg, Caprice C; Neuman, Heather B

    2017-10-01

    Post-mastectomy reconstruction is a critical component of high-quality breast cancer care. Prior studies demonstrate socioeconomic disparity in receipt of reconstruction. Our objective was to evaluate trends in receipt of immediate reconstruction and examine socioeconomic factors associated with reconstruction in a contemporary cohort. Using the National Cancer Database, we identified women rates (2004-2013) for the overall cohort and stratified by socioeconomic factors were examined using Join-point regression analysis, and annual percentage change (APC) was calculated. We then restricted our sample to a contemporary cohort (2010-2013, n = 145,577). Multivariable logistic regression identified socioeconomic factors associated with immediate reconstruction. Average adjusted predicted probabilities of receiving reconstruction were calculated. Immediate reconstruction rates increased from 27 to 48%. Although absolute rates of reconstruction for each stratification group increased, similar APCs across strata led to persistent gaps in receipt of reconstruction. On multivariable logistic regression using our contemporary cohort, race, income, education, and insurance type were all strongly associated with immediate reconstruction. Patients with the lowest predicted probability of receiving reconstruction were patients with Medicaid who lived in areas with the lowest rates of high-school graduation (Black 42.4% [95% CI 40.5-44.3], White 45.7% [95% CI 43.9-47.4]). Although reconstruction rates have increased dramatically over the past decade, lower rates persist for disadvantaged patients. Understanding how socioeconomic factors influence receipt of reconstruction, and identifying modifiable factors, are critical next steps towards identifying interventions to reduce disparities in breast cancer surgical care.

  2. Finding the Perfect Match: Factors That Influence Family Medicine Residency Selection.

    Science.gov (United States)

    Wright, Katherine M; Ryan, Elizabeth R; Gatta, John L; Anderson, Lauren; Clements, Deborah S

    2016-04-01

    Residency program selection is a significant experience for emerging physicians, yet there is limited information about how applicants narrow their list of potential programs. This study examines factors that influence residency program selection among medical students interested in family medicine at the time of application. Medical students with an expressed interest in family medicine were invited to participate in a 37-item, online survey. Students were asked to rate factors that may impact residency selection on a 6-point Likert scale in addition to three open-ended qualitative questions. Mean values were calculated for each survey item and were used to determine a rank order for selection criteria. Logistic regression analysis was performed to identify factors that predict a strong interest in urban, suburban, and rural residency programs. Logistic regression was also used to identify factors that predict a strong interest in academic health center-based residencies, community-based residencies, and community-based residencies with an academic affiliation. A total of 705 medical students from 32 states across the country completed the survey. Location, work/life balance, and program structure (curriculum, schedule) were rated the most important factors for residency selection. Logistic regression analysis was used to refine our understanding of how each factor relates to specific types of residencies. These findings have implications for how to best advise students in selecting a residency, as well as marketing residencies to the right candidates. Refining the recruitment process will ensure a better fit between applicants and potential programs. Limited recruitment resources may be better utilized by focusing on targeted dissemination strategies.

  3. THE PROGNOSIS OF RUSSIAN DEFENSE INDUSTRY DEVELOPMENT IMPLEMENTED THROUGH REGRESSION ANALYSIS

    Directory of Open Access Journals (Sweden)

    L.M. Kapustina

    2007-03-01

    Full Text Available The article illustrates the results of investigation the major internal and external factors which influence the development of the defense industry, as well as the results of regression analysis which quantitatively displays the factorial contribution in the growth rate of Russian defense industry. On the basis of calculated regression dependences the authors fulfilled the medium-term prognosis of defense industry. Optimistic and inertial versions of defense product growth rate for the period up to 2009 are based on scenario conditions in Russian economy worked out by the Ministry of economy and development. In conclusion authors point out which factors and conditions have the largest impact on successful and stable operation of Russian defense industry.

  4. Evaluation of Logistic Regression and Multivariate Adaptive Regression Spline Models for Groundwater Potential Mapping Using R and GIS

    Directory of Open Access Journals (Sweden)

    Soyoung Park

    2017-07-01

    Full Text Available This study mapped and analyzed groundwater potential using two different models, logistic regression (LR and multivariate adaptive regression splines (MARS, and compared the results. A spatial database was constructed for groundwater well data and groundwater influence factors. Groundwater well data with a high potential yield of ≥70 m3/d were extracted, and 859 locations (70% were used for model training, whereas the other 365 locations (30% were used for model validation. We analyzed 16 groundwater influence factors including altitude, slope degree, slope aspect, plan curvature, profile curvature, topographic wetness index, stream power index, sediment transport index, distance from drainage, drainage density, lithology, distance from fault, fault density, distance from lineament, lineament density, and land cover. Groundwater potential maps (GPMs were constructed using LR and MARS models and tested using a receiver operating characteristics curve. Based on this analysis, the area under the curve (AUC for the success rate curve of GPMs created using the MARS and LR models was 0.867 and 0.838, and the AUC for the prediction rate curve was 0.836 and 0.801, respectively. This implies that the MARS model is useful and effective for groundwater potential analysis in the study area.

  5. A comparative study to identify factors of caregiver burden between baby boomers and post baby boomers: a secondary analysis of a US online caregiver survey.

    Science.gov (United States)

    Kim, Heejung; Lee, Sangeun; Cheon, Jooyoung; Hong, Soyun; Chang, Mido

    2018-05-02

    Baby boomers' position in the caregiving context is shifting from caregiver to care recipient as the population ages. While the unique characteristics of baby boomer caregivers are well established in caregiving literature, there is limited information about the next caregiving group after the baby boomers. In this study, the sociodemographic and caregiving-related characteristics of the two generations are compared and specific factors contributing to caregiver burden between baby boomer and post baby boomer caregivers are identified. This cross-sectional and correlational study used secondary analysis of data from the National Alliance for Caregiving and the American Association of Retired Persons. A structured online survey was conducted in 2014 with randomly selected samples (n = 1069) in the United States focusing on sociodemographics, caregiving-related characteristics, and burden of care. Descriptive statistics, multivariate linear regression analyses, and Steiger's Z-test were used to identify group differences in multivariate factors related to caregiver burden in two generational groups. Baby boomers and post baby boomers experienced caregiver burden to a similar degree. Caregiving-related factors are more likely to increase burden of care than sociodemographics in both groups. Caregiving without choice and spending longer hours on caregiving tasks were common factors that increased the burden in both generational groups (all p values baby boomer caregivers reported additional challenges, such as unemployment during caregiving, the dual responsibility of both adult and child care, and a family relationship with the care recipient. Due to the aging population of baby boomers, post baby boomers encounter different challenges related to caregiving burden, which is often considered an additional workload in their life course. Current policy and program tailored to baby boomers should be re-designed to meet the different needs of emerging caregivers

  6. Controlling attribute effect in linear regression

    KAUST Repository

    Calders, Toon; Karim, Asim A.; Kamiran, Faisal; Ali, Wasif Mohammad; Zhang, Xiangliang

    2013-01-01

    In data mining we often have to learn from biased data, because, for instance, data comes from different batches or there was a gender or racial bias in the collection of social data. In some applications it may be necessary to explicitly control this bias in the models we learn from the data. This paper is the first to study learning linear regression models under constraints that control the biasing effect of a given attribute such as gender or batch number. We show how propensity modeling can be used for factoring out the part of the bias that can be justified by externally provided explanatory attributes. Then we analytically derive linear models that minimize squared error while controlling the bias by imposing constraints on the mean outcome or residuals of the models. Experiments with discrimination-aware crime prediction and batch effect normalization tasks show that the proposed techniques are successful in controlling attribute effects in linear regression models. © 2013 IEEE.

  7. Controlling attribute effect in linear regression

    KAUST Repository

    Calders, Toon

    2013-12-01

    In data mining we often have to learn from biased data, because, for instance, data comes from different batches or there was a gender or racial bias in the collection of social data. In some applications it may be necessary to explicitly control this bias in the models we learn from the data. This paper is the first to study learning linear regression models under constraints that control the biasing effect of a given attribute such as gender or batch number. We show how propensity modeling can be used for factoring out the part of the bias that can be justified by externally provided explanatory attributes. Then we analytically derive linear models that minimize squared error while controlling the bias by imposing constraints on the mean outcome or residuals of the models. Experiments with discrimination-aware crime prediction and batch effect normalization tasks show that the proposed techniques are successful in controlling attribute effects in linear regression models. © 2013 IEEE.

  8. Job satisfaction of nurses and identifying factors of job satisfaction in Slovenian Hospitals.

    Science.gov (United States)

    Lorber, Mateja; Skela Savič, Brigita

    2012-06-01

    To determine the level of job satisfaction of nursing professionals in Slovenian hospitals and factors influencing job satisfaction in nursing. The study included 4 hospitals selected from the hospital list comprising 26 hospitals in Slovenia. The employees of these hospitals represent 29.8% and 509 employees included in the study represent 6% of all employees in nursing in Slovenian hospitals. One structured survey questionnaire was administered to the leaders and the other to employees, both consisting 154 items evaluated on a 5 point Likert-type scale. We examined the correlation between independent variables (age, number of years of employment, behavior of leaders, personal characteristics of leaders, and managerial competencies of leaders) and the dependent variable (job satisfaction - satisfaction with the work, coworkers, management, pay, etc) by applying correlation analysis and multivariate regression analysis. In addition, factor analysis was used to establish characteristic components of the variables measured. We found a medium level of job satisfaction in both leaders (3.49±0.5) and employees (3.19±0.6), however, there was a significant difference between their estimates (t=3.237; P=lt;0.001). Job satisfaction was explained by age (Plt;0.05; β=0.091), years of employment (Plt;0.05; β=0.193), personal characteristics of leaders (Plt;0.001; β=0.158), and managerial competencies of leaders (Plt;0.000; β=0.634) in 46% of cases. The factor analysis yielded four factors explaining 64% of the total job satisfaction variance. Satisfied employees play a crucial role in an organization's success, so health care organizations must be aware of the importance of employees' job satisfaction. It is recommended to monitor employees' job satisfaction levels on an annual basis.

  9. Simulation Experiments in Practice : Statistical Design and Regression Analysis

    NARCIS (Netherlands)

    Kleijnen, J.P.C.

    2007-01-01

    In practice, simulation analysts often change only one factor at a time, and use graphical analysis of the resulting Input/Output (I/O) data. Statistical theory proves that more information is obtained when applying Design Of Experiments (DOE) and linear regression analysis. Unfortunately, classic

  10. Regression analysis by example

    CERN Document Server

    Chatterjee, Samprit

    2012-01-01

    Praise for the Fourth Edition: ""This book is . . . an excellent source of examples for regression analysis. It has been and still is readily readable and understandable."" -Journal of the American Statistical Association Regression analysis is a conceptually simple method for investigating relationships among variables. Carrying out a successful application of regression analysis, however, requires a balance of theoretical results, empirical rules, and subjective judgment. Regression Analysis by Example, Fifth Edition has been expanded

  11. A retrospective chart review to identify perinatal factors associated with food allergies.

    Science.gov (United States)

    Dowhower Karpa, Kelly; Paul, Ian M; Leckie, J Alexander; Shung, Sharon; Carkaci-Salli, Nurgul; Vrana, Kent E; Mauger, David; Fausnight, Tracy; Poger, Jennifer

    2012-10-19

    Gut flora are important immunomodulators that may be disrupted in individuals with atopic conditions. Probiotic bacteria have been suggested as therapeutic modalities to mitigate or prevent food allergic manifestations. We wished to investigate whether perinatal factors known to disrupt gut flora increase the risk of IgE-mediated food allergies. Birth records obtained from 192 healthy children and 99 children diagnosed with food allergies were reviewed retrospectively. Data pertaining to delivery method, perinatal antibiotic exposure, neonatal nursery environment, and maternal variables were recorded. Logistic regression analysis was used to assess the association between variables of interest and subsequent food allergy diagnosis. Retrospective investigation did not find perinatal antibiotics, NICU admission, or cesarean section to be associated with increased risk of food allergy diagnosis. However, associations between food allergy diagnosis and male gender (66 vs. 33; p=0.02) were apparent in this cohort. Additionally, increasing maternal age at delivery was significantly associated with food allergy diagnosis during childhood (OR, 1.05; 95% CI, 1.017 to 1.105; p=0.005). Gut flora are potent immunomodulators, but their overall contribution to immune maturation remains to be elucidated. Additional understanding of the interplay between immunologic, genetic, and environmental factors underlying food allergy development need to be clarified before probiotic therapeutic interventions can routinely be recommended for prevention or mitigation of food allergies. Such interventions may be well-suited in male infants and in infants born to older mothers.

  12. Landslide susceptibility mapping on a global scale using the method of logistic regression

    Directory of Open Access Journals (Sweden)

    L. Lin

    2017-08-01

    Full Text Available This paper proposes a statistical model for mapping global landslide susceptibility based on logistic regression. After investigating explanatory factors for landslides in the existing literature, five factors were selected for model landslide susceptibility: relative relief, extreme precipitation, lithology, ground motion and soil moisture. When building the model, 70 % of landslide and nonlandslide points were randomly selected for logistic regression, and the others were used for model validation. To evaluate the accuracy of predictive models, this paper adopts several criteria including a receiver operating characteristic (ROC curve method. Logistic regression experiments found all five factors to be significant in explaining landslide occurrence on a global scale. During the modeling process, percentage correct in confusion matrix of landslide classification was approximately 80 % and the area under the curve (AUC was nearly 0.87. During the validation process, the above statistics were about 81 % and 0.88, respectively. Such a result indicates that the model has strong robustness and stable performance. This model found that at a global scale, soil moisture can be dominant in the occurrence of landslides and topographic factor may be secondary.

  13. Applying psychological theories to evidence-based clinical practice: identifying factors predictive of placing preventive fissure sealants

    Directory of Open Access Journals (Sweden)

    Maclennan Graeme

    2010-04-01

    identifying factors that may predict clinical behaviour and so provide possible targets for knowledge translation interventions. Results suggest that more evidence-based behaviour may be achieved by influencing beliefs about the positive outcomes of placing fissure sealants and building a habit of placing them as part of patient management. However a number of conceptual and methodological challenges remain.

  14. Applied logistic regression

    CERN Document Server

    Hosmer, David W; Sturdivant, Rodney X

    2013-01-01

     A new edition of the definitive guide to logistic regression modeling for health science and other applications This thoroughly expanded Third Edition provides an easily accessible introduction to the logistic regression (LR) model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables. Applied Logistic Regression, Third Edition emphasizes applications in the health sciences and handpicks topics that best suit the use of modern statistical software. The book provides readers with state-of-

  15. Identifying risk factors for PTSD in women seeking medical help after rape.

    Directory of Open Access Journals (Sweden)

    Anna Tiihonen Möller

    Full Text Available Rape has been found to be the trauma most commonly associated with Posttraumatic Stress Disorder (PTSD among women. It is therefore important to be able to identify those women at greatest risk of developing PTSD. The aims of the present study were to analyze the PTSD prevalence six months after sexual assaults and identify the major risk factors for developing PTSD.Participants were 317 female victims of rape who sought help at the Emergency Clinic for Raped Women at Stockholm South Hospital, Sweden. Baseline assessments of mental health were carried out and followed up after six months.Thirty-nine percent of the women had developed PTSD at the six month assessment, and 47% suffered from moderate or severe depression. The major risk factors for PTSD were having been sexually assaulted by more than one person, suffering from acute stress disorder (ASD shortly after the assault, having been exposed to several acts during the assault, having been injured, having co-morbid depression, and having a history of more than two earlier traumas. Further, ASD on its own was found to be a poor predictor of PTSD because of the substantial ceiling effect after sexual assaults.Development of PTSD is common in the aftermath of sexual assaults. Increased risk of developing PTSD is caused by a combination of victim vulnerability and the extent of the dramatic nature of the current assault. By identifying those women at greatest risk of developing PTSD appropriate therapeutic resources can be directed.

  16. Normalization Ridge Regression in Practice I: Comparisons Between Ordinary Least Squares, Ridge Regression and Normalization Ridge Regression.

    Science.gov (United States)

    Bulcock, J. W.

    The problem of model estimation when the data are collinear was examined. Though the ridge regression (RR) outperforms ordinary least squares (OLS) regression in the presence of acute multicollinearity, it is not a problem free technique for reducing the variance of the estimates. It is a stochastic procedure when it should be nonstochastic and it…

  17. Factors predictive of sustained virological response following 72 weeks of combination therapy for genotype 1b hepatitis C.

    Science.gov (United States)

    Chayama, Kazuaki; Hayes, C Nelson; Yoshioka, Kentaro; Moriwaki, Hisataka; Okanoue, Takashi; Sakisaka, Shotaro; Takehara, Tetsuo; Oketani, Makoto; Toyota, Joji; Izumi, Namiki; Hiasa, Yoichi; Matsumoto, Akihiro; Nomura, Hideyuki; Seike, Masataka; Ueno, Yoshiyuki; Yotsuyanagi, Hiroshi; Kumada, Hiromitsu

    2011-04-01

    Treatment of genotype 1b chronic hepatitis C virus (HCV) infection has been improved by extending peg-interferon plus ribavirin combination therapy to 72 weeks, but predictive factors are needed to identify those patients who are likely to respond to long-term therapy. We analyzed amino acid (aa) substitutions in the core protein and the interferon sensitivity determining region (ISDR) of nonstructural protein (NS) 5A in 840 genotype 1b chronic hepatitis C patients with high viral load. We used logistic regression and classification and regression tree (CART) analysis to identify predictive factors for sustained virological response (SVR) for patients undergoing 72 weeks of treatment. When patients were separately analyzed by treatment duration using multivariate logistic regression, several factors, including sex, age, viral load, and core aa70 and ISDR substitutions (P = 0.0003, P = 0.02, P = 0.01, P = 0.0001, and P = 0.0004, respectively) were significant predictive factors for SVR with 48 weeks of treatment, whereas age, previous interferon treatment history, and ISDR substitutions (P = 0.03, P = 0.01, and P = 0.02, respectively) were the only significant predictive factors with 72 weeks of treatment. Using CART analysis, a decision tree was generated that identified age, cholesterol, sex, treatment length, and aa70 and ISDR substitutions as the most important predictive factors. The CART model had a sensitivity of 69.2% and specificity of 60%, with a positive predictive value of 68.4%. Complementary statistical and data mining approaches were used to identify a subgroup of patients likely to benefit from 72 weeks of therapy.

  18. Impact of multicollinearity on small sample hydrologic regression models

    Science.gov (United States)

    Kroll, Charles N.; Song, Peter

    2013-06-01

    Often hydrologic regression models are developed with ordinary least squares (OLS) procedures. The use of OLS with highly correlated explanatory variables produces multicollinearity, which creates highly sensitive parameter estimators with inflated variances and improper model selection. It is not clear how to best address multicollinearity in hydrologic regression models. Here a Monte Carlo simulation is developed to compare four techniques to address multicollinearity: OLS, OLS with variance inflation factor screening (VIF), principal component regression (PCR), and partial least squares regression (PLS). The performance of these four techniques was observed for varying sample sizes, correlation coefficients between the explanatory variables, and model error variances consistent with hydrologic regional regression models. The negative effects of multicollinearity are magnified at smaller sample sizes, higher correlations between the variables, and larger model error variances (smaller R2). The Monte Carlo simulation indicates that if the true model is known, multicollinearity is present, and the estimation and statistical testing of regression parameters are of interest, then PCR or PLS should be employed. If the model is unknown, or if the interest is solely on model predictions, is it recommended that OLS be employed since using more complicated techniques did not produce any improvement in model performance. A leave-one-out cross-validation case study was also performed using low-streamflow data sets from the eastern United States. Results indicate that OLS with stepwise selection generally produces models across study regions with varying levels of multicollinearity that are as good as biased regression techniques such as PCR and PLS.

  19. Assessing the Credit Risk of Corporate Bonds Based on Factor Analysis and Logistic Regress Analysis Techniques: Evidence from New Energy Enterprises in China

    Directory of Open Access Journals (Sweden)

    Yuanxin Liu

    2018-05-01

    Full Text Available In recent years, new energy sources have ushered in tremendous opportunities for development. The difficulties to finance new energy enterprises (NEEs can be estimated through issuing corporate bonds. However, there are few scientific and reasonable methods to assess the credit risk of NEE bonds, which is not conducive to the healthy development of NEEs. Based on this, this paper analyzes the advantages and risks of NEEs issuing bonds and the main factors affecting the credit risk of NEE bonds, constructs a hybrid model for assessing the credit risk of NEE bonds based on factor analysis and logistic regress analysis techniques, and verifies the applicability and effectiveness of the model employing relevant data from 46 Chinese NEEs. The results show that the main factors affecting the credit risk of NEE bonds are internal factors involving the company’s profitability, solvency, operational ability, growth potential, asset structure and viability, and external factors including macroeconomic environment and energy policy support. Based on the empirical results and the exact situation of China’s NEE bonds, this article finally puts forward several targeted recommendations.

  20. Vector regression introduced

    Directory of Open Access Journals (Sweden)

    Mok Tik

    2014-06-01

    Full Text Available This study formulates regression of vector data that will enable statistical analysis of various geodetic phenomena such as, polar motion, ocean currents, typhoon/hurricane tracking, crustal deformations, and precursory earthquake signals. The observed vector variable of an event (dependent vector variable is expressed as a function of a number of hypothesized phenomena realized also as vector variables (independent vector variables and/or scalar variables that are likely to impact the dependent vector variable. The proposed representation has the unique property of solving the coefficients of independent vector variables (explanatory variables also as vectors, hence it supersedes multivariate multiple regression models, in which the unknown coefficients are scalar quantities. For the solution, complex numbers are used to rep- resent vector information, and the method of least squares is deployed to estimate the vector model parameters after transforming the complex vector regression model into a real vector regression model through isomorphism. Various operational statistics for testing the predictive significance of the estimated vector parameter coefficients are also derived. A simple numerical example demonstrates the use of the proposed vector regression analysis in modeling typhoon paths.

  1. Organisational Issues for E-Learning: Critical Success Factors as Identified by HE Practitioners

    Science.gov (United States)

    McPherson, Maggie; Nunes, Miguel Baptista

    2006-01-01

    Purpose: The purpose of this paper is to report on a research project that identified organisational critical success factors (CSFs) for e-learning implementation in higher education (HE). These CSFs can be used as a theoretical foundation upon which to base decision-making and strategic thinking about e-learning. Design/methodology/approach: The…

  2. Applied linear regression

    CERN Document Server

    Weisberg, Sanford

    2013-01-01

    Praise for the Third Edition ""...this is an excellent book which could easily be used as a course text...""-International Statistical Institute The Fourth Edition of Applied Linear Regression provides a thorough update of the basic theory and methodology of linear regression modeling. Demonstrating the practical applications of linear regression analysis techniques, the Fourth Edition uses interesting, real-world exercises and examples. Stressing central concepts such as model building, understanding parameters, assessing fit and reliability, and drawing conclusions, the new edition illus

  3. Logistic regression against a divergent Bayesian network

    Directory of Open Access Journals (Sweden)

    Noel Antonio Sánchez Trujillo

    2015-01-01

    Full Text Available This article is a discussion about two statistical tools used for prediction and causality assessment: logistic regression and Bayesian networks. Using data of a simulated example from a study assessing factors that might predict pulmonary emphysema (where fingertip pigmentation and smoking are considered; we posed the following questions. Is pigmentation a confounding, causal or predictive factor? Is there perhaps another factor, like smoking, that confounds? Is there a synergy between pigmentation and smoking? The results, in terms of prediction, are similar with the two techniques; regarding causation, differences arise. We conclude that, in decision-making, the sum of both: a statistical tool, used with common sense, and previous evidence, taking years or even centuries to develop; is better than the automatic and exclusive use of statistical resources.

  4. Prognostic factors in Acanthamoeba keratitis.

    Science.gov (United States)

    Kaiserman, Igor; Bahar, Irit; McAllum, Penny; Srinivasan, Sathish; Elbaz, Uri; Slomovic, Allan R; Rootman, David S

    2012-06-01

    To assess the prognostic factors influencing visual prognosis and length of treatment after acanthamoeba keratitis (AK). Forty-two AK eyes of 41 patients treated between 1999 and 2006 were included. A diagnosis of AK was made on the basis of culture results with a corresponding clinical presentation. We calculated the prognostic effect of the various factors on final visual acuity and the length of treatment. Multivariate regression analysis was used to adjust for the simultaneous effects of the various prognostic factors. Mean follow-up was 19.7 ± 21.0 months. Sixty-four percent of cases had > 1 identified risk factor for AK, the most common risk factor being contact lens wear (92.9% of eyes). At presentation, median best spectacle corrected visual acuity (BCVA) was 20/200 (20/30 to Hand Motion [HM]) that improved after treatment to 20/50 (20/20 to Counting Fingers [CF]). Infection acquired by swimming or related to contact lenses had significantly better final BCVA (p = 0.03 and p = 0.007, respectively). Neuritis and pseudodendrites were also associated with better final BCVA (p = 0.04 and p = 0.05, respectively). Having had an epithelial defect on presentation and having been treated with topical steroid were associated with worse final best spectacle corrected visual acuity (BSCVA) (p = 0.0006 and p = 0.04). Multivariate regression analysis found a good initial visual acuity (p = 0.002), infections related to swimming (p = 0.01), the absence of an epithelial defect (p = 0.03), having been treated with chlorhexidine (p = 0.05), and not having receive steroids (p = 0.003) to significantly forecast a good final BCVA. We identified several prognostic factors that can help clinicians evaluate the expected visual damage of the AK infection and thus tailor treatment accordingly. Copyright © 2012 Canadian Ophthalmological Society. All rights reserved.

  5. Risk factors for miscarriage from a prevention perspective

    DEFF Research Database (Denmark)

    Nilsson, Sandra Feodor; Andersen, Per Kragh; Strandberg-Larsen, K

    2014-01-01

    OBJECTIVE: To identify modifiable risk factors for miscarriage and to estimate the preventable proportion of miscarriages that could be attributed to these. DESIGN: Nationwide observational follow-up study. SETTING: Denmark. POPULATION: Ninety-one thousand four hundred and twenty seven pregnancies...... included in the Danish National Birth Cohort between 1996 and 2002. METHODS: Information on potentially modifiable risk factors before and during pregnancy was collected by means of computer-assisted telephone interviews and linkage with Danish registers, ensuring almost complete follow-up of pregnancy...... outcome. Modifiable risk factors for miscarriage were identified by multiple Cox regression analysis, which provided the background for our estimations of population attributable fractions. In all, 88 373 pregnancies had full information on all covariates and were included in this analysis. MAIN OUTCOME...

  6. Understanding poisson regression.

    Science.gov (United States)

    Hayat, Matthew J; Higgins, Melinda

    2014-04-01

    Nurse investigators often collect study data in the form of counts. Traditional methods of data analysis have historically approached analysis of count data either as if the count data were continuous and normally distributed or with dichotomization of the counts into the categories of occurred or did not occur. These outdated methods for analyzing count data have been replaced with more appropriate statistical methods that make use of the Poisson probability distribution, which is useful for analyzing count data. The purpose of this article is to provide an overview of the Poisson distribution and its use in Poisson regression. Assumption violations for the standard Poisson regression model are addressed with alternative approaches, including addition of an overdispersion parameter or negative binomial regression. An illustrative example is presented with an application from the ENSPIRE study, and regression modeling of comorbidity data is included for illustrative purposes. Copyright 2014, SLACK Incorporated.

  7. Variable selection and model choice in geoadditive regression models.

    Science.gov (United States)

    Kneib, Thomas; Hothorn, Torsten; Tutz, Gerhard

    2009-06-01

    Model choice and variable selection are issues of major concern in practical regression analyses, arising in many biometric applications such as habitat suitability analyses, where the aim is to identify the influence of potentially many environmental conditions on certain species. We describe regression models for breeding bird communities that facilitate both model choice and variable selection, by a boosting algorithm that works within a class of geoadditive regression models comprising spatial effects, nonparametric effects of continuous covariates, interaction surfaces, and varying coefficients. The major modeling components are penalized splines and their bivariate tensor product extensions. All smooth model terms are represented as the sum of a parametric component and a smooth component with one degree of freedom to obtain a fair comparison between the model terms. A generic representation of the geoadditive model allows us to devise a general boosting algorithm that automatically performs model choice and variable selection.

  8. Characteristics of effective collaborative care for treatment of depression: a systematic review and meta-regression of 74 randomised controlled trials.

    Directory of Open Access Journals (Sweden)

    Peter A Coventry

    Full Text Available Collaborative care is a complex intervention based on chronic disease management models and is effective in the management of depression. However, there is still uncertainty about which components of collaborative care are effective. We used meta-regression to identify factors in collaborative care associated with improvement in patient outcomes (depressive symptoms and the process of care (use of anti-depressant medication.Systematic review with meta-regression. The Cochrane Collaboration Depression, Anxiety and Neurosis Group trials registers were searched from inception to 9th February 2012. An update was run in the CENTRAL trials database on 29th December 2013. Inclusion criteria were: randomised controlled trials of collaborative care for adults ≥18 years with a primary diagnosis of depression or mixed anxiety and depressive disorder. Random effects meta-regression was used to estimate regression coefficients with 95% confidence intervals (CIs between study level covariates and depressive symptoms and relative risk (95% CI and anti-depressant use. The association between anti-depressant use and improvement in depression was also explored. Seventy four trials were identified (85 comparisons, across 21,345 participants. Collaborative care that included psychological interventions predicted improvement in depression (β coefficient -0.11, 95% CI -0.20 to -0.01, p = 0.03. Systematic identification of patients (relative risk 1.43, 95% CI 1.12 to 1.81, p = 0.004 and the presence of a chronic physical condition (relative risk 1.32, 95% CI 1.05 to 1.65, p = 0.02 predicted use of anti-depressant medication.Trials of collaborative care that included psychological treatment, with or without anti-depressant medication, appeared to improve depression more than those without psychological treatment. Trials that used systematic methods to identify patients with depression and also trials that included patients with a chronic physical

  9. Identifying and Ranking the Effective Factors on Successful Implementation of Social Commerce in Iran, Using AHP Fuzzy

    Directory of Open Access Journals (Sweden)

    Zahra Rahimi

    2016-07-01

    Full Text Available Social commerce has been introduced as a new approach to increase sales, number of customers and reduce marketing expenditures. This approach is a combination of business, communication between people, as well as communicative and informative technologies based on web 2.0 Its achievement originated from different factors relied on business, individuals, culture, and technology. These factors have been primarily identified on the basis of library researches and classified into six infrastructural groups including:  technical, economical and human resources, cultural, rules governing the countries, style of management, and business. Then, it identified priority of the factors by using the fuzzy analytic hierarchy process (AHP. Innovation of this research was to extract a comprehensive list of factors and to prioritize them based on specific conditions in Iran.

  10. The Effect of Extrinsic Motivational Factors Towards Iba Student Achievement

    OpenAIRE

    Pangemanan, Sifrid S.; Saerang, David Paul Elia; Rondonuwu, Mariska

    2014-01-01

    The reason students can facing the world of competition because they have a motivation. A thing that help students to get their motivation when they are not get a motivation by themself is through extrinsic motivational factors. There are two objectives of this research are to analyze the effect of extrinsic motivational factors towards student achievement and to identify the most influental factors on student achievement. The method is multiple linear regression analysis to examine the effec...

  11. Alternative Methods of Regression

    CERN Document Server

    Birkes, David

    2011-01-01

    Of related interest. Nonlinear Regression Analysis and its Applications Douglas M. Bates and Donald G. Watts ".an extraordinary presentation of concepts and methods concerning the use and analysis of nonlinear regression models.highly recommend[ed].for anyone needing to use and/or understand issues concerning the analysis of nonlinear regression models." --Technometrics This book provides a balance between theory and practice supported by extensive displays of instructive geometrical constructs. Numerous in-depth case studies illustrate the use of nonlinear regression analysis--with all data s

  12. Parental Vaccine Acceptance: A Logistic Regression Model Using Previsit Decisions.

    Science.gov (United States)

    Lee, Sara; Riley-Behringer, Maureen; Rose, Jeanmarie C; Meropol, Sharon B; Lazebnik, Rina

    2017-07-01

    This study explores how parents' intentions regarding vaccination prior to their children's visit were associated with actual vaccine acceptance. A convenience sample of parents accompanying 6-week-old to 17-year-old children completed a written survey at 2 pediatric practices. Using hierarchical logistic regression, for hospital-based participants (n = 216), vaccine refusal history ( P < .01) and vaccine decision made before the visit ( P < .05) explained 87% of vaccine refusals. In community-based participants (n = 100), vaccine refusal history ( P < .01) explained 81% of refusals. Over 1 in 5 parents changed their minds about vaccination during the visit. Thirty parents who were previous vaccine refusers accepted current vaccines, and 37 who had intended not to vaccinate choose vaccination. Twenty-nine parents without a refusal history declined vaccines, and 32 who did not intend to refuse before the visit declined vaccination. Future research should identify key factors to nudge parent decision making in favor of vaccination.

  13. Modified Regression Rate Formula of PMMA Combustion by a Single Plane Impinging Jet

    Directory of Open Access Journals (Sweden)

    Tsuneyoshi Matsuoka

    2017-01-01

    Full Text Available A modified regression rate formula for the uppermost stage of CAMUI-type hybrid rocket motor is proposed in this study. Assuming a quasi-steady, one-dimensional, an energy balance against a control volume near the fuel surface is considered. Accordingly, the regression rate formula which can calculate the local regression rate by the quenching distance between the flame and the regression surface is derived. An experimental setup which simulates the combustion phenomenon involved in the uppermost stage of a CAMUI-type hybrid rocket motor was constructed and the burning tests with various flow velocities and impinging distances were performed. A PMMA slab of 20 mm height, 60 mm width, and 20 mm thickness was chosen as a sample specimen and pure oxygen and O2/N2 mixture (50/50 vol.% were employed as the oxidizers. The time-averaged regression rate along the fuel surface was measured by a laser displacement sensor. The quenching distance during the combustion event was also identified from the observation. The comparison between the purely experimental and calculated values showed good agreement, although a large systematic error was expected due to the difficulty in accurately identifying the quenching distance.

  14. Introduction to regression graphics

    CERN Document Server

    Cook, R Dennis

    2009-01-01

    Covers the use of dynamic and interactive computer graphics in linear regression analysis, focusing on analytical graphics. Features new techniques like plot rotation. The authors have composed their own regression code, using Xlisp-Stat language called R-code, which is a nearly complete system for linear regression analysis and can be utilized as the main computer program in a linear regression course. The accompanying disks, for both Macintosh and Windows computers, contain the R-code and Xlisp-Stat. An Instructor's Manual presenting detailed solutions to all the problems in the book is ava

  15. Factors affecting on customers’ satisfaction an empirical investigation of ATM service

    OpenAIRE

    Kumbhar, Vijay

    2011-01-01

    The present empirical study focuses on identifying key factors that have influences customers satisfaction in ATM service provided by public and private sector banks. For the purpose of the study primary data were collected using schedule and collected data from March to November 2010. Results of factor analysis, correlation and regression analysis show that a cost effectiveness, easy to use and security and responsiveness in ATM service were most important factors in customer satisfaction.

  16. Personality, Driving Behavior and Mental Disorders Factors as Predictors of Road Traffic Accidents Based on Logistic Regression.

    Science.gov (United States)

    Alavi, Seyyed Salman; Mohammadi, Mohammad Reza; Souri, Hamid; Mohammadi Kalhori, Soroush; Jannatifard, Fereshteh; Sepahbodi, Ghazal

    2017-01-01

    The aim of this study was to evaluate the effect of variables such as personality traits, driving behavior and mental illness on road traffic accidents among the drivers with accidents and those without road crash. In this cohort study, 800 bus and truck drivers were recruited. Participants were selected among drivers who referred to Imam Sajjad Hospital (Tehran, Iran) during 2013-2015. The Manchester driving behavior questionnaire (MDBQ), big five personality test (NEO personality inventory) and semi-structured interview (schizophrenia and affective disorders scale) were used. After two years, we surveyed all accidents due to human factors that involved the recruited drivers. The data were analyzed using the SPSS software by performing the descriptive statistics, t-test, and multiple logistic regression analysis methods. P values less than 0.05 were considered statistically significant. In terms of controlling the effective and demographic variables, the findings revealed significant differences between the two groups of drivers that were and were not involved in road accidents. In addition, it was found that depression and anxiety could increase the odds ratio (OR) of road accidents by 2.4- and 2.7-folds, respectively (P=0.04, P=0.004). It is noteworthy to mention that neuroticism alone can increase the odds of road accidents by 1.1-fold (P=0.009), but other personality factors did not have a significant effect on the equation. The results revealed that some mental disorders affect the incidence of road collisions. Considering the importance and sensitivity of driving behavior, it is necessary to evaluate multiple psychological factors influencing drivers before and after receiving or renewing their driver's license.

  17. Personality, Driving Behavior and Mental Disorders Factors as Predictors of Road Traffic Accidents Based on Logistic Regression

    Science.gov (United States)

    Alavi, Seyyed Salman; Mohammadi, Mohammad Reza; Souri, Hamid; Mohammadi Kalhori, Soroush; Jannatifard, Fereshteh; Sepahbodi, Ghazal

    2017-01-01

    Background: The aim of this study was to evaluate the effect of variables such as personality traits, driving behavior and mental illness on road traffic accidents among the drivers with accidents and those without road crash. Methods: In this cohort study, 800 bus and truck drivers were recruited. Participants were selected among drivers who referred to Imam Sajjad Hospital (Tehran, Iran) during 2013-2015. The Manchester driving behavior questionnaire (MDBQ), big five personality test (NEO personality inventory) and semi-structured interview (schizophrenia and affective disorders scale) were used. After two years, we surveyed all accidents due to human factors that involved the recruited drivers. The data were analyzed using the SPSS software by performing the descriptive statistics, t-test, and multiple logistic regression analysis methods. P values less than 0.05 were considered statistically significant. Results: In terms of controlling the effective and demographic variables, the findings revealed significant differences between the two groups of drivers that were and were not involved in road accidents. In addition, it was found that depression and anxiety could increase the odds ratio (OR) of road accidents by 2.4- and 2.7-folds, respectively (P=0.04, P=0.004). It is noteworthy to mention that neuroticism alone can increase the odds of road accidents by 1.1-fold (P=0.009), but other personality factors did not have a significant effect on the equation. Conclusion: The results revealed that some mental disorders affect the incidence of road collisions. Considering the importance and sensitivity of driving behavior, it is necessary to evaluate multiple psychological factors influencing drivers before and after receiving or renewing their driver’s license. PMID:28293047

  18. Personality, Driving Behavior and Mental Disorders Factors as Predictors of Road Traffic Accidents Based on Logistic Regression

    Directory of Open Access Journals (Sweden)

    Seyyed Salman Alavi

    2017-01-01

    Full Text Available Background: The aim of this study was to evaluate the effect of variables such as personality traits, driving behavior and mental illness on road traffic accidents among the drivers with accidents and those without road crash. Methods: In this cohort study, 800 bus and truck drivers were recruited. Participants were selected among drivers who referred to Imam Sajjad Hospital (Tehran, Iran during 2013-2015. The Manchester driving behavior questionnaire (MDBQ, big five personality test (NEO personality inventory and semi-structured interview (SADS were used. After two years, we surveyed all accidents due to human factors that involved the recruited drivers. The data were analyzed using the SPSS software by performing the descriptive statistics, t-test, and multiple logistic regression analysis methods. P values less than 0.05 were considered statistically significant. Results: In terms of controlling the effective and demographic variables, the findings revealed significant differences between the two groups of drivers that were and were not involved in road accidents. In addition, it was found that depression and anxiety could increase the odds ratio (OR of road accidents by 2.4- and 2.7-folds, respectively (P=0.04, P=0.004. It is noteworthy to mention that neuroticism alone can increase the odds of road accidents by 1.1-fold (P=0.009, but other personality factors did not have a significant effect on the equation. Conclusion: The results revealed that some mental disorders affect the incidence of road collisions. Considering the importance and sensitivity of driving behavior, it is necessary to evaluate multiple psychological factors influencing drivers before and after receiving or renewing their driver’s license.

  19. Identifying and prioritizing the factors effective in customer satisfaction using the TOPSIS method

    Directory of Open Access Journals (Sweden)

    H Forougozar

    2014-01-01

    Full Text Available Introduction: Customer satisfaction has been suggested as one of the interesting and challenging issues of management in the new millennium. In addition, oral and dental health and the quality of the services the health centers delivered to the patients directly affect the customer satisfaction. Therefore, the present study aimed to identify, investigate, and rank the factors affecting the customer satisfaction in the department of dentistry of Shiraz Farhangiyan health center. Method: The present descriptive study was conducted on the specialists and patients of the department of dentistry of Shiraz Farhangiyan health center. The validity of the questionnaire utilized in the study was confirmed by expert professors and its reliability was approved using the Cronbach’s alpha formula. Finally, the study data were analyzed in SPSS statistical software (v. 16, using inferential statistics. Results: All the hypotheses were confirmed by the results of the statistical analyses and quality, services, and expenditures revealed to affect the customer satisfaction in the department of dentistry of Shiraz Farhangiyan health center. Moreover, these factors were ranked using the TOPSIS method and the results showed quality and expenditures as the most and the least effective factors in customer satisfaction, respectively. Conclusion: Since restoring and arranging the organization based on the customer needs is among the main priorities of designing an organization, managers are suggested to take measures for organizational reformation based on the customers’ priorities. Of course, conducting such programs is of utmost importance in health and treatment environments, leading to provision of better services and facilitation of learning, education, and research. Thus, identifying the effective factors in customer satisfaction and ranking them are highly important.

  20. Quality of life in breast cancer patients--a quantile regression analysis.

    Science.gov (United States)

    Pourhoseingholi, Mohamad Amin; Safaee, Azadeh; Moghimi-Dehkordi, Bijan; Zeighami, Bahram; Faghihzadeh, Soghrat; Tabatabaee, Hamid Reza; Pourhoseingholi, Asma

    2008-01-01

    Quality of life study has an important role in health care especially in chronic diseases, in clinical judgment and in medical resources supplying. Statistical tools like linear regression are widely used to assess the predictors of quality of life. But when the response is not normal the results are misleading. The aim of this study is to determine the predictors of quality of life in breast cancer patients, using quantile regression model and compare to linear regression. A cross-sectional study conducted on 119 breast cancer patients that admitted and treated in chemotherapy ward of Namazi hospital in Shiraz. We used QLQ-C30 questionnaire to assessment quality of life in these patients. A quantile regression was employed to assess the assocciated factors and the results were compared to linear regression. All analysis carried out using SAS. The mean score for the global health status for breast cancer patients was 64.92+/-11.42. Linear regression showed that only grade of tumor, occupational status, menopausal status, financial difficulties and dyspnea were statistically significant. In spite of linear regression, financial difficulties were not significant in quantile regression analysis and dyspnea was only significant for first quartile. Also emotion functioning and duration of disease statistically predicted the QOL score in the third quartile. The results have demonstrated that using quantile regression leads to better interpretation and richer inference about predictors of the breast cancer patient quality of life.

  1. Liver Fibrosis Regression Measured by Transient Elastography in Human Immunodeficiency Virus (HIV)-Hepatitis B Virus (HBV)-Coinfected Individuals on Long-Term HBV-Active Combination Antiretroviral Therapy.

    Science.gov (United States)

    Audsley, Jennifer; Robson, Christopher; Aitchison, Stacey; Matthews, Gail V; Iser, David; Sasadeusz, Joe; Lewin, Sharon R

    2016-01-01

    Background.  Advanced fibrosis occurs more commonly in human immunodeficiency virus (HIV)-hepatitis B virus (HBV) coinfected individuals; therefore, fibrosis monitoring is important in this population. However, transient elastography (TE) data in HIV-HBV coinfection are lacking. We aimed to assess liver fibrosis using TE in a cross-sectional study of HIV-HBV coinfected individuals receiving combination HBV-active (lamivudine and/or tenofovir/tenofovir-emtricitabine) antiretroviral therapy, identify factors associated with advanced fibrosis, and examine change in fibrosis in those with >1 TE assessment. Methods.  We assessed liver fibrosis in 70 HIV-HBV coinfected individuals on HBV-active combination antiretroviral therapy (cART). Change in fibrosis over time was examined in a subset with more than 1 TE result (n = 49). Clinical and laboratory variables at the time of the first TE were collected, and associations with advanced fibrosis (≥F3, Metavir scoring system) and fibrosis regression (of least 1 stage) were examined. Results.  The majority of the cohort (64%) had mild to moderate fibrosis at the time of the first TE, and we identified alanine transaminase, platelets, and detectable HIV ribonucleic acid as associated with advanced liver fibrosis. Alanine transaminase and platelets remained independently advanced in multivariate modeling. More than 28% of those with >1 TE subsequently showed liver fibrosis regression, and higher baseline HBV deoxyribonucleic acid was associated with regression. Prevalence of advanced fibrosis (≥F3) decreased 12.3% (32.7%-20.4%) over a median of 31 months. Conclusions.  The observed fibrosis regression in this group supports the beneficial effects of cART on liver stiffness. It would be important to study a larger group of individuals with more advanced fibrosis to more definitively assess factors associated with liver fibrosis regression.

  2. Predictors of unsuccessful outcome in cemented femoral revisions using bone impaction grafting; Cox regression analysis of 208 cases.

    Science.gov (United States)

    Te Stroet, Martijn A J; Rijnen, Wim H C; Gardeniers, Jean W M; Schreurs, B Willem; Hannink, Gerjon

    2016-09-29

    Despite improvements in the technique of femoral impaction bone grafting, reconstruction failures still can occur. Therefore, the aim of our study was to determine risk factors for the endpoint re-revision for any reason. We used prospectively collected demographic, clinical and surgical data of all 202 patients who underwent 208 femoral revisions using the X-change Femoral Revision System (Stryker-Howmedica), fresh-frozen morcellised allograft and a cemented polished Exeter stem in our department from 1991 to 2007. Univariable and multivariable Cox regression analyses were performed to identify potential factors associated with re-revision. The mean follow-up was 10.6 (5-21) years. The cumulative re-revision rate was 6.3% (13/208). After univariable selection, sex, age, body mass index (BMI), American Association of Anesthesiologists (ASA) classification, type of removed femoral component, and mesh used for reconstruction were included in multivariable regression analysis.In the multivariable analysis, BMI was the only factor that was significantly associated with the risk of re-revision after bone impaction grafting (BMI ≥30 vs. BMI <30, HR = 6.54 [95% CI 1.89-22.65]; p = 0.003). BMI was the only factor associated with the risk of re-revision for any reason. Besides BMI also other factors, such as Endoklinik score and the type of removed femoral component, can provide guidance in the process of preclinical decision making. With the knowledge obtained from this study, preoperative patient selection, informed consent, and treatment protocols can be better adjusted to the individual patient who needs to undergo a femoral revision with impaction bone grafting.

  3. Arterial hypertension and associated factors in patients submitted to myocardial revascularization

    Directory of Open Access Journals (Sweden)

    Flávia Cortez Colósimo

    2015-04-01

    Full Text Available OBJECTIVE To identify the prevalence of arterial hypertension and associated factors in patients submitted to myocardial revascularization. METHOD Cross-sectional study using the database of a hospital in São Paulo (SP, Brazil containing 3010 patients with coronary artery disease submitted to myocardial revascularization. A multiple logistic regression was performed to identify variables independently associated with hypertension (statistical significance: p1.3: (OR=1.37;CI:1.09-1.72. CONCLUSION A high prevalence of arterial hypertension and association with both non-modifiable and modifiable factors was observed.

  4. Desertification Susceptibility Mapping Using Logistic Regression Analysis in the Djelfa Area, Algeria

    Directory of Open Access Journals (Sweden)

    Farid Djeddaoui

    2017-10-01

    Full Text Available The main goal of this work was to identify the areas that are most susceptible to desertification in a part of the Algerian steppe, and to quantitatively assess the key factors that contribute to this desertification. In total, 139 desertified zones were mapped using field surveys and photo-interpretation. We selected 16 spectral and geomorphic predictive factors, which a priori play a significant role in desertification. They were mainly derived from Landsat 8 imagery and Shuttle Radar Topographic Mission digital elevation model (SRTM DEM. Some factors, such as the topographic position index (TPI and curvature, were used for the first time in this kind of study. For this purpose, we adapted the logistic regression algorithm for desertification susceptibility mapping, which has been widely used for landslide susceptibility mapping. The logistic model was evaluated using the area under the receiver operating characteristic (ROC curve. The model accuracy was 87.8%. We estimated the model uncertainties using a bootstrap method. Our analysis suggests that the predictive model is robust and stable. Our results indicate that land cover factors, including normalized difference vegetation index (NDVI and rangeland classes, play a major role in determining desertification occurrence, while geomorphological factors have a limited impact. The predictive map shows that 44.57% of the area is classified as highly to very highly susceptible to desertification. The developed approach can be used to assess desertification in areas with similar characteristics and to guide possible actions to combat desertification.

  5. Mapping geogenic radon potential by regression kriging

    Energy Technology Data Exchange (ETDEWEB)

    Pásztor, László [Institute for Soil Sciences and Agricultural Chemistry, Centre for Agricultural Research, Hungarian Academy of Sciences, Department of Environmental Informatics, Herman Ottó út 15, 1022 Budapest (Hungary); Szabó, Katalin Zsuzsanna, E-mail: sz_k_zs@yahoo.de [Department of Chemistry, Institute of Environmental Science, Szent István University, Páter Károly u. 1, Gödöllő 2100 (Hungary); Szatmári, Gábor; Laborczi, Annamária [Institute for Soil Sciences and Agricultural Chemistry, Centre for Agricultural Research, Hungarian Academy of Sciences, Department of Environmental Informatics, Herman Ottó út 15, 1022 Budapest (Hungary); Horváth, Ákos [Department of Atomic Physics, Eötvös University, Pázmány Péter sétány 1/A, 1117 Budapest (Hungary)

    2016-02-15

    Radon ({sup 222}Rn) gas is produced in the radioactive decay chain of uranium ({sup 238}U) which is an element that is naturally present in soils. Radon is transported mainly by diffusion and convection mechanisms through the soil depending mainly on the physical and meteorological parameters of the soil and can enter and accumulate in buildings. Health risks originating from indoor radon concentration can be attributed to natural factors and is characterized by geogenic radon potential (GRP). Identification of areas with high health risks require spatial modeling, that is, mapping of radon risk. In addition to geology and meteorology, physical soil properties play a significant role in the determination of GRP. In order to compile a reliable GRP map for a model area in Central-Hungary, spatial auxiliary information representing GRP forming environmental factors were taken into account to support the spatial inference of the locally measured GRP values. Since the number of measured sites was limited, efficient spatial prediction methodologies were searched for to construct a reliable map for a larger area. Regression kriging (RK) was applied for the interpolation using spatially exhaustive auxiliary data on soil, geology, topography, land use and climate. RK divides the spatial inference into two parts. Firstly, the deterministic component of the target variable is determined by a regression model. The residuals of the multiple linear regression analysis represent the spatially varying but dependent stochastic component, which are interpolated by kriging. The final map is the sum of the two component predictions. Overall accuracy of the map was tested by Leave-One-Out Cross-Validation. Furthermore the spatial reliability of the resultant map is also estimated by the calculation of the 90% prediction interval of the local prediction values. The applicability of the applied method as well as that of the map is discussed briefly. - Highlights: • A new method

  6. Mapping geogenic radon potential by regression kriging

    International Nuclear Information System (INIS)

    Pásztor, László; Szabó, Katalin Zsuzsanna; Szatmári, Gábor; Laborczi, Annamária; Horváth, Ákos

    2016-01-01

    Radon ( 222 Rn) gas is produced in the radioactive decay chain of uranium ( 238 U) which is an element that is naturally present in soils. Radon is transported mainly by diffusion and convection mechanisms through the soil depending mainly on the physical and meteorological parameters of the soil and can enter and accumulate in buildings. Health risks originating from indoor radon concentration can be attributed to natural factors and is characterized by geogenic radon potential (GRP). Identification of areas with high health risks require spatial modeling, that is, mapping of radon risk. In addition to geology and meteorology, physical soil properties play a significant role in the determination of GRP. In order to compile a reliable GRP map for a model area in Central-Hungary, spatial auxiliary information representing GRP forming environmental factors were taken into account to support the spatial inference of the locally measured GRP values. Since the number of measured sites was limited, efficient spatial prediction methodologies were searched for to construct a reliable map for a larger area. Regression kriging (RK) was applied for the interpolation using spatially exhaustive auxiliary data on soil, geology, topography, land use and climate. RK divides the spatial inference into two parts. Firstly, the deterministic component of the target variable is determined by a regression model. The residuals of the multiple linear regression analysis represent the spatially varying but dependent stochastic component, which are interpolated by kriging. The final map is the sum of the two component predictions. Overall accuracy of the map was tested by Leave-One-Out Cross-Validation. Furthermore the spatial reliability of the resultant map is also estimated by the calculation of the 90% prediction interval of the local prediction values. The applicability of the applied method as well as that of the map is discussed briefly. - Highlights: • A new method, regression

  7. Effective dose in abdominal digital radiography: Patient factor

    Energy Technology Data Exchange (ETDEWEB)

    Jang, Ji Sung; Koo, Hyun Jung; Park, Jung Hoon; Cho, Young Chul; Do, Kyung Hyun [Dept. of Radiology, and Research Institute of Radiology, Asan Medical Center, University of Ulsan College of Medicine, Seoul(Korea, Republic of); Yang, Hyung Jin [Dept. of Medical Physics, Korea University, Seoul (Korea, Republic of)

    2017-08-15

    To identify independent patient factors associated with an increased radiation dose, and to evaluate the effect of patient position on the effective dose in abdominal digital radiography. We retrospectively evaluated the effective dose for abdominal digital radiography in 222 patients. The patients were divided into two groups based on the cut-off dose value of 0.311 mSv (the upper third quartile of dose distribution): group A (n = 166) and group B (n = 56). Through logistic regression, independent factors associated with a larger effective dose were identified. The effect of patient position on the effective dose was evaluated using a paired t-test. High body mass index (BMI) (≥ 23 kg/m2), presence of ascites, and spinal metallic instrumentation were significantly associated with a larger effective dose. Multivariate logistic regression analysis revealed that high BMI [odds ratio (OR), 25.201; p < 0.001] and ascites (OR, 25.132; p < 0.001) were significantly associated with a larger effective dose. The effective dose was significantly lesser (22.6%) in the supine position than in the standing position (p < 0.001). High BMI and ascites were independent factors associated with a larger effective dose in abdominal digital radiography. Significant dose reduction in patients with these factors may be achieved by placing the patient in the supine position during abdominal digital radiography.

  8. Transcription factor expression uniquely identifies most postembryonic neuronal lineages in the Drosophila thoracic central nervous system.

    Science.gov (United States)

    Lacin, Haluk; Zhu, Yi; Wilson, Beth A; Skeath, James B

    2014-03-01

    Most neurons of the adult Drosophila ventral nerve cord arise from a burst of neurogenesis during the third larval instar stage. Most of this growth occurs in thoracic neuromeres, which contain 25 individually identifiable postembryonic neuronal lineages. Initially, each lineage consists of two hemilineages--'A' (Notch(On)) and 'B' (Notch(Off))--that exhibit distinct axonal trajectories or fates. No reliable method presently exists to identify these lineages or hemilineages unambiguously other than labor-intensive lineage-tracing methods. By combining mosaic analysis with a repressible cell marker (MARCM) analysis with gene expression studies, we constructed a gene expression map that enables the rapid, unambiguous identification of 23 of the 25 postembryonic lineages based on the expression of 15 transcription factors. Pilot genetic studies reveal that these transcription factors regulate the specification and differentiation of postembryonic neurons: for example, Nkx6 is necessary and sufficient to direct axonal pathway selection in lineage 3. The gene expression map thus provides a descriptive foundation for the genetic and molecular dissection of adult-specific neurogenesis and identifies many transcription factors that are likely to regulate the development and differentiation of discrete subsets of postembryonic neurons.

  9. Identifying risk factors for victimization among male prisoners in Taiwan.

    Science.gov (United States)

    Kuo, Shih-Ya; Cuvelier, Steven J; Huang, Yung-Shun

    2014-02-01

    This study identified risk factors for prison victimization in Taiwan with an application of Western literature and assessed the extent of its applicability in an Eastern context. The sample was drawn from four male prisons located in Northern, Central, Southern, and Eastern Taiwan; a total of 1,181 valid surveys were collected. The results generally support the major findings of the extant Western studies. Crowding, however, was not significantly associated with the risk of victimization in any of the statistical models, which might be related to the different experiences and living conditions in the free community between Taiwanese and American inmates. This study generated clear policy implications, which may reduce prison victimization and engender a greater sense of well-being in the prison environment.

  10. Modeling the potential risk factors of bovine viral diarrhea prevalence in Egypt using univariable and multivariable logistic regression analyses

    Directory of Open Access Journals (Sweden)

    Abdelfattah M. Selim

    2018-03-01

    Full Text Available Aim: The present cross-sectional study was conducted to determine the seroprevalence and potential risk factors associated with Bovine viral diarrhea virus (BVDV disease in cattle and buffaloes in Egypt, to model the potential risk factors associated with the disease using logistic regression (LR models, and to fit the best predictive model for the current data. Materials and Methods: A total of 740 blood samples were collected within November 2012-March 2013 from animals aged between 6 months and 3 years. The potential risk factors studied were species, age, sex, and herd location. All serum samples were examined with indirect ELIZA test for antibody detection. Data were analyzed with different statistical approaches such as Chi-square test, odds ratios (OR, univariable, and multivariable LR models. Results: Results revealed a non-significant association between being seropositive with BVDV and all risk factors, except for species of animal. Seroprevalence percentages were 40% and 23% for cattle and buffaloes, respectively. OR for all categories were close to one with the highest OR for cattle relative to buffaloes, which was 2.237. Likelihood ratio tests showed a significant drop of the -2LL from univariable LR to multivariable LR models. Conclusion: There was an evidence of high seroprevalence of BVDV among cattle as compared with buffaloes with the possibility of infection in different age groups of animals. In addition, multivariable LR model was proved to provide more information for association and prediction purposes relative to univariable LR models and Chi-square tests if we have more than one predictor.

  11. A Bayesian Nonparametric Causal Model for Regression Discontinuity Designs

    Science.gov (United States)

    Karabatsos, George; Walker, Stephen G.

    2013-01-01

    The regression discontinuity (RD) design (Thistlewaite & Campbell, 1960; Cook, 2008) provides a framework to identify and estimate causal effects from a non-randomized design. Each subject of a RD design is assigned to the treatment (versus assignment to a non-treatment) whenever her/his observed value of the assignment variable equals or…

  12. DIABETES MELLITUS AND ITS ROLE IN CAUDAL REGRESSION SYNDROME

    Directory of Open Access Journals (Sweden)

    Sandeep

    2016-03-01

    Full Text Available BACKGROUND Caudal regression syndrome also called as sacral agenesis or hypoplasia of the sacrum is a congenital disorder in which there is abnormal development of the lower part of the vertebral column 1 due to which there is a plethora of abnormalities such as gross motor deficiencies and other genitor-urinary malformations which in deed depends on the extent of malformations that is seen. Caudal regression syndrome is rare, with an estimated incidence of 1:7500-100,000. The aim of the study is to find the frequency of manifestations and the manifestations itself. METHODS Fifty patients who were pregnant and were diagnosed with diabetes mellitus were identified and were referred to the Department of Medicine. RESULTS In the present study the frequency of manifestations of caudal regression syndrome is 8 in 100 diagnosed patients. CONCLUSION The malformations in the babies born to diabetic mothers are high in the population of costal Karnataka and Kerala.

  13. Spontaneous regression of metastases from melanoma: review of the literature

    DEFF Research Database (Denmark)

    Kalialis, Louise Vennegaard; Drzewiecki, Krzysztof T; Klyver, Helle

    2009-01-01

    Regression of metastatic melanoma is a rare event, and review of the literature reveals a total of 76 reported cases since 1866. The proposed mechanisms include immunologic, endocrine, inflammatory and metastatic tumour nutritional factors. We conclude from this review that although the precise...

  14. [Logistic regression model of noninvasive prediction for portal hypertensive gastropathy in patients with hepatitis B associated cirrhosis].

    Science.gov (United States)

    Wang, Qingliang; Li, Xiaojie; Hu, Kunpeng; Zhao, Kun; Yang, Peisheng; Liu, Bo

    2015-05-12

    To explore the risk factors of portal hypertensive gastropathy (PHG) in patients with hepatitis B associated cirrhosis and establish a Logistic regression model of noninvasive prediction. The clinical data of 234 hospitalized patients with hepatitis B associated cirrhosis from March 2012 to March 2014 were analyzed retrospectively. The dependent variable was the occurrence of PHG while the independent variables were screened by binary Logistic analysis. Multivariate Logistic regression was used for further analysis of significant noninvasive independent variables. Logistic regression model was established and odds ratio was calculated for each factor. The accuracy, sensitivity and specificity of model were evaluated by the curve of receiver operating characteristic (ROC). According to univariate Logistic regression, the risk factors included hepatic dysfunction, albumin (ALB), bilirubin (TB), prothrombin time (PT), platelet (PLT), white blood cell (WBC), portal vein diameter, spleen index, splenic vein diameter, diameter ratio, PLT to spleen volume ratio, esophageal varices (EV) and gastric varices (GV). Multivariate analysis showed that hepatic dysfunction (X1), TB (X2), PLT (X3) and splenic vein diameter (X4) were the major occurring factors for PHG. The established regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4. The accuracy of model for PHG was 79.1% with a sensitivity of 77.2% and a specificity of 80.8%. Hepatic dysfunction, TB, PLT and splenic vein diameter are risk factors for PHG and the noninvasive predicted Logistic regression model was Logit P=-2.667+2.186X1-2.167X2+0.725X3+0.976X4.

  15. Global Prevalence of Elder Abuse: A Meta-analysis and Meta-regression.

    Science.gov (United States)

    Ho, C Sh; Wong, S Y; Chiu, M M; Ho, R Cm

    2017-06-01

    Elder abuse is increasingly recognised as a global public health and social problem. There has been limited inter-study comparison of the prevalence and risk factors for elder abuse. This study aimed to estimate the pooled and subtype prevalence of elder abuse worldwide and identify significant associated risk factors. We conducted a meta-analysis and meta-regression of 34 population-based and 17 non-population-based studies. The pooled prevalences of elder abuse were 10.0% (95% confidence interval, 5.2%-18.6%) and 34.3% (95% confidence interval, 22.9%-47.8%) in population-based studies and third party- or caregiver-reported studies, respectively. Being in a marital relationship was found to be a significant moderator using random-effects model. This meta-analysis revealed that third parties or caregivers were more likely to report abuse than older abused adults. Subgroup analyses showed that females and those resident in non-western countries were more likely to be abused. Emotional abuse was the most prevalent elder abuse subtype and financial abuse was less commonly reported by third parties or caregivers. Heterogeneity in the prevalence was due to the high proportion of married older adults in the sample. Subgroup analysis showed that cultural factors, subtypes of abuse, and gender also contributed to heterogeneity in the pooled prevalence of elder abuse.

  16. Risk factors identified for certain lymphoma subtypes

    Science.gov (United States)

    In a large international collaborative analysis of risk factors for non-Hodgkin lymphoma (NHL), scientists were able to quantify risk associated with medical history, lifestyle factors, family history of blood or lymph-borne cancers, and occupation for 11

  17. Linking demand and supply factors in identifying cultural ecosystem services of urban green infrastructures

    DEFF Research Database (Denmark)

    Hegetschweiler, K. Tessa; de Vries, Sjerp; Arnberger, Arne

    2017-01-01

    and supply factors together. The aim was to provide an overview of this highly interdisciplinary research, to describe how these linkages are being made and to identify which factors significantly influence dependent variables such as levels of use, activities or health and well-being benefits. Commonly used......Urban green infrastructure provides a number of cultural ecosystem services that are greatly appreciated by the public. In order to benefit from these services, actual contact with the respective ecosystem is often required. Furthermore, the type of services offered depend on the physical...... characteristics of the ecosystem. We conducted a review of publications dealing with demand or social factors such as user needs, preferences and values as well as spatially explicit supply or physical factors such as amount of green space, (bio)diversity, recreational infrastructure, etc. and linking demand...

  18. Sequences of Regressions Distinguish Nonmechanical from Mechanical Associations between Metabolic Factors, Body Composition, and Bone in Healthy Postmenopausal Women.

    Science.gov (United States)

    Solis-Trapala, Ivonne; Schoenmakers, Inez; Goldberg, Gail R; Prentice, Ann; Ward, Kate A

    2016-03-09

    There is increasing recognition of complex interrelations between the endocrine functions of bone and fat tissues or organs. The objective was to describe nonmechanical and mechanical links between metabolic factors, body composition, and bone with the use of graphical Markov models. Seventy postmenopausal women with a mean ± SD age of 62.3 ± 3.7 y and body mass index (in kg/m 2 ) of 24.9 ± 3.8 were recruited. Bone outcomes were peripheral quantitative computed tomography measures of the distal and diaphyseal tibia, cross-sectional area (CSA), volumetric bone mineral density (vBMD), and cortical CSA. Biomarkers of osteoblast and adipocyte function were plasma concentrations of leptin, adiponectin, osteocalcin, undercarboxylated osteocalcin (UCOC), and phylloquinone. Body composition measurements were lean and percent fat mass, which were derived with the use of a 4-compartment model. Sequences of Regressions, a subclass of graphical Markov models, were used to describe the direct (nonmechanical) and indirect (mechanical) interrelations between metabolic factors and bone by simultaneously modeling multiple bone outcomes and their relation with biomarker outcomes with lean mass, percent fat mass, and height as intermediate explanatory variables. The graphical Markov models showed both direct and indirect associations linking plasma leptin and adiponectin concentrations with CSA and vBMD. At the distal tibia, lean mass, height, and adiponectin-UCOC interaction were directly explanatory of CSA (R 2 = 0.45); at the diaphysis, lean mass, percent fat mass, leptin, osteocalcin, and age-adiponectin interaction were directly explanatory of CSA (R 2 = 0.49). The regression models exploring direct associations for vBMD were much weaker, with R 2 = 0.15 and 0.18 at the distal and diaphyseal sites, respectively. Lean mass and UCOC were associated, and the global Markov property of the graph indicated that this association was explained by osteocalcin. This study, to our

  19. Dietary patterns as identified by factor analysis and colorectal cancer among middle-aged Americans.

    Science.gov (United States)

    Flood, Andrew; Rastogi, Tanuja; Wirfält, Elisabet; Mitrou, Panagiota N; Reedy, Jill; Subar, Amy F; Kipnis, Victor; Mouw, Traci; Hollenbeck, Albert R; Leitzmann, Michael; Schatzkin, Arthur

    2008-07-01

    Although diet has long been suspected as an etiological factor for colorectal cancer, studies of single foods and nutrients have provided inconsistent results. We used factor analysis methods to study associations between dietary patterns and colorectal cancer in middle-aged Americans. Diet was assessed among 293,615 men and 198,767 women in the National Institutes of Health-AARP Diet and Health Study. Principal components factor analysis identified 3 primary dietary patterns: a fruit and vegetables, a diet foods, and a red meat and potatoes pattern. State cancer registries identified 2151 incident cases of colorectal cancer in men and 959 in women between 1995 and 2000. Men with high scores on the fruit and vegetable pattern were at decreased risk [relative risk (RR) for quintile (Q) 5 versus Q1: 0.81; 95% CI: 0.70, 0.93; P for trend = 0.004]. Both men and women had a similar risk reduction with high scores on the diet food factor: men (RR: 0.82; 95% CI: 0.72, 0.94; P for trend = 0.001) and women (RR: 0.87; 95% CI: 0.71, 1.07; P for trend = 0.06). High scores on the red meat factor were associated with increased risk: men (RR: 1.17; 95% CI: 1.02, 1.35; P for trend = 0.14) and women (RR: 1.48; 95% CI: 1.20, 1.83; P for trend = 0.0002). These results suggest that dietary patterns characterized by a low frequency of meat and potato consumption and frequent consumption of fruit and vegetables and fat-reduced foods are consistent with a decreased risk of colorectal cancer.

  20. Application of positive matrix factorization to identify potential sources of PAHs in soil of Dalian, China

    International Nuclear Information System (INIS)

    Wang Degao; Tian Fulin; Yang Meng; Liu Chenlin; Li Yifan

    2009-01-01

    Soil derived sources of polycyclic aromatic hydrocarbons (PAHs) in the region of Dalian, China were investigated using positive matrix factorization (PMF). Three factors were separated based on PMF for the statistical investigation of the datasets both in summer and winter. These factors were dominated by the pattern of single sources or groups of similar sources, showing seasonal and regional variations. The main sources of PAHs in Dalian soil in summer were the emissions from coal combustion average (46%), diesel engine (30%), and gasoline engine (24%). In winter, the main sources were the emissions from coal-fired boiler (72%), traffic average (20%), and gasoline engine (8%). These factors with strong seasonality indicated that coal combustion in winter and traffic exhaust in summer dominated the sources of PAHs in soil. These results suggested that PMF model was a proper approach to identify the sources of PAHs in soil. - PMF model is a proper approach to identify potential sources of PAHs in soil based on the PAH profiles measured in the field and those published in the literature.

  1. T cell receptor (TCR-transgenic CD8 lymphocytes rendered insensitive to transforming growth factor beta (TGFβ signaling mediate superior tumor regression in an animal model of adoptive cell therapy

    Directory of Open Access Journals (Sweden)

    Quatromoni Jon G

    2012-06-01

    Full Text Available Abstract Tumor antigen-reactive T cells must enter into an immunosuppressive tumor microenvironment, continue to produce cytokine and deliver apoptotic death signals to affect tumor regression. Many tumors produce transforming growth factor beta (TGFβ, which inhibits T cell activation, proliferation and cytotoxicity. In a murine model of adoptive cell therapy, we demonstrate that transgenic Pmel-1 CD8 T cells, rendered insensitive to TGFβ by transduction with a TGFβ dominant negative receptor II (DN, were more effective in mediating regression of established B16 melanoma. Smaller numbers of DN Pmel-1 T cells effectively mediated tumor regression and retained the ability to produce interferon-γ in the tumor microenvironment. These results support efforts to incorporate this DN receptor in clinical trials of adoptive cell therapy for cancer.

  2. Prediction of unwanted pregnancies using logistic regression, probit regression and discriminant analysis.

    Science.gov (United States)

    Ebrahimzadeh, Farzad; Hajizadeh, Ebrahim; Vahabi, Nasim; Almasian, Mohammad; Bakhteyar, Katayoon

    2015-01-01

    Unwanted pregnancy not intended by at least one of the parents has undesirable consequences for the family and the society. In the present study, three classification models were used and compared to predict unwanted pregnancies in an urban population. In this cross-sectional study, 887 pregnant mothers referring to health centers in Khorramabad, Iran, in 2012 were selected by the stratified and cluster sampling; relevant variables were measured and for prediction of unwanted pregnancy, logistic regression, discriminant analysis, and probit regression models and SPSS software version 21 were used. To compare these models, indicators such as sensitivity, specificity, the area under the ROC curve, and the percentage of correct predictions were used. The prevalence of unwanted pregnancies was 25.3%. The logistic and probit regression models indicated that parity and pregnancy spacing, contraceptive methods, household income and number of living male children were related to unwanted pregnancy. The performance of the models based on the area under the ROC curve was 0.735, 0.733, and 0.680 for logistic regression, probit regression, and linear discriminant analysis, respectively. Given the relatively high prevalence of unwanted pregnancies in Khorramabad, it seems necessary to revise family planning programs. Despite the similar accuracy of the models, if the researcher is interested in the interpretability of the results, the use of the logistic regression model is recommended.

  3. Semiparametric nonlinear quantile regression model for financial returns

    Czech Academy of Sciences Publication Activity Database

    Avdulaj, Krenar; Baruník, Jozef

    2017-01-01

    Roč. 21, č. 1 (2017), s. 81-97 ISSN 1081-1826 R&D Projects: GA ČR(CZ) GBP402/12/G097 Institutional support: RVO:67985556 Keywords : copula quantile regression * realized volatility * value-at-risk Subject RIV: AH - Economic s OBOR OECD: Applied Economic s, Econometrics Impact factor: 0.649, year: 2016 http://library.utia.cas.cz/separaty/2017/E/avdulaj-0472346.pdf

  4. On-line mixture-based alternative to logistic regression

    Czech Academy of Sciences Publication Activity Database

    Nagy, Ivan; Suzdaleva, Evgenia

    2016-01-01

    Roč. 26, č. 5 (2016), s. 417-437 ISSN 1210-0552 R&D Projects: GA ČR GA15-03564S Institutional support: RVO:67985556 Keywords : on-line modeling * on-line logistic regression * recursive mixture estimation * data dependent pointer Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.394, year: 2016 http://library.utia.cas.cz/separaty/2016/ZS/suzdaleva-0464463.pdf

  5. Factors affecting CO_2 emissions in China’s agriculture sector: Evidence from geographically weighted regression model

    International Nuclear Information System (INIS)

    Xu, Bin; Lin, Boqiang

    2017-01-01

    China is currently the world's largest emitter of carbon dioxide. Considered as a large agricultural country, carbon emission in China’s agriculture sector keeps on growing rapidly. It is, therefore, of great importance to investigate the driving forces of carbon dioxide emissions in this sector. The traditional regression estimation can only get “average” and “global” parameter estimates; it excludes the “local” parameter estimates which vary across space in some spatial systems. Geographically weighted regression embeds the latitude and longitude of the sample data into the regression parameters, and uses the local weighted least squares method to estimate the parameters point–by–point. To reveal the nonstationary spatial effects of driving forces, geographically weighted regression model is employed in this paper. The results show that economic growth is positively correlated with emissions, with the impact in the western region being less than that in the central and eastern regions. Urbanization is positively related to emissions but produces opposite effects pattern. Energy intensity is also correlated with emissions, with a decreasing trend from the eastern region to the central and western regions. Therefore, policymakers should take full account of the spatial nonstationarity of driving forces in designing emission reduction policies. - Highlights: • We explore the driving forces of CO_2 emissions in the agriculture sector. • Urbanization is positively related to emissions but produces opposite effect pattern. • The effect of energy intensity declines from the eastern region to western region.

  6. Identifying context factors explaining physician's low performance in communication assessment: an explorative study in general practice.

    Science.gov (United States)

    Essers, Geurt; van Dulmen, Sandra; van Weel, Chris; van der Vleuten, Cees; Kramer, Anneke

    2011-12-13

    Communication is a key competence for health care professionals. Analysis of registrar and GP communication performance in daily practice, however, suggests a suboptimal application of communication skills. The influence of context factors could reveal why communication performance levels, on average, do not appear adequate. The context of daily practice may require different skills or specific ways of handling these skills, whereas communication skills are mostly treated as generic. So far no empirical analysis of the context has been made. Our aim was to identify context factors that could be related to GP communication. A purposive sample of real-life videotaped GP consultations was analyzed (N = 17). As a frame of reference we chose the MAAS-Global, a widely used assessment instrument for medical communication. By inductive reasoning, we analyzed the GP behaviour in the consultation leading to poor item scores on the MAAS-Global. In these cases we looked for the presence of an intervening context factor, and how this might explain the actual GP communication behaviour. We reached saturation after having viewed 17 consultations. We identified 19 context factors that could potentially explain the deviation from generic recommendations on communication skills. These context factors can be categorized into doctor-related, patient-related, and consultation-related factors. Several context factors seem to influence doctor-patient communication, requiring the GP to apply communication skills differently from recommendations on communication. From this study we conclude that there is a need to explicitly account for context factors in the assessment of GP (and GP registrar) communication performance. The next step is to validate our findings.

  7. Developing Predictive Models for Algal Bloom Occurrence and Identifying Factors Controlling their Occurrence in the Charlotte County and Surroundings

    Science.gov (United States)

    Karki, S.; Sultan, M.; Elkadiri, R.; Chouinard, K.

    2017-12-01

    Numerous occurrences of harmful algal blooms (Karenia Brevis) were reported from Southwest Florida along the coast of Charlotte County, Florida. We are developing data-driven (remote sensing, field, and meteorological data) models to accomplish the following: (1) identify the factors controlling bloom development, (2) forecast bloom occurrences, and (3) make recommendations for monitoring variables that are found to be most indicative of algal bloom occurrences and for identifying optimum locations for monitoring stations. To accomplish these three tasks we completed/are working on the following steps. Firstly, we developed an automatic system for downloading and processing of ocean color data acquired through MODIS Terra and MODIS Aqua products using SeaDAS ocean color processing software. Examples of extracted variables include: chlorophyll a (OC3M), chlorophyll a Generalized Inherent Optical Property (GIOP), chlorophyll a Garver-Siegel- Maritorena (GSM), sea surface temperature (SST), Secchi disk depth, euphotic depth, turbidity index, wind direction and speed, colored dissolved organic material (CDOM). Secondly we are developing a GIS database and a web-based GIS to host the generated remote sensing-based products in addition to relevant meteorological and field data. Examples of the meteorological and field inputs include: precipitation amount and rates, concentrations of nitrogen, phosphorous, fecal coliform and Dissolved Oxygen (DO). Thirdly, we are constructing and validating a multivariate regression model and an artificial neural network model to simulate past algal bloom occurrences using the compiled archival remote sensing, meteorological, and field data. The validated model will then be used to predict the timing and location of algal bloom occurrences. The developed system, upon completion, could enhance the decision making process, improve the citizen's quality of life, and strengthen the local economy.

  8. The likelihood of achieving quantified road safety targets: a binary logistic regression model for possible factors.

    Science.gov (United States)

    Sze, N N; Wong, S C; Lee, C Y

    2014-12-01

    In past several decades, many countries have set quantified road safety targets to motivate transport authorities to develop systematic road safety strategies and measures and facilitate the achievement of continuous road safety improvement. Studies have been conducted to evaluate the association between the setting of quantified road safety targets and road fatality reduction, in both the short and long run, by comparing road fatalities before and after the implementation of a quantified road safety target. However, not much work has been done to evaluate whether the quantified road safety targets are actually achieved. In this study, we used a binary logistic regression model to examine the factors - including vehicle ownership, fatality rate, and national income, in addition to level of ambition and duration of target - that contribute to a target's success. We analyzed 55 quantified road safety targets set by 29 countries from 1981 to 2009, and the results indicate that targets that are in progress and with lower level of ambitions had a higher likelihood of eventually being achieved. Moreover, possible interaction effects on the association between level of ambition and the likelihood of success are also revealed. Copyright © 2014 Elsevier Ltd. All rights reserved.

  9. Regression and regression analysis time series prediction modeling on climate data of quetta, pakistan

    International Nuclear Information System (INIS)

    Jafri, Y.Z.; Kamal, L.

    2007-01-01

    Various statistical techniques was used on five-year data from 1998-2002 of average humidity, rainfall, maximum and minimum temperatures, respectively. The relationships to regression analysis time series (RATS) were developed for determining the overall trend of these climate parameters on the basis of which forecast models can be corrected and modified. We computed the coefficient of determination as a measure of goodness of fit, to our polynomial regression analysis time series (PRATS). The correlation to multiple linear regression (MLR) and multiple linear regression analysis time series (MLRATS) were also developed for deciphering the interdependence of weather parameters. Spearman's rand correlation and Goldfeld-Quandt test were used to check the uniformity or non-uniformity of variances in our fit to polynomial regression (PR). The Breusch-Pagan test was applied to MLR and MLRATS, respectively which yielded homoscedasticity. We also employed Bartlett's test for homogeneity of variances on a five-year data of rainfall and humidity, respectively which showed that the variances in rainfall data were not homogenous while in case of humidity, were homogenous. Our results on regression and regression analysis time series show the best fit to prediction modeling on climatic data of Quetta, Pakistan. (author)

  10. Linear regression in astronomy. I

    Science.gov (United States)

    Isobe, Takashi; Feigelson, Eric D.; Akritas, Michael G.; Babu, Gutti Jogesh

    1990-01-01

    Five methods for obtaining linear regression fits to bivariate data with unknown or insignificant measurement errors are discussed: ordinary least-squares (OLS) regression of Y on X, OLS regression of X on Y, the bisector of the two OLS lines, orthogonal regression, and 'reduced major-axis' regression. These methods have been used by various researchers in observational astronomy, most importantly in cosmic distance scale applications. Formulas for calculating the slope and intercept coefficients and their uncertainties are given for all the methods, including a new general form of the OLS variance estimates. The accuracy of the formulas was confirmed using numerical simulations. The applicability of the procedures is discussed with respect to their mathematical properties, the nature of the astronomical data under consideration, and the scientific purpose of the regression. It is found that, for problems needing symmetrical treatment of the variables, the OLS bisector performs significantly better than orthogonal or reduced major-axis regression.

  11. Logic regression and its extensions.

    Science.gov (United States)

    Schwender, Holger; Ruczinski, Ingo

    2010-01-01

    Logic regression is an adaptive classification and regression procedure, initially developed to reveal interacting single nucleotide polymorphisms (SNPs) in genetic association studies. In general, this approach can be used in any setting with binary predictors, when the interaction of these covariates is of primary interest. Logic regression searches for Boolean (logic) combinations of binary variables that best explain the variability in the outcome variable, and thus, reveals variables and interactions that are associated with the response and/or have predictive capabilities. The logic expressions are embedded in a generalized linear regression framework, and thus, logic regression can handle a variety of outcome types, such as binary responses in case-control studies, numeric responses, and time-to-event data. In this chapter, we provide an introduction to the logic regression methodology, list some applications in public health and medicine, and summarize some of the direct extensions and modifications of logic regression that have been proposed in the literature. Copyright © 2010 Elsevier Inc. All rights reserved.

  12. Identifying areas at risk of low birth weight using spatial epidemiology: A small area surveillance study.

    Science.gov (United States)

    Insaf, Tabassum Z; Talbot, Thomas

    2016-07-01

    To assess the geographic distribution of Low Birth Weight (LBW) in New York State among singleton births using a spatial regression approach in order to identify priority areas for public health actions. LBW was defined as birth weight less than 2500g. Geocoded data from 562,586 birth certificates in New York State (years 2008-2012) were merged with 2010 census data at the tract level. To provide stable estimates and maintain confidentiality, data were aggregated to yield 1268 areas of analysis. LBW prevalence among singleton births was related with area-level behavioral, socioeconomic and demographic characteristics using a Poisson mixed effects spatial error regression model. Observed low birth weight showed statistically significant auto-correlation in our study area (Moran's I 0.16 p value 0.0005). After over-dispersion correction and accounting for fixed effects for selected social determinants, spatial autocorrelation was fully accounted for (Moran's I-0.007 p value 0.241). The proportion of LBW was higher in areas with larger Hispanic or Black populations and high smoking prevalence. Smoothed maps with predicted prevalence were developed to identify areas at high risk of LBW. Spatial patterns of residual variation were analyzed to identify unique risk factors. Neighborhood racial composition contributes to disparities in LBW prevalence beyond differences in behavioral and socioeconomic factors. Small-area analyses of LBW can identify areas for targeted interventions and display unique local patterns that should be accounted for in prevention strategies. Copyright © 2016 The Authors. Published by Elsevier Inc. All rights reserved.

  13. Local acceptance of wind energy: Factors of success identified in French and German case studies

    International Nuclear Information System (INIS)

    Jobert, Arthur; Laborgne, Pia; Mimler, Solveig

    2007-01-01

    The objective of this paper is to identify and analyse factors that are important for winning acceptance of wind-energy parks on the local level. The developers of wind-energy parks need to know how to manage 'social acceptance' at the different stages of planning, realisation and operation. Five case studies in France and Germany focused on factors of success in developing a wind-energy project on a given site and illuminated how policy frameworks influence local acceptance. Our hypothesis is that these factors fall into two categories: institutional conditions, such as economic incentives and regulations; and site-specific conditions (territorial factors), such as the local economy, the local geography, local actors, and the actual on-site planning process (project management)

  14. Job satisfaction of nurses and identifying factors of job satisfaction in Slovenian Hospitals

    Science.gov (United States)

    Lorber, Mateja; Skela Savič, Brigita

    2012-01-01

    Aim To determine the level of job satisfaction of nursing professionals in Slovenian hospitals and factors influencing job satisfaction in nursing. Methods The study included 4 hospitals selected from the hospital list comprising 26 hospitals in Slovenia. The employees of these hospitals represent 29.8% and 509 employees included in the study represent 6% of all employees in nursing in Slovenian hospitals. One structured survey questionnaire was administered to the leaders and the other to employees, both consisting 154 items evaluated on a 5 point Likert-type scale. We examined the correlation between independent variables (age, number of years of employment, behavior of leaders, personal characteristics of leaders, and managerial competencies of leaders) and the dependent variable (job satisfaction – satisfaction with the work, coworkers, management, pay, etc) by applying correlation analysis and multivariate regression analysis. In addition, factor analysis was used to establish characteristic components of the variables measured. Results We found a medium level of job satisfaction in both leaders (3.49 ± 0.5) and employees (3.19 ± 0.6), however, there was a significant difference between their estimates (t = 3.237; P = Job satisfaction was explained by age (P job satisfaction variance. Conclusion Satisfied employees play a crucial role in an organization’s success, so health care organizations must be aware of the importance of employees’ job satisfaction. It is recommended to monitor employees’ job satisfaction levels on an annual basis. PMID:22661140

  15. A Global Interactome Map of the Dengue Virus NS1 Identifies Virus Restriction and Dependency Host Factors

    Directory of Open Access Journals (Sweden)

    Mohamed Lamine Hafirassou

    2017-12-01

    Full Text Available Dengue virus (DENV infections cause the most prevalent mosquito-borne viral disease worldwide, for which no therapies are available. DENV encodes seven non-structural (NS proteins that co-assemble and recruit poorly characterized host factors to form the DENV replication complex essential for viral infection. Here, we provide a global proteomic analysis of the human host factors that interact with the DENV NS1 protein. Combined with a functional RNAi screen, this study reveals a comprehensive network of host cellular processes involved in DENV infection and identifies DENV host restriction and dependency factors. We highlight an important role of RACK1 and the chaperonin TRiC (CCT and oligosaccharyltransferase (OST complexes during DENV replication. We further show that the OST complex mediates NS1 and NS4B glycosylation, and pharmacological inhibition of its N-glycosylation function strongly impairs DENV infection. In conclusion, our study provides a global interactome of the DENV NS1 and identifies host factors targetable for antiviral therapies.

  16. Tracking time-varying parameters with local regression

    DEFF Research Database (Denmark)

    Joensen, Alfred Karsten; Nielsen, Henrik Aalborg; Nielsen, Torben Skov

    2000-01-01

    This paper shows that the recursive least-squares (RLS) algorithm with forgetting factor is a special case of a varying-coe\\$cient model, and a model which can easily be estimated via simple local regression. This observation allows us to formulate a new method which retains the RLS algorithm, bu......, but extends the algorithm by including polynomial approximations. Simulation results are provided, which indicates that this new method is superior to the classical RLS method, if the parameter variations are smooth....

  17. Robust Nonnegative Matrix Factorization via Joint Graph Laplacian and Discriminative Information for Identifying Differentially Expressed Genes

    Directory of Open Access Journals (Sweden)

    Ling-Yun Dai

    2017-01-01

    Full Text Available Differential expression plays an important role in cancer diagnosis and classification. In recent years, many methods have been used to identify differentially expressed genes. However, the recognition rate and reliability of gene selection still need to be improved. In this paper, a novel constrained method named robust nonnegative matrix factorization via joint graph Laplacian and discriminative information (GLD-RNMF is proposed for identifying differentially expressed genes, in which manifold learning and the discriminative label information are incorporated into the traditional nonnegative matrix factorization model to train the objective matrix. Specifically, L2,1-norm minimization is enforced on both the error function and the regularization term which is robust to outliers and noise in gene data. Furthermore, the multiplicative update rules and the details of convergence proof are shown for the new model. The experimental results on two publicly available cancer datasets demonstrate that GLD-RNMF is an effective method for identifying differentially expressed genes.

  18. Combining Alphas via Bounded Regression

    Directory of Open Access Journals (Sweden)

    Zura Kakushadze

    2015-11-01

    Full Text Available We give an explicit algorithm and source code for combining alpha streams via bounded regression. In practical applications, typically, there is insufficient history to compute a sample covariance matrix (SCM for a large number of alphas. To compute alpha allocation weights, one then resorts to (weighted regression over SCM principal components. Regression often produces alpha weights with insufficient diversification and/or skewed distribution against, e.g., turnover. This can be rectified by imposing bounds on alpha weights within the regression procedure. Bounded regression can also be applied to stock and other asset portfolio construction. We discuss illustrative examples.

  19. Prevalence of asthma symptoms and associated factors in schoolchildren from Brazilian Amazon islands.

    Science.gov (United States)

    Freitas, Marly S; Monteiro, Julius Caesar S; Camelo-Nunes, Inês C; Solé, Dirceu

    2012-08-01

    To determine the prevalence of asthma symptoms among schoolchildren living on two different Brazilian Amazon islands and to identify the risk factors related to this condition. A cross-sectional study of 400 schoolchildren (5-8 years old) using written questionnaires produced by the International Study of Asthma and Allergies in Childhood, a skin prick test (allergic sensitization), and a stool examination (for helminthic infection). Nonparametric tests were carried out, and the risk factors were identified by logistic regression. The prevalence of active asthma symptoms was markedly higher in children living on Outeiro Island (OI) than those living on Combú Island (CBI) (30.5% and 16.5%, respectively). The logistic regression identified several risk factors of asthma symptoms on CBI: parental history of asthma, night coughing in the past year, and "currently have a cat". On OI, the major risk factors were parental history of asthma, personal history of eczema, having two or more older siblings, and night coughing in the past year. The risk factors in common on both the islands were night coughing in the past year and parental history of asthma. The prevalence of asthma symptoms was higher in those with a lifestyle closer to that observed in urban areas (i.e., better sanitation and hygiene), reinforcing the protective effect of a rural environment. Different risk factors were associated with asthma symptoms in schoolchildren living on OI and on CBI. This fact may reflect the environmental individuality and particularities of each island.

  20. Genome-wide screen of Pseudomonas aeruginosa In Saccharomyces cerevisiae identifies new virulence factors

    Directory of Open Access Journals (Sweden)

    Rafat eZrieq

    2015-11-01

    Full Text Available Pseudomonas aeruginosa is a human opportunistic pathogen that causes mortality in cystic fibrosis and immunocompromised patients. While many virulence factors of this pathogen have already been identified, several remain to be discovered. In this respect we set an unprecedented genome-wide screen of a P. aeruginosa expression library based on a yeast growth phenotype. 51 candidates were selected in a three-round screening process. The robustness of the screen was validated by the selection of three well known secreted proteins including one demonstrated virulence factor, the protease LepA. Further in silico sorting of the 51 candidates highlighted three potential new Pseudomonas effector candidates (Pec. By testing the cytotoxicity of wild type P. aeruginosa vs pec mutants towards macrophages and the virulence in the Caenorhabditis elegans model, we demonstrated that the three selected Pecs are novel virulence factors of P. aeruginosa. Additional cellular localization experiments in the host revealed specific localization for Pec1 and Pec2 that could inform about their respective functions.

  1. riskRegression

    DEFF Research Database (Denmark)

    Ozenne, Brice; Sørensen, Anne Lyngholm; Scheike, Thomas

    2017-01-01

    In the presence of competing risks a prediction of the time-dynamic absolute risk of an event can be based on cause-specific Cox regression models for the event and the competing risks (Benichou and Gail, 1990). We present computationally fast and memory optimized C++ functions with an R interface...... for predicting the covariate specific absolute risks, their confidence intervals, and their confidence bands based on right censored time to event data. We provide explicit formulas for our implementation of the estimator of the (stratified) baseline hazard function in the presence of tied event times. As a by...... functionals. The software presented here is implemented in the riskRegression package....

  2. Profile-driven regression for modeling and runtime optimization of mobile networks

    DEFF Research Database (Denmark)

    McClary, Dan; Syrotiuk, Violet; Kulahci, Murat

    2010-01-01

    Computer networks often display nonlinear behavior when examined over a wide range of operating conditions. There are few strategies available for modeling such behavior and optimizing such systems as they run. Profile-driven regression is developed and applied to modeling and runtime optimization...... of throughput in a mobile ad hoc network, a self-organizing collection of mobile wireless nodes without any fixed infrastructure. The intermediate models generated in profile-driven regression are used to fit an overall model of throughput, and are also used to optimize controllable factors at runtime. Unlike...

  3. Random T-DNA mutagenesis identifies a Cu-Zn-superoxide dismutase gene as a virulence factor of Sclerotinia sclerotiorum

    Science.gov (United States)

    Agrobacterium-mediated transformation (AMT) was used to identify potential virulence factors in Sclerotinia sclerotiorum. Screening AMT transformants identified two mutants showing significantly reduced virulence. The mutants showed similar growth rate, colony morphology, and sclerotial and oxalate ...

  4. Testing job typologies and identifying at-risk subpopulations using factor mixture models.

    Science.gov (United States)

    Keller, Anita C; Igic, Ivana; Meier, Laurenz L; Semmer, Norbert K; Schaubroeck, John M; Brunner, Beatrice; Elfering, Achim

    2017-10-01

    Research in occupational health psychology has tended to focus on the effects of single job characteristics or various job characteristics combined into 1 factor. However, such a variable-centered approach does not account for the clustering of job attributes among groups of employees. We addressed this issue by using a person-centered approach to (a) investigate the occurrence of different empirical constellations of perceived job stressors and resources and (b) validate the meaningfulness of profiles by analyzing their association with employee well-being and performance. We applied factor mixture modeling to identify profiles in 4 large samples consisting of employees in Switzerland (Studies 1 and 2) and the United States (Studies 3 and 4). We identified 2 profiles that spanned the 4 samples, with 1 reflecting a combination of relatively low stressors and high resources (P1) and the other relatively high stressors and low resources (P3). The profiles differed mainly in terms of their organizational and social aspects. Employees in P1 reported significantly higher mean levels of job satisfaction, performance, and general health, and lower means in exhaustion compared with P3. Additional analyses showed differential relationships between job attributes and outcomes depending on profile membership. These findings may benefit organizational interventions as they show that perceived work stressors and resources more strongly influence satisfaction and well-being in particular profiles. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  5. Trabecular bone score as an assessment tool to identify the risk of osteoporosis in axial spondyloarthritis: a case-control study.

    Science.gov (United States)

    Kang, Kwi Young; Goo, Hye Yeon; Park, Sung-Hwan; Hong, Yeon Sik

    2018-03-01

    To compare the trabecular bone score (TBS) between patients with axial spondyloarthritis (axSpA) and matched normal controls and identify risk factors associated with a low TBS. TBS and BMD were assessed in the two groups (axSpA and control) using DXA. Osteoporosis risk factors and inflammatory markers were also assessed. Disease activity and radiographic progression in the sacroiliac joint and spine were evaluated in the axSpA group. Multivariate linear regression analysis was performed to identify risk factors associated with TBS. In the axSpA group, 248 subjects were enrolled; an equal number of age- and sex-matched subjects comprised the control group. The mean TBS was 1.43 (0.08) and 1.38 (0.12) in the control and axSpA groups, respectively (P tool to identify the risk of osteoporosis in patients with axSpA.

  6. Parameter identifiability and redundancy: theoretical considerations.

    Directory of Open Access Journals (Sweden)

    Mark P Little

    Full Text Available BACKGROUND: Models for complex biological systems may involve a large number of parameters. It may well be that some of these parameters cannot be derived from observed data via regression techniques. Such parameters are said to be unidentifiable, the remaining parameters being identifiable. Closely related to this idea is that of redundancy, that a set of parameters can be expressed in terms of some smaller set. Before data is analysed it is critical to determine which model parameters are identifiable or redundant to avoid ill-defined and poorly convergent regression. METHODOLOGY/PRINCIPAL FINDINGS: In this paper we outline general considerations on parameter identifiability, and introduce the notion of weak local identifiability and gradient weak local identifiability. These are based on local properties of the likelihood, in particular the rank of the Hessian matrix. We relate these to the notions of parameter identifiability and redundancy previously introduced by Rothenberg (Econometrica 39 (1971 577-591 and Catchpole and Morgan (Biometrika 84 (1997 187-196. Within the widely used exponential family, parameter irredundancy, local identifiability, gradient weak local identifiability and weak local identifiability are shown to be largely equivalent. We consider applications to a recently developed class of cancer models of Little and Wright (Math Biosciences 183 (2003 111-134 and Little et al. (J Theoret Biol 254 (2008 229-238 that generalize a large number of other recently used quasi-biological cancer models. CONCLUSIONS/SIGNIFICANCE: We have shown that the previously developed concepts of parameter local identifiability and redundancy are closely related to the apparently weaker properties of weak local identifiability and gradient weak local identifiability--within the widely used exponential family these concepts largely coincide.

  7. Modeling oil production based on symbolic regression

    International Nuclear Information System (INIS)

    Yang, Guangfei; Li, Xianneng; Wang, Jianliang; Lian, Lian; Ma, Tieju

    2015-01-01

    Numerous models have been proposed to forecast the future trends of oil production and almost all of them are based on some predefined assumptions with various uncertainties. In this study, we propose a novel data-driven approach that uses symbolic regression to model oil production. We validate our approach on both synthetic and real data, and the results prove that symbolic regression could effectively identify the true models beneath the oil production data and also make reliable predictions. Symbolic regression indicates that world oil production will peak in 2021, which broadly agrees with other techniques used by researchers. Our results also show that the rate of decline after the peak is almost half the rate of increase before the peak, and it takes nearly 12 years to drop 4% from the peak. These predictions are more optimistic than those in several other reports, and the smoother decline will provide the world, especially the developing countries, with more time to orchestrate mitigation plans. -- Highlights: •A data-driven approach has been shown to be effective at modeling the oil production. •The Hubbert model could be discovered automatically from data. •The peak of world oil production is predicted to appear in 2021. •The decline rate after peak is half of the increase rate before peak. •Oil production projected to decline 4% post-peak

  8. Differential Effects for Sexual Risk Behavior: An Application of Finite Mixture Regression

    OpenAIRE

    Lanza, Stephanie T.; Kugler, Kari C.; Mathur, Charu

    2011-01-01

    Understanding the multiple factors that place individuals at risk for sexual risk behavior is critical for developing effective intervention programs. Regression-based methods are commonly used to estimate the average effects of risk factors, however such results can be difficult to translate to prevention implications at the individual level. Although differential effects can be examined to some extent by including interaction terms, as risk factors and moderators are added to the model inte...

  9. Understanding logistic regression analysis

    OpenAIRE

    Sperandei, Sandro

    2014-01-01

    Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using ex...

  10. Frequent hospital admissions in Singapore: clinical risk factors and impact of socioeconomic status.

    Science.gov (United States)

    Low, Lian Leng; Tay, Wei Yi; Ng, Matthew Joo Ming; Tan, Shu Yun; Liu, Nan; Lee, Kheng Hock

    2018-01-01

    Frequent admitters to hospitals are high-cost patients who strain finite healthcare resources. However, the exact risk factors for frequent admissions, which can be used to guide risk stratification and design effective interventions locally, remain unknown. Our study aimed to identify the clinical and sociodemographic risk factors associated with frequent hospital admissions in Singapore. An observational study was conducted using retrospective 2014 data from the administrative database at Singapore General Hospital, Singapore. Variables were identified a priori and included patient demographics, comorbidities, prior healthcare utilisation, and clinical and laboratory variables during the index admission. Multivariate logistic regression analysis was used to identify independent risk factors for frequent admissions. A total of 16,306 unique patients were analysed and 1,640 (10.1%) patients were classified as frequent admitters. On multivariate logistic regression, 16 variables were independently associated with frequent hospital admissions, including age, cerebrovascular disease, history of malignancy, haemoglobin, serum creatinine, serum albumin, and number of specialist outpatient clinic visits, emergency department visits, admissions preceding index admission and medications dispensed at discharge. Patients staying in public rental housing had a 30% higher risk of being a frequent admitter after adjusting for demographics and clinical conditions. Our study, the first in our knowledge to examine the clinical risk factors for frequent admissions in Singapore, validated the use of public rental housing as a sensitive indicator of area-level socioeconomic status in Singapore. These risk factors can be used to identify high-risk patients in the hospital so that they can receive interventions that reduce readmission risk. Copyright: © Singapore Medical Association

  11. Study of depression influencing factors with zero-inflated regression models in a large-scale population survey

    OpenAIRE

    Xu, Tao; Zhu, Guangjin; Han, Shaomei

    2017-01-01

    Objectives The number of depression symptoms can be considered as count data in order to get complete and accurate analyses findings in studies of depression. This study aims to compare the goodness of fit of four count outcomes models by a large survey sample to identify the optimum model for a risk factor study of the number of depression symptoms. Methods 15 820 subjects, aged 10 to 80 years old, who were not suffering from serious chronic diseases and had not run a high fever in the past ...

  12. Use of multiple linear regression and logistic regression models to investigate changes in birthweight for term singleton infants in Scotland.

    Science.gov (United States)

    Bonellie, Sandra R

    2012-10-01

    To illustrate the use of regression and logistic regression models to investigate changes over time in size of babies particularly in relation to social deprivation, age of the mother and smoking. Mean birthweight has been found to be increasing in many countries in recent years, but there are still a group of babies who are born with low birthweights. Population-based retrospective cohort study. Multiple linear regression and logistic regression models are used to analyse data on term 'singleton births' from Scottish hospitals between 1994-2003. Mothers who smoke are shown to give birth to lighter babies on average, a difference of approximately 0.57 Standard deviations lower (95% confidence interval. 0.55-0.58) when adjusted for sex and parity. These mothers are also more likely to have babies that are low birthweight (odds ratio 3.46, 95% confidence interval 3.30-3.63) compared with non-smokers. Low birthweight is 30% more likely where the mother lives in the most deprived areas compared with the least deprived, (odds ratio 1.30, 95% confidence interval 1.21-1.40). Smoking during pregnancy is shown to have a detrimental effect on the size of infants at birth. This effect explains some, though not all, of the observed socioeconomic birthweight. It also explains much of the observed birthweight differences by the age of the mother.   Identifying mothers at greater risk of having a low birthweight baby as important implications for the care and advice this group receives. © 2012 Blackwell Publishing Ltd.

  13. Resident Workflow and Psychiatric Emergency Consultation: Identifying Factors for Quality Improvement in a Training Environment.

    Science.gov (United States)

    Blair, Thomas; Wiener, Zev; Seroussi, Ariel; Tang, Lingqi; O'Hora, Jennifer; Cheung, Erick

    2017-06-01

    Quality improvement to optimize workflow has the potential to mitigate resident burnout and enhance patient care. This study applied mixed methods to identify factors that enhance or impede workflow for residents performing emergency psychiatric consultations. The study population consisted of all psychiatry program residents (55 eligible, 42 participating) at the Semel Institute for Neuroscience and Human Behavior, University of California, Los Angeles. The authors developed a survey through iterative piloting, surveyed all residents, and then conducted a focus group. The survey included elements hypothesized to enhance or impede workflow, and measures pertaining to self-rated efficiency and stress. Distributional and bivariate analyses were performed. Survey findings were clarified in focus group discussion. This study identified several factors subjectively associated with enhanced or impeded workflow, including difficulty with documentation, the value of personal organization systems, and struggles to communicate with patients' families. Implications for resident education are discussed.

  14. Spatial vulnerability assessments by regression kriging

    Science.gov (United States)

    Pásztor, László; Laborczi, Annamária; Takács, Katalin; Szatmári, Gábor

    2016-04-01

    information representing IEW or GRP forming environmental factors were taken into account to support the spatial inference of the locally experienced IEW frequency and measured GRP values respectively. An efficient spatial prediction methodology was applied to construct reliable maps, namely regression kriging (RK) using spatially exhaustive auxiliary data on soil, geology, topography, land use and climate. RK divides the spatial inference into two parts. Firstly the deterministic component of the target variable is determined by a regression model. The residuals of the multiple linear regression analysis represent the spatially varying but dependent stochastic component, which are interpolated by kriging. The final map is the sum of the two component predictions. Application of RK also provides the possibility of inherent accuracy assessment. The resulting maps are characterized by global and local measures of its accuracy. Additionally the method enables interval estimation for spatial extension of the areas of predefined risk categories. All of these outputs provide useful contribution to spatial planning, action planning and decision making. Acknowledgement: Our work was partly supported by the Hungarian National Scientific Research Foundation (OTKA, Grant No. K105167).

  15. Linear regression in astronomy. II

    Science.gov (United States)

    Feigelson, Eric D.; Babu, Gutti J.

    1992-01-01

    A wide variety of least-squares linear regression procedures used in observational astronomy, particularly investigations of the cosmic distance scale, are presented and discussed. The classes of linear models considered are (1) unweighted regression lines, with bootstrap and jackknife resampling; (2) regression solutions when measurement error, in one or both variables, dominates the scatter; (3) methods to apply a calibration line to new data; (4) truncated regression models, which apply to flux-limited data sets; and (5) censored regression models, which apply when nondetections are present. For the calibration problem we develop two new procedures: a formula for the intercept offset between two parallel data sets, which propagates slope errors from one regression to the other; and a generalization of the Working-Hotelling confidence bands to nonstandard least-squares lines. They can provide improved error analysis for Faber-Jackson, Tully-Fisher, and similar cosmic distance scale relations.

  16. ANALYSIS OF THE FINANCIAL PERFORMANCES OF THE FIRM, BY USING THE MULTIPLE REGRESSION MODEL

    Directory of Open Access Journals (Sweden)

    Constantin Anghelache

    2011-11-01

    Full Text Available The information achieved through the use of simple linear regression are not always enough to characterize the evolution of an economic phenomenon and, furthermore, to identify its possible future evolution. To remedy these drawbacks, the special literature includes multiple regression models, in which the evolution of the dependant variable is defined depending on two or more factorial variables.

  17. Household Factors Associated with Self-Harm in Johannesburg, South African Urban-Poor Households.

    Directory of Open Access Journals (Sweden)

    Nisha Naicker

    Full Text Available Low and middle income countries bear the majority burden of self-harm, yet there is a paucity of evidence detailing risk-factors for self-harm in these populations. This study aims to identify environmental, socio-economic and demographic household-level risk factors for self-harm in five impoverished urban communities in Johannesburg, South Africa.Annual serial cross-sectional surveys were undertaken in five impoverished urban communities in Johannesburg for the Health, Environment and Development (HEAD study. Logistic regression analysis using the HEAD study data (2006-2011 was conducted to identify household-level risk factors associated with self-harm (defined as a self-reported case of a fatal or non-fatal suicide attempt within the household during the preceding year. Stepwise multivariate logistic regression analysis was employed to identify factors associated with self-harm.A total of 2 795 household interviews were conducted from 2006 to 2011. There was no significant trend in self-harm over time. Results from the final model showed that self-harm was significantly associated with households exposed to a violent crime during the past year (Adjusted Odds Ratio (AOR 5.72; 95% CI 1.64-19.97; that have a member suffering from a chronic medical condition (AOR 8.95; 95% 2.39-33.56 and households exposed to indoor smoking (AOR 4.39; CI 95% 1.14-16.47.This study provides evidence on household risk factors of self-harm in settings of urban poverty and has highlighted the potential for a more cost-effective approach to identifying those at risk of self-harm based on household level factors.

  18. Linear regression and sensitivity analysis in nuclear reactor design

    International Nuclear Information System (INIS)

    Kumar, Akansha; Tsvetkov, Pavel V.; McClarren, Ryan G.

    2015-01-01

    Highlights: • Presented a benchmark for the applicability of linear regression to complex systems. • Applied linear regression to a nuclear reactor power system. • Performed neutronics, thermal–hydraulics, and energy conversion using Brayton’s cycle for the design of a GCFBR. • Performed detailed sensitivity analysis to a set of parameters in a nuclear reactor power system. • Modeled and developed reactor design using MCNP, regression using R, and thermal–hydraulics in Java. - Abstract: The paper presents a general strategy applicable for sensitivity analysis (SA), and uncertainity quantification analysis (UA) of parameters related to a nuclear reactor design. This work also validates the use of linear regression (LR) for predictive analysis in a nuclear reactor design. The analysis helps to determine the parameters on which a LR model can be fit for predictive analysis. For those parameters, a regression surface is created based on trial data and predictions are made using this surface. A general strategy of SA to determine and identify the influential parameters those affect the operation of the reactor is mentioned. Identification of design parameters and validation of linearity assumption for the application of LR of reactor design based on a set of tests is performed. The testing methods used to determine the behavior of the parameters can be used as a general strategy for UA, and SA of nuclear reactor models, and thermal hydraulics calculations. A design of a gas cooled fast breeder reactor (GCFBR), with thermal–hydraulics, and energy transfer has been used for the demonstration of this method. MCNP6 is used to simulate the GCFBR design, and perform the necessary criticality calculations. Java is used to build and run input samples, and to extract data from the output files of MCNP6, and R is used to perform regression analysis and other multivariate variance, and analysis of the collinearity of data

  19. Detection of Differential Item Functioning with Nonlinear Regression: A Non-IRT Approach Accounting for Guessing

    Czech Academy of Sciences Publication Activity Database

    Drabinová, Adéla; Martinková, Patrícia

    2017-01-01

    Roč. 54, č. 4 (2017), s. 498-517 ISSN 0022-0655 R&D Projects: GA ČR GJ15-15856Y Institutional support: RVO:67985807 Keywords : differential item functioning * non-linear regression * logistic regression * item response theory Subject RIV: AM - Education OBOR OECD: Statistics and probability Impact factor: 0.979, year: 2016

  20. Application of range-test in multiple linear regression analysis in ...

    African Journals Online (AJOL)

    Application of range-test in multiple linear regression analysis in the presence of outliers is studied in this paper. First, the plot of the explanatory variables (i.e. Administration, Social/Commercial, Economic services and Transfer) on the dependent variable (i.e. GDP) was done to identify the statistical trend over the years.

  1. Control principles of confounders in ecological comparative studies: standardization and regressive modelss

    Directory of Open Access Journals (Sweden)

    Varaksin Anatoly

    2014-03-01

    Full Text Available The methods of the analysis of research data including the concomitant variables (confounders associated with both the response and the current factor are considered. There are two usual ways to take into account such variables: the first, at the stage of planning the experiment and the second, in analyzing the received data. Despite the equal effectiveness of these approaches, there exists strong reason to restrict the usage of regression method to accounting for confounders by ANCOVA. Authors consider the standardization by stratification as a reliable method to account for the effect of confounding factors as opposed to the widely-implemented application of logistic regression and the covariance analysis. The program for the automation of standardization procedure is proposed, it is available at the site of the Institute of Industrial Ecology.

  2. Analyzing musculoskeletal neck pain, measured as present pain and periods of pain, with three different regression models: a cohort study

    Directory of Open Access Journals (Sweden)

    Hagberg Mats

    2009-06-01

    Full Text Available Abstract Background In the literature there are discussions on the choice of outcome and the need for more longitudinal studies of musculoskeletal disorders. The general aim of this longitudinal study was to analyze musculoskeletal neck pain, in a group of young adults. Specific aims were to determine whether psychosocial factors, computer use, high work/study demands, and lifestyle are long-term or short-term factors for musculoskeletal neck pain, and whether these factors are important for developing or ongoing musculoskeletal neck pain. Methods Three regression models were used to analyze the different outcomes. Pain at present was analyzed with a marginal logistic model, for number of years with pain a Poisson regression model was used and for developing and ongoing pain a logistic model was used. Presented results are odds ratios and proportion ratios (logistic models and rate ratios (Poisson model. The material consisted of web-based questionnaires answered by 1204 Swedish university students from a prospective cohort recruited in 2002. Results Perceived stress was a risk factor for pain at present (PR = 1.6, for developing pain (PR = 1.7 and for number of years with pain (RR = 1.3. High work/study demands was associated with pain at present (PR = 1.6; and with number of years with pain when the demands negatively affect home life (RR = 1.3. Computer use pattern (number of times/week with a computer session ≥ 4 h, without break was a risk factor for developing pain (PR = 1.7, but also associated with pain at present (PR = 1.4 and number of years with pain (RR = 1.2. Among life style factors smoking (PR = 1.8 was found to be associated to pain at present. The difference between men and women in prevalence of musculoskeletal pain was confirmed in this study. It was smallest for the outcome ongoing pain (PR = 1.4 compared to pain at present (PR = 2.4 and developing pain (PR = 2.5. Conclusion By using different regression models different

  3. Using a Systematic Approach to Identifying Organizational Factors in Root Cause Analysis

    International Nuclear Information System (INIS)

    Gallogly, Kay Wilde

    2011-01-01

    This presentation set the scene for the second discussion session. In her presentation, the author observed that: - Investigators do not see the connection between the analysis tools available and the identification of HOF. Most investigators use the tools in a cursory manner and so do not derive the full benefits of the tools. Some tools are used for presentation purposes as opposed to being used for analytical purposes e.g. event and causal factors charts. In some cases, the report will indicate that specific analytical tools were used in the investigation but the analysis is not in the body of the report. - Some investigators are documenting HOF causes but do not recognize them as such. This indicates a lack of understanding of HOF. - Others investigators focus on technical issues because of their own comfort level. - The culture of the Organisation will affect the depth of the investigation and therefore the use of the analytical tools to pursue HOF issues. - The author contends that if analysis tools are applied systematically to gather factually based data, then HOF issues can be identified. The use of factual information (without judgement and subjectivity) is important to maintain the credibility of the investigation especially when HOF issues are identified. - Systematic use of tools assists in better communication of the issues to foster greater understanding and acceptance by senior management. - Barrier Analysis, Change Analysis, and TWIN (Task Demands, Work Environment, Individual Capabilities, and Human Nature) all offer the opportunity to identify HOF issues if the analyst pursues this line of investigation. It was illustrated that many elements of the TWIN Error Precursors are themselves Organisational in nature. - The TWIN model applied to the Anatomy of an Event will help to distinguish those which are Organisational issues (Latent Organisational Weaknesses, Error Precursors and Flawed Defences) and those which are human factors (Active Errors

  4. Perioperative factors predicting poor outcome in elderly patients following emergency general surgery: a multivariate regression analysis

    Science.gov (United States)

    Lees, Mackenzie C.; Merani, Shaheed; Tauh, Keerit; Khadaroo, Rachel G.

    2015-01-01

    Background Older adults (≥ 65 yr) are the fastest growing population and are presenting in increasing numbers for acute surgical care. Emergency surgery is frequently life threatening for older patients. Our objective was to identify predictors of mortality and poor outcome among elderly patients undergoing emergency general surgery. Methods We conducted a retrospective cohort study of patients aged 65–80 years undergoing emergency general surgery between 2009 and 2010 at a tertiary care centre. Demographics, comorbidities, in-hospital complications, mortality and disposition characteristics of patients were collected. Logistic regression analysis was used to identify covariate-adjusted predictors of in-hospital mortality and discharge of patients home. Results Our analysis included 257 patients with a mean age of 72 years; 52% were men. In-hospital mortality was 12%. Mortality was associated with patients who had higher American Society of Anesthesiologists (ASA) class (odds ratio [OR] 3.85, 95% confidence interval [CI] 1.43–10.33, p = 0.008) and in-hospital complications (OR 1.93, 95% CI 1.32–2.83, p = 0.001). Nearly two-thirds of patients discharged home were younger (OR 0.92, 95% CI 0.85–0.99, p = 0.036), had lower ASA class (OR 0.45, 95% CI 0.27–0.74, p = 0.002) and fewer in-hospital complications (OR 0.69, 95% CI 0.53–0.90, p = 0.007). Conclusion American Society of Anesthesiologists class and in-hospital complications are perioperative predictors of mortality and disposition in the older surgical population. Understanding the predictors of poor outcome and the importance of preventing in-hospital complications in older patients will have important clinical utility in terms of preoperative counselling, improving health care and discharging patients home. PMID:26204143

  5. Quantile regression theory and applications

    CERN Document Server

    Davino, Cristina; Vistocco, Domenico

    2013-01-01

    A guide to the implementation and interpretation of Quantile Regression models This book explores the theory and numerous applications of quantile regression, offering empirical data analysis as well as the software tools to implement the methods. The main focus of this book is to provide the reader with a comprehensivedescription of the main issues concerning quantile regression; these include basic modeling, geometrical interpretation, estimation and inference for quantile regression, as well as issues on validity of the model, diagnostic tools. Each methodological aspect is explored and

  6. Fungible weights in logistic regression.

    Science.gov (United States)

    Jones, Jeff A; Waller, Niels G

    2016-06-01

    In this article we develop methods for assessing parameter sensitivity in logistic regression models. To set the stage for this work, we first review Waller's (2008) equations for computing fungible weights in linear regression. Next, we describe 2 methods for computing fungible weights in logistic regression. To demonstrate the utility of these methods, we compute fungible logistic regression weights using data from the Centers for Disease Control and Prevention's (2010) Youth Risk Behavior Surveillance Survey, and we illustrate how these alternate weights can be used to evaluate parameter sensitivity. To make our work accessible to the research community, we provide R code (R Core Team, 2015) that will generate both kinds of fungible logistic regression weights. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  7. Penalized regression procedures for variable selection in the potential outcomes framework.

    Science.gov (United States)

    Ghosh, Debashis; Zhu, Yeying; Coffman, Donna L

    2015-05-10

    A recent topic of much interest in causal inference is model selection. In this article, we describe a framework in which to consider penalized regression approaches to variable selection for causal effects. The framework leads to a simple 'impute, then select' class of procedures that is agnostic to the type of imputation algorithm as well as penalized regression used. It also clarifies how model selection involves a multivariate regression model for causal inference problems and that these methods can be applied for identifying subgroups in which treatment effects are homogeneous. Analogies and links with the literature on machine learning methods, missing data, and imputation are drawn. A difference least absolute shrinkage and selection operator algorithm is defined, along with its multiple imputation analogs. The procedures are illustrated using a well-known right-heart catheterization dataset. Copyright © 2015 John Wiley & Sons, Ltd.

  8. Identifying Critical Factors in the Eco-Efficiency of Remanufacturing Based on the Fuzzy DEMATEL Method

    Directory of Open Access Journals (Sweden)

    Qianwang Deng

    2015-11-01

    Full Text Available Remanufacturing can bring considerable economic and environmental benefits such as cost saving, conservation of energy and resources, and reduction of emissions. With the increasing awareness of sustainable manufacturing, remanufacturing gradually becomes the research priority. Most studies concentrate on the analysis of influencing factors, or the evaluation of the economic and environmental performance in remanufacturing, while little effort has been devoted to investigating the critical factors influencing the eco-efficiency of remanufacturing. Considering the current development of the remanufacturing industry in China, this paper proposes a set of factors influencing the eco-efficiency of remanufacturing and then utilizes a fuzzy Decision Making Trial and Evaluation Laboratory (DEMATEL method to establish relation matrixes reflecting the interdependent relationships among these factors. Finally, the contributions of each factor to eco-efficiency and mutual influence values among them are obtained, and critical factors in eco-efficiency of remanufacturing are identified. The results of the present work can provide theoretical supports for the government to make appropriate policies to improve the eco-efficiency of remanufacturing.

  9. STUDY OF IDENTIFYING AND PRIORITIZING THE AFFECTING FACTORS ON BANK BRAND CUSTOMER LOYALTY

    OpenAIRE

    Zahra Aliyari; Yosef Beygzadeh

    2017-01-01

    Today, customer loyalty is the key to business success. By increased customers’ loyalty, market share and profitability level of enterprises will rise. Market perception along with planning and adopting appropriate strategies for making customers loyal and enhancing their rate of loyalty leads to long-term benefits for the enterprises. Given the importance of the issue, the goal of this study was to identify and prioritize the factors affecting loyalty to a banking brand from perspective of K...

  10. Genome-wide CRISPR/Cas9 Screen Identifies Host Factors Essential for Influenza Virus Replication

    Directory of Open Access Journals (Sweden)

    Julianna Han

    2018-04-01

    Full Text Available Summary: The emergence of influenza A viruses (IAVs from zoonotic reservoirs poses a great threat to human health. As seasonal vaccines are ineffective against zoonotic strains, and newly transmitted viruses can quickly acquire drug resistance, there remains a need for host-directed therapeutics against IAVs. Here, we performed a genome-scale CRISPR/Cas9 knockout screen in human lung epithelial cells with a human isolate of an avian H5N1 strain. Several genes involved in sialic acid biosynthesis and related glycosylation pathways were highly enriched post-H5N1 selection, including SLC35A1, a sialic acid transporter essential for IAV receptor expression and thus viral entry. Importantly, we have identified capicua (CIC as a negative regulator of cell-intrinsic immunity, as loss of CIC resulted in heightened antiviral responses and restricted replication of multiple viruses. Therefore, our study demonstrates that the CRISPR/Cas9 system can be utilized for the discovery of host factors critical for the replication of intracellular pathogens. : Using a genome-wide CRISPR/Cas9 screen, Han et al. demonstrate that the major hit, the sialic acid transporter SLC35A1, is an essential host factor for IAV entry. In addition, they identify the DNA-binding transcriptional repressor CIC as a negative regulator of cell-intrinsic immunity. Keywords: CRISPR/Cas9 screen, GeCKO, influenza virus, host factors, sialic acid pathway, SLC35A1, Capicua, CIC, cell-intrinsic immunity, H5N1

  11. Semiparametric Allelic Tests for Mapping Multiple Phenotypes: Binomial Regression and Mahalanobis Distance.

    Science.gov (United States)

    Majumdar, Arunabha; Witte, John S; Ghosh, Saurabh

    2015-12-01

    Binary phenotypes commonly arise due to multiple underlying quantitative precursors and genetic variants may impact multiple traits in a pleiotropic manner. Hence, simultaneously analyzing such correlated traits may be more powerful than analyzing individual traits. Various genotype-level methods, e.g., MultiPhen (O'Reilly et al. []), have been developed to identify genetic factors underlying a multivariate phenotype. For univariate phenotypes, the usefulness and applicability of allele-level tests have been investigated. The test of allele frequency difference among cases and controls is commonly used for mapping case-control association. However, allelic methods for multivariate association mapping have not been studied much. In this article, we explore two allelic tests of multivariate association: one using a Binomial regression model based on inverted regression of genotype on phenotype (Binomial regression-based Association of Multivariate Phenotypes [BAMP]), and the other employing the Mahalanobis distance between two sample means of the multivariate phenotype vector for two alleles at a single-nucleotide polymorphism (Distance-based Association of Multivariate Phenotypes [DAMP]). These methods can incorporate both discrete and continuous phenotypes. Some theoretical properties for BAMP are studied. Using simulations, the power of the methods for detecting multivariate association is compared with the genotype-level test MultiPhen's. The allelic tests yield marginally higher power than MultiPhen for multivariate phenotypes. For one/two binary traits under recessive mode of inheritance, allelic tests are found to be substantially more powerful. All three tests are applied to two different real data and the results offer some support for the simulation study. We propose a hybrid approach for testing multivariate association that implements MultiPhen when Hardy-Weinberg Equilibrium (HWE) is violated and BAMP otherwise, because the allelic approaches assume HWE

  12. A Two-Step Method to Identify Positive Deviant Physician Organizations of Accountable Care Organizations with Robust Performance Management Systems.

    Science.gov (United States)

    Pimperl, Alexander F; Rodriguez, Hector P; Schmittdiel, Julie A; Shortell, Stephen M

    2018-06-01

    To identify positive deviant (PD) physician organizations of Accountable Care Organizations (ACOs) with robust performance management systems (PMSYS). Third National Survey of Physician Organizations (NSPO3, n = 1,398). Organizational and external factors from NSPO3 were analyzed. Linear regression estimated the association of internal and contextual factors on PMSYS. Two cutpoints (75th/90th percentiles) identified PDs with the largest residuals and highest PMSYS scores. A total of 65 and 41 PDs were identified using 75th and 90th percentiles cutpoints, respectively. The 90th percentile more strongly differentiated PDs from non-PDs. Having a high proportion of vulnerable patients appears to constrain PMSYS development. Our PD identification method increases the likelihood that PD organizations selected for in-depth inquiry are high-performing organizations that exceed expectations. © Health Research and Educational Trust.

  13. Factors related to treatment refusal in Taiwanese cancer patients.

    Science.gov (United States)

    Chiang, Ting-Yu; Wang, Chao-Hui; Lin, Yu-Fen; Chou, Shu-Lan; Wang, Ching-Ting; Juang, Hsiao-Ting; Lin, Yung-Chang; Lin, Mei-Hsiang

    2015-01-01

    Incidence and mortality rates for cancer have increased dramatically in the recent 30 years in Taiwan. However, not all patients receive treatment. Treatment refusal might impair patient survival and life quality. In order to improve this situation, we proposed this study to evaluate factors that are related to refusal of treatment in cancer patients via a cancer case manager system. This study analysed data from a case management system during the period from 2010 to 2012 at a medical center in Northern Taiwan. We enrolled a total of 14,974 patients who were diagnosed with cancer. Using the PRECEDE Model as a framework, we conducted logistic regression analysis to identify independent variables that are significantly associated with refusal of therapy in cancer patients. A multivariate logistic regression model was also applied to estimate adjusted the odds ratios (ORs) with 95% confidence intervals (95%CI). A total of 253 patients (1.69%) refused treatment. The multivariate logistic regression result showed that the high risk factors for refusal of treatment in cancer patient included: concerns about adverse effects (prefuse treatment have poor survival. The present study provides evidence of factors that are related to refusal of therapy and might be helpful for further application and improvement of cancer care.

  14. Tumour Progression and Spontaneous Regression in the Lewis Rat Sarcoma Model

    Czech Academy of Sciences Publication Activity Database

    Kovalská, Jana; Mishra, Rajbardhan; Jebavý, L.; Makovický, P.; Janda, Jozef; Plánská, D.; Červinková, Monika; Horák, Vratislav

    2015-01-01

    Roč. 35, č. 12 (2015), s. 6539-6549 ISSN 0250-7005 R&D Projects: GA MŠk ED2.1.00/03.0124 Institutional support: RVO:67985904 Keywords : spontaneous regression * progression * sarcoma Subject RIV: FD - Oncology ; Hematology Impact factor: 1.895, year: 2015

  15. Risk factors for syphilis in women: case-control study

    Directory of Open Access Journals (Sweden)

    Vilma Costa de Macêdo

    2017-08-01

    Full Text Available ABSTRACT OBJECTIVE To determine the sociodemographic, behavioral, and health care factors related to the occurrence of syphilis in women treated at public maternity hospitals. METHODS This is a case-control study (239 cases and 322 controls with women admitted to seven maternity hospitals in the municipality of Recife, Brazil, from July 2013 to July 2014. Eligible women were recruited after the result of the VDRL (Venereal Disease Research Laboratory under any titration. The selection of cases and controls was based on the result of the serology for syphilis using ELISA (enzyme-linked immunosorbent assay. The independent variables were grouped into: sociodemographic, behavioral, clinical and obstetric history, and health care in prenatal care and maternity hospital. Information was obtained by interview, during hospitalization, with the application of a questionnaire. Odds ratios and 95% confidence intervals were estimated using logistic regression to identify the predicting factors of the variable to be explained. RESULTS The logistic regression analysis identified as determinant factors for gestational syphilis: education level of incomplete basic education or illiterate (OR = 2.02, lack of access to telephone (OR = 2.4, catholic religion (OR = 1.70 , four or more pregnancies (OR = 2.2, three or more sexual partners in the last year (OR = 3.1, use of illicit drugs before the age of 18 (OR = 3.0, and use of illicit drugs by the current partner (OR = 1.7. Only one to three prenatal appointments (OR = 3.5 and a previous history of sexually transmitted infection (OR = 9.7 were also identified as determinant factors. CONCLUSIONS Sociodemographic, behavioral, and health care factors are associated with the occurrence of syphilis in women and should be taken into account in the elaboration of universal strategies aimed at the prevention and control of syphilis, but with a focus on situations of greater vulnerability.

  16. Risk factors for syphilis in women: case-control study

    Science.gov (United States)

    de Macêdo, Vilma Costa; de Lira, Pedro Israel Cabral; de Frias, Paulo Germano; Romaguera, Luciana Maria Delgado; Caires, Silvana de Fátima Ferreira; Ximenes, Ricardo Arraes de Alencar

    2017-01-01

    ABSTRACT OBJECTIVE To determine the sociodemographic, behavioral, and health care factors related to the occurrence of syphilis in women treated at public maternity hospitals. METHODS This is a case-control study (239 cases and 322 controls) with women admitted to seven maternity hospitals in the municipality of Recife, Brazil, from July 2013 to July 2014. Eligible women were recruited after the result of the VDRL (Venereal Disease Research Laboratory) under any titration. The selection of cases and controls was based on the result of the serology for syphilis using ELISA (enzyme-linked immunosorbent assay). The independent variables were grouped into: sociodemographic, behavioral, clinical and obstetric history, and health care in prenatal care and maternity hospital. Information was obtained by interview, during hospitalization, with the application of a questionnaire. Odds ratios and 95% confidence intervals were estimated using logistic regression to identify the predicting factors of the variable to be explained. RESULTS The logistic regression analysis identified as determinant factors for gestational syphilis: education level of incomplete basic education or illiterate (OR = 2.02), lack of access to telephone (OR = 2.4), catholic religion (OR = 1.70 ), four or more pregnancies (OR = 2.2), three or more sexual partners in the last year (OR = 3.1), use of illicit drugs before the age of 18 (OR = 3.0), and use of illicit drugs by the current partner (OR = 1.7). Only one to three prenatal appointments (OR = 3.5) and a previous history of sexually transmitted infection (OR = 9.7) were also identified as determinant factors. CONCLUSIONS Sociodemographic, behavioral, and health care factors are associated with the occurrence of syphilis in women and should be taken into account in the elaboration of universal strategies aimed at the prevention and control of syphilis, but with a focus on situations of greater vulnerability. PMID:28832758

  17. Understanding the Impact of School Factors on School Counselor Burnout: A Mixed-Methods Study

    Science.gov (United States)

    Bardhoshi, Gerta; Schweinle, Amy; Duncan, Kelly

    2014-01-01

    This mixed-methods study investigated the relationship between burnout and performing noncounseling duties among a national sample of professional school counselors, while identifying school factors that could attenuate this relationship. Results of regression analyses indicate that performing noncounseling duties significantly predicted burnout…

  18. Risky Sexual Behavior among Rural Female Adolescents in Malaysia: A Limited Role of Protective Factors

    OpenAIRE

    Ahmadian, Maryam; Hamsan, Hanina H.; Abdullah, Haslinda; Samah, Asnarulkhadi Abu; Noor, Amna Md

    2014-01-01

    Purpose: This paper presents the findings of a cross-sectional survey on the risk and protective factors of premarital sexual behavior among rural female adolescents in Peninsular Malaysia. Methods: We investigated data on 770 female respondents aged 13-17 years in rural areas to identify predictive factors for premarital sexual intercourse. Data were analyzed using bivariate and multivariate regression. Specific socio-demographic factors, psychological and family domains, peer delinquency, a...

  19. Factoral analysis of the cost of preparing oil

    Energy Technology Data Exchange (ETDEWEB)

    Avdeyeva, L A; Kudoyarov, G Sh; Shmatova, M F

    1979-01-01

    Mathematical statistics methods (basically correlational and regression analysis) are used to study the factors which form the level of cost of preparing oil with consideration of the mutual influence of the factors. Selected as the claims for inclusion into a mathematical model was a group of five a priori justified factors: the water level of the oil being extracted (%); the specific expenditure of deemulsifiers; the volume of oil preparation; the quality of oil preparation (the salt content) and the level of use of the installations' capacities (%). To construct an economic and mathematical model of the cost of the technical preparation (SPP) of the oil, all the unions which make up the Ministry of the Oil Industry were divided into two comparable totalities. The first group included unions in which the oil SPP was lower than the branch average and the second, unions in which the SPP was higher than the branch wide cost. Using the coefficients of regression, special elasticity coefficients and the fluctuation indicators, the basic factors were finally identified which have the greatest influence on the formation of the oil SPP level separately for the first and second groups of unions.

  20. Nonlinear Trimodal Regression Analysis of Radiodensitometric Distributions to Quantify Sarcopenic and Sequelae Muscle Degeneration

    Science.gov (United States)

    Árnadóttir, Í.; Gíslason, M. K.; Carraro, U.

    2016-01-01

    Muscle degeneration has been consistently identified as an independent risk factor for high mortality in both aging populations and individuals suffering from neuromuscular pathology or injury. While there is much extant literature on its quantification and correlation to comorbidities, a quantitative gold standard for analyses in this regard remains undefined. Herein, we hypothesize that rigorously quantifying entire radiodensitometric distributions elicits more muscle quality information than average values reported in extant methods. This study reports the development and utility of a nonlinear trimodal regression analysis method utilized on radiodensitometric distributions of upper leg muscles from CT scans of a healthy young adult, a healthy elderly subject, and a spinal cord injury patient. The method was then employed with a THA cohort to assess pre- and postsurgical differences in their healthy and operative legs. Results from the initial representative models elicited high degrees of correlation to HU distributions, and regression parameters highlighted physiologically evident differences between subjects. Furthermore, results from the THA cohort echoed physiological justification and indicated significant improvements in muscle quality in both legs following surgery. Altogether, these results highlight the utility of novel parameters from entire HU distributions that could provide insight into the optimal quantification of muscle degeneration. PMID:28115982

  1. Nonlinear Trimodal Regression Analysis of Radiodensitometric Distributions to Quantify Sarcopenic and Sequelae Muscle Degeneration

    Directory of Open Access Journals (Sweden)

    K. J. Edmunds

    2016-01-01

    Full Text Available Muscle degeneration has been consistently identified as an independent risk factor for high mortality in both aging populations and individuals suffering from neuromuscular pathology or injury. While there is much extant literature on its quantification and correlation to comorbidities, a quantitative gold standard for analyses in this regard remains undefined. Herein, we hypothesize that rigorously quantifying entire radiodensitometric distributions elicits more muscle quality information than average values reported in extant methods. This study reports the development and utility of a nonlinear trimodal regression analysis method utilized on radiodensitometric distributions of upper leg muscles from CT scans of a healthy young adult, a healthy elderly subject, and a spinal cord injury patient. The method was then employed with a THA cohort to assess pre- and postsurgical differences in their healthy and operative legs. Results from the initial representative models elicited high degrees of correlation to HU distributions, and regression parameters highlighted physiologically evident differences between subjects. Furthermore, results from the THA cohort echoed physiological justification and indicated significant improvements in muscle quality in both legs following surgery. Altogether, these results highlight the utility of novel parameters from entire HU distributions that could provide insight into the optimal quantification of muscle degeneration.

  2. Imaging-Based Screen Identifies Laminin 411 as a Physiologically Relevant Niche Factor with Importance for i-Hep Applications

    Directory of Open Access Journals (Sweden)

    John Ong

    2018-03-01

    Full Text Available Summary: Use of hepatocytes derived from induced pluripotent stem cells (i-Heps is limited by their functional differences in comparison with primary cells. Extracellular niche factors likely play a critical role in bridging this gap. Using image-based characterization (high content analysis; HCA of freshly isolated hepatocytes from 17 human donors, we devised and validated an algorithm (Hepatocyte Likeness Index; HLI for comparing the hepatic properties of cells against a physiological gold standard. The HLI was then applied in a targeted screen of extracellular niche factors to identify substrates driving i-Heps closer to the standard. Laminin 411, the top hit, was validated in two additional induced pluripotent stem cell (iPSC lines, primary tissue, and an in vitro model of α1-antitrypsin deficiency. Cumulatively, these data provide a reference method to control and screen for i-Hep differentiation, identify Laminin 411 as a key niche protein, and underscore the importance of combining substrates, soluble factors, and HCA when developing iPSC applications. : Rashid and colleagues demonstrate the utility of a high-throughput imaging platform for identification of physiologically relevant extracellular niche factors to advance i-Heps closer to their primary tissue counterparts. The extracellular matrix (ECM protein screen identified Laminin 411 as an important niche factor facilitating i-Hep-based disease modeling in vitro. Keywords: iPS hepatocytes, extracellular niche, image-based screening, disease modeling, laminin

  3. Logistic regression models

    CERN Document Server

    Hilbe, Joseph M

    2009-01-01

    This book really does cover everything you ever wanted to know about logistic regression … with updates available on the author's website. Hilbe, a former national athletics champion, philosopher, and expert in astronomy, is a master at explaining statistical concepts and methods. Readers familiar with his other expository work will know what to expect-great clarity.The book provides considerable detail about all facets of logistic regression. No step of an argument is omitted so that the book will meet the needs of the reader who likes to see everything spelt out, while a person familiar with some of the topics has the option to skip "obvious" sections. The material has been thoroughly road-tested through classroom and web-based teaching. … The focus is on helping the reader to learn and understand logistic regression. The audience is not just students meeting the topic for the first time, but also experienced users. I believe the book really does meet the author's goal … .-Annette J. Dobson, Biometric...

  4. Multidimensional Discriminative Factors for Unprotected Sex Among Adolescents in Southern Taiwan

    Directory of Open Access Journals (Sweden)

    Cheng-Fang Yen

    2009-04-01

    Full Text Available Establishing the discriminative factors for unprotected sex among adolescents is essential for early identification of at-risk teens and for the prevention of unplanned pregnancy and sexually transmitted diseases. The aim of this study was to examine the discriminative effects of demographic, individual, family, peers, and school life factors on unprotected sex in a large-scale, representative adolescent population in Southern Taiwan. A total of 9,736 adolescent students were recruited into this study and completed the questionnaires. The multidimensional discriminative factors for unprotected sex were examined using χ2 automatic interaction detection analysis and logistic regression models. The results of the χ2 automatic interaction detection analysis revealed that having friends, using illicit drugs, being of an older age, suspension from school, and low family monitoring had discriminative effects on unprotected sex in adolescents. The logistic regression analysis further confirmed the discriminative effect of these factors. Because of the adverse effects of unprotected sex in adolescents, we suggest that parents and health professionals should pay attention to adolescents with the discriminative factors for unprotected sex identified in this study.

  5. Identifying risk factors of avian infectious diseases at household level in Poyang Lake region, China.

    Science.gov (United States)

    Jiang, Qian; Zhou, Jieting; Jiang, Zhiben; Xu, Bing

    2014-09-01

    Poultry kept in backyard farms are susceptible to acquiring and spreading infectious diseases because of free ranging and poor biosecurity measures. Since some of these diseases are zoonoses, this is also a significant health concern to breeders and their families. Backyard farms are common in rural regions of China. However, there is lack of knowledge of backyard poultry in the country. To obtain first-hand information of backyard poultry and identify risk factors of avian infectious diseases, a cross-sectional study was carried out at household level in rural regions around Poyang Lake. A door-to-door survey was conducted to collect data on husbandry practices, trading practices of backyard farmers, and surrounding environments of backyard farms. Farms were categorized into cases and controls based on their history of poultry death. Data were collected for 137 farms, and the association with occurrence of poultry death event was explored by chi-square tests. Results showed that vaccination implementation was a protective factor (odds ratio OR=0.40, 95% confidence interval CI: 0.20-0.80, p=0.01), while contact with other backyard flocks increased risk (OR=1.72, 95% CI: 0.79-3.74, p=0.16). A concept of "farm connectivity" characterized by the density of particular land-use types in the vicinity of the farm was proposed to characterize the degree of contact between poultry in one household farm and those in other household farms. It was found that housing density in a 20-m buffer zone of the farmhouse was most significantly associated with poultry death occurrence (OR=1.08, 95% CI: 1.02-1.17, p=0.03), and was in agreement with observation of villagers. Binary logistic regression was applied to evaluate the relationship between poultry death event and density of land-use types in all buffer zones. When integrated with vaccination implementation for poultry, prediction accuracy of poultry death event reached 72.0%. Results combining questionnaire survey with

  6. Identifying sources of atmospheric fine particles in Havana City using Positive Matrix Factorization technique

    International Nuclear Information System (INIS)

    Pinnera, I.; Perez, G.; Ramos, M.; Guibert, R.; Aldape, F.; Flores M, J.; Martinez, M.; Molina, E.; Fernandez, A.

    2011-01-01

    In previous study a set of samples of fine and coarse airborne particulate matter collected in a urban area of Havana City were analyzed by Particle-Induced X-ray Emission (PIXE) technique. The concentrations of 14 elements (S, Cl, K, Ca, Ti, V, Cr, Mn, Fe, Ni, Cu, Zn, Br and Pb) were consistently determined in both particle sizes. The analytical database provided by PIXE was statistically analyzed in order to determine the local pollution sources. The Positive Matrix Factorization (PMF) technique was applied to fine particle data in order to identify possible pollution sources. These sources were further verified by enrichment factor (EF) calculation. A general discussion about these results is presented in this work. (Author)

  7. Prediction, Regression and Critical Realism

    DEFF Research Database (Denmark)

    Næss, Petter

    2004-01-01

    This paper considers the possibility of prediction in land use planning, and the use of statistical research methods in analyses of relationships between urban form and travel behaviour. Influential writers within the tradition of critical realism reject the possibility of predicting social...... phenomena. This position is fundamentally problematic to public planning. Without at least some ability to predict the likely consequences of different proposals, the justification for public sector intervention into market mechanisms will be frail. Statistical methods like regression analyses are commonly...... seen as necessary in order to identify aggregate level effects of policy measures, but are questioned by many advocates of critical realist ontology. Using research into the relationship between urban structure and travel as an example, the paper discusses relevant research methods and the kinds...

  8. EPS-LASSO: Test for High-Dimensional Regression Under Extreme Phenotype Sampling of Continuous Traits.

    Science.gov (United States)

    Xu, Chao; Fang, Jian; Shen, Hui; Wang, Yu-Ping; Deng, Hong-Wen

    2018-01-25

    Extreme phenotype sampling (EPS) is a broadly-used design to identify candidate genetic factors contributing to the variation of quantitative traits. By enriching the signals in extreme phenotypic samples, EPS can boost the association power compared to random sampling. Most existing statistical methods for EPS examine the genetic factors individually, despite many quantitative traits have multiple genetic factors underlying their variation. It is desirable to model the joint effects of genetic factors, which may increase the power and identify novel quantitative trait loci under EPS. The joint analysis of genetic data in high-dimensional situations requires specialized techniques, e.g., the least absolute shrinkage and selection operator (LASSO). Although there are extensive research and application related to LASSO, the statistical inference and testing for the sparse model under EPS remain unknown. We propose a novel sparse model (EPS-LASSO) with hypothesis test for high-dimensional regression under EPS based on a decorrelated score function. The comprehensive simulation shows EPS-LASSO outperforms existing methods with stable type I error and FDR control. EPS-LASSO can provide a consistent power for both low- and high-dimensional situations compared with the other methods dealing with high-dimensional situations. The power of EPS-LASSO is close to other low-dimensional methods when the causal effect sizes are small and is superior when the effects are large. Applying EPS-LASSO to a transcriptome-wide gene expression study for obesity reveals 10 significant body mass index associated genes. Our results indicate that EPS-LASSO is an effective method for EPS data analysis, which can account for correlated predictors. The source code is available at https://github.com/xu1912/EPSLASSO. hdeng2@tulane.edu. Supplementary data are available at Bioinformatics online. © The Author (2018). Published by Oxford University Press. All rights reserved. For Permissions, please

  9. Factors predicting weight-bearing asymmetry 1month after unilateral total knee arthroplasty: a cross-sectional study.

    Science.gov (United States)

    Christiansen, Cory L; Bade, Michael J; Weitzenkamp, David A; Stevens-Lapsley, Jennifer E

    2013-03-01

    Factors predicting weight-bearing asymmetry (WBA) after unilateral total knee arthroplasty (TKA) are not known. However, identifying modifiable and non-modifiable predictors of WBA is needed to optimize rehabilitation, especially since WBA is negatively correlated to poor functional performance. The purpose of this study was to identify factors predictive of WBA during sit-stand transitions for people 1month following unilateral TKA. Fifty-nine people were tested preoperatively and 1month following unilateral TKA for WBA using average vertical ground reaction force under each foot during the Five Times Sit-to-Stand Test. Candidate variables tested in the regression analysis represented physical impairments (strength, muscle activation, pain, and motion), demographics, anthropometrics, and movement compensations. WBA, measured as the ratio of surgical/non-surgical limb vertical ground reaction force, was 0.69 (0.18) (mean (SD)) 1month after TKA. Regression analysis identified preoperative WBA (β=0.40), quadriceps strength ratio (β=0.31), and hamstrings strength ratio (β=0.19) as factors predictive of WBA 1month after TKA (R(2)=0.30). Greater amounts of WBA 1month after TKA are predicted by modifiable factors including habitual movement pattern and asymmetry in quadriceps and hamstrings strength. Copyright © 2012 Elsevier B.V. All rights reserved.

  10. Multiple Response Regression for Gaussian Mixture Models with Known Labels.

    Science.gov (United States)

    Lee, Wonyul; Du, Ying; Sun, Wei; Hayes, D Neil; Liu, Yufeng

    2012-12-01

    Multiple response regression is a useful regression technique to model multiple response variables using the same set of predictor variables. Most existing methods for multiple response regression are designed for modeling homogeneous data. In many applications, however, one may have heterogeneous data where the samples are divided into multiple groups. Our motivating example is a cancer dataset where the samples belong to multiple cancer subtypes. In this paper, we consider modeling the data coming from a mixture of several Gaussian distributions with known group labels. A naive approach is to split the data into several groups according to the labels and model each group separately. Although it is simple, this approach ignores potential common structures across different groups. We propose new penalized methods to model all groups jointly in which the common and unique structures can be identified. The proposed methods estimate the regression coefficient matrix, as well as the conditional inverse covariance matrix of response variables. Asymptotic properties of the proposed methods are explored. Through numerical examples, we demonstrate that both estimation and prediction can be improved by modeling all groups jointly using the proposed methods. An application to a glioblastoma cancer dataset reveals some interesting common and unique gene relationships across different cancer subtypes.

  11. Understanding logistic regression analysis.

    Science.gov (United States)

    Sperandei, Sandro

    2014-01-01

    Logistic regression is used to obtain odds ratio in the presence of more than one explanatory variable. The procedure is quite similar to multiple linear regression, with the exception that the response variable is binomial. The result is the impact of each variable on the odds ratio of the observed event of interest. The main advantage is to avoid confounding effects by analyzing the association of all variables together. In this article, we explain the logistic regression procedure using examples to make it as simple as possible. After definition of the technique, the basic interpretation of the results is highlighted and then some special issues are discussed.

  12. Forecasting urban water demand: A meta-regression analysis.

    Science.gov (United States)

    Sebri, Maamar

    2016-12-01

    Water managers and planners require accurate water demand forecasts over the short-, medium- and long-term for many purposes. These range from assessing water supply needs over spatial and temporal patterns to optimizing future investments and planning future allocations across competing sectors. This study surveys the empirical literature on the urban water demand forecasting using the meta-analytical approach. Specifically, using more than 600 estimates, a meta-regression analysis is conducted to identify explanations of cross-studies variation in accuracy of urban water demand forecasting. Our study finds that accuracy depends significantly on study characteristics, including demand periodicity, modeling method, forecasting horizon, model specification and sample size. The meta-regression results remain robust to different estimators employed as well as to a series of sensitivity checks performed. The importance of these findings lies in the conclusions and implications drawn out for regulators and policymakers and for academics alike. Copyright © 2016. Published by Elsevier Ltd.

  13. Time series regression model for infectious disease and weather.

    Science.gov (United States)

    Imai, Chisato; Armstrong, Ben; Chalabi, Zaid; Mangtani, Punam; Hashizume, Masahiro

    2015-10-01

    Time series regression has been developed and long used to evaluate the short-term associations of air pollution and weather with mortality or morbidity of non-infectious diseases. The application of the regression approaches from this tradition to infectious diseases, however, is less well explored and raises some new issues. We discuss and present potential solutions for five issues often arising in such analyses: changes in immune population, strong autocorrelations, a wide range of plausible lag structures and association patterns, seasonality adjustments, and large overdispersion. The potential approaches are illustrated with datasets of cholera cases and rainfall from Bangladesh and influenza and temperature in Tokyo. Though this article focuses on the application of the traditional time series regression to infectious diseases and weather factors, we also briefly introduce alternative approaches, including mathematical modeling, wavelet analysis, and autoregressive integrated moving average (ARIMA) models. Modifications proposed to standard time series regression practice include using sums of past cases as proxies for the immune population, and using the logarithm of lagged disease counts to control autocorrelation due to true contagion, both of which are motivated from "susceptible-infectious-recovered" (SIR) models. The complexity of lag structures and association patterns can often be informed by biological mechanisms and explored by using distributed lag non-linear models. For overdispersed models, alternative distribution models such as quasi-Poisson and negative binomial should be considered. Time series regression can be used to investigate dependence of infectious diseases on weather, but may need modifying to allow for features specific to this context. Copyright © 2015 The Authors. Published by Elsevier Inc. All rights reserved.

  14. Factors Affecting Adoption of Recommended Cauliflower Production Technology in Nepal

    Directory of Open Access Journals (Sweden)

    Pankaj Raj Dhital

    2016-05-01

    Full Text Available Binary logit regression model of econometrics was used to identify the factors affecting adoption of recommended agricultural technology by the commercial farmers of Nepal. A survey was carried out in 2012 in 120 households from Kavre district, Nepal. The objective was to appraise factors affecting adoption of recommended technology of cauliflower, finding out the level of adoption, identifying the constraints of cauliflower cultivation, assessing the perceived level of satisfaction and studying the relationship of certain selected variables on the adoption. Education, Occupation, contact with Personal Localite sources of information, Group membership and Experience were the most influencing factors for adoption of recommended technology. Though, other factors were not strong enough to contribute significantly but indirectly influence the adoption decision of farmers as combined effects. Occupation was found as negatively associated. Landholding size was strongly and positively influencing among the non significant factors. The level of satisfaction and the constraints of cauliflower production were also identified. Transfer of technology will be effective if and only if the client adopts the technology. For an effective transfer of technology, it is better to go for activities for the welfare of the socioeconomic factors of the farmers.

  15. The effect of postoperative medical treatment on left ventricular mass regression after aortic valve replacement.

    Science.gov (United States)

    Helder, Meghana R K; Ugur, Murat; Bavaria, Joseph E; Kshettry, Vibhu R; Groh, Mark A; Petracek, Michael R; Jones, Kent W; Suri, Rakesh M; Schaff, Hartzell V

    2015-03-01

    The study objective was to analyze factors associated with left ventricular mass regression in patients undergoing aortic valve replacement with a newer bioprosthesis, the Trifecta valve pericardial bioprosthesis (St Jude Medical Inc, St Paul, Minn). A total of 444 patients underwent aortic valve replacement with the Trifecta bioprosthesis from 2007 to 2009 at 6 US institutions. The clinical and echocardiographic data of 200 of these patients who had left ventricular hypertrophy and follow-up studies 1 year postoperatively were reviewed and compared to analyze factors affecting left ventricular mass regression. Mean (standard deviation) age of the 200 study patients was 73 (9) years, 66% were men, and 92% had pure or predominant aortic valve stenosis. Complete left ventricular mass regression was observed in 102 patients (51%) by 1 year postoperatively. In univariate analysis, male sex, implantation of larger valves, larger left ventricular end-diastolic volume, and beta-blocker or calcium-channel blocker treatment at dismissal were significantly associated with complete mass regression. In the multivariate model, odds ratios (95% confidence intervals) indicated that male sex (3.38 [1.39-8.26]) and beta-blocker or calcium-channel blocker treatment at dismissal (3.41 [1.40-8.34]) were associated with increased probability of complete left ventricular mass regression. Patients with higher preoperative systolic blood pressure were less likely to have complete left ventricular mass regression (0.98 [0.97-0.99]). Among patients with left ventricular hypertrophy, postoperative treatment with beta-blockers or calcium-channel blockers may enhance mass regression. This highlights the need for close medical follow-up after operation. Labeled valve size was not predictive of left ventricular mass regression. Copyright © 2015 The American Association for Thoracic Surgery. Published by Elsevier Inc. All rights reserved.

  16. Bayesian median regression for temporal gene expression data

    Science.gov (United States)

    Yu, Keming; Vinciotti, Veronica; Liu, Xiaohui; 't Hoen, Peter A. C.

    2007-09-01

    Most of the existing methods for the identification of biologically interesting genes in a temporal expression profiling dataset do not fully exploit the temporal ordering in the dataset and are based on normality assumptions for the gene expression. In this paper, we introduce a Bayesian median regression model to detect genes whose temporal profile is significantly different across a number of biological conditions. The regression model is defined by a polynomial function where both time and condition effects as well as interactions between the two are included. MCMC-based inference returns the posterior distribution of the polynomial coefficients. From this a simple Bayes factor test is proposed to test for significance. The estimation of the median rather than the mean, and within a Bayesian framework, increases the robustness of the method compared to a Hotelling T2-test previously suggested. This is shown on simulated data and on muscular dystrophy gene expression data.

  17. Modeling data for pancreatitis in presence of a duodenal diverticula using logistic regression

    Science.gov (United States)

    Dineva, S.; Prodanova, K.; Mlachkova, D.

    2013-12-01

    The presence of a periampullary duodenal diverticulum (PDD) is often observed during upper digestive tract barium meal studies and endoscopic retrograde cholangiopancreatography (ERCP). A few papers reported that the diverticulum had something to do with the incidence of pancreatitis. The aim of this study is to investigate if the presence of duodenal diverticula predisposes to the development of a pancreatic disease. A total 3966 patients who had undergone ERCP were studied retrospectively. They were divided into 2 groups-with and without PDD. Patients with a duodenal diverticula had a higher rate of acute pancreatitis. The duodenal diverticula is a risk factor for acute idiopathic pancreatitis. A multiple logistic regression to obtain adjusted estimate of odds and to identify if a PDD is a predictor of acute or chronic pancreatitis was performed. The software package STATISTICA 10.0 was used for analyzing the real data.

  18. Validating the Copenhagen Psychosocial Questionnaire (COPSOQ-II) Using Set-ESEM: Identifying Psychosocial Risk Factors in a Sample of School Principals.

    Science.gov (United States)

    Dicke, Theresa; Marsh, Herbert W; Riley, Philip; Parker, Philip D; Guo, Jiesi; Horwood, Marcus

    2018-01-01

    School principals world-wide report high levels of strain and attrition resulting in a shortage of qualified principals. It is thus crucial to identify psychosocial risk factors that reflect principals' occupational wellbeing. For this purpose, we used the Copenhagen Psychosocial Questionnaire (COPSOQ-II), a widely used self-report measure covering multiple psychosocial factors identified by leading occupational stress theories. We evaluated the COPSOQ-II regarding factor structure and longitudinal, discriminant, and convergent validity using latent structural equation modeling in a large sample of Australian school principals ( N = 2,049). Results reveal that confirmatory factor analysis produced marginally acceptable model fit. A novel approach we call set exploratory structural equation modeling (set-ESEM), where cross-loadings were only allowed within a priori defined sets of factors, fit well, and was more parsimonious than a full ESEM. Further multitrait-multimethod models based on the set-ESEM confirm the importance of a principal's psychosocial risk factors; Stressors and depression were related to demands and ill-being, while confidence and autonomy were related to wellbeing. We also show that working in the private sector was beneficial for showing a low psychosocial risk, while other demographics have little effects. Finally, we identify five latent risk profiles (high risk to no risk) of school principals based on all psychosocial factors. Overall the research presented here closes the theory application gap of a strong multi-dimensional measure of psychosocial risk-factors.

  19. Minimax Regression Quantiles

    DEFF Research Database (Denmark)

    Bache, Stefan Holst

    A new and alternative quantile regression estimator is developed and it is shown that the estimator is root n-consistent and asymptotically normal. The estimator is based on a minimax ‘deviance function’ and has asymptotically equivalent properties to the usual quantile regression estimator. It is......, however, a different and therefore new estimator. It allows for both linear- and nonlinear model specifications. A simple algorithm for computing the estimates is proposed. It seems to work quite well in practice but whether it has theoretical justification is still an open question....

  20. Regression with Sparse Approximations of Data

    DEFF Research Database (Denmark)

    Noorzad, Pardis; Sturm, Bob L.

    2012-01-01

    We propose sparse approximation weighted regression (SPARROW), a method for local estimation of the regression function that uses sparse approximation with a dictionary of measurements. SPARROW estimates the regression function at a point with a linear combination of a few regressands selected...... by a sparse approximation of the point in terms of the regressors. We show SPARROW can be considered a variant of \\(k\\)-nearest neighbors regression (\\(k\\)-NNR), and more generally, local polynomial kernel regression. Unlike \\(k\\)-NNR, however, SPARROW can adapt the number of regressors to use based...