WorldWideScience

Sample records for single item scales

  1. The Single-Item Math Anxiety Scale: An Alternative Way of Measuring Mathematical Anxiety

    Science.gov (United States)

    Núñez-Peña, M. Isabel; Guilera, Georgina; Suárez-Pellicioni, Macarena

    2014-01-01

    This study examined whether the Single-Item Math Anxiety Scale (SIMA), based on the item suggested by Ashcraft, provided valid and reliable scores of mathematical anxiety. A large sample of university students (n = 279) was administered the SIMA and the 25-item Shortened Math Anxiety Rating Scale (sMARS) to evaluate the relation between the scores…

  2. A psychometric comparison of three scales and a single-item measure to assess sexual satisfaction.

    Science.gov (United States)

    Mark, Kristen P; Herbenick, Debby; Fortenberry, J Dennis; Sanders, Stephanie; Reece, Michael

    2014-01-01

    This study was designed to systematically compare and contrast the psychometric properties of three scales developed to measure sexual satisfaction and a single-item measure of sexual satisfaction. The Index of Sexual Satisfaction (ISS), Global Measure of Sexual Satisfaction (GMSEX), and the New Sexual Satisfaction Scale-Short (NSSS-S) were compared to one another and to a single-item measure of sexual satisfaction. Conceptualization of the constructs, distribution of scores, internal consistency, convergent validity, test-retest reliability, and factor structure were compared between the measures. A total of 211 men and 214 women completed the scales and a measure of relationship satisfaction, with 33% (n = 139) of the sample reassessed two months later. All scales demonstrated appropriate distribution of scores and adequate internal consistency. The GMSEX, NSSS-S, and the single-item measure demonstrated convergent validity. Test-retest reliability was demonstrated by the ISS, GMSEX, and NSSS-S, but not the single-item measure. Taken together, the GMSEX received the strongest psychometric support in this sample for a unidimensional measure of sexual satisfaction and the NSSS-S received the strongest psychometric support in this sample for a bidimensional measure of sexual satisfaction.

  3. Development and validation of the Single Item Trait Empathy Scale (SITES).

    Science.gov (United States)

    Konrath, Sara; Meier, Brian P; Bushman, Brad J

    2018-04-01

    Empathy involves feeling compassion for others and imagining how they feel. In this article, we develop and validate the Single Item Trait Empathy Scale (SITES), which contains only one item that takes seconds to complete. In seven studies (N=5,724), the SITES was found to be both reliable and valid. It correlated in expected ways with a wide variety of intrapersonal outcomes. For example, it is negatively correlated with narcissism, depression, anxiety, and alexithymia. In contrast, it is positively correlated with other measures of empathy, self-esteem, subjective well-being, and agreeableness. The SITES also correlates with a wide variety of interpersonal outcomes, especially compassion for others and helping others. The SITES is recommended in situations when time or question quantity is constrained.

  4. Development and validation of the Single Item Narcissism Scale (SINS).

    Science.gov (United States)

    Konrath, Sara; Meier, Brian P; Bushman, Brad J

    2014-01-01

    The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS). Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies). In 11 independent studies (total N = 2,250), we demonstrate the SINS' psychometric properties. The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults), intrapersonal correlates (e.g., positive affect, depression), and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior). The SINS taps into the more fragile and less desirable components of narcissism. The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures.

  5. Development and Validation of the Single Item Narcissism Scale (SINS)

    Science.gov (United States)

    Konrath, Sara; Meier, Brian P.; Bushman, Brad J.

    2014-01-01

    Main Objectives The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS). Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies). Methods In 11 independent studies (total N = 2,250), we demonstrate the SINS' psychometric properties. Results The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults), intrapersonal correlates (e.g., positive affect, depression), and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior). The SINS taps into the more fragile and less desirable components of narcissism. Significance The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures. PMID:25093508

  6. Development and validation of the Single Item Narcissism Scale (SINS.

    Directory of Open Access Journals (Sweden)

    Sara Konrath

    Full Text Available MAIN OBJECTIVES: The narcissistic personality is characterized by grandiosity, entitlement, and low empathy. This paper describes the development and validation of the Single Item Narcissism Scale (SINS. Although the use of longer instruments is superior in most circumstances, we recommend the SINS in some circumstances (e.g. under serious time constraints, online studies. METHODS: In 11 independent studies (total N = 2,250, we demonstrate the SINS' psychometric properties. RESULTS: The SINS is significantly correlated with longer narcissism scales, but uncorrelated with self-esteem. It also has high test-retest reliability. We validate the SINS in a variety of samples (e.g., undergraduates, nationally representative adults, intrapersonal correlates (e.g., positive affect, depression, and interpersonal correlates (e.g., aggression, relationship quality, prosocial behavior. The SINS taps into the more fragile and less desirable components of narcissism. SIGNIFICANCE: The SINS can be a useful tool for researchers, especially when it is important to measure narcissism with constraints preventing the use of longer measures.

  7. Cross-National Prevalence of Traditional Bullying, Traditional Victimization, Cyberbullying and Cyber-Victimization: Comparing Single-Item and Multiple-Item Approaches of Measurement

    Science.gov (United States)

    Yanagida, Takuya; Gradinger, Petra; Strohmeier, Dagmar; Solomontos-Kountouri, Olga; Trip, Simona; Bora, Carmen

    2016-01-01

    Many large-scale cross-national studies rely on a single-item measurement when comparing prevalence rates of traditional bullying, traditional victimization, cyberbullying, and cyber-victimization between countries. However, the reliability and validity of single-item measurement approaches are highly problematic and might be biased. Data from…

  8. Concurrent Validity and Sensitivity to Change of Direct Behavior Rating Single-Item Scales (DBR-SIS) within an Elementary Sample

    Science.gov (United States)

    Smith, Rhonda L.; Eklund, Katie; Kilgus, Stephen P.

    2018-01-01

    The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily…

  9. Item reduction and psychometric validation of the Oily Skin Self Assessment Scale (OSSAS) and the Oily Skin Impact Scale (OSIS).

    Science.gov (United States)

    Arbuckle, Robert; Clark, Marci; Harness, Jane; Bonner, Nicola; Scott, Jane; Draelos, Zoe; Rizer, Ronald; Yeh, Yating; Copley-Merriman, Kati

    2009-01-01

    Developed using focus groups, the Oily Skin Self Assessment Scale (OSSAS) and Oily Skin Impact Scale (OSIS) are patient-reported outcome measures of oily facial skin. The aim of this study was to finalize the item-scale structure of the instruments and perform psychometric validation in adults with self-reported oily facial skin. The OSSAS and OSIS were administered to 202 adult subjects with oily facial skin in the United States. A subgroup of 152 subjects returned, 4 to 10 days later, for test–retest reliability evaluation. Of the 202 participants, 72.8% were female; 64.4% had self-reported nonsevere acne. Item reduction resulted in a 14-item OSSAS with Sensation (five items), Tactile (four items) and Visual (four items) domains, a single blotting item, and an overall oiliness item. The OSIS was reduced to two three-item domains assessing Annoyance and Self-Image. Confirmatory factor analysis supported the construct validity of the final item-scale structures. The OSSAS and OSIS scales had acceptable item convergent validity (item-scale correlations >0.40) and floor and ceiling effects (skin severity (P skin (P skin), as assessments of self-reported oily facial skin severity and its emotional impact, respectively.

  10. A Model-Free Diagnostic for Single-Peakedness of Item Responses Using Ordered Conditional Means

    Science.gov (United States)

    Polak, Marike; De Rooij, Mark; Heiser, Willem J.

    2012-01-01

    In this article we propose a model-free diagnostic for single-peakedness (unimodality) of item responses. Presuming a unidimensional unfolding scale and a given item ordering, we approximate item response functions of all items based on ordered conditional means (OCM). The proposed OCM methodology is based on Thurstone & Chave's (1929) "criterion…

  11. Assessing the validity of single-item life satisfaction measures: results from three large samples.

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E

    2014-12-01

    The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS)-a more psychometrically established measure. Two large samples from Washington (N = 13,064) and Oregon (N = 2,277) recruited by the Behavioral Risk Factor Surveillance System and a representative German sample (N = 1,312) recruited by the Germany Socio-Economic Panel were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62-0.64; disattenuated r = 0.78-0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001-0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS was very small (average absolute difference = 0.015-0.042). Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use.

  12. Reliability and validity of the Spanish version of the 10-item Connor-Davidson Resilience Scale (10-item CD-RISC in young adults

    Directory of Open Access Journals (Sweden)

    García-Campayo Javier

    2011-08-01

    Full Text Available Abstract Background The 10-item Connor-Davidson Resilience Scale (10-item CD-RISC is an instrument for measuring resilience that has shown good psychometric properties in its original version in English. The aim of this study was to evaluate the validity and reliability of the Spanish version of the 10-item CD-RISC in young adults and to verify whether it is structured in a single dimension as in the original English version. Findings Cross-sectional observational study including 681 university students ranging in age from 18 to 30 years. The number of latent factors in the 10 items of the scale was analyzed by exploratory factor analysis. Confirmatory factor analysis was used to verify whether a single factor underlies the 10 items of the scale as in the original version in English. The convergent validity was analyzed by testing whether the mean of the scores of the mental component of SF-12 (MCS and the quality of sleep as measured with the Pittsburgh Sleep Index (PSQI were higher in subjects with better levels of resilience. The internal consistency of the 10-item CD-RISC was estimated using the Cronbach α test and test-retest reliability was estimated with the intraclass correlation coefficient. The Cronbach α coefficient was 0.85 and the test-retest intraclass correlation coefficient was 0.71. The mean MCS score and the level of quality of sleep in both men and women were significantly worse in subjects with lower resilience scores. Conclusions The Spanish version of the 10-item CD-RISC showed good psychometric properties in young adults and thus can be used as a reliable and valid instrument for measuring resilience. Our study confirmed that a single factor underlies the resilience construct, as was the case of the original scale in English.

  13. Assessing the Validity of Single-item Life Satisfaction Measures: Results from Three Large Samples

    Science.gov (United States)

    Cheung, Felix; Lucas, Richard E.

    2014-01-01

    Purpose The present paper assessed the validity of single-item life satisfaction measures by comparing single-item measures to the Satisfaction with Life Scale (SWLS) - a more psychometrically established measure. Methods Two large samples from Washington (N=13,064) and Oregon (N=2,277) recruited by the Behavioral Risk Factor Surveillance System (BRFSS) and a representative German sample (N=1,312) recruited by the Germany Socio-Economic Panel (GSOEP) were included in the present analyses. Single-item life satisfaction measures and the SWLS were correlated with theoretically relevant variables, such as demographics, subjective health, domain satisfaction, and affect. The correlations between the two life satisfaction measures and these variables were examined to assess the construct validity of single-item life satisfaction measures. Results Consistent across three samples, single-item life satisfaction measures demonstrated substantial degree of criterion validity with the SWLS (zero-order r = 0.62 – 0.64; disattenuated r = 0.78 – 0.80). Patterns of statistical significance for correlations with theoretically relevant variables were the same across single-item measures and the SWLS. Single-item measures did not produce systematically different correlations compared to the SWLS (average difference = 0.001 – 0.005). The average absolute difference in the magnitudes of the correlations produced by single-item measures and the SWLS were very small (average absolute difference = 0.015 −0.042). Conclusions Single-item life satisfaction measures performed very similarly compared to the multiple-item SWLS. Social scientists would get virtually identical answer to substantive questions regardless of which measure they use. PMID:24890827

  14. The validity of the Satisfaction with Life Scale in adolescents and a comparison with single-item life satisfaction measures: a preliminary study.

    Science.gov (United States)

    Jovanović, Veljko

    2016-12-01

    The validity of the life satisfaction measures commonly used among adults has been rarely examined in adolescent samples. The present research had two main goals: (1) to evaluate the structural validity of the Satisfaction with Life Scale (SWLS) among adolescents and to test measurement invariance across gender; (2) to compare the criterion and convergent validity of the SWLS and single-item life satisfaction measures among adolescents. Three samples of Serbian adolescents were recruited for the present research. Study 1 (N = 481, M age  = 17.01 years) examined the structure of the SWLS via confirmatory factor analysis (CFA) and evaluated measurement invariance of the SWLS across gender by a multi-group CFA. Study 2 (N = 283, M age  = 17.34 years) and Study 3 (N = 220, M age  = 16.73 years) compared the convergent validity of the SWLS and single-item life satisfaction measures. The results of Study 1 supported the original one-factor model of the SWLS among adolescents and provided evidence for strong measurement invariance of the SWLS across gender. The findings of Study 2 and Study 3 showed that the SWLS and single-item measures were equally valid and strongly associated (r = .734 in Study 2 and r = .668 in Study 3). No substantial differences in correlations with school success and well-being indicators were found between the SWLS and single-item measures. Our findings support the use of the SWLS among adolescents and indicate that single-item life satisfaction measures perform as well as the SWLS in adolescent samples.

  15. Item-Level Psychometrics of the Glasgow Outcome Scale: Extended Structured Interviews.

    Science.gov (United States)

    Hong, Ickpyo; Li, Chih-Ying; Velozo, Craig A

    2016-04-01

    The Glasgow Outcome Scale-Extended (GOSE) structured interview captures critical components of activities and participation, including home, shopping, work, leisure, and family/friend relationships. Eighty-nine community dwelling adults with mild-moderate traumatic brain injury (TBI) were recruited (average = 2.7 year post injury). Nine items of the 19 items were used for the psychometrics analysis purpose. Factor analysis and item-level psychometrics were investigated using the Rasch partial-credit model. Although the principal components analysis of residuals suggests that a single measurement factor dominates the measure, the instrument did not meet the factor analysis criteria. Five items met the rating scale criteria. Eight items fit the Rasch model. The instrument demonstrated low person reliability (0.63), low person strata (2.07), and a slight ceiling effect. The GOSE demonstrated limitations in precisely measuring activities/participation for individuals after TBI. Future studies should examine the impact of the low precision of the GOSE on effect size. © The Author(s) 2016.

  16. Brief Sensation Seeking Scale: Latent structure of 8-item and 4-item versions in Peruvian adolescents.

    Science.gov (United States)

    Merino-Soto, Cesar; Salas Blas, Edwin

    2018-01-01

    This research intended to validate two brief scales of sensations seeking with Peruvian adolescents: the eight item scale (BSSS8; Hoyle, Stephenson, Palmgreen, Lorch, y Donohew, 2002) and the four item scale (BSSS4; Stephenson, Hoyle, Slater, y Palmgreen, 2003). Questionnaires were administered to 618 voluntary participants, with an average age of 13.6 years, from different levels of high school, state and private school in a district in the south of Lima. It analyzed the internal structure of both short versions using three models: a) unidimensional (M1), b) oblique or related dimensions (M2), and c) the bifactor model (M3). Results show that both instruments have a single dimension which best represents the variability of the items; a fact that can be explained both by the complexity of the concept and by the small number of items representing each factor, which is more noticeable in the BSSS4. Reliability is within levels found by previous studies: alpha: .745 = BSSS8 and BSSS4 =. 643; omega coefficient: .747 in BSSS8 and .651 in BSSS4. These are considered suitable for the type of instruments studied. Based on the correlation between the two instruments, it was found that there are satisfactory levels of equivalence between the BSSS8 and BSSS4. However, it is recommended that the BSSS4 is mainly used for research and for the purpose of describing populations.

  17. Lawton IADL scale in dementia: can item response theory make it more informative?

    Science.gov (United States)

    McGrory, Sarah; Shenkin, Susan D; Austin, Elizabeth J; Starr, John M

    2014-07-01

    impairment of functional abilities represents a crucial component of dementia diagnosis. Current functional measures rely on the traditional aggregate method of summing raw scores. While this summary score provides a quick representation of a person's ability, it disregards useful information on the item level. to use item response theory (IRT) methods to increase the interpretive power of the Lawton Instrumental Activities of Daily Living (IADL) scale by establishing a hierarchy of item 'difficulty' and 'discrimination'. this cross-sectional study applied IRT methods to the analysis of IADL outcomes. Participants were 202 members of the Scottish Dementia Research Interest Register (mean age = 76.39, range = 56-93, SD = 7.89 years) with complete itemised data available. a Mokken scale with good reliability (Molenaar Sijtsama statistic 0.79) was obtained, satisfying the IRT assumption that the items comprise a single unidimensional scale. The eight items in the scale could be placed on a hierarchy of 'difficulty' (H coefficient = 0.55), with 'Shopping' being the most 'difficult' item and 'Telephone use' being the least 'difficult' item. 'Shopping' was the most discriminatory item differentiating well between patients of different levels of ability. IRT methods are capable of providing more information about functional impairment than a summed score. 'Shopping' and 'Telephone use' were identified as items that reveal key information about a patient's level of ability, and could be useful screening questions for clinicians. © The Author 2013. Published by Oxford University Press on behalf of the British Geriatrics Society. All rights reserved. For Permissions, please email: journals.permissions@ oup.com.

  18. Rating the methodological quality of single-subject designs and n-of-1 trials: introducing the Single-Case Experimental Design (SCED) Scale.

    Science.gov (United States)

    Tate, Robyn L; McDonald, Skye; Perdices, Michael; Togher, Leanne; Schultz, Regina; Savage, Sharon

    2008-08-01

    Rating scales that assess methodological quality of clinical trials provide a means to critically appraise the literature. Scales are currently available to rate randomised and non-randomised controlled trials, but there are none that assess single-subject designs. The Single-Case Experimental Design (SCED) Scale was developed for this purpose and evaluated for reliability. Six clinical researchers who were trained and experienced in rating methodological quality of clinical trials developed the scale and participated in reliability studies. The SCED Scale is an 11-item rating scale for single-subject designs, of which 10 items are used to assess methodological quality and use of statistical analysis. The scale was developed and refined over a 3-year period. Content validity was addressed by identifying items to reduce the main sources of bias in single-case methodology as stipulated by authorities in the field, which were empirically tested against 85 published reports. Inter-rater reliability was assessed using a random sample of 20/312 single-subject reports archived in the Psychological Database of Brain Impairment Treatment Efficacy (PsycBITE). Inter-rater reliability for the total score was excellent, both for individual raters (overall ICC = 0.84; 95% confidence interval 0.73-0.92) and for consensus ratings between pairs of raters (overall ICC = 0.88; 95% confidence interval 0.78-0.95). Item reliability was fair to excellent for consensus ratings between pairs of raters (range k = 0.48 to 1.00). The results were replicated with two independent novice raters who were trained in the use of the scale (ICC = 0.88, 95% confidence interval 0.73-0.95). The SCED Scale thus provides a brief and valid evaluation of methodological quality of single-subject designs, with the total score demonstrating excellent inter-rater reliability using both individual and consensus ratings. Items from the scale can also be used as a checklist in the design, reporting and critical

  19. Item analysis of single-peaked response data : the psychometric evaluation of bipolar measurement scales

    NARCIS (Netherlands)

    Polak, Maaike Geertruida

    2011-01-01

    The thesis explains the fundamental difference between unipolar and bipolar measurement scales for psychological characteristics. We explore the use of correspondence analysis (CA), a technique that is similar to principal component analysis and is available in SAS and SPSS, to select items that

  20. Psychometric properties of a single-item scale to assess sleep quality among individuals with fibromyalgia

    Directory of Open Access Journals (Sweden)

    Sadosky Alesia B

    2009-06-01

    Full Text Available Abstract Background Sleep disturbances are a common and bothersome symptom of fibromyalgia (FM. This study reports psychometric properties of a single-item scale to assess sleep quality among individuals with FM. Methods Analyses were based on data from two randomized, double-blind, placebo-controlled trials of pregabalin (studies 1056 and 1077. In a daily diary, patients reported the quality of their sleep on a numeric rating scale ranging from 0 ("best possible sleep" to 10 ("worst possible sleep". Test re-test reliability of the Sleep Quality Scale was evaluated by computing intraclass correlation coefficients. Pearson correlation coefficients were computed between baseline Sleep Quality scores and baseline pain diary and Medical Outcomes Study (MOS Sleep scores. Responsiveness to treatment was evaluated by standardized effect sizes computed as the difference between least squares mean changes in Sleep Quality scores in the pregabalin and placebo groups divided by the standard deviation of Sleep Quality scores across all patients at baseline. Results Studies 1056 and 1077 included 748 and 745 patients, respectively. Most patients were female (study 1056: 94.4%; study 1077: 94.5% and white (study 1056: 90.2%; study 1077: 91.0%. Mean ages were 48.8 years (study 1056 and 50.1 years (study 1077. Test re-test reliability coefficients of the Sleep Quality Scale were 0.91 and 0.90 in the 1056 and 1077 studies, respectively. Pearson correlation coefficients between baseline Sleep Quality scores and baseline pain diary scores were 0.64 (p Conclusion These results provide evidence of the reproducibility, convergent validity, and responsiveness to treatment of the Sleep Quality Scale and provide a foundation for its further use and evaluation in FM patients.

  1. The utility of single-item readiness screeners in middle school.

    Science.gov (United States)

    Lewis, Crystal G; Herman, Keith C; Huang, Francis L; Stormont, Melissa; Grossman, Caroline; Eddy, Colleen; Reinke, Wendy M

    2017-10-01

    This study examined the benefit of utilizing one-item academic and one-item behavior readiness teacher-rated screeners at the beginning of the school year to predict end-of-school year outcomes for middle school students. The Middle School Academic and Behavior Readiness (M-ABR) screeners were developed to provide an efficient and effective way to assess readiness in students. Participants included 889 students in 62 middle school classrooms in an urban Missouri school district. Concurrent validity with the M-ABR items and other indicators of readiness in the fall were evaluated using Pearson product-moment correlation coefficients, with the academic readiness item having medium to strong correlations with other baseline academic indicators (r=±0.56 to 0.91) and the behavior readiness item having low to strong correlations with baseline behavior items (r=±0.20 to 0.79). Next, the predictive validity of the M-ABR items was analyzed with hierarchical linear regressions using end-of-year outcomes as the dependent variable. The academic and behavior readiness items demonstrated adequate validity for all outcomes with moderate effects (β=±0.31 to 0.73 for academic outcomes and β=±0.24 to 0.59 for behavioral outcomes) after controlling for baseline demographics. Even after controlling for baseline scores, the M-ABR items predicted unique variance in almost all outcome variables. Four conditional probability indices were calculated to obtain an optimal cut score, to determine ready vs. not ready, for both single-item M-ABR scales. The cut point of "fair" yielded the most acceptable values for the indices. The odd ratios (OR) of experiencing negative outcomes given a "fair" or lower readiness rating (2 or below on the M-ABR screeners) at the beginning of the year were significant and strong for all outcomes (OR=2.29 to OR=14.46), except for internalizing problems. These findings suggest promise for using single readiness items to screen for varying negative end

  2. Polytomous latent scales for the investigation of the ordering of items

    NARCIS (Netherlands)

    Ligtvoet, R.; van der Ark, L.A.; Bergsma, W. P.; Sijtsma, K.

    2011-01-01

    We propose three latent scales within the framework of nonparametric item response theory for polytomously scored items. Latent scales are models that imply an invariant item ordering, meaning that the order of the items is the same for each measurement value on the latent scale. This ordering

  3. Single-Item Measurement of Suicidal Behaviors: Validity and Consequences of Misclassification.

    Directory of Open Access Journals (Sweden)

    Alexander J Millner

    Full Text Available Suicide is a leading cause of death worldwide. Although research has made strides in better defining suicidal behaviors, there has been less focus on accurate measurement. Currently, the widespread use of self-report, single-item questions to assess suicide ideation, plans and attempts may contribute to measurement problems and misclassification. We examined the validity of single-item measurement and the potential for statistical errors. Over 1,500 participants completed an online survey containing single-item questions regarding a history of suicidal behaviors, followed by questions with more precise language, multiple response options and narrative responses to examine the validity of single-item questions. We also conducted simulations to test whether common statistical tests are robust against the degree of misclassification produced by the use of single-items. We found that 11.3% of participants that endorsed a single-item suicide attempt measure engaged in behavior that would not meet the standard definition of a suicide attempt. Similarly, 8.8% of those who endorsed a single-item measure of suicide ideation endorsed thoughts that would not meet standard definitions of suicide ideation. Statistical simulations revealed that this level of misclassification substantially decreases statistical power and increases the likelihood of false conclusions from statistical tests. Providing a wider range of response options for each item reduced the misclassification rate by approximately half. Overall, the use of single-item, self-report questions to assess the presence of suicidal behaviors leads to misclassification, increasing the likelihood of statistical decision errors. Improving the measurement of suicidal behaviors is critical to increase understanding and prevention of suicide.

  4. Vegetable parenting practices scale: Item response modeling analyses

    Science.gov (United States)

    Our objective was to evaluate the psychometric properties of a vegetable parenting practices scale using multidimensional polytomous item response modeling which enables assessing item fit to latent variables and the distributional characteristics of the items in comparison to the respondents. We al...

  5. Measuring single constructs by single items: Constructing an even shorter version of the "Short Five" personality inventory.

    Directory of Open Access Journals (Sweden)

    Kenn Konstabel

    Full Text Available The aim of this study was to construct a short, 30-item personality questionnaire that would be, in terms of content and meaning of the scores, as comparable as possible with longer, well-established inventories such as NEO PI-R and its clones. To do this, we shortened the formerly constructed 60-item "Short Five" (S5 by half so that each subscale would be represented by a single item. We compared all possibilities of selecting 30 items (preserving balanced keying within each domain of the five-factor model in terms of correlations with well-established scales, self-peer correlations, and clarity of meaning, and selected an optimal combination for each domain. The resulting shortened questionnaire, XS5, was compared to the original S5 using data from student samples in 6 different countries (Estonia, Finland, UK, Germany, Spain, and China, and a representative Finnish sample. The correlations between XS5 domain scales and their longer counterparts from well-established scales ranged from 0.74 to 0.84; the difference from the equivalent correlations for full version of S5 or from meta-analytic short-term dependability coefficients of NEO PI-R was not large. In terms of prediction of external criteria (emotional experience and self-reported behaviours, there were no important differences between XS5, S5, and the longer well-established scales. Controlling for acquiescence did not improve the prediction of criteria, self-peer correlations, or correlations with longer scales, but it did improve internal reliability and, in some analyses, comparability of the principal component structure. XS5 can be recommended as an economic measure of the five-factor model of personality at the level of domain scales; it has reasonable psychometric properties, fair correlations with longer well-established scales, and it can predict emotional experience and self-reported behaviours no worse than S5. When subscales are essential, we would still recommend using the

  6. Measuring single constructs by single items: Constructing an even shorter version of the “Short Five” personality inventory

    Science.gov (United States)

    Konstabel, Kenn; Lönnqvist, Jan-Erik; Leikas, Sointu; García Velázquez, Regina; Qin, Hiaying; Verkasalo, Markku; Walkowitz, Gari

    2017-01-01

    The aim of this study was to construct a short, 30-item personality questionnaire that would be, in terms of content and meaning of the scores, as comparable as possible with longer, well-established inventories such as NEO PI-R and its clones. To do this, we shortened the formerly constructed 60-item “Short Five” (S5) by half so that each subscale would be represented by a single item. We compared all possibilities of selecting 30 items (preserving balanced keying within each domain of the five-factor model) in terms of correlations with well-established scales, self-peer correlations, and clarity of meaning, and selected an optimal combination for each domain. The resulting shortened questionnaire, XS5, was compared to the original S5 using data from student samples in 6 different countries (Estonia, Finland, UK, Germany, Spain, and China), and a representative Finnish sample. The correlations between XS5 domain scales and their longer counterparts from well-established scales ranged from 0.74 to 0.84; the difference from the equivalent correlations for full version of S5 or from meta-analytic short-term dependability coefficients of NEO PI-R was not large. In terms of prediction of external criteria (emotional experience and self-reported behaviours), there were no important differences between XS5, S5, and the longer well-established scales. Controlling for acquiescence did not improve the prediction of criteria, self-peer correlations, or correlations with longer scales, but it did improve internal reliability and, in some analyses, comparability of the principal component structure. XS5 can be recommended as an economic measure of the five-factor model of personality at the level of domain scales; it has reasonable psychometric properties, fair correlations with longer well-established scales, and it can predict emotional experience and self-reported behaviours no worse than S5. When subscales are essential, we would still recommend using the full version

  7. Item Response Theory Models for Wording Effects in Mixed-Format Scales

    Science.gov (United States)

    Wang, Wen-Chung; Chen, Hui-Fang; Jin, Kuan-Yu

    2015-01-01

    Many scales contain both positively and negatively worded items. Reverse recoding of negatively worded items might not be enough for them to function as positively worded items do. In this study, we commented on the drawbacks of existing approaches to wording effect in mixed-format scales and used bi-factor item response theory (IRT) models to…

  8. A Comparison of the 27-Item and 12-Item Intolerance of Uncertainty Scales

    Science.gov (United States)

    Khawaja, Nigar G.; Yu, Lai Ngo Heidi

    2010-01-01

    The 27-item Intolerance of Uncertainty Scale (IUS) has become one of the most frequently used measures of Intolerance of Uncertainty. More recently, an abridged, 12-item version of the IUS has been developed. The current research used clinical (n = 50) and non-clinical (n = 56) samples to examine and compare the psychometric properties of both…

  9. Work ability as prognostic risk marker of disability pension : Single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; Rhenen, van W.; Groothoff, J.W.; Klink, van der J.J.L.; Twisk, W.R.; Heymans, M.W.

    2014-01-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP.

  10. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, C.A.M.; van Rhenen, W.; Groothoff, J.W.; van der Klink, J.J.L.; Twisk, J.W.R.; Heymans, M.W.

    2014-01-01

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  11. Work ability as prognostic risk marker of disability pension : single-item work ability score versus multi-item work ability index

    NARCIS (Netherlands)

    Roelen, Corne A. M.; van Rhenen, Willem; Groothoff, Johan W.; van der Klink, Jac J. L.; Twisk, Jos W. R.; Heymans, Martijn W.

    Objectives Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. Methods This

  12. Mokken scale analysis : Between the Guttman scale and parametric item response theory

    NARCIS (Netherlands)

    van Schuur, Wijbrandt H.

    2003-01-01

    This article introduces a model of ordinal unidimensional measurement known as Mokken scale analysis. Mokken scaling is based on principles of Item Response Theory (IRT) that originated in the Guttman scale. I compare the Mokken model with both Classical Test Theory (reliability or factor analysis)

  13. The development of a single-item Food Choice Questionnaire

    NARCIS (Netherlands)

    Onwezen, M.C.; Reinders, M.J.; Verain, M.C.D.; Snoek, H.M.

    2019-01-01

    Based on the multi-item Food Choice Questionnaire (FCQ) originally developed by Steptoe and colleagues (1995), the current study developed a single-item FCQ that provides an acceptable balance between practical needs and psychometric concerns. Studies 1 (N = 1851) and 2 (2a (N = 3290), 2b (N =

  14. A single-item global job satisfaction measure is associated with quantitative blood immune indices in white-collar employees.

    Science.gov (United States)

    Nakata, Akinori; Irie, Masahiro; Takahashi, Masaya

    2013-01-01

    Although a single-item job satisfaction measure has been shown to be reliable and inclusive as multiple-item scales in relation to health, studies including immunological data are few. The purpose of this study was to evaluate the validity of single-item job and family life satisfaction based on its association with immune indices. A total of 189 white-collar employees (70% men) underwent a blood draw for the measurement of natural killer (NK), total T, and B cell counts as well as plasma immunoglobulin (Ig) G concentrations and completed single-item job and family life satisfaction measures, respectively. The response options for satisfaction measures were 'dissatisfied' (coded 1) to 'satisfied' (coded 4). Spearman's partial correlations controlling for cofactors revealed that increased job satisfaction was positively associated with NK cells (rsp=0.201, p=0.007) and IgG (rsp=0.178, p=0.018), while family life satisfaction was unrelated to immune indices. Those who reported a combination of low job/low family life satisfaction had significantly lower NK and higher B cell counts than those with a high job/high family life satisfaction. Our study suggests that the single-item summary measure of job satisfaction, but not family life satisfaction, may be a valid tool to evaluate immune status in healthy white-collar employees.

  15. Improving Measurement Efficiency of the Inner EAR Scale with Item Response Theory.

    Science.gov (United States)

    Jessen, Annika; Ho, Andrew D; Corrales, C Eduardo; Yueh, Bevan; Shin, Jennifer J

    2018-02-01

    Objectives (1) To assess the 11-item Inner Effectiveness of Auditory Rehabilitation (Inner EAR) instrument with item response theory (IRT). (2) To determine whether the underlying latent ability could also be accurately represented by a subset of the items for use in high-volume clinical scenarios. (3) To determine whether the Inner EAR instrument correlates with pure tone thresholds and word recognition scores. Design IRT evaluation of prospective cohort data. Setting Tertiary care academic ambulatory otolaryngology clinic. Subjects and Methods Modern psychometric methods, including factor analysis and IRT, were used to assess unidimensionality and item properties. Regression methods were used to assess prediction of word recognition and pure tone audiometry scores. Results The Inner EAR scale is unidimensional, and items varied in their location and information. Information parameter estimates ranged from 1.63 to 4.52, with higher values indicating more useful items. The IRT model provided a basis for identifying 2 sets of items with relatively lower information parameters. Item information functions demonstrated which items added insubstantial value over and above other items and were removed in stages, creating a 8- and 3-item Inner EAR scale for more efficient assessment. The 8-item version accurately reflected the underlying construct. All versions correlated moderately with word recognition scores and pure tone averages. Conclusion The 11-, 8-, and 3-item versions of the Inner EAR scale have strong psychometric properties, and there is correlational validity evidence for the observed scores. Modern psychometric methods can help streamline care delivery by maximizing relevant information per item administered.

  16. Development and validation of 26-item dysfunctional attitude scale.

    Science.gov (United States)

    Ebrahimi, Amrollah; Samouei, Rahele; Mousavii, Sayyed Ghafour; Bornamanesh, Ali Reza

    2013-06-01

    Dysfunctional Attitude Scale is one of the most common instruments used to assess cognitive vulnerability. This study aimed to develop and validate a short form of Dysfunctional Attitude Scale appropriate for an Iranian clinical population. Participants were 160 psychiatric patients from medical centers affiliated with Isfahan Medical University, as well as 160 non-patients. Research instruments were clinical interviews based on the Diagnostic and Statistical Manual-IV-TR, Dysfunctional Attitude Scale and General Heath Questionnaire (GHQ-28). Data was analyzed using multicorrelation calculations and factor analysis. Based on the results of factor analysis and item-total correlation, 14 items were judged candidates for omission. Analysis of the 26-item Dysfunctional Attitude Scale (DAS-26) revealed a Cronbach's alpha of 0.92. Evidence for the concurrent criterion validity was obtained through calculating the correlation between the Dysfunctional Attitude Scale and psychiatric diagnosis (r = 0.55), GHQ -28 (r = 0.56) and somatization, anxiety, social dysfunction, and depression subscales (0.45,0.53,0.48, and 0.57, respectively). Factor analysis deemed a four-factor structure the best. The factors were labeled as success-perfectionism, need for approval, need for satisfying others, and vulnerability-performance evaluation. The results showed that the Iranian version of the Dysfunctional Attitude Scale (DAS-26) bears satisfactory psychometric properties suggesting that this cognitive instrument is appropriate for use in an Iranian cultural context. Copyright © 2012 Wiley Publishing Asia Pty Ltd.

  17. Item-level factor analysis of the Self-Efficacy Scale.

    Science.gov (United States)

    Bunketorp Käll, Lina

    2014-03-01

    This study explores the internal structure of the Self-Efficacy Scale (SES) using item response analysis. The SES was previously translated into Swedish and modified to encompass all types of pain, not exclusively back pain. Data on perceived self-efficacy in 47 patients with subacute whiplash-associated disorders were derived from a previously conducted randomized-controlled trial. The item-level factor analysis was carried out using a six-step procedure. To further study the item inter-relationships and to determine the underlying structure empirically, the 20 items of the SES were also subjected to principal component analysis with varimax rotation. The analyses showed two underlying factors, named 'social activities' and 'physical activities', with seven items loading on each factor. The remaining six items of the SES appeared to measure somewhat different constructs and need to be analysed further.

  18. The 12 item Social and Economic Conservatism Scale (SECS).

    Science.gov (United States)

    Everett, Jim A C

    2013-01-01

    Recent years have seen a surge in psychological research on the relationship between political ideology (particularly conservatism) and cognition, affect, behaviour, and even biology. Despite this flurry of investigation, however, there is as yet no accepted, validated, and widely used multi-item scale of conservatism that is concise, that is modern in its conceptualisation, and that includes both social and economic conservatism subscales. In this paper the 12-Item Social and Economic Conservatism Scale (SECS) is proposed and validated to help fill this gap. The SECS is suggested to be an important and useful tool for researchers working in political psychology.

  19. An item-response theory approach to safety climate measurement: The Liberty Mutual Safety Climate Short Scales.

    Science.gov (United States)

    Huang, Yueng-Hsiang; Lee, Jin; Chen, Zhuo; Perry, MacKenna; Cheung, Janelle H; Wang, Mo

    2017-06-01

    Zohar and Luria's (2005) safety climate (SC) scale, measuring organization- and group- level SC each with 16 items, is widely used in research and practice. To improve the utility of the SC scale, we shortened the original full-length SC scales. Item response theory (IRT) analysis was conducted using a sample of 29,179 frontline workers from various industries. Based on graded response models, we shortened the original scales in two ways: (1) selecting items with above-average discriminating ability (i.e. offering more than 6.25% of the original total scale information), resulting in 8-item organization-level and 11-item group-level SC scales; and (2) selecting the most informative items that together retain at least 30% of original scale information, resulting in 4-item organization-level and 4-item group-level SC scales. All four shortened scales had acceptable reliability (≥0.89) and high correlations (≥0.95) with the original scale scores. The shortened scales will be valuable for academic research and practical survey implementation in improving occupational safety. Copyright © 2017 The Author(s). Published by Elsevier Ltd.. All rights reserved.

  20. The 12 item Social and Economic Conservatism Scale (SECS.

    Directory of Open Access Journals (Sweden)

    Jim A C Everett

    Full Text Available Recent years have seen a surge in psychological research on the relationship between political ideology (particularly conservatism and cognition, affect, behaviour, and even biology. Despite this flurry of investigation, however, there is as yet no accepted, validated, and widely used multi-item scale of conservatism that is concise, that is modern in its conceptualisation, and that includes both social and economic conservatism subscales. In this paper the 12-Item Social and Economic Conservatism Scale (SECS is proposed and validated to help fill this gap. The SECS is suggested to be an important and useful tool for researchers working in political psychology.

  1. Refinement of the Brazilian Household Food Insecurity Measurement Scale: Recommendation for a 14-item EBIA

    Directory of Open Access Journals (Sweden)

    Ana Maria Segall-Corrêa

    2014-04-01

    Full Text Available OBJECTIVE: To review and refine Brazilian Household Food Insecurity Measurement Scale structure. METHODS: The study analyzed the impact of removing the item "adult lost weight" and one of two possibly redundant items on Brazilian Household Food Insecurity Measurement Scale psychometric behavior using the one-parameter logistic (Rasch model. Brazilian Household Food Insecurity Measurement Scale psychometric behavior was analyzed with respect to acceptable adjustment values ranging from 0.7 to 1.3, and to severity scores of the items with theoretically expected gradients. The socioeconomic and food security indicators came from the 2004 National Household Sample Survey, which obtained complete answers to Brazilian Household Food Insecurity Measurement Scale items from 112,665 households. RESULTS: Removing the items "adult reduced amount..." followed by "adult ate less..." did not change the infit of the remaining items, except for "adult lost weight", whose infit increased from 1.21 to 1.56. The internal consistency and item severity scores did not change when "adult ate less" and one of the two redundant items were removed. CONCLUSION: Brazilian Household Food Insecurity Measurement Scale reanalysis reduced the number of scale items from 16 to 14 without changing its internal validity. Its use as a nationwide household food security measure is strongly recommended.

  2. Psychometric evaluation of the 10-item Short Opiate Withdrawal Scale-Gossop (SOWS-Gossop) in patients undergoing opioid detoxification.

    Science.gov (United States)

    Vernon, Margaret K; Reinders, Stefan; Mannix, Sally; Gullo, Kristen; Gorodetzky, Charles W; Clinch, Thomas

    2016-09-01

    The Short Opiate Withdrawal Scale (SOWS)-Gossop is a 10-item questionnaire developed to evaluate opioid withdrawal symptom severity. The scale was derived from the original 32-item Opiate Withdrawal Scale in order to reduce redundancy while providing an equally sensitive measure of opioid withdrawal symptom severity appropriate for research and clinical practice. The objective of this study was to examine the psychometric properties and provide score interpretation guidelines for the SOWS-Gossop 10-item version. Blinded, pooled data from two trials assessing the efficacy of lofexidine hydrochloride in reducing withdrawal symptoms in patients undergoing opioid detoxification were used to evaluate the quantitative psychometric properties and score interpretation of the SOWS-Gossop. Five hundred fifty-five (N=555) observations were available at baseline with numbers decreasing to n=213 at day 7. Mean (standard deviation) SOWS-Gossop scores were 10.4 (6.86) at baseline, 8.7 (6.49) on day 1, 10.5 (7.21) on day 2, and 3.1 (3.95) on day 7. Confirmatory factor analysis indicated that the SOWS-Gossop items loaded on a single factor consistent with a single total score. Intra-class correlations (95% confidence interval) were 0.78 (0.70-0.85) between baseline and day 1, 0.84 (0.79-0.89) between days 4 and 5, and 0.88 (0.83-0.91) between days 6 and 7, demonstrating good test-retest reliability. Mean SOWS-Gossop scores varied significantly (popioid withdrawal and has excellent psychometric properties. The SOWS-Gossop is an appropriate, precise, and sensitive measure to evaluate the symptoms of acute opioid withdrawal in research or clinical settings. Copyright © 2016 Elsevier Ltd. All rights reserved.

  3. A proposal for a new Brazilian six-item version of the Edinburgh Postnatal Depression Scale

    Directory of Open Access Journals (Sweden)

    Maicon Rodrigues Albuquerque

    Full Text Available Abstract Introduction: Factor analysis of the Edinburgh Postnatal Depression Scale (EPDS could result in a shorter and easier to handle screening tool. Therefore, the aim of this study was to check and compare the metrics of two different 6-item EPDS subscales. Methods: We administered the EPDS to a total of 3,891 women who had given birth between 1 and 3 months previously. We conducted confirmatory and exploratory factor analyses and plotted receiver-operating characteristics (ROC curves to, respectively, determine construct validity, scale items' fit to the data, and ideal cutoff scores for the short versions. Results: A previously defined 6-item scale did not exhibit construct validity for our sample. Nevertheless, we used exploratory factor analysis to derive a new 6-item scale with very good construct validity. The area under the ROC curve of the new 6-item scale was 0.986 and the ideal cutoff score was ≥ 6. Conclusions: The new 6-item scale has adequate psychometric properties and similar ROC curve values to the10-item version and offers a means of reducing the cost and time taken to administer the instrument.

  4. Translation Fidelity of Psychological Scales: An Item Response Theory Analysis of an Individualism-Collectivism Scale.

    Science.gov (United States)

    Bontempo, Robert

    1993-01-01

    Describes a method for assessing the quality of translations based on item response theory (IRT). Results from the IRT technique with French and Chinese versions of a scale measuring individualism-collectivism for samples of 250 U.S., 357 French, and 290 Chinese undergraduates show how several biased items are detected. (SLD)

  5. Characterizing Sources of Uncertainty in Item Response Theory Scale Scores

    Science.gov (United States)

    Yang, Ji Seung; Hansen, Mark; Cai, Li

    2012-01-01

    Traditional estimators of item response theory scale scores ignore uncertainty carried over from the item calibration process, which can lead to incorrect estimates of the standard errors of measurement (SEMs). Here, the authors review a variety of approaches that have been applied to this problem and compare them on the basis of their statistical…

  6. Examining the Effect of Reverse Worded Items on the Factor Structure of the Need for Cognition Scale.

    Directory of Open Access Journals (Sweden)

    Xijuan Zhang

    Full Text Available Reverse worded (RW items are often used to reduce or eliminate acquiescence bias, but there is a rising concern about their harmful effects on the covariance structure of the scale. Therefore, results obtained via traditional covariance analyses may be distorted. This study examined the effect of the RW items on the factor structure of the abbreviated 18-item Need for Cognition (NFC scale using confirmatory factor analysis. We modified the scale to create three revised versions, varying from no RW items to all RW items. We also manipulated the type of the RW items (polar opposite vs. negated. To each of the four scales, we fit four previously developed models. The four models included a 1-factor model, a 2-factor model distinguishing between positively worded (PW items and RW items, and two 2-factor models, each with one substantive factor and one method factor. Results showed that the number and type of the RW items affected the factor structure of the NFC scale. Consistent with previous research findings, for the original NFC scale, which contains both PW and RW items, the 1-factor model did not have good fit. In contrast, for the revised scales that had no RW items or all RW items, the 1-factor model had reasonably good fit. In addition, for the scale with polar opposite and negated RW items, the factor model with a method factor among the polar opposite items had considerably better fit than the 1-factor model.

  7. An Analysis of the Connectedness to Nature Scale Based on Item Response Theory.

    Science.gov (United States)

    Pasca, Laura; Aragonés, Juan I; Coello, María T

    2017-01-01

    The Connectedness to Nature Scale (CNS) is used as a measure of the subjective cognitive connection between individuals and nature. However, to date, it has not been analyzed at the item level to confirm its quality. In the present study, we conduct such an analysis based on Item Response Theory. We employed data from previous studies using the Spanish-language version of the CNS, analyzing a sample of 1008 participants. The results show that seven items presented appropriate indices of discrimination and difficulty, in addition to a good fit. The remaining six have inadequate discrimination indices and do not present a good fit. A second study with 321 participants shows that the seven-item scale has adequate levels of reliability and validity. Therefore, it would be appropriate to use a reduced version of the scale after eliminating the items that display inappropriate behavior, since they may interfere with research results on connectedness to nature.

  8. Matrix Sampling of Items in Large-Scale Assessments

    Directory of Open Access Journals (Sweden)

    Ruth A. Childs

    2003-07-01

    Full Text Available Matrix sampling of items -' that is, division of a set of items into different versions of a test form..-' is used by several large-scale testing programs. Like other test designs, matrixed designs have..both advantages and disadvantages. For example, testing time per student is less than if each..student received all the items, but the comparability of student scores may decrease. Also,..curriculum coverage is maintained, but reporting of scores becomes more complex. In this paper,..matrixed designs are compared with more traditional designs in nine categories of costs:..development costs, materials costs, administration costs, educational costs, scoring costs,..reliability costs, comparability costs, validity costs, and reporting costs. In choosing among test..designs, a testing program should examine the costs in light of its mandate(s, the content of the..tests, and the financial resources available, among other considerations.

  9. A New Functional Health Literacy Scale for Japanese Young Adults Based on Item Response Theory.

    Science.gov (United States)

    Tsubakita, Takashi; Kawazoe, Nobuo; Kasano, Eri

    2017-03-01

    Health literacy predicts health outcomes. Despite concerns surrounding the health of Japanese young adults, to date there has been no objective assessment of health literacy in this population. This study aimed to develop a Functional Health Literacy Scale for Young Adults (funHLS-YA) based on item response theory. Each item in the scale requires participants to choose the most relevant term from 3 choices in relation to a target item, thus assessing objective rather than perceived health literacy. The 20-item scale was administered to 1816 university students and 1751 responded. Cronbach's α coefficient was .73. Difficulty and discrimination parameters of each item were estimated, resulting in the exclusion of 1 item. Some items showed different difficulty parameters for male and female participants, reflecting that some aspects of health literacy may differ by gender. The current 19-item version of funHLS-YA can reliably assess the objective health literacy of Japanese young adults.

  10. The Body Appreciation Scale-2: item refinement and psychometric evaluation.

    Science.gov (United States)

    Tylka, Tracy L; Wood-Barcalow, Nichole L

    2015-01-01

    Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.

  11. Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers.

    Science.gov (United States)

    Stochl, Jan; Jones, Peter B; Croudace, Tim J

    2012-06-11

    Well-being Scale (WEMWBS) met criteria for the monotone homogeneity model but four items violated double monotonicity with respect to a single underlying dimension.Software availability and commands used to specify unidimensionality and reliability analysis and graphical displays for diagnosing monotone homogeneity and double monotonicity are discussed, with an emphasis on current implementations in freeware.

  12. Mokken scale analysis of mental health and well-being questionnaire item responses: a non-parametric IRT method in empirical research for applied health researchers

    Directory of Open Access Journals (Sweden)

    Stochl Jan

    2012-06-01

    confirmed that all 14 positively worded items of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS met criteria for the monotone homogeneity model but four items violated double monotonicity with respect to a single underlying dimension. Software availability and commands used to specify unidimensionality and reliability analysis and graphical displays for diagnosing monotone homogeneity and double monotonicity are discussed, with an emphasis on current implementations in freeware.

  13. Geriatric Anxiety Scale: item response theory analysis, differential item functioning, and creation of a ten-item short form (GAS-10).

    Science.gov (United States)

    Mueller, Anne E; Segal, Daniel L; Gavett, Brandon; Marty, Meghan A; Yochim, Brian; June, Andrea; Coolidge, Frederick L

    2015-07-01

    The Geriatric Anxiety Scale (GAS; Segal et al. (Segal, D. L., June, A., Payne, M., Coolidge, F. L. and Yochim, B. (2010). Journal of Anxiety Disorders, 24, 709-714. doi:10.1016/j.janxdis.2010.05.002) is a self-report measure of anxiety that was designed to address unique issues associated with anxiety assessment in older adults. This study is the first to use item response theory (IRT) to examine the psychometric properties of a measure of anxiety in older adults. A large sample of older adults (n = 581; mean age = 72.32 years, SD = 7.64 years, range = 60 to 96 years; 64% women; 88% European American) completed the GAS. IRT properties were examined. The presence of differential item functioning (DIF) or measurement bias by age and sex was assessed, and a ten-item short form of the GAS (called the GAS-10) was created. All GAS items had discrimination parameters of 1.07 or greater. Items from the somatic subscale tended to have lower discrimination parameters than items on the cognitive or affective subscales. Two items were flagged for DIF, but the impact of the DIF was negligible. Women scored significantly higher than men on the GAS and its subscales. Participants in the young-old group (60 to 79 years old) scored significantly higher on the cognitive subscale than participants in the old-old group (80 years old and older). Results from the IRT analyses indicated that the GAS and GAS-10 have strong psychometric properties among older adults. We conclude by discussing implications and future research directions.

  14. Work ability as prognostic risk marker of disability pension: single-item work ability score versus multi-item work ability index.

    Science.gov (United States)

    Roelen, Corné A M; van Rhenen, Willem; Groothoff, Johan W; van der Klink, Jac J L; Twisk, Jos W R; Heymans, Martijn W

    2014-07-01

    Work ability predicts future disability pension (DP). A single-item work ability score (WAS) is emerging as a measure for work ability. This study compared single-item WAS with the multi-item work ability index (WAI) in its ability to identify workers at risk of DP. This prospective cohort study comprised 11 537 male construction workers, who completed the WAI at baseline and reported DP after a mean 2.3 years of follow-up. WAS and WAI were calibrated for DP risk predictions with the Hosmer-Lemeshow (H-L) test and their ability to discriminate between high- and low-risk construction workers was investigated with the area under the receiver operating characteristic curve (AUC). At follow-up, 336 (3%) construction workers reported DP. Both WAS [odds ratio (OR) 0.72, 95% confidence interval (95% CI) 0.66-0.78] and WAI (OR 0.57, 95% CI 0.52-0.63) scores were associated with DP at follow-up. The WAS showed miscalibration (H-L model χ (�)=10.60; df=3; P=0.01) and poorly discriminated between high- and low-risk construction workers (AUC 0.67, 95% CI 0.64-0.70). In contrast, calibration (H-L model χ �=8.20; df=8; P=0.41) and discrimination (AUC 0.78, 95% CI 0.75-0.80) were both adequate for the WAI. Although associated with the risk of future DP, the single-item WAS poorly identified male construction workers at risk of DP. We recommend using the multi-item WAI to screen for risk of DP in occupational health practice.

  15. A Polytomous Item Response Theory Analysis of Social Physique Anxiety Scale

    Science.gov (United States)

    Fletcher, Richard B.; Crocker, Peter

    2014-01-01

    The present study investigated the social physique anxiety scale's factor structure and item properties using confirmatory factor analysis and item response theory. An additional aim was to identify differences in response patterns between groups (gender). A large sample of high school students aged 11-15 years (N = 1,529) consisting of n =…

  16. The Piper Fatigue Scale-12 (PFS-12): psychometric findings and item reduction in a cohort of breast cancer survivors.

    Science.gov (United States)

    Reeve, Bryce B; Stover, Angela M; Alfano, Catherine M; Smith, Ashley Wilder; Ballard-Barbash, Rachel; Bernstein, Leslie; McTiernan, Anne; Baumgartner, Kathy B; Piper, Barbara F

    2012-11-01

    Brief, valid measures of fatigue, a prevalent and distressing cancer symptom, are needed for use in research. This study's primary aim was to create a shortened version of the revised Piper Fatigue Scale (PFS-R) based on data from a diverse cohort of breast cancer survivors. A secondary aim was to determine whether the PFS captured multiple distinct aspects of fatigue (a multidimensional model) or a single overall fatigue factor (a unidimensional model). Breast cancer survivors (n = 799; stages in situ through IIIa; ages 29-86 years) were recruited through three SEER registries (New Mexico, Western Washington, and Los Angeles, CA) as part of the Health, Eating, Activity, and Lifestyle (HEAL) study. Fatigue was measured approximately 3 years post-diagnosis using the 22-item PFS-R that has four subscales (Behavior, Affect, Sensory, and Cognition). Confirmatory factor analysis was used to compare unidimensional and multidimensional models. Six criteria were used to make item selections to shorten the PFS-R: scale's content validity, items' relationship with fatigue, content redundancy, differential item functioning by race and/or education, scale reliability, and literacy demand. Factor analyses supported the original 4-factor structure. There was also evidence from the bi-factor model for a dominant underlying fatigue factor. Six items tested positive for differential item functioning between African-American and Caucasian survivors. Four additional items either showed poor association, local dependence, or content validity concerns. After removing these 10 items, the reliability of the PFS-12 subscales ranged from 0.87 to 0.89, compared to 0.90-0.94 prior to item removal. The newly developed PFS-12 can be used to assess fatigue in African-American and Caucasian breast cancer survivors and reduces response burden without compromising reliability or validity. This is the first study to determine PFS literacy demand and to compare PFS-R responses in African

  17. Internal consistency of a five-item form of the Francis Scale of Attitude Toward Christianity among adolescent students.

    Science.gov (United States)

    Campo-Arias, Adalberto; Oviedo, Heidi Celina; Cogollo, Zuleima

    2009-04-01

    The short form of the Francis Scale of Attitude Toward Christianity (L. J. Francis, 1992) is a 7-item Likert-type scale that shows high homogeneity among adolescents. The psychometric performance of a shorter version of this scale has not been explored. The authors aimed to determine the internal consistency of a 5-item form of the Francis Scale of Attitude Toward Christianity among 405 students from a school in Cartagena, Colombia. The authors computed the Cronbach's alpha coefficient for the 5 items with a greater corrected item-total punctuation correlation. The version without Items 2 and 7 showed internal consistency of .87. The 5-item version of the Francis Scale of Attitude Toward Christianity exhibited higher internal consistency than did the 7-item version. Future researchers should corroborate this finding.

  18. The importance of rating scale design in the measurement of patient-reported outcomes using questionnaires or item banks.

    Science.gov (United States)

    Khadka, Jyoti; McAlinden, Colm; Gothwal, Vijaya K; Lamoureux, Ecosse L; Pesudovs, Konrad

    2012-06-26

    To investigate the effect of rating scale designs (question formats and response categories) on item difficulty calibrations and assess the impact that rating scale differences have on overall vision-related activity limitation (VRAL) scores. Sixteen existing patient-reported outcome instruments (PROs) suitable for cataract assessment, with different rating scales, were self-administered by patients on a cataract surgery waiting list. A total of 226 VRAL items from these PROs in their native rating scales were included in an item bank and calibrated using Rasch analysis. Fifteen item/content areas (e.g., reading newspapers) appearing in at least three different PROs were identified. Within each content area, item calibrations were compared and their range calculated. Similarly, five PROs having at least three items in common with the Visual Function (VF-14) were compared in terms of average item measures. A total of 614 patients (mean age ± SD, 74.1 ± 9.4 years) participated. Items with the same content varied in their calibration by as much as two logits; "reading the small print" had the largest range (1.99 logits) followed by "watching TV" (1.60). Compared with the VF-14 (0.00 logits), the rating scale of the Visual Disability Assessment (1.13 logits) produced the most difficult items and the Cataract Symptom Scale (0.24 logits) produced the least difficult items. The VRAL item bank was suboptimally targeted to the ability level of the participants (2.00 logits). Rating scale designs have a significant effect on item calibrations. Therefore, constructing item banks from existing items in their native formats carries risks to face validity and transmission of problems inherent in existing instruments, such as poor targeting.

  19. Development and validation of an item response theory-based Social Responsiveness Scale short form.

    Science.gov (United States)

    Sturm, Alexandra; Kuhfeld, Megan; Kasari, Connie; McCracken, James T

    2017-09-01

    Research and practice in autism spectrum disorder (ASD) rely on quantitative measures, such as the Social Responsiveness Scale (SRS), for characterization and diagnosis. Like many ASD diagnostic measures, SRS scores are influenced by factors unrelated to ASD core features. This study further interrogates the psychometric properties of the SRS using item response theory (IRT), and demonstrates a strategy to create a psychometrically sound short form by applying IRT results. Social Responsiveness Scale analyses were conducted on a large sample (N = 21,426) of youth from four ASD databases. Items were subjected to item factor analyses and evaluation of item bias by gender, age, expressive language level, behavior problems, and nonverbal IQ. Item selection based on item psychometric properties, DIF analyses, and substantive validity produced a reduced item SRS short form that was unidimensional in structure, highly reliable (α = .96), and free of gender, age, expressive language, behavior problems, and nonverbal IQ influence. The short form also showed strong relationships with established measures of autism symptom severity (ADOS, ADI-R, Vineland). Degree of association between all measures varied as a function of expressive language. Results identified specific SRS items that are more vulnerable to non-ASD-related traits. The resultant 16-item SRS short form may possess superior psychometric properties compared to the original scale and emerge as a more precise measure of ASD core symptom severity, facilitating research and practice. Future research using IRT is needed to further refine existing measures of autism symptomatology. © 2017 Association for Child and Adolescent Mental Health.

  20. Gender Invariance of the Gambling Behavior Scale for Adolescents (GBS-A): An Analysis of Differential Item Functioning Using Item Response Theory.

    Science.gov (United States)

    Donati, Maria Anna; Chiesi, Francesca; Izzo, Viola A; Primi, Caterina

    2017-01-01

    As there is a lack of evidence attesting the equivalent item functioning across genders for the most employed instruments used to measure pathological gambling in adolescence, the present study was aimed to test the gender invariance of the Gambling Behavior Scale for Adolescents (GBS-A), a new measurement tool to assess the severity of Gambling Disorder (GD) in adolescents. The equivalence of the items across genders was assessed by analyzing Differential Item Functioning within an Item Response Theory framework. The GBS-A was administered to 1,723 adolescents, and the graded response model was employed. The results attested the measurement equivalence of the GBS-A when administered to male and female adolescent gamblers. Overall, findings provided evidence that the GBS-A is an effective measurement tool of the severity of GD in male and female adolescents and that the scale was unbiased and able to relieve truly gender differences. As such, the GBS-A can be profitably used in educational interventions and clinical treatments with young people.

  1. Item response theory analysis of the Lichtenberg Financial Decision Screening Scale.

    Science.gov (United States)

    Teresi, Jeanne A; Ocepek-Welikson, Katja; Lichtenberg, Peter A

    2017-01-01

    The focus of these analyses was to examine the psychometric properties of the Lichtenberg Financial Decision Screening Scale (LFDSS). The purpose of the screen was to evaluate the decisional abilities and vulnerability to exploitation of older adults. Adults aged 60 and over were interviewed by social, legal, financial, or health services professionals who underwent in-person training on the administration and scoring of the scale. Professionals provided a rating of the decision-making abilities of the older adult. The analytic sample included 213 individuals with an average age of 76.9 (SD = 10.1). The majority (57%) were female. Data were analyzed using item response theory (IRT) methodology. The results supported the unidimensionality of the item set. Several IRT models were tested. Ten ordinal and binary items evidenced a slightly higher reliability estimate (0.85) than other versions and better coverage in terms of the range of reliable measurement across the continuum of financial incapacity.

  2. Validation of the Spanish versions of the long (26 items) and short (12 items) forms of the Self-Compassion Scale (SCS).

    Science.gov (United States)

    Garcia-Campayo, Javier; Navarro-Gil, Mayte; Andrés, Eva; Montero-Marin, Jesús; López-Artal, Lorena; Demarzo, Marcelo Marcos Piva

    2014-01-10

    Self-compassion is a key psychological construct for assessing clinical outcomes in mindfulness-based interventions. The aim of this study was to validate the Spanish versions of the long (26 item) and short (12 item) forms of the Self-Compassion Scale (SCS). The translated Spanish versions of both subscales were administered to two independent samples: Sample 1 was comprised of university students (n = 268) who were recruited to validate the long form, and Sample 2 was comprised of Aragon Health Service workers (n = 271) who were recruited to validate the short form. In addition to SCS, the Mindful Attention Awareness Scale (MAAS), the State-Trait Anxiety Inventory-Trait (STAI-T), the Beck Depression Inventory (BDI) and the Perceived Stress Questionnaire (PSQ) were administered. Construct validity, internal consistency, test-retest reliability and convergent validity were tested. The Confirmatory Factor Analysis (CFA) of the long and short forms of the SCS confirmed the original six-factor model in both scales, showing goodness of fit. Cronbach's α for the 26 item SCS was 0.87 (95% CI = 0.85-0.90) and ranged between 0.72 and 0.79 for the 6 subscales. Cronbach's α for the 12-item SCS was 0.85 (95% CI = 0.81-0.88) and ranged between 0.71 and 0.77 for the 6 subscales. The long (26-item) form of the SCS showed a test-retest coefficient of 0.92 (95% CI = 0.89-0.94). The Intraclass Correlation (ICC) for the 6 subscales ranged from 0.84 to 0.93. The short (12-item) form of the SCS showed a test-retest coefficient of 0.89 (95% CI: 0.87-0.93). The ICC for the 6 subscales ranged from 0.79 to 0.91. The long and short forms of the SCS exhibited a significant negative correlation with the BDI, the STAI and the PSQ, and a significant positive correlation with the MAAS. The correlation between the total score of the long and short SCS form was r = 0.92. The Spanish versions of the long (26-item) and short (12-item) forms of the SCS are valid and

  3. Developing a Model for Optimizing Inventory of Repairable Items at Single Operating Base

    OpenAIRE

    Le, Tin

    2016-01-01

    The use of EOQ model in inventory management is popular. However, EOQ models has many disadvantages, especially, when the model is applied to manage repairable items. In order to deal with high-cost and repairable items, Craig C. Sherbrooke introduced a model in his book “Optimal Inventory Modeling of Systems: Multi-Echelon Techniques”. The research focus is to implement and develop a program to execute the single-site in-ventory model for repairable items. The model helps to significantl...

  4. The Australian Racism, Acceptance, and Cultural-Ethnocentrism Scale (RACES): item response theory findings.

    Science.gov (United States)

    Grigg, Kaine; Manderson, Lenore

    2016-03-17

    Racism and associated discrimination are pervasive and persistent challenges with multiple cumulative deleterious effects contributing to inequities in various health outcomes. Globally, research over the past decade has shown consistent associations between racism and negative health concerns. Such research confirms that race endures as one of the strongest predictors of poor health. Due to the lack of validated Australian measures of racist attitudes, RACES (Racism, Acceptance, and Cultural-Ethnocentrism Scale) was developed. Here, we examine RACES' psychometric properties, including the latent structure, utilising Item Response Theory (IRT). Unidimensional and Multidimensional Rating Scale Model (RSM) Rasch analyses were utilised with 296 Victorian primary school students and 182 adolescents and 220 adults from the Australian community. RACES was demonstrated to be a robust 24-item three-dimensional scale of Accepting Attitudes (12 items), Racist Attitudes (8 items), and Ethnocentric Attitudes (4 items). RSM Rasch analyses provide strong support for the instrument as a robust measure of racist attitudes in the Australian context, and for the overall factorial and construct validity of RACES across primary school children, adolescents, and adults. RACES provides a reliable and valid measure that can be utilised across the lifespan to evaluate attitudes towards all racial, ethnic, cultural, and religious groups. A core function of RACES is to assess the effectiveness of interventions to reduce community levels of racism and in turn inequities in health outcomes within Australia.

  5. The MIMIC Method with Scale Purification for Detecting Differential Item Functioning

    Science.gov (United States)

    Wang, Wen-Chung; Shih, Ching-Lin; Yang, Chih-Chien

    2009-01-01

    This study implements a scale purification procedure onto the standard MIMIC method for differential item functioning (DIF) detection and assesses its performance through a series of simulations. It is found that the MIMIC method with scale purification (denoted as M-SP) outperforms the standard MIMIC method (denoted as M-ST) in controlling…

  6. Should Global Items on Student Rating Scales Be Used for Summative Decisions?

    Science.gov (United States)

    Berk, Ronald A.

    2013-01-01

    One of the simplest indicators of teaching or course effectiveness is student ratings on one or more global items from the entire rating scale. That approach seems intuitively sound and easy to use. Global items have even been recommended by a few researchers to get a quick-read, at-a-glance summary for summative decisions about faculty. The…

  7. An Item Response Theory Analysis of the Community of Inquiry Scale

    Science.gov (United States)

    Horzum, Mehmet Baris; Uyanik, Gülden Kaya

    2015-01-01

    The aim of this study is to examine validity and reliability of Community of Inquiry Scale commonly used in online learning by the means of Item Response Theory. For this purpose, Community of Inquiry Scale version 14 is applied on 1,499 students of a distance education center's online learning programs at a Turkish state university via internet.…

  8. The Protective Behavioral Strategies for Marijuana Scale: Further examination using item response theory.

    Science.gov (United States)

    Pedersen, Eric R; Huang, Wenjing; Dvorak, Robert D; Prince, Mark A; Hummer, Justin F

    2017-08-01

    Given recent state legislation legalizing marijuana for recreational purposes and majority popular opinion favoring these laws, we developed the Protective Behavioral Strategies for Marijuana scale (PBSM) to identify strategies that may mitigate the harms related to marijuana use among those young people who choose to use the drug. In the current study, we expand on the initial exploratory study of the PBSM to further validate the measure with a large and geographically diverse sample (N = 2,117; 60% women, 30% non-White) of college students from 11 different universities across the United States. We sought to develop a psychometrically sound item bank for the PBSM and to create a short assessment form that minimizes respondent burden and time. Quantitative item analyses, including exploratory and confirmatory factor analyses with item response theory (IRT) and evaluation of differential item functioning (DIF), revealed an item bank of 36 items that was examined for unidimensionality and good content coverage, as well as a short form of 17 items that is free of bias in terms of gender (men vs. women), race (White vs. non-White), ethnicity (Hispanic vs. non-Hispanic), and recreational marijuana use legal status (state recreational marijuana was legal for 25.5% of participants). We also provide a scoring table for easy transformation from sum scores to IRT scale scores. The PBSM item bank and short form associated strongly and negatively with past month marijuana use and consequences. The measure may be useful to researchers and clinicians conducting intervention and prevention programs with young adults. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  9. Concurrent validity and sensitivity to change of Direct Behavior Rating Single-Item Scales (DBR-SIS) within an elementary sample.

    Science.gov (United States)

    Smith, Rhonda L; Eklund, Katie; Kilgus, Stephen P

    2018-03-01

    The purpose of this study was to evaluate the concurrent validity, sensitivity to change, and teacher acceptability of Direct Behavior Rating single-item scales (DBR-SIS), a brief progress monitoring measure designed to assess student behavioral change in response to intervention. Twenty-four elementary teacher-student dyads implemented a daily report card intervention to promote positive student behavior during prespecified classroom activities. During both baseline and intervention, teachers completed DBR-SIS ratings of 2 target behaviors (i.e., Academic Engagement, Disruptive Behavior) whereas research assistants collected systematic direct observation (SDO) data in relation to the same behaviors. Five change metrics (i.e., absolute change, percent of change from baseline, improvement rate difference, Tau-U, and standardized mean difference; Gresham, 2005) were calculated for both DBR-SIS and SDO data, yielding estimates of the change in student behavior in response to intervention. Mean DBR-SIS scores were predominantly moderately to highly correlated with SDO data within both baseline and intervention, demonstrating evidence of the former's concurrent validity. DBR-SIS change metrics were also significantly correlated with SDO change metrics for both Disruptive Behavior and Academic Engagement, yielding evidence of the former's sensitivity to change. In addition, teacher Usage Rating Profile-Assessment (URP-A) ratings indicated they found DBR-SIS to be acceptable and usable. Implications for practice, study limitations, and areas of future research are discussed. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  10. Single-item memory, associative memory, and the human hippocampus

    OpenAIRE

    Gold, Jeffrey J.; Hopkins, Ramona O.; Squire, Larry R.

    2006-01-01

    We tested recognition memory for items and associations in memory-impaired patients with bilateral lesions thought to be limited to the hippocampal region. In Experiment 1 (Combined memory test), participants studied words and then took a memory test in which studied words, new words, studied word pairs, and recombined word pairs were presented in a mixed order. In Experiment 2 (Separated memory test), participants studied single words and then took a memory test involving studied word and ne...

  11. 48 CFR 245.7101-3 - DD Form 1348-1, DoD Single Line Item Release/Receipt Document.

    Science.gov (United States)

    2010-10-01

    ... 48 Federal Acquisition Regulations System 3 2010-10-01 2010-10-01 false DD Form 1348-1, DoD Single Line Item Release/Receipt Document. 245.7101-3 Section 245.7101-3 Federal Acquisition Regulations... PROPERTY Plant Clearance Forms 245.7101-3 DD Form 1348-1, DoD Single Line Item Release/Receipt Document...

  12. Validation of the Single-Factor Model of the Relationship Assessment Scale among Married and Cohabiting Persons from Monterrey, Mexico

    Directory of Open Access Journals (Sweden)

    José Moral de la Rubia

    2015-07-01

    Full Text Available The study of intimate partner relationships is particularly important because this union is the foundation of the family. Satisfaction with the relationship can be defined as the overall attitude to the relationship and the partner. The Hendrick's Relationship Assessment Scale (RAS is a instrument commonly used to assess the construct. Previous research papers have showed that this scale has high internal consistency and a single-factor structure. Although there are validation studies of the RAS, these studies used inappropriate statistical techniques to analyze its Likert-type items, and to determine the number of factors; likewise, its factor invariance across sex has not been previously contrasted. Therefore, this study posed the following research questions: Does the RAS have consistent and discriminating items? Basing the analysis on a polychoric correlation matrix, what is its level of internal consistency? How many factors emerge using rigorous empirical methods? Is the single-factor model invariant across sex? In order to answer these research questions, we used a random route probability sampling in this instrument validation study of the RAS. The sample was extracted from the population of married couples or the ones living in consensual union in Monterrey, Mexico. There were 431 female and 376 male participants in the study. The RAS’ items were consistent and discriminative. The internal consistency of the scale was excellent in the whole sample (ordinal α = .93, as well as among female (ordinal α = .94 and male participants (ordinal α = .92. Horn's parallel analysis and Velicer's  minimum average partial test suggested a one factor solution. Moreover, the single-factor model (with one correlation between the residuals of the two negatively worded items had a close fit to the data, and its properties of invariance across sex were very acceptable by the Unweighted Least Squares method. We conclude that the scale shows internal

  13. Structural validity of a 16-item abridged version of the Cervantes Health-Related Quality of Life scale for menopause: the Cervantes Short-Form Scale.

    Science.gov (United States)

    Coronado, Pluvio J; Borrego, Rafael Sánchez; Palacios, Santiago; Ruiz, Miguel A; Rejas, Javier

    2015-03-01

    The Cervantes Scale is a specific health-related quality of life questionnaire that was originally developed in Spanish to be used in Spain for women through and beyond menopause. It contains 31 items and is time-consuming. The aim of this study was to produce an abridged version with the same dimensional structure and with similar psychometric properties. A representative sample of 516 postmenopausal women (mean [SD] age, 57 [4.31] y) seen in outpatient gynecology clinics and extracted from an observational cross-sectional study was used. Item analysis, internal consistency reliability, item-total and item-dimension correlations, and item correlation with the 12-item Medical Outcomes Study Short Form Health Survey Version 2.0 were studied. Dimensional and full-model confirmatory factor analyses were used to check structure stability. A threefold cross-validation method was used to obtain stable estimates by means of multigroup analysis. The scale was reduced to a 16-item version, the Cervantes Short-Form Scale, containing four main dimensions (Menopause and Health, Psychological, Sexuality, and Couple Relations), with the first dimension composed of three subdimensions (Vasomotor Symptoms, Health, and Aging). Goodness-of-fit statistics were better than those of the extended version (χ(2)/df = 2.493; adjusted goodness-of-fit index, 0.802; parsimony comparative fit index, 0.749; root mean standard error of approximation, 0.054). Internal consistency was good (Cronbach's α = 0.880). Correlations between the extended and the reduced dimensions were high and significant in all cases (P < 0.001; r values ranged from 0.90 for Sexuality to 0.969 for Vasomotor Symptoms). The Cervantes Scale can be reduced to a 16-item abridged version (Cervantes Short-Form Scale) that maintains the original dimensional structure and psychometric properties. At 51% of the original length, this version can be administered faster, making it especially suitable for routine medical practice.

  14. Item response modeling: a psychometric assessment of the children's fruit, vegetable, water, and physical activity self-efficacy scales among Chinese children.

    Science.gov (United States)

    Wang, Jing-Jing; Chen, Tzu-An; Baranowski, Tom; Lau, Patrick W C

    2017-09-16

    This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups using item response modeling (IRM) and differential item functioning (DIF). Four self-efficacy scales were administrated to 763 Hong Kong Chinese children (55.2% boys) aged 8-13 years. Classical test theory (CTT) was used to examine the reliability and factorial validity of scales. IRM was conducted and DIF analyses were performed to assess the characteristics of item parameter estimates on the basis of children's sex, age and body weight status. All self-efficacy scales demonstrated adequate to excellent internal consistency reliability (Cronbach's α: 0.79-0.91). One FSE misfit item and one PASE misfit item were detected. Small DIF were found for all the scale items across children's age groups. Items with medium to large DIF were detected in different sex and body weight status groups, which will require modification. A Wright map revealed that items covered the range of the distribution of participants' self-efficacy for each scale except VSE. Several self-efficacy scales' items functioned differently by children's sex and body weight status. Additional research is required to modify the four self-efficacy scales to minimize these moderating influences for application.

  15. Stability of the Spanish version of the five-item Francis Scale of Attitude toward Christianity.

    Science.gov (United States)

    Miranda-Tapia, Giskar Alonso; Cogollo, Zuleima; Herazo, Edwin; Campo-Arias, Adalberto

    2010-12-01

    The aim of this study was to establish test-retest reliability of a Spanish version of the Francis Scale of Attitude toward Christianity (Campo-Arias, Oviedo, & Cogollo, 2009) among adolescent students in Cartagena, Colombia. A group of ninth grade students from two public schools in Colombia (N = 157) completed the five-item scale. Cronbach's alphas were .74 and .76 in the first and second administrations, respectively. Both Pearson's rho and intra-class correlation coefficient were .69. A Spanish translation of the 5-item scale had consistent stability over four weeks.

  16. Improving a measure of mobility-related fatigue (the mobility-tiredness scale) by establishing item intensity

    DEFF Research Database (Denmark)

    Fieo, Robert A; Mortensen, Erik L; Rantanen, Taina

    2013-01-01

    To improve the construct validity of self-reported fatigue by establishing a formal hierarchy of scale items and to determine whether such a hierarchy could be maintained across time (aged 75-80), sex, and nationality.......To improve the construct validity of self-reported fatigue by establishing a formal hierarchy of scale items and to determine whether such a hierarchy could be maintained across time (aged 75-80), sex, and nationality....

  17. Quantitative Analysis of Complex Multiple-Choice Items in Science Technology and Society: Item Scaling

    Directory of Open Access Journals (Sweden)

    Ángel Vázquez Alonso

    2005-05-01

    Full Text Available The scarce attention to assessment and evaluation in science education research has been especially harmful for Science-Technology-Society (STS education, due to the dialectic, tentative, value-laden, and controversial nature of most STS topics. To overcome the methodological pitfalls of the STS assessment instruments used in the past, an empirically developed instrument (VOSTS, Views on Science-Technology-Society have been suggested. Some methodological proposals, namely the multiple response models and the computing of a global attitudinal index, were suggested to improve the item implementation. The final step of these methodological proposals requires the categorization of STS statements. This paper describes the process of categorization through a scaling procedure ruled by a panel of experts, acting as judges, according to the body of knowledge from history, epistemology, and sociology of science. The statement categorization allows for the sound foundation of STS items, which is useful in educational assessment and science education research, and may also increase teachers’ self-confidence in the development of the STS curriculum for science classrooms.

  18. Problems with the factor analysis of items: Solutions based on item response theory and item parcelling

    Directory of Open Access Journals (Sweden)

    Gideon P. De Bruin

    2004-10-01

    Full Text Available The factor analysis of items often produces spurious results in the sense that unidimensional scales appear multidimensional. This may be ascribed to failure in meeting the assumptions of linearity and normality on which factor analysis is based. Item response theory is explicitly designed for the modelling of the non-linear relations between ordinal variables and provides a strong alternative to the factor analysis of items. Items may also be combined in parcels that are more likely to satisfy the assumptions of factor analysis than do the items. The use of the Rasch rating scale model and the factor analysis of parcels is illustrated with data obtained with the Locus of Control Inventory. The results of these analyses are compared with the results obtained through the factor analysis of items. It is shown that the Rasch rating scale model and the factoring of parcels produce superior results to the factor analysis of items. Recommendations for the analysis of scales are made. Opsomming Die faktorontleding van items lewer dikwels misleidende resultate op, veral in die opsig dat eendimensionele skale as meerdimensioneel voorkom. Hierdie resultate kan dikwels daaraan toegeskryf word dat daar nie aan die aannames van lineariteit en normaliteit waarop faktorontleding berus, voldoen word nie. Itemresponsteorie, wat eksplisiet vir die modellering van die nie-liniêre verbande tussen ordinale items ontwerp is, bied ’n aantreklike alternatief vir die faktorontleding van items. Items kan ook in pakkies gegroepeer word wat meer waarskynlik aan die aannames van faktorontleding voldoen as individuele items. Die gebruik van die Rasch beoordelingskaalmodel en die faktorontleding van pakkies word aan die hand van data wat met die Lokus van Beheervraelys verkry is, gedemonstreer. Die resultate van hierdie ontledings word vergelyk met die resultate wat deur ‘n faktorontleding van die individuele items verkry is. Die resultate dui daarop dat die Rasch

  19. Diagnostic Value of Subjective Memory Complaints Assessed with a Single Item in Dominantly Inherited Alzheimer’s Disease: Results of the DIAN Study

    Directory of Open Access Journals (Sweden)

    Christoph Laske

    2015-01-01

    Full Text Available Objective. We examined the diagnostic value of subjective memory complaints (SMCs assessed with a single item in a large cross-sectional cohort consisting of families with autosomal dominant Alzheimer’s disease (ADAD participating in the Dominantly Inherited Alzheimer Network (DIAN. Methods. The baseline sample of 183 mutation carriers (MCs and 117 noncarriers (NCs was divided according to Clinical Dementia Rating (CDR scale into preclinical (CDR 0; MCs: n=107; NCs: n=109, early symptomatic (CDR 0.5; MCs: n=48; NCs: n=8, and dementia stage (CDR ≥ 1; MCs: n=28; NCs: n=0. These groups were subdivided by the presence or absence of SMCs. Results. At CDR 0, SMCs were present in 12.1% of MCs and 9.2% of NCs (P=0.6. At CDR 0.5, SMCs were present in 66.7% of MCs and 62.5% of NCs (P=1.0. At CDR ≥ 1, SMCs were present in 96.4% of MCs. SMCs in MCs were significantly associated with CDR, logical memory scores, Geriatric Depression Scale, education, and estimated years to onset. Conclusions. The present study shows that SMCs assessed by a single-item scale have no diagnostic value to identify preclinical ADAD in asymptomatic individuals. These results demonstrate the need of further improvement of SMC measures that should be examined in large clinical trials.

  20. Pattern analysis of total item score and item response of the Kessler Screening Scale for Psychological Distress (K6 in a nationally representative sample of US adults

    Directory of Open Access Journals (Sweden)

    Shinichiro Tomitaka

    2017-02-01

    Full Text Available Background Several recent studies have shown that total scores on depressive symptom measures in a general population approximate an exponential pattern except for the lower end of the distribution. Furthermore, we confirmed that the exponential pattern is present for the individual item responses on the Center for Epidemiologic Studies Depression Scale (CES-D. To confirm the reproducibility of such findings, we investigated the total score distribution and item responses of the Kessler Screening Scale for Psychological Distress (K6 in a nationally representative study. Methods Data were drawn from the National Survey of Midlife Development in the United States (MIDUS, which comprises four subsamples: (1 a national random digit dialing (RDD sample, (2 oversamples from five metropolitan areas, (3 siblings of individuals from the RDD sample, and (4 a national RDD sample of twin pairs. K6 items are scored using a 5-point scale: “none of the time,” “a little of the time,” “some of the time,” “most of the time,” and “all of the time.” The pattern of total score distribution and item responses were analyzed using graphical analysis and exponential regression model. Results The total score distributions of the four subsamples exhibited an exponential pattern with similar rate parameters. The item responses of the K6 approximated a linear pattern from “a little of the time” to “all of the time” on log-normal scales, while “none of the time” response was not related to this exponential pattern. Discussion The total score distribution and item responses of the K6 showed exponential patterns, consistent with other depressive symptom scales.

  1. The 10-item Remembered Relationship with Parents (RRP10) scale

    DEFF Research Database (Denmark)

    Denollet, Johan; Smolderen, Kim G E; van den Broek, Krista C

    2007-01-01

    Dysfunctional parenting styles are associated with poor mental and physical health. The 10-item Remembered Relationship with Parents (RRP(10)) scale retrospectively assesses Alienation (dysfunctional communication and intimacy) and Control (overprotection by parents), with an emphasis...... on deficiencies in empathic parenting. We examined the 2-factor structure of the RRP(10) and its relationship with adult depression....

  2. Overcoming the effects of differential skewness of test items in scale construction

    Directory of Open Access Journals (Sweden)

    Johann M. Schepers

    2004-10-01

    Full Text Available The principal objective of the study was to develop a procedure for overcoming the effects of differential skewness of test items in scale construction. It was shown that the degree of skewness of test items places an upper limit on the correlations between the items, regardless of the contents of the items. If the items are ordered in terms of skewness the resulting inter correlation matrix forms a simplex or a pseudo simplex. Factoring such a matrix results in a multiplicity of factors, most of which are artifacts. A procedure for overcoming this problem was demonstrated with items from the Locus of Control Inventory (Schepers, 1995. The analysis was based on a sample of 1662 first year university students. Opsomming Die hoofdoel van die studie was om ’n prosedure te ontwikkel om die gevolge van differensiële skeefheid van toetsitems, in skaalkonstruksie, teen te werk. Daar is getoon dat die graad van skeefheid van toetsitems ’n boonste grens plaas op die korrelasies tussen die items ongeag die inhoud daarvan. Indien die items gerangskik word volgens graad van skeefheid, sal die interkorrelasiematriks van die items ’n simpleks of pseudosimpleks vorm. Indien so ’n matriks aan faktorontleding onderwerp word, lei dit tot ’n veelheid van faktore waarvan die meerderheid artefakte is. ’n Prosedure om hierdie probleem te bowe te kom, is gedemonstreer met behulp van die items van die Lokus van Beheer-vraelys (Schepers, 1995. Die ontledings is op ’n steekproef van 1662 eerstejaaruniversiteitstudente gebaseer.

  3. Item response theory analysis applied to the Spanish version of the Personal Outcomes Scale.

    Science.gov (United States)

    Guàrdia-Olmos, J; Carbó-Carreté, M; Peró-Cebollero, M; Giné, C

    2017-11-01

    The study of measurements of quality of life (QoL) is one of the great challenges of modern psychology and psychometric approaches. This issue has greater importance when examining QoL in populations that were historically treated on the basis of their deficiency, and recently, the focus has shifted to what each person values and desires in their life, as in cases of people with intellectual disability (ID). Many studies of QoL scales applied in this area have attempted to improve the validity and reliability of their components by incorporating various sources of information to achieve consistency in the data obtained. The adaptation of the Personal Outcomes Scale (POS) in Spanish has shown excellent psychometric attributes, and its administration has three sources of information: self-assessment, practitioner and family. The study of possible congruence or incongruence of observed distributions of each item between sources is therefore essential to ensure a correct interpretation of the measure. The aim of this paper was to analyse the observed distribution of items and dimensions from the three Spanish POS information sources cited earlier, using the item response theory. We studied a sample of 529 people with ID and their respective practitioners and family member, and in each case, we analysed items and factors using Samejima's model of polytomic ordinal scales. The results indicated an important number of items with differential effects regarding sources, and in some cases, they indicated significant differences in the distribution of items, factors and sources of information. As a result of this analysis, we must affirm that the administration of the POS, considering three sources of information, was adequate overall, but a correct interpretation of the results requires that it obtain much more information to consider, as well as some specific items in specific dimensions. The overall ratings, if these comments are considered, could result in bias. © 2017

  4. Evaluation of the Multiple Sclerosis Walking Scale-12 (MSWS-12) in a Dutch sample: Application of item response theory.

    Science.gov (United States)

    Mokkink, Lidwine Brigitta; Galindo-Garre, Francisca; Uitdehaag, Bernard Mj

    2016-12-01

    The Multiple Sclerosis Walking Scale-12 (MSWS-12) measures walking ability from the patients' perspective. We examined the quality of the MSWS-12 using an item response theory model, the graded response model (GRM). A total of 625 unique Dutch multiple sclerosis (MS) patients were included. After testing for unidimensionality, monotonicity, and absence of local dependence, a GRM was fit and item characteristics were assessed. Differential item functioning (DIF) for the variables gender, age, duration of MS, type of MS and severity of MS, reliability, total test information, and standard error of the trait level (θ) were investigated. Confirmatory factor analysis showed a unidimensional structure of the 12 items of the scale, explaining 88% of the variance. Item 2 did not fit into the GRM model. Reliability was 0.93. Items 8 and 9 (of the 11 and 12 item version respectively) showed DIF on the variable severity, based on the Expanded Disability Status Scale (EDSS). However, the EDSS is strongly related to the content of both items. Our results confirm the good quality of the MSWS-12. The trait level (θ) scores and item parameters of both the 12- and 11-item versions were highly comparable, although we do not suggest to change the content of the MSWS-12. © The Author(s), 2016.

  5. The role of attention in item-item binding in visual working memory.

    Science.gov (United States)

    Peterson, Dwight J; Naveh-Benjamin, Moshe

    2017-09-01

    An important yet unresolved question regarding visual working memory (VWM) relates to whether or not binding processes within VWM require additional attentional resources compared with processing solely the individual components comprising these bindings. Previous findings indicate that binding of surface features (e.g., colored shapes) within VWM is not demanding of resources beyond what is required for single features. However, it is possible that other types of binding, such as the binding of complex, distinct items (e.g., faces and scenes), in VWM may require additional resources. In 3 experiments, we examined VWM item-item binding performance under no load, articulatory suppression, and backward counting using a modified change detection task. Binding performance declined to a greater extent than single-item performance under higher compared with lower levels of concurrent load. The findings from each of these experiments indicate that processing item-item bindings within VWM requires a greater amount of attentional resources compared with single items. These findings also highlight an important distinction between the role of attention in item-item binding within VWM and previous studies of long-term memory (LTM) where declines in single-item and binding test performance are similar under divided attention. The current findings provide novel evidence that the specific type of binding is an important determining factor regarding whether or not VWM binding processes require attention. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  6. Robustness of two single-item self-esteem measures: cross-validation with a measure of stigma in a sample of psychiatric patients.

    Science.gov (United States)

    Bagley, Christopher

    2005-08-01

    Robins' Single-item Self-esteem Inventory was compared with a single item from the Coopersmith Self-esteem. Although a new scoring format was used, there was good evidence of cross-validation in 83 current and former psychiatric patients who completed Harvey's adapted measure of stigma felt and experienced by users of mental health services. Scores on the two single-item self-esteem measures correlated .76 (p self-esteem in users of mental health services.

  7. A hierarchy of distress and invariant item ordering in the General Health Questionnaire-12.

    Science.gov (United States)

    Doyle, F; Watson, R; Morgan, K; McBride, O

    2012-06-01

    Invariant item ordering (IIO) is defined as the extent to which items have the same ordering (in terms of item difficulty/severity - i.e. demonstrating whether items are difficult [rare] or less difficult [common]) for each respondent who completes a scale. IIO is therefore crucial for establishing a scale hierarchy that is replicable across samples, but no research has demonstrated IIO in scales of psychological distress. We aimed to determine if a hierarchy of distress with IIO exists in a large general population sample who completed a scale measuring distress. Data from 4107 participants who completed the 12-item General Health Questionnaire (GHQ-12) from the Northern Ireland Health and Social Wellbeing Survey 2005-6 were analysed. Mokken scaling was used to determine the dimensionality and hierarchy of the GHQ-12, and items were investigated for IIO. All items of the GHQ-12 formed a single, strong unidimensional scale (H=0.58). IIO was found for six of the 12 items (H-trans=0.55), and these symptoms reflected the following hierarchy: anhedonia, concentration, participation, coping, decision-making and worthlessness. The cross-sectional analysis needs replication. The GHQ-12 showed a hierarchy of distress, but IIO is only demonstrated for six of the items, and the scale could therefore be shortened. Adopting brief, hierarchical scales with IIO may be beneficial in both clinical and research contexts. Copyright © 2011 Elsevier B.V. All rights reserved.

  8. Single-item measures for depression and anxiety: Validation of the Screening Tool for Psychological Distress in an inpatient cardiology setting.

    Science.gov (United States)

    Young, Quincy-Robyn; Nguyen, Michelle; Roth, Susan; Broadberry, Ann; Mackay, Martha H

    2015-12-01

    Depression and anxiety are common among patients with cardiovascular disease (CVD) and confer significant cardiac risk, contributing to CVD morbidity and mortality. Unfortunately, due to the lack of screening tools that address the specific needs of hospitalized patients, few cardiac inpatient programs offer routine screening for these forms of psychological distress, despite recommendations to do so. The purpose of this study was to validate single-item measures for depression and anxiety among cardiac inpatients. Consecutive inpatients were recruited from the cardiology and cardiac surgery step-down units at a university-affiliated, quaternary-care hospital. Subjects completed a questionnaire that included: (a) demographics, (b) single-item-measures for depression and anxiety (from the Screening Tool for Psychological Distress (STOP-D)), and (c) Hospital Anxiety and Depression Scale (HADS). One hundred and five participants were recruited with a wide variety of cardiac diagnoses, having a mean age of 66 years, and 28% were women. Both STOP-D items were highly correlated with their corresponding validated measures and demonstrated robust receiver-operator characteristic curves. Severity scores on both items correlated well with established severity cut-off scores on the corresponding subscales of the HADS. The STOP-D is a self-administered, self-report measure using two independent items that provide severity scores for depression and anxiety. The tool performs very well compared with other previously validated measures. Requiring no additional scoring and being free, STOP-D offers a simple and valid method for identifying hospitalized cardiac patients who are experiencing psychological distress. This crucial first step triggers initiation of appropriate monitoring and intervention, thus reducing the likelihood of the adverse cardiac outcomes associated with psychological distress. © The European Society of Cardiology 2014.

  9. [Unfolding item response model using best-worst scaling].

    Science.gov (United States)

    Ikehara, Kazuya

    2015-02-01

    In attitude measurement and sensory tests, the unfolding model is typically used. In this model, response probability is formulated by the distance between the person and the stimulus. In this study, we proposed an unfolding item response model using best-worst scaling (BWU model), in which a person chooses the best and worst stimulus among repeatedly presented subsets of stimuli. We also formulated an unfolding model using best scaling (BU model), and compared the accuracy of estimates between the BU and BWU models. A simulation experiment showed that the BWU modell performed much better than the BU model in terms of bias and root mean square errors of estimates. With reference to Usami (2011), the proposed models were apllied to actual data to measure attitudes toward tardiness. Results indicated high similarity between stimuli estimates generated with the proposed models and those of Usami (2011).

  10. Summarizing activity limitations in children with chronic illnesses living in the community: a measurement study of scales using supplemented interRAI items

    Directory of Open Access Journals (Sweden)

    Phillips Charles D

    2012-01-01

    Full Text Available Abstract Background To test the validity and reliability of scales intended to measure activity limitations faced by children with chronic illnesses living in the community. The scales were based on information provided by caregivers to service program personnel almost exclusively trained as social workers. The items used to measure activity limitations were interRAI items supplemented so that they were more applicable to activity limitations in children with chronic illnesses. In addition, these analyses may shed light on the possibility of gathering functional information that can span the life course as well as spanning different care settings. Methods Analyses included testing the internal consistency, predictive, concurrent, discriminant and construct validity of two activity limitation scales. The scales were developed using assessment data gathered in the United States of America (USA from over 2,700 assessments of children aged 4 to 20 receiving Medicaid Early and Periodic Screening, Diagnostic and Treatment (EPSDT services, specifically Personal Care Services to assist children in overcoming activity limitations. The Medicaid program in the USA pays for health care services provided to children in low-income households. Data were collected in a single, large state in the southwestern USA in late 2008 and early 2009. A similar sample of children was assessed in 2010, and the analyses were replicated using this sample. Results The two scales exhibited excellent internal consistency. Evidence on the concurrent, predictive, discriminant, and construct validity of the proposed scales was strong. Quite importantly, scale scores were not correlated with (confounded with a child's developmental stage or age. The results for these scales and items were consistent across the two independent samples. Conclusions Unpaid caregivers, usually parents, can provide assessors lacking either medical or nursing training with reliable and valid information

  11. Summarizing activity limitations in children with chronic illnesses living in the community: a measurement study of scales using supplemented interRAI items.

    Science.gov (United States)

    Phillips, Charles D; Patnaik, Ashweeta; Moudouni, Darcy K; Naiser, Emily; Dyer, James A; Hawes, Catherine; Fournier, Constance J; Miller, Thomas R; Elliott, Timothy R

    2012-01-23

    To test the validity and reliability of scales intended to measure activity limitations faced by children with chronic illnesses living in the community. The scales were based on information provided by caregivers to service program personnel almost exclusively trained as social workers. The items used to measure activity limitations were interRAI items supplemented so that they were more applicable to activity limitations in children with chronic illnesses. In addition, these analyses may shed light on the possibility of gathering functional information that can span the life course as well as spanning different care settings. Analyses included testing the internal consistency, predictive, concurrent, discriminant and construct validity of two activity limitation scales. The scales were developed using assessment data gathered in the United States of America (USA) from over 2,700 assessments of children aged 4 to 20 receiving Medicaid Early and Periodic Screening, Diagnostic and Treatment (EPSDT) services, specifically Personal Care Services to assist children in overcoming activity limitations. The Medicaid program in the USA pays for health care services provided to children in low-income households. Data were collected in a single, large state in the southwestern USA in late 2008 and early 2009. A similar sample of children was assessed in 2010, and the analyses were replicated using this sample. The two scales exhibited excellent internal consistency. Evidence on the concurrent, predictive, discriminant, and construct validity of the proposed scales was strong. Quite importantly, scale scores were not correlated with (confounded with) a child's developmental stage or age. The results for these scales and items were consistent across the two independent samples. Unpaid caregivers, usually parents, can provide assessors lacking either medical or nursing training with reliable and valid information on the activity limitations of children. One can summarize these

  12. Differential item functioning magnitude and impact measures from item response theory models.

    Science.gov (United States)

    Kleinman, Marjorie; Teresi, Jeanne A

    2016-01-01

    Measures of magnitude and impact of differential item functioning (DIF) at the item and scale level, respectively are presented and reviewed in this paper. Most measures are based on item response theory models. Magnitude refers to item level effect sizes, whereas impact refers to differences between groups at the scale score level. Reviewed are magnitude measures based on group differences in the expected item scores and impact measures based on differences in the expected scale scores. The similarities among these indices are demonstrated. Various software packages are described that provide magnitude and impact measures, and new software presented that computes all of the available statistics conveniently in one program with explanations of their relationships to one another.

  13. Secondary Psychometric Examination of the Dimensional Obsessive-Compulsive Scale: Classical Testing, Item Response Theory, and Differential Item Functioning.

    Science.gov (United States)

    Thibodeau, Michel A; Leonard, Rachel C; Abramowitz, Jonathan S; Riemann, Bradley C

    2015-12-01

    The Dimensional Obsessive-Compulsive Scale (DOCS) is a promising measure of obsessive-compulsive disorder (OCD) symptoms but has received minimal psychometric attention. We evaluated the utility and reliability of DOCS scores. The study included 832 students and 300 patients with OCD. Confirmatory factor analysis supported the originally proposed four-factor structure. DOCS total and subscale scores exhibited good to excellent internal consistency in both samples (α = .82 to α = .96). Patient DOCS total scores reduced substantially during treatment (t = 16.01, d = 1.02). DOCS total scores discriminated between students and patients (sensitivity = 0.76, 1 - specificity = 0.23). The measure did not exhibit gender-based differential item functioning as tested by Mantel-Haenszel chi-square tests. Expected response options for each item were plotted as a function of item response theory and demonstrated that DOCS scores incrementally discriminate OCD symptoms ranging from low to extremely high severity. Incremental differences in DOCS scores appear to represent unbiased and reliable differences in true OCD symptom severity. © The Author(s) 2014.

  14. Evaluation of psychometric properties and differential item functioning of 8-item Child Perceptions Questionnaires using item response theory.

    Science.gov (United States)

    Yau, David T W; Wong, May C M; Lam, K F; McGrath, Colman

    2015-08-19

    Four-factor structure of the two 8-item short forms of Child Perceptions Questionnaire CPQ11-14 (RSF:8 and ISF:8) has been confirmed. However, the sum scores are typically reported in practice as a proxy of Oral health-related Quality of Life (OHRQoL), which implied a unidimensional structure. This study first assessed the unidimensionality of 8-item short forms of CPQ11-14. Item response theory (IRT) was employed to offer an alternative and complementary approach of validation and to overcome the limitations of classical test theory assumptions. A random sample of 649 12-year-old school children in Hong Kong was analyzed. Unidimensionality of the scale was tested by confirmatory factor analysis (CFA), principle component analysis (PCA) and local dependency (LD) statistic. Graded response model was fitted to the data. Contribution of each item to the scale was assessed by item information function (IIF). Reliability of the scale was assessed by test information function (TIF). Differential item functioning (DIF) across gender was identified by Wald test and expected score functions. Both CPQ11-14 RSF:8 and ISF:8 did not deviate much from the unidimensionality assumption. Results from CFA indicated acceptable fit of the one-factor model. PCA indicated that the first principle component explained >30 % of the total variation with high factor loadings for both RSF:8 and ISF:8. Almost all LD statistic items suggesting little contribution of information to the scale and item removal caused little practical impact. Comparing the TIFs, RSF:8 showed slightly better information than ISF:8. In addition to oral symptoms items, the item "Concerned with what other people think" demonstrated a uniform DIF (p Items related to oral symptoms were not informative to OHRQoL and deletion of these items is suggested. The impact of DIF across gender on the overall score was minimal. CPQ11-14 RSF:8 performed slightly better than ISF:8 in measurement precision. The 6-item short forms

  15. Concurrent Validation of the Clinical Opiate Withdrawal Scale (COWS) and Single-Item Indices against the Clinical Institute Narcotic Assessment (CINA) Opioid Withdrawal Instrument

    Science.gov (United States)

    Tompkins, D. Andrew; Bigelow, George E.; Harrison, Joseph A.; Johnson, Rolley E.; Fudala, Paul J.; Strain, Eric C.

    2009-01-01

    Introduction The Clinical Opiate Withdrawal Scale (COWS) is an 11-item clinician-administered scale assessing opioid withdrawal. Though commonly used in clinical practice, it has not been systematically validated. The present study validated the COWS in comparison to the validated Clinical Institute Narcotic Assessment (CINA) scale. Method Opioid-dependent volunteers were enrolled in a residential trial and stabilized on morphine 30 mg given subcutaneously four times daily. Subjects then underwent double-blind, randomized challenges of intramuscularly administered placebo and naloxone (0.4 mg) on separate days, during which the COWS, CINA, and visual analog scale (VAS) assessments were concurrently obtained. Subjects completing both challenges were included (N=46). Correlations between mean peak COWS and CINA scores as well as self-report VAS questions were calculated. Results Mean peak COWS and CINA scores of 7.6 and 24.4, respectively, occurred on average 30 minutes post-injection of naloxone. Mean COWS and CINA scores 30 minutes after placebo injection were 1.3 and 18.9, respectively. The Pearson correlation coefficient for peak COWS and CINA scores during the naloxone challenge session was 0.85 (p<0.001). Peak COWS scores also correlated well with peak VAS self-report scores of bad drug effect (r=0.57, p<0.001) and feeling sick (r=0.57, p<0.001), providing additional evidence of concurrent validity. Placebo was not associated with any significant elevation of COWS, CINA, or VAS scores, indicating discriminant validity. Cronbach’s alpha for the COWS was 0.78, indicating good internal consistency (reliability). Discussion COWS, CINA, and certain VAS items are all valid measurement tools for acute opiate withdrawal. PMID:19647958

  16. General mixture item response models with different item response structures: Exposition with an application to Likert scales.

    Science.gov (United States)

    Tijmstra, Jesper; Bolsinova, Maria; Jeon, Minjeong

    2018-01-10

    This article proposes a general mixture item response theory (IRT) framework that allows for classes of persons to differ with respect to the type of processes underlying the item responses. Through the use of mixture models, nonnested IRT models with different structures can be estimated for different classes, and class membership can be estimated for each person in the sample. If researchers are able to provide competing measurement models, this mixture IRT framework may help them deal with some violations of measurement invariance. To illustrate this approach, we consider a two-class mixture model, where a person's responses to Likert-scale items containing a neutral middle category are either modeled using a generalized partial credit model, or through an IRTree model. In the first model, the middle category ("neither agree nor disagree") is taken to be qualitatively similar to the other categories, and is taken to provide information about the person's endorsement. In the second model, the middle category is taken to be qualitatively different and to reflect a nonresponse choice, which is modeled using an additional latent variable that captures a person's willingness to respond. The mixture model is studied using simulation studies and is applied to an empirical example.

  17. Psychometric analysis of the Generalized Anxiety Disorder scale (GAD-7) in primary care using modern item response theory.

    Science.gov (United States)

    Jordan, Pascal; Shedden-Mora, Meike C; Löwe, Bernd

    2017-01-01

    The Generalized Anxiety Disorder scale (GAD-7) is one of the most frequently used diagnostic self-report scales for screening, diagnosis and severity assessment of anxiety disorder. Its psychometric properties from the view of the Item Response Theory paradigm have rarely been investigated. We aimed to close this gap by analyzing the GAD-7 within a large sample of primary care patients with respect to its psychometric properties and its implications for scoring using Item Response Theory. Robust, nonparametric statistics were used to check unidimensionality of the GAD-7. A graded response model was fitted using a Bayesian approach. The model fit was evaluated using posterior predictive p-values, item information functions were derived and optimal predictions of anxiety were calculated. The sample included N = 3404 primary care patients (60% female; mean age, 52,2; standard deviation 19.2) The analysis indicated no deviations of the GAD-7 scale from unidimensionality and a decent fit of a graded response model. The commonly suggested ultra-brief measure consisting of the first two items, the GAD-2, was supported by item information analysis. The first four items discriminated better than the last three items with respect to latent anxiety. The information provided by the first four items should be weighted more heavily. Moreover, estimates corresponding to low to moderate levels of anxiety show greater variability. The psychometric validity of the GAD-2 was supported by our analysis.

  18. Bifactor and Item Response Theory Analyses of Interviewer Report Scales of Cognitive Impairment in Schizophrenia

    Science.gov (United States)

    Reise, Steven P.; Ventura, Joseph; Keefe, Richard S. E.; Baade, Lyle E.; Gold, James M.; Green, Michael F.; Kern, Robert S.; Mesholam-Gately, Raquelle; Nuechterlein, Keith H.; Seidman, Larry J.; Bilder, Robert

    2011-01-01

    A psychometric analysis of 2 interview-based measures of cognitive deficits was conducted: the 21-item Clinical Global Impression of Cognition in Schizophrenia (CGI-CogS; Ventura et al., 2008), and the 20-item Schizophrenia Cognition Rating Scale (SCoRS; Keefe et al., 2006), which were administered on 2 occasions to a sample of people with…

  19. Dependability and Treatment Sensitivity of Multi-Item Direct Behavior Rating Scales for Interpersonal Peer Conflict

    Science.gov (United States)

    Daniels, Brian; Volpe, Robert J.; Briesch, Amy M.; Gadow, Kenneth D.

    2017-01-01

    Direct behavior rating (DBR) represents a feasible method for monitoring student behavior in the classroom; however, limited work to date has focused on the use of multi-item scales. The purposes of the study were to examine the (a) dependability of data obtained from a multi-item DBR designed to assess peer conflict and (b) treatment sensitivity…

  20. Creating a brief rating scale for the assessment of learning disabilities using reliability and true score estimates of the scale's items based on the Rasch model.

    Science.gov (United States)

    Sideridis, Georgios; Padeliadu, Susana

    2013-01-01

    The purpose of the present studies was to provide the means to create brief versions of instruments that can aid the diagnosis and classification of students with learning disabilities and comorbid disorders (e.g., attention-deficit/hyperactivity disorder). A sample of 1,108 students with and without a diagnosis of learning disabilities took part in study 1. Using information from modern theory methods (i.e., the Rasch model), a scale was created that included fewer than one third of the original battery items designed to assess reading skills. This best item synthesis was then evaluated for its predictive and criterion validity with a valid external reading battery (study 2). Using a sample of 232 students with and without learning disabilities, results indicated that the brief version of the scale was equally effective as the original scale in predicting reading achievement. Analysis of the content of the brief scale indicated that the best item synthesis involved items from cognition, motivation, strategy use, and advanced reading skills. It is suggested that multiple psychometric criteria be employed in evaluating the psychometric adequacy of scales used for the assessment and identification of learning disabilities and comorbid disorders.

  1. A preliminary psychometric evaluation of the eight-item cognitive load scale.

    Science.gov (United States)

    Pignatiello, Grant A; Tsivitse, Emily; Hickman, Ronald L

    2018-04-01

    The aim of this article is to report the psychometric properties of the eight-item cognitive load scale. According to cognitive load theory, the formatting and delivery of healthcare education influences the degree to which patients and/or family members can engage their working memory systems for learning. However, despite its relevance, cognitive load has not yet been evaluated among surrogate decision makers exposed to electronic decision support for healthcare decisions. To date, no psychometric analyses of instruments evaluating cognitive load have been reported within healthcare settings. A convenience sample of 62 surrogate decision makers for critically ill patients were exposed to one of two healthcare decision support interventions were recruited from four intensive care units at a tertiary medical center in Northeast Ohio. Participants were administered a battery of psychosocial instruments and the eight-item cognitive load scale (CLS). The CLS demonstrated a bidimensional factor structure with acceptable discriminant validity and internal consistency reliability (Cronbach's α = 0.75 and 0.89). The CLS is a psychometrically sound instrument that may be used in the evaluation of decision support among surrogate decision makers of the critically ill. The authors recommend application of the cognitive load scale in the evaluation and development of healthcare education and interventions. Copyright © 2018 Elsevier Inc. All rights reserved.

  2. Development of a mobbing short scale in the Gutenberg Health Study.

    Science.gov (United States)

    Garthus-Niegel, Susan; Nübling, Matthias; Letzel, Stephan; Hegewald, Janice; Wagner, Mandy; Wild, Philipp S; Blettner, Maria; Zwiener, Isabella; Latza, Ute; Jankowiak, Sylvia; Liebers, Falk; Seidler, Andreas

    2016-01-01

    Despite its highly detrimental potential, most standard questionnaires assessing psychosocial stress at work do not include mobbing as a risk factor. In the German standard version of COPSOQ, mobbing is assessed with a single item. In the Gutenberg Health Study, this version was used together with a newly developed short scale based on the Leymann Inventory of Psychological Terror. The purpose of the present study was to evaluate the psychometric properties of these two measures, to compare them and to test their differential impact on relevant outcome parameters. This analysis is based on a population-based sample of 1441 employees participating in the Gutenberg Health Study. Exploratory and confirmatory factor analyses and reliability analyses were used to assess the mobbing scale. To determine their predictive validities, multiple linear regression analyses with six outcome parameters and log-binomial regression models for two of the outcome aspects were run. Factor analyses of the five-item scale confirmed a one-factor solution, reliability was α = 0.65. Both the single-item and the five-item scales were associated with all six outcome scales. Effect sizes were similar for both mobbing measures. Mobbing is an important risk factor for health-related outcomes. For the purpose of psychosocial risk assessment in the workplace, both the single-item and the five-item constructs were psychometrically appropriate. Associations with outcomes were about equivalent. However, the single item has the advantage of parsimony, whereas the five-item construct depicts several distinct forms of mobbing.

  3. An item response theory analysis of the Olweus Bullying scale.

    Science.gov (United States)

    Breivik, Kyrre; Olweus, Dan

    2014-12-02

    In the present article, we used IRT (graded response) modeling as a useful technology for a detailed and refined study of the psychometric properties of the various items of the Olweus Bullying scale and the scale itself. The sample consisted of a very large number of Norwegian 4th-10th grade students (n = 48 926). The IRT analyses revealed that the scale was essentially unidimensional and had excellent reliability in the upper ranges of the latent bullying tendency trait, as intended and desired. Gender DIF effects were identified with regard to girls' use of indirect bullying by social exclusion and boys' use of physical bullying by hitting and kicking but these effects were small and worked in opposite directions, having negligible effects at the scale level. Also scale scores adjusted for DIF effects differed very little from non-adjusted scores. In conclusion, the empirical data were well characterized by the chosen IRT model and the Olweus Bullying scale was considered well suited for the conduct of fair and reliable comparisons involving different gender-age groups. Information Aggr. Behav. 9999:XX-XX, 2014. © 2014 Wiley Periodicals, Inc. © 2014 Wiley Periodicals, Inc.

  4. An item response theory analysis of Harter's Self-Perception Profile for children or why strong clinical scales should be distrusted.

    Science.gov (United States)

    Egberink, Iris J L; Meijer, Rob R

    2011-06-01

    The authors investigated the psychometric properties of the subscales of the Self-Perception Profile for Children with item response theory (IRT) models using a sample of 611 children. Results from a nonparametric Mokken analysis and a parametric IRT approach for boys (n = 268) and girls (n = 343) were compared. The authors found that most scales formed weak scales and that measurement precision was relatively low and only present for latent trait values indicating low self-perception. The subscales Physical Appearance and Global Self-Worth formed one strong scale. Children seem to interpret Global Self-Worth items as if they measure Physical Appearance. Furthermore, the authors found that strong Mokken scales (such as Global Self-Worth) consisted mostly of items that repeat the same item content. They conclude that researchers should be very careful in interpreting the total scores on the different Self-Perception Profile for Children scales. Finally, implications for further research are discussed.

  5. 'Do you think you suffer from depression?' Reevaluating the use of a single item question for the screening of depression in older primary care patients

    DEFF Research Database (Denmark)

    Ayalon, Liat; Goldfracht, Margalit; Bech, Per

    2010-01-01

    OBJECTIVES: The majority of older adults seek depression treatment in primary care. Despite impressive efforts to integrate depression treatment into primary care, depression often remains undetected. The overall goal of the present study was to compare a single item screening for depression...... to existing depression screening tools. METHODS: A cross sectional sample of 153 older primary care patients. Participants completed several depression-screening measures (e.g. a single depression screen, Patient Health Questionnaire-9, Major Depression Inventory, Visual Analogue Scale). Measures were......: An easy way to detect depression in older primary care patients would be asking the single question, 'do you think you suffer from depression?'...

  6. Evaluation of a single-item screening question to detect limited health literacy in peritoneal dialysis patients.

    Science.gov (United States)

    Jain, Deepika; Sheth, Heena; Bender, Filitsa H; Weisbord, Steven D; Green, Jamie A

    2014-01-01

    Studies have shown that a single-item question might be useful in identifying patients with limited health literacy. However, the utility of the approach has not been studied in patients receiving maintenance peritoneal dialysis (PD). We assessed health literacy in a cohort of 31 PD patients by administering the Rapid Estimate of Adult Literacy in Medicine (REALM) and a single-item health literacy (SHL) screening question "How confident are you filling out medical forms by yourself?" (Extremely, Quite a bit, Somewhat, A little bit, or Not at all). To determine the accuracy of the single-item question for detecting limited health literacy, we performed sensitivity and specificity analyses of the SHL and plotted the area under the receiver operating characteristic (AUROC) curve using the REALM as a reference standard. Using a cut-off of "Somewhat" or less confident, the sensitivity of the SHL for detecting limited health literacy was 80%, and the specificity was 88%. The positive likelihood ratio was 6.9. The SHL had an AUROC of 0.79 (95% confidence interval: 0.52 to 1.00). Our results show that the SHL could be effective in detecting limited health literacy in PD patients.

  7. Item Response Theory analysis of the Autonomy over Tobacco Scale (AUTOS).

    Science.gov (United States)

    Wellman, Robert J; Edelen, Maria Orlando; DiFranza, Joseph R

    2015-06-01

    The Autonomy over Tobacco Scale (AUTOS) is composed of 12-symptoms of nicotine dependence. While it has demonstrated excellent reliability and validity, several psychometric properties have yet to be investigated. We aimed to determine (1) whether items functioned differently across demographic groups, (2) the likelihood that individual symptoms would be endorsed by smokers at different levels of diminished autonomy, and (3) the degree of information provided by each item and the reliability of the full AUTOS across the range of diminished autonomy. Data for this study come from two convenience samples of American adult current smokers (n=777; 69% female; 88% white; Mage=34 years, range: 18-78), of whom 66% were daily smokers (Mcigarettes/smoking day=10.1, range: AUTOS online as part of "a research study about the experiences people have when they smoke." After p value correction, items remained invariant across sex and minority status, while two items functioned differently according to age, with minimal impact on the total AUTOS score. Discriminative power of the items was high. The greatest amount of information is provided at just under one-half SD above the mean and the least at the extremes of diminished autonomy. The AUTOS maintains acceptable reliability (>0.70) across the range of diminished autonomy within which more than 95% of smokers' scores could be anticipated to fall. The AUTOS is a versatile and psychometrically sound instrument for measuring the loss of autonomy over tobacco use. Copyright © 2015 Elsevier Ltd. All rights reserved.

  8. The Development of Marital Maturity Scale

    Directory of Open Access Journals (Sweden)

    Muhammed YILDIZ

    2017-06-01

    Full Text Available In this study, validity, reliability and item analysis studies of the Marital Maturity Scale prepared to test whether individuals are ready for marriage have been done. Studies of the development of the scale were made on 623 individuals, consisting of single adults. In the validity studies of the scale, explanatory and confirmatory factor analyses and criterion related validity studies were performed. Factor analysis revealed that the scale had four dimensions. The four factors in the measurement account for 60.91% of the total variance. The factor loadings of the items in the scale range from 0.42 to 0.86. Inonu Marriage Attitude Scale was used in the criterion related validity studies. Correlation value of the two scales r=0.72 (p=0.000 was found significant. It was determined that the subscales of the scale had a significant correlation with the total scale. The cronbach alpha value of the first dimension of the scale was 0.85, the cronbach alpha value of the second dimension of the scale was 0.68, the cronbach alpha value of the third dimension of the scale was 0.80, the cronbach alpha value of the fourth dimension of the scale was 0.91 and the cronbach alpha value of the total scale was 0.90. Test retest results r=0.70, (p=0.000 were found significant. In the item analysis studies, it was revealed that in the lower 27% group, the individuals in the upper 27% group were significantly different in all items (p=0.000. The item total correlation value of the items in the scale was between 0.40 and 0.63. As a result of the assessments, it was concluded that the Marital Maturity Scale was a reliable and valid instrument to measure marital maturity of single adults

  9. Psychometric Properties of the 20-Item Toronto Alexithymia Scale in the Chilean Population

    Directory of Open Access Journals (Sweden)

    Mauricio González-Arias

    2018-06-01

    Full Text Available Alexithymia can be defined as inability to identify and describe emotions in the self. Has shown to be related to several psychological and pathological processes that can result in unsatisfactory interpersonal relationships and decreased social adjustment. Advances in research of alexithymia require the development and validation of assessment instruments, and its application to different population. With this aim, we studied the psychometric properties of the Twenty-Item Toronto Alexithymia Scale (TAS-20 in Chilean population using various modeling procedures (e.g., CFA, ESEM in different structures (i.e., Correlated, Unidimensional, Hierarchical or Wording factors. Among the 10 models tested, the four-dimensional structure offered the best fit but with item-loading problems in the last factor (Pragmatic Thinking. We suggest that the studied version of the scale needs improvement (theoretical and empirical to ensure optimal indices of validation for Chilean population.

  10. The Number of Response Categories and the Reverse Directional Item Problem in Likert-Type Scales: A Study with the Rasch Model

    Directory of Open Access Journals (Sweden)

    Mustafa İLHAN

    2017-09-01

    Full Text Available This study addressed reverse directional item and the number of response categories problems in Likert-type scales. The Fear of Negative Evaluation Scale (FNES and the Oxford Happiness Questionnaire (OHQ were used as data collection tools. The data of the study were analyzed according to the Rasch model. The analysis found that the observed and expected test characteristic curves were largely overlapped, each of the three rating scales worked effectively, and the differences between response categories could be distinguished successfully by the participants in straightforward directional items. On the other hand, it was determined that there were significant differences between the observed and expected test characteristic curves in reverse directional items. It was also found that no matter which one of these three, five and seven-point rating scales was used, the participants could not distinguish the response categories of the reverse directional items on the FNES and the OHQ. Afterwards, the reverse directional items were removed from the data file, and the analysis was repeated. The analysis results revealed that item discrimination, reliability coefficients for person facet, separation ratios and Chi square values calculated for the facets of person and items were higher in five-pointed rating compared to three and seven pointed rating.

  11. Do Personality Scale Items Function Differently in People with High and Low IQ?

    Science.gov (United States)

    Waiyavutti, Chakadee; Johnson, Wendy; Deary, Ian J.

    2012-01-01

    Intelligence differences might contribute to true differences in personality traits. It is also possible that intelligence might contribute to differences in understanding and interpreting personality items. Previous studies have not distinguished clearly between these possibilities. Before it can be accepted that scale score differences actually…

  12. The work ability index and single-item question: associations with sick leave, symptoms, and health--a prospective study of women on long-term sick leave.

    Science.gov (United States)

    Ahlstrom, Linda; Grimby-Ekman, Anna; Hagberg, Mats; Dellve, Lotta

    2010-09-01

    This study investigated the association between the work ability index (WAI) and the single-item question on work ability among women working in human service organizations (HSO) currently on long-term sick leave. It also examined the association between the WAI and the single-item question in relation to sick leave, symptoms, and health. Predictive values of the WAI, the changed WAI, the single-item question and the changed single-item question were investigated for degree of sick leave, symptoms, and health. This cohort study comprised 324 HSO female workers on long-term (>60 days) sick leave, with follow-ups at 6 and 12 months. Participants responded to questionnaires. Data on work ability, sick leave, health, and symptoms were analyzed with regard to associations and predictability. Spearman correlation and mixed-model analysis were performed for repeated measurements over time. The study showed a very strong association between the WAI and the single-item question among all participants. Both the WAI and the single-item question showed similar patterns of associations with sick leave, health, and symptoms. The predictive value for the degree of sick leave and health-related quality of life (HRQoL) was strong for both the WAI and the single-item question, and slightly less strong for vitality, neck pain, both self-rated general and mental health, and behavioral and current stress. This study suggests that the single-item question on work ability could be used as a simple indicator for assessing the status and progress of work ability among women on long-term sick leave.

  13. Examining the Psychometric Quality of Multiple-Choice Assessment Items using Mokken Scale Analysis.

    Science.gov (United States)

    Wind, Stefanie A

    The concept of invariant measurement is typically associated with Rasch measurement theory (Engelhard, 2013). Concerned with the appropriateness of the parametric transformation upon which the Rasch model is based, Mokken (1971) proposed a nonparametric procedure for evaluating the quality of social science measurement that is theoretically and empirically related to the Rasch model. Mokken's nonparametric procedure can be used to evaluate the quality of dichotomous and polytomous items in terms of the requirements for invariant measurement. Despite these potential benefits, the use of Mokken scaling to examine the properties of multiple-choice (MC) items in education has not yet been fully explored. A nonparametric approach to evaluating MC items is promising in that this approach facilitates the evaluation of assessments in terms of invariant measurement without imposing potentially inappropriate transformations. Using Rasch-based indices of measurement quality as a frame of reference, data from an eighth-grade physical science assessment are used to illustrate and explore Mokken-based techniques for evaluating the quality of MC items. Implications for research and practice are discussed.

  14. Gender Effect According to Item Directionality on the Perceived Stress Scale for Adults with Multiple Sclerosis

    Science.gov (United States)

    Gitchel, W. Dent; Roessler, Richard T.; Turner, Ronna C.

    2011-01-01

    Assessment is critical to rehabilitation practice and research, and self-reports are a commonly used form of assessment. This study examines a gender effect according to item wording on the "Perceived Stress Scale" for adults with multiple sclerosis. Past studies have demonstrated two-factor solutions on this scale and other scales measuring…

  15. An item response theory evaluation of the young mania rating scale and the montgomery-asberg depression rating scale in the systematic treatment enhancement program for bipolar disorder (STEP-BD).

    Science.gov (United States)

    Prisciandaro, James J; Tolliver, Bryan K

    2016-11-15

    The Young Mania Rating Scale (YMRS) and Montgomery-Asberg Depression Rating Scale (MADRS) are among the most widely used outcome measures for clinical trials of medications for Bipolar Disorder (BD). Nonetheless, very few studies have examined the measurement characteristics of the YMRS and MADRS in individuals with BD using modern psychometric methods. The present study evaluated the YMRS and MADRS in the Systematic Treatment Enhancement Program for BD (STEP-BD) study using Item Response Theory (IRT). Baseline data from 3716 STEP-BD participants were available for the present analysis. The Graded Response Model (GRM) was fit separately to YMRS and MADRS item responses. Differential item functioning (DIF) was examined by regressing a variety of clinically relevant covariates (e.g., sex, substance dependence) on all test items and on the latent symptom severity dimension, within each scale. Both scales: 1) contained several items that provided little or no psychometric information, 2) were inefficient, in that the majority of item response categories did not provide incremental psychometric information, 3) poorly measured participants outside of a narrow band of severity, 4) evidenced DIF for nearly all items, suggesting that item responses were, in part, determined by factors other than symptom severity. Limited to outpatients; DIF analysis only sensitive to certain forms of DIF. The present study provides evidence for significant measurement problems involving the YMRS and MADRS. More work is needed to refine these measures and/or develop suitable alternative measures of BD symptomatology for clinical trials research. Copyright © 2016 Elsevier B.V. All rights reserved.

  16. Single-item screening for agoraphobic symptoms : validation of a web-based audiovisual screening instrument

    NARCIS (Netherlands)

    van Ballegooijen, Wouter; Riper, Heleen; Donker, Tara; Martin Abello, Katherina; Marks, Isaac; Cuijpers, Pim

    2012-01-01

    The advent of web-based treatments for anxiety disorders creates a need for quick and valid online screening instruments, suitable for a range of social groups. This study validates a single-item multimedia screening instrument for agoraphobia, part of the Visual Screener for Common Mental Disorders

  17. Using Item Response Theory to Develop Measures of Acquisitive and Protective Self-Monitoring From the Original Self-Monitoring Scale.

    Science.gov (United States)

    Wilmot, Michael P; Kostal, Jack W; Stillwell, David; Kosinski, Michal

    2017-07-01

    For the past 40 years, the conventional univariate model of self-monitoring has reigned as the dominant interpretative paradigm in the literature. However, recent findings associated with an alternative bivariate model challenge the conventional paradigm. In this study, item response theory is used to develop measures of the bivariate model of acquisitive and protective self-monitoring using original Self-Monitoring Scale (SMS) items, and data from two large, nonstudent samples ( Ns = 13,563 and 709). Results indicate that the new acquisitive (six-item) and protective (seven-item) self-monitoring scales are reliable, unbiased in terms of gender and age, and demonstrate theoretically consistent relations to measures of personality traits and cognitive ability. Additionally, by virtue of using original SMS items, previously collected responses can be reanalyzed in accordance with the alternative bivariate model. Recommendations for the reanalysis of archival SMS data, as well as directions for future research, are provided.

  18. Dimensionality of the 9-item Utrecht Work Engagement Scale (UWES-9).

    Science.gov (United States)

    de Bruin, Gideon P; Henn, Carolina M

    2013-06-01

    Despite wide-spread use, questions remain about the dimensionality of the 9-item Utrecht Work Engagement Scale (UWES-9). Theoretical underpinnings of the UWES-9 point toward a hierarchical structure with a general factor and three group or primary factors: Dedication, Vigor, and Absorption. To date, researchers have failed to model the general factor, which contributes to the lack of consensus about the dimensionality of the scale. Bi-factor analysis was used to demonstrate the presence of a very strong general factor and, in comparison, two weak group factors. The results shed additional light on the meaning of the work engagement construct. The implications for research with the UWES-9 are discussed.

  19. Psychometric properties of the Chinese version of resilience scale specific to cancer: an item response theory analysis.

    Science.gov (United States)

    Ye, Zeng Jie; Liang, Mu Zi; Zhang, Hao Wei; Li, Peng Fei; Ouyang, Xue Ren; Yu, Yuan Liang; Liu, Mei Ling; Qiu, Hong Zhong

    2018-06-01

    Classic theory test has been used to develop and validate the 25-item Resilience Scale Specific to Cancer (RS-SC) in Chinese patients with cancer. This study was designed to provide additional information about the discriminative value of the individual items tested with an item response theory analysis. A two-parameter graded response model was performed to examine whether any of the items of the RS-SC exhibited problems with the ordering and steps of thresholds, as well as the ability of items to discriminate patients with different resilience levels using item characteristic curves. A sample of 214 Chinese patients with cancer diagnosis was analyzed. The established three-dimension structure of the RS-SC was confirmed. Several items showed problematic thresholds or discrimination ability and require further revision. Some problematic items should be refined and a short-form of RS-SC maybe feasible in clinical settings in order to reduce burden on patients. However, the generalizability of these findings warrants further investigations.

  20. Few items in the thyroid-related quality of life instrument ThyPRO exhibited differential item functioning.

    Science.gov (United States)

    Watt, Torquil; Groenvold, Mogens; Hegedüs, Laszlo; Bonnema, Steen Joop; Rasmussen, Åse Krogh; Feldt-Rasmussen, Ulla; Bjorner, Jakob Bue

    2014-02-01

    To evaluate the extent of differential item functioning (DIF) within the thyroid-specific quality of life patient-reported outcome measure, ThyPRO, according to sex, age, education and thyroid diagnosis. A total of 838 patients with benign thyroid diseases completed the ThyPRO questionnaire (84 five-point items, 13 scales). Uniform and nonuniform DIF were investigated using ordinal logistic regression, testing for both statistical significance and magnitude (∆R(2) > 0.02). Scale level was estimated by the sum score, after purification. Twenty instances of DIF in 17 of the 84 items were found. Eight according to diagnosis, where the goiter scale was the one most affected, possibly due to differing perceptions in patients with auto-immune thyroid diseases compared to patients with simple goiter. Eight DIFs according to age were found, of which 5 were in positively worded items, which younger patients were more likely to endorse; one according to gender: women were more likely to report crying, and three according to educational level. The vast majority of DIF had only minor influence on the scale scores (0.1-2.3 points on the 0-100 scales), but two DIF corresponded to a difference of 4.6 and 9.8, respectively. Ordinal logistic regression identified DIF in 17 of 84 items. The potential impact of this on the present scales was low, but items displaying DIF could be avoided when developing abbreviated scales, where the potential impact of DIF (due to fewer items) will be larger.

  1. Location Indices for Ordinal Polytomous Items Based on Item Response Theory. Research Report. ETS RR-15-20

    Science.gov (United States)

    Ali, Usama S.; Chang, Hua-Hua; Anderson, Carolyn J.

    2015-01-01

    Polytomous items are typically described by multiple category-related parameters; situations, however, arise in which a single index is needed to describe an item's location along a latent trait continuum. Situations in which a single index would be needed include item selection in computerized adaptive testing or test assembly. Therefore single…

  2. Face validity of the single work ability item

    DEFF Research Database (Denmark)

    Gupta, Nidhi; Jensen, Bjørn Søvsø; Søgaard, Karen

    2014-01-01

    with a total of 5,810 h, including 2,640 working hours. RESULTS: A significant moderate correlation between work ability and %HRR was observed among males (R = -0.33, P = 0.005), but not among females (R = 0.11, P = 0.431). In a gender-stratified multi-adjusted logistic regression analysis, males with high...... %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI) = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16), and a significant interaction between work ability, %HRR......PURPOSE: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR) among blue-collar workers. METHODS: We utilized data from 127 blue-collar workers (Female = 53; Male = 74) aged 18-65 years from...

  3. Development and Standardization of the Diagnostic Adaptive Behavior Scale: Application of Item Response Theory to the Assessment of Adaptive Behavior

    Science.gov (United States)

    Tassé, Marc J.; Schalock, Robert L.; Thissen, David; Balboni, Giulia; Bersani, Henry, Jr.; Borthwick-Duffy, Sharon A.; Spreat, Scott; Widaman, Keith F.; Zhang, Dalun; Navas, Patricia

    2016-01-01

    The Diagnostic Adaptive Behavior Scale (DABS) was developed using item response theory (IRT) methods and was constructed to provide the most precise and valid adaptive behavior information at or near the cutoff point of making a decision regarding a diagnosis of intellectual disability. The DABS initial item pool consisted of 260 items. Using IRT…

  4. Screening for depression in advanced disease: psychometric properties, sensitivity, and specificity of two items of the Palliative Care Outcome Scale (POS).

    Science.gov (United States)

    Antunes, Bárbara; Murtagh, Fliss; Bausewein, Claudia; Harding, Richard; Higginson, Irene J

    2015-02-01

    Depression is common among patients with advanced disease but often difficult to detect. To assess the Palliative care Outcome Scale (POS) (10 items) against the Geriatric Depression Scale (GDS)-10 total score and the Hospital Anxiety and Depression Scale (HADS)-Depression subscale total score and determine if the POS has appropriate items to screen for depression among people with advanced disease. This was a secondary analysis performed on five studies. Four psychometric properties were assessed: data quality, scaling assumptions, acceptability, and internal consistency (reliability). Receiver operating characteristic (ROC) curves were used to determine the area under the curve. Sensitivity, specificity, positive and negative predictive values, false positive and negative rates, and positive and negative likelihood ratios were computed. The overall sample had 416 patients from Germany and England: 144 had cancer and 267 had nonmalignant conditions. Prevalence of depression across the sample was 17.5%. Floor and ceiling effects were rare. Cronbach's alpha coefficients for POS items 7 and 8 summed, GDS-10 and HADS-Depression items varied: 0.61 (heart failure) and 0.80 (cancer). Two items combined (Item 7-feeling depressed and Item 8-feeling good about yourself) consistently presented the highest area under the ROC curve, ranging from 0.76 (95% CI 0.60, 0.93) (Germany, lung cancer) to 0.97 (95% CI 0.91, 1.0) (heart failure), highest negative predictive value, and lowest false negative rate. For the overall sample, the cutoff 2/3 presented a negative predictive value of 89.4% (95% CI 84.7, 92.8) and false negative rate of 10.6 (95% CI 7.2, 15.3). POS items 7 and 8 summed are potentially useful to screen for depression in advanced disease populations. Copyright © 2015 American Academy of Hospice and Palliative Medicine. Published by Elsevier Inc. All rights reserved.

  5. Chemical Transfer (Single Small-Scale) Facility

    Data.gov (United States)

    Federal Laboratory Consortium — Description/History: Chemistry laboratoryThe Chemical Transfer Facility (CTF)  is the only U.S. single small-scale  facility, a single repository for the Army’s...

  6. A Multiple-Item Scale for Assessing E-Government Service Quality

    Science.gov (United States)

    Papadomichelaki, Xenia; Mentzas, Gregoris

    A critical element in the evolution of e-governmental services is the development of sites that better serve the citizens’ needs. To deliver superior service quality, we must first understand how citizens perceive and evaluate online citizen service. This involves defining what e-government service quality is, identifying its underlying dimensions, and determining how it can be conceptualized and measured. In this article we conceptualise an e-government service quality model (e-GovQual) and then we develop, refine, validate, confirm and test a multiple-item scale for measuring e-government service quality for public administration sites where citizens seek either information or services.

  7. Work-related stress assessed by a text message single-item stress question.

    Science.gov (United States)

    Arapovic-Johansson, B; Wåhlin, C; Kwak, L; Björklund, C; Jensen, I

    2017-12-02

    Given the prevalence of work stress-related ill-health in the Western world, it is important to find cost-effective, easy-to-use and valid measures which can be used both in research and in practice. To examine the validity and reliability of the single-item stress question (SISQ), distributed weekly by short message service (SMS) and used for measurement of work-related stress. The convergent validity was assessed through associations between the SISQ and subscales of the Job Demand-Control-Support model, the Effort-Reward Imbalance model and scales measuring depression, exhaustion and sleep. The predictive validity was assessed using SISQ data collected through SMS. The reliability was analysed by the test-retest procedure. Correlations between the SISQ and all the subscales except for job strain and esteem reward were significant, ranging from -0.186 to 0.627. The SISQ could also predict sick leave, depression and exhaustion at 12-month follow-up. The analysis on reliability revealed a satisfactory stability with a weighted kappa between 0.804 and 0.868. The SISQ, administered through SMS, can be used for the screening of stress levels in a working population. © The Author 2017. Published by Oxford University Press on behalf of the Society of Occupational Medicine. All rights reserved. For Permissions, please email: journals.permissions@oup.com

  8. Differential Item Functioning in the SF-36 Physical Functioning and Mental Health Sub-Scales: A Population-Based Investigation in the Canadian Multicentre Osteoporosis Study.

    Science.gov (United States)

    Lix, Lisa M; Wu, Xiuyun; Hopman, Wilma; Mayo, Nancy; Sajobi, Tolulope T; Liu, Juxin; Prior, Jerilynn C; Papaioannou, Alexandra; Josse, Robert G; Towheed, Tanveer E; Davison, K Shawn; Sawatzky, Richard

    2016-01-01

    Self-reported health status measures, like the Short Form 36-item Health Survey (SF-36), can provide rich information about the overall health of a population and its components, such as physical, mental, and social health. However, differential item functioning (DIF), which arises when population sub-groups with the same underlying (i.e., latent) level of health have different measured item response probabilities, may compromise the comparability of these measures. The purpose of this study was to test for DIF on the SF-36 physical functioning (PF) and mental health (MH) sub-scale items in a Canadian population-based sample. Study data were from the prospective Canadian Multicentre Osteoporosis Study (CaMos), which collected baseline data in 1996-1997. DIF was tested using a multiple indicators multiple causes (MIMIC) method. Confirmatory factor analysis defined the latent variable measurement model for the item responses and latent variable regression with demographic and health status covariates (i.e., sex, age group, body weight, self-perceived general health) produced estimates of the magnitude of DIF effects. The CaMos cohort consisted of 9423 respondents; 69.4% were female and 51.7% were less than 65 years. Eight of 10 items on the PF sub-scale and four of five items on the MH sub-scale exhibited DIF. Large DIF effects were observed on PF sub-scale items about vigorous and moderate activities, lifting and carrying groceries, walking one block, and bathing or dressing. On the MH sub-scale items, all DIF effects were small or moderate in size. SF-36 PF and MH sub-scale scores were not comparable across population sub-groups defined by demographic and health status variables due to the effects of DIF, although the magnitude of this bias was not large for most items. We recommend testing and adjusting for DIF to ensure comparability of the SF-36 in population-based investigations.

  9. Differential Item Functioning in the SF-36 Physical Functioning and Mental Health Sub-Scales: A Population-Based Investigation in the Canadian Multicentre Osteoporosis Study.

    Directory of Open Access Journals (Sweden)

    Lisa M Lix

    Full Text Available Self-reported health status measures, like the Short Form 36-item Health Survey (SF-36, can provide rich information about the overall health of a population and its components, such as physical, mental, and social health. However, differential item functioning (DIF, which arises when population sub-groups with the same underlying (i.e., latent level of health have different measured item response probabilities, may compromise the comparability of these measures. The purpose of this study was to test for DIF on the SF-36 physical functioning (PF and mental health (MH sub-scale items in a Canadian population-based sample.Study data were from the prospective Canadian Multicentre Osteoporosis Study (CaMos, which collected baseline data in 1996-1997. DIF was tested using a multiple indicators multiple causes (MIMIC method. Confirmatory factor analysis defined the latent variable measurement model for the item responses and latent variable regression with demographic and health status covariates (i.e., sex, age group, body weight, self-perceived general health produced estimates of the magnitude of DIF effects.The CaMos cohort consisted of 9423 respondents; 69.4% were female and 51.7% were less than 65 years. Eight of 10 items on the PF sub-scale and four of five items on the MH sub-scale exhibited DIF. Large DIF effects were observed on PF sub-scale items about vigorous and moderate activities, lifting and carrying groceries, walking one block, and bathing or dressing. On the MH sub-scale items, all DIF effects were small or moderate in size.SF-36 PF and MH sub-scale scores were not comparable across population sub-groups defined by demographic and health status variables due to the effects of DIF, although the magnitude of this bias was not large for most items. We recommend testing and adjusting for DIF to ensure comparability of the SF-36 in population-based investigations.

  10. The e-MSWS-12: improving the multiple sclerosis walking scale using item response theory.

    Science.gov (United States)

    Engelhard, Matthew M; Schmidt, Karen M; Engel, Casey E; Brenton, J Nicholas; Patek, Stephen D; Goldman, Myla D

    2016-12-01

    The Multiple Sclerosis Walking Scale (MSWS-12) is the predominant patient-reported measure of multiple sclerosis (MS) -elated walking ability, yet it had not been analyzed using item response theory (IRT), the emerging standard for patient-reported outcome (PRO) validation. This study aims to reduce MSWS-12 measurement error and facilitate computerized adaptive testing by creating an IRT model of the MSWS-12 and distributing it online. MSWS-12 responses from 284 subjects with MS were collected by mail and used to fit and compare several IRT models. Following model selection and assessment, subpopulations based on age and sex were tested for differential item functioning (DIF). Model comparison favored a one-dimensional graded response model (GRM). This model met fit criteria and explained 87 % of response variance. The performance of each MSWS-12 item was characterized using category response curves (CRCs) and item information. IRT-based MSWS-12 scores correlated with traditional MSWS-12 scores (r = 0.99) and timed 25-foot walk (T25FW) speed (r =  -0.70). Item 2 showed DIF based on age (χ 2  = 19.02, df = 5, p Item 11 showed DIF based on sex (χ 2  = 13.76, df = 5, p = 0.02). MSWS-12 measurement error depends on walking ability, but could be lowered by improving or replacing items with low information or DIF. The e-MSWS-12 includes IRT-based scoring, error checking, and an estimated T25FW derived from MSWS-12 responses. It is available at https://ms-irt.shinyapps.io/e-MSWS-12 .

  11. Validation of a 10-item care-related regret intensity scale (RIS-10) for health care professionals.

    Science.gov (United States)

    Courvoisier, Delphine S; Cullati, Stéphane; Haller, Chiara S; Schmidt, Ralph E; Haller, Guy; Agoritsas, Thomas; Perneger, Thomas V

    2013-03-01

    Regret after one of the many decisions and interventions that health care professionals make every day can have an impact on their own health and quality of life, and on their patient care practices. To validate a new care-related regret intensity scale (RIS) for health care professionals. Retrospective cross-sectional cohort study with a 1-month follow-up (test-retest) in a French-speaking University Hospital. A total of 469 nurses and physicians responded to the survey, and 175 answered the retest. RIS, self-report questions on the context of the regret-inducing event, its consequences for the patient, involvement of the health care professionals, and changes in patient care practices after the event. We measured the impact of regret intensity on health care professionals with the satisfaction with life scale, the SF-36 first question (self-reported health), and a question on self-esteem. On the basis of factor analysis and item response analysis, the initial 19-item scale was shortened to 10 items. The resulting scale (RIS-10) was unidimensional and had high internal consistency (α=0.87) and acceptable test-retest reliability (0.70). Higher regret intensity was associated with (a) more consequences for the patient; (b) lower life satisfaction and poorer self-reported health in health care professionals; and (c) changes in patient care practices. Nurses reported analyzing the event and apologizing, whereas physicians reported talking preferentially to colleagues, rather than to their supervisor, about changing practices. The RIS is a valid and reliable measure of care-related regret intensity for hospital-based physicians and nurses.

  12. VALIDITY OF THE EMOTIONAL INTELLIGENCE SCALE FOR USE IN SPORT

    Directory of Open Access Journals (Sweden)

    Andrew M. Lane

    2009-06-01

    Full Text Available This study investigated the factorial validity of the 33-item self-rated Emotional Intelligence Scale (EIS: Schutte et al., 1998 for use with athletes. In stage 1, content validity of the EIS was assessed by a panel of experts (n = 9. Items were evaluated in terms of whether they assessed EI related to oneself and EI focused on others. Content validity further examined items in terms of awareness, regulation, and utilization of emotions. Content validity results indicated items describe 6-factors: appraisal of own emotions, regulation of own emotions, utilization of own emotions, optimism, social skills, and appraisal of others emotions. Results highlighted 13-items which make no direct reference to emotional experiences, and therefore, it is questionable whether such items should be retained. Stage 2 tested two competing models: a single factor model, which is the typical way researchers use the EIS and the 5-factor model (optimism was discarded as it become a single-item scale fiolliwng stage 1 identified in stage 1. Confirmatory factor analysis (CFA results on EIS data from 1,681 athletes demonstrated unacceptable fit indices for the 33-item single factor model and acceptable fit indices for the 6-factor model. Data were re-analyzed after removing the 13-items lacking emotional content, and CFA results indicate partial support for single factor model, and further support for a five-factor model (optimism was discarded as a factor during item removal. Despite encouraging results for a reduced item version of the EIS, we suggest further validation work is needed

  13. MIMIC Methods for Assessing Differential Item Functioning in Polytomous Items

    Science.gov (United States)

    Wang, Wen-Chung; Shih, Ching-Lin

    2010-01-01

    Three multiple indicators-multiple causes (MIMIC) methods, namely, the standard MIMIC method (M-ST), the MIMIC method with scale purification (M-SP), and the MIMIC method with a pure anchor (M-PA), were developed to assess differential item functioning (DIF) in polytomous items. In a series of simulations, it appeared that all three methods…

  14. Standard Errors for National Trends in International Large-Scale Assessments in the Case of Cross-National Differential Item Functioning

    Science.gov (United States)

    Sachse, Karoline A.; Haag, Nicole

    2017-01-01

    Standard errors computed according to the operational practices of international large-scale assessment studies such as the Programme for International Student Assessment's (PISA) or the Trends in International Mathematics and Science Study (TIMSS) may be biased when cross-national differential item functioning (DIF) and item parameter drift are…

  15. Mokken scaling analysis of the Hospital Anxiety and Depression Scale in individuals with cardiovascular disease.

    Science.gov (United States)

    Cosco, Theodore D; Doyle, Frank; Watson, Roger; Ward, Mark; McGee, Hannah

    2012-01-01

    The Hospital Anxiety and Depression Scale (HADS) is a prolifically used scale of anxiety and depression. The original bidimensional anxiety-depression latent structure of the HADS has come under significant scrutiny, with previous studies revealing one-, two-, three- and four-dimensional structures. The current study examines the latent structure of the HADS using a non-parametric item response theory method. Using data conglomerated from four independent studies of cardiovascular disease employing the HADS (n=893), Mokken scaling procedure was conducted to assess the latent structure of the HADS. A single scale consisting of 12 of 14 HADS items was revealed, indicating a unidimensional latent HADS structure. The HADS was initially intended to measure mutually exclusive levels of anxiety and depression; however, the current study indicates that a single dimension of general psychological distress is captured. Copyright © 2012 Elsevier Inc. All rights reserved.

  16. Psychometric properties of the Epworth Sleepiness Scale: A factor analysis and item-response theory approach.

    Science.gov (United States)

    Pilcher, June J; Switzer, Fred S; Munc, Alec; Donnelly, Janet; Jellen, Julia C; Lamm, Claus

    2018-04-01

    The purpose of this study is to examine the psychometric properties of the Epworth Sleepiness Scale (ESS) in two languages, German and English. Students from a university in Austria (N = 292; 55 males; mean age = 18.71 ± 1.71 years; 237 females; mean age = 18.24 ± 0.88 years) and a university in the US (N = 329; 128 males; mean age = 18.71 ± 0.88 years; 201 females; mean age = 21.59 ± 2.27 years) completed the ESS. An exploratory-factor analysis was completed to examine dimensionality of the ESS. Item response theory (IRT) analyses were used to provide information about the response rates on the items on the ESS and provide differential item functioning (DIF) analyses to examine whether the items were interpreted differently between the two languages. The factor analyses suggest that the ESS measures two distinct sleepiness constructs. These constructs indicate that the ESS is probing sleepiness in settings requiring active versus passive responding. The IRT analyses found that overall, the items on the ESS perform well as a measure of sleepiness. However, Item 8 and to a lesser extent Item 6 were being interpreted differently by respondents in comparison to the other items. In addition, the DIF analyses showed that the responses between German and English were very similar indicating that there are only minor measurement differences between the two language versions of the ESS. These findings suggest that the ESS provides a reliable measure of propensity to sleepiness; however, it does convey a two-factor approach to sleepiness. Researchers and clinicians can use the German and English versions of the ESS but may wish to exclude Item 8 when calculating a total sleepiness score.

  17. Measuring Corporate Social Responsibility in Gambling Industry: Multi-Items Stakeholder Based Scales

    Directory of Open Access Journals (Sweden)

    Jian Ming Luo

    2017-11-01

    Full Text Available Macau gambling companies included Corporate Social Responsibility (CSR information in their annual reports and websites as a marketing tool. Responsible Gambling (RG had been a recurring issue in Macau’s chief executive report since 2007 and in many of the major gambling operators’ annual report. The purpose of this study was to develop a measurement scale on CSR activities in Macau. Items on the measurement scale were based on qualitative research with data collected from employees in Macau’s gambling industry and academic literature. First and Second Order confirmatory factor analysis (CFA were used to verify the reliability and validity of the measurement scale. The results of this study were satisfactory and were supported by empirical evidence. This study provided recommendations to gambling stakeholders, including practitioners, government officers, customers and shareholders, and implications to promote CSR practice in Macau gambling industry.

  18. High Agreement was Obtained Across Scores from Multiple Equated Scales for Social Anxiety Disorder using Item Response Theory.

    Science.gov (United States)

    Sunderland, Matthew; Batterham, Philip; Calear, Alison; Carragher, Natacha; Baillie, Andrew; Slade, Tim

    2018-04-10

    There is no standardized approach to the measurement of social anxiety. Researchers and clinicians are faced with numerous self-report scales with varying strengths, weaknesses, and psychometric properties. The lack of standardization makes it difficult to compare scores across populations that utilise different scales. Item response theory offers one solution to this problem via equating different scales using an anchor scale to set a standardized metric. This study is the first to equate several scales for social anxiety disorder. Data from two samples (n=3,175 and n=1,052), recruited from the Australian community using online advertisements, were utilised to equate a network of 11 self-report social anxiety scales via a fixed parameter item calibration method. Comparisons between actual and equated scores for most of the scales indicted a high level of agreement with mean differences <0.10 (equivalent to a mean difference of less than one point on the standardized metric). This study demonstrates that scores from multiple scales that measure social anxiety can be converted to a common scale. Re-scoring observed scores to a common scale provides opportunities to combine research from multiple studies and ultimately better assess social anxiety in treatment and research settings. Copyright © 2018. Published by Elsevier Inc.

  19. Building an Evaluation Scale using Item Response Theory.

    Science.gov (United States)

    Lalor, John P; Wu, Hao; Yu, Hong

    2016-11-01

    Evaluation of NLP methods requires testing against a previously vetted gold-standard test set and reporting standard metrics (accuracy/precision/recall/F1). The current assumption is that all items in a given test set are equal with regards to difficulty and discriminating power. We propose Item Response Theory (IRT) from psychometrics as an alternative means for gold-standard test-set generation and NLP system evaluation. IRT is able to describe characteristics of individual items - their difficulty and discriminating power - and can account for these characteristics in its estimation of human intelligence or ability for an NLP task. In this paper, we demonstrate IRT by generating a gold-standard test set for Recognizing Textual Entailment. By collecting a large number of human responses and fitting our IRT model, we show that our IRT model compares NLP systems with the performance in a human population and is able to provide more insight into system performance than standard evaluation metrics. We show that a high accuracy score does not always imply a high IRT score, which depends on the item characteristics and the response pattern.

  20. Item Response Theory Analyses of the Parent and Teacher Ratings of the DSM-IV ADHD Rating Scale

    Science.gov (United States)

    Gomez, Rapson

    2008-01-01

    The graded response model (GRM), which is based on item response theory (IRT), was used to evaluate the psychometric properties of the inattention and hyperactivity/impulsivity symptoms in an ADHD rating scale. To accomplish this, parents and teachers completed the DSM-IV ADHD Rating Scale (DARS; Gomez et al., "Journal of Child Psychology and…

  1. Use of NON-PARAMETRIC Item Response Theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS)

    Science.gov (United States)

    2011-01-01

    Background Nonparametric item response theory (IRT) was used to examine (a) the performance of the 30 Positive and Negative Syndrome Scale (PANSS) items and their options ((levels of severity), (b) the effectiveness of various subscales to discriminate among differences in symptom severity, and (c) the development of an abbreviated PANSS (Mini-PANSS) based on IRT and a method to link scores to the original PANSS. Methods Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs) and Item Characteristic Curves (ICCs) were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. Results The majority of items forming the Positive and Negative subscales (i.e. 19 items) performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. Conclusions The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of illness severity

  2. Use of non-parametric item response theory to develop a shortened version of the Positive and Negative Syndrome Scale (PANSS).

    Science.gov (United States)

    Khan, Anzalee; Lewis, Charles; Lindenmayer, Jean-Pierre

    2011-11-16

    Nonparametric item response theory (IRT) was used to examine (a) the performance of the 30 Positive and Negative Syndrome Scale (PANSS) items and their options ((levels of severity), (b) the effectiveness of various subscales to discriminate among differences in symptom severity, and (c) the development of an abbreviated PANSS (Mini-PANSS) based on IRT and a method to link scores to the original PANSS. Baseline PANSS scores from 7,187 patients with Schizophrenia or Schizoaffective disorder who were enrolled between 1995 and 2005 in psychopharmacology trials were obtained. Option characteristic curves (OCCs) and Item Characteristic Curves (ICCs) were constructed to examine the probability of rating each of seven options within each of 30 PANSS items as a function of subscale severity, and summed-score linking was applied to items selected for the Mini-PANSS. The majority of items forming the Positive and Negative subscales (i.e. 19 items) performed very well and discriminate better along symptom severity compared to the General Psychopathology subscale. Six of the seven Positive Symptom items, six of the seven Negative Symptom items, and seven out of the 16 General Psychopathology items were retained for inclusion in the Mini-PANSS. Summed score linking and linear interpolation was able to produce a translation table for comparing total subscale scores of the Mini-PANSS to total subscale scores on the original PANSS. Results show scores on the subscales of the Mini-PANSS can be linked to scores on the original PANSS subscales, with very little bias. The study demonstrated the utility of non-parametric IRT in examining the item properties of the PANSS and to allow selection of items for an abbreviated PANSS scale. The comparisons between the 30-item PANSS and the Mini-PANSS revealed that the shorter version is comparable to the 30-item PANSS, but when applying IRT, the Mini-PANSS is also a good indicator of illness severity.

  3. Generalizability theory and item response theory

    NARCIS (Netherlands)

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a

  4. Recommendations to improve the positive and negative syndrome scale (PANSS) based on item response theory.

    Science.gov (United States)

    Levine, Stephen Z; Rabinowitz, Jonathan; Rizopoulos, Dimitris

    2011-08-15

    The adequacy of the Positive and Negative Syndrome Scale (PANSS) items in measuring symptom severity in schizophrenia was examined using Item Response Theory (IRT). Baseline PANSS assessments were analyzed from two multi-center clinical trials of antipsychotic medication in chronic schizophrenia (n=1872). Generally, the results showed that the PANSS (a) item ratings discriminated symptom severity best for the negative symptoms; (b) has an excess of "Severe" and "Extremely severe" rating options; and (c) assessments are more reliable at medium than very low or high levels of symptom severity. Analysis also showed that the detection of statistically and non-statistically significant differences in treatment were highly similar for the original and IRT-modified PANSS. In clinical trials of chronic schizophrenia, the PANSS appears to require the following modifications: fewer rating options, adjustment of 'Lack of judgment and insight', and improved severe symptom assessment. 2011 Elsevier Ltd. All rights reserved.

  5. The Blood Donor Anxiety Scale: a six-item state anxiety measure based on the Spielberger State-Trait Anxiety Inventory.

    Science.gov (United States)

    Chell, Kathleen; Waller, Daniel; Masser, Barbara

    2016-06-01

    Research demonstrates that anxiety elevates the risk of blood donors experiencing adverse events, which in turn deters the performance of repeat blood donations. Identifying donors suffering from heightened state anxiety is important to assess the impact of evidence-based interventions. This study analyzed the appropriateness of a shortened version of the state subscale of the State-Trait Anxiety Inventory (STAI) in a blood donation context. STAI-State questionnaire data were collected from two separate samples of Australian blood donors (n = 919 and n = 824 after cleaning). Responses to demographic, donation history, and adverse reaction questions were also obtained. Identification of items and analysis was performed systematically to assess and compare internal reliability and content, construct, convergent, and criterion validity of three potential short-form state anxiety scales. Of the three short-form scales tested, STAI-State six-item scale demonstrated the best metric properties with the least number of items across both sample groups. Cronbach's alpha was acceptable (α = 0.844 and α = 0.820), correlated positively with the original measure (r = 0.927 and r = 0.931) and criterion-related variables, and maintained the two-dimension factorial structure of the original measure. The six-item short version of the STAI-State subscale presented the most reliable and valid scale for use with blood donors. A validated donor anxiety tool provides a standardized assessment and record of donor anxiety to gauge the effectiveness of ongoing efforts to enhance the donation experience. © 2016 AABB.

  6. Development of a Short Version of MSQOL-54 Using Factor Analysis and Item Response Theory.

    Directory of Open Access Journals (Sweden)

    Rosalba Rosato

    Full Text Available The Multiple Sclerosis Quality of Life-54 (MSQOL-54, 52 items grouped in 12 subscales plus two single items is the most used MS specific health related quality of life inventory.To develop a shortened version of the MSQOL-54.MSQOL-54 dimensionality and metric properties were investigated by confirmatory factor analysis (CFA and Rasch modelling (Partial Credit Model, PCM on MSQOL-54s completed by 473 MS patients. Their mean age was 41 years, 65% were women, and median Expanded Disability Status Scale (EDSS score was 2.0 (range 0-9.5. Differential item functioning (DIF was evaluated for gender, age and EDSS. Dimensionality of the resulting short version was assessed by exploratory factor analysis (EFA and CFA. Cognitive debriefing of the short instrument (vs. the original was then performed on 12 MS patients.CFA of MSQOL-54 subscales showed that the data fitted the overall model well. Two subscales (Role Limitations--Physical, Role Limitations--Emotional did not fit the PCM, and were removed; two other subscales (Health Perceptions, Social Function did not fit the model, but were retained as single items. Sexual Satisfaction (single-item subscale was also removed. The resulting MSQOL-29 consisted of 25 items grouped in 7 subscales, plus 4 single items. PCM fit statistics were within the acceptability range for all MSQOL-29 items except one which had significant DIF by age. EFA and CFA indicated adequate fit to the original two-factor (Physical and Mental Health Composites hypothesis. Cognitive debriefing confirmed that MSQOL-29 was acceptable and had lost no key items.The proposed MSQOL-29 is 50% shorter than MSQOL-54, yet preserves key quality of life dimensions. Prospective validation on a large, independent MS patient sample is ongoing.

  7. Maslach Burnout Inventory and a Self-Defined, Single-Item Burnout Measure Produce Different Clinician and Staff Burnout Estimates.

    Science.gov (United States)

    Knox, Margae; Willard-Grace, Rachel; Huang, Beatrice; Grumbach, Kevin

    2018-06-04

    Clinicians and healthcare staff report high levels of burnout. Two common burnout assessments are the Maslach Burnout Inventory (MBI) and a single-item, self-defined burnout measure. Relatively little is known about how the measures compare. To identify the sensitivity, specificity, and concurrent validity of the self-defined burnout measure compared to the more established MBI measure. Cross-sectional survey (November 2016-January 2017). Four hundred forty-four primary care clinicians and 606 staff from three San Francisco Aarea healthcare systems. The MBI measure, calculated from a high score on either the emotional exhaustion or cynicism subscale, and a single-item measure of self-defined burnout. Concurrent validity was assessed using a validated, 7-item team culture scale as reported by Willard-Grace et al. (J Am Board Fam Med 27(2):229-38, 2014) and a standard question about workplace atmosphere as reported by Rassolian et al. (JAMA Intern Med 177(7):1036-8, 2017) and Linzer et al. (Ann Intern Med 151(1):28-36, 2009). Similar to other nationally representative burnout estimates, 52% of clinicians (95% CI: 47-57%) and 46% of staff (95% CI: 42-50%) reported high MBI emotional exhaustion or high MBI cynicism. In contrast, 29% of clinicians (95% CI: 25-33%) and 31% of staff (95% CI: 28-35%) reported "definitely burning out" or more severe symptoms on the self-defined burnout measure. The self-defined measure's sensitivity to correctly identify MBI-assessed burnout was 50.4% for clinicians and 58.6% for staff; specificity was 94.7% for clinicians and 92.3% for staff. Area under the receiver operator curve was 0.82 for clinicians and 0.81 for staff. Team culture and atmosphere were significantly associated with both self-defined burnout and the MBI, confirming concurrent validity. Point estimates of burnout notably differ between the self-defined and MBI measures. Compared to the MBI, the self-defined burnout measure misses half of high-burnout clinicians and more

  8. Validity and usefulness of a single-item measure of patient-reported bother from side effects of cancer therapy.

    Science.gov (United States)

    Pearman, Timothy P; Beaumont, Jennifer L; Mroczek, Daniel; O'Connor, Mary; Cella, David

    2018-03-01

    The improving efficacy of cancer treatment has resulted in an increasing array of treatment-related symptoms and associated burdens imposed on individuals undergoing aggressive treatment of their disease. Often, clinical trials compare therapies that have different types, and severities, of adverse effects. Whether rated by clinicians or patients themselves, it can be difficult to know which side effect profile is more disruptive or bothersome to patients. A simple summary index of bother can help to adjudicate the variability in adverse effects across treatments being compared with each other. Across 4 studies, a total of 5765 patients enrolled in cooperative group studies and industry-sponsored clinical trials were the subjects of the current study. Patients were diagnosed with a range of primary cancer sites, including bladder, brain, breast, colon/rectum, head/neck, hepatobiliary, kidney, lung, ovary, pancreas, and prostate as well as leukemia and lymphoma. All patients were administered the Functional Assessment of Cancer Therapy-General version (FACT-G). The single item "I am bothered by side effects of treatment" (GP5), rated on a 5-point Likert scale, is part of the FACT-G. To determine its validity as a useful summary measure from the patient perspective, it was correlated with individual and aggregated clinician-rated adverse events and patient reports of their general ability to enjoy life. Analyses of pharmaceutical trials demonstrated that mean GP5 scores ("I am bothered by side effects of treatment") significantly differed by maximum adverse event grade (PEffect sizes ranged from 0.13 to 0.46. Analyses of cooperative group trials demonstrated a significant correlation between GP5 and item GF3 ("I am able to enjoy life") in the predicted direction. The single FACT-G item "I am bothered by side effects of treatment" is significantly associated with clinician-reported adverse events and with patients' ability to enjoy their lives. It has promise as an

  9. Five-Item Francis Scale of Attitude toward Christianity: Construct and Nomological Validity and Internal Consistency among Colombian College Students

    Science.gov (United States)

    Ceballos, Guillermo A.; Suescun, Jesus D.; Oviedo, Heidi C.; Herazo, Edwin; Campo-Arias, Adalberto

    2015-01-01

    The Spanish version of the five-item Francis scale of attitude toward Christianity is a refinement of the short version of the Francis scale of attitude toward Christianity. The scale is a good measurement for intrinsic religiosity. It has been applied previously among Colombian adolescent students. The internal consistency and construct and…

  10. Cross-cultural and sex differences in the Emotional Skills and Competence Questionnaire scales: Challenges of differential item functioning analyses

    Directory of Open Access Journals (Sweden)

    Bo Molander

    2009-11-01

    Full Text Available University students in Croatia, Slovenia, and Sweden (N = 1129 were examined by means of the Emotional Skills and Competence Questionnaire (Takšić, 1998. Results showed a significant effect for the sex factor only on the total-score scale, women scoring higher than men, but significant effects were obtained for country, as well as for sex, on the Express and Label (EL and Perceive and Understand (PU subscales. Sweden showed higher scores than Croatia and Slovenia on the EL scale, and Slovenia showed higher scores than Croatia and Sweden on the PU scale. In subsequent analyses of differential item functioning (DIF, comparisons were carried out for pairs of countries. The analyses revealed that a large proportion of the items in the total-score scale were potentially biased, most so for the Croatian-Swedish comparison, less for the Slovenian-Swedish comparison, and least for the Croatian-Slovenian comparison. These findings give doubts about the validity of mean score differences in comparisons of countries. However, DIF analyses of sex differences within each country show very few DIF items, indicating that the ESCQ instrument works well within each cultural/linguistic setting. Possible explanations of the findings are discussed, and improvements for future studies are suggested.

  11. Reliability, Validity, and Predictive Utility of the 25-Item Criminogenic Cognitions Scale (CCS)

    OpenAIRE

    Tangney, June Price; Stuewig, Jeffrey; Furukawa, Emi; Kopelovich, Sarah; Meyer, Patrick; Cosby, Brandon

    2012-01-01

    Theory, research, and clinical reports suggest that moral cognitions play a role in initiating and sustaining criminal behavior. The 25 item Criminogenic Cognitions Scale (CCS) was designed to tap 5 dimensions: Notions of entitlement; Failure to Accept Responsibility; Short-Term Orientation; Insensitivity to Impact of Crime; and Negative Attitudes Toward Authority. Results from 552 jail inmates support the reliability, validity, and predictive utility of the measure. The CCS was linked to cri...

  12. Validating a shortened depression scale (10 item CES-D among HIV-positive people in British Columbia, Canada.

    Directory of Open Access Journals (Sweden)

    Wendy Zhang

    Full Text Available OBJECTIVE: To establish the reliability and validity of a shortened (10-item depression scale used among HIV-positive patients enrolled in the Drug Treatment Program in British Columbia, Canada. METHODS: The 10-item CES-D (Center for Epidemiologic Studies Depression Scale was examined among 563 participants who initiated antiretroviral therapy (ART between August 1, 1996 and June 30, 2002. Internal consistency of the scale was measured by Cronbach's alpha. Using the original CES-D 20 as primary criteria, comparisons were made using the Kappa statistic. Predictive accuracy of CES-D 10 was assessed by calculating sensitivity, specificity, positive predictive values and negative predictive values. Factor analysis was also performed to determine if the CES-D 10 contained the same factors of positive and negative affect found in the original development of the CES-D. RESULTS: The correlation between the original and the shortened scale is very high (Spearman correlation coefficient  =0.97 (P<0.001. Internal consistency reliability coefficients of the CES-D 10 were satisfactory (Cronbach α=0.88. The CES-D 10 showed comparable accuracy to the original CES-D 20 in classifying participants with depressive symptoms (Kappa=0.82, P<0.001. Sensitivity of CES-D 10 was 91%; specificity was 92%; and positive predictive value was 92%. Factor analysis demonstrates that CES-D 10 contains the same underlying factors of positive and negative affect found in the original development of the CES-D 20. CONCLUSION: The 10-item CES-D is a comparable tool to measure depressive symptoms among HIV-positive research participants.

  13. Item Response Theory with Covariates (IRT-C): Assessing Item Recovery and Differential Item Functioning for the Three-Parameter Logistic Model

    Science.gov (United States)

    Tay, Louis; Huang, Qiming; Vermunt, Jeroen K.

    2016-01-01

    In large-scale testing, the use of multigroup approaches is limited for assessing differential item functioning (DIF) across multiple variables as DIF is examined for each variable separately. In contrast, the item response theory with covariate (IRT-C) procedure can be used to examine DIF across multiple variables (covariates) simultaneously. To…

  14. Evaluation of the Fecal Incontinence Quality of Life Scale (FIQL) using item response theory reveals limitations and suggests revisions.

    Science.gov (United States)

    Peterson, Alexander C; Sutherland, Jason M; Liu, Guiping; Crump, R Trafford; Karimuddin, Ahmer A

    2018-06-01

    The Fecal Incontinence Quality of Life Scale (FIQL) is a commonly used patient-reported outcome measure for fecal incontinence, often used in clinical trials, yet has not been validated in English since its initial development. This study uses modern methods to thoroughly evaluate the psychometric characteristics of the FIQL and its potential for differential functioning by gender. This study analyzed prospectively collected patient-reported outcome data from a sample of patients prior to colorectal surgery. Patients were recruited from 14 general and colorectal surgeons in Vancouver Coastal Health hospitals in Vancouver, Canada. Confirmatory factor analysis was used to assess construct validity. Item response theory was used to evaluate test reliability, describe item-level characteristics, identify local item dependence, and test for differential functioning by gender. 236 patients were included for analysis, with mean age 58 and approximately half female. Factor analysis failed to identify the lifestyle, coping, depression, and embarrassment domains, suggesting lack of construct validity. Items demonstrated low difficulty, indicating that the test has the highest reliability among individuals who have low quality of life. Five items are suggested for removal or replacement. Differential test functioning was minimal. This study has identified specific improvements that can be made to each domain of the Fecal Incontinence Quality of Life Scale and to the instrument overall. Formatting, scoring, and instructions may be simplified, and items with higher difficulty developed. The lifestyle domain can be used as is. The embarrassment domain should be significantly revised before use.

  15. Using an FSDS-R Item to Screen for Sexually Related Distress: A MsFLASH Analysis

    Directory of Open Access Journals (Sweden)

    Janet S. Carpenter, PhD, RN, FAAN

    2015-03-01

    Conclusions: A single FSDS-R item may be a useful screening tool to quickly identify midlife women with sexually related distress when it is not feasible to administer the entire scale, though further validation is warranted. Carpenter JS, Reed SD, Guthrie KA, Larson JC, Newton KM, Lau RJ, Learman LA, and Shifren JL. Using an FSDS-R item to screen for sexually related distress: A MsFLASH analysis. Sex Med 2015;3:7–13.

  16. An approach for estimating item sensitivity to within-person change over time: An illustration using the Alzheimer's Disease Assessment Scale-Cognitive subscale (ADAS-Cog).

    Science.gov (United States)

    Dowling, N Maritza; Bolt, Daniel M; Deng, Sien

    2016-12-01

    When assessments are primarily used to measure change over time, it is important to evaluate items according to their sensitivity to change, specifically. Items that demonstrate good sensitivity to between-person differences at baseline may not show good sensitivity to change over time, and vice versa. In this study, we applied a longitudinal factor model of change to a widely used cognitive test designed to assess global cognitive status in dementia, and contrasted the relative sensitivity of items to change. Statistically nested models were estimated introducing distinct latent factors related to initial status differences between test-takers and within-person latent change across successive time points of measurement. Models were estimated using all available longitudinal item-level data from the Alzheimer's Disease Assessment Scale-Cognitive subscale, including participants representing the full-spectrum of disease status who were enrolled in the multisite Alzheimer's Disease Neuroimaging Initiative. Five of the 13 Alzheimer's Disease Assessment Scale-Cognitive items demonstrated noticeably higher loadings with respect to sensitivity to change. Attending to performance change on only these 5 items yielded a clearer picture of cognitive decline more consistent with theoretical expectations in comparison to the full 13-item scale. Items that show good psychometric properties in cross-sectional studies are not necessarily the best items at measuring change over time, such as cognitive decline. Applications of the methodological approach described and illustrated in this study can advance our understanding regarding the types of items that best detect fine-grained early pathological changes in cognition. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  17. The profile of selected samples of Croatian athletes based on the items of sport jealousy scale (SJS

    Directory of Open Access Journals (Sweden)

    Sindik Joško

    2016-01-01

    Full Text Available The role of jealousy in sport, as a negative emotional reaction, accompanied by thoughts of inadequacy when compared to others, is the issue of this article. This study had a purpose to define the characteristic profiles of the Croatian athletes, based on single items of Sport Jealousy Scale (SJS II, labeled by several variables: gender, type of sport, age group. Purposive sample of 73 athletes competing at Croatian championships in different sports (football, bowling, volleyball and handball were examined with Croatian version of SJS-II. Three clusters obtained are similarly balanced, according to the number of cases in each cluster. The most simply explained, clusters clearly differentiate the most jealous, moderately jealous and slightly/low jealous athletes. Among the features of the athletes in each cluster, in the most jealous (first cluster are the athletes from team sports, women and older athletes. Females, bowling athletes, athletes from individual (coactive sports and the youngest athletes are the least jealous (grouped in third cluster.

  18. Evaluation of the Psychometric Properties of the Asian Adolescent Depression Scale and Construction of a Short Form: An Item Response Theory Analysis.

    Science.gov (United States)

    Lo, Barbara Chuen Yee; Zhao, Yue; Kwok, Alice Wai Yee; Chan, Wai; Chan, Calais Kin Yuen

    2017-07-01

    The present study applied item response theory to examine the psychometric properties of the Asian Adolescent Depression Scale and to construct a short form among 1,084 teenagers recruited from secondary schools in Hong Kong. Findings suggested that some items of the full form reflected higher levels of severity and were more discriminating than others, and the Asian Adolescent Depression Scale was useful in measuring a broad range of depressive severity in community youths. Differential item functioning emerged in several items where females reported higher depressive severity than males. In the short form construction, preliminary validation suggested that, relative to the 20-item full form, our derived short form offered significantly greater diagnostic performance and stronger discriminatory ability in differentiating depressed and nondepressed groups, and simultaneously maintained adequate measurement precision with a reduced response burden in assessing depression in the Asian adolescents. Cultural variance in depressive symptomatology and clinical implications are discussed.

  19. A simulation study provided sample size guidance for differential item functioning (DIF) studies using short scales

    DEFF Research Database (Denmark)

    Scott, Neil W.; Fayers, Peter M.; Bottomley, Andrew

    2009-01-01

    Differential item functioning (DIF) analyses are increasingly used to evaluate health-related quality of life (HRQoL) instruments, which often include relatively short subscales. Computer simulations were used to explore how various factors including scale length affect analysis of DIF by ordinal...... logistic regression....

  20. A scale purification procedure for evaluation of differential item functioning

    NARCIS (Netherlands)

    Khalid, Muhammad Naveed; Glas, Cornelis A.W.

    2014-01-01

    Item bias or differential item functioning (DIF) has an important impact on the fairness of psychological and educational testing. In this paper, DIF is seen as a lack of fit to an item response (IRT) model. Inferences about the presence and importance of DIF require a process of so-called test

  1. [Impact of passing items above the ceiling on the assessment results of Peabody developmental motor scales].

    Science.gov (United States)

    Zhao, Gai; Bian, Yang; Li, Ming

    2013-12-18

    To analyze the impact of passing items above the roof level in the gross motor subtest of Peabody development motor scales (PDMS-2) on its assessment results. In the subtests of PDMS-2, 124 children from 1.2 to 71 months were administered. Except for the original scoring method, a new scoring method which includes passing items above the ceiling were developed. The standard scores and quotients of the two scoring methods were compared using the independent-samples t test. Only one child could pass the items above the ceiling in the stationary subtest, 19 children in the locomotion subtest, and 17 children in the visual-motor integration subtest. When the scores of these passing items were included in the raw scores, the total raw scores got the added points of 1-12, the standard scores added 0-1 points and the motor quotients added 0-3 points. The diagnostic classification was changed only in two children. There was no significant difference between those two methods about motor quotients or standard scores in the specific subtest (P>0.05). The passing items above a ceiling of PDMS-2 isn't a rare situation. It usually takes place in the locomotion subtest and visual-motor integration subtest. Including these passing items into the scoring system will not make significant difference in the standard scores of the subtests or the developmental motor quotients (DMQ), which supports the original setting of a ceiling established by upassing 3 items in a row. However, putting the passing items above the ceiling into the raw score will improve tracking of children's developmental trajectory and intervention effects.

  2. Item response theory analysis of Centers for Disease Control and Prevention Health-Related Quality of Life (CDC HRQOL) items in adults with arthritis.

    Science.gov (United States)

    Mielenz, Thelma J; Callahan, Leigh F; Edwards, Michael C

    2016-03-12

    Examine the feasibility of performing an item response theory (IRT) analysis on two of the Centers for Disease Control and Prevention health-related quality of life (CDC HRQOL) modules - the 4-item Healthy Days Core Module (HDCM) and the 5-item Healthy days Symptoms Module (HDSM). Previous principal components analyses confirm that the two scales both assess a mix of mental (CDC-MH) and physical health (CDC-PH). The purpose is to conduct item response theory (IRT) analysis on the CDC-MH and CDC-PH scales separately. 2182 patients with self-reported or physician-diagnosed arthritis completed a cross-sectional survey including HDCM and HDSM items. Besides global health, the other 8 items ask the number of days that some statement was true; we chose to recode the data into 8 categories based on observed clustering. The IRT assumptions were assessed using confirmatory factor analysis and the data could be modeled using an unidimensional IRT model. The graded response model was used for IRT analyses and CDC-MH and CDC-PH scales were analyzed separately in flexMIRT. The IRT parameter estimates for the five-item CDC-PH all appeared reasonable. The three-item CDC-MH did not have reasonable parameter estimates. The CDC-PH scale is amenable to IRT analysis but the existing The CDC-MH scale is not. We suggest either using the 4-item Healthy Days Core Module (HDCM) and the 5-item Healthy days Symptoms Module (HDSM) as they currently stand or the CDC-PH scale alone if the primary goal is to measure physical health related HRQOL.

  3. Psychometric Properties of the Heart Disease Knowledge Scale: Evidence from Item and Confirmatory Factor Analyses.

    Science.gov (United States)

    Lim, Bee Chiu; Kueh, Yee Cheng; Arifin, Wan Nor; Ng, Kok Huan

    2016-07-01

    Heart disease knowledge is an important concept for health education, yet there is lack of evidence on proper validated instruments used to measure levels of heart disease knowledge in the Malaysian context. A cross-sectional, survey design was conducted to examine the psychometric properties of the adapted English version of the Heart Disease Knowledge Questionnaire (HDKQ). Using proportionate cluster sampling, 788 undergraduate students at Universiti Sains Malaysia, Malaysia, were recruited and completed the HDKQ. Item analysis and confirmatory factor analysis (CFA) were used for the psychometric evaluation. Construct validity of the measurement model was included. Most of the students were Malay (48%), female (71%), and from the field of science (51%). An acceptable range was obtained with respect to both the difficulty and discrimination indices in the item analysis results. The difficulty index ranged from 0.12-0.91 and a discrimination index of ≥ 0.20 were reported for the final retained 23 items. The final CFA model showed an adequate fit to the data, yielding a 23-item, one-factor model [weighted least squares mean and variance adjusted scaled chi-square difference = 1.22, degrees of freedom = 2, P-value = 0.544, the root mean square error of approximation = 0.03 (90% confidence interval = 0.03, 0.04); close-fit P-value = > 0.950]. Adequate psychometric values were obtained for Malaysian undergraduate university students using the 23-item, one-factor model of the adapted HDKQ.

  4. Generalizability theory and item response theory

    OpenAIRE

    Glas, Cornelis A.W.; Eggen, T.J.H.M.; Veldkamp, B.P.

    2012-01-01

    Item response theory is usually applied to items with a selected-response format, such as multiple choice items, whereas generalizability theory is usually applied to constructed-response tasks assessed by raters. However, in many situations, raters may use rating scales consisting of items with a selected-response format. This chapter presents a short overview of how item response theory and generalizability theory were integrated to model such assessments. Further, the precision of the esti...

  5. Subjective caregiver burden: validity of the 10-item short version of the Burden Scale for Family Caregivers BSFC-s.

    Science.gov (United States)

    Graessel, Elmar; Berth, Hendrik; Lichte, Thomas; Grau, Hannes

    2014-02-20

    Subjective burden is a central variable describing the situation encountered by family caregivers. The 10-item short version of the Burden Scale for Family Caregivers (BSFC-short/BSFC-s) was developed to provide an economical measure of this variable. The present study examined the reliability and validity of the BSFC-s. Comprehensive data from "the IDA project" were the basis of the calculations, which included 351 dyads and examined medical data on people with dementia, interview data from their family caregivers, and health insurance data. A factor analysis was performed to explore the structure of the BSFC-s; Cronbach's alpha was used to evaluate the internal consistency of the scale. The items were analyzed to determine the item difficulty and the discriminatory power. Construct validity was tested with five hypotheses. To establish the predictive validity of the BSFC-s, predictors of institutionalization at a follow-up time of 2.5 years were analyzed (binary logistic regression). The BSFC-s score adhered to a one-factor structure. Cronbach's alpha for the complete scale was .92. A significant increase in the BSFC-s score was observed when dementia progressed, disturbing behavior occurred more frequently, care requirements increased, and when caregivers were diagnosed with depression. Caregiver burden was the second strongest predictor of institutionalization out of a total of four significant predictors. All hypotheses that referred to the construct validity were supported. The BSFC-short with its ten items is a very economical instrument for assessing the caregiver's total subjective burden in a short time frame. The BSFC-s score has predictive validity for the institutionalization of people with dementia. Therefore it is an appropriate outcome measure to evaluate caregiver interventions. The scale is available for free in 20 languages (http://www.caregiver-burden.eu). This availability facilitates the comparison of international research findings.

  6. Measurement properties of the WOMAC LK 3.1 pain scale.

    Science.gov (United States)

    Stratford, P W; Kennedy, D M; Woodhouse, L J; Spadoni, G F

    2007-03-01

    The Western Ontario and McMaster Universities Osteoarthritis Index (WOMAC) is applied extensively to patients with osteoarthritis of the hip or knee. Previous work has challenged the validity of its physical function scale however an extensive evaluation of its pain scale has not been reported. Our purpose was to estimate internal consistency, factorial validity, test-retest reliability, and the standard error of measurement (SEM) of the WOMAC LK 3.1 pain scale. Four hundred and seventy-four patients with osteoarthritis of the hip or knee awaiting arthroplasty were administered the WOMAC. Estimates of internal consistency (coefficient alpha), factorial validity (confirmatory factor analysis), and the SEM based on internal consistency (SEM(IC)) were obtained. Test-retest reliability [Type 2,1 intraclass correlation coefficients (ICC)] and a corresponding SEM(TRT) were estimated on a subsample of 36 patients. Our estimates were: internal consistency alpha=0.84; SEM(IC)=1.48; Type 2,1 ICC=0.77; SEM(TRT)=1.69. Confirmatory factor analysis failed to support a single factor structure of the pain scale with uncorrelated error terms. Two comparable models provided excellent fit: (1) a model with correlated error terms between the walking and stairs items, and between night and sit items (chi2=0.18, P=0.98); (2) a two factor model with walking and stairs items loading on one factor, night and sit items loading on a second factor, and the standing item loading on both factors (chi2=0.18, P=0.98). Our examination of the factorial structure of the WOMAC pain scale failed to support a single factor and internal consistency analysis yielded a coefficient less than optimal for individual patient use. An alternate strategy to summing the five-item responses when considering individual patient application would be to interpret item responses separately or to sum only those items which display homogeneity.

  7. Investigating Separate and Concurrent Approaches for Item Parameter Drift in 3PL Item Response Theory Equating

    Science.gov (United States)

    Arce-Ferrer, Alvaro J.; Bulut, Okan

    2017-01-01

    This study examines separate and concurrent approaches to combine the detection of item parameter drift (IPD) and the estimation of scale transformation coefficients in the context of the common item nonequivalent groups design with the three-parameter item response theory equating. The study uses real and synthetic data sets to compare the two…

  8. Item response theory analyses of the Delis-Kaplan Executive Function System card sorting subtest.

    Science.gov (United States)

    Spencer, Mercedes; Cho, Sun-Joo; Cutting, Laurie E

    2018-02-02

    In the current study, we examined the dimensionality of the 16-item Card Sorting subtest of the Delis-Kaplan Executive Functioning System assessment in a sample of 264 native English-speaking children between the ages of 9 and 15 years. We also tested for measurement invariance for these items across age and gender groups using item response theory (IRT). Results of the exploratory factor analysis indicated that a two-factor model that distinguished between verbal and perceptual items provided the best fit to the data. Although the items demonstrated measurement invariance across age groups, measurement invariance was violated for gender groups, with two items demonstrating differential item functioning for males and females. Multigroup analysis using all 16 items indicated that the items were more effective for individuals whose IRT scale scores were relatively high. A single-group explanatory IRT model using 14 non-differential item functioning items showed that for perceptual ability, females scored higher than males and that scores increased with age for both males and females; for verbal ability, the observed increase in scores across age differed for males and females. The implications of these findings are discussed.

  9. Validation of the JDS satisfaction scales applied to educational university environments

    Directory of Open Access Journals (Sweden)

    Martha Giraldo-O'Meara

    2014-01-01

    Full Text Available Purpose: The aim of this study is to review and summarize the main satisfaction scales used in publications about human Resource Management and educational research, in order to adapt the satisfaction scales of the Job Diagnostic Survey (JDS to higher education and validate it with a sample of university students and to assess the concept of satisfaction in two different ways: as a single-item measure, with a global indicator and as a multi-item measure, analyzed as a global model and composed by several scales. Design/methodology/approach: Confirmatory factor analysis with maximum likelihood, using structural equations model, was employed to assess the model fit in 152 business management undergraduates. Findings and Originality/value: The satisfaction model measured as multi-item scale present an acceptable fit. Even though, some of the satisfaction scales did not present a satisfactory fit, they can be used and interpreted independently with carefulness. Nevertheless, the satisfaction single-item scale presents a better fit and has been validated as a simpler and less costly measure of satisfaction. Originality/value: In the current process of change that is taking place in universities according to the plan developed by the European Space of higher Education, validated instruments as the satisfaction scale of JDS, adapted to teaching, may facilitate this process through the diagnosis, and follow-up of changes in satisfaction levels in university classrooms.

  10. Development and psychometric characteristics of the SCI-QOL Bladder Management Difficulties and Bowel Management Difficulties item banks and short forms and the SCI-QOL Bladder Complications scale.

    Science.gov (United States)

    Tulsky, David S; Kisala, Pamela A; Tate, Denise G; Spungen, Ann M; Kirshblum, Steven C

    2015-05-01

    To describe the development and psychometric properties of the Spinal Cord Injury--Quality of Life (SCI-QOL) Bladder Management Difficulties and Bowel Management Difficulties item banks and Bladder Complications scale. Using a mixed-methods design, a pool of items assessing bladder and bowel-related concerns were developed using focus groups with individuals with spinal cord injury (SCI) and SCI clinicians, cognitive interviews, and item response theory (IRT) analytic approaches, including tests of model fit and differential item functioning. Thirty-eight bladder items and 52 bowel items were tested at the University of Michigan, Kessler Foundation Research Center, the Rehabilitation Institute of Chicago, the University of Washington, Craig Hospital, and the James J. Peters VA Medical Center, Bronx, NY. Seven hundred fifty-seven adults with traumatic SCI. The final item banks demonstrated unidimensionality (Bladder Management Difficulties CFI=0.965; RMSEA=0.093; Bowel Management Difficulties CFI=0.955; RMSEA=0.078) and acceptable fit to a graded response IRT model. The final calibrated Bladder Management Difficulties bank includes 15 items, and the final Bowel Management Difficulties item bank consists of 26 items. Additionally, 5 items related to urinary tract infections (UTI) did not fit with the larger Bladder Management Difficulties item bank but performed relatively well independently (CFI=0.992, RMSEA=0.050) and were thus retained as a separate scale. The SCI-QOL Bladder Management Difficulties and Bowel Management Difficulties item banks are psychometrically robust and are available as computer adaptive tests or short forms. The SCI-QOL Bladder Complications scale is a brief, fixed-length outcomes instrument for individuals with a UTI.

  11. Item Response Theory Analysis of the Psychopathic Personality Inventory-Revised.

    Science.gov (United States)

    Eichenbaum, Alexander E; Marcus, David K; French, Brian F

    2017-06-01

    This study examined item and scale functioning in the Psychopathic Personality Inventory-Revised (PPI-R) using an item response theory analysis. PPI-R protocols from 1,052 college student participants (348 male, 704 female) were analyzed. Analyses were conducted on the 131 self-report items comprising the PPI-R's eight content scales, using a graded response model. Scales collected a majority of their information about respondents possessing higher than average levels of the traits being measured. Each scale contained at least some items that evidenced limited ability to differentiate between respondents with differing levels of the trait being measured. Moreover, 80 items (61.1%) yielded significantly different responses between men and women presumably possessing similar levels of the trait being measured. Item performance was also influenced by the scoring format (directly scored vs. reverse-scored) of the items. Overall, the results suggest that the PPI-R, despite identifying psychopathic personality traits in individuals possessing high levels of those traits, may not identify these traits equally well for men and women, and scores are likely influenced by the scoring format of the individual item and scale.

  12. Problem of item overlap between the psychopathy screening device and attention deficit hyperactivity disorder, oppositional defiant disorder, and conduct disorder rating scales.

    Science.gov (United States)

    Burns, G L

    2000-12-01

    Content validity requires a clear definition of the construct of interest and the delineation of the construct from similar constructs. Content validity also requires that the items be representative of the construct as well as specific to the construct. An examination of the items on the Psychopathy Screening Device (PSD), a parent- and teacher-rating scale of childhood psychopathy, indicates significant overlap with the symptoms and associated features of attention deficit hyperactivity disorder (ADHD), oppositional defiant disorder (ODD), and conduct disorder (CD). The failure of the PSD to have unique items results in poor discriminant validity with ADHD, ODD, and CD rating scales. More careful attention to content validation guidelines is required to develop a more useful measure of childhood psychopathy.

  13. Validation of a 15-item care-related regret coping scale for health-care professionals (RCS-HCP).

    Science.gov (United States)

    Courvoisier, Delphine Sophie; Cullati, Stephane; Ouchi, Rieko; Schmidt, Ralph Eric; Haller, Guy; Chopard, Pierre; Agoritsas, Thomas; Perneger, Thomas V

    2014-01-01

    Coping with difficult care-related situations is a common challenge for health-care professionals. How these professionals deal with the regrets they may experience following one of the many decisions and interventions they must make every day can have an impact on their own health and quality of life, and also on their patient care practices. To identify professionals most at need for extra support, development and validation of a tool measuring coping style are needed. We performed a survey of physicians and nurses of a French-speaking University hospital; 469 health-care professionals responded to the survey, and 175 responded to the same survey one-month later. Regret was assessed with the regret coping scale developed for this study, self-report questions on the frequency of regretted situations and the intensity of regret. Construct validity was assessed using measures of health-care professionals' quality of life (including job and life satisfaction, and self-reported health) as well as sleep problems and depression. Based on factor analysis and item response analysis, the initial 31-item scale was shortened to 15 items, which measured three types of strategies: problem-focused strategies (i.e., trying to find solutions, talking to colleagues) and two types of emotion-focused strategies, A (i.e., self-blame, rumination) and B (e.g., acceptance, emotional distance). All subscales showed high internal consistency (α >0.85). Overall, as expected, problem-focused and emotion-focused B strategies correlated with higher quality of life, fewer sleep problems and less depression, and emotion-focused A strategies showed the opposite pattern. The regret coping scale (RCS-HCP) is a valid and reliable measure of coping abilities of hospital-based health-care professionals.

  14. Item response theory analysis of the Utrecht Work Engagement Scale for Students (UWES-S) using a sample of Japanese university and college students majoring medical science, nursing, and natural science.

    Science.gov (United States)

    Tsubakita, Takashi; Shimazaki, Kazuyo; Ito, Hiroshi; Kawazoe, Nobuo

    2017-10-30

    The Utrecht Work Engagement Scale for Students has been used internationally to assess students' academic engagement, but it has not been analyzed via item response theory. The purpose of this study was to conduct an item response theory analysis of the Japanese version of the Utrecht Work Engagement Scale for Students translated by authors. Using a two-parameter model and Samejima's graded response model, difficulty and discrimination parameters were estimated after confirming the factor structure of the scale. The 14 items on the scale were analyzed with a sample of 3214 university and college students majoring medical science, nursing, or natural science in Japan. The preliminary parameter estimation was conducted with the two parameter model, and indicated that three items should be removed because there were outlier parameters. Final parameter estimation was conducted using the survived 11 items, and indicated that all difficulty and discrimination parameters were acceptable. The test information curve suggested that the scale better assesses higher engagement than average engagement. The estimated parameters provide a basis for future comparative studies. The results also suggested that a 7-point Likert scale is too broad; thus, the scaling should be modified to fewer graded scaling structure.

  15. Spanish validation of the 10-item Connor-Davidson Resilience Scale (CD-RISC 10) with non-professional caregivers.

    Science.gov (United States)

    Blanco, Vanessa; Guisande, María Adelina; Sánchez, María Teresa; Otero, Patricia; Vázquez, Fernando L

    2017-11-08

    Despite the importance of resilience in populations under stress, and the fact that the 10-item version Connor-Davidson Resilience Scale (CD-RISC 10) is the shortest instrument for reliable and valid evaluation of resilience, there are no data on their psychometric properties in non-professional caregivers. The aim of this study was to analyze the psychometric properties and factorial structure of the spanish version of the CD-RISC 10 in non-professional caregivers. Independently trained assessors evaluated resilience, self-esteem, social support, emotional distress and depression in a sample of 294 caregivers (89.8% women, mean age 55.3 years). The internal consistency of CD-RISC 10 was α = .86. A single factor was found that accounted for 44.7% of the total variance. Confirmatory factor analysis corroborated this unifactorial model. The CD-RISC 10 was significantly correlated with the self-esteem (r = .416, p caregivers with depression (sensitivity = 70.0%, specificity = 68.2%). The CD-RISC 10 is a reliable and valid instrument to evaluate resilience in the caregiver population.

  16. Screening instruments for a population of older adults: The 10-item Kessler Psychological Distress Scale (K10) and the 7-item Generalized Anxiety Disorder Scale (GAD-7).

    Science.gov (United States)

    Vasiliadis, Helen-Maria; Chudzinski, Veronica; Gontijo-Guerra, Samantha; Préville, Michel

    2015-07-30

    Screening tools that appropriately detect older adults' mental disorders are of great public health importance. The present study aimed to establish cutoff scores for the 10-item Kessler Psychological Distress (K10) and the 7-item Generalized Anxiety Disorder (GAD-7) scales when screening for depression and anxiety. We used data from participants (n = 1811) in the Enquête sur la Santé des Aînés-Service study. Depression and anxiety were measured using DSM-V and DSM-IV criteria. Receiver operating characteristic (ROC) curve analysis provided an area under the curve (AUC) of 0.767 and 0.833 for minor and for major depression when using K10. A cutoff of 19 was found to balance sensitivity (0.794) and specificity (0.664) for minor depression, whereas a cutoff of 23 was found to balance sensitivity (0.692) and specificity (0.811) for major depression. When screening for an anxiety with GAD-7, ROC analysis yielded an AUC of 0.695; a cutoff of 5 was found to balance sensitivity (0.709) and specificity (0.568). No significant differences were found between subgroups of age and gender. Both K10 and GAD-7 were able to discriminate between cases and non-cases when screening for depression and anxiety in an older adult population of primary care service users. Copyright © 2015 Elsevier Ireland Ltd. All rights reserved.

  17. Face Validity of the Single Work Ability Item: Comparison with Objectively Measured Heart Rate Reserve over Several Days

    Science.gov (United States)

    Gupta, Nidhi; Jensen, Bjørn Søvsø; Søgaard, Karen; Carneiro, Isabella Gomes; Christiansen, Caroline Stordal; Hanisch, Christiana; Holtermann, Andreas

    2014-01-01

    Purpose: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR) among blue-collar workers. Methods: We utilized data from 127 blue-collar workers (Female = 53; Male = 74) aged 18–65 years from the cross-sectional “New method for Objective Measurements of physical Activity in Daily living (NOMAD)” study. The workers reported their single item work ability and completed an aerobic capacity cycling test and objective measurements of heart rate reserve monitored with Actiheart for 3–4 days with a total of 5,810 h, including 2,640 working hours. Results: A significant moderate correlation between work ability and %HRR was observed among males (R = −0.33, P = 0.005), but not among females (R = 0.11, P = 0.431). In a gender-stratified multi-adjusted logistic regression analysis, males with high %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI) = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16), and a significant interaction between work ability, %HRR and gender was observed (P = 0.03). Conclusions: The observed association between work ability and objectively measured %HRR over several days among male blue-collar workers supports the face validity of the single work ability item. It is a useful and valid measure of the relation between physical work demands and resources among male blue-collar workers. The contrasting association among females needs to be further investigated. PMID:24840350

  18. Face Validity of the Single Work Ability Item: Comparison with Objectively Measured Heart Rate Reserve over Several Days

    Directory of Open Access Journals (Sweden)

    Nidhi Gupta

    2014-05-01

    Full Text Available Purpose: The purpose of this study was to investigate the face validity of the self-reported single item work ability with objectively measured heart rate reserve (%HRR among blue-collar workers. Methods: We utilized data from 127 blue-collar workers (Female = 53; Male = 74 aged 18–65 years from the cross-sectional “New method for Objective Measurements of physical Activity in Daily living (NOMAD” study. The workers reported their single item work ability and completed an aerobic capacity cycling test and objective measurements of heart rate reserve monitored with Actiheart for 3–4 days with a total of 5,810 h, including 2,640 working hours. Results: A significant moderate correlation between work ability and %HRR was observed among males (R = −0.33, P = 0.005, but not among females (R = 0.11, P = 0.431. In a gender-stratified multi-adjusted logistic regression analysis, males with high %HRR were more likely to report a reduced work ability compared to males with low %HRR [OR = 4.75, 95% confidence interval (95% CI = 1.31 to 17.25]. However, this association was not found among females (OR = 0.26, 95% CI 0.03 to 2.16, and a significant interaction between work ability, %HRR and gender was observed (P = 0.03. Conclusions: The observed association between work ability and objectively measured %HRR over several days among male blue-collar workers supports the face validity of the single work ability item. It is a useful and valid measure of the relation between physical work demands and resources among male blue-collar workers. The contrasting association among females needs to be further investigated.

  19. Psychometric validation of the Persian nine-item Internet Gaming Disorder Scale - Short Form: Does gender and hours spent online gaming affect the interpretations of item descriptions?

    Science.gov (United States)

    Wu, Tzu-Yi; Lin, Chung-Ying; Årestedt, Kristofer; Griffiths, Mark D; Broström, Anders; Pakpour, Amir H

    2017-06-01

    Background and aims The nine-item Internet Gaming Disorder Scale - Short Form (IGDS-SF9) is brief and effective to evaluate Internet Gaming Disorder (IGD) severity. Although its scores show promising psychometric properties, less is known about whether different groups of gamers interpret the items similarly. This study aimed to verify the construct validity of the Persian IGDS-SF9 and examine the scores in relation to gender and hours spent online gaming among 2,363 Iranian adolescents. Methods Confirmatory factor analysis (CFA) and Rasch analysis were used to examine the construct validity of the IGDS-SF9. The effects of gender and time spent online gaming per week were investigated by multigroup CFA and Rasch differential item functioning (DIF). Results The unidimensionality of the IGDS-SF9 was supported in both CFA and Rasch. However, Item 4 (fail to control or cease gaming activities) displayed DIF (DIF contrast = 0.55) slightly over the recommended cutoff in Rasch but was invariant in multigroup CFA across gender. Items 4 (DIF contrast = -0.67) and 9 (jeopardize or lose an important thing because of gaming activity; DIF contrast = 0.61) displayed DIF in Rasch and were non-invariant in multigroup CFA across time spent online gaming. Conclusions Given the Persian IGDS-SF9 was unidimensional, it is concluded that the instrument can be used to assess IGD severity. However, users of the instrument are cautioned concerning the comparisons of the sum scores of the IGDS-SF9 across gender and across adolescents spending different amounts of time online gaming.

  20. Serbian translation of the 20-item toronto alexithymia scale: Psychometric properties and the new methodological approach in translating scales

    Directory of Open Access Journals (Sweden)

    Trajanović Nikola N.

    2013-01-01

    Full Text Available Introduction. Since inception of the alexithymia construct in 1970’s, there has been a continuous effort to improve both its theoretical postulates and the clinical utility through development, standardization and validation of assessment scales. Objective. The aim of this study was to validate the Serbian translation of the 20-item Toronto Alexithymia Scale (TAS-20 and to propose a new method of translation of scales with a property of temporal stability. Methods. The scale was expertly translated by bilingual medical professionals and a linguist, and given to a sample of bilingual participants from the general population who completed both the English and the Serbian version of the scale one week apart. Results. The findings showed that the Serbian version of the TAS-20 had a good internal consistency reliability regarding total scale (α=0.86, and acceptable reliability of the three factors (α=0.71-0.79. Conclusion. The analysis confirmed the validity and consistency of the Serbian translation of the scale, with observed weakness of the factorial structure consistent with studies in other languages. The results also showed that the method of utilizing a self-control bilingual subject is a useful alternative to the back-translation method, particularly in cases of linguistically and structurally sensitive scales, or in cases where a larger sample is not available. This method, dubbed as ‘forth-translation’, could be used to translate psychometric scales measuring properties which have temporal stability over the period of at least several weeks.

  1. Reliability and validity of the Khmer version of the 10-item Connor-Davidson Resilience Scale (Kh-CD-RISC10) in Cambodian adolescents.

    Science.gov (United States)

    Duong, Chanmettachampavieng; Hurst, Cameron P

    2016-06-08

    Resilience has been characterized as a defensive factor against the refinement of mental health problems. This study adapted the Connor-Davidson Resilience Scale (Kh-CD-RISC10) for use in Khmer adolescents and subsequently investigates its psychometric properties. Using stratified random sampling, this cross-sectional study sampled Cambodian adolescents from high schools selected randomly within three provinces (Phnom Penh, Battambang and Mondulkiri)-location (rural, urban) combinations. Parallel analysis was used to identify the number of component(s), and the structure of the single factor was subsequently explored using principal axis factoring. A confirmatory factor analysis was then performed to establish the fit of the Kh-CD-RISC10 to another sample. To assess convergent validity, the factor scores of the Khmer version of Connor-Davidson Resilience Scale were categorized into three levels, and then the general negative affectivity (GNA) and physiological hyperarousal (PH) scales (derived from the DASS 21) were compared among the three resilience groups. Of the 798 participants who responded (responded rate = 82.26 %), 440 (41.23 %) were female and the age ranged from 14 to 24 years old (mean = 17.36, SD = 1.325). The internal consistency of the Khmer 10-item CD-RISC was also shown to be high in Cambodian adolescents (Cronbach's alpha = 0. 82). Confirmatory factor analysis revealed the single factor model fit data adequately (χ(2) = 100.103, df = 35, p scale of this population, and can be used to assess the resilience comparing to the level of PTSD symptoms in general Khmer adolescent.

  2. Linking Existing Instruments to Develop an Activity of Daily Living Item Bank.

    Science.gov (United States)

    Li, Chih-Ying; Romero, Sergio; Bonilha, Heather S; Simpson, Kit N; Simpson, Annie N; Hong, Ickpyo; Velozo, Craig A

    2018-03-01

    This study examined dimensionality and item-level psychometric properties of an item bank measuring activities of daily living (ADL) across inpatient rehabilitation facilities and community living centers. Common person equating method was used in the retrospective veterans data set. This study examined dimensionality, model fit, local independence, and monotonicity using factor analyses and fit statistics, principal component analysis (PCA), and differential item functioning (DIF) using Rasch analysis. Following the elimination of invalid data, 371 veterans who completed both the Functional Independence Measure (FIM) and minimum data set (MDS) within 6 days were retained. The FIM-MDS item bank demonstrated good internal consistency (Cronbach's α = .98) and met three rating scale diagnostic criteria and three of the four model fit statistics (comparative fit index/Tucker-Lewis index = 0.98, root mean square error of approximation = 0.14, and standardized root mean residual = 0.07). PCA of Rasch residuals showed the item bank explained 94.2% variance. The item bank covered the range of θ from -1.50 to 1.26 (item), -3.57 to 4.21 (person) with person strata of 6.3. The findings indicated the ADL physical function item bank constructed from FIM and MDS measured a single latent trait with overall acceptable item-level psychometric properties, suggesting that it is an appropriate source for developing efficient test forms such as short forms and computerized adaptive tests.

  3. Evaluating item endorsement rates for the MMPI-2-RF F-r and Fp-r scales across ethnic, gender, and diagnostic groups with a forensic inpatient sample.

    Science.gov (United States)

    Glassmire, David M; Jhawar, Amandeep; Burchett, Danielle; Tarescavage, Anthony M

    2017-05-01

    The Minnesota Multiphasic Personality Inventory-2 (MMPI-2) F(p) (Infrequency-Psychopathology) scale was developed to measure overreporting in a manner that was minimally confounded by genuine psychopathology, which was a problem with using the MMPI-2 F (Infrequency) scale among patients with severe mental illness. Although revised versions of both of these scales are included on the MMPI-2-Restructured Form and used in a forensic context, no item-level research has been conducted on their sensitivity to genuine psychopathology among forensic psychiatric inpatients. Therefore, we examined the psychometric properties of the scales in a sample of 438 criminally committed forensic psychiatric inpatients who were adjudicated as not guilty by reason of insanity and had no known incentive to overreport. We found that 20 of the 21 Fp-r items (95.2%) demonstrated endorsement rates ≤ 20%, with 14 of the items (66.7%) endorsed by less than 10% of the sample. Similar findings were observed across genders and across patients with mood and psychotic disorders. The one item endorsed by more than 20% of the sample had a 23.7% overall endorsement rate and significantly different endorsement rates across ethnic groups, with the highest endorsements occurring among Hispanic/Latino (43.3% endorsement rate) patients. Endorsement rates of F-r items were generally higher than for Fp-r items. At the scale level, we also examined correlations with the Restructured Clinical Scales and found that Fp-r demonstrated lower correlations than F-r, indicating that Fp-r is less associated with a broad range of psychopathology. Finally, we found that Fp-r demonstrated slightly higher specificity values than F-r at all T score cutoffs. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  4. Overview of Classical Test Theory and Item Response Theory for Quantitative Assessment of Items in Developing Patient-Reported Outcome Measures

    Science.gov (United States)

    Cappelleri, Joseph C.; Lundy, J. Jason; Hays, Ron D.

    2014-01-01

    Introduction The U.S. Food and Drug Administration’s patient-reported outcome (PRO) guidance document defines content validity as “the extent to which the instrument measures the concept of interest” (FDA, 2009, p. 12). “Construct validity is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity” (Strauss & Smith, 2009, p. 7). Hence both qualitative and quantitative information are essential in evaluating the validity of measures. Methods We review classical test theory and item response theory approaches to evaluating PRO measures including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized “difficulty” (severity) order of items is represented by observed responses. Conclusion Classical test theory and item response theory can be useful in providing a quantitative assessment of items and scales during the content validity phase of patient-reported outcome measures. Depending on the particular type of measure and the specific circumstances, either one or both approaches should be considered to help maximize the content validity of PRO measures. PMID:24811753

  5. Subjective assessment of acute mountain sickness: investigating the relationship between the Lake Louise Self-Report, a visual analogue scale and psychological well-being scales.

    Science.gov (United States)

    Frühauf, Anika; Burtscher, Martin; Pocecco, Elena; Faulhaber, Martin; Kopp, Martin

    2016-01-01

    There is an ongoing discussion how to assess acute mountain sickness (AMS) in real life conditions. Next to more-item scales with a cut off like the Lake Louise Self-Report (LLS), some authors suggested to use visual analog scales (VAS) to assess AMS. This study tried to contribute to this question using VAS items used for the Subjective Ratings of Drug Effects, including an additional single item for AMS. Furthermore, we investigated if instruments developed to assess psychological well-being might predict AMS assessed via LLS or VAS. 32 (19 Female) adult persons with known AMS susceptibility filled in questionnaires (Feeling Scale, Felt Arousal Scale, Activation Deactivation Check List, LLS, VAS) at a height of 3650 m above sea level. Correlation and regression analysis suggest a moderate to high relationship between the LLS score and the VAS items, including one VAS item asking for the severity of AMS, as well as psychological well-being. In conclusion, using VAS items to assess AMS can be a more precise alternative to questionnaires like LLS, for people knowledgeable with AMS. Furthermore, researchers should be aware that psychological well-being might be an important parameter influencing the assessment of AMS.

  6. Single-item measure for assessing quality of life in children with drug-resistant epilepsy.

    Science.gov (United States)

    Conway, Lauryn; Widjaja, Elysa; Smith, Mary Lou

    2018-03-01

    The current study investigated the psychometric properties of a single-item quality of life (QOL) measure, the Global Quality of Life in Childhood Epilepsy question (G-QOLCE), in children with drug-resistant epilepsy. Data came from the Impact of Pediatric Epilepsy Surgery on Health-Related Quality of Life Study (PESQOL), a multicenter prospective cohort study (n = 118) with observations collected at baseline and at 6 months of follow-up on children aged 4-18 years. QOL was measured with the QOLCE-76 and KIDSCREEN-27. The G-QOLCE was an overall QOL question derived from the QOLCE-76. Construct validity and reliability were assessed with Spearman's correlation and intraclass correlation coefficient (ICC). Responsiveness was examined through distribution-based and anchor-based methods. The G-QOLCE showed moderate (r ≥ 0.30) to strong (r ≥ 0.50) correlations with composite scores, and most subscales of the QOLCE-76 and KIDSCREEN-27 at baseline and 6-month follow-up. The G-QOLCE had moderate test-retest reliability (ICC range: 0.49-0.72) and was able to detect clinically important change in patients' QOL (standardized response mean: 0.38; probability of change: 0.65; Guyatt's responsiveness statistics: 0.62 and 0.78). Caregiver anxiety and family functioning contributed most strongly to G-QOLCE scores over time. Results offer promising preliminary evidence regarding the validity, reliability, and responsiveness of the proposed single-item QOL measure. The G-QOLCE is a potentially useful tool that can be feasibly administered in a busy clinical setting to evaluate clinical status and impact of treatment outcomes in pediatric epilepsy.

  7. An abbreviated Faecal Incontinence Quality of Life Scale for Chinese-speaking population with colorectal cancer after surgery: cultural adaptation and item reduction.

    Science.gov (United States)

    Hsu, L-F; Hung, C-L; Kuo, L-J; Tsai, P-S

    2017-09-01

    No instrument is available to assess the impact of faecal incontinence (FI) of quality of life for Chinese-speaking population. The purpose of the study was to adapt the Faecal Incontinence Quality of Life Scale (FIQL) for patients with colorectal cancer, assess the factor structure and reduce the items for brevity. A sample of 120 participants were enrolled. Internal consistency, test-retest reliability, and convergent and contrasted-groups validity were assessed. Construct validity was analysed using an exploratory and confirmatory factor analyses (CFA). The internal consistency (Cronbach's α of the total scale and four subscales = 0.98 and 0.97, 0.96, 0.92, 0.82 respectively), test-retest reliability (intraclass correlation coefficients ≥.98 for all scales with p < .001) and significant correlations of all scales with selected subscales of the Medical Outcomes Study 36-Item Short-Form Health Survey and the Wexner scale suggested satisfactory reliability and validity. The severe FI group (with a Wexner score ≥9) scored significantly lower on the scale than the less severe FI group (with a Wexner score <9) did (p < .001). The CFA supported a two-factor structure and demonstrated an excellent model fit of the 15-item abbreviated version of the FIQL-Chinese. The FIQL-Chinese has satisfactory validity and reliability and the abbreviated version may be more practical and applicable. © 2016 John Wiley & Sons Ltd.

  8. An HIV/AIDS Knowledge Scale for Adolescents: Item Response Theory Analyses Based on Data from a Study in South Africa and Tanzania

    Science.gov (United States)

    Aaro, Leif E.; Breivik, Kyrre; Klepp, Knut-Inge; Kaaya, Sylvia; Onya, Hans E.; Wubs, Annegreet; Helleve, Arnfinn; Flisher, Alan J.

    2011-01-01

    A 14-item human immunodeficiency virus/acquired immunodeficiency syndrome knowledge scale was used among school students in 80 schools in 3 sites in Sub-Saharan Africa (Cape Town and Mankweng, South Africa, and Dar es Salaam, Tanzania). For each item, an incorrect or don't know response was coded as 0 and correct response as 1. Exploratory factor…

  9. Does the Assessment of Recovery Capital scale reflect a single or multiple domains?

    Science.gov (United States)

    Arndt, Stephan; Sahker, Ethan; Hedden, Suzy

    2017-01-01

    The goal of this study was to determine whether the 50-item Assessment of Recovery Capital scale represents a single general measure or whether multiple domains might be psychometrically useful for research or clinical applications. Data are from a cross-sectional de-identified existing program evaluation information data set with 1,138 clients entering substance use disorder treatment. Principal components and iterated factor analysis were used on the domain scores. Multiple group factor analysis provided a quasi-confirmatory factor analysis. The solution accounted for 75.24% of the total variance, suggesting that 10 factors provide a reasonably good fit. However, Tucker's congruence coefficients between the factor structure and defining weights (0.41-0.52) suggested a poor fit to the hypothesized 10-domain structure. Principal components of the 10-domain scores yielded one factor whose eigenvalue was greater than one (5.93), accounting for 75.8% of the common variance. A few domains had perceptible but small unique variance components suggesting that a few of the domains may warrant enrichment. Our findings suggest that there is one general factor, with a caveat. Using the 10 measures inflates the chance for Type I errors. Using one general measure avoids this issue, is simple to interpret, and could reduce the number of items. However, those seeking to maximally predict later recovery success may need to use the full instrument and all 10 domains.

  10. Software Note: Using BILOG for Fixed-Anchor Item Calibration

    Science.gov (United States)

    DeMars, Christine E.; Jurich, Daniel P.

    2012-01-01

    The nonequivalent groups anchor test (NEAT) design is often used to scale item parameters from two different test forms. A subset of items, called the anchor items or common items, are administered as part of both test forms. These items are used to adjust the item calibrations for any differences in the ability distributions of the groups taking…

  11. Measuring organizational effectiveness in information and communication technology companies using item response theory.

    Science.gov (United States)

    Trierweiller, Andréa Cristina; Peixe, Blênio César Severo; Tezza, Rafael; Pereira, Vera Lúcia Duarte do Valle; Pacheco, Waldemar; Bornia, Antonio Cezar; de Andrade, Dalton Francisco

    2012-01-01

    The aim of this paper is to measure the effectiveness of the organizations Information and Communication Technology (ICT) from the point of view of the manager, using Item Response Theory (IRT). There is a need to verify the effectiveness of these organizations which are normally associated to complex, dynamic, and competitive environments. In academic literature, there is disagreement surrounding the concept of organizational effectiveness and its measurement. A construct was elaborated based on dimensions of effectiveness towards the construction of the items of the questionnaire which submitted to specialists for evaluation. It demonstrated itself to be viable in measuring organizational effectiveness of ICT companies under the point of view of a manager through using Two-Parameter Logistic Model (2PLM) of the IRT. This modeling permits us to evaluate the quality and property of each item placed within a single scale: items and respondents, which is not possible when using other similar tools.

  12. A new look at the psychometrics of the parenting scale through the lens of item response theory.

    Science.gov (United States)

    Lorber, Michael F; Xu, Shu; Slep, Amy M Smith; Bulling, Lisanne; O'Leary, Susan G

    2014-01-01

    The psychometrics of the Parenting Scale's Overreactivity and Laxness subscales were evaluated using item response theory (IRT) techniques. The IRT analyses were based on 2 community samples of cohabiting parents of 3- to 8-year-old children, combined to yield a total sample size of 852 families. The results supported the utility of the Overreactivity and Laxness subscales, particularly in discriminating among parents in the mid to upper reaches of each construct. The original versions of the Overreactivity and Laxness subscales were more reliable than alternative, shorter versions identified in replicated factor analyses from previously published research and in IRT analyses in the present research. Moreover, in several cases, the original versions of these subscales, in comparison with the shortened versions, exhibited greater 6-month stabilities and correlations with child externalizing behavior and couple relationship satisfaction. Reliability was greater for the Laxness than for the Overreactivity subscale. Item performance on each subscale was highly variable. Together, the present findings are generally supportive of the psychometrics of the Parenting Scale, particularly for clinical research and practice. They also suggest areas for further development.

  13. Working memory for sequences of temporal durations reveals a volatile single-item store

    Directory of Open Access Journals (Sweden)

    Sanjay G Manohar

    2016-10-01

    Full Text Available When a sequence is held in working memory, different items are retained with differing fidelity. Here we ask whether a sequence of brief time intervals that must be remembered show recency effects, similar to those observed in verbal and visuospatial working memory. It has been suggested that prioritising some items over others can be accounted for by a focus of attention, maintaining some items in a privileged state. We therefore also investigated whether such benefits are vulnerable to disruption by attention or expectation. Participants listened to sequences of one to five tones, of varying durations (200ms to 2s. Subsequently, the length of one of the tones in the sequence had to be reproduced by holding a key. The discrepancy between the reproduced and actual durations quantified the fidelity of memory for auditory durations. Recall precision decreased with the number of items that had to be remembered, and was better for the first and last items of sequences, in line with set-size and serial position effects seen in other modalities. To test whether attentional filtering demands might impair performance, an irrelevant variation in pitch was introduced in some blocks of trials. In those blocks, memory precision was worse for sequences that consisted of only one item, i.e. the smallest memory set size. Thus, when irrelevant information was present, the benefit of having only one item in memory is attenuated. Finally we examined whether expectation could interfere with memory. On half the trials, the number of items in the upcoming sequence was cued. When the number of items was known in advance, performance was paradoxically worse when the sequence consisted of only one item. Thus the benefit of having only one item to remember is stronger when it is unexpectedly the only item. Our results suggest that similar mechanisms are used to hold auditory time durations in working memory, as for visual or verbal stimuli. Further, solitary items were

  14. An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

    Science.gov (United States)

    Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

    2013-01-01

    Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…

  15. Readability and Comprehension of the Geriatric Depression Scale and PROMIS® Physical Function Items in Older African Americans and Latinos.

    Science.gov (United States)

    Paz, Sylvia H; Jones, Loretta; Calderón, José L; Hays, Ron D

    2017-02-01

    Depression and physical function are particularly important health domains for the elderly. The Geriatric Depression Scale (GDS) and the Patient-Reported Outcomes Measurement Information System (PROMIS ® ) physical function item bank are two surveys commonly used to measure these domains. It is unclear if these two instruments adequately measure these aspects of health in minority elderly. The aim of this study was to estimate the readability of the GDS and PROMIS ® physical function items and to assess their comprehensibility using a sample of African American and Latino elderly. Readability was estimated using the Flesch-Kincaid and Flesch Reading Ease (FRE) formulae for English versions, and a Spanish adaptation of the FRE formula for the Spanish versions. Comprehension of the GDS and PROMIS ® items by minority elderly was evaluated with 30 cognitive interviews. Readability estimates of a number of items in English and Spanish of the GDS and PROMIS ® physical functioning items exceed the U.S. recommended 5th-grade threshold for vulnerable populations, or were rated as 'fairly difficult', 'difficult', or 'very difficult' to read. Cognitive interviews revealed that many participants felt that more than the two (yes/no) GDS response options were needed to answer the questions. Wording of several PROMIS ® items was considered confusing, and interpreting responses was problematic because they were based on using physical aids. Problems with item wording and response options of the GDS and PROMIS ® physical function items may reduce reliability and validity of measurement when used with minority elderly.

  16. The Deaf Acculturation Scale (DAS): Development and Validation of a 58-Item Measure

    Science.gov (United States)

    Maxwell-McCaw, Deborah; Zea, Maria Cecilia

    2011-01-01

    This study involved the development and validation of the Deaf Acculturation Scale (DAS), a new measure of cultural identity for Deaf and hard-of-hearing (hh) populations. Data for this study were collected online and involved a nation-wide sample of 3,070 deaf/hh individuals. Results indicated strong internal reliabilities for all the subscales, and construct validity was established by demonstrating that the DAS could discriminate groups based on parental hearing status, school background, and use of self-labels. Construct validity was further demonstrated through factorial analyses, and findings resulted in a final 58-item measure. Directions for future research are discussed. PMID:21263041

  17. Mokken scaling of the Myocardial Infarction Dimensional Assessment Scale (MIDAS).

    Science.gov (United States)

    Thompson, David R; Watson, Roger

    2011-02-01

    The purpose of this study was to examine the hierarchical and cumulative nature of the 35 items of the Myocardial Infarction Dimensional Assessment Scale (MIDAS), a disease-specific health-related quality of life measure. Data from 668 participants who completed the MIDAS were analysed using the Mokken Scaling Procedure, which is a computer program that searches polychotomous data for hierarchical and cumulative scales on the basis of a range of diagnostic criteria. Fourteen MIDAS items were retained in a Mokken scale and these items included physical activity, insecurity, emotional reaction and dependency items but excluded items related to diet, medication or side-effects. Item difficulty, in item response theory terms, ran from physical activity items (low difficulty) to insecurity, suggesting that the most severe quality of life effect of myocardial infarction is loneliness and isolation. Items from the MIDAS form a strong and reliable Mokken scale, which provides new insight into the relationship between items in the MIDAS and the measurement of quality of life after myocardial infarction. © 2010 Blackwell Publishing Ltd.

  18. Enactment versus observation: item-specific and relational processing in goal-directed action sequences (and lists of single actions.

    Directory of Open Access Journals (Sweden)

    Janette Schult

    Full Text Available What are the memory-related consequences of learning actions (such as "apply the patch" by enactment during study, as compared to action observation? Theories converge in postulating that enactment encoding increases item-specific processing, but not the processing of relational information. Typically, in the laboratory enactment encoding is studied for lists of unrelated single actions in which one action execution has no overarching purpose or relation with other actions. In contrast, real-life actions are usually carried out with the intention to achieve such a purpose. When actions are embedded in action sequences, relational information provides efficient retrieval cues. We contrasted memory for single actions with memory for action sequences in three experiments. We found more reliance on relational processing for action-sequences than single actions. To what degree can this relational information be used after enactment versus after the observation of an actor? We found indicators of superior relational processing after observation than enactment in ordered pair recall (Experiment 1A and in emerging subjective organization of repeated recall protocols (recall runs 2-3, Experiment 2. An indicator of superior item-specific processing after enactment compared to observation was recognition (Experiment 1B, Experiment 2. Similar net recall suggests that observation can be as good a learning strategy as enactment. We discuss possible reasons why these findings only partly converge with previous research and theorizing.

  19. 'Do you think you suffer from depression?' Reevaluating the use of a single item question for the screening of depression in older primary care patients

    DEFF Research Database (Denmark)

    Ayalon, Liat; Goldfracht, Margalit; Bech, Per

    2010-01-01

    evaluated against a depression diagnosis made by the Structured Clinical Interview for DSM-IV. RESULTS: Overall, 3.9% of the sample was diagnosed with depression. The most notable finding was that the single-item question, 'do you think you suffer from depression?' had as good or better sensitivity (83......%) than all other screens. Nonetheless, its specificity of 83% suggested that it has to be followed up by a through diagnostic interview. Additional sensitivity analyses concerning the use of a single depression item taken directly from the depression screening measures supported this finding. CONCLUSIONS......: An easy way to detect depression in older primary care patients would be asking the single question, 'do you think you suffer from depression?'...

  20. Asenapine effects on individual Young Mania Rating Scale items in bipolar disorder patients with acute manic or mixed episodes: a pooled analysis

    Directory of Open Access Journals (Sweden)

    Cazorla P

    2013-03-01

    Full Text Available Pilar Cazorla, Jun Zhao, Mary Mackle, Armin Szegedi Merck, Rahway, NJ, USA Background: An exploratory post hoc analysis was conducted to evaluate the potential differential effects over time of asenapine and olanzapine compared with placebo on the eleven individual items comprising the Young Mania Rating Scale (YMRS in patients with manic or mixed episodes in bipolar I disorder. Methods: Data were pooled from two 3-week randomized, controlled trials in which the eleven individual items comprising the YMRS were measured over 21 days. An analysis of covariance model adjusted by baseline value was used to test for differences in changes from baseline in YMRS scores between groups. Results: Each of the eleven individual YMRS item scores was significantly reduced compared with placebo at day 21. After 2 days of treatment, asenapine and olanzapine were superior to placebo for six of the YMRS items: disruptive/aggressive behavior, content, irritability, elevated mood, sleep, and speech. Conclusion: Reduction in manic symptoms over 21 days was associated with a broad-based improvement across all symptom domains with no subset of symptoms predominating. Keywords: asenapine, Young Mania Rating Scale, bipolar disorder, YMRS, antipsychotic, olanzapine

  1. Psychometric validation of the Persian nine-item Internet Gaming Disorder Scale – Short Form: Does gender and hours spent online gaming affect the interpretations of item descriptions?

    Science.gov (United States)

    Wu, Tzu-Yi; Lin, Chung-Ying; Årestedt, Kristofer; Griffiths, Mark D.; Broström, Anders; Pakpour, Amir H.

    2017-01-01

    Background and aims The nine-item Internet Gaming Disorder Scale – Short Form (IGDS-SF9) is brief and effective to evaluate Internet Gaming Disorder (IGD) severity. Although its scores show promising psychometric properties, less is known about whether different groups of gamers interpret the items similarly. This study aimed to verify the construct validity of the Persian IGDS-SF9 and examine the scores in relation to gender and hours spent online gaming among 2,363 Iranian adolescents. Methods Confirmatory factor analysis (CFA) and Rasch analysis were used to examine the construct validity of the IGDS-SF9. The effects of gender and time spent online gaming per week were investigated by multigroup CFA and Rasch differential item functioning (DIF). Results The unidimensionality of the IGDS-SF9 was supported in both CFA and Rasch. However, Item 4 (fail to control or cease gaming activities) displayed DIF (DIF contrast = 0.55) slightly over the recommended cutoff in Rasch but was invariant in multigroup CFA across gender. Items 4 (DIF contrast = −0.67) and 9 (jeopardize or lose an important thing because of gaming activity; DIF contrast = 0.61) displayed DIF in Rasch and were non-invariant in multigroup CFA across time spent online gaming. Conclusions Given the Persian IGDS-SF9 was unidimensional, it is concluded that the instrument can be used to assess IGD severity. However, users of the instrument are cautioned concerning the comparisons of the sum scores of the IGDS-SF9 across gender and across adolescents spending different amounts of time online gaming. PMID:28571474

  2. A 67-Item Stress Resilience item bank showing high content validity was developed in a psychosomatic sample.

    Science.gov (United States)

    Obbarius, Nina; Fischer, Felix; Obbarius, Alexander; Nolte, Sandra; Liegl, Gregor; Rose, Matthias

    2018-04-10

    To develop the first item bank to measure Stress Resilience (SR) in clinical populations. Qualitative item development resulted in an initial pool of 131 items covering a broad theoretical SR concept. These items were tested in n=521 patients at a psychosomatic outpatient clinic. Exploratory and Confirmatory Factor Analysis (CFA), as well as other state-of-the-art item analyses and IRT were used for item evaluation and calibration of the final item bank. Out of the initial item pool of 131 items, we excluded 64 items (54 factor loading .3, 2 non-discriminative Item Response Curves, 4 Differential Item Functioning). The final set of 67 items indicated sufficient model fit in CFA and IRT analyses. Additionally, a 10-item short form with high measurement precision (SE≤.32 in a theta range between -1.8 and +1.5) was derived. Both the SR item bank and the SR short form were highly correlated with an existing static legacy tool (Connor-Davidson Resilience Scale). The final SR item bank and 10-item short form showed good psychometric properties. When further validated, they will be ready to be used within a framework of Computer-Adaptive Tests for a comprehensive assessment of the Stress-Construct. Copyright © 2018. Published by Elsevier Inc.

  3. What Do You Think You Are Measuring? A Mixed-Methods Procedure for Assessing the Content Validity of Test Items and Theory-Based Scaling

    Science.gov (United States)

    Koller, Ingrid; Levenson, Michael R.; Glück, Judith

    2017-01-01

    The valid measurement of latent constructs is crucial for psychological research. Here, we present a mixed-methods procedure for improving the precision of construct definitions, determining the content validity of items, evaluating the representativeness of items for the target construct, generating test items, and analyzing items on a theoretical basis. To illustrate the mixed-methods content-scaling-structure (CSS) procedure, we analyze the Adult Self-Transcendence Inventory, a self-report measure of wisdom (ASTI, Levenson et al., 2005). A content-validity analysis of the ASTI items was used as the basis of psychometric analyses using multidimensional item response models (N = 1215). We found that the new procedure produced important suggestions concerning five subdimensions of the ASTI that were not identifiable using exploratory methods. The study shows that the application of the suggested procedure leads to a deeper understanding of latent constructs. It also demonstrates the advantages of theory-based item analysis. PMID:28270777

  4. Desenvolvimento de uma escala para medir o potencial empreendedor utilizando a Teoria da Resposta ao Item (TRI Development of a scale to measure the entrepreneurial potential using the Item Response Theory (IRT

    Directory of Open Access Journals (Sweden)

    Luciano Ricardo Rath Alves

    2011-01-01

    Full Text Available Diversas variáveis estão relacionadas ao desenvolvimento da atividade empreendedora, verifica-se, entre elas, a importância do agente empreendedor. Dos estudos que contribuem para o seu entendimento, este segue a linha que defende que o empreendedor tem características e traços de personalidade singulares em relação à população, os quais são propícios ao sucesso do empreendedorismo. O objetivo deste trabalho é desenvolver uma escala para medir o potencial empreendedor utilizando a Teoria da Resposta ao Item. Foi utilizado o modelo logístico de dois parâmetros da TRI. As estimativas dos parâmetros foram obtidas a partir da amostra com 764 pessoas que responderam a um instrumento composto por 103 itens. A curva de informação e do erro padrão do teste e a interpretação qualitativa de níveis da escala permitiram determinar o intervalo mais apropriado para utilização do instrumento. Os resultados mostraram que a escala é mais adequada para avaliar indivíduos com baixo até moderadamente alto potencial empreendedor. Por isso, sugere-se que novos itens sejam incorporados ao instrumento para mensurar e interpretar níveis ainda mais elevados. A Teoria da Resposta ao Item permite que novos itens sejam calibrados a fim de mensurar os empreendedores com alto potencial empreendedor, aproveitando os dados já obtidos.Several variables are related to the development of entrepreneurial activities. An important one among them is the entrepreneurial agent. This study is one of many that contribute to the understanding of the entrepreneurial agent. In its line of thought, it upholds the idea that the entrepreneur has characteristics and personality traits that stand out from the general population and that are favorable to the success of the entrepreneurship. This study aims at developing a measurement scale for entrepreneurial potential using the Item Response Theory. The items were generated by Santos (2008 based on a theoretical model

  5. Assessing the Straightforwardly-Worded Brief Fear of Negative Evaluation Scale for Differential Item Functioning Across Gender and Ethnicity.

    Science.gov (United States)

    Harpole, Jared K; Levinson, Cheri A; Woods, Carol M; Rodebaugh, Thomas L; Weeks, Justin W; Brown, Patrick J; Heimberg, Richard G; Menatti, Andrew R; Blanco, Carlos; Schneier, Franklin; Liebowitz, Michael

    2015-06-01

    The Brief Fear of Negative Evaluation Scale (BFNE; Leary Personality and Social Psychology Bulletin , 9, 371-375, 1983) assesses fear and worry about receiving negative evaluation from others. Rodebaugh et al. Psychological Assessment, 16 , 169-181, (2004) found that the BFNE is composed of a reverse-worded factor (BFNE-R) and straightforwardly-worded factor (BFNE-S). Further, they found the BFNE-S to have better psychometric properties and provide more information than the BFNE-R. Currently there is a lack of research regarding the measurement invariance of the BFNE-S across gender and ethnicity with respect to item thresholds. The present study uses item response theory (IRT) to test the BFNE-S for differential item functioning (DIF) related to gender and ethnicity (White, Asian, and Black). Six data sets consisting of clinical, community, and undergraduate participants were utilized ( N =2,109). The factor structure of the BFNE-S was confirmed using categorical confirmatory factor analysis, IRT model assumptions were tested, and the BFNE-S was evaluated for DIF. Item nine demonstrated significant non-uniform DIF between White and Black participants. No other items showed significant uniform or non-uniform DIF across gender or ethnicity. Results suggest the BFNE-S can be used reliably with men and women and Asian and White participants. More research is needed to understand the implications of using the BFNE-S with Black participants.

  6. Pathological mechanisms underlying single large‐scale mitochondrial DNA deletions

    Science.gov (United States)

    Rocha, Mariana C.; Rosa, Hannah S.; Grady, John P.; Blakely, Emma L.; He, Langping; Romain, Nadine; Haller, Ronald G.; Newman, Jane; McFarland, Robert; Ng, Yi Shiau; Gorman, Grainne S.; Schaefer, Andrew M.; Tuppen, Helen A.; Taylor, Robert W.

    2018-01-01

    Objective Single, large‐scale deletions in mitochondrial DNA (mtDNA) are a common cause of mitochondrial disease. This study aimed to investigate the relationship between the genetic defect and molecular phenotype to improve understanding of pathogenic mechanisms associated with single, large‐scale mtDNA deletions in skeletal muscle. Methods We investigated 23 muscle biopsies taken from adult patients (6 males/17 females with a mean age of 43 years) with characterized single, large‐scale mtDNA deletions. Mitochondrial respiratory chain deficiency in skeletal muscle biopsies was quantified by immunoreactivity levels for complex I and complex IV proteins. Single muscle fibers with varying degrees of deficiency were selected from 6 patient biopsies for determination of mtDNA deletion level and copy number by quantitative polymerase chain reaction. Results We have defined 3 “classes” of single, large‐scale deletion with distinct patterns of mitochondrial deficiency, determined by the size and location of the deletion. Single fiber analyses showed that fibers with greater respiratory chain deficiency harbored higher levels of mtDNA deletion with an increase in total mtDNA copy number. For the first time, we have demonstrated that threshold levels for complex I and complex IV deficiency differ based on deletion class. Interpretation Combining genetic and immunofluorescent assays, we conclude that thresholds for complex I and complex IV deficiency are modulated by the deletion of complex‐specific protein‐encoding genes. Furthermore, removal of mt‐tRNA genes impacts specific complexes only at high deletion levels, when complex‐specific protein‐encoding genes remain. These novel findings provide valuable insight into the pathogenic mechanisms associated with these mutations. Ann Neurol 2018;83:115–130 PMID:29283441

  7. Development of coordination system model on single-supplier multi-buyer for multi-item supply chain with probabilistic demand

    Science.gov (United States)

    Olivia, G.; Santoso, A.; Prayogo, D. N.

    2017-11-01

    Nowadays, the level of competition between supply chains is getting tighter and a good coordination system between supply chains members is very crucial in solving the issue. This paper focused on a model development of coordination system between single supplier and buyers in a supply chain as a solution. Proposed optimization model was designed to determine the optimal number of deliveries from a supplier to buyers in order to minimize the total cost over a planning horizon. Components of the total supply chain cost consist of transportation costs, handling costs of supplier and buyers and also stock out costs. In the proposed optimization model, the supplier can supply various types of items to retailers whose item demand patterns are probabilistic. Sensitivity analysis of the proposed model was conducted to test the effect of changes in transport costs, handling costs and production capacities of the supplier. The results of the sensitivity analysis showed a significant influence on the changes in the transportation cost, handling costs and production capacity to the decisions of the optimal numbers of product delivery for each item to the buyers.

  8. Evaluation of the Edinburgh Post Natal Depression Scale using Rasch analysis

    Science.gov (United States)

    Pallant, Julie F; Miller, Renée L; Tennant, Alan

    2006-01-01

    Background The Edinburgh Postnatal Depression Scale (EPDS) is a 10 item self-rating post-natal depression scale which has seen widespread use in epidemiological and clinical studies. Concern has been raised over the validity of the EPDS as a single summed scale, with suggestions that it measures two separate aspects, one of depressive feelings, the other of anxiety. Methods As part of a larger cross-sectional study conducted in Melbourne, Australia, a community sample (324 women, ranging in age from 18 to 44 years: mean = 32 yrs, SD = 4.6), was obtained by inviting primiparous women to participate voluntarily in this study. Data from the EPDS were fitted to the Rasch measurement model and tested for appropriate category ordering, for item bias through Differential Item Functioning (DIF) analysis, and for unidimensionality through tests of the assumption of local independence. Results Rasch analysis of the data from the ten item scale initially demonstrated a lack of fit to the model with a significant Item-Trait Interaction total chi-square (chi Square = 82.8, df = 40; p < .001). Removal of two items (items 7 and 8) resulted in a non-significant Item-Trait Interaction total chi-square with a residual mean value for items of -0.467 with a standard deviation of 0.850, showing fit to the model. No DIF existed in the final 8-item scale (EPDS-8) and all items showed fit to model expectations. Principal Components Analysis of the residuals supported the local independence assumption, and unidimensionality of the revised EPDS-8 scale. Revised cut points were identified for EPDS-8 to maintain the case identification of the original scale. Conclusion The results of this study suggest that EPDS, in its original 10 item form, is not a viable scale for the unidimensional measurement of depression. Rasch analysis suggests that a revised eight item version (EPDS-8) would provide a more psychometrically robust scale. The revised cut points of 7/8 and 9/10 for the EPDS-8 show high

  9. Diagnostic accuracy of the original 30-item and shortened versions of the Geriatric Depression Scale in nursing home patients

    NARCIS (Netherlands)

    Jongenelis, K; Eisses, AMH; Gerritsen, DL; Beekman, ATF; Kluiter, H; Ribbe, MW

    2005-01-01

    Objective To determine the diagnostic accuracy of the 30-item and shortened versions of the Geriatric Depression Scale (GDS) in diagnosing depression in older nursing home patients. Method Three hundred and thirty-three older nursing home patients participated in a prospective cross-sectional study

  10. Diagnostic accuracy of the original 30-item and shortened versions of the Geriatric Depression Scale in nursing home patients

    NARCIS (Netherlands)

    Jongenelis, K; Eisses, AMH; Gerritsen, DL; Beekman, ATF; Kluiter, H; Ribbe, MW

    Objective To determine the diagnostic accuracy of the 30-item and shortened versions of the Geriatric Depression Scale (GDS) in diagnosing depression in older nursing home patients. Method Three hundred and thirty-three older nursing home patients participated in a prospective cross-sectional study

  11. Screening for somatization and hypochondriasis in primary care and neurological in-patients: a seven-item scale for hypochondriasis and somatization.

    Science.gov (United States)

    Fink, P; Ewald, H; Jensen, J; Sørensen, L; Engberg, M; Holm, M; Munk-Jørgensen, P

    1999-03-01

    The aim of this study was to investigate the internal and external validity of the Whiteley Index as a screening instrument for somatization illness. A 14-item version of the Whiteley Index for hypochondriacal traits was given to 99 of 191 consecutive primary care patients, aged 18-65 years, and to 100 consecutive patients, aged 18-60 years, admitted for the first time to a neurological ward. The primary care sample was, in addition, interviewed by means of the SCAN (Schedules for Clinical Assessment in Neuropsychiatry) psychiatric interview. The GPs and the neurologists were asked to rate various characteristics of the patients that might indicate somatization. The internal validity of the Whiteley Index was tested by means of latent structure analysis. On this basis, a reduced seven-item scale (Whiteley-7 scale) and two subscales (i.e., an Illness Conviction and Illness Worrying scale, each with three items) were constructed. All three had a high internal validity fitting into the very restricted Rasch statistical model (p>0.05) and an acceptable transferability between most of the subpopulations investigated. In the primary care population, the Whiteley-7 and the Illness Conviction scales at cut-point 0/1 showed 1.00 and 0.87 sensitivity and 0.65 and 0.87 specificity, respectively, using as "gold standard" the fulfillment of criteria for at least one ICD-10 somatoform disorder, and 0.71 and 0.63 sensitivity and 0.62 and 0.87 specificity, respectively, as gold standard for the fulfillment of criteria for at least one DSM-IV somatoform disorder, excluding the NOS diagnostic group. The Illness Worrying subscale showed less impressive performance in this respect. The agreement between the Whiteley-7 scale including the two subscales and neurologists' rating and the GPs' rating and the somatization subscale on the SCL-90 was modest or worse. It may be concluded that the Whiteley-7 scale and the Illness Conviction subscale had acceptable psychometric profiles, and

  12. Item bias detection in the Hospital Anxiety and Depression Scale using structural equation modeling: comparison with other item bias detection methods

    NARCIS (Netherlands)

    Verdam, M.G.E.; Oort, F.J.; Sprangers, M.A.G.

    Purpose Comparison of patient-reported outcomes may be invalidated by the occurrence of item bias, also known as differential item functioning. We show two ways of using structural equation modeling (SEM) to detect item bias: (1) multigroup SEM, which enables the detection of both uniform and

  13. Validation of the JDS satisfaction scales applied to educational university environments

    OpenAIRE

    Giraldo-O'Meara, Martha; Marin-Garcia, Juan A.; Martinez-Gomez, Monica

    2014-01-01

    Purpose: The aim of this study is to review and summarize the main satisfaction scales used in publications about human Resource Management and educational research, in order to adapt the satisfaction scales of the Job Diagnostic Survey (JDS) to higher education and validate it with a sample of university students and to assess the concept of satisfaction in two different ways: as a single-item measure, with a global indicator and as a multi-item measure, analyzed as a global model and compos...

  14. Improved Approximation Algorithms for Item Pricing with Bounded Degree and Valuation

    Science.gov (United States)

    Hamane, Ryoso; Itoh, Toshiya

    When a store sells items to customers, the store wishes to decide the prices of the items to maximize its profit. If the store sells the items with low (resp. high) prices, the customers buy more (resp. less) items, which provides less profit to the store. It would be hard for the store to decide the prices of items. Assume that a store has a set V of n items and there is a set C of m customers who wish to buy those items. The goal of the store is to decide the price of each item to maximize its profit. We refer to this maximization problem as an item pricing problem. We classify the item pricing problems according to how many items the store can sell or how the customers valuate the items. If the store can sell every item i with unlimited (resp. limited) amount, we refer to this as unlimited supply (resp. limited supply). We say that the item pricing problem is single-minded if each customer j∈C wishes to buy a set ej⊆V of items and assigns valuation w(ej)≥0. For the single-minded item pricing problems (in unlimited supply), Balcan and Blum regarded them as weighted k-hypergraphs and gave several approximation algorithms. In this paper, we focus on the (pseudo) degree of k-hypergraphs and the valuation ratio, i. e., the ratio between the smallest and the largest valuations. Then for the single-minded item pricing problems (in unlimited supply), we show improved approximation algorithms (for k-hypergraphs, general graphs, bipartite graphs, etc.) with respect to the maximum (pseudo) degree and the valuation ratio.

  15. Developing an African youth psychosocial assessment: an application of item response theory.

    Science.gov (United States)

    Betancourt, Theresa S; Yang, Frances; Bolton, Paul; Normand, Sharon-Lise

    2014-06-01

    This study aimed to refine a dimensional scale for measuring psychosocial adjustment in African youth using item response theory (IRT). A 60-item scale derived from qualitative data was administered to 667 war-affected adolescents (55% female). Exploratory factor analysis (EFA) determined the dimensionality of items based on goodness-of-fit indices. Items with loadings less than 0.4 were dropped. Confirmatory factor analysis (CFA) was used to confirm the scale's dimensionality found under the EFA. Item discrimination and difficulty were estimated using a graded response model for each subscale using weighted least squares means and variances. Predictive validity was examined through correlations between IRT scores (θ) for each subscale and ratings of functional impairment. All models were assessed using goodness-of-fit and comparative fit indices. Fisher's Information curves examined item precision at different underlying ranges of each trait. Original scale items were optimized and reconfigured into an empirically-robust 41-item scale, the African Youth Psychosocial Assessment (AYPA). Refined subscales assess internalizing and externalizing problems, prosocial attitudes/behaviors and somatic complaints without medical cause. The AYPA is a refined dimensional assessment of emotional and behavioral problems in African youth with good psychometric properties. Validation studies in other cultures are recommended. Copyright © 2014 John Wiley & Sons, Ltd.

  16. Using personality item characteristics to predict single-item reliability, retest reliability, and self-other agreement

    NARCIS (Netherlands)

    de Vries, Reinout Everhard; Realo, Anu; Allik, Jüri

    2016-01-01

    The use of reliability estimates is increasingly scrutinized as scholars become more aware that test–retest stability and self–other agreement provide a better approximation of the theoretical and practical usefulness of an instrument than its internal reliability. In this study, we investigate item

  17. An emotional functioning item bank of 24 items for computerized adaptive testing (CAT) was established

    DEFF Research Database (Denmark)

    Petersen, Morten Aa.; Gamper, Eva-Maria; Costantini, Anna

    2016-01-01

    of the widely used EORTC Quality of Life questionnaire (QLQ-C30). STUDY DESIGN AND SETTING: On the basis of literature search and evaluations by international samples of experts and cancer patients, 38 candidate items were developed. The psychometric properties of the items were evaluated in a large...... international sample of cancer patients. This included evaluations of dimensionality, item response theory (IRT) model fit, differential item functioning (DIF), and of measurement precision/statistical power. RESULTS: Responses were obtained from 1,023 cancer patients from four countries. The evaluations showed...... that 24 items could be included in a unidimensional IRT model. DIF did not seem to have any significant impact on the estimation of EF. Evaluations indicated that the CAT measure may reduce sample size requirements by up to 50% compared to the QLQ-C30 EF scale without reducing power. CONCLUSION...

  18. Psychometric properties of the 7-item game addiction scale among french and German speaking adults

    OpenAIRE

    Khazaal, Yasser; Chatton, Anne; Rothen, Stephane; Achab, Sophia; Thorens, Gabriel; Zullino, Daniele; Gmel, Gerhard

    2016-01-01

    Background The 7-item Game Addiction Scale (GAS) is a used to screen for addictive game use. Both cross cross-linguistic validation and validation in French and German is needed in adult samples. The objective of the study is to assess the factorial structure of the French and German versions of the GAS among adults. Methods Two samples of men from French (N?=?3318) and German (N?=?2665) language areas of Switzerland were assessed with the GAS, the Major Depression Inventory (MDI), the Brief ...

  19. Differential Item Functioning of the Psychological Domain of the Menopause Rating Scale

    Science.gov (United States)

    Portela-Buelvas, Katherin; Oviedo, Heidi C.; Herazo, Edwin; Campo-Arias, Adalberto

    2016-01-01

    Introduction. Quality of life could be quantified with the Menopause Rating Scale (MRS), which evaluates the severity of somatic, psychological, and urogenital symptoms in menopause. However, differential item functioning (DIF) analysis has not been applied previously. Objective. To establish the DIF of the psychological domain of the MRS in Colombian women. Methods. 4,009 women aged between 40 and 59 years, who participated in the CAVIMEC (Calidad de Vida en la Menopausia y Etnias Colombianas) project, were included. Average age was 49.0 ± 5.9 years. Women were classified in mestizo, Afro-Colombian, and indigenous. The results were presented as averages and standard deviation (X ± SD). A p value <0.001 was considered statistically significant. Results. In mestizo women, the highest X ± SD were obtained in physical and mental exhaustion (PME) (0.86 ± 0.93) and the lowest ones in anxiety (0.44 ± 0.79). In Afro-Colombian women, an average score of 0.99 ± 1.07 for PME and 0.63 ± 0.88 for anxiety was gotten. Indigenous women obtained an increased average score for PME (1.33 ± 0.93). The lowest score was evidenced in depressive mood (0.50 ± 0.81), which is different from other Colombian women (p < 0.001). Conclusions. The psychological items of the MRS show differential functioning according to the ethnic group, which may induce systematic error in the measurement of the construct. PMID:27847825

  20. Differential Item Functioning of the Psychological Domain of the Menopause Rating Scale.

    Science.gov (United States)

    Monterrosa-Castro, Alvaro; Portela-Buelvas, Katherin; Oviedo, Heidi C; Herazo, Edwin; Campo-Arias, Adalberto

    2016-01-01

    Introduction. Quality of life could be quantified with the Menopause Rating Scale (MRS), which evaluates the severity of somatic, psychological, and urogenital symptoms in menopause. However, differential item functioning (DIF) analysis has not been applied previously. Objective . To establish the DIF of the psychological domain of the MRS in Colombian women. Methods . 4,009 women aged between 40 and 59 years, who participated in the CAVIMEC (Calidad de Vida en la Menopausia y Etnias Colombianas) project, were included. Average age was 49.0 ± 5.9 years. Women were classified in mestizo, Afro-Colombian, and indigenous. The results were presented as averages and standard deviation ( X ± SD). A p value <0.001 was considered statistically significant. Results . In mestizo women, the highest X ± SD were obtained in physical and mental exhaustion (PME) (0.86 ± 0.93) and the lowest ones in anxiety (0.44 ± 0.79). In Afro-Colombian women, an average score of 0.99 ± 1.07 for PME and 0.63 ± 0.88 for anxiety was gotten. Indigenous women obtained an increased average score for PME (1.33 ± 0.93). The lowest score was evidenced in depressive mood (0.50 ± 0.81), which is different from other Colombian women ( p < 0.001). Conclusions . The psychological items of the MRS show differential functioning according to the ethnic group, which may induce systematic error in the measurement of the construct.

  1. Using existing questionnaires in latent class analysis: should we use summary scores or single items as input? A methodological study using a cohort of patients with low back pain

    Directory of Open Access Journals (Sweden)

    Nielsen AM

    2016-04-01

    Full Text Available Anne Molgaard Nielsen,1 Werner Vach,2 Peter Kent,1,3 Lise Hestbaek,1,4 Alice Kongsted1,4 1Department of Sports Science and Clinical Biomechanics, University of Southern Denmark, Odense, Denmark; 2Center for Medical Biometry and Medical Informatics, Medical Center, University of Freiburg, Freiburg, Germany; 3School of Physiotherapy and Exercise Science, Curtin University, Perth, Australia; 4Nordic Institute of Chiropractic and Clinical Biomechanics, University of Southern Denmark, Odense, Denmark Background: Latent class analysis (LCA is increasingly being used in health research, but optimal approaches to handling complex clinical data are unclear. One issue is that commonly used questionnaires are multidimensional, but expressed as summary scores. Using the example of low back pain (LBP, the aim of this study was to explore and descriptively compare the application of LCA when using questionnaire summary scores and when using single items to subgrouping of patients based on multidimensional data. Materials and methods: Baseline data from 928 LBP patients in an observational study were classified into four health domains (psychology, pain, activity, and participation using the World Health Organization’s International Classification of Functioning, Disability, and Health framework. LCA was performed within each health domain using the strategies of summary-score and single-item analyses. The resulting subgroups were descriptively compared using statistical measures and clinical interpretability. Results: For each health domain, the preferred model solution ranged from five to seven subgroups for the summary-score strategy and seven to eight subgroups for the single-item strategy. There was considerable overlap between the results of the two strategies, indicating that they were reflecting the same underlying data structure. However, in three of the four health domains, the single-item strategy resulted in a more nuanced description, in terms

  2. Explaining Method Effects Associated with Negatively Worded Items in Trait and State Global and Domain-Specific Self-Esteem Scales

    Science.gov (United States)

    Tomas, Jose M.; Oliver, Amparo; Galiana, Laura; Sancho, Patricia; Lila, Marisol

    2013-01-01

    Several investigators have interpreted method effects associated with negatively worded items in a substantive way. This research extends those studies in different ways: (a) it establishes the presence of methods effects in further populations and particular scales, and (b) it examines the possible relations between a method factor associated…

  3. A comparison of three methods of assessing differential item functioning (DIF) in the Hospital Anxiety Depression Scale: ordinal logistic regression, Rasch analysis and the Mantel chi-square procedure.

    Science.gov (United States)

    Cameron, Isobel M; Scott, Neil W; Adler, Mats; Reid, Ian C

    2014-12-01

    It is important for clinical practice and research that measurement scales of well-being and quality of life exhibit only minimal differential item functioning (DIF). DIF occurs where different groups of people endorse items in a scale to different extents after being matched by the intended scale attribute. We investigate the equivalence or otherwise of common methods of assessing DIF. Three methods of measuring age- and sex-related DIF (ordinal logistic regression, Rasch analysis and Mantel χ(2) procedure) were applied to Hospital Anxiety Depression Scale (HADS) data pertaining to a sample of 1,068 patients consulting primary care practitioners. Three items were flagged by all three approaches as having either age- or sex-related DIF with a consistent direction of effect; a further three items identified did not meet stricter criteria for important DIF using at least one method. When applying strict criteria for significant DIF, ordinal logistic regression was slightly less sensitive. Ordinal logistic regression, Rasch analysis and contingency table methods yielded consistent results when identifying DIF in the HADS depression and HADS anxiety scales. Regardless of methods applied, investigators should use a combination of statistical significance, magnitude of the DIF effect and investigator judgement when interpreting the results.

  4. Quality of life in infants and children with atopic dermatitis: Addressing issues of differential item functioning across countries in multinational clinical trials

    Directory of Open Access Journals (Sweden)

    Tennant Alan

    2007-07-01

    all items in a scale fit both a single theoretical construct and the Rasch measurement model, it is feasible to conceive of outcome measures with a different set of items in each language.

  5. Evaluation of the Edinburgh Post Natal Depression Scale using Rasch analysis

    Directory of Open Access Journals (Sweden)

    Tennant Alan

    2006-06-01

    Full Text Available Abstract Background The Edinburgh Postnatal Depression Scale (EPDS is a 10 item self-rating post-natal depression scale which has seen widespread use in epidemiological and clinical studies. Concern has been raised over the validity of the EPDS as a single summed scale, with suggestions that it measures two separate aspects, one of depressive feelings, the other of anxiety. Methods As part of a larger cross-sectional study conducted in Melbourne, Australia, a community sample (324 women, ranging in age from 18 to 44 years: mean = 32 yrs, SD = 4.6, was obtained by inviting primiparous women to participate voluntarily in this study. Data from the EPDS were fitted to the Rasch measurement model and tested for appropriate category ordering, for item bias through Differential Item Functioning (DIF analysis, and for unidimensionality through tests of the assumption of local independence. Results Rasch analysis of the data from the ten item scale initially demonstrated a lack of fit to the model with a significant Item-Trait Interaction total chi-square (chi Square = 82.8, df = 40; p Conclusion The results of this study suggest that EPDS, in its original 10 item form, is not a viable scale for the unidimensional measurement of depression. Rasch analysis suggests that a revised eight item version (EPDS-8 would provide a more psychometrically robust scale. The revised cut points of 7/8 and 9/10 for the EPDS-8 show high levels of agreement with the original case identification for the EPDS-10.

  6. Use of item response theory to develop a shortened version of the EORTC QLQ-C30 emotional functioning scale

    NARCIS (Netherlands)

    Bjorner, J. B.; Petersen, M. Aa; Groenvold, M.; Aaronson, N.; Ahlner-Elmqvist, M.; Arraras, J. I.; Brédart, A.; Fayers, P.; Jordhoy, M.; Sprangers, M.; Watson, M.; Young, T.

    2004-01-01

    Background: As part of a larger study whose objective is to develop an abbreviated version of the EORTC QLQ-C30 suitable for research in palliative care, analyses were conducted to determine the feasibility of generating a shorter version of the 4-item emotional functioning (EF) scale that could be

  7. The Chinese version of the Myocardial Infarction Dimensional Assessment Scale (MIDAS: Mokken scaling

    Directory of Open Access Journals (Sweden)

    Watson Roger

    2012-01-01

    Full Text Available Abstract Background Hierarchical scales are very useful in clinical practice due to their ability to discriminate precisely between individuals, and the original English version of the Myocardial Infarction Dimensional Assessment Scale has been shown to contain a hierarchy of items. The purpose of this study was to analyse a Mandarin Chinese translation of the Myocardial Infarction Dimensional Assessment Scale for a hierarchy of items according to the criteria of Mokken scaling. Data from 180 Chinese participants who completed the Chinese translation of the Myocardial Infarction Dimensional Assessment Scale were analysed using the Mokken Scaling Procedure and the 'R' statistical programme using the diagnostics available in these programmes. Correlation between Mandarin Chinese items and a Chinese translation of the Short Form (36 Health Survey was also analysed. Findings Fifteen items from the Mandarin Chinese Myocardial Infarction Dimensional Assessment Scale were retained in a strong and reliable Mokken scale; invariant item ordering was not evident and the Mokken scaled items of the Chinese Myocardial Infarction Dimensional Assessment Scale correlated with the Short Form (36 Health Survey. Conclusions Items from the Mandarin Chinese Myocardial Infarction Dimensional Assessment Scale form a Mokken scale and this offers further insight into how the items of the Myocardial Infarction Dimensional Assessment Scale relate to the measurement of health-related quality of life people with a myocardial infarction.

  8. Why Japanese workers show low work engagement: An item response theory analysis of the Utrecht Work Engagement scale.

    Science.gov (United States)

    Shimazu, Akihito; Schaufeli, Wilmar B; Miyanaka, Daisuke; Iwata, Noboru

    2010-11-05

    With the globalization of occupational health psychology, more and more researchers are interested in applying employee well-being like work engagement (i.e., a positive, fulfilling, work-related state of mind that is characterized by vigor, dedication, and absorption) to diverse populations. Accurate measurement contributes to our further understanding and to the generalizability of the concept of work engagement across different cultures. The present study investigated the measurement accuracy of the Japanese and the original Dutch versions of the Utrecht Work Engagement Scale (9-item version, UWES-9) and the comparability of this scale between both countries. Item Response Theory (IRT) was applied to the data from Japan (N = 2,339) and the Netherlands (N = 13,406). Reliability of the scale was evaluated at various levels of the latent trait (i.e., work engagement) based the test information function (TIF) and the standard error of measurement (SEM). The Japanese version had difficulty in differentiating respondents with extremely low work engagement, whereas the original Dutch version had difficulty in differentiating respondents with high work engagement. The measurement accuracy of both versions was not similar. Suppression of positive affect among Japanese people and self-enhancement (the general sensitivity to positive self-relevant information) among Dutch people may have caused decreased measurement accuracy. Hence, we should be cautious when interpreting low engagement scores among Japanese as well as high engagement scores among western employees.

  9. Normative data for the 12 item WHO Disability Assessment Schedule 2.0.

    Directory of Open Access Journals (Sweden)

    Gavin Andrews

    Full Text Available BACKGROUND: The World Health Organization Disability Assessment Schedule (WHODAS 2.0 measures disability due to health conditions including diseases, illnesses, injuries, mental or emotional problems, and problems with alcohol or drugs. METHOD: The 12 Item WHODAS 2.0 was used in the second Australian Survey of Mental Health and Well-being. We report the overall factor structure and the distribution of scores and normative data (means and SDs for people with any physical disorder, any mental disorder and for people with neither. FINDINGS: A single second order factor justifies the use of the scale as a measure of global disability. People with mental disorders had high scores (mean 6.3, SD 7.1, people with physical disorders had lower scores (mean 4.3, SD 6.1. People with no disorder covered by the survey had low scores (mean 1.4, SD 3.6. INTERPRETATION: The provision of normative data from a population sample of adults will facilitate use of the WHODAS 2.0 12 item scale in clinical and epidemiological research.

  10. easyCBM CCSS Math Item Scaling and Test Form Revision (2012-2013): Grades 6-8. Technical Report #1313

    Science.gov (United States)

    Anderson, Daniel; Alonzo, Julie; Tindal, Gerald

    2012-01-01

    The purpose of this technical report is to document the piloting and scaling of new easyCBM mathematics test items aligned with the Common Core State Standards (CCSS) and to describe the process used to revise and supplement the 2012 research version easyCBM CCSS math tests in Grades 6-8. For all operational 2012 research version test forms (10…

  11. Evaluation of the Hospital Anxiety and Depression Scale (HADS) in screening stroke patients for symptoms: Item Response Theory (IRT) analysis.

    Science.gov (United States)

    Ayis, Salma A; Ayerbe, Luis; Ashworth, Mark; DA Wolfe, Charles

    2018-03-01

    Variations have been reported in the number of underlying constructs and choice of thresholds that determine caseness of anxiety and /or depression using the Hospital Anxiety and Depression scale (HADS). This study examined the properties of each item of HADS as perceived by stroke patients, and assessed the information these items convey about anxiety and depression between 3 months to 5 years after stroke. The study included 1443 stroke patients from the South London Stroke Register (SLSR). The dimensionality of HADS was examined using factor analysis methods, and items' properties up to 5 years after stroke were tested using Item Response Theory (IRT) methods, including graded response models (GRMs). The presence of two dimensions of HADS (anxiety and depression) for stroke patients was confirmed. Items that accurately inferred about the severity of anxiety and depression, and offered good discrimination of caseness were identified as "I can laugh and see the funny side of things" (Q4) and "I get sudden feelings of panic" (Q13), discrimination 2.44 (se = 0.26), and 3.34 (se = 0.35), respectively. Items that shared properties, hence replicate inference were: "I get a sort of frightened feeling as if something awful is about to happen" (Q3), "I get a sort of frightened feeling like butterflies in my stomach" (Q6), and "Worrying thoughts go through my mind" (Q9). Item properties were maintained over time. Approximately 20% of patients were lost to follow up. A more concise selection of items based on their properties, would provide a precise approach for screening patients and for an optimal allocation of patients into clinical trials. Copyright © 2017 Elsevier B.V. All rights reserved.

  12. Clinical utility of the MMPI-2-RF SUI items and scale in a forensic inpatient setting: Association with interview self-report and future suicidal behaviors.

    Science.gov (United States)

    Glassmire, David M; Tarescavage, Anthony M; Burchett, Danielle; Martinez, Jennifer; Gomez, Anthony

    2016-11-01

    In this study, we examined whether the 5 Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2-RF; Ben-Porath & Tellegen, 2008/2011) Suicidal/Death Ideation (SUI) items (93, 120, 164, 251, and 334) would provide incremental suicide-risk assessment information after accounting for information garnered from clinical interview questions. Among 229 forensic inpatients (146 men, 83 women) who were administered the MMPI-2-RF, 34.9% endorsed at least 1 SUI item. We found that patients who endorsed SUI items on the MMPI-2-RF concurrently denied conceptually related suicide-risk information during the clinical interview. For instance, 8% of the sample endorsed Item 93 (indicating recent suicidal ideation), yet denied current suicidal ideation upon interview. Conversely, only 2.2% of the sample endorsed current suicidal ideation during the interview, yet denied recent suicidal ideation on Item 93. The SUI scale, as well as the MMPI-2-RF Demoralization (RCd) and Low Positive Emotions (RC2) scales, correlated significantly and meaningfully with conceptually related suicide-risk information from the interview, including history of suicide attempts, history of suicidal ideation, current suicidal ideation, and months since last suicide attempt. We also found that the SUI scale added incremental variance (after accounting for information garnered from the interview and after accounting for scores on RCd and RC2) to predictions of future suicidal behavior within 1 year of testing. Relative risk ratios indicated that both SUI-item endorsement and the presence of interview-reported risk information significantly and meaningfully increased the risk of suicidal behavior in the year following testing, particularly when endorsement of suicidal ideation occurred for both methods of self-report. (PsycINFO Database Record (c) 2016 APA, all rights reserved).

  13. The construct validity of the Perceived Stress Scale

    DEFF Research Database (Denmark)

    Germund Nielsen, Marie; Ørnbøl, Eva; Vestergaard, Mogens

    2016-01-01

    Objective: Stress impacts the quality of life and is associated with increased risk of mental and physical disorders. The Perceived Stress Scale (PSS) is widely used for measuring psychological distress. Although the instrument was originally defined as a single construct, several studies based...... of 32,374 citizens who completed the PSS-10 as part of the Danish National Health Survey in 2010. We investigated the construct validity of the PSS-10 by CFA. We examined the scalability by investigating the fit of the data distribution in a unidimensional Rasch model and performing modification...... of response categories, persons and items. The scale dimensionality was additionally assessed by Mokken and Rasch analysis.  Results: The PSS-10 did not fit the Rasch model. Item four indicated the largest misfit, and items four and seven displayed disordered thresholds. Unidimensionality could...

  14. The Divergent Meanings of Life Satisfaction: Item Response Modeling of the Satisfaction with Life Scale in Greenland and Norway

    Science.gov (United States)

    Vitterso, Joar; Biswas-Diener, Robert; Diener, Ed

    2005-01-01

    Cultural differences in response to the Satisfaction With Life Scale (SWLS) items is investigated. Data were fit to a mixed Rasch model in order to identify latent classes of participants in a combined sample of Norwegians (N = 461) and Greenlanders (N = 180). Initial analyses showed no mean difference in life satisfaction between the two…

  15. Dissociating the neural correlates of intra-item and inter-item working-memory binding.

    Directory of Open Access Journals (Sweden)

    Carinne Piekema

    Full Text Available BACKGROUND: Integration of information streams into a unitary representation is an important task of our cognitive system. Within working memory, the medial temporal lobe (MTL has been conceptually linked to the maintenance of bound representations. In a previous fMRI study, we have shown that the MTL is indeed more active during working-memory maintenance of spatial associations as compared to non-spatial associations or single items. There are two explanations for this result, the mere presence of the spatial component activates the MTL, or the MTL is recruited to bind associations between neurally non-overlapping representations. METHODOLOGY/PRINCIPAL FINDINGS: The current fMRI study investigates this issue further by directly comparing intrinsic intra-item binding (object/colour, extrinsic intra-item binding (object/location, and inter-item binding (object/object. The three binding conditions resulted in differential activation of brain regions. Specifically, we show that the MTL is important for establishing extrinsic intra-item associations and inter-item associations, in line with the notion that binding of information processed in different brain regions depends on the MTL. CONCLUSIONS/SIGNIFICANCE: Our findings indicate that different forms of working-memory binding rely on specific neural structures. In addition, these results extend previous reports indicating that the MTL is implicated in working-memory maintenance, challenging the classic distinction between short-term and long-term memory systems.

  16. Test–retest reliability of Antonovsky’s 13-item sense of coherence scale in patients with handrelated disorders

    DEFF Research Database (Denmark)

    Hansen, Alice Ørts; Kristensen, Hanne Kaae; Cederlund, Ragnhild

    2016-01-01

    Purpose: To report on the distribution and test-retest reliability of Antonovsky’s 13-item Sense of Coherence (SOC-13) Scale in patients with hand-related disorders (HRD). Links between the SOC-13 score and factors such as age, number of days between date of injury and start of rehabilitation, ge...... to be a powerful tool to measure the ICF component personal factors, which could have an impact on patients’ rehabilitation outcomes....

  17. Negative affect impairs associative memory but not item memory.

    OpenAIRE

    Bisby, J. A.; Burgess, N.

    2014-01-01

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 ...

  18. Refining and validating the Social Interaction Anxiety Scale and the Social Phobia Scale.

    Science.gov (United States)

    Carleton, R Nicholas; Collimore, Kelsey C; Asmundson, Gordon J G; McCabe, Randi E; Rowa, Karen; Antony, Martin M

    2009-01-01

    The Social Interaction Anxiety Scale and Social Phobia Scale are companion measures for assessing symptoms of social anxiety and social phobia. The scales have good reliability and validity across several samples, however, exploratory and confirmatory factor analyses have yielded solutions comprising substantially different item content and factor structures. These discrepancies are likely the result of analyzing items from each scale separately or simultaneously. The current investigation sets out to assess items from those scales, both simultaneously and separately, using exploratory and confirmatory factor analyses in an effort to resolve the factor structure. Participants consisted of a clinical sample (n 5353; 54% women) and an undergraduate sample (n 5317; 75% women) who completed the Social Interaction Anxiety Scale and Social Phobia Scale, along with additional fear-related measures to assess convergent and discriminant validity. A three-factor solution with a reduced set of items was found to be most stable, irrespective of whether the items from each scale are assessed together or separately. Items from the Social Interaction Anxiety Scale represented one factor, whereas items from the Social Phobia Scale represented two other factors. Initial support for scale and factor validity, along with implications and recommendations for future research, is provided. (c) 2009 Wiley-Liss, Inc.

  19. Feed mechanism and method for feeding minute items

    Science.gov (United States)

    Stringer, Timothy Kent [Bucyrus, KS; Yerganian, Simon Scott [Lee's Summit, MO

    2009-10-20

    A feeding mechanism and method for feeding minute items, such as capacitors, resistors, or solder preforms. The mechanism is adapted to receive a plurality of the randomly-positioned and randomly-oriented extremely small or minute items, and to isolate, orient, and position one or more of the items in a specific repeatable pickup location wherefrom they may be removed for use by, for example, a computer-controlled automated assembly machine. The mechanism comprises a sliding shelf adapted to receive and support the items; a wiper arm adapted to achieve a single even layer of the items; and a pushing arm adapted to push the items into the pickup location. The mechanism can be adapted for providing the items with a more exact orientation, and can also be adapted for use in a liquid environment.

  20. Factoring handedness data: I. Item analysis.

    Science.gov (United States)

    Messinger, H B; Messinger, M I

    1995-12-01

    Recently in this journal Peters and Murphy challenged the validity of factor analyses done on bimodal handedness data, suggesting instead that right- and left-handers be studied separately. But bimodality may be avoidable if attention is paid to Oldfield's questionnaire format and instructions for the subjects. Two characteristics appear crucial: a two-column LEFT-RIGHT format for the body of the instrument and what we call Oldfield's Admonition: not to indicate strong preference for handedness item, such as write, unless "... the preference is so strong that you would never try to use the other hand unless absolutely forced to...". Attaining unimodality of an item distribution would seem to overcome the objections of Peters and Murphy. In a 1984 survey in Boston we used Oldfield's ten-item questionnaire exactly as published. This produced unimodal item distributions. With reflection of the five-point item scale and a logarithmic transformation, we achieved a degree of normalization for the items. Two surveys elsewhere based on Oldfield's 20-item list but with changes in the questionnaire format and the instructions, yielded markedly different item distributions with peaks at each extreme and sometimes in the middle as well.

  1. Comparing Two Versions of the MEOCS Using Differential Item Functioning

    National Research Council Canada - National Science Library

    Truhon, Stephen

    2003-01-01

    ...) from item response theory (IRT). DIF was found for the majority of the 40 items examined, although in many cases the DIF indicated improvements in the revised items. Implications for these scales and for the use of IRT with the MEOCS are discussed.

  2. Adult Attachment Ratings (AAR): an item response theory analysis.

    Science.gov (United States)

    Pilkonis, Paul A; Kim, Yookyung; Yu, Lan; Morse, Jennifer Q

    2014-01-01

    The Adult Attachment Ratings (AAR) include 3 scales for anxious, ambivalent attachment (excessive dependency, interpersonal ambivalence, and compulsive care-giving), 3 for avoidant attachment (rigid self-control, defensive separation, and emotional detachment), and 1 for secure attachment. The scales include items (ranging from 6-16 in their original form) scored by raters using a 3-point format (0 = absent, 1 = present, and 2 = strongly present) and summed to produce a total score. Item response theory (IRT) analyses were conducted with data from 414 participants recruited from psychiatric outpatient, medical, and community settings to identify the most informative items from each scale. The IRT results allowed us to shorten the scales to 5-item versions that are more precise and easier to rate because of their brevity. In general, the effective range of measurement for the scales was 0 to +2 SDs for each of the attachment constructs; that is, from average to high levels of attachment problems. Evidence for convergent and discriminant validity of the scales was investigated by comparing them with the Experiences of Close Relationships-Revised (ECR-R) scale and the Kobak Attachment Q-sort. The best consensus among self-reports on the ECR-R, informant ratings on the ECR-R, and expert judgments on the Q-sort and the AAR emerged for anxious, ambivalent attachment. Given the good psychometric characteristics of the scale for secure attachment, however, this measure alone might provide a simple alternative to more elaborate procedures for some measurement purposes. Conversion tables are provided for the 7 scales to facilitate transformation from raw scores to IRT-calibrated (theta) scores.

  3. Analyzing force concept inventory with item response theory

    Science.gov (United States)

    Wang, Jing; Bao, Lei

    2010-10-01

    Item response theory is a popular assessment method used in education. It rests on the assumption of a probability framework that relates students' innate ability and their performance on test questions. Item response theory transforms students' raw test scores into a scaled proficiency score, which can be used to compare results obtained with different test questions. The scaled score also addresses the issues of ceiling effects and guessing, which commonly exist in quantitative assessment. We used item response theory to analyze the force concept inventory (FCI). Our results show that item response theory can be useful for analyzing physics concept surveys such as the FCI and produces results about the individual questions and student performance that are beyond the capability of classical statistics. The theory yields detailed measurement parameters regarding the difficulty, discrimination features, and probability of correct guess for each of the FCI questions.

  4. Improving measurement of injection drug risk behavior using item response theory.

    Science.gov (United States)

    Janulis, Patrick

    2014-03-01

    Recent research highlights the multiple steps to preparing and injecting drugs and the resultant viral threats faced by drug users. This research suggests that more sensitive measurement of injection drug HIV risk behavior is required. In addition, growing evidence suggests there are gender differences in injection risk behavior. However, the potential for differential item functioning between genders has not been explored. To explore item response theory as an improved measurement modeling technique that provides empirically justified scaling of injection risk behavior and to examine for potential gender-based differential item functioning. Data is used from three studies in the National Institute on Drug Abuse's Criminal Justice Drug Abuse Treatment Studies. A two-parameter item response theory model was used to scale injection risk behavior and logistic regression was used to examine for differential item functioning. Item fit statistics suggest that item response theory can be used to scale injection risk behavior and these models can provide more sensitive estimates of risk behavior. Additionally, gender-based differential item functioning is present in the current data. Improved measurement of injection risk behavior using item response theory should be encouraged as these models provide increased congruence between construct measurement and the complexity of injection-related HIV risk. Suggestions are made to further improve injection risk behavior measurement. Furthermore, results suggest direct comparisons of composite scores between males and females may be misleading and future work should account for differential item functioning before comparing levels of injection risk behavior.

  5. Why Japanese workers show low work engagement: An item response theory analysis of the Utrecht Work Engagement scale

    Directory of Open Access Journals (Sweden)

    Iwata Noboru

    2010-11-01

    Full Text Available Abstract With the globalization of occupational health psychology, more and more researchers are interested in applying employee well-being like work engagement (i.e., a positive, fulfilling, work-related state of mind that is characterized by vigor, dedication, and absorption to diverse populations. Accurate measurement contributes to our further understanding and to the generalizability of the concept of work engagement across different cultures. The present study investigated the measurement accuracy of the Japanese and the original Dutch versions of the Utrecht Work Engagement Scale (9-item version, UWES-9 and the comparability of this scale between both countries. Item Response Theory (IRT was applied to the data from Japan (N = 2,339 and the Netherlands (N = 13,406. Reliability of the scale was evaluated at various levels of the latent trait (i.e., work engagement based the test information function (TIF and the standard error of measurement (SEM. The Japanese version had difficulty in differentiating respondents with extremely low work engagement, whereas the original Dutch version had difficulty in differentiating respondents with high work engagement. The measurement accuracy of both versions was not similar. Suppression of positive affect among Japanese people and self-enhancement (the general sensitivity to positive self-relevant information among Dutch people may have caused decreased measurement accuracy. Hence, we should be cautious when interpreting low engagement scores among Japanese as well as high engagement scores among western employees.

  6. Advancing Accounting Research of Teaching Efficacy: Developing a Scale to Measure Student Attitudes toward Active Learning Experiences

    Science.gov (United States)

    Burney, Laurie; Zascavage, Victoria; Matherly, Michele

    2017-01-01

    Literature consistently documents a positive, direct effect of students' attitudes on learning (Lizzio, Wilson, & Simons, 2002). Hence, accounting studies describing active learning activities often report student attitudes as evidence of efficacy (e.g., Matherly & Burney, 2013), but rely on single-item instead of multi-item scales. This…

  7. Robust Scale Transformation Methods in IRT True Score Equating under Common-Item Nonequivalent Groups Design

    Science.gov (United States)

    He, Yong

    2013-01-01

    Common test items play an important role in equating multiple test forms under the common-item nonequivalent groups design. Inconsistent item parameter estimates among common items can lead to large bias in equated scores for IRT true score equating. Current methods extensively focus on detection and elimination of outlying common items, which…

  8. Item response theory analysis of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised in the Pooled Resource Open-Access ALS Clinical Trials Database.

    Science.gov (United States)

    Bacci, Elizabeth D; Staniewska, Dorota; Coyne, Karin S; Boyer, Stacey; White, Leigh Ann; Zach, Neta; Cedarbaum, Jesse M

    2016-01-01

    Our objective was to examine dimensionality and item-level performance of the Amyotrophic Lateral Sclerosis Functional Rating Scale-Revised (ALSFRS-R) across time using classical and modern test theory approaches. Confirmatory factor analysis (CFA) and Item Response Theory (IRT) analyses were conducted using data from patients with amyotrophic lateral sclerosis (ALS) Pooled Resources Open-Access ALS Clinical Trials (PRO-ACT) database with complete ALSFRS-R data (n = 888) at three time-points (Time 0, Time 1 (6-months), Time 2 (1-year)). Results demonstrated that in this population of 888 patients, mean age was 54.6 years, 64.4% were male, and 93.7% were Caucasian. The CFA supported a 4* individual-domain structure (bulbar, gross motor, fine motor, and respiratory domains). IRT analysis within each domain revealed misfitting items and overlapping item response category thresholds at all time-points, particularly in the gross motor and respiratory domain items. Results indicate that many of the items of the ALSFRS-R may sub-optimally distinguish among varying levels of disability assessed by each domain, particularly in patients with less severe disability. Measure performance improved across time as patient disability severity increased. In conclusion, modifications to select ALSFRS-R items may improve the instrument's specificity to disability level and sensitivity to treatment effects.

  9. Are reflective models appropriate for very short scales? Proofs of concept of formative models using the Ten-Item Personality Inventory.

    Science.gov (United States)

    Myszkowski, Nils; Storme, Martin; Tavani, Jean-Louis

    2018-04-27

    Because of their length and objective of broad content coverage, very short scales can show limited internal consistency and structural validity. We argue that it is because their objectives may be better aligned with formative investigations than with reflective measurement methods that capitalize on content overlap. As proofs of concept of formative investigations of short scales, we investigate the Ten Item Personality Inventory (TIPI). In Study 1, we administered the TIPI and the Big Five Inventory (BFI) to 938 adults, and fitted a formative Multiple Indicator Multiple Causes model, which consisted of the TIPI items forming 5 latent variables, which in turn predicted the 5 BFI scores. These results were replicated in Study 2, on a sample of 759 adults, with, this time, the Revised NEO Personality Inventory (NEO-PI-R) as the external criterion. The models fit the data adequately, and moderate to strong significant effects (.37<|β|<.69, all p<.001) of all 5 latent formative variables on their corresponding BFI and NEOPI-R scores were observed. This study presents a formative approach that we propose to be more consistent with the aims of scales with broad content and short length like the TIPI. This article is protected by copyright. All rights reserved. © 2018 Wiley Periodicals, Inc.

  10. Item response theory at subject- and group-level

    NARCIS (Netherlands)

    Tobi, Hilde

    1990-01-01

    This paper reviews the literature about item response models for the subject level and aggregated level (group level). Group-level item response models (IRMs) are used in the United States in large-scale assessment programs such as the National Assessment of Educational Progress and the California

  11. Information and processes underlying semantic and episodic memory across tasks, items, and individuals.

    Science.gov (United States)

    Cox, Gregory E; Hemmer, Pernille; Aue, William R; Criss, Amy H

    2018-04-01

    The development of memory theory has been constrained by a focus on isolated tasks rather than the processes and information that are common to situations in which memory is engaged. We present results from a study in which 453 participants took part in five different memory tasks: single-item recognition, associative recognition, cued recall, free recall, and lexical decision. Using hierarchical Bayesian techniques, we jointly analyzed the correlations between tasks within individuals-reflecting the degree to which tasks rely on shared cognitive processes-and within items-reflecting the degree to which tasks rely on the same information conveyed by the item. Among other things, we find that (a) the processes involved in lexical access and episodic memory are largely separate and rely on different kinds of information, (b) access to lexical memory is driven primarily by perceptual aspects of a word, (c) all episodic memory tasks rely to an extent on a set of shared processes which make use of semantic features to encode both single words and associations between words, and (d) recall involves additional processes likely related to contextual cuing and response production. These results provide a large-scale picture of memory across different tasks which can serve to drive the development of comprehensive theories of memory. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  12. Item response theory analysis of the life orientation test-revised: age and gender differential item functioning analyses.

    Science.gov (United States)

    Steca, Patrizia; Monzani, Dario; Greco, Andrea; Chiesi, Francesca; Primi, Caterina

    2015-06-01

    This study is aimed at testing the measurement properties of the Life Orientation Test-Revised (LOT-R) for the assessment of dispositional optimism by employing item response theory (IRT) analyses. The LOT-R was administered to a large sample of 2,862 Italian adults. First, confirmatory factor analyses demonstrated the theoretical conceptualization of the construct measured by the LOT-R as a single bipolar dimension. Subsequently, IRT analyses for polytomous, ordered response category data were applied to investigate the items' properties. The equivalence of the items across gender and age was assessed by analyzing differential item functioning. Discrimination and severity parameters indicated that all items were able to distinguish people with different levels of optimism and adequately covered the spectrum of the latent trait. Additionally, the LOT-R appears to be gender invariant and, with minor exceptions, age invariant. Results provided evidence that the LOT-R is a reliable and valid measure of dispositional optimism. © The Author(s) 2014.

  13. Perception of victims of rape and perception of gender social roles among college students in Southwest Nigeria: validation of a 5-item gender scale.

    Science.gov (United States)

    Opekitan, Afe Taiwo; Ogunsemi, Olawale; Osalusi, Bamidele; Adeleye, Olufunke; Ale, Ayotunde

    2017-08-29

    Our study focused on the perception of victims of rape and the relationship with the perception of social roles for gender among college students in southwest Nigeria using a 5-item gender social scale and a perception of victims of rape questionnaire. The study was done among 312 college students in Southwest Nigeria and explored the perception of victims of rape and gender social roles. The aim was to determine the relationship between perception of rape victims and view of gender social roles. We used a perception of rape victims questionnaire and a validated 5-item gender social roles scale to assess the views of participants. The findings revealed that females had better perception of victims of rape than males. Females also had more positive views of females' social roles involving gender. However, there was poor perception on work-related social roles and the traditional concept of headship in the varied situations described on the 5-item gender social scale. Old stereotypes of typically blaming victims of rape were not common beliefs among college students. There were no significant correlations between perception of victims of rape and perception of gender social roles among college students. Seemingly, the perception of victims of rape does not have a significant relationship with the concept of gender social roles.

  14. Scoring best-worst data in unbalanced many-item designs, with applications to crowdsourcing semantic judgments.

    Science.gov (United States)

    Hollis, Geoff

    2018-04-01

    Best-worst scaling is a judgment format in which participants are presented with a set of items and have to choose the superior and inferior items in the set. Best-worst scaling generates a large quantity of information per judgment because each judgment allows for inferences about the rank value of all unjudged items. This property of best-worst scaling makes it a promising judgment format for research in psychology and natural language processing concerned with estimating the semantic properties of tens of thousands of words. A variety of different scoring algorithms have been devised in the previous literature on best-worst scaling. However, due to problems of computational efficiency, these scoring algorithms cannot be applied efficiently to cases in which thousands of items need to be scored. New algorithms are presented here for converting responses from best-worst scaling into item scores for thousands of items (many-item scoring problems). These scoring algorithms are validated through simulation and empirical experiments, and considerations related to noise, the underlying distribution of true values, and trial design are identified that can affect the relative quality of the derived item scores. The newly introduced scoring algorithms consistently outperformed scoring algorithms used in the previous literature on scoring many-item best-worst data.

  15. Psychometric properties of the 7-item game addiction scale among french and German speaking adults.

    Science.gov (United States)

    Khazaal, Yasser; Chatton, Anne; Rothen, Stephane; Achab, Sophia; Thorens, Gabriel; Zullino, Daniele; Gmel, Gerhard

    2016-05-10

    The 7-item Game Addiction Scale (GAS) is a used to screen for addictive game use. Both cross cross-linguistic validation and validation in French and German is needed in adult samples. The objective of the study is to assess the factorial structure of the French and German versions of the GAS among adults. Two samples of men from French (N = 3318) and German (N = 2665) language areas of Switzerland were assessed with the GAS, the Major Depression Inventory (MDI), the Brief Sensation Seeking Scale, and the Zuckerman-Kuhlman Personality Questionnaire (ZKPQ-50-cc). They were also assessed for cannabis and alcohol use. The internal consistency of the scale was satisfactory (Cronbach α = 0.85). A one-factor solution was found in both samples. Small and positive associations were found between GAS scores and the MDI, as well as the Neuroticism-Anxiety and Aggression-Hostility subscales of the ZKPQ-50-cc. A small negative association was found with the ZKPQ-50-cc Sociability subscale. The GAS, in its French and German versions, is appropriate for the assessment of game addiction among adults.

  16. A confirmative clinimetric analysis of the 36-item Family Assessment Device.

    Science.gov (United States)

    Timmerby, Nina; Cosci, Fiammetta; Watson, Maggie; Csillag, Claudio; Schmitt, Florence; Steck, Barbara; Bech, Per; Thastum, Mikael

    2018-02-07

    The Family Assessment Device (FAD) is a 60-item questionnaire widely used to evaluate self-reported family functioning. However, the factor structure as well as the number of items has been questioned. A shorter and more user-friendly version of the original FAD-scale, the 36-item FAD, has therefore previously been proposed, based on findings in a nonclinical population of adults. We aimed in this study to evaluate the brief 36-item version of the FAD in a clinical population. Data from a European multinational study, examining factors associated with levels of family functioning in adult cancer patients' families, were used. Both healthy and ill parents completed the 60-item version FAD. The psychometric analyses conducted were Principal Component Analysis and Mokken-analysis. A total of 564 participants were included. Based on the psychometric analysis we confirmed that the 36-item version of the FAD has robust psychometric properties and can be used in clinical populations. The present analysis confirmed that the 36-item version of the FAD (18 items assessing 'well-being' and 18 items assessing 'dysfunctional' family function) is a brief scale where the summed total score is a valid measure of the dimensions of family functioning. This shorter version of the FAD is, in accordance with the concept of 'measurement-based care', an easy to use scale that could be considered when the aim is to evaluate self-reported family functioning.

  17. Symptoms of anxiety in depression: assessment of item performance of the Hamilton Anxiety Rating Scale in patients with depression.

    Science.gov (United States)

    Vaccarino, Anthony L; Evans, Kenneth R; Sills, Terrence L; Kalali, Amir H

    2008-01-01

    Although diagnostically dissociable, anxiety is strongly co-morbid with depression. To examine further the clinical symptoms of anxiety in major depressive disorder (MDD), a non-parametric item response analysis on "blinded" data from four pharmaceutical company clinical trials was performed on the Hamilton Anxiety Rating Scale (HAMA) across levels of depressive severity. The severity of depressive symptoms was assessed using the 17-item Hamilton Depression Rating Scale (HAMD). HAMA and HAMD measures were supplied for each patient on each of two post-screen visits (n=1,668 observations). Option characteristic curves were generated for all 14 HAMA items to determine the probability of scoring a particular option on the HAMA in relation to the total HAMD score. Additional analyses were conducted using Pearson's product-moment correlations. Results showed that anxiety-related symptomatology generally increased as a function of overall depressive severity, though there were clear differences between individual anxiety symptoms in their relationship with depressive severity. In particular, anxious mood, tension, insomnia, difficulties in concentration and memory, and depressed mood were found to discriminate over the full range of HAMD scores, increasing continuously with increases in depressive severity. By contrast, many somatic-related symptoms, including muscular, sensory, cardiovascular, respiratory, gastro-intestinal, and genito-urinary were manifested primarily at higher levels of depression and did not discriminate well at lower HAMD scores. These results demonstrate anxiety as a core feature of depression, and the relationship between anxiety-related symptoms and depression should be considered in the assessment of depression and evaluation of treatment strategies and outcome.

  18. Item response modeling: A psychometric assessment of the children's fruit, vegetable, water, and physical activity self-efficacy scales among Chinese children

    Science.gov (United States)

    This study aimed to evaluate the psychometric properties of four self-efficacy scales (i.e., self-efficacy for fruit (FSE), vegetable (VSE), and water (WSE) intakes, and physical activity (PASE)) and to investigate their differences in item functioning across sex, age, and body weight status groups ...

  19. Psychometric properties of the brief version of the Fear of Negative Evaluation Scale in a Turkish sample.

    Science.gov (United States)

    Koydemir, Selda; Demir, Ayhan

    2007-06-01

    The purpose of the study was to report initial data on the psychometric properties of the Brief Fear of Negative Evaluation Scale. The scale was applied to a nonclinical sample of 250 (137 women, 113 men) Turkish undergraduate students selected randomly from Middle East Technical University. Their mean age was 20.4 yr. (SD= 1.9). The factor structure of the Turkish version, its criterion validity, and internal reliability coefficients were assessed. Although maximum likelihood factor analysis initially indicated that the scale had only one factor, a forced two-factor solution accounted for more variance (61%) in scale scores than a single factor. The straightforward items loaded on the first factor, and the reverse-coded items loaded on the second factor. The total score was significantly positively correlated with scores on the Revised Cheek and Buss Shyness Scale and significantly negatively correlated with scores on the Rosenberg Self-Esteem Scale. Factor 1 (straightforward items) correlated more highly with both Shyness and Self-esteem than Factor 2 (reverse-coded items). Internal consistency estimate was .94 for the Total scores, .91 for the Factor 1 (straightforward items), and .87 for the Factor 2 (reverse-coded items). No sex differences were evident for Fear of Negative Evaluation.

  20. Cross-cultural adaptation and psychometric evaluations of the Turkish version of Parkinson Fatigue Scale.

    Science.gov (United States)

    Ozturk, Erhan Arif; Kocer, Bilge Gonenli; Umay, Ebru; Cakci, Aytul

    2018-06-07

    The objectives of the present study were to translate and cross-culturally adapt the English version of the Parkinson Fatigue Scale into Turkish, to evaluate its psychometric properties, and to compare them with that of other language versions. A total of 144 patients with idiopathic Parkinson disease were included in the study. The Turkish version of Parkinson Fatigue Scale was evaluated for data quality, scaling assumptions, acceptability, reliability, and validity. The questionnaire response rate was 100% for both test and retest. The percentage of missing data was zero for items, and the percentage of computable scores was full. Floor and ceiling effects were absent. The Parkinson Fatigue Scale provides an acceptable internal consistency (Cronbach's alpha was 0.974 for 1st test and 0.964 for a retest, and corrected item-to-total correlations were ranged from 0.715 to 0.906) and test-retest reliability (Cohen's kappa coefficients were ranged from 0.632 to 0.786 for individuals items, and intraclass correlation coefficient was 0.887 for the overall Parkinson Fatigue Scale Score). An exploratory factor analysis of the items revealed a single factor explaining 71.7% of variance. The goodness-of-fit statistics for the one-factorial confirmatory factor analysis were Tucker Lewis index = 0.961, comparative fit index = 0.971 and root mean square error of approximation = 0.077 for a single factor. The average Parkinson Fatigue Scale Score was correlated significantly with sociodemographic data, clinical characteristics and scores of rating scales. The Turkish version of the Parkinson Fatigue Scale seems to be culturally well adapted and have good psychometric properties. The scale can be used in further studies to assess the fatigue in patients with Parkinson's disease.

  1. The Consumer Motivation Scale: A detailed review of item generation, exploration, confirmation, and validation procedures

    Directory of Open Access Journals (Sweden)

    I. Barbopoulos

    2017-08-01

    Full Text Available This data article offers a detailed description of analyses pertaining to the development of the Consumer Motivation Scale (CMS, from item generation and the extraction of factors, to confirmation of the factor structure and validation of the emergent dimensions. The established goal structure – consisting of the sub-goals Value for Money, Quality, Safety, Stimulation, Comfort, Ethics, and Social Acceptance – is shown to be related to a variety of consumption behaviors in different contexts and for different products, and should thereby prove useful in standard marketing research, as well as in the development of tailored marketing strategies, and the segmentation of consumer groups, settings, brands, and products.

  2. The Consumer Motivation Scale: A detailed review of item generation, exploration, confirmation, and validation procedures.

    Science.gov (United States)

    Barbopoulos, I; Johansson, L-O

    2017-08-01

    This data article offers a detailed description of analyses pertaining to the development of the Consumer Motivation Scale (CMS), from item generation and the extraction of factors, to confirmation of the factor structure and validation of the emergent dimensions. The established goal structure - consisting of the sub-goals Value for Money, Quality, Safety, Stimulation, Comfort, Ethics, and Social Acceptance - is shown to be related to a variety of consumption behaviors in different contexts and for different products, and should thereby prove useful in standard marketing research, as well as in the development of tailored marketing strategies, and the segmentation of consumer groups, settings, brands, and products.

  3. Development of an assessment tool to measure students′ perceptions of respiratory care education programs: Item generation, item reduction, and preliminary validation

    Directory of Open Access Journals (Sweden)

    Ghazi Alotaibi

    2013-01-01

    Full Text Available Objectives: Students who perceived their learning environment positively are more likely to develop effective learning strategies, and adopt a deep learning approach. Currently, there is no validated instrument for measuring the educational environment of educational programs on respiratory care (RC. The aim of this study was to develop an instrument to measure students′ perception of the RC educational environment. Materials and Methods: Based on the literature review and an assessment of content validity by multiple focus groups of RC educationalists, potential items of the instrument relevant to RC educational environment construct were generated by the research group. The initial 71 item questionnaire was then field-tested on all students from the 3 RC programs in Saudi Arabia and was subjected to multi-trait scaling analysis. Cronbach′s alpha was used to assess internal consistency reliabilities. Results: Two hundred and twelve students (100% completed the survey. The initial instrument of 71 items was reduced to 65 across 5 scales. Convergent and discriminant validity assessment demonstrated that the majority of items correlated more highly with their intended scale than a competing one. Cronbach′s alpha exceeded the standard criterion of >0.70 in all scales except one. There was no floor or ceiling effect for scale or overall score. Conclusions: This instrument is the first assessment tool developed to measure the RC educational environment. There was evidence of its good feasibility, validity, and reliability. This first validation of the instrument supports its use by RC students to evaluate educational environment.

  4. Reliability, Validity, and Predictive Utility of the 25-Item Criminogenic Cognitions Scale (CCS).

    Science.gov (United States)

    Tangney, June Price; Stuewig, Jeffrey; Furukawa, Emi; Kopelovich, Sarah; Meyer, Patrick; Cosby, Brandon

    2012-10-01

    Theory, research, and clinical reports suggest that moral cognitions play a role in initiating and sustaining criminal behavior. The 25 item Criminogenic Cognitions Scale (CCS) was designed to tap 5 dimensions: Notions of entitlement; Failure to Accept Responsibility; Short-Term Orientation; Insensitivity to Impact of Crime; and Negative Attitudes Toward Authority. Results from 552 jail inmates support the reliability, validity, and predictive utility of the measure. The CCS was linked to criminal justice system involvement, self-report measures of aggression, impulsivity, and lack of empathy. Additionally, the CCS was associated with violent criminal history, antisocial personality, and clinicians' ratings of risk for future violence and psychopathy (PCL:SV). Furthermore, criminogenic thinking upon incarceration predicted subsequent official reports of inmate misconduct during incarceration. CCS scores varied somewhat by gender and race. Research and applied uses of CCS are discussed.

  5. Psychometric properties of the Bulgarian translation of noise sensitivity scale short form (NSS-SF): implementation in the field of noise control.

    Science.gov (United States)

    Dzhambov, Angel M; Dimitrova, Donka D

    2014-01-01

    The Noise Sensitivity Scale Short Form (NSS-SF), developed in English as a more practical form of the classical Weinstein NSS, has not to date been validated in other cultures, and its validity and reliability have not yet been confirmed. This study aimed to validate NSS-SF in Bulgarian and to demonstrate its applicability. The study comprised test-retest (n = 115) and a field-testing (n = 71) of the newly validated scale. Its construct validity was examined with confirmatory factor analysis, and very good model-fit was observed. Temporal stability was assessed in a test-retest (r = 0.990), convergent validity was examined with single-item susceptibility to the noise scale (r = 0.906) and discriminant validity was confirmed with single-item noise annoyance scale (r = 0.718). The lowest observed McDonald's omega across the studies was 0.923. The cross-cultural validation of NSS-SF was successful but it proved to be somewhat problematic with respect to its annoyance-based items.

  6. A comparison of Rasch item-fit and Cronbach's alpha item reduction analysis for the development of a Quality of Life scale for children and adolescents.

    Science.gov (United States)

    Erhart, M; Hagquist, C; Auquier, P; Rajmil, L; Power, M; Ravens-Sieberer, U

    2010-07-01

    This study compares item reduction analysis based on classical test theory (maximizing Cronbach's alpha - approach A), with analysis based on the Rasch Partial Credit Model item-fit (approach B), as applied to children and adolescents' health-related quality of life (HRQoL) items. The reliability and structural, cross-cultural and known-group validity of the measures were examined. Within the European KIDSCREEN project, 3019 children and adolescents (8-18 years) from seven European countries answered 19 HRQoL items of the Physical Well-being dimension of a preliminary KIDSCREEN instrument. The Cronbach's alpha and corrected item total correlation (approach A) were compared with infit mean squares and the Q-index item-fit derived according to a partial credit model (approach B). Cross-cultural differential item functioning (DIF ordinal logistic regression approach), structural validity (confirmatory factor analysis and residual correlation) and relative validity (RV) for socio-demographic and health-related factors were calculated for approaches (A) and (B). Approach (A) led to the retention of 13 items, compared with 11 items with approach (B). The item overlap was 69% for (A) and 78% for (B). The correlation coefficient of the summated ratings was 0.93. The Cronbach's alpha was similar for both versions [0.86 (A); 0.85 (B)]. Both approaches selected some items that are not strictly unidimensional and items displaying DIF. RV ratios favoured (A) with regard to socio-demographic aspects. Approach (B) was superior in RV with regard to health-related aspects. Both types of item reduction analysis should be accompanied by additional analyses. Neither of the two approaches was universally superior with regard to cultural, structural and known-group validity. However, the results support the usability of the Rasch method for developing new HRQoL measures for children and adolescents.

  7. Development of the Learner Self-Directedness in the Workplace Scale

    Directory of Open Access Journals (Sweden)

    Karina De Bruin

    2011-10-01

    Research purpose: The purpose of this study was to develop a scale to measure learner selfdirectedness in the workplace. Motivation for the study: Learner self-directedness appears to be an essential characteristic to keep up with the demands of the world of work. There is no brief instrument currently available to measure learner self-directedness in the workplace. Research design, approach and method: The researchers fitted the responses of 519 participantsto 22 items to the Rasch rating scale model. Main findings: The researchers retained 13 of the original 22 items. The hierarchy of item locations supported the construct validity of the scale. Hierarchical factor analysis showed the presence of one higher-order factor and three residual first-order factors. The higher-order factor accounted for almost five times as much of the common variance as did the strongest residual first-order factor. The Rasch analysis and the factor analysis suggested that the 13-item Learner Self-Directedness in the Workplace Scale (LSWS measures a single one-dimensional construct (α = 0.93. Practical/managerial implications: The instrument can help employers to understand and support employees’ self-directed learning efforts. Contribution/value-add: This research resulted in a brief instrument to measure learner selfdirectedness in the workplace. This instrument is unique in the South African context.

  8. Instemmingsgeneigdheid en verskillende item- en responsformate in 'n gesommeerde selfbeoordelingskaal

    Directory of Open Access Journals (Sweden)

    Nadene Hanekom

    1998-06-01

    Full Text Available This study examines the degree of acquiescence present when the item and response formats of a summated rating scale are varied. It is often recommended that acquiescence response bias in rating scales may be controlled by using both positively and negatively worded items. Such items are generally worded in the Likert-type format of statements. The purpose of the study was to establish whether items in question format would result in a smaller degree of acquiescence than items worded as statements. the response format was also varied (five- and seven-point options to determine whether this would influence the reliability and degree of acquiescence in the scales. A twenty-item Locus of Control (LC questionnaire was used, but each item was complemented by its opposite, resulting in 40 items. The subjects, divided randomly into two groups, were second year students who had to complete four versions of the questionnaire, plus a shortened version of Bass's scale for measuring acquiescence. The LC version were questions or statements each combined with a five- or seven-point respons format. Partial counterbalancing was introduced by testing on two separate occasions, presenting the tests to the two groups in the opposite order. The degree of acquiescence was assessed by correlating the items with their opposite, and by correlating scores on each version with scores on the acquiescence questionnaire. No major difference were found between the various item and response format in relation to acquiescence. Opsomming Hierdie ondersoek is uitgevoer om te bepaal of die mate van instemmingsgeneigdheid deur die item- en responsformaat van 'n gesommeerde selfbeoordelingskaal beinvloed word. Daar word dikwels aanbeveel dat die gebruik van positief- sowel as negatiefbewoorde items in 'n vraelys instemmingsgeneigdheid beperk. Suike items word gewoonlik in die tradisionele Likertformaat as stellings geformuleer. Die doel van die ondersoek was om te bepaal of items

  9. Resiliency Scale (RS): Scale Development, Reliability and Validity Study

    OpenAIRE

    GÜRGAN, Uğur

    2003-01-01

    The purpose of this study was to develop a new Resiliency Scale (RS) for Turkish samples. Various items from some major resiliency scales, most of them with some partial change, were collected and a pool of 228 items containing almost all possible resilience areas were obtained. This item-pool was administered to a college sample of 419. Resulting of analysis 50 item RS were obtained and administered to a new college sample of 112 participants. This second sample has also received the Rosenba...

  10. Overview of classical test theory and item response theory for the quantitative assessment of items in developing patient-reported outcomes measures.

    Science.gov (United States)

    Cappelleri, Joseph C; Jason Lundy, J; Hays, Ron D

    2014-05-01

    The US Food and Drug Administration's guidance for industry document on patient-reported outcomes (PRO) defines content validity as "the extent to which the instrument measures the concept of interest" (FDA, 2009, p. 12). According to Strauss and Smith (2009), construct validity "is now generally viewed as a unifying form of validity for psychological measurements, subsuming both content and criterion validity" (p. 7). Hence, both qualitative and quantitative information are essential in evaluating the validity of measures. We review classical test theory and item response theory (IRT) approaches to evaluating PRO measures, including frequency of responses to each category of the items in a multi-item scale, the distribution of scale scores, floor and ceiling effects, the relationship between item response options and the total score, and the extent to which hypothesized "difficulty" (severity) order of items is represented by observed responses. If a researcher has few qualitative data and wants to get preliminary information about the content validity of the instrument, then descriptive assessments using classical test theory should be the first step. As the sample size grows during subsequent stages of instrument development, confidence in the numerical estimates from Rasch and other IRT models (as well as those of classical test theory) would also grow. Classical test theory and IRT can be useful in providing a quantitative assessment of items and scales during the content-validity phase of PRO-measure development. Depending on the particular type of measure and the specific circumstances, the classical test theory and/or the IRT should be considered to help maximize the content validity of PRO measures. Copyright © 2014 Elsevier HS Journals, Inc. All rights reserved.

  11. Applying Item Response Theory methods to design a learning progression-based science assessment

    Science.gov (United States)

    Chen, Jing

    Learning progressions are used to describe how students' understanding of a topic progresses over time and to classify the progress of students into steps or levels. This study applies Item Response Theory (IRT) based methods to investigate how to design learning progression-based science assessments. The research questions of this study are: (1) how to use items in different formats to classify students into levels on the learning progression, (2) how to design a test to give good information about students' progress through the learning progression of a particular construct and (3) what characteristics of test items support their use for assessing students' levels. Data used for this study were collected from 1500 elementary and secondary school students during 2009--2010. The written assessment was developed in several formats such as the Constructed Response (CR) items, Ordered Multiple Choice (OMC) and Multiple True or False (MTF) items. The followings are the main findings from this study. The OMC, MTF and CR items might measure different components of the construct. A single construct explained most of the variance in students' performances. However, additional dimensions in terms of item format can explain certain amount of the variance in student performance. So additional dimensions need to be considered when we want to capture the differences in students' performances on different types of items targeting the understanding of the same underlying progression. Items in each item format need to be improved in certain ways to classify students more accurately into the learning progression levels. This study establishes some general steps that can be followed to design other learning progression-based tests as well. For example, first, the boundaries between levels on the IRT scale can be defined by using the means of the item thresholds across a set of good items. Second, items in multiple formats can be selected to achieve the information criterion at all

  12. Item-focussed Trees for the Identification of Items in Differential Item Functioning.

    Science.gov (United States)

    Tutz, Gerhard; Berger, Moritz

    2016-09-01

    A novel method for the identification of differential item functioning (DIF) by means of recursive partitioning techniques is proposed. We assume an extension of the Rasch model that allows for DIF being induced by an arbitrary number of covariates for each item. Recursive partitioning on the item level results in one tree for each item and leads to simultaneous selection of items and variables that induce DIF. For each item, it is possible to detect groups of subjects with different item difficulties, defined by combinations of characteristics that are not pre-specified. The way a DIF item is determined by covariates is visualized in a small tree and therefore easily accessible. An algorithm is proposed that is based on permutation tests. Various simulation studies, including the comparison with traditional approaches to identify items with DIF, show the applicability and the competitive performance of the method. Two applications illustrate the usefulness and the advantages of the new method.

  13. MMPI-2 Item Endorsements in Dissociative Identity Disorder vs. Simulators.

    Science.gov (United States)

    Brand, Bethany L; Chasson, Gregory S; Palermo, Cori A; Donato, Frank M; Rhodes, Kyle P; Voorhees, Emily F

    2016-03-01

    Elevated scores on some MMPI-2 (Minnesota Multiphasic Inventory-2) validity scales are common among patients with dissociative identity disorder (DID), which raises questions about the validity of their responses. Such patients show elevated scores on atypical answers (F), F-psychopathology (Fp), atypical answers in the second half of the test (FB), schizophrenia (Sc), and depression (D) scales, with Fp showing the greatest utility in distinguishing them from coached and uncoached DID simulators. In the current study, we investigated the items on the MMPI-2 F, Fp, FB, Sc, and D scales that were most and least commonly endorsed by participants with DID in our 2014 study and compared these responses with those of coached and uncoached DID simulators. The comparisons revealed that patients with DID most frequently endorsed items related to dissociation, trauma, depression, fearfulness, conflict within family, and self-destructiveness. The coached group more successfully imitated item endorsements of the DID group than did the uncoached group. However, both simulating groups, especially the uncoached group, frequently endorsed items that were uncommonly endorsed by the DID group. The uncoached group endorsed items consistent with popular media portrayals of people with DID being violent, delusional, and unlawful. These results suggest that item endorsement patterns can provide useful information to clinicians making determinations about whether an individual is presenting with DID or feigning. © 2016 American Academy of Psychiatry and the Law.

  14. Optimizing incomplete sample designs for item response model parameters

    NARCIS (Netherlands)

    van der Linden, Willem J.

    Several models for optimizing incomplete sample designs with respect to information on the item parameters are presented. The following cases are considered: (1) known ability parameters; (2) unknown ability parameters; (3) item sets with multiple ability scales; and (4) response models with

  15. Using Item Response Theory to Describe the Nonverbal Literacy Assessment (NVLA)

    Science.gov (United States)

    Fleming, Danielle; Wilson, Mark; Ahlgrim-Delzell, Lynn

    2018-01-01

    The Nonverbal Literacy Assessment (NVLA) is a literacy assessment designed for students with significant intellectual disabilities. The 218-item test was initially examined using confirmatory factor analysis. This method showed that the test worked as expected, but the items loaded onto a single factor. This article uses item response theory to…

  16. CTTITEM: SAS macro and SPSS syntax for classical item analysis.

    Science.gov (United States)

    Lei, Pui-Wa; Wu, Qiong

    2007-08-01

    This article describes the functions of a SAS macro and an SPSS syntax that produce common statistics for conventional item analysis including Cronbach's alpha, item difficulty index (p-value or item mean), and item discrimination indices (D-index, point biserial and biserial correlations for dichotomous items and item-total correlation for polytomous items). These programs represent an improvement over the existing SAS and SPSS item analysis routines in terms of completeness and user-friendliness. To promote routine evaluations of item qualities in instrument development of any scale, the programs are available at no charge for interested users. The program codes along with a brief user's manual that contains instructions and examples are downloadable from suen.ed.psu.edu/-pwlei/plei.htm.

  17. Spare Items validation

    International Nuclear Information System (INIS)

    Fernandez Carratala, L.

    1998-01-01

    There is an increasing difficulty for purchasing safety related spare items, with certifications by manufacturers for maintaining the original qualifications of the equipment of destination. The main reasons are, on the top of the logical evolution of technology, applied to the new manufactured components, the quitting of nuclear specific production lines and the evolution of manufacturers quality systems, originally based on nuclear codes and standards, to conventional industry standards. To face this problem, for many years different Dedication processes have been implemented to verify whether a commercial grade element is acceptable to be used in safety related applications. In the same way, due to our particular position regarding the spare part supplies, mainly from markets others than the american, C.N. Trillo has developed a methodology called Spare Items Validation. This methodology, which is originally based on dedication processes, is not a single process but a group of coordinated processes involving engineering, quality and management activities. These are to be performed on the spare item itself, its design control, its fabrication and its supply for allowing its use in destinations with specific requirements. The scope of application is not only focussed on safety related items, but also to complex design, high cost or plant reliability related components. The implementation in C.N. Trillo has been mainly curried out by merging, modifying and making the most of processes and activities which were already being performed in the company. (Author)

  18. The 12-item World Health Organization Disability Assessment Schedule II (WHO-DAS II: a nonparametric item response analysis

    Directory of Open Access Journals (Sweden)

    Fernandez Ana

    2010-05-01

    Full Text Available Abstract Background Previous studies have analyzed the psychometric properties of the World Health Organization Disability Assessment Schedule II (WHO-DAS II using classical omnibus measures of scale quality. These analyses are sample dependent and do not model item responses as a function of the underlying trait level. The main objective of this study was to examine the effectiveness of the WHO-DAS II items and their options in discriminating between changes in the underlying disability level by means of item response analyses. We also explored differential item functioning (DIF in men and women. Methods The participants were 3615 adult general practice patients from 17 regions of Spain, with a first diagnosed major depressive episode. The 12-item WHO-DAS II was administered by the general practitioners during the consultation. We used a non-parametric item response method (Kernel-Smoothing implemented with the TestGraf software to examine the effectiveness of each item (item characteristic curves and their options (option characteristic curves in discriminating between changes in the underliying disability level. We examined composite DIF to know whether women had a higher probability than men of endorsing each item. Results Item response analyses indicated that the twelve items forming the WHO-DAS II perform very well. All items were determined to provide good discrimination across varying standardized levels of the trait. The items also had option characteristic curves that showed good discrimination, given that each increasing option became more likely than the previous as a function of increasing trait level. No gender-related DIF was found on any of the items. Conclusions All WHO-DAS II items were very good at assessing overall disability. Our results supported the appropriateness of the weights assigned to response option categories and showed an absence of gender differences in item functioning.

  19. Item Response Theory Applied to Factors Affecting the Patient Journey Towards Hearing Rehabilitation

    Science.gov (United States)

    Chenault, Michelene; Berger, Martijn; Kremer, Bernd; Anteunis, Lucien

    2016-01-01

    To develop a tool for use in hearing screening and to evaluate the patient journey towards hearing rehabilitation, responses to the hearing aid rehabilitation questionnaire scales aid stigma, pressure, and aid unwanted addressing respectively hearing aid stigma, experienced pressure from others; perceived hearing aid benefit were evaluated with item response theory. The sample was comprised of 212 persons aged 55 years or more; 63 were hearing aid users, 64 with and 85 persons without hearing impairment according to guidelines for hearing aid reimbursement in the Netherlands. Bias was investigated relative to hearing aid use and hearing impairment within the differential test functioning framework. Items compromising model fit or demonstrating differential item functioning were dropped. The aid stigma scale was reduced from 6 to 4, the pressure scale from 7 to 4, and the aid unwanted scale from 5 to 4 items. This procedure resulted in bias-free scales ready for screening purposes and application to further understand the help-seeking process of the hearing impaired. PMID:28028428

  20. A single-item self-report medication adherence question predicts hospitalisation and death in patients with heart failure.

    Science.gov (United States)

    Wu, Jia-Rong; DeWalt, Darren A; Baker, David W; Schillinger, Dean; Ruo, Bernice; Bibbins-Domingo, Kristen; Macabasco-O'Connell, Aurelia; Holmes, George M; Broucksou, Kimberly A; Erman, Brian; Hawk, Victoria; Cene, Crystal W; Jones, Christine DeLong; Pignone, Michael

    2014-09-01

    To determine whether a single-item self-report medication adherence question predicts hospitalisation and death in patients with heart failure. Poor medication adherence is associated with increased morbidity and mortality. Having a simple means of identifying suboptimal medication adherence could help identify at-risk patients for interventions. We performed a prospective cohort study in 592 participants with heart failure within a four-site randomised trial. Self-report medication adherence was assessed at baseline using a single-item question: 'Over the past seven days, how many times did you miss a dose of any of your heart medication?' Participants who reported no missing doses were defined as fully adherent, and those missing more than one dose were considered less than fully adherent. The primary outcome was combined all-cause hospitalisation or death over one year and the secondary endpoint was heart failure hospitalisation. Outcomes were assessed with blinded chart reviews, and heart failure outcomes were determined by a blinded adjudication committee. We used negative binomial regression to examine the relationship between medication adherence and outcomes. Fifty-two percent of participants were 52% male, mean age was 61 years, and 31% were of New York Heart Association class III/IV at enrolment; 72% of participants reported full adherence to their heart medicine at baseline. Participants with full medication adherence had a lower rate of all-cause hospitalisation and death (0·71 events/year) compared with those with any nonadherence (0·86 events/year): adjusted-for-site incidence rate ratio was 0·83, fully adjusted incidence rate ratio 0·68. Incidence rate ratios were similar for heart failure hospitalisations. A single medication adherence question at baseline predicts hospitalisation and death over one year in heart failure patients. Medication adherence is associated with all-cause and heart failure-related hospitalisation and death in heart

  1. Validity and Reliability of the 8-Item Work Limitations Questionnaire.

    Science.gov (United States)

    Walker, Timothy J; Tullar, Jessica M; Diamond, Pamela M; Kohl, Harold W; Amick, Benjamin C

    2017-12-01

    Purpose To evaluate factorial validity, scale reliability, test-retest reliability, convergent validity, and discriminant validity of the 8-item Work Limitations Questionnaire (WLQ) among employees from a public university system. Methods A secondary analysis using de-identified data from employees who completed an annual Health Assessment between the years 2009-2015 tested research aims. Confirmatory factor analysis (CFA) (n = 10,165) tested the latent structure of the 8-item WLQ. Scale reliability was determined using a CFA-based approach while test-retest reliability was determined using the intraclass correlation coefficient. Convergent/discriminant validity was tested by evaluating relations between the 8-item WLQ with health/performance variables for convergent validity (health-related work performance, number of chronic conditions, and general health) and demographic variables for discriminant validity (gender and institution type). Results A 1-factor model with three correlated residuals demonstrated excellent model fit (CFI = 0.99, TLI = 0.99, RMSEA = 0.03, and SRMR = 0.01). The scale reliability was acceptable (0.69, 95% CI 0.68-0.70) and the test-retest reliability was very good (ICC = 0.78). Low-to-moderate associations were observed between the 8-item WLQ and the health/performance variables while weak associations were observed between the demographic variables. Conclusions The 8-item WLQ demonstrated sufficient reliability and validity among employees from a public university system. Results suggest the 8-item WLQ is a usable alternative for studies when the more comprehensive 25-item WLQ is not available.

  2. Psychometric properties of the Triarchic Psychopathy Measure: An item response theory approach.

    Science.gov (United States)

    Shou, Yiyun; Sellbom, Martin; Xu, Jing

    2018-05-01

    There is cumulative evidence for the cross-cultural validity of the Triarchic Psychopathy Measure (TriPM; Patrick, 2010) among non-Western populations. Recent studies using correlational and regression analyses show promising construct validity of the TriPM in Chinese samples. However, little is known about the efficiency of items in TriPM in assessing the proposed latent traits. The current study evaluated the psychometric properties of the Chinese TriPM at the item level using item response theory analyses. It also examined the measurement invariance of the TriPM between the Chinese and the U.S. student samples by applying differential item functioning analyses under the item response theory framework. The results supported the unidimensional nature of the Disinhibition and Meanness scales. Both scales had a greater level of precision in the respective underlying constructs at the positive ends. The two scales, however, had several items that were weakly associated with their respective latent traits in the Chinese student sample. Boldness, on the other hand, was found to be multidimensional, and reflected a more normally distributed range of variation. The examination of measurement bias via differential item functioning analyses revealed that a number of items of the TriPM were not equivalent across the Chinese and the U.S. Some modification and adaptation of items might be considered for improving the precision of the TriPM for Chinese participants. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  3. Negative Affect Impairs Associative Memory but Not Item Memory

    Science.gov (United States)

    Bisby, James A.; Burgess, Neil

    2014-01-01

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine…

  4. An Investigation of Item Type in a Standards-Based Assessment.

    Directory of Open Access Journals (Sweden)

    Liz Hollingworth

    2007-12-01

    Full Text Available Large-scale state assessment programs use both multiple-choice and open-ended items on tests for accountability purposes. Certainly, there is an intuitive belief among some educators and policy makers that open-ended items measure something different than multiple-choice items. This study examined two item formats in custom-built, standards-based tests of achievement in Reading and Mathematics at grades 3-8. In this paper, we raise questions about the value of including open-ended items, given scoring costs, time constraints, and the higher probability of missing data from test-takers.

  5. Methodology for the development and calibration of the SCI-QOL item banks.

    Science.gov (United States)

    Tulsky, David S; Kisala, Pamela A; Victorson, David; Choi, Seung W; Gershon, Richard; Heinemann, Allen W; Cella, David

    2015-05-01

    To develop a comprehensive, psychometrically sound, and conceptually grounded patient reported outcomes (PRO) measurement system for individuals with spinal cord injury (SCI). Individual interviews (n=44) and focus groups (n=65 individuals with SCI and n=42 SCI clinicians) were used to select key domains for inclusion and to develop PRO items. Verbatim items from other cutting-edge measurement systems (i.e. PROMIS, Neuro-QOL) were included to facilitate linkage and cross-population comparison. Items were field tested in a large sample of individuals with traumatic SCI (n=877). Dimensionality was assessed with confirmatory factor analysis. Local item dependence and differential item functioning were assessed, and items were calibrated using the item response theory (IRT) graded response model. Finally, computer adaptive tests (CATs) and short forms were administered in a new sample (n=245) to assess test-retest reliability and stability. A calibration sample of 877 individuals with traumatic SCI across five SCI Model Systems sites and one Department of Veterans Affairs medical center completed SCI-QOL items in interview format. We developed 14 unidimensional calibrated item banks and 3 calibrated scales across physical, emotional, and social health domains. When combined with the five Spinal Cord Injury--Functional Index physical function banks, the final SCI-QOL system consists of 22 IRT-calibrated item banks/scales. Item banks may be administered as CATs or short forms. Scales may be administered in a fixed-length format only. The SCI-QOL measurement system provides SCI researchers and clinicians with a comprehensive, relevant and psychometrically robust system for measurement of physical-medical, physical-functional, emotional, and social outcomes. All SCI-QOL instruments are freely available on Assessment CenterSM.

  6. A more general model for testing measurement invariance and differential item functioning.

    Science.gov (United States)

    Bauer, Daniel J

    2017-09-01

    The evaluation of measurement invariance is an important step in establishing the validity and comparability of measurements across individuals. Most commonly, measurement invariance has been examined using 1 of 2 primary latent variable modeling approaches: the multiple groups model or the multiple-indicator multiple-cause (MIMIC) model. Both approaches offer opportunities to detect differential item functioning within multi-item scales, and thereby to test measurement invariance, but both approaches also have significant limitations. The multiple groups model allows 1 to examine the invariance of all model parameters but only across levels of a single categorical individual difference variable (e.g., ethnicity). In contrast, the MIMIC model permits both categorical and continuous individual difference variables (e.g., sex and age) but permits only a subset of the model parameters to vary as a function of these characteristics. The current article argues that moderated nonlinear factor analysis (MNLFA) constitutes an alternative, more flexible model for evaluating measurement invariance and differential item functioning. We show that the MNLFA subsumes and combines the strengths of the multiple group and MIMIC models, allowing for a full and simultaneous assessment of measurement invariance and differential item functioning across multiple categorical and/or continuous individual difference variables. The relationships between the MNLFA model and the multiple groups and MIMIC models are shown mathematically and via an empirical demonstration. (PsycINFO Database Record (c) 2017 APA, all rights reserved).

  7. Rats Remember Items in Context Using Episodic Memory.

    Science.gov (United States)

    Panoz-Brown, Danielle; Corbin, Hannah E; Dalecki, Stefan J; Gentry, Meredith; Brotheridge, Sydney; Sluka, Christina M; Wu, Jie-En; Crystal, Jonathon D

    2016-10-24

    Vivid episodic memories in people have been characterized as the replay of unique events in sequential order [1-3]. Animal models of episodic memory have successfully documented episodic memory of a single event (e.g., [4-8]). However, a fundamental feature of episodic memory in people is that it involves multiple events, and notably, episodic memory impairments in human diseases are not limited to a single event. Critically, it is not known whether animals remember many unique events using episodic memory. Here, we show that rats remember many unique events and the contexts in which the events occurred using episodic memory. We used an olfactory memory assessment in which new (but not old) odors were rewarded using 32 items. Rats were presented with 16 odors in one context and the same odors in a second context. To attain high accuracy, the rats needed to remember item in context because each odor was rewarded as a new item in each context. The demands on item-in-context memory were varied by assessing memory with 2, 3, 5, or 15 unpredictable transitions between contexts, and item-in-context memory survived a 45 min retention interval challenge. When the memory of item in context was put in conflict with non-episodic familiarity cues, rats relied on item in context using episodic memory. Our findings suggest that rats remember multiple unique events and the contexts in which these events occurred using episodic memory and support the view that rats may be used to model fundamental aspects of human cognition. Copyright © 2016 Elsevier Ltd. All rights reserved.

  8. [Social anxiety and self-esteem: Hungarian validation of the "Brief Fear of Negative Evaluation Scale - Straightforward Items"].

    Science.gov (United States)

    Perczel-Forintos, Dóra; Kresznerits, Szilvia

    2017-06-01

    Although social anxiety disorder (SAD) is the third most frequent emotional disorder with 13-15% prevalence rate, it remains unrecognized very often. Social phobia is associated with low self-esteem, high self-criticism and fear of negative evaluation by others. It shows high comorbidity with depression, alcoholism, drug addiction and eating disorders. To adapt the widely used "Fear of Negative Evaluation" (FNE) social phobia questionnaire. Anxiety and mood disorder patients (n = 255) completed the Fear of Negative Evaluation Scale (30, 12 and 8 item-versions) as well as social cognition, anxiety and self-esteem questionnaires. All the three versions of the FNE have strong internal validity (α>0.83) and moderate significant correlation with low self-esteem, negative social cognitions and anxiety. The short 8-item BFNE-S has the strongest disciminative value in differentiating patients with social phobia and with other emotional disorders. The Hungarian version of the BFNE-S is an effective tool for the quick recognition of social phobia. Orv Hetil. 2017; 158(22): 843-850.

  9. Validity and reliability of the Cohen 10-item Perceived Stress Scale in patients with chronic headache: Persian version.

    Science.gov (United States)

    Khalili, Robabe; Sirati Nir, Masoud; Ebadi, Abbas; Tavallai, Abbas; Habibi, Mehdi

    2017-04-01

    The Cohen Perceived Stress Scale is being used widely in various countries. The present study evaluated the validity and reliability of the Cohen 10-item Perceived Stress Scale (PSS-10) in assessing tension headache, migraine, and stress-related diseases in Iran. This study is a methodological and cross-sectional descriptive investigation of 100 patients with chronic headache admitted to the pain clinic of Baqiyatallah Educational and Therapeutic Center. Convenience sampling was used for subject selection. PSS psychometric properties were evaluated in two stages. First, the standard scale was translated. Then, the face validity, content, and construct of the translated version were determined. The average age of participants was 38 years with a standard deviation (SD) of 13.2. As for stress levels, 12% were within the normal range, 36% had an intermediate level, and 52% had a high level of stress. The face validity and scale content were remarkable, and the KMO coefficient was 0.82. Bartlett's test yielded 0.327 which was statistically significant (pstress and chronic headache. Copyright © 2017 Elsevier B.V. All rights reserved.

  10. Subjective Happiness of Lebanese College Youth in Lebanon: Factorial Structure and Invariance of the Arabic Subjective Happiness Scale

    Science.gov (United States)

    Moghnie, Lamia; Kazarian, Shahe S.

    2012-01-01

    The present study evaluated the subjective happiness of Lebanese college youth using a multi-item rather than a single-item subjective happiness measure. An Arabic translation of the Subjective Happiness Scale (SHS) was administered to 273 Lebanese college youth from state- and private-run higher institutions of learning, as was the Arabic Adult…

  11. Does the Assessment of Recovery Capital scale reflect a single or multiple domains?

    Directory of Open Access Journals (Sweden)

    Arndt S

    2017-07-01

    Full Text Available Stephan Arndt,1–3 Ethan Sahker,1,4 Suzy Hedden1 1Iowa Consortium for Substance Abuse Research and Evaluation, 2Department of Psychiatry, Carver College of Medicine, 3Department of Biostatistics, College of Public Health, 4Department of Psychological and Quantitative Foundations, Counseling Psychology Program College of Education, University of Iowa, Iowa City, IA, USA Objective: The goal of this study was to determine whether the 50-item Assessment of Recovery Capital scale represents a single general measure or whether multiple domains might be psychometrically useful for research or clinical applications. Methods: Data are from a cross-sectional de-identified existing program evaluation information data set with 1,138 clients entering substance use disorder treatment. Principal components and iterated factor analysis were used on the domain scores. Multiple group factor analysis provided a quasi-confirmatory factor analysis. Results: The solution accounted for 75.24% of the total variance, suggesting that 10 factors provide a reasonably good fit. However, Tucker’s congruence coefficients between the factor structure and defining weights (0.41–0.52 suggested a poor fit to the hypothesized 10-domain structure. Principal components of the 10-domain scores yielded one factor whose eigenvalue was greater than one (5.93, accounting for 75.8% of the common variance. A few domains had perceptible but small unique variance components suggesting that a few of the domains may warrant enrichment. Conclusion: Our findings suggest that there is one general factor, with a caveat. Using the 10 measures inflates the chance for Type I errors. Using one general measure avoids this issue, is simple to interpret, and could reduce the number of items. However, those seeking to maximally predict later recovery success may need to use the full instrument and all 10 domains. Keywords: social support, psychometrics, quality of life

  12. Development of six PROMIS pediatrics proxy-report item banks.

    Science.gov (United States)

    Irwin, Debra E; Gross, Heather E; Stucky, Brian D; Thissen, David; DeWitt, Esi Morgan; Lai, Jin Shei; Amtmann, Dagmar; Khastou, Leyla; Varni, James W; DeWalt, Darren A

    2012-02-22

    Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO) among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS) pediatric proxy-report item banks. The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact). Caregivers (n = 25) of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads). Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432). In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%), married (70%), Caucasian (64%) and had at least a high school education (94%). Approximately 50% had children with a chronic health condition, primarily asthma, which was diagnosed or treated within 6

  13. A validation study using a modified version of Postural Assessment Scale for Stroke Patients: Postural Stroke Study in Gothenburg (POSTGOT

    Directory of Open Access Journals (Sweden)

    Danielsson Anna

    2011-10-01

    Full Text Available Abstract Background A modified version of Postural Assessment Scale for Stroke Patients (PASS was created with some changes in the description of the items and clarifications in the manual (e.g. much help was defined as support from 2 persons. The aim of this validation study was to assess intrarater and interrater reliability using this modified version of PASS, at a stroke unit, for patients in the acute phase after their first event of stroke. Methods In the intrarater reliability study 114 patients and in the interrater reliability study 15 patients were examined twice with the test within one to 24 hours in the first week after stroke. Spearman's rank correlation, Kappa coefficients, Percentage Agreement and the newer rank-invariant methods; Relative Position, Relative Concentration and Relative rank Variance were used for the statistical analysis. Results For the intrarater reliability Spearman's rank correlations were 0.88-0.98 and k were 0.70-0.93 for the individual items. Small, statistically significant, differences were found for two items regarding Relative Position and for one item regarding Relative Concentration. There was no Relative rank Variance for any single item. For the interrater reliability, Spearman's rank correlations were 0.77-0.99 for individual items. For some items there was a possible, even if not proved, reliability problem regarding Relative Position and Relative Concentration. There was no Relative rank Variance for the single items, except for a small Relative rank Variance for one item. Conclusions The high intrarater and interrater reliability shown for the modified Postural Assessment Scale for Stroke Patients, the Swedish version of Postural Assessment Scale for Stroke Patients, with traditional and newer statistical analyses, particularly for assessments performed by the same rater, support the use of the Swedish version of Postural Assessment Scale for Stroke Patients, in the acute stage after stroke both

  14. Practical Guide to Conducting an Item Response Theory Analysis

    Science.gov (United States)

    Toland, Michael D.

    2014-01-01

    Item response theory (IRT) is a psychometric technique used in the development, evaluation, improvement, and scoring of multi-item scales. This pedagogical article provides the necessary information needed to understand how to conduct, interpret, and report results from two commonly used ordered polytomous IRT models (Samejima's graded…

  15. Satisfaction with Life Scale (SLS-6): First validation study in Parkinson's disease population.

    Science.gov (United States)

    Ambrosio, Leire; Portillo, Mari Carmen; Rodriguez-Blazquez, Carmen; Martínez-Castrillo, Juan Carlos; Rodriguez-Violante, Mayela; Serrano-Dueñas, Marcos; Campos-Arillo, Víctor; Garretto, Nelida Susana; Arakaki, Tomoko; Álvarez, Mario; Pedroso-Ibáñez, Ivonne; Carvajal, Ana; Martinez-Martin, Pablo

    2016-04-01

    To explore the psychometric attributes of a new Satisfaction with Life Scale (SLS-6) in a wide Spanish-speaking population with Parkinson's disease (PD). This was an international, cross-sectional study. Several rater-based and patient-reported outcomes measures for evaluation of PD (e.g., Scales for Outcomes in Parkinson's Disease-Motor) and other constructs (e.g., Duke-UNC Functional Social Support Questionnaire, Scale for Living with Chronic Illness) were applied together with the SLS-6. Acceptability, scaling assumptions, reliability, precision, and construct validity were tested. The study included 324 patients from five countries, with age (mean ± standard deviation) 66.67 ± 10.68 years. None of the SLS-6 items had missing values and all acceptability parameters fulfilled the standard criteria. Scaling assumptions allowed the calculation of a summary index from items 2 to 6, complementary to the global evaluation (item 1). For these five items, Cronbach's alpha was 0.85; the corrected item-total correlation 0.53-0.73; inter-item correlation, 0.45-0.70, with an item homogeneity index of 0.55. The standard error of measurement, based on Cronbach's alpha for a single observation, was 3.48. SLS-6 correlations were moderate to strong (rs ≥ 0.35) with the patient-reported outcomes and weak to moderate with the rater-based assessments used in the study. The SLS-6 total score was significantly different according to PD severity levels established according to Hoehn and Yahr staging, Clinical Impression of Severity Index, and Patient-Based Global Impression of Severity scale. The results suggest that SLS-6 is an easy, feasible, acceptable, consistent, precise and valid measure to evaluate satisfaction with life in PD patients. Copyright © 2016 Elsevier Ltd. All rights reserved.

  16. : Translated Scales in French: Social Anxiety and Taijin Kyofusho

    OpenAIRE

    Lacroix , Franca; Zhou , Biru

    2015-01-01

    This document contains 6 in-house translated scales (in French) that are related to social anxiety and Taijin Kyofusho. These translated scales are: Self Construal Scale (30 items), Brief Fear of Negative Evaluation Scale (12 items), Social Interaction Anxiety Scale (20 items), Social Anxiety - Causing Discomfort to Others (26 items), Taijin Kyofusho Scale (31 items), Modified version of Taijin Kyofusho Questionnaire (40 items).; Ce document contient la traduction en français de 6 échelles de...

  17. Using item response theory to address vulnerabilities in FFQ.

    Science.gov (United States)

    Kazman, Josh B; Scott, Jonathan M; Deuster, Patricia A

    2017-09-01

    The limitations for self-reporting of dietary patterns are widely recognised as a major vulnerability of FFQ and the dietary screeners/scales derived from FFQ. Such instruments can yield inconsistent results to produce questionable interpretations. The present article discusses the value of psychometric approaches and standards in addressing these drawbacks for instruments used to estimate dietary habits and nutrient intake. We argue that a FFQ or screener that treats diet as a 'latent construct' can be optimised for both internal consistency and the value of the research results. Latent constructs, a foundation for item response theory (IRT)-based scales (e.g. Patient Reported Outcomes Measurement Information System) are typically introduced in the design stage of an instrument to elicit critical factors that cannot be observed or measured directly. We propose an iterative approach that uses such modelling to refine FFQ and similar instruments. To that end, we illustrate the benefits of psychometric modelling by using items and data from a sample of 12 370 Soldiers who completed the 2012 US Army Global Assessment Tool (GAT). We used factor analysis to build the scale incorporating five out of eleven survey items. An IRT-driven assessment of response category properties indicates likely problems in the ordering or wording of several response categories. Group comparisons, examined with differential item functioning (DIF), provided evidence of scale validity across each Army sub-population (sex, service component and officer status). Such an approach holds promise for future FFQ.

  18. Preliminary data concerning the reliability and psychometric properties of the Greek translation of the 20-item Subjective Well-Being Under Neuroleptic Treatment Scale (SWN-20

    Directory of Open Access Journals (Sweden)

    Arapidis Konstantinos

    2009-01-01

    Full Text Available Abstract Background The 20-item Subjective Well-Being Under Neuroleptic Treatment Scale (SWN-20 is a self-report scale developed in order to assess the well-being of patients receiving antipsychotic medication independent of the improvement in their psychotic symptoms. The current study reports on the reliability and the psychometric properties of the Greek translation of the SWN-20. Methods A total of 100 inpatients or outpatients with schizophrenia (79 males and 21 females, aged 42.6 ± 11.35 years old from 3 different facilities were assessed with the Positive and Negative Symptoms Scale (PANSS, the Calgary Depression Scale and the Simpson-Angus Scale, and completed the SWN-20. The statistical analysis included the calculation of Pearson product moment correlation coefficient, the Cronbach α and factor analysis with Varimax normalised rotation. Results The SWN-20 had an α value equal to 0.79 and all the items were equal. The factor analysis revealed the presence of seven factors explaining 66% of total variance. The correlation matrix revealed a moderate relationship of the SWN-20 and its factors with the PANSS-Negative (PANSS-N, PANSS-General Psychopathology (PANSS-G, the Simpson-Angus and the Calgary scales, and no relationship to age, education and income class. Discussion The Greek translation of the SWN-20 is reliable, with psychometric properties close to the original scale.

  19. Item response theory applied to factors affecting the patient journey towards hearing rehabilitation

    Directory of Open Access Journals (Sweden)

    Michelene Chenault

    2016-11-01

    Full Text Available To develop a tool for use in hearing screening and to evaluate the patient journey towards hearing rehabilitation, responses to the hearing aid rehabilitation questionnaire scales aid stigma, pressure, and aid unwanted addressing respectively hearing aid stigma, experienced pressure from others; perceived hearing aid benefit were evaluated with item response theory. The sample was comprised of 212 persons aged 55 years or more; 63 were hearing aid users, 64 with and 85 persons without hearing impairment according to guidelines for hearing aid reimbursement in the Netherlands. Bias was investigated relative to hearing aid use and hearing impairment within the differential test functioning framework. Items compromising model fit or demonstrating differential item functioning were dropped. The aid stigma scale was reduced from 6 to 4, the pressure scale from 7 to 4, and the aid unwanted scale from 5 to 4 items. This procedure resulted in bias-free scales ready for screening purposes and application to further understand the help-seeking process of the hearing impaired.

  20. A comment on Watson, Deary, and Austin and Watson, Roberts, Gow, and Deary : How to investigate whether personality items form a hierarchical scale?

    NARCIS (Netherlands)

    Meijer, Rob R.

    I comment on two recent papers by Watson et al. (2007, 2008) who investigated whether personality items form a hierarchical scale. I discuss that the methods they used are inappropriate and discuss alternative methods presented in the literature. (C) 2009 Elsevier Ltd All rights reserved..

  1. Validation of the Chinese version 10-item Perceived Efficacy in Patient-Physician Interactions scale in patients with osteoarthritis

    Directory of Open Access Journals (Sweden)

    Zhao HW

    2016-10-01

    Full Text Available Huiwen Zhao,1 Wen Luo,1 Rose C Maly,2 Jun Liu,1 Junyi Lee,1 Yaning Cui1 1Joint Department, The 2nd Ward of Joint Surgery, Tianjin Hospital, Tianjin, the People’s Republic of China; 2Department of Family Medicine David Geffen School of Medicine, University of California Los Angeles, Los Angeles, CA, USA Objectives: This study aimed to assess the reliability and validity of the Chinese version of the 10-item Perceived Efficacy in Patient–Physician Interaction (PEPPI-10 scale in hospitalized patients with severe knee osteoarthritis in the People’s Republic of China. Methods: Between January and March 2015, the Chinese versions of PEPPI, self-efficacy for exercise scale, osteoporosis self-efficacy scale, and modified fall efficacy scale were applied to assess 110 severe knee osteoarthritis patients who were hospitalized in the second ward of the department of arthroplasty surgery of Tianjin Hospital. Results: The Chinese version of the PEPPI-10 scale had a high coefficient of internal consistency (Cronbach’s α coefficient, 0.907. The score of the Chinese version of PEPPI was weakly correlated with the scores of the Chinese versions of self-efficacy for exercise scale, osteoporosis self-efficacy scale, and modified fall efficacy scale. Conclusion: The Chinese version of the PEPPI-10 scale exhibits sufficient internal consistency and convergent validity in hospitalized patients with severe knee osteoarthritis in the People’s Republic of China. Keywords: assessment of osteoarthritis, patient–physician communication, self-efficacy, instrument validation

  2. Rasch Measurement Analysis of a 25-Item Version of the Mueller/McCloskey Nurse Job Satisfaction Scale in a Sample of Nurses in Lebanon and Qatar

    Directory of Open Access Journals (Sweden)

    Michael Clinton

    2015-06-01

    Full Text Available The Mueller/McCloskey Nurse Job Satisfaction Scale (MMSS is widely used, but its psychometric characteristics have not been sufficiently validated for use in Middle Eastern countries. The objective of our methodological study was to determine the psychometric suitability of a 25-item version of the MMSS (MMSS-25 for use in middle-income and high-income Middle Eastern countries. A total of 1,322 registered nurses, 859 in Lebanon and 463 in Qatar, completed the MMSS-25 as part of a cross-sectional multinational investigation of nursing shortages in the region. We used the Rasch rating scale model to investigate the psychometric performance of the MMSS-25. We identified possible item bias among MMSS-25 items. We conducted confirmatory factor analyses (CFA to compare the fit to our data of five factor structures reported in the literature. We concluded that irrespective of administration in English or Arabic, the MMSS-25 is not sufficiently productive of measurement for use in the region. A core set of 13 items (MMSS-13, Cronbach’s α = .82 loading on five dimensions eliminates redundant MMSS items and is suitable for initial screening of nurses’ satisfaction. Of the five factor structures we examined, the MMSS-13 was the only close fit to our data (comparative fit index = 0.951; Tucker–Lewis index = 0.931; root mean square error of approximation = 0.051; p value = .401. The MMSS-13 has psychometric characteristics superior to MMSS-25, but additional items are required to meet the research-specific objectives of future studies of nurses’ job satisfaction in Middle Eastern countries.

  3. Uncertainties in the Item Parameter Estimates and Robust Automated Test Assembly

    Science.gov (United States)

    Veldkamp, Bernard P.; Matteucci, Mariagiulia; de Jong, Martijn G.

    2013-01-01

    Item response theory parameters have to be estimated, and because of the estimation process, they do have uncertainty in them. In most large-scale testing programs, the parameters are stored in item banks, and automated test assembly algorithms are applied to assemble operational test forms. These algorithms treat item parameters as fixed values,…

  4. Disparity between General Symptom Relief and Remission Criteria in the Positive and Negative Syndrome Scale (PANSS): A Post-treatment Bifactor Item Response Theory Model.

    Science.gov (United States)

    Anderson, Ariana E; Reise, Steven P; Marder, Stephen R; Mansolf, Maxwell; Han, Carol; Bilder, Robert M

    2017-12-01

    Objective: Total scale scores derived by summing ratings from the 30-item PANSS are commonly used in clinical trial research to measure overall symptom severity, and percentage reductions in the total scores are sometimes used to document the efficacy of treatment. Acknowledging that some patients may have substantial changes in PANSS total scores but still be sufficiently symptomatic to warrant diagnosis, ratings on a subset of 8 items, referred to here as the "Remission set," are sometimes used to determine if patients' symptoms no longer satisfy diagnostic criteria. An unanswered question remains: is the goal of treatment better conceptualized as reduction in overall symptom severity, or reduction in symptoms below the threshold for diagnosis? We evaluated the psychometric properties of PANSS total scores, to assess whether having low symptom severity post-treatment is equivalent to attaining Remission. Design: We applied a bifactor item response theory (IRT) model to post-treatment PANSS ratings of 3,647 subjects diagnosed with schizophrenia assessed at the termination of 11 clinical trials. The bifactor model specified one general dimension to reflect overall symptom severity, and five domain-specific dimensions. We assessed how PANSS item discrimination and information parameters varied across the range of overall symptom severity (θ), with a special focus on low levels of symptoms (i.e., θexpected PANSS item score of 1.83, a rating between "Absent" and "Minimal" for a PANSS symptom. Results: The application of the bifactor IRT model revealed: (1) 88% of total score variation was attributable to variation in general symptom severity, and only 8% reflected secondary domain factors. This implies that a general factor may provide a good indicator of symptom severity, and that interpretation is not overly complicated by multidimensionality; (2) Post-treatment, 534 individuals (about 15% of the whole sample) scored in the "Relief" range of general symptom

  5. Item validity vs. item discrimination index: a redundancy?

    Science.gov (United States)

    Panjaitan, R. L.; Irawati, R.; Sujana, A.; Hanifah, N.; Djuanda, D.

    2018-03-01

    In several literatures about evaluation and test analysis, it is common to find that there are calculations of item validity as well as item discrimination index (D) with different formula for each. Meanwhile, other resources said that item discrimination index could be obtained by calculating the correlation between the testee’s score in a particular item and the testee’s score on the overall test, which is actually the same concept as item validity. Some research reports, especially undergraduate theses tend to include both item validity and item discrimination index in the instrument analysis. It seems that these concepts might overlap for both reflect the test quality on measuring the examinees’ ability. In this paper, examples of some results of data processing on item validity and item discrimination index were compared. It would be discussed whether item validity and item discrimination index can be represented by one of them only or it should be better to present both calculations for simple test analysis, especially in undergraduate theses where test analyses were included.

  6. Role of optometry school in single day large scale school vision testing

    Science.gov (United States)

    Anuradha, N; Ramani, Krishnakumar

    2015-01-01

    Background: School vision testing aims at identification and management of refractive errors. Large-scale school vision testing using conventional methods is time-consuming and demands a lot of chair time from the eye care professionals. A new strategy involving a school of optometry in single day large scale school vision testing is discussed. Aim: The aim was to describe a new approach of performing vision testing of school children on a large scale in a single day. Materials and Methods: A single day vision testing strategy was implemented wherein 123 members (20 teams comprising optometry students and headed by optometrists) conducted vision testing for children in 51 schools. School vision testing included basic vision screening, refraction, frame measurements, frame choice and referrals for other ocular problems. Results: A total of 12448 children were screened, among whom 420 (3.37%) were identified to have refractive errors. 28 (1.26%) children belonged to the primary, 163 to middle (9.80%), 129 (4.67%) to secondary and 100 (1.73%) to the higher secondary levels of education respectively. 265 (2.12%) children were referred for further evaluation. Conclusion: Single day large scale school vision testing can be adopted by schools of optometry to reach a higher number of children within a short span. PMID:25709271

  7. Reliability and validity of the Italian version of the 14-item Resilience Scale

    Directory of Open Access Journals (Sweden)

    Callegari C

    2016-10-01

    Full Text Available Camilla Callegari,1 Lorenza Bertù,2 Melissa Lucano,1 Marta Ielmini,1 Elena Braggio,1 Simone Vender1 1Department of Clinical and Experimental Medicine – Psychiatric Division, 2Department of Clinical and Experimental Medicine, Centre for Research EPIMED – Epidemiology and Preventive Medicine, University of Insubria, Varese, Italy Background: In recent years resilience has gained clinical relevance in sociological, psychological, and medical disciplines, and a lot of scales measuring resilience have been developed and have been utilized in the western countries. The aim of the study was to assess the psychometric properties of the Italian version of the 14-item Resilience Scale (RS-14, by describing its validity and reliability. As agreed with the authors of the original English version of the RS-14, it was translated into Italian. Then the standard procedure for back-translation was followed. Methods: In total, 150 participants among the nursing and professional education students of the University of Insubria of Varese and health workers of the “ASST dei Sette Laghi-Ospedale di Circolo” of Varese were enrolled. The responses to the questionnaires were collected only from the students and the health workers between the ages of 18 and 65 years who gave their consent to participate in the study from April to September 2015. A subsample of 26 students and health workers was retested on the RS-14, 5 weeks after the first assessment. The questionnaires were handed out to 214 people, and 150 sets of questionnaires (70% were returned, of which eight were subsequently removed because >60% of the answers were missing. In order to ensure anonymity, every completed questionnaire was identified only via a code. Results: No significant differences were found between the mean values of the resilience scores between women (76.1 and men (76.3, with unpaired t-test = –0.08 and P=0.93. Similarly, no difference between resilience scores were found between

  8. Single-stage micro-scale solvent extraction in parallel microbore tubes using MDIMJ

    International Nuclear Information System (INIS)

    Darekar, Mayur; Singh, K.K.; Joshi, J.M.; Mukhopadhyay, S.; Shenoy, K.T.

    2016-01-01

    Single-stage micro-scale solvent extraction of U(VI) from simulated lean streams is explored using micro-scale contactor comprising of a MDIMJ (Monoblock Distributor with Integrated Microfluidic Junction) and PTFE microbore tubes. 30% (v/v) TBP in dodecane has been used as the extracting phase. The objective of the study is to demonstrate numbering up approach for scale-up of micro-scale extraction using indigenously conceptualized and fabricated MDIMJ. First the performance of MIDIMJ for equal flow distribution is tested. Then the effects of inlet flow rate and O/A ratio on stage efficiency and percentage extraction are studied. The experiments show that it is easy to scale-up single-stage micro-scale solvent extraction by using MDIMJ for numbering up approach. Maximum capacity tested is 4.8 LPH. With O/A = 2/1, more than 90% extraction is achieved in a very short contact time of less than 3s. The study thus demonstrates possibility of process intensification and easy scale-up of micro-scale solvent extraction

  9. The therapeutic factor inventory-8: Using item response theory to create a brief scale for continuous process monitoring for group psychotherapy.

    Science.gov (United States)

    Tasca, Giorgio A; Cabrera, Christine; Kristjansson, Elizabeth; MacNair-Semands, Rebecca; Joyce, Anthony S; Ogrodniczuk, John S

    2016-01-01

    We tested a very brief version of the 23-item Therapeutic Factors Inventory-Short Form (TFI-S), and describe the use of Item Response Theory (IRT) for the purpose of developing short and reliable scales for group psychotherapy. Group therapy patients (N = 578) completed the TFI-S on one occasion, and their data were used for the IRT analysis. Of those, 304 completed the TFI-S and other measures on more than one occasion to assess sensitivity to change, concurrent, and predictive validity of the brief version. Results suggest that the new TFI-8 is a brief, reliable, and valid measure of a higher-order group therapeutic factor. The TFI-8 may be used for continuous process measurement and feedback to improve the functioning of therapy groups.

  10. A note on monotonicity of item response functions for ordered polytomous item response theory models.

    Science.gov (United States)

    Kang, Hyeon-Ah; Su, Ya-Hui; Chang, Hua-Hua

    2018-03-08

    A monotone relationship between a true score (τ) and a latent trait level (θ) has been a key assumption for many psychometric applications. The monotonicity property in dichotomous response models is evident as a result of a transformation via a test characteristic curve. Monotonicity in polytomous models, in contrast, is not immediately obvious because item response functions are determined by a set of response category curves, which are conceivably non-monotonic in θ. The purpose of the present note is to demonstrate strict monotonicity in ordered polytomous item response models. Five models that are widely used in operational assessments are considered for proof: the generalized partial credit model (Muraki, 1992, Applied Psychological Measurement, 16, 159), the nominal model (Bock, 1972, Psychometrika, 37, 29), the partial credit model (Masters, 1982, Psychometrika, 47, 147), the rating scale model (Andrich, 1978, Psychometrika, 43, 561), and the graded response model (Samejima, 1972, A general model for free-response data (Psychometric Monograph no. 18). Psychometric Society, Richmond). The study asserts that the item response functions in these models strictly increase in θ and thus there exists strict monotonicity between τ and θ under certain specified conditions. This conclusion validates the practice of customarily using τ in place of θ in applied settings and provides theoretical grounds for one-to-one transformations between the two scales. © 2018 The British Psychological Society.

  11. Development of abbreviated eight-item form of the Penn Verbal Reasoning Test.

    Science.gov (United States)

    Bilker, Warren B; Wierzbicki, Michael R; Brensinger, Colleen M; Gur, Raquel E; Gur, Ruben C

    2014-12-01

    The ability to reason with language is a highly valued cognitive capacity that correlates with IQ measures and is sensitive to damage in language areas. The Penn Verbal Reasoning Test (PVRT) is a 29-item computerized test for measuring abstract analogical reasoning abilities using language. The full test can take over half an hour to administer, which limits its applicability in large-scale studies. We previously described a procedure for abbreviating a clinical rating scale and a modified procedure for reducing tests with a large number of items. Here we describe the application of the modified method to reducing the number of items in the PVRT to a parsimonious subset of items that accurately predicts the total score. As in our previous reduction studies, a split sample is used for model fitting and validation, with cross-validation to verify results. We find that an 8-item scale predicts the total 29-item score well, achieving a correlation of .9145 for the reduced form for the model fitting sample and .8952 for the validation sample. The results indicate that a drastically abbreviated version, which cuts administration time by more than 70%, can be safely administered as a predictor of PVRT performance. © The Author(s) 2014.

  12. Development of Abbreviated Eight-Item Form of the Penn Verbal Reasoning Test

    Science.gov (United States)

    Bilker, Warren B.; Wierzbicki, Michael R.; Brensinger, Colleen M.; Gur, Raquel E.; Gur, Ruben C.

    2014-01-01

    The ability to reason with language is a highly valued cognitive capacity that correlates with IQ measures and is sensitive to damage in language areas. The Penn Verbal Reasoning Test (PVRT) is a 29-item computerized test for measuring abstract analogical reasoning abilities using language. The full test can take over half an hour to administer, which limits its applicability in large-scale studies. We previously described a procedure for abbreviating a clinical rating scale and a modified procedure for reducing tests with a large number of items. Here we describe the application of the modified method to reducing the number of items in the PVRT to a parsimonious subset of items that accurately predicts the total score. As in our previous reduction studies, a split sample is used for model fitting and validation, with cross-validation to verify results. We find that an 8-item scale predicts the total 29-item score well, achieving a correlation of .9145 for the reduced form for the model fitting sample and .8952 for the validation sample. The results indicate that a drastically abbreviated version, which cuts administration time by more than 70%, can be safely administered as a predictor of PVRT performance. PMID:24577310

  13. Rating scales in general practice depression

    DEFF Research Database (Denmark)

    Bech, Per; Paykel, Eugene; Sireling, Lester

    2015-01-01

    BACKGROUND: Our objective was to investigate to what extent the Clinical Interview for Depression (CID) used in the general practice setting covers clinically valid subscales (depression, anxiety, and apathy) which can measure outcome of antidepressant therapy as well as identifying subsyndromes...... within major depressive disorder. The CID was compared to the Hamilton Depression Rating Scale (HAM-D17). METHODS: 146 patients from a previous study in general practice with the CID were investigated. The item response theory model established by Rasch was used to investigate the scalability (a scale...... (approximately 20%) had an atypical depression. LIMITATIONS: The samples were derived from a single study and were all rated by a single rater. CONCLUSION: The CID contains subscales of depression, anxiety, and apathy with an acceptable scalability for use in general practice. A subsyndrome of atypical...

  14. Psychometric aspects of item mapping for criterion-referenced interpretation and bookmark standard setting.

    Science.gov (United States)

    Huynh, Huynh

    2010-01-01

    Locating an item on an achievement continuum (item mapping) is well-established in technical work for educational/psychological assessment. Applications of item mapping may be found in criterion-referenced (CR) testing (or scale anchoring, Beaton and Allen, 1992; Huynh, 1994, 1998a, 2000a, 2000b, 2006), computer-assisted testing, test form assembly, and in standard setting methods based on ordered test booklets. These methods include the bookmark standard setting originally used for the CTB/TerraNova tests (Lewis, Mitzel, Green, and Patz, 1999), the item descriptor process (Ferrara, Perie, and Johnson, 2002) and a similar process described by Wang (2003) for multiple-choice licensure and certification examinations. While item response theory (IRT) models such as the Rasch and two-parameter logistic (2PL) models traditionally place a binary item at its location, Huynh has argued in the cited papers that such mapping may not be appropriate in selecting items for CR interpretation and scale anchoring.

  15. Single and two-phase similarity analysis of a reduced-scale natural convection loop relative to a full-scale prototype

    International Nuclear Information System (INIS)

    Botelho, David A.; Faccini, Jose L.H.

    2002-01-01

    The main topic in this paper is a new device being considered to improve nuclear reactor safety employing the natural circulation. A scaled experiment used to demonstrate the performance of the device is also described. We also applied a similarity analysis method for single and two-phase natural convection loop flow to the IEN CCN experiment and to an APEX like experiment to verify the degree of similarity relative to a full-scale prototype like the AP600. Most of the CCN similarity numbers that represent important single and two-phase similarity conditions are comparable to the APEX like loop non-dimensional numbers calculated employing the same methodology. Despite the much smaller geometric, pressure, and power scales, we conclude that the IEN CCN has single and two-phase natural circulation similarity numbers that represent fairly well the full-scale prototype. even lacking most complementary primary and safety systems, this IEN circuit provided a much valid experience to develop human, experimental, and analytical resources, besides its utilization as a training tool. (author)

  16. Nanoscale heterostructures with molecular-scale single-crystal metal wires.

    Science.gov (United States)

    Kundu, Paromita; Halder, Aditi; Viswanath, B; Kundu, Dipan; Ramanath, Ganpati; Ravishankar, N

    2010-01-13

    Creating nanoscale heterostructures with molecular-scale (synthesis of nanoscale heterostructures with single-crystal molecular-scale Au nanowires attached to different nanostructure substrates. Our method involves the formation of Au nanoparticle seeds by the reduction of rocksalt AuCl nanocubes heterogeneously nucleated on the substrates and subsequent nanowire growth by oriented attachment of Au nanoparticles from the solution phase. Nanoscale heterostructures fabricated by such site-specific nucleation and growth are attractive for many applications including nanoelectronic device wiring, catalysis, and sensing.

  17. Development of six PROMIS pediatrics proxy-report item banks

    Directory of Open Access Journals (Sweden)

    Irwin Debra E

    2012-02-01

    Full Text Available Abstract Background Pediatric self-report should be considered the standard for measuring patient reported outcomes (PRO among children. However, circumstances exist when the child is too young, cognitively impaired, or too ill to complete a PRO instrument and a proxy-report is needed. This paper describes the development process including the proxy cognitive interviews and large-field-test survey methods and sample characteristics employed to produce item parameters for the Patient Reported Outcomes Measurement Information System (PROMIS pediatric proxy-report item banks. Methods The PROMIS pediatric self-report items were converted into proxy-report items before undergoing cognitive interviews. These items covered six domains (physical function, emotional distress, social peer relationships, fatigue, pain interference, and asthma impact. Caregivers (n = 25 of children ages of 5 and 17 years provided qualitative feedback on proxy-report items to assess any major issues with these items. From May 2008 to March 2009, the large-scale survey enrolled children ages 8-17 years to complete the self-report version and caregivers to complete the proxy-report version of the survey (n = 1548 dyads. Caregivers of children ages 5 to 7 years completed the proxy report survey (n = 432. In addition, caregivers completed other proxy instruments, PedsQL™ 4.0 Generic Core Scales Parent Proxy-Report version, PedsQL™ Asthma Module Parent Proxy-Report version, and KIDSCREEN Parent-Proxy-52. Results Item content was well understood by proxies and did not require item revisions but some proxies clearly noted that determining an answer on behalf of their child was difficult for some items. Dyads and caregivers of children ages 5-17 years old were enrolled in the large-scale testing. The majority were female (85%, married (70%, Caucasian (64% and had at least a high school education (94%. Approximately 50% had children with a chronic health condition, primarily

  18. When Is a New Scale not a New Scale? The Case of the Bergen Shopping Addiction Scale and the Compulsive Online Shopping Scale.

    Science.gov (United States)

    Griffiths, Mark D; Andreassen, Cecilie S; Pallesen, Ståle; Bilder, Robert M; Torsheim, Torbjørn; Aboujaoude, Elias

    2016-01-01

    Manchiraju et al. ( International Journal of Mental Health and Addiction , 1-15, 2016) published the Compulsive Online Shopping Scale (COSS) in the International Journal of Mental Health and Addiction ( IJMHA ). To develop their measure of compulsive online shopping, Manchiraju and colleagues adapted items from the seven-item Bergen Shopping Addiction Scale (BSAS) and its' original 28-item item pool. Manchiraju et al. did not add or remove any of the original seven items, and did not substantially change the content of any of the 28 items on which the BSAS was based. They simply added the word "online" to each existing item. Given that the BSAS was specifically developed to take into account the different ways in which people now shop and to include both online and offline shopping, there does not seem to be a good rationale for developing an online version of the BSAS. It is argued that the COSS is not really an adaptation of the BSAS but an almost identical instrument based on the original 28-item pool.

  19. Determination of Validity and Reliability of the Mobbing Scale for Indoor Sports Referees

    Directory of Open Access Journals (Sweden)

    Serkan HACICAFEROĞLU

    2014-07-01

    Full Text Available Purpose of this study is to conduct validity and reliability analyses of the scale prepared for determining the mobbing of indoor sports referees, who take active duties in different classes (basketball, handball and volleyball. In orde r to develop the scale, a trial scale made of 31 items in likert type was prepared and applied to referees and then factor analysis method was used, which converts many variables into meaningful and independent factors in fewer number. According to the dat a obtained from factor analysis, the scale has a structure of 14 articles and single component. It was determined that the total variance of scale items was 43.721 and factor load was between 0.48 and 0.76. The cronbach alpha internal coefficient of consis tency was computed as 0.82. Based on the values obtained from the scale it could be said that the Mobbing Scale for Indoor Sports Referees is valid and reliable.

  20. Examining the Impact of Unscorable Item Responses on the Validity and Interpretability of MMPI-2/MMPI-2-RF Restructured Clinical (RC) Scale Scores

    Science.gov (United States)

    Dragon, Wendy R.; Ben-Porath, Yossef S.; Handel, Richard W.

    2012-01-01

    This article examined the impact of unscorable item responses on the psychometric validity and practical interpretability of scores on the Restructured Clinical (RC) Scales of the Minnesota Multiphasic Personality Inventory-2/Minnesota Multiphasic Personality Inventory-2-Restructured Form (MMPI-2/MMPI-2-RF). In analyses conducted with five…

  1. Developing an item bank to measure the coping strategies of people with hereditary retinal diseases.

    Science.gov (United States)

    Prem Senthil, Mallika; Khadka, Jyoti; De Roach, John; Lamey, Tina; McLaren, Terri; Campbell, Isabella; Fenwick, Eva K; Lamoureux, Ecosse L; Pesudovs, Konrad

    2018-05-05

    Our understanding of the coping strategies used by people with visual impairment to manage stress related to visual loss is limited. This study aims to develop a sophisticated coping instrument in the form of an item bank implemented via Computerised adaptive testing (CAT) for hereditary retinal diseases. Items on coping were extracted from qualitative interviews with patients which were supplemented by items from a literature review. A systematic multi-stage process of item refinement was carried out followed by expert panel discussion and cognitive interviews. The final coping item bank had 30 items. Rasch analysis was used to assess the psychometric properties. A CAT simulation was carried out to estimate an average number of items required to gain precise measurement of hereditary retinal disease-related coping. One hundred eighty-nine participants answered the coping item bank (median age = 58 years). The coping scale demonstrated good precision and targeting. The standardised residual loadings for items revealed six items grouped together. Removal of the six items reduced the precision of the main coping scale and worsened the variance explained by the measure. Therefore, the six items were retained within the main scale. Our CAT simulation indicated that, on average, less than 10 items are required to gain a precise measurement of coping. This is the first study to develop a psychometrically robust coping instrument for hereditary retinal diseases. CAT simulation indicated that on an average, only four and nine items were required to gain measurement at moderate and high precision, respectively.

  2. 12-Item Pruritus Severity Scale: Development and Validation of New Itch Severity Questionnaire

    Directory of Open Access Journals (Sweden)

    Adam Reich

    2017-01-01

    Full Text Available Introduction. A validated assessment of pruritus intensity is an important but still difficult clinical problem due to a subjective nature of this sensation. Objective. The aim of this study was the creation and validation of new itch severity questionnaire assessing pruritus intensity. Material and Methods. A total of 148 patients with pruritic dermatoses were asked to assess pruritus intensity using 12-Item Pruritus Severity Score (12-PSS and Visual Analogue Scale (VAS. Patients were also asked to complete the Dermatology Life Quality Index (DLQI and Hospitality Anxiety and Depression Scale (HADS. Test-retest comparison of 12-PSS was conducted in 102 subjects who completed the itch questionnaire twice with the 3- to 5-day interval. Results. We have created the 12-PSS assessing pruritus intensity (two questions, pruritus extent (one question and duration (one question, influence of pruritus on concentration and patient psyche (four questions, and scratching as a response to pruritus stimuli (four questions. A maximum scoring was 22 points. The results showed strong consistency (Cronbach α coefficient 0.81. A significant correlation was observed with VAS (r=0.58, p<0.001 and quality of life level according to DLQI (r=0.53, p<0.001. Test-retest comparison in 102 subjects revealed a satisfactory reproducibility of achieved results (ICC = 0,72. Conclusions. The newly developed pruritus severity questionnaire may be used in daily clinical practice in the future.

  3. Cross-cultural adaptation and validation of the 12-item Multiple Sclerosis Walking Scale (MSWS-12 for the Brazilian population

    Directory of Open Access Journals (Sweden)

    Bruna E. M. Marangoni

    2012-12-01

    Full Text Available Gait impairment is reported by 85% of patients with multiple sclerosis (MS as main complaint. In 2003, Hobart et al. developed a scale for walking known as The 12-item Multiple Sclerosis Walking Scale (MSWS-12, which combines the perspectives of patients with psychometric methods. OBJECTIVE: This study aimed to cross-culturally adapt and validate the MSWS-12 for the Brazilian population with MS. METHODS: This study included 116 individuals diagnosed with MS, in accordance with McDonald's criteria. The steps of the adaptation process included translation, back-translation, review by an expert committee and pretesting. A test and retest of MSWS-12/BR was made for validation, with comparison with another scale (MSIS-29/BR and another test (T25FW. RESULTS: The Brazilian version of MSWS-12/BR was shown to be similar to the original. The results indicate that MSWS-12/BR is a reliable and reproducible scale. CONCLUSIONS: MSWS-12/BR has been adapted and validated, and it is a reliable tool for the Brazilian population.

  4. Using Procedure Based on Item Response Theory to Evaluate Classification Consistency Indices in the Practice of Large-Scale Assessment

    Directory of Open Access Journals (Sweden)

    Shanshan Zhang

    2017-09-01

    Full Text Available In spite of the growing interest in the methods of evaluating the classification consistency (CC indices, only few researches are available in the field of applying these methods in the practice of large-scale educational assessment. In addition, only few studies considered the influence of practical factors, for example, the examinee ability distribution, the cut score location and the score scale, on the performance of CC indices. Using the newly developed Lee's procedure based on the item response theory (IRT, the main purpose of this study is to investigate the performance of CC indices when practical factors are taken into consideration. A simulation study and an empirical study were conducted under comprehensive conditions. Results suggested that with negatively skewed distribution, the CC indices were larger than with other distributions. Interactions occurred among ability distribution, cut score location, and score scale. Consequently, Lee's IRT procedure is reliable to be used in the field of large-scale educational assessment, and when reporting the indices, it should be treated with caution as testing conditions may vary a lot.

  5. Cross-cultural measurement invariance in the satisfaction with food-related life scale in older adults from two developing countries.

    Science.gov (United States)

    Schnettler, Berta; Miranda-Zapata, Edgardo; Lobos, Germán; Lapo, María; Grunert, Klaus G; Adasme-Berríos, Cristian; Hueche, Clementina

    2017-05-30

    Nutrition is one of the major determinants of successful aging. The Satisfaction with Food-related Life (SWFL) scale measures a person's overall assessment regarding their food and eating habits. The SWFL scale has been used in older adult samples across different countries in Europe, Asia and America, however, there are no studies that have evaluated the cross-cultural measurement invariance of the scale in older adult samples. Therefore, we evaluated the measurement invariance of the SWFL scale across older adults from Chile and Ecuador. Stratified random sampling was used to recruit a sample of older adults of both genders from Chile (mean age = 71.38, SD = 6.48, range = 60-92) and from Ecuador (mean age = 73.70, SD = 7.45, range = 60-101). Participants reported their levels of satisfaction with food-related life by completing the SWFL scale, which consists of five items grouped into a single dimension. Confirmatory factor analysis (CFA) was used to examine cross-cultural measurement invariance of the SWFL scale. Results showed that the SWFL scale exhibited partial measurement invariance, with invariance of all factor loadings, invariance in all but one item's threshold (item 1) and invariance in all items' uniqueness (residuals), which leads us to conclude that there is a reasonable level of partial measurement invariance for the CFA model of the SWFL scale, when comparing the Chilean and Ecuadorian older adult samples. The lack of invariance in item 1 confirms previous studies with adults and emerging adults in Chile that suggest this item is culture-sensitive. We recommend revising the wording of the first item of the SWFL in order to relate the statement with the person's life. The SWFL scale shows partial measurement invariance across older adults from Chile and Ecuador. A 4-item version of the scale (excluding item 1) provides the basis for international comparisons of satisfaction with food-related life in older adults from developing

  6. Validating the 11-Item Revised University of California Los Angeles Scale to Assess Loneliness Among Older Adults: An Evaluation of Factor Structure and Other Measurement Properties.

    Science.gov (United States)

    Lee, Joonyup; Cagle, John G

    2017-11-01

    To examine the measurement properties and factor structure of the short version of the Revised University of California Los Angeles (R-UCLA) loneliness scale from the Health and Retirement Study (HRS). Based on data from 3,706 HRS participants aged 65 + who completed the 2012 wave of the HRS and its Psychosocial Supplement, the measurement properties and factorability of the R-UCLA were examined by conducting an exploratory factor analysis (EFA) and the confirmatory factor analysis (CFA) on randomly split halves. The average score for the 11-item loneliness scale was 16.4 (standard deviation: 4.5). An evaluation of the internal consistency produced a Cronbach's α of 0.87. Results from the EFA showed that two- and three-factor models were appropriate. However, based on the results of the CFA, only a two-factor model was determined to be suitable because there was a very high correlation between two factors identified in the three-factor model, available social connections and sense of belonging. This study provides important data on the properties of the 11-item R-UCLA scale by identifying a two-factor model of loneliness: feeling isolated and available social connections. Our findings suggest the 11-item R-UCLA has good factorability and internal reliability. Copyright © 2017 American Association for Geriatric Psychiatry. Published by Elsevier Inc. All rights reserved.

  7. The Reverse of Social Anxiety Is Not Always the Opposite: The Reverse-Scored Items of the Social Interaction Anxiety Scale Do Not Belong

    Science.gov (United States)

    Rodebaugh, Thomas L.; Woods, Carol M.; Heimberg, Richard G.

    2007-01-01

    Although well-used and empirically supported, the Social Interaction Anxiety Scale (SIAS) has a questionable factor structure and includes reverse-scored items with questionable utility. Here, using samples of undergraduates and a sample of clients with social anxiety disorder, we extend previous work that opened the question of whether the…

  8. [Instrument to measure adherence in hypertensive patients: contribution of Item Response Theory].

    Science.gov (United States)

    Rodrigues, Malvina Thaís Pacheco; Moreira, Thereza Maria Magalhaes; Vasconcelos, Alexandre Meira de; Andrade, Dalton Francisco de; Silva, Daniele Braz da; Barbetta, Pedro Alberto

    2013-06-01

    To analyze, by means of "Item Response Theory", an instrument to measure adherence to t treatment for hypertension. Analytical study with 406 hypertensive patients with associated complications seen in primary care in Fortaleza, CE, Northeastern Brazil, 2011 using "Item Response Theory". The stages were: dimensionality test, calibrating the items, processing data and creating a scale, analyzed using the gradual response model. A study of the dimensionality of the instrument was conducted by analyzing the polychoric correlation matrix and factor analysis of complete information. Multilog software was used to calibrate items and estimate the scores. Items relating to drug therapy are the most directly related to adherence while those relating to drug-free therapy need to be reworked because they have less psychometric information and low discrimination. The independence of items, the small number of levels in the scale and low explained variance in the adjustment of the models show the main weaknesses of the instrument analyzed. The "Item Response Theory" proved to be a relevant analysis technique because it evaluated respondents for adherence to treatment for hypertension, the level of difficulty of the items and their ability to discriminate between individuals with different levels of adherence, which generates a greater amount of information. The instrument analyzed is limited in measuring adherence to hypertension treatment, by analyzing the "Item Response Theory" of the item, and needs adjustment. The proper formulation of the items is important in order to accurately measure the desired latent trait.

  9. Measuring pregnancy planning: A psychometric evaluation and comparison of two scales.

    Science.gov (United States)

    Drevin, Jennifer; Kristiansson, Per; Stern, Jenny; Rosenblad, Andreas

    2017-11-01

    To psychometrically test the London Measure of Unplanned Pregnancy and compare it with the Swedish Pregnancy Planning Scale. The incidence of unplanned pregnancies is an important indicator of reproductive health. The London Measure of Unplanned Pregnancy measures pregnancy planning by taking contraceptive use, timing, intention to become pregnant, desire for pregnancy, partner agreement, and pre-conceptual preparations into account. It has, however, previously not been psychometrically evaluated using confirmatory factor analysis. The Likert-scored single-item Swedish Pregnancy Planning Scale has been developed to measure the woman's own view of pregnancy planning level. Cross-sectional design. In 2012-2013, 5493 pregnant women living in Sweden were invited to participate in the Swedish Pregnancy Planning study, of whom 3327 (61%) agreed to participate and answered a questionnaire. A test-retest pilot study was conducted in 2011-2012. Thirty-two participants responded to the questionnaire on two occasions 14 days apart. Data were analysed using confirmatory factor analysis, Cohen's weighted kappa and Spearman's correlation. All items of the London Measure of Unplanned Pregnancy contributed to measuring pregnancy planning, but four items had low item-reliability. The London Measure of Unplanned Pregnancy and Swedish Pregnancy Planning Scale corresponded reasonably well with each other and both showed good test-retest reliability. The London Measure of Unplanned Pregnancy may benefit from item reduction and its usefulness may be questioned. The Swedish Pregnancy Planning Scale is time-efficient and shows acceptable reliability and construct validity, which makes it more useful for measuring pregnancy planning. © 2017 John Wiley & Sons Ltd.

  10. Discrepancies in Cornell Scale for Depression in Dementia (CSDD items between residents and caregivers, and the CSDD's factor structure

    Directory of Open Access Journals (Sweden)

    Wongpakaran N

    2013-06-01

    Full Text Available Nahathai Wongpakaran,1 Tinakon Wongpakaran,1 Robert van Reekum2,3 1Department of Psychiatry, Chiang Mai University, Chiang Mai, Thailand; 2Department of Psychiatry, 3Institute of Medical Sciences, University of Toronto, Toronto, ON, Canada Purpose: This validation study aims to examine Cornell Scale for Depression in Dementia (CSDD items in terms of the agreement found between residents and caregivers, and also to compare alternative models of the Thai version of the CSDD. Patients and methods: A cross-sectional study was conducted of 84 elderly residents (46 women, 38 men, age range 60–94 years in a long-term residential home setting in Thailand between March and June 2011. The selected residents went through a comprehensive geriatric assessment that included use of the Mini-Mental State Examination, Mini-International Neuropsychiatric Interview, and CSDD instruments. Intraclass correlation (ICC was calculated in order to establish the level of agreement between the residents and caregivers, in light of the residents' cognitive status. Confirmatory factor analysis (CFA was adopted to evaluate the alternative CSDD models. Results: The CSDD yielded a high internal consistency (Cronbach's alpha = 0.87 and moderate agreement between residents and caregivers (ICC = 0.55; however, it was stronger in cognitively impaired subjects (ICC = 0.71. CFA revealed that there was no difference between the four-factor model, in which factors A (mood-related signs and E (ideational disturbance were collapsed into a single factor, and the five-factor model as per the original theoretical construct. Both models were found to be similar, and displayed a poor fit. Conclusion: The CSDD demonstrated a moderate level of interrater agreement between residents and caregivers, and was more reliable when used with cognitively impaired residents. CFA indicated a poorly fitting model in this sample. Keywords: Cornell Scale for Depression in Dementia (CSDD, factor structure

  11. Three controversies over item disclosure in medical licensure examinations

    Directory of Open Access Journals (Sweden)

    Yoon Soo Park

    2015-09-01

    Full Text Available In response to views on public's right to know, there is growing attention to item disclosure – release of items, answer keys, and performance data to the public – in medical licensure examinations and their potential impact on the test's ability to measure competence and select qualified candidates. Recent debates on this issue have sparked legislative action internationally, including South Korea, with prior discussions among North American countries dating over three decades. The purpose of this study is to identify and analyze three issues associated with item disclosure in medical licensure examinations – 1 fairness and validity, 2 impact on passing levels, and 3 utility of item disclosure – by synthesizing existing literature in relation to standards in testing. Historically, the controversy over item disclosure has centered on fairness and validity. Proponents of item disclosure stress test takers’ right to know, while opponents argue from a validity perspective. Item disclosure may bias item characteristics, such as difficulty and discrimination, and has consequences on setting passing levels. To date, there has been limited research on the utility of item disclosure for large scale testing. These issues requires ongoing and careful consideration.

  12. Identifying predictors of physics item difficulty: A linear regression approach

    Science.gov (United States)

    Mesic, Vanes; Muratovic, Hasnija

    2011-06-01

    Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal physics knowledge

  13. Identifying predictors of physics item difficulty: A linear regression approach

    Directory of Open Access Journals (Sweden)

    Hasnija Muratovic

    2011-06-01

    Full Text Available Large-scale assessments of student achievement in physics are often approached with an intention to discriminate students based on the attained level of their physics competencies. Therefore, for purposes of test design, it is important that items display an acceptable discriminatory behavior. To that end, it is recommended to avoid extraordinary difficult and very easy items. Knowing the factors that influence physics item difficulty makes it possible to model the item difficulty even before the first pilot study is conducted. Thus, by identifying predictors of physics item difficulty, we can improve the test-design process. Furthermore, we get additional qualitative feedback regarding the basic aspects of student cognitive achievement in physics that are directly responsible for the obtained, quantitative test results. In this study, we conducted a secondary analysis of data that came from two large-scale assessments of student physics achievement at the end of compulsory education in Bosnia and Herzegovina. Foremost, we explored the concept of “physics competence” and performed a content analysis of 123 physics items that were included within the above-mentioned assessments. Thereafter, an item database was created. Items were described by variables which reflect some basic cognitive aspects of physics competence. For each of the assessments, Rasch item difficulties were calculated in separate analyses. In order to make the item difficulties from different assessments comparable, a virtual test equating procedure had to be implemented. Finally, a regression model of physics item difficulty was created. It has been shown that 61.2% of item difficulty variance can be explained by factors which reflect the automaticity, complexity, and modality of the knowledge structure that is relevant for generating the most probable correct solution, as well as by the divergence of required thinking and interference effects between intuitive and formal

  14. Scaling of ion implanted Si:P single electron devices

    International Nuclear Information System (INIS)

    Escott, C C; Hudson, F E; Chan, V C; Petersson, K D; Clark, R G; Dzurak, A S

    2007-01-01

    We present a modelling study on the scaling prospects for phosphorus in silicon (Si:P) single electron devices using readily available commercial and free-to-use software. The devices comprise phosphorus ion implanted, metallically doped (n + ) dots (size range 50-500 nm) with source and drain reservoirs. Modelling results are compared to measurements on fabricated devices and discussed in the context of scaling down to few-electron structures. Given current fabrication constraints, we find that devices with 70-75 donors per dot should be realizable. We comment on methods for further reducing this number

  15. Scaling of ion implanted Si:P single electron devices

    Energy Technology Data Exchange (ETDEWEB)

    Escott, C C [Centre for Quantum Computer Technology, School of Electrical Engineering and Telecommunications, UNSW, Sydney, NSW 2052 (Australia); Hudson, F E [Centre for Quantum Computer Technology, School of Electrical Engineering and Telecommunications, UNSW, Sydney, NSW 2052 (Australia); Chan, V C [Centre for Quantum Computer Technology, School of Electrical Engineering and Telecommunications, UNSW, Sydney, NSW 2052 (Australia); Petersson, K D [Centre for Quantum Computer Technology, School of Electrical Engineering and Telecommunications, UNSW, Sydney, NSW 2052 (Australia); Clark, R G [Centre for Quantum Computer Technology, School of Physics, UNSW, Sydney, 2052 (Australia); Dzurak, A S [Centre for Quantum Computer Technology, School of Electrical Engineering and Telecommunications, UNSW, Sydney, NSW 2052 (Australia)

    2007-06-13

    We present a modelling study on the scaling prospects for phosphorus in silicon (Si:P) single electron devices using readily available commercial and free-to-use software. The devices comprise phosphorus ion implanted, metallically doped (n{sup +}) dots (size range 50-500 nm) with source and drain reservoirs. Modelling results are compared to measurements on fabricated devices and discussed in the context of scaling down to few-electron structures. Given current fabrication constraints, we find that devices with 70-75 donors per dot should be realizable. We comment on methods for further reducing this number.

  16. Psychometric properties of responses by clinicians and older adults to a 6-item Hebrew version of the Hamilton Depression Rating Scale (HAM-D6)

    DEFF Research Database (Denmark)

    Bachner, Yaacov G; O'Rourke, Norm; Goldfracht, Margalit

    2013-01-01

    The Hamilton Depression Rating Scale (HAM-D) is commonly used as a screening instrument, as a continuous measure of change in depressive symptoms over time, and as a means to compare the relative efficacy of treatments. Among several abridged versions, the 6-item HAM-D6 is used most widely in lar...... degree because of its good psychometric properties. The current study compares both self-report and clinician-rated versions of the Hebrew version of this scale....

  17. Child Development Program Evaluation Scale.

    Science.gov (United States)

    Fiene, Richard J.

    The Child Development Program Evaluation Scale (CDPES) is actually two scales in one, a licensing scale and a quality scale. Licensing predictor items have been found to predict overall compliance of child day care centers with state regulations in four states. Quality scale items have been found to predict the overall quality of child day care…

  18. An Evaluation of the Brief Symptom Inventory-18 Using Item Response Theory: Which Items Are Most Strongly Related to Psychological Distress?

    Science.gov (United States)

    Meijer, Rob R.; de Vries, Rivka M.; van Bruggen, Vincent

    2011-01-01

    The psychometric structure of the Brief Symptom Inventory-18 (BSI-18; Derogatis, 2001) was investigated using Mokken scaling and parametric item response theory. Data of 487 outpatients, 266 students, and 207 prisoners were analyzed. Results of the Mokken analysis indicated that the BSI-18 formed a strong Mokken scale for outpatients and…

  19. Using automatic item generation to create multiple-choice test items.

    Science.gov (United States)

    Gierl, Mark J; Lai, Hollis; Turner, Simon R

    2012-08-01

    Many tests of medical knowledge, from the undergraduate level to the level of certification and licensure, contain multiple-choice items. Although these are efficient in measuring examinees' knowledge and skills across diverse content areas, multiple-choice items are time-consuming and expensive to create. Changes in student assessment brought about by new forms of computer-based testing have created the demand for large numbers of multiple-choice items. Our current approaches to item development cannot meet this demand. We present a methodology for developing multiple-choice items based on automatic item generation (AIG) concepts and procedures. We describe a three-stage approach to AIG and we illustrate this approach by generating multiple-choice items for a medical licensure test in the content area of surgery. To generate multiple-choice items, our method requires a three-stage process. Firstly, a cognitive model is created by content specialists. Secondly, item models are developed using the content from the cognitive model. Thirdly, items are generated from the item models using computer software. Using this methodology, we generated 1248 multiple-choice items from one item model. Automatic item generation is a process that involves using models to generate items using computer technology. With our method, content specialists identify and structure the content for the test items, and computer technology systematically combines the content to generate new test items. By combining these outcomes, items can be generated automatically. © Blackwell Publishing Ltd 2012.

  20. Negative affect impairs associative memory but not item memory.

    Science.gov (United States)

    Bisby, James A; Burgess, Neil

    2013-12-17

    The formation of associations between items and their context has been proposed to rely on mechanisms distinct from those supporting memory for a single item. Although emotional experiences can profoundly affect memory, our understanding of how it interacts with different aspects of memory remains unclear. We performed three experiments to examine the effects of emotion on memory for items and their associations. By presenting neutral and negative items with background contexts, Experiment 1 demonstrated that item memory was facilitated by emotional affect, whereas memory for an associated context was reduced. In Experiment 2, arousal was manipulated independently of the memoranda, by a threat of shock, whereby encoding trials occurred under conditions of threat or safety. Memory for context was equally impaired by the presence of negative affect, whether induced by threat of shock or a negative item, relative to retrieval of the context of a neutral item in safety. In Experiment 3, participants were presented with neutral and negative items as paired associates, including all combinations of neutral and negative items. The results showed both above effects: compared to a neutral item, memory for the associate of a negative item (a second item here, context in Experiments 1 and 2) is impaired, whereas retrieval of the item itself is enhanced. Our findings suggest that negative affect impairs associative memory while recognition of a negative item is enhanced. They support dual-processing models in which negative affect or stress impairs hippocampal-dependent associative memory while the storage of negative sensory/perceptual representations is spared or even strengthened.

  1. Psychometric evaluation of the Chinese version of the Subjective Happiness Scale: evidence from the Hong Kong FAMILY Cohort.

    Science.gov (United States)

    Nan, Hairong; Ni, Michael Y; Lee, Paul H; Tam, Wilson W S; Lam, Tai Hing; Leung, Gabriel M; McDowell, Ian

    2014-08-01

    With China's rapid economic growth in the past few decades, there is currently an emerging focus on happiness. Cross-cultural validity studies have indicated that the four-item Subjective Happiness Scale (SHS) has high internal consistency and stable reliability. However, the psychometric characteristics of the SHS in broader Chinese community samples are unknown. We evaluated the factor structure and psychometric properties of the SHS in the Hong Kong general population. The Chinese SHS was derived using forward-backward translation. Of the Cantonese-speaking participants aged ≥15 years, 2,635 were randomly selected from the random sample component of the FAMILY Cohort, a territory-wide cohort study in Hong Kong. In addition to the SHS, a single-item overall happiness scale, the Patient Health Questionnaire-9 (PHQ-9), the Family Adaptation, Partnership, Growth, Affection, Resolve (APGAR) scale, and the Medical Outcomes Study 12-item short-form version 2 (SF-12) mental and physical health scales were administered. Exploratory and confirmatory factor analyses supported a single factor with high loadings for the four SHS items. Multiple group analyses indicated factor invariance across sex and age groups. Cronbach's alpha was 0.82, and 2-week test-retest reliability (n = 191) was 0.70. The SHS correlated significantly with single-item overall happiness (Spearman's rho [ρ] = 0.57), Family APGAR (ρ = 0.26), PHQ-9 (ρ = -0.34), and mental health-related quality of life (ρ = 0.40) but showed a lower correlation with physical health (ρ = 0.15). A regression model that included the PHQ-9 and Family APGAR scores explained 37% of the variance in SF-12 mental health scores; adding the SHS raised the variance explained to 41 %. Our results support the reliability and validity of the SHS as a relevant component in the measurement battery for mental well-being in a Chinese general population.

  2. Understanding and quantifying cognitive complexity level in mathematical problem solving items

    Directory of Open Access Journals (Sweden)

    SUSAN E. EMBRETSON

    2008-09-01

    Full Text Available The linear logistic test model (LLTM; Fischer, 1973 has been applied to a wide variety of new tests. When the LLTM application involves item complexity variables that are both theoretically interesting and empirically supported, several advantages can result. These advantages include elaborating construct validity at the item level, defining variables for test design, predicting parameters of new items, item banking by sources of complexity and providing a basis for item design and item generation. However, despite the many advantages of applying LLTM to test items, it has been applied less often to understand the sources of complexity for large-scale operational test items. Instead, previously calibrated item parameters are modeled using regression techniques because raw item response data often cannot be made available. In the current study, both LLTM and regression modeling are applied to mathematical problem solving items from a widely used test. The findings from the two methods are compared and contrasted for their implications for continued development of ability and achievement tests based on mathematical problem solving items.

  3. Application of Item Response Theory to Modeling of Expanded Disability Status Scale in Multiple Sclerosis.

    Science.gov (United States)

    Novakovic, A M; Krekels, E H J; Munafo, A; Ueckert, S; Karlsson, M O

    2017-01-01

    In this study, we report the development of the first item response theory (IRT) model within a pharmacometrics framework to characterize the disease progression in multiple sclerosis (MS), as measured by Expanded Disability Status Score (EDSS). Data were collected quarterly from a 96-week phase III clinical study by a blinder rater, involving 104,206 item-level observations from 1319 patients with relapsing-remitting MS (RRMS), treated with placebo or cladribine. Observed scores for each EDSS item were modeled describing the probability of a given score as a function of patients' (unobserved) disability using a logistic model. Longitudinal data from placebo arms were used to describe the disease progression over time, and the model was then extended to cladribine arms to characterize the drug effect. Sensitivity with respect to patient disability was calculated as Fisher information for each EDSS item, which were ranked according to the amount of information they contained. The IRT model was able to describe baseline and longitudinal EDSS data on item and total level. The final model suggested that cladribine treatment significantly slows disease-progression rate, with a 20% decrease in disease-progression rate compared to placebo, irrespective of exposure, and effects an additional exposure-dependent reduction in disability progression. Four out of eight items contained 80% of information for the given range of disabilities. This study has illustrated that IRT modeling is specifically suitable for accurate quantification of disease status and description and prediction of disease progression in phase 3 studies on RRMS, by integrating EDSS item-level data in a meaningful manner.

  4. The Effects of Donepezil on 15-Item Geriatric Depression Scale Structure in Patients with Alzheimer Disease

    Directory of Open Access Journals (Sweden)

    Youngsoon Yang

    2016-09-01

    Full Text Available Background/Aims: In Alzheimer disease (AD, depression is among the most common accompanying neuropsychiatric symptoms and has different clinical manifestations when compared with early-life depression. In patients with drug-naïve AD, we tried to explore the structure of the 15-item Geriatric Depression Scale (GDS15 and the effect of donepezil on these substructures. Methods: GDS15, cognitive function, and activities of daily living function tests were administered to 412 patients with probable AD who had not been medicated before visiting the hospital. Using principal component analysis, three factors were identified. The patients with AD who received only donepezil were retrospectively analyzed and we compared the change of cognition and GDS15 subgroup after donepezil medication. Results: Our study identified three factors and revealed that the GDS15 may be comprised of a heterogeneous scale. The Barthel index was significantly correlated with factor 1 (positively and factor 2 (negatively. The Korean version of the MMSE (K-MMSE was significantly correlated with factor 2 and factor 3. Compared to the baseline state, K-MMSE and GDS15 showed significant improvement after taking donepezil. Among GDS15 subgroups, factor 2 and factor 3 showed significant improvement after donepezil treatment. Conclusions: These results suggest that the GDS15 may be comprised of a heterogeneous scale and donepezil differentially affects the GDS15 subgroup in AD.

  5. The factor structure of the Social Interaction Anxiety Scale and the Social Phobia Scale.

    Science.gov (United States)

    Heidenreich, Thomas; Schermelleh-Engel, Karin; Schramm, Elisabeth; Hofmann, Stefan G; Stangier, Ulrich

    2011-05-01

    The Social Interaction Anxiety Scale (SIAS) and the Social Phobia Scale (SPS) are two compendium measures that have become some of the most popular self-report scales of social anxiety. Despite their popularity, it remains unclear whether it is necessary to maintain two separate scales of social anxiety. The primary objective of the present study was to examine the factor analytic structure of both measures to determine the factorial validity of each scale. For this purpose, we administered both scales to 577 patients at the beginning of outpatient treatment. Analyzing both scales simultaneously, a CFA with two correlated factors showed a better fit to the data than a single factor model. An additional EFA with an oblique rotation on all 40 items using the WLSMV estimator further supported the two factor solution. These results suggest that the SIAS and SPS measure similar, but not identical facets of social anxiety. Thus, our findings provide support to retain the SIAS and SPS as two separate scales. Copyright © 2011 Elsevier Ltd. All rights reserved.

  6. Synthesis of Large-Scale Single-Crystalline Monolayer WS2 Using a Semi-Sealed Method

    Directory of Open Access Journals (Sweden)

    Feifei Lan

    2018-02-01

    Full Text Available As a two-dimensional semiconductor, WS2 has attracted great attention due to its rich physical properties and potential applications. However, it is still difficult to synthesize monolayer single-crystalline WS2 at larger scale. Here, we report the growth of large-scale triangular single-crystalline WS2 with a semi-sealed installation by chemical vapor deposition (CVD. Through this method, triangular single-crystalline WS2 with an average length of more than 300 µm was obtained. The largest one was about 405 μm in length. WS2 triangles with different sizes and thicknesses were analyzed by optical microscope and atomic force microscope (AFM. Their optical properties were evaluated by Raman and photoluminescence (PL spectra. This report paves the way to fabricating large-scale single-crystalline monolayer WS2, which is useful for the growth of high-quality WS2 and its potential applications in the future.

  7. Development of a short form Social Interaction Anxiety (SIAS) and Social Phobia Scale (SPS) using nonparametric item response theory: the SIAS-6 and the SPS-6.

    Science.gov (United States)

    Peters, Lorna; Sunderland, Matthew; Andrews, Gavin; Rapee, Ronald M; Mattick, Richard P

    2012-03-01

    Shortened forms of the Social Interaction Anxiety Scale (SIAS) and the Social Phobia Scale (SPS) were developed using nonparametric item response theory methods. Using data from socially phobic participants enrolled in 5 treatment trials (N = 456), 2 six-item scales (the SIAS-6 and the SPS-6) were developed. The validity of the scores on the SIAS-6 and the SPS-6 was then tested using traditional methods for their convergent validity in an independent clinical sample and a student sample, as well as for their sensitivity to change and diagnostic sensitivity in the clinical sample. The scores on the SIAS-6 and the SPS-6 correlated as well as the scores on the original SIAS and SPS, with scores on measures of related constructs, discriminated well between those with and without a diagnosis of social phobia, providing cutoffs for diagnosis and were as sensitive to measuring change associated with treatment as were the SIAS and SPS. Together, the SIAS-6 and the SPS-6 appear to be an efficient method of measuring symptoms of social phobia and provide a brief screening tool.

  8. Sources of interference in item and associative recognition memory.

    Science.gov (United States)

    Osth, Adam F; Dennis, Simon

    2015-04-01

    A powerful theoretical framework for exploring recognition memory is the global matching framework, in which a cue's memory strength reflects the similarity of the retrieval cues being matched against the contents of memory simultaneously. Contributions at retrieval can be categorized as matches and mismatches to the item and context cues, including the self match (match on item and context), item noise (match on context, mismatch on item), context noise (match on item, mismatch on context), and background noise (mismatch on item and context). We present a model that directly parameterizes the matches and mismatches to the item and context cues, which enables estimation of the magnitude of each interference contribution (item noise, context noise, and background noise). The model was fit within a hierarchical Bayesian framework to 10 recognition memory datasets that use manipulations of strength, list length, list strength, word frequency, study-test delay, and stimulus class in item and associative recognition. Estimates of the model parameters revealed at most a small contribution of item noise that varies by stimulus class, with virtually no item noise for single words and scenes. Despite the unpopularity of background noise in recognition memory models, background noise estimates dominated at retrieval across nearly all stimulus classes with the exception of high frequency words, which exhibited equivalent levels of context noise and background noise. These parameter estimates suggest that the majority of interference in recognition memory stems from experiences acquired before the learning episode. (c) 2015 APA, all rights reserved).

  9. Response pattern of depressive symptoms among college students: What lies behind items of the Beck Depression Inventory-II?

    Science.gov (United States)

    de Sá Junior, Antonio Reis; de Andrade, Arthur Guerra; Andrade, Laura Helena; Gorenstein, Clarice; Wang, Yuan-Pang

    2018-07-01

    This study examines the response pattern of depressive symptoms in a nationwide student sample, through item analyses of a rating scale by both classical test theory (CTT) and item response theory (IRT). The 21-item Beck Depression Inventory-II (BDI-II) was administered to 12,711 college students. First, the psychometric properties of the scale were described. Thereafter, the endorsement probability of depressive symptom in each scale item was analyzed through CTT and IRT. Graphical plots depicted the endorsement probability of scale items and intensity of depression. Three items of different difficulty level were compared through CTT and IRT approach. Four in five students reported the presence of depressive symptoms. The BDI-II items presented good reliability and were distributed along the symptomatic continuum of depression. Similarly, in both CTT and IRT approaches, the item 'changes in sleep' was easily endorsed, 'loss of interest' moderately and 'suicidal thoughts' hardly. Graphical representation of BDI-II of both methods showed much equivalence in terms of item discrimination and item difficulty. The item characteristic curve of the IRT method provided informative evaluation of item performance. The inventory was applied only in college students. Depressive symptoms were frequent psychopathological manifestations among college students. The performance of the BDI-II items indicated convergent results from both methods of analysis. While the CTT was easy to understand and to apply, the IRT was more complex to understand and to implement. Comprehensive assessment of the functioning of each BDI-II item might be helpful in efficient detection of depressive conditions in college students. Copyright © 2018 Elsevier B.V. All rights reserved.

  10. Validation of the 17-item Hamilton Depression Rating Scale definition of response for adults with major depressive disorder using equipercentile linking to Clinical Global Impression scale ratings: analysis of Pharmacogenomic Research Network Antidepressant Medication Pharmacogenomic Study (PGRN-AMPS) data.

    Science.gov (United States)

    Bobo, William V; Angleró, Gabriela C; Jenkins, Gregory; Hall-Flavin, Daniel K; Weinshilboum, Richard; Biernacka, Joanna M

    2016-05-01

    The study aimed to define thresholds of clinically significant change in 17-item Hamilton Depression Rating Scale (HDRS-17) scores using the Clinical Global Impression-Improvement (CGI-I) Scale as a gold standard. We conducted a secondary analysis of individual patient data from the Pharmacogenomic Research Network Antidepressant Medication Pharmacogenomic Study, an 8-week, single-arm clinical trial of citalopram or escitalopram treatment of adults with major depression. We used equipercentile linking to identify levels of absolute and percent change in HDRS-17 scores that equated with scores on the CGI-I at 4 and 8 weeks. Additional analyses equated changes in the HDRS-7 and Bech-6 scale scores with CGI-I scores. A CGI-I score of 2 (much improved) corresponded to an absolute decrease (improvement) in HDRS-17 total score of 11 points and a percent decrease of 50-57%, from baseline values. Similar results were observed for percent change in HDRS-7 and Bech-6 scores. Larger absolute (but not percent) decreases in HDRS-17 scores equated with CGI-I scores of 2 in persons with higher baseline depression severity. Our results support the consensus definition of response based on HDRS-17 scores (>50% decrease from baseline). A similar definition of response may apply to the HDRS-7 and Bech-6. Copyright © 2016 John Wiley & Sons, Ltd. Copyright © 2016 John Wiley & Sons, Ltd.

  11. Development of new physical activity and sedentary behavior change self-efficacy questionnaires using item response modeling

    Directory of Open Access Journals (Sweden)

    Venditti Elizabeth

    2009-03-01

    Full Text Available Abstract Background Theoretically, increased levels of physical activity self-efficacy (PASE should lead to increased physical activity, but few studies have reported this effect among youth. This failure may be at least partially attributable to measurement limitations. In this study, Item Response Modeling (IRM was used to develop new physical activity and sedentary behavior change self-efficacy scales. The validity of the new scales was compared with accelerometer assessments of physical activity and sedentary behavior. Methods New PASE and sedentary behavior change (TV viewing, computer video game use, and telephone use self-efficacy items were developed. The scales were completed by 714, 6th grade students in seven US cities. A limited number of participants (83 also wore an accelerometer for five days and provided at least 3 full days of complete data. The new scales were analyzed using Classical Test Theory (CTT and IRM; a reduced set of items was produced with IRM and correlated with accelerometer counts per minute and minutes of sedentary, light and moderate to vigorous activity per day after school. Results The PASE items discriminated between high and low levels of PASE. Full and reduced scales were weakly correlated (r = 0.18 with accelerometer counts per minute after school for boys, with comparable associations for girls. Weaker correlations were observed between PASE and minutes of moderate to vigorous activity (r = 0.09 – 0.11. The uni-dimensionality of the sedentary scales was established by both exploratory factor analysis and the fit of items to the underlying variable and reliability was assessed across the length of the underlying variable with some limitations. The reduced sedentary behavior scales had poor reliability. The full scales were moderately correlated with light intensity physical activity after school (r = 0.17 to 0.33 and sedentary behavior (r = -0.29 to -0.12 among the boys, but not for girls. Conclusion New

  12. Leadership: validation of a self-report scale.

    Science.gov (United States)

    Dussault, Marc; Frenette, Eric; Fernet, Claude

    2013-04-01

    The aim of this paper was to propose and test the factor structure of a new self-report questionnaire on leadership. A sample of 373 school principals in the Province of Quebec, Canada completed the initial 46-item version of the questionnaire. In order to obtain a questionnaire of minimal length, a four-step procedure was retained. First, items analysis was performed using Classical Test Theory. Second, Rasch analysis was used to identify non-fitting or overlapping items. Third, a confirmatory factor analysis (CFA) using structural equation modelling was performed on the 21 remaining items to verify the factor structure of the scale. Results show that the model with a single third-order dimension (leadership), two second-order dimensions (transactional and transformational leadership), and one first-order dimension (laissez-faire leadership) provides a good fit to the data. Finally, invariance of factor structure was assessed with a second sample of 222 vice-principals in the Province of Quebec, Canada. This model is in agreement with the theoretical model developed by Bass (1985), upon which the questionnaire is based.

  13. [Development of meaning in life scale II].

    Science.gov (United States)

    Choi, Soon-Ock; Kim, Sook-Nam; Shin, Kyung-Il; Lee, Jong-Ji

    2005-08-01

    The purpose of this study was to develop a meaning of life scale with high validity and reliability. A conceptual framework composed of 4 phases of meanings of life was identified. And 49 preliminary items on a 4-points scale were developed through content validity. A reliability and validity test of the 49 items was conducted on 564 adults. By means of internal consistency of the 49 items, 1 item was deleted. To verify the 48 items, factor analysis, reliability test, and LISEREL were done. Through exploratory factor analysis of the 48 items, 8 factors were extracted. These factors were labeled as 'self- awareness and self-acceptance', 'hope', 'responsibility awareness', 'love experience', 'self transcendence', 'relation experience', 'self contentedness', and 'Commitment'. Through LISEREL of the 48 items, 2 items were excluded and finally 46 items remained. Cronbach's Alpha of the 46 items was .94. The correlation coefficient of the Self-esteem scale was .79. By the above results, the researchers recommend the following: An exploratory study on the variables related to the meaning of life are needed for criterion validity of this scale. Studies on meaning of life of different group, and subjects are needed for reverification.

  14. Effects of memantine on cognition in patients with moderate to severe Alzheimer's disease: post-hoc analyses of ADAS-cog and SIB total and single-item scores from six randomized, double-blind, placebo-controlled studies.

    Science.gov (United States)

    Mecocci, Patrizia; Bladström, Anna; Stender, Karina

    2009-05-01

    The post-hoc analyses reported here evaluate the specific effects of memantine treatment on ADAS-cog single-items or SIB subscales for patients with moderate to severe AD. Data from six multicentre, randomised, placebo-controlled, parallel-group, double-blind, 6-month studies were used as the basis for these post-hoc analyses. All patients with a Mini-Mental State Examination (MMSE) score of less than 20 were included. Analyses of patients with moderate AD (MMSE: 10-19), evaluated with the Alzheimer's disease Assessment Scale (ADAS-cog) and analyses of patients with moderate to severe AD (MMSE: 3-14), evaluated using the Severe Impairment Battery (SIB), were performed separately. The mean change from baseline showed a significant benefit of memantine treatment on both the ADAS-cog (p ADAS-cog single-item analyses showed significant benefits of memantine treatment, compared to placebo, for mean change from baseline for commands (p < 0.001), ideational praxis (p < 0.05), orientation (p < 0.01), comprehension (p < 0.05), and remembering test instructions (p < 0.05) for observed cases (OC). The SIB subscale analyses showed significant benefits of memantine, compared to placebo, for mean change from baseline for language (p < 0.05), memory (p < 0.05), orientation (p < 0.01), praxis (p < 0.001), and visuospatial ability (p < 0.01) for OC. Memantine shows significant benefits on overall cognitive abilities as well as on specific key cognitive domains for patients with moderate to severe AD. (c) 2009 John Wiley & Sons, Ltd.

  15. Item Response Theory analysis of Fagerström Test for Cigarette Dependence.

    Science.gov (United States)

    Svicher, Andrea; Cosci, Fiammetta; Giannini, Marco; Pistelli, Francesco; Fagerström, Karl

    2018-02-01

    The Fagerström Test for Cigarette Dependence (FTCD) and the Heaviness of Smoking Index (HSI) are the gold standard measures to assess cigarette dependence. However, FTCD reliability and factor structure have been questioned and HSI psychometric properties are in need of further investigations. The present study examined the psychometrics properties of the FTCD and the HSI via the Item Response Theory. The study was a secondary analysis of data collected in 862 Italian daily smokers. Confirmatory factor analysis was run to evaluate the dimensionality of FTCD. A Grade Response Model was applied to FTCD and HSI to verify the fit to the data. Both item and test functioning were analyzed and item statistics, Test Information Function, and scale reliabilities were calculated. Mokken Scale Analysis was applied to estimate homogeneity and Loevinger's coefficients were calculated. The FTCD showed unidimensionality and homogeneity for most of the items and for the total score. It also showed high sensitivity and good reliability from medium to high levels of cigarette dependence, although problems related to some items (i.e., items 3 and 5) were evident. HSI had good homogeneity, adequate item functioning, and high reliability from medium to high levels of cigarette dependence. Significant Differential Item Functioning was found for items 1, 4, 5 of the FTCD and for both items of HSI. HSI seems highly recommended in clinical settings addressed to heavy smokers while FTCD would be better used in smokers with a level of cigarette dependence ranging between low and high. Copyright © 2017 Elsevier Ltd. All rights reserved.

  16. The Stanford Leisure-Time Activity Categorical Item (L-Cat): a single categorical item sensitive to physical activity changes in overweight/obese women.

    Science.gov (United States)

    Kiernan, M; Schoffman, D E; Lee, K; Brown, S D; Fair, J M; Perri, M G; Haskell, W L

    2013-12-01

    Physical activity is essential for chronic disease prevention, yet Cat) is a single item comprising six descriptive categories ranging from inactive to very active. This novel methodological approach assesses national activity recommendations as well as multiple clinically relevant categories below and above the recommendations, and incorporates critical methodological principles that enhance psychometrics (reliability, validity and sensitivity to change). We evaluated the L-Cat's psychometrics among 267 overweight/obese women who were asked to meet the national activity recommendations in a randomized behavioral weight-loss trial. The L-Cat had excellent test-retest reliability (κ=0.64, PCat category at 6 months was associated with 1059 more daily pedometer steps (95% CI 712-1407, β=0.38, PCat categories differentiated from each other in a dose-response gradient for steps and weight loss (PsCat was sensitive to change in response to the trial's activity component. Women increased one L-Cat category at 6 months (M=1.0±1.4, PCat categories at 6 months lost more weight than those who did not (M=-4.6%, 95% CI -6.7 to -2.5, PCat has timely potential for clinical use such as tracking activity changes via electronic medical records, especially among overweight/obese populations who are unable or unlikely to reach national recommendations.

  17. Industrial-scale separation of high-purity single-chirality single-wall carbon nanotubes for biological imaging

    Science.gov (United States)

    Yomogida, Yohei; Tanaka, Takeshi; Zhang, Minfang; Yudasaka, Masako; Wei, Xiaojun; Kataura, Hiromichi

    2016-01-01

    Single-chirality, single-wall carbon nanotubes are desired due to their inherent physical properties and performance characteristics. Here, we demonstrate a chromatographic separation method based on a newly discovered chirality-selective affinity between carbon nanotubes and a gel containing a mixture of the surfactants. In this system, two different selectivities are found: chiral-angle selectivity and diameter selectivity. Since the chirality of nanotubes is determined by the chiral angle and diameter, combining these independent selectivities leads to high-resolution single-chirality separation with milligram-scale throughput and high purity. Furthermore, we present efficient vascular imaging of mice using separated single-chirality (9,4) nanotubes. Due to efficient absorption and emission, blood vessels can be recognized even with the use of ∼100-fold lower injected dose than the reported value for pristine nanotubes. Thus, 1 day of separation provides material for up to 15,000 imaging experiments, which is acceptable for industrial use. PMID:27350127

  18. Interpreting the Third International Mathematics and Science Study (TIMSS) achievement scales using scale anchoring

    Science.gov (United States)

    Kelly, Dana L.

    1999-11-01

    The scale anchoring method was used to analyze and describe the TIMSS primary and middle school (Populations 1 and 2) mathematics and science achievement scales. Scale anchoring is a way of attaching meaning to a scale by describing what students know and can do at specific points on the scale. Student achievement was scrutinized at four points on the TIMSS primary and middle school achievement scales---the 25th, 50th, 75th, and 90th international percentiles for fourth and eighth grades. The scale anchoring method was adapted for the TIMSS data and items that students scoring at each of the four scale points were likely to answer correctly (with a 65 percent probability) were identified. The items were assembled in binders organized by anchor level and content area. Two ten-member panels of subject-matter specialists were convened to scrutinize the items, draft descriptions of student proficiency at the four scale points, and identify example TIMSS items to illustrate performance at each level. Following the panel meetings, the descriptions were refined through an iterative review process. The result is a content-referenced interpretation of the TIMSS scales through which TIMSS achievement results can be better communicated and understood.

  19. The Role of Item Models in Automatic Item Generation

    Science.gov (United States)

    Gierl, Mark J.; Lai, Hollis

    2012-01-01

    Automatic item generation represents a relatively new but rapidly evolving research area where cognitive and psychometric theories are used to produce tests that include items generated using computer technology. Automatic item generation requires two steps. First, test development specialists create item models, which are comparable to templates…

  20. Validity and Reliability of Trichotomous Achievement Goal Scale

    Science.gov (United States)

    Ilker, Gokce Erturan; Arslan, Yunus; Demirhan, Giyasettin

    2011-01-01

    The Trichotomous Achievement Goal Scale was developed by Agbuga and Xiang (2008) by including selected items from the scales of Duda and Nicholls (1992), Elliot (1999), and Elliot and Church (1997) and adapting them into Turkish. The scale consists of 18 items, and students rated each item on a 7-point Likert scale. To ascertain the validity and…

  1. Selection of multiple cued items is possible during visual short-term memory maintenance.

    Science.gov (United States)

    Matsukura, Michi; Vecera, Shaun P

    2015-07-01

    Recent neuroimaging studies suggest that maintenance of a selected object feature held in visual short-term/working memory (VSTM/VWM) is supported by the same neural mechanisms that encode the sensory information. If VSTM operates by retaining "reasonable copies" of scenes constructed during sensory processing (Serences, Ester, Vogel, & Awh, 2009, p. 207, the sensory recruitment hypothesis), then attention should be able to select multiple items represented in VSTM as long as the number of these attended items does not exceed the typical VSTM capacity. It is well known that attention can select at least two noncontiguous locations at the same time during sensory processing. However, empirical reports from the studies that examined this possibility are inconsistent. In the present study, we demonstrate that (1) attention can indeed select more than a single item during VSTM maintenance when observers are asked to recognize a set of items in the manner that these items were originally attended, and (2) attention can select multiple cued items regardless of whether these items are perceptually organized into a single group (contiguous locations) or not (noncontiguous locations). The results also replicate and extend the recent finding that selective attention that operates during VSTM maintenance is sensitive to the observers' goal and motivation to use the cueing information.

  2. Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.

    Science.gov (United States)

    Haberman, Shelby J; Sinharay, Sandip; Chon, Kyong Hee

    2013-07-01

    Residual analysis (e.g. Hambleton & Swaminathan, Item response theory: principles and applications, Kluwer Academic, Boston, 1985; Hambleton, Swaminathan, & Rogers, Fundamentals of item response theory, Sage, Newbury Park, 1991) is a popular method to assess fit of item response theory (IRT) models. We suggest a form of residual analysis that may be applied to assess item fit for unidimensional IRT models. The residual analysis consists of a comparison of the maximum-likelihood estimate of the item characteristic curve with an alternative ratio estimate of the item characteristic curve. The large sample distribution of the residual is proved to be standardized normal when the IRT model fits the data. We compare the performance of our suggested residual to the standardized residual of Hambleton et al. (Fundamentals of item response theory, Sage, Newbury Park, 1991) in a detailed simulation study. We then calculate our suggested residuals using data from an operational test. The residuals appear to be useful in assessing the item fit for unidimensional IRT models.

  3. Using Reversed MFCC and IT-EM for Automatic Speaker Verification

    Directory of Open Access Journals (Sweden)

    Sheeraz Memon

    2012-01-01

    Full Text Available This paper proposes text independent automatic speaker verification system using IMFCC (Inverse/ Reverse Mel Frequency Coefficients and IT-EM (Information Theoretic Expectation Maximization. To perform speaker verification, feature extraction using Mel scale has been widely applied and has established better results. The IMFCC is based on inverse Mel-scale. The IMFCC effectively captures information available at the high frequency formants which is ignored by the MFCC. In this paper the fusion of MFCC and IMFCC at input level is proposed. GMMs (Gaussian Mixture Models based on EM (Expectation Maximization have been widely used for classification of text independent verification. However EM comes across the convergence issue. In this paper we use our proposed IT-EM which has faster convergence, to train speaker models. IT-EM uses information theory principles such as PDE (Parzen Density Estimation and KL (Kullback-Leibler divergence measure. IT-EM acclimatizes the weights, means and covariances, like EM. However, IT-EM process is not performed on feature vector sets but on a set of centroids obtained using IT (Information Theoretic metric. The IT-EM process at once diminishes divergence measure between PDE estimates of features distribution within a given class and the centroids distribution within the same class. The feature level fusion and IT-EM is tested for the task of speaker verification using NIST2001 and NIST2004. The experimental evaluation validates that MFCC/IMFCC has better results than the conventional delta/MFCC feature set. The MFCC/IMFCC feature vector size is also much smaller than the delta MFCC thus reducing the computational burden as well. IT-EM method also showed faster convergence, than the conventional EM method, and thus it leads to higher speaker recognition scores.

  4. Normal Theory Two-Stage ML Estimator When Data Are Missing at the Item Level.

    Science.gov (United States)

    Savalei, Victoria; Rhemtulla, Mijke

    2017-08-01

    In many modeling contexts, the variables in the model are linear composites of the raw items measured for each participant; for instance, regression and path analysis models rely on scale scores, and structural equation models often use parcels as indicators of latent constructs. Currently, no analytic estimation method exists to appropriately handle missing data at the item level. Item-level multiple imputation (MI), however, can handle such missing data straightforwardly. In this article, we develop an analytic approach for dealing with item-level missing data-that is, one that obtains a unique set of parameter estimates directly from the incomplete data set and does not require imputations. The proposed approach is a variant of the two-stage maximum likelihood (TSML) methodology, and it is the analytic equivalent of item-level MI. We compare the new TSML approach to three existing alternatives for handling item-level missing data: scale-level full information maximum likelihood, available-case maximum likelihood, and item-level MI. We find that the TSML approach is the best analytic approach, and its performance is similar to item-level MI. We recommend its implementation in popular software and its further study.

  5. The Major Depressive Disorder Hierarchy: Rasch Analysis of 6 items of the Hamilton Depression Scale Covering the Continuum of Depressive Syndrome.

    Directory of Open Access Journals (Sweden)

    Lucas Primo de Carvalho Alves

    Full Text Available Melancholic features of depression (MFD seem to be a unidimensional group of signs and symptoms. However, little importance has been given to the evaluation of what features are related to a more severe disorder. That is, what are the MFD that appear only in the most depressed patients. We aim to demonstrate how each MFD is related to the severity of the major depressive disorder.We evaluated both the Hamilton depression rating scale (HDRS-17 and its 6-item melancholic subscale (HAM-D6 in 291 depressed inpatients using Rasch analysis, which computes the severity of each MFD. Overall measures of model fit were mean (±SD of items and persons residual = 0 (±1; low χ2 value; p>0.01.For the HDRS-17 model fit, mean (±SD of item residuals = 0.35 (±1.4; mean (±SD of person residuals = -0.15 (±1.09; χ2 = 309.74; p<0.00001. For the HAM-D6 model fit, mean (±SD of item residuals = 0.5 (±0.86; mean (±SD of person residuals = 0.15 (±0.91; χ2 = 56.13; p = 0.196. MFD ordered by crescent severity were depressed mood, work and activities, somatic symptoms, psychic anxiety, guilt feelings, and psychomotor retardation.Depressed mood is less severe, while guilt feelings and psychomotor retardation are more severe MFD in a psychiatric hospitalization. Understanding depression as a continuum of symptoms can improve the understanding of the disorder and may improve its perspective of treatment.

  6. Item information and discrimination functions for trinary PCM items

    NARCIS (Netherlands)

    Akkermans, Wies; Muraki, Eiji

    1997-01-01

    For trinary partial credit items the shape of the item information and the item discrimination function is examined in relation to the item parameters. In particular, it is shown that these functions are unimodal if δ2 – δ1 < 4 ln 2 and bimodal otherwise. The locations and values of the maxima are

  7. [A Brief Homophobia Scale in Medical Students From Two Universities: Results of A Refinement Process].

    Science.gov (United States)

    Campo-Arias, Adalberto; Herazo, Edwin; Oviedo, Heidi Celina

    The process of evaluating measurement scales is an ongoing procedure that requires revisions and adaptations according to the characteristics of the participants. The Homophobia Scale of seven items (EHF-7) has showed acceptable performance in medical students attending to two universities in Colombia. However, performance of some items was poor and could be removed, with an improvement in the psychometric findings of items retained. To review the psychometric functioning and refine the content of EHF-7 among medical students from two Colombian universities. A group of 667 students from the first to tenth semester participated in the research. Theirs ages were between 18 and 34 (mean, 20.9±2.7) years-old, and 60.6% were females. Cronbach alpha (α) and omega of McDonald (Ω) were calculated as indicators of reliability and to refine the scale, an exploratory (EFA) and confirmatory factor analysis (CFA) was performed. EHF-7 showed α=.793 and Ω=.796 and a main factor that explained 45.2% of the total variance. EFA and CFA suggested the suppression of three items. The four-item version (EHF-4) reached an α=.770 and Ω=.775, with a single factor that accounted for 59.7% of the total variance. CFA showed better indexes (χ 2 =3.622; df=1; P=.057; Root-mean-square error of approximation (RMSEA)=.063, 90% CI, .000-.130; Comparative Fit Indices (CFI)=.998; Tucker-Lewis Index (TLI)=.991). EHF-4 shows high internal consistency and a single dimension that explains more than 50% of the total variance. Further studies are needed to confirm these observations, that can be taken as preliminary. Copyright © 2016 Asociación Colombiana de Psiquiatría. Publicado por Elsevier España. All rights reserved.

  8. The (mis)measurement of the Dark Triad Dirty Dozen: exploitation at the core of the scale.

    Science.gov (United States)

    Kajonius, Petri J; Persson, Björn N; Rosenberg, Patricia; Garcia, Danilo

    2016-01-01

    Background. The dark side of human character has been conceptualized in the Dark Triad Model: Machiavellianism, psychopathy, and narcissism. These three dark traits are often measured using single long instruments for each one of the traits. Nevertheless, there is a necessity of short and valid personality measures in psychological research. As an independent research group, we replicated the factor structure, convergent validity and item response for one of the most recent and widely used short measures to operationalize these malevolent traits, namely, Jonason's Dark Triad Dirty Dozen. We aimed to expand the understanding of what the Dirty Dozen really captures because the mixed results on construct validity in previous research. Method. We used the largest sample to date to respond to the Dirty Dozen (N = 3,698). We firstly investigated the factor structure using Confirmatory Factor Analysis and an exploratory distribution analysis of the items in the Dirty Dozen. Secondly, using a sub-sample (n = 500) and correlation analyses, we investigated the Dirty Dozen dark traits convergent validity to Machiavellianism measured by the Mach-IV, psychopathy measured by Eysenck's Personality Questionnaire Revised, narcissism using the Narcissism Personality Inventory, and both neuroticism and extraversion from the Eysenck's questionnaire. Finally, besides these Classic Test Theory analyses, we analyzed the responses for each Dirty Dozen item using Item Response Theory (IRT). Results. The results confirmed previous findings of a bi-factor model fit: one latent core dark trait and three dark traits. All three Dirty Dozen traits had a striking bi-modal distribution, which might indicate unconcealed social undesirability with the items. The three Dirty Dozen traits did converge too, although not strongly, with the contiguous single Dark Triad scales (r between .41 and .49). The probabilities of filling out steps on the Dirty Dozen narcissism-items were much higher than on the

  9. Development and validation of the Attitudes Towards Police Legitimacy Scale.

    Science.gov (United States)

    Reynolds, Joshua J; Estrada-Reynolds, Victoria; Nunez, Narina

    2018-04-01

    Although there is a substantial body of work examining attitudes towards the police, no measure has been developed to consistently capture citizens' beliefs regarding police legitimacy. Given that police conduct has garnered a great deal of attention, particularly in the last few years, the current research sought to develop a scale measuring perceptions of police legitimacy. Across multiple studies, items were created and the scale's factor structure explored (Study 1 and Study 2), the factor structure was confirmed (Study 3a), and the predictive validity of the scale was tested (Studies 3b-3d). Results provided evidence for a reliable and valid 34-item scale with a single-factor solution that predicted multiple outcomes, including justification of a police shooting (Study 3b) and resource allocation to a police charity (Study 3c), as well as correlations with self-reported criminal activity, right-wing authoritarianism, and social dominance orientation (Study 3d). We hope this scale will be useful in the study of police legitimacy, expanding the current literature, and improving police-community relations. (PsycINFO Database Record (c) 2018 APA, all rights reserved).

  10. Developing a Scale to Measure Content Knowledge and Pedagogy Content Knowledge of In-Service Elementary Teachers on Fractions

    Science.gov (United States)

    Kazemi, Farhad; Rafiepour, Abolfazl

    2018-01-01

    The main purpose of this study was to develop a scale for measuring content knowledge (CK) and pedagogy content knowledge (PCK) of in-service elementary teachers on mathematical fractions. Another aim of this study was to consider whether CK and PCK are separate from each other, or are in a single body. Therefore, a scale containing 22 items about…

  11. Comparison of Single and Multi-Scale Method for Leaf and Wood Points Classification from Terrestrial Laser Scanning Data

    Science.gov (United States)

    Wei, Hongqiang; Zhou, Guiyun; Zhou, Junjie

    2018-04-01

    The classification of leaf and wood points is an essential preprocessing step for extracting inventory measurements and canopy characterization of trees from the terrestrial laser scanning (TLS) data. The geometry-based approach is one of the widely used classification method. In the geometry-based method, it is common practice to extract salient features at one single scale before the features are used for classification. It remains unclear how different scale(s) used affect the classification accuracy and efficiency. To assess the scale effect on the classification accuracy and efficiency, we extracted the single-scale and multi-scale salient features from the point clouds of two oak trees of different sizes and conducted the classification on leaf and wood. Our experimental results show that the balanced accuracy of the multi-scale method is higher than the average balanced accuracy of the single-scale method by about 10 % for both trees. The average speed-up ratio of single scale classifiers over multi-scale classifier for each tree is higher than 30.

  12. Differential item functioning of the UWES-17 in South Africa

    Directory of Open Access Journals (Sweden)

    Leanne Goliath-Yarde

    2011-11-01

    Research purpose: This study assesses the Differential Item Functioning (DIF of the Utrecht Work Engagement Scale (UWES-17 for different South African cultural groups in a South African company. Motivation for the study: Organisations are using the UWES-17 more and more in South Africa to assess work engagement. Therefore, research evidence from psychologists or assessment practitioners on its DIF across different cultural groups is necessary. Research design, approach and method: The researchers conducted a Secondary Data Analysis (SDA on the UWES-17 sample (n = 2429 that they obtained from a cross-sectional survey undertaken in a South African Information and Communication Technology (ICT sector company (n = 24 134. Quantitative item data on the UWES-17 scale enabled the authors to address the research question. Main findings: The researchers found uniform and/or non-uniform DIF on five of the vigour items, four of the dedication items and two of the absorption items. This also showed possible Differential Test Functioning (DTF on the vigour and dedication dimensions. Practical/managerial implications: Based on the DIF, the researchers suggested that organisations should not use the UWES-17 comparatively for different cultural groups or employment decisions in South Africa. Contribution/value add: The study provides evidence on DIF and possible DTF for the UWES-17. However, it also raises questions about possible interaction effects that need further investigation.

  13. Work environment impact scale: testing the psychometric properties of the Swedish version.

    Science.gov (United States)

    Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée

    2014-01-01

    The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.

  14. [Development and psychometric validation of the Brief Smartphone Addiction Scale (BSAS) with schoolchidren].

    Science.gov (United States)

    Csibi, Sándor; Demetrovics, Zsolt; Szabó, Attila

    2016-01-01

    Smartphone use among children increases continuously. A growing range of stimulating applications may trigger the risk of addiction. The aim of this study was to develop a brief, easy-to-use and score tool for screening children at risk for smartphone addiction. A 6-item agree-disagree Likert scale (6-point range), was developed on the basis of the 'components' model of addiction (Griffiths, 2005). The brief tool was administered to 441 Hungarian speaking schoolchildren (mean age=13.4 years, SD=2.22) along with the 26-item Smartphone Addiction Inventory (SPAI; Lin et al, 2014). Principal components analysis yielded a single component for the 6-item tool, which accounted for 52.38% of the total variance. The internal reliability of the scale was good (Cronbach's alpha=0.82). Content validity was confirmed by statistically significant differences between heavy and light users (p smartphone addiction inventory appears to be a valid and reliable tool for screening for mobile phone addiction among schoolchildren.

  15. Development and psychometric evaluation of the Thirst Distress Scale for patients with heart failure.

    Science.gov (United States)

    Waldréus, Nana; Jaarsma, Tiny; van der Wal, Martje Hl; Kato, Naoko P

    2018-03-01

    Patients with heart failure can experience thirst distress. However, there is no instrument to measure this in patients with heart failure. The aim of the present study was to develop the Thirst Distress Scale for patients with Heart Failure (TDS-HF) and to evaluate psychometric properties of the scale. The TDS-HF was developed to measure thirst distress in patients with heart failure. Face and content validity was confirmed using expert panels including patients and healthcare professionals. Data on the TDS-HF was collected from patients with heart failure at outpatient heart failure clinics and hospitals in Sweden, the Netherlands and Japan. Psychometric properties were evaluated using data from 256 heart failure patients (age 72±11 years). Concurrent validity of the scale was assessed using a thirst intensity visual analogue scale. Patients did not have any difficulties answering the questions, and time taken to answer the questions was about five minutes. Factor analysis of the scale showed one factor. After psychometric testing, one item was deleted. For the eight item TDS-HF, a single factor explained 61% of the variance and Cronbach's alpha was 0.90. The eight item TDS-HF was significantly associated with the thirst intensity score ( r=0.55, pfailure.

  16. [Development of a cell phone addiction scale for korean adolescents].

    Science.gov (United States)

    Koo, Hyun Young

    2009-12-01

    This study was done to develop a cell phone addiction scale for Korean adolescents. The process included construction of a conceptual framework, generation of initial items, verification of content validity, selection of secondary items, preliminary study, and extraction of final items. The participants were 577 adolescents in two middle schools and three high schools. Item analysis, factor analysis, criterion related validity, and internal consistency were used to analyze the data. Twenty items were selected for the final scale, and categorized into 3 factors explaining 55.45% of total variance. The factors were labeled as withdrawal/tolerance (7 items), life dysfunction (6 items), and compulsion/persistence (7 items). The scores for the scale were significantly correlated with self-control, impulsiveness, and cell phone use. Cronbach's alpha coefficient for the 20 items was .92. Scale scores identified students as cell phone addicted, heavy users, or average users. The above findings indicate that the cell phone addiction scale has good validity and reliability when used with Korean adolescents.

  17. Refining a self-assessment of informatics competency scale using Mokken scaling analysis.

    Science.gov (United States)

    Yoon, Sunmoo; Shaffer, Jonathan A; Bakken, Suzanne

    2015-01-01

    Healthcare environments are increasingly implementing health information technology (HIT) and those from various professions must be competent to use HIT in meaningful ways. In addition, HIT has been shown to enable interprofessional approaches to health care. The purpose of this article is to describe the refinement of the Self-Assessment of Nursing Informatics Competencies Scale (SANICS) using analytic techniques based upon item response theory (IRT) and discuss its relevance to interprofessional education and practice. In a sample of 604 nursing students, the 93-item version of SANICS was examined using non-parametric IRT. The iterative modeling procedure included 31 steps comprising: (1) assessing scalability, (2) assessing monotonicity, (3) assessing invariant item ordering, and (4) expert input. SANICS was reduced to an 18-item hierarchical scale with excellent reliability. Fundamental skills for team functioning and shared decision making among team members (e.g. "using monitoring systems appropriately," "describing general systems to support clinical care") had the highest level of difficulty, and "demonstrating basic technology skills" had the lowest difficulty level. Most items reflect informatics competencies relevant to all health professionals. Further, the approaches can be applied to construct a new hierarchical scale or refine an existing scale related to informatics attitudes or competencies for various health professions.

  18. Use of differential item functioning analysis to assess the equivalence of translations of a questionnaire

    NARCIS (Netherlands)

    Petersen, Morten Aa; Groenvold, Mogens; Bjorner, Jakob B.; Aaronson, Neil; Conroy, Thierry; Cull, Ann; Fayers, Peter; Hjermstad, Marianne; Sprangers, Mirjam; Sullivan, Marianne

    2003-01-01

    In cross-national comparisons based on questionnaires, accurate translations are necessary to obtain valid results. Differential item functioning (DIF) analysis can be used to test whether translations of items in multi-item scales are equivalent to the original. In data from 10,815 respondents

  19. Evaluating an Automated Number Series Item Generator Using Linear Logistic Test Models

    Directory of Open Access Journals (Sweden)

    Bao Sheng Loe

    2018-04-01

    Full Text Available This study investigates the item properties of a newly developed Automatic Number Series Item Generator (ANSIG. The foundation of the ANSIG is based on five hypothesised cognitive operators. Thirteen item models were developed using the numGen R package and eleven were evaluated in this study. The 16-item ICAR (International Cognitive Ability Resource1 short form ability test was used to evaluate construct validity. The Rasch Model and two Linear Logistic Test Model(s (LLTM were employed to estimate and predict the item parameters. Results indicate that a single factor determines the performance on tests composed of items generated by the ANSIG. Under the LLTM approach, all the cognitive operators were significant predictors of item difficulty. Moderate to high correlations were evident between the number series items and the ICAR test scores, with high correlation found for the ICAR Letter-Numeric-Series type items, suggesting adequate nomothetic span. Extended cognitive research is, nevertheless, essential for the automatic generation of an item pool with predictable psychometric properties.

  20. Using Linear Equating to Map PROMIS(®) Global Health Items and the PROMIS-29 V2.0 Profile Measure to the Health Utilities Index Mark 3.

    Science.gov (United States)

    Hays, Ron D; Revicki, Dennis A; Feeny, David; Fayers, Peter; Spritzer, Karen L; Cella, David

    2016-10-01

    Preference-based health-related quality of life (HR-QOL) scores are useful as outcome measures in clinical studies, for monitoring the health of populations, and for estimating quality-adjusted life-years. This was a secondary analysis of data collected in an internet survey as part of the Patient-Reported Outcomes Measurement Information System (PROMIS(®)) project. To estimate Health Utilities Index Mark 3 (HUI-3) preference scores, we used the ten PROMIS(®) global health items, the PROMIS-29 V2.0 single pain intensity item and seven multi-item scales (physical functioning, fatigue, pain interference, depressive symptoms, anxiety, ability to participate in social roles and activities, sleep disturbance), and the PROMIS-29 V2.0 items. Linear regression analyses were used to identify significant predictors, followed by simple linear equating to avoid regression to the mean. The regression models explained 48 % (global health items), 61 % (PROMIS-29 V2.0 scales), and 64 % (PROMIS-29 V2.0 items) of the variance in the HUI-3 preference score. Linear equated scores were similar to observed scores, although differences tended to be larger for older study participants. HUI-3 preference scores can be estimated from the PROMIS(®) global health items or PROMIS-29 V2.0. The estimated HUI-3 scores from the PROMIS(®) health measures can be used for economic applications and as a measure of overall HR-QOL in research.

  1. Scaling Irrational Beliefs in the General Attitude and Belief Scale

    Directory of Open Access Journals (Sweden)

    Lindsay R. Owings

    2013-04-01

    Full Text Available Accurate measurement of key constructs is essential to the continued development of Rational-Emotive Behavior Therapy (REBT. The General Attitude and Belief Scale (GABS, a contemporary inventory of rational and irrational beliefs based on current REBT theory, is one of the most valid and widely used instruments available, and recent research has continued to improve its psychometric standing. In this study of 544 students, item response theory (IRT methods were used (a to identify the most informative item in each irrational subscale of the GABS, (b to determine the level of irrationality represented by each of those items, and (c to suggest a condensed form of the GABS for further study with clinical populations. Administering only the most psychometrically informative items to clients could result in economies of time and effort. Further research based on the scaling of items could clarify the specific patterns of irrational beliefs associated with particular clinical syndromes.

  2. Item level diagnostics and model - data fit in item response theory ...

    African Journals Online (AJOL)

    Item response theory (IRT) is a framework for modeling and analyzing item response data. Item-level modeling gives IRT advantages over classical test theory. The fit of an item score pattern to an item response theory (IRT) models is a necessary condition that must be assessed for further use of item and models that best fit ...

  3. Negative Symptom Dimensions of the Positive and Negative Syndrome Scale Across Geographical Regions: Implications for Social, Linguistic, and Cultural Consistency.

    Science.gov (United States)

    Khan, Anzalee; Liharska, Lora; Harvey, Philip D; Atkins, Alexandra; Ulshen, Daniel; Keefe, Richard S E

    2017-12-01

    Objective: Recognizing the discrete dimensions that underlie negative symptoms in schizophrenia and how these dimensions are understood across localities might result in better understanding and treatment of these symptoms. To this end, the objectives of this study were to 1) identify the Positive and Negative Syndrome Scale negative symptom dimensions of expressive deficits and experiential deficits and 2) analyze performance on these dimensions over 15 geographical regions to determine whether the items defining them manifest similar reliability across these regions. Design: Data were obtained for the baseline Positive and Negative Syndrome Scale visits of 6,889 subjects across 15 geographical regions. Using confirmatory factor analysis, we examined whether a two-factor negative symptom structure that is found in schizophrenia (experiential deficits and expressive deficits) would be replicated in our sample, and using differential item functioning, we tested the degree to which specific items from each negative symptom subfactor performed across geographical regions in comparison with the United States. Results: The two-factor negative symptom solution was replicated in this sample. Most geographical regions showed moderate-to-large differential item functioning for Positive and Negative Syndrome Scale expressive deficit items, especially N3 Poor Rapport, as compared with Positive and Negative Syndrome Scale experiential deficit items, showing that these items might be interpreted or scored differently in different regions. Across countries, except for India, the differential item functioning values did not favor raters in the United States. Conclusion: These results suggest that the Positive and Negative Syndrome Scale negative symptom factor can be better represented by a two-factor model than by a single-factor model. Additionally, the results show significant differences in responses to items representing the Positive and Negative Syndrome Scale expressive

  4. Linguistic Simplification of Mathematics Items: Effects for Language Minority Students in Germany

    Science.gov (United States)

    Haag, Nicole; Heppt, Birgit; Roppelt, Alexander; Stanat, Petra

    2015-01-01

    In large-scale assessment studies, language minority students typically obtain lower test scores in mathematics than native speakers. Although this performance difference was related to the linguistic complexity of test items in some studies, other studies did not find linguistically demanding math items to be disproportionally more difficult for…

  5. Beyond the Shadow of a Trait: Understanding Discounting through Item-Level Analysis of Personality Scales

    Science.gov (United States)

    Charlton, Shawn R.; Gossett, Bradley D.; Charlton, Veda A.

    2011-01-01

    Temporal discounting, the loss in perceived value associated with delayed outcomes, correlates with a number of personality measures, suggesting that an item-level analysis of trait measures might provide a more detailed understanding of discounting. The current report details two studies that investigate the utility of such an item-level…

  6. Atomic-scale structure of single-layer MoS2 nanoclusters

    DEFF Research Database (Denmark)

    Helveg, S.; Lauritsen, J. V.; Lægsgaard, E.

    2000-01-01

    We have studied using scanning tunneling microscopy (STM) the atomic-scale realm of molybdenum disulfide (MoS2) nanoclusters, which are of interest as a model system in hydrodesulfurization catalysis. The STM gives the first real space images of the shape and edge structure of single-layer MoS2...

  7. Preliminary Study of Single-Phase Natural Circulation for Lab-scaled Molten Salt Application

    Energy Technology Data Exchange (ETDEWEB)

    Shin, Yukyung; Kang, Sarah; Kim, In Guk; Seo, Seok Bin; Bang, In Cheol [UNIST, Ulsan (Korea, Republic of); Park, Seong Dae [Korea Atomic Energy Research Institute, Daejeon (Korea, Republic of)

    2015-10-15

    Advanced reactors such as MSR (FHR), VHTR and AHTR utilized molten salt as a coolant for efficiency and safety which has advantages in higher heat capacity, lower pumping power and scale compared to liquid metal. It becomes more necessary to study on the characteristics of molten salt. However, due to several characteristics such as high operating temperature, large-scale facility and preventing solidification, satisfying that condition for study has difficulties. Thus simulant fluid was used with scaling method for lab-scale experiment. Scaled experiment enables simulant fluid to simulate fluid mechanics and heat transfer behavior of molten salt on lower operating temperature and reduced scale. In this paper, as a proof test of the scaled experiment, simplified single-phase natural circulation loop was designed in a lab-scale and applied to the passive safety system in advanced reactor in which molten salt is considered as a major coolant of the system. For the application of the improved safety system, prototype was based on the primary loop of the test-scale DRACS, the main passive safety system in FHR, developed at the OSU. For preliminary experiment, single-phase natural circulation under low power was performed. DOWTHERM A and DOWTHERM RP were selected as simulant candidates. Then, study of feasibility with simulant was conducted based on the scaling law for heat transfer characteristics and geometric parameters. Additionally, simulation with MARS code and ANSYS-CFX with the same condition of natural circulation was carried out as verification. For the accurate code simulation, thermo-physical properties of DOWTHERM A and RP were developed and implemented into MARS code. In this study, single-phase natural circulation experiment was performed with simulant oil, DOWTHERM RP, based on the passive safety system of FHR. Feasibility of similarity experiment for molten salt with oil simulant was confirmed by scaling method. In addition, simulation with two

  8. Comparing short forms of the Social Interaction Anxiety Scale and the Social Phobia Scale.

    Science.gov (United States)

    Carleton, R Nicholas; Thibodeau, Michel A; Weeks, Justin W; Teale Sapach, Michelle J N; McEvoy, Peter M; Horswill, Samantha C; Heimberg, Richard G

    2014-12-01

    The Social Interaction Anxiety Scale (SIAS) and the Social Phobia Scale (SPS; Mattick & Clarke, 1998) are companion scales developed to measure anxiety in social interaction and performance situations, respectively. The measures have strong discriminant and convergent validity; however, their factor structures remain debated, and furthermore, the combined administration length (i.e., 39 items) can be prohibitive for some settings. There have been 4 attempts to assess the factor structures of the scales and reduce the item content: the 14-item Social Interaction Phobia Scale (SIPS; Carleton et al., 2009), the 12-item SIAS-6/SPS-6 (Peters, Sunderland, Andrews, Rapee, & Mattick, 2012), the 21-item abbreviated SIAS/SPS (ASIAS/ASPS; Kupper & Denollet, 2012), and the 12-item Readability SIAS and SPS (RSIAS/RSPS; Fergus, Valentiner, McGrath, Gier-Lonsway, & Kim, 2012). The current study compared the short forms on (a) factor structure, (b) ability to distinguish between clinical and non-clinical populations, (c) sensitivity to change following therapy, and (d) convergent validity with related measures. Participants included 3,607 undergraduate students (55% women) and 283 patients with social anxiety disorder (43% women). Results of confirmatory factor analyses, sensitivity analyses, and correlation analyses support the robust utility of items in the SIPS and the SPS-6 and SIAS-6 relative to the other short forms; furthermore, the SIPS and the SPS-6 and SIAS-6 were also supported by convergent validity analyses within the undergraduate sample. The RSIAS/RSPS and the ASIAS/ASPS were least supported, based on the current results and the principle of parsimony. Accordingly, researchers and clinicians should consider carefully which of the short forms will best suit their needs. (c) 2014 APA, all rights reserved.

  9. Development of the Abbreviated Masculine Gender Role Stress Scale.

    Science.gov (United States)

    Swartout, Kevin M; Parrott, Dominic J; Cohn, Amy M; Hagman, Brett T; Gallagher, Kathryn E

    2015-06-01

    Data gathered from 6 independent samples (n = 1,729) that assessed men's masculine gender role stress in college and community males were aggregated used to determine the reliability and validity of an abbreviated version of the Masculine Gender Role Stress (MGRS) Scale. The 15 items with the highest item-to-total scale correlations were used to create an abbreviated MGRS Scale. Psychometric properties of each of the 15 items were examined with item response theory (IRT) analysis, using the discrimination and threshold parameters. IRT results showed that the abbreviated scale may hold promise at capturing the same amount of information as the full 40-item scale. Relative to the 40-item scale, the total score of the abbreviated MGRS Scale demonstrated comparable convergent validity using the measurement domains of masculine identity, hypermasculinity, trait anger, anger expression, and alcohol involvement. An abbreviated MGRS Scale may be recommended for use in clinical practice and research settings to reduce cost, time, and patient/participant burden. Additionally, IRT analyses identified items with higher discrimination and threshold parameters that may be used to screen for problematic gender role stress in men who may be seen in routine clinical or medical practice. (c) 2015 APA, all rights reserved).

  10. Large-scale single-chirality separation of single-wall carbon nanotubes by simple gel chromatography

    Science.gov (United States)

    Liu, Huaping; Nishide, Daisuke; Tanaka, Takeshi; Kataura, Hiromichi

    2011-01-01

    Monostructured single-wall carbon nanotubes (SWCNTs) are important in both scientific research and electronic and biomedical applications; however, the bulk separation of SWCNTs into populations of single-chirality nanotubes remains challenging. Here we report a simple and effective method for the large-scale chirality separation of SWCNTs using a single-surfactant multicolumn gel chromatography method utilizing one surfactant and a series of vertically connected gel columns. This method is based on the structure-dependent interaction strength of SWCNTs with an allyl dextran-based gel. Overloading an SWCNT dispersion on the top column results in the adsorption sites of the column becoming fully occupied by the nanotubes that exhibit the strongest interaction with the gel. The unbound nanotubes flow through to the next column, and the nanotubes with the second strongest interaction with the gel are adsorbed in this stage. In this manner, 13 different (n, m) species were separated. Metallic SWCNTs were finally collected as unbound nanotubes because they exhibited the lowest interaction with the gel. PMID:21556063

  11. A single gene (yes controls pigmentation of eyes and scales in Heliothis virescens

    Directory of Open Access Journals (Sweden)

    Thomas M. Brown

    2001-02-01

    Full Text Available A yellow-eyed mutant was discovered in a strain of Heliothis virescens, the tobacco budworm, that already exhibited a mutation for yellow scale, y. We investigated the inheritance of these visible mutations as candidate markers for transgenesis. Yellow eye was controlled by a single, recessive, autosomal factor, the same type of inheritance previously known for y. Presence of the recombinant mutants with yellow scales with wild type eyes in test crosses indicated independent segregation of genes for these traits. The recombinant class with wild type scales and yellow eyes was completely absent and there was a corresponding increase of the double mutant parental class having yellow scales and yellow eyes. These results indicated that a single factor for yellow eye also controls yellow scales independently of y. This gene was named yes, for yellow eye and scale. We hypothesize that yes controls both eye and scale color through a deficiency in transport of pigment precursors in both the ommochrome and melanin pathways. The unlinked gene y likely controls an enzyme affecting the melanin pathway only. Both y and yes segregated independently of AceIn, acetylcholinesterase insensitivity, and sodium channel hscp, which are genes related to insecticide resistance.

  12. Inventory control in multi-item production systems

    NARCIS (Netherlands)

    Bruin, J.

    2010-01-01

    This thesis focusses on the analysis and construction of control policies in multiitem production systems. In such systems, multiple items can be made to stock, but they have to share the finite capacity of a single machine. This machine can only produce one unit at a time and if it is set-up for

  13. HIV/AIDS knowledge among men who have sex with men: applying the item response theory.

    Science.gov (United States)

    Gomes, Raquel Regina de Freitas Magalhães; Batista, José Rodrigues; Ceccato, Maria das Graças Braga; Kerr, Lígia Regina Franco Sansigolo; Guimarães, Mark Drew Crosland

    2014-04-01

    To evaluate the level of HIV/AIDS knowledge among men who have sex with men in Brazil using the latent trait model estimated by Item Response Theory. Multicenter, cross-sectional study, carried out in ten Brazilian cities between 2008 and 2009. Adult men who have sex with men were recruited (n = 3,746) through Respondent Driven Sampling. HIV/AIDS knowledge was ascertained through ten statements by face-to-face interview and latent scores were obtained through two-parameter logistic modeling (difficulty and discrimination) using Item Response Theory. Differential item functioning was used to examine each item characteristic curve by age and schooling. Overall, the HIV/AIDS knowledge scores using Item Response Theory did not exceed 6.0 (scale 0-10), with mean and median values of 5.0 (SD = 0.9) and 5.3, respectively, with 40.7% of the sample with knowledge levels below the average. Some beliefs still exist in this population regarding the transmission of the virus by insect bites, by using public restrooms, and by sharing utensils during meals. With regard to the difficulty and discrimination parameters, eight items were located below the mean of the scale and were considered very easy, and four items presented very low discrimination parameter (items contributed to the inaccuracy of the measurement of knowledge among those with median level and above. Item Response Theory analysis, which focuses on the individual properties of each item, allows measures to be obtained that do not vary or depend on the questionnaire, which provides better ascertainment and accuracy of knowledge scores. Valid and reliable scales are essential for monitoring HIV/AIDS knowledge among the men who have sex with men population over time and in different geographic regions, and this psychometric model brings this advantage.

  14. Development of a Facebook Addiction Scale.

    Science.gov (United States)

    Andreassen, Cecilie Schou; Torsheim, Torbjørn; Brunborg, Geir Scott; Pallesen, Ståle

    2012-04-01

    The Bergen Facebook Addiction Scale (BFAS), initially a pool of 18 items, three reflecting each of the six core elements of addiction (salience, mood modification, tolerance, withdrawal, conflict, and relapse), was constructed and administered to 423 students together with several other standardized self-report scales (Addictive Tendencies Scale, Online Sociability Scale, Facebook Attitude Scale, NEO-FFI, BIS/BAS scales, and Sleep questions). That item within each of the six addiction elements with the highest corrected item-total correlation was retained in the final scale. The factor structure of the scale was good (RMSEA = .046, CFI = .99) and coefficient alpha was .83. The 3-week test-retest reliability coefficient was .82. The scores converged with scores for other scales of Facebook activity. Also, they were positively related to Neuroticism and Extraversion, and negatively related to Conscientiousness. High scores on the new scale were associated with delayed bedtimes and rising times.

  15. The 10-item Remembered Relationship with Parents (RRP10) scale: two-factor model and association with adult depressive symptoms.

    Science.gov (United States)

    Denollet, Johan; Smolderen, Kim G E; van den Broek, Krista C; Pedersen, Susanne S

    2007-06-01

    Dysfunctional parenting styles are associated with poor mental and physical health. The 10-item Remembered Relationship with Parents (RRP(10)) scale retrospectively assesses Alienation (dysfunctional communication and intimacy) and Control (overprotection by parents), with an emphasis on deficiencies in empathic parenting. We examined the 2-factor structure of the RRP(10) and its relationship with adult depression. 664 respondents from the general population (48% men, mean age 54.6+/-14.2 years) completed the RRP(10), Parental Bonding Instrument (PBI), and Beck Depression Inventory. The Alienation and Control dimensions of the RRP(10) displayed a sound factor structure, good internal consistency (Cronbach's alpha=0.83-0.86), and convergent validity against the PBI scales. No significant gender differences were found on the RRP(10) scales. Stratifying by RRP(10) dimensions showed that respondents high in Alienation and Control, for both father (33.3% vs. 14.5%, pparental Alienation and Control. High Alienation and Control were independently related to increased risk of depressive symptoms. Given the brevity of the RRP(10), it can easily be used in epidemiological/clinical research on the link between the remembered relationship with parents and mental/physical health.

  16. The (mismeasurement of the Dark Triad Dirty Dozen: exploitation at the core of the scale

    Directory of Open Access Journals (Sweden)

    Petri J. Kajonius

    2016-03-01

    Full Text Available Background. The dark side of human character has been conceptualized in the Dark Triad Model: Machiavellianism, psychopathy, and narcissism. These three dark traits are often measured using single long instruments for each one of the traits. Nevertheless, there is a necessity of short and valid personality measures in psychological research. As an independent research group, we replicated the factor structure, convergent validity and item response for one of the most recent and widely used short measures to operationalize these malevolent traits, namely, Jonason’s Dark Triad Dirty Dozen. We aimed to expand the understanding of what the Dirty Dozen really captures because the mixed results on construct validity in previous research. Method. We used the largest sample to date to respond to the Dirty Dozen (N = 3,698. We firstly investigated the factor structure using Confirmatory Factor Analysis and an exploratory distribution analysis of the items in the Dirty Dozen. Secondly, using a sub-sample (n = 500 and correlation analyses, we investigated the Dirty Dozen dark traits convergent validity to Machiavellianism measured by the Mach-IV, psychopathy measured by Eysenck’s Personality Questionnaire Revised, narcissism using the Narcissism Personality Inventory, and both neuroticism and extraversion from the Eysenck’s questionnaire. Finally, besides these Classic Test Theory analyses, we analyzed the responses for each Dirty Dozen item using Item Response Theory (IRT. Results. The results confirmed previous findings of a bi-factor model fit: one latent core dark trait and three dark traits. All three Dirty Dozen traits had a striking bi-modal distribution, which might indicate unconcealed social undesirability with the items. The three Dirty Dozen traits did converge too, although not strongly, with the contiguous single Dark Triad scales (r between .41 and .49. The probabilities of filling out steps on the Dirty Dozen narcissism-items were

  17. Cross-cultural adaptation and validation of the Danish consensus version of the 10-item Perceived Stress Scale

    DEFF Research Database (Denmark)

    Eskildsen, Anita; Dalgaard, Vita Ligaya; Nielsen, Kent Jacob

    2015-01-01

    with work-related stress complaints. METHODS: A consensus-building process was performed involving the authors of the three previous Danish translations and the consensus version was back-translated into English and pilot-tested. Psychometric properties of the final version were examined in a sample of 64...... patients with work-related stress complaints. RESULTS: The face validity, reliability, and internal consistency of the Danish consensus version of the PSS-10 were satisfactory, and convergent construct validity was confirmed. Receiver operating characteristic (ROC) curves of the change scores showed......OBJECTIVES: The aims of the present study were to (i) cross-culturally adapt a Danish consensus version of the 10-item Perceived Stress Scale (PSS-10) and (ii) evaluate its psychometric properties in terms of agreement, reliability, validity, responsiveness, and interpretability among patients...

  18. Detecting treatment effects with combinations of the ADAS-cog items in patients with mild and moderate Alzheimer's disease.

    Science.gov (United States)

    Ihl, Ralf; Ferris, Steven; Robert, Philippe; Winblad, Bengt; Gauthier, Serge; Tennigkeit, Frank

    2012-01-01

    When complex cognitive functions are measured with multi-item scales like the Alzheimer's Disease Assessment Scale - cognitive subscale (ADAS-cog), it seems valuable information can be lost due to combination of the ADAS-cog items results into a total score. We hypothesized, that an analysis of the results of different ADAS-cog item combinations may reveal drug treatment effects in distinct cognitive domains and/or enhance the sensitivity to detect such treatment effects. Here, we present a novel approach called 'subsetting analysis' for assessment of drug treatment effects with multi-item scales, like the ADAS-cog. The subsetting approach is a mathematical algorithm designed to select and group scale items in a subset detecting drug treatment effects in a particular study population. The approach was applied in a post-hoc analysis of ADAS-cog results from two randomized, placebo-controlled and double-blind clinical trials with memantine in mild to moderate Alzheimer's disease (AD). The subsetting analysis of the ADAS-cog combined database aimed at selecting the scale items showing no worsening at study end compared to baseline due to memantine treatment in mild AD (Mini-Mental State Examination (MMSE >19)) patients. Two ADAS-cog subsets were finally revealed by the analysis: a subset of five ADAS-cog items, identified as most sensitive to memantine effects in mild AD patients, and a subset of six ADAS-cog items shown to detect significant memantine effects in moderate AD patients. The subsetting approach of analyzing ADAS-cog data is a powerful alternative for gaining information about drug effects on cognitive performance in mild and moderate AD patients. Copyright © 2011 John Wiley & Sons, Ltd.

  19. Ordinal-To-Interval Scale Conversion Tables and National Items for the New Zealand Version of the WHOQOL-BREF.

    Directory of Open Access Journals (Sweden)

    Christian U Krägeloh

    Full Text Available The World Health Organisation Quality of Life (WHOQOL questionnaires are widely used around the world and can claim strong cross-cultural validity due to their development in collaboration with international field centres. To enhance conceptual equivalence of quality of life across cultures, optional national items are often developed for use alongside the core instrument. The present study outlines the development of national items for the New Zealand WHOQOL-BREF. Focus groups with members of the community as well as health experts discussed what constitutes quality of life in their opinion. Based on themes extracted of aspects not contained in the existing WHOQOL instrument, 46 candidate items were generated and subsequently rated for their importance by a random sample of 585 individuals from the general population. Applying importance criteria reduced these items to 24, which were then sent to another large random sample (n = 808 to be rated alongside the existing WHOQOL-BREF. A final set of five items met the criteria for national items. Confirmatory factor analysis identified four national items as belonging to the psychological domain of quality of life, and one item to the social domain. Rasch analysis validated these results and generated ordinal-to-interval conversion algorithms to allow use of parametric statistics for domain scores with and without national items.

  20. Item Response Theory at Subject- and Group-Level. Research Report 90-1.

    Science.gov (United States)

    Tobi, Hilde

    This paper reviews the literature about item response models for the subject level and aggregated level (group level). Group-level item response models (IRMs) are used in the United States in large-scale assessment programs such as the National Assessment of Educational Progress and the California Assessment Program. In the Netherlands, these…

  1. Measuring Narcissism within Add Health: The Development and Validation of a New Scale

    Science.gov (United States)

    Davis, Mark S.; Brunell, Amy B.

    2012-01-01

    This study reports the development of a measure of narcissism within the National Longitudinal Study of Adolescent Health (Add Health) data set. In Study 1, items were selected from Wave III to form the Add Health Narcissism Scale (AHNS). These were factor analyzed, yielding a single factor comprised of five subscales. We correlated the AHNS and…

  2. Delving Deep into Multiscale Pedestrian Detection via Single Scale Feature Maps

    Directory of Open Access Journals (Sweden)

    Xinchuan Fu

    2018-04-01

    Full Text Available The standard pipeline in pedestrian detection is sliding a pedestrian model on an image feature pyramid to detect pedestrians of different scales. In this pipeline, feature pyramid construction is time consuming and becomes the bottleneck for fast detection. Recently, a method called multiresolution filtered channels (MRFC was proposed which only used single scale feature maps to achieve fast detection. However, there are two shortcomings in MRFC which limit its accuracy. One is that the receptive field correspondence in different scales is weak. Another is that the features used are not scale invariance. In this paper, two solutions are proposed to tackle with the two shortcomings respectively. Specifically, scale-aware pooling is proposed to make a better receptive field correspondence, and soft decision tree is proposed to relive scale variance problem. When coupled with efficient sliding window classification strategy, our detector achieves fast detecting speed at the same time with state-of-the-art accuracy.

  3. Comparison of the psychometric properties of two balance scales in children with cerebral palsy.

    Science.gov (United States)

    Jeon, Yong-Jin; Kim, Gyoung-Mo

    2016-12-01

    [Purpose] The purpose of this study was to compare the item difficulty degree between the Pediatric Balance Scale and Fullerton Advanced Balance scale for children with cerebral palsy. [Subjects and Methods] Forty children with cerebral palsy (male=17, female=23) voluntarily participated in the study. Item difficulty was expressed in the Rasch analysis using a logit value, with a higher value indicative of increasing item difficulty. [Results] Among the 24 items of the combined Pediatric Balance Scale and Fullerton Advanced Balance scale, the most difficult item was "Walk with head turns", whereas, the easiest item was "Sitting with back unsupported and feet supported on the floor". Among the 14 items of the Pediatric Balance Scale, 9 items (item 1, 2, 3, 4, 5, 6, 7, 11, and 12) had negative logit values, whereas for the Fullerton Advanced Balance scale, only 1 item (item 1) had a negative logit value. [Conclusion] The Fullerton Advanced Balance scale is a more appropriate tool to assess balance ability than the Pediatric Balance Scale in in a group of higher functioning children with cerebral palsy.

  4. Screening for HIV-related PTSD: sensitivity and specificity of the 17-item Posttraumatic Stress Diagnostic Scale (PDS) in identifying HIV-related PTSD among a South African sample.

    Science.gov (United States)

    Martin, L; Fincham, D; Kagee, A

    2009-11-01

    The identification of HIV-positive patients who exhibit criteria for Posttraumatic Stress Disorder (PTSD) and related trauma symptomatology is of clinical importance in the maintenance of their overall wellbeing. This study assessed the sensitivity and specificity of the 17-item Posttraumatic Stress Diagnostic Scale (PDS), a self-report instrument, in the detection of HIV-related PTSD. An adapted version of the PTSD module of the Composite International Diagnostic Interview (CIDI) served as the gold standard. 85 HIV-positive patients diagnosed with HIV within the year preceding data collection were recruited by means of convenience sampling from three HIV clinics within primary health care facilities in the Boland region of South Africa. A significant association was found between the 17-item PDS and the adapted PTSD module of the CIDI. A ROC curve analysis indicated that the 17-item PDS correctly discriminated between PTSD caseness and non-caseness 74.9% of the time. Moreover, a PDS cut-off point of > or = 15 yielded adequate sensitivity (68%) and 1-specificity (65%). The 17-item PDS demonstrated a PPV of 76.0% and a NPV of 56.7%. The 17-item PDS can be used as a brief screening measure for the detection of HIV-related PTSD among HIV-positive patients in South Africa.

  5. Do people with and without medical conditions respond similarly to the short health anxiety inventory? An assessment of differential item functioning using item response theory.

    Science.gov (United States)

    LeBouthillier, Daniel M; Thibodeau, Michel A; Alberts, Nicole M; Hadjistavropoulos, Heather D; Asmundson, Gordon J G

    2015-04-01

    Individuals with medical conditions are likely to have elevated health anxiety; however, research has not demonstrated how medical status impacts response patterns on health anxiety measures. Measurement bias can undermine the validity of a questionnaire by overestimating or underestimating scores in groups of individuals. We investigated whether the Short Health Anxiety Inventory (SHAI), a widely-used measure of health anxiety, exhibits medical condition-based bias on item and subscale levels, and whether the SHAI subscales adequately assess the health anxiety continuum. Data were from 963 individuals with diabetes, breast cancer, or multiple sclerosis, and 372 healthy individuals. Mantel-Haenszel tests and item characteristic curves were used to classify the severity of item-level differential item functioning in all three medical groups compared to the healthy group. Test characteristic curves were used to assess scale-level differential item functioning and whether the SHAI subscales adequately assess the health anxiety continuum. Nine out of 14 items exhibited differential item functioning. Two items exhibited differential item functioning in all medical groups compared to the healthy group. In both Thought Intrusion and Fear of Illness subscales, differential item functioning was associated with mildly deflated scores in medical groups with very high levels of the latent traits. Fear of Illness items poorly discriminated between individuals with low and very low levels of the latent trait. While individuals with medical conditions may respond differentially to some items, clinicians and researchers can confidently use the SHAI with a variety of medical populations without concern of significant bias. Copyright © 2015 Elsevier Inc. All rights reserved.

  6. Reliability and validity of the visual analogue scale for disability in patients with chronic musculoskeletal pain

    NARCIS (Netherlands)

    Boonstra, Anne M.; Schiphorst Preuper, Henrica R.; Reneman, Michiel F.; Posthumus, Jitze B.; Stewart, Roy E.

    To determine the reliability and concurrent validity of a visual analogue scale (VAS) for disability as a single-item instrument measuring disability in chronic pain patients was the objective of the study. For the reliability study a test-retest design and for the validity study a cross-sectional

  7. Single Image Super-Resolution Based on Multi-Scale Competitive Convolutional Neural Network.

    Science.gov (United States)

    Du, Xiaofeng; Qu, Xiaobo; He, Yifan; Guo, Di

    2018-03-06

    Deep convolutional neural networks (CNNs) are successful in single-image super-resolution. Traditional CNNs are limited to exploit multi-scale contextual information for image reconstruction due to the fixed convolutional kernel in their building modules. To restore various scales of image details, we enhance the multi-scale inference capability of CNNs by introducing competition among multi-scale convolutional filters, and build up a shallow network under limited computational resources. The proposed network has the following two advantages: (1) the multi-scale convolutional kernel provides the multi-context for image super-resolution, and (2) the maximum competitive strategy adaptively chooses the optimal scale of information for image reconstruction. Our experimental results on image super-resolution show that the performance of the proposed network outperforms the state-of-the-art methods.

  8. The 7-item generalized anxiety disorder scale as a tool for measuring generalized anxiety in multiple sclerosis.

    Science.gov (United States)

    Terrill, Alexandra L; Hartoonian, Narineh; Beier, Meghan; Salem, Rana; Alschuler, Kevin

    2015-01-01

    Generalized anxiety disorder (GAD) is common in multiple sclerosis (MS) but understudied. Reliable and valid measures are needed to advance clinical care and expand research in this area. The objectives of this study were to examine the psychometric properties of the 7-item Generalized Anxiety Disorder Scale (GAD-7) in individuals with MS and to analyze correlates of GAD. Participants (N = 513) completed the anxiety module of the Patient Health Questionnaire (GAD-7). To evaluate psychometric properties of the GAD-7, the sample was randomly split to conduct exploratory and confirmatory factor analyses. Based on the exploratory factor analysis, a one-factor structure was specified for the confirmatory factor analysis, which showed excellent global fit to the data (χ(2) 12 = 15.17, P = .23, comparative fit index = 0.99, root mean square error of approximation = 0.03, standardized root mean square residual = 0.03). The Cronbach alpha (0.75) indicated acceptable internal consistency for the scale. Furthermore, the GAD-7 was highly correlated with the Hospital Anxiety and Depression Scale-Anxiety (r = 0.70). Age and duration of MS were both negatively associated with GAD. Higher GAD-7 scores were observed in women and individuals with secondary progressive MS. Individuals with higher GAD-7 scores also endorsed more depressive symptoms. These findings support the reliability and internal validity of the GAD-7 for use in MS. Correlational analyses revealed important relationships with demographics, disease course, and depressive symptoms, which suggest the need for further anxiety research.

  9. Exploratory factor analysis of the 12-item Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being Scale in people newly diagnosed with advanced cancer.

    Science.gov (United States)

    Bai, Mei; Dixon, Jane K

    2014-01-01

    The purpose of this study was to reexamine the factor pattern of the 12-item Functional Assessment of Chronic Illness Therapy-Spiritual Well-Being Scale (FACIT-Sp-12) using exploratory factor analysis in people newly diagnosed with advanced cancer. Principal components analysis (PCA) and 3 common factor analysis methods were used to explore the factor pattern of the FACIT-Sp-12. Factorial validity was assessed in association with quality of life (QOL). Principal factor analysis (PFA), iterative PFA, and maximum likelihood suggested retrieving 3 factors: Peace, Meaning, and Faith. Both Peace and Meaning positively related to QOL, whereas only Peace uniquely contributed to QOL. This study supported the 3-factor model of the FACIT-Sp-12. Suggestions for revision of items and further validation of the identified factor pattern were provided.

  10. A Bifactor Multidimensional Item Response Theory Model for Differential Item Functioning Analysis on Testlet-Based Items

    Science.gov (United States)

    Fukuhara, Hirotaka; Kamata, Akihito

    2011-01-01

    A differential item functioning (DIF) detection method for testlet-based data was proposed and evaluated in this study. The proposed DIF model is an extension of a bifactor multidimensional item response theory (MIRT) model for testlets. Unlike traditional item response theory (IRT) DIF models, the proposed model takes testlet effects into…

  11. Heart rate detection from single-foot plantar bioimpedance measurements in a weighing scale.

    Science.gov (United States)

    Diaz, Delia H; Casas, Oscar; Pallas-Areny, Ramon

    2010-01-01

    Electronic bathroom scales are an easy-to-use, affordable mean to measure physiological parameters in addition to body weight. They have been proposed to obtain the ballistocardiogram (BCG) and derive from it the heart rate, cardiac output and systolic blood pressure. Therefore, weighing scales may suit intermittent monitoring in e-health and patient screening. Scales intended for bioelectrical impedance analysis (BIA) have also been proposed to estimate the heart rate by amplifying the pulsatile impedance component superimposed on the basal impedance. However, electronic weighing scales cannot easily obtain the BCG from people that have a single leg neither are bioimpedance measurements between both feet recommended for people wearing a pacemaker or other electronic implants, neither for pregnant women. We propose a method to detect the heart rate (HR) from bioimpedance measured in a single foot while standing on an bathroom weighting scale intended for BIA. The electrodes built in the weighing scale are used to apply a 50 kHz voltage between the outer electrode pair and to measure the drop in voltage across the inner electrode pair. The agreement with the HR simultaneously obtained from the ECG is excellent. We have also compared the drop in voltage across the waist and the thorax with that obtained when measuring bioimpedance between both feet to compare the possible risk of the proposed method to that of existing BIA scales.

  12. Gender-Based Differential Item Performance in Mathematics Achievement Items.

    Science.gov (United States)

    Doolittle, Allen E.; Cleary, T. Anne

    1987-01-01

    Eight randomly equivalent samples of high school seniors were each given a unique form of the ACT Assessment Mathematics Usage Test (ACTM). Signed measures of differential item performance (DIP) were obtained for each item in the eight ACTM forms. DIP estimates were analyzed and a significant item category effect was found. (Author/LMO)

  13. Psychometric evaluation of the shortened resilience scale among Alzheimer's caregivers.

    Science.gov (United States)

    Wilks, Scott E

    2008-01-01

    The purpose of this study was to evaluate psychometric properties of the shortened Resilience Scale (15-item version RS15) among a sample of Alzheimer's caregivers. Self-reported data were collected from 229 participants at 2 Alzheimer's caregiver conferences. RS15 principal axis factoring indicated a single-dimensional solution with all items loaded. Reliability was strong. Convergent validity for the RS15 was suggested through its correlations with stress, family support, and friend support. Odds ratios showed significant likelihoods of high resilience given low stress and high social support. The results confirmed the RS15 to be a psychometrically sound measure that can be used to appraise the efficacy of adaptability among Alzheimer's caregivers.

  14. Electrochemistry of single molecules and biomolecules, molecular scale nanostructures, and low-dimensional systems

    DEFF Research Database (Denmark)

    Nazmutdinov, Renat R.; Zinkicheva, Tamara T.; Zinkicheva, Tamara T.

    2018-01-01

    Electrochemistry at ultra-small scales, where even the single molecule or biomolecule can be characterized and manipulated, is on the way to a consolidated status. At the same time molecular electrochemistry is expanding into other areas of sophisticated nano- and molecular scale systems includin...... molecular scale metal and semiconductor nanoparticles (NPs) and other nanostructures, e.g. nanotubes, “nanoflowers” etc.. The new structures offer both new electronic properties and highly confined novel charge transfer environments....

  15. Towards a single empirical correlation to predict kLa across scales and processes

    DEFF Research Database (Denmark)

    Quintanilla Hernandez, Daniela Alejandra; Gernaey, Krist; Albæk, Mads O.

    Mathematical models are increasingly used in fermentation. Nevertheless, one of the major limitations of these models is that the parameters they include are process specific, e.g. the volumetric mass transfer coefficient (kLa). Oxygen transfer was studied in order to establish a single equation...... different calculations of the average shear rate. The experimental kLa value was determined with the direct method; however, eight variations of its calculation were evaluated. Several simple correlations were fitted to the measured kLa data. The standard empirical equation was found to be best...... scales using on ‐ line viscosity measurements. A single correlation for all processes and all scales could not be established...

  16. Clinical and psychometric validation of the psychotic depression assessment scale

    DEFF Research Database (Denmark)

    Østergaard, Søren D; Pedersen, Christina H; Uggerby, Peter

    2015-01-01

    BACKGROUND: Recent studies have indicated that the 11-item Psychotic Depression Assessment Scale (PDAS), consisting of the 6-item melancholia subscale (HAM-D6) of the Hamilton Depression Rating Scale and 5 psychosis items from the Brief Psychiatric Rating Scale (BPRS), is a valid measure for the ...

  17. Estimating reliability coefficients with heterogeneous item weightings using Stata: A factor based approach

    NARCIS (Netherlands)

    Boermans, M.A.; Kattenberg, M.A.C.

    2011-01-01

    We show how to estimate a Cronbach's alpha reliability coefficient in Stata after running a principal component or factor analysis. Alpha evaluates to what extent items measure the same underlying content when the items are combined into a scale or used for latent variable. Stata allows for testing

  18. Cleaning and disinfection of patient care items, in relation to small animals.

    Science.gov (United States)

    Weese, J Scott

    2015-03-01

    Patient care involves several medical and surgical items, including those that come into contact with sterile or other high-risk body sites and items that have been used on other patients. These situations create a risk for infection if items are contaminated, and the implications can range from single infections to large outbreaks. To minimize the risk, proper equipment cleaning, disinfection/sterilization, storage, and monitoring practices are required. Risks posed by different items; the required level of cleaning, disinfection, or sterilization; the methods that are available and appropriate; and how to ensure efficacy, must be considered when designing and implementing an infection control program. Copyright © 2015 Elsevier Inc. All rights reserved.

  19. Irrational Delay Revisited: Examining Five Procrastination Scales in a Global Sample.

    Science.gov (United States)

    Svartdal, Frode; Steel, Piers

    2017-01-01

    Scales attempting to measure procrastination focus on different facets of the phenomenon, yet they share a common understanding of procrastination as an unnecessary, unwanted, and disadvantageous delay. The present paper examines in a global sample ( N = 4,169) five different procrastination scales - Decisional Procrastination Scale (DPS), Irrational Procrastination Scale (IPS), Pure Procrastination Scale (PPS), Adult Inventory of Procrastination Scale (AIP), and General Procrastination Scale (GPS), focusing on factor structures and item functioning using Confirmatory Factor Analysis and Item Response Theory. The results indicated that The PPS (12 items selected from DPS, AIP, and GPS) measures different facets of procrastination even better than the three scales it is based on. An even shorter version of the PPS (5 items focusing on irrational delay), corresponds well to the nine-item IPS. Both scales demonstrate good psychometric properties and appear to be superior measures of core procrastination attributes than alternative procrastination scales.

  20. The multi-dimensional model of Māori identity and cultural engagement: item response theory analysis of scale properties.

    Science.gov (United States)

    Sibley, Chris G; Houkamau, Carla A

    2013-01-01

    We argue that there is a need for culture-specific measures of identity that delineate the factors that most make sense for specific cultural groups. One such measure, recently developed specifically for Māori peoples, is the Multi-Dimensional Model of Māori Identity and Cultural Engagement (MMM-ICE). Māori are the indigenous peoples of New Zealand. The MMM-ICE is a 6-factor measure that assesses the following aspects of identity and cultural engagement as Māori: (a) group membership evaluation, (b) socio-political consciousness, (c) cultural efficacy and active identity engagement, (d) spirituality, (e) interdependent self-concept, and (f) authenticity beliefs. This article examines the scale properties of the MMM-ICE using item response theory (IRT) analysis in a sample of 492 Māori. The MMM-ICE subscales showed reasonably even levels of measurement precision across the latent trait range. Analysis of age (cohort) effects further indicated that most aspects of Māori identification tended to be higher among older Māori, and these cohort effects were similar for both men and women. This study provides novel support for the reliability and measurement precision of the MMM-ICE. The study also provides a first step in exploring change and stability in Māori identity across the life span. A copy of the scale, along with recommendations for scale scoring, is included.

  1. Boundary curves of individual items in the distribution of total depressive symptom scores approximate an exponential pattern in a general population.

    Science.gov (United States)

    Tomitaka, Shinichiro; Kawasaki, Yohei; Ide, Kazuki; Akutagawa, Maiko; Yamada, Hiroshi; Furukawa, Toshiaki A; Ono, Yutaka

    2016-01-01

    Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D) questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items). The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an exponential mathematical pattern.

  2. Boundary curves of individual items in the distribution of total depressive symptom scores approximate an exponential pattern in a general population

    Directory of Open Access Journals (Sweden)

    Shinichiro Tomitaka

    2016-10-01

    Full Text Available Background Previously, we proposed a model for ordinal scale scoring in which individual thresholds for each item constitute a distribution by each item. This lead us to hypothesize that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores follow a common mathematical model, which is expressed as the product of the frequency of the total depressive symptom scores and the probability of the cumulative distribution function of each item threshold. To verify this hypothesis, we investigated the boundary curves of the distribution of total depressive symptom scores in a general population. Methods Data collected from 21,040 subjects who had completed the Center for Epidemiologic Studies Depression Scale (CES-D questionnaire as part of a national Japanese survey were analyzed. The CES-D consists of 20 items (16 negative items and four positive items. The boundary curves of adjacent item scores in the distribution of total depressive symptom scores for the 16 negative items were analyzed using log-normal scales and curve fitting. Results The boundary curves of adjacent item scores for a given symptom approximated a common linear pattern on a log normal scale. Curve fitting showed that an exponential fit had a markedly higher coefficient of determination than either linear or quadratic fits. With negative affect items, the gap between the total score curve and boundary curve continuously increased with increasing total depressive symptom scores on a log-normal scale, whereas the boundary curves of positive affect items, which are not considered manifest variables of the latent trait, did not exhibit such increases in this gap. Discussion The results of the present study support the hypothesis that the boundary curves of each depressive symptom score in the distribution of total depressive symptom scores commonly follow the predicted mathematical model, which was verified to approximate an

  3. The impact of item order on ratings of cancer risk perception.

    Science.gov (United States)

    Taylor, Kathryn L; Shelby, Rebecca A; Schwartz, Marc D; Ackerman, Josh; LaSalle, V Holland; Gelmann, Edward P; McGuire, Colleen

    2002-07-01

    Although perceived risk is central to most theories of health behavior, there is little consensus on its measurement with regard to item wording, response set, or the number of items to include. In a methodological assessment of perceived risk, we assessed the impact of changing the order of three commonly used perceived risk items: quantitative personal risk, quantitative population risk, and comparative risk. Participants were 432 men and women enrolled in an ancillary study of the Prostate, Lung, Colorectal, and Ovarian Cancer Screening Trial. Three groups of consecutively enrolled participants responded to the three items in one of three question orders. Results indicated that item order was related to the perceived risk ratings of both ovarian (P Perceptions of risk were significantly lower when the comparative rating was made first. The findings suggest that compelling participants to consider their own risk relative to the risk of others results in lower ratings of perceived risk. Although the use of multiple items may provide more information than when only a single method is used, different conclusions may be reached depending on the context in which an item is assessed.

  4. An evaluation of the brief symptom inventory-18 using item response theory: which items are most strongly related to psychological distress?

    NARCIS (Netherlands)

    Meijer, R.R.; de Vries, Rivka M.; van Bruggen, Vincent

    2011-01-01

    The psychometric structure of the Brief Symptom Inventory–18 (BSI-18; Derogatis, 2001) was investigated using Mokken scaling and parametric item response theory. Data of 487 outpatients, 266 students, and 207 prisoners were analyzed. Results of the Mokken analysis indicated that the BSI-18 formed a

  5. An Evaluation of the Brief Symptom Inventory-18 Using Item Response Theory : Which Items Are Most Strongly Related to Psychological Distress?

    NARCIS (Netherlands)

    Meijer, Rob R.; de Vries, Rivka M.; van Bruggen, Vincent

    The psychometric structure of the Brief Symptom Inventory-18 (BSI-18; Derogatis, 2001) was investigated using Mokken scaling and parametric item response theory. Data of 487 outpatients, 266 students, and 207 prisoners were analyzed. Results of the Mokken analysis indicated that the BSI-18 formed a

  6. Scale Pretesting

    Science.gov (United States)

    Howard, Matt C.

    2018-01-01

    Scale pretests analyze the suitability of individual scale items for further analysis, whether through judging their face validity, wording concerns, and/or other aspects. The current article reviews scale pretests, separated by qualitative and quantitative methods, in order to identify the differences, similarities, and even existence of the…

  7. Item difficulty of multiple choice tests dependant on different item response formats – An experiment in fundamental research on psychological assessment

    Directory of Open Access Journals (Sweden)

    KLAUS D. KUBINGER

    2007-12-01

    Full Text Available Multiple choice response formats are problematical as an item is often scored as solved simply because the test-taker is a lucky guesser. Instead of applying pertinent IRT models which take guessing effects into account, a pragmatic approach of re-conceptualizing multiple choice response formats to reduce the chance of lucky guessing is considered. This paper compares the free response format with two different multiple choice formats. A common multiple choice format with a single correct response option and five distractors (“1 of 6” is used, as well as a multiple choice format with five response options, of which any number of the five is correct and the item is only scored as mastered if all the correct response options and none of the wrong ones are marked (“x of 5”. An experiment was designed, using pairs of items with exactly the same content but different response formats. 173 test-takers were randomly assigned to two test booklets of 150 items altogether. Rasch model analyses adduced a fitting item pool, after the deletion of 39 items. The resulting item difficulty parameters were used for the comparison of the different formats. The multiple choice format “1 of 6” differs significantly from “x of 5”, with a relative effect of 1.63, while the multiple choice format “x of 5” does not significantly differ from the free response format. Therefore, the lower degree of difficulty of items with the “1 of 6” multiple choice format is an indicator of relevant guessing effects. In contrast the “x of 5” multiple choice format can be seen as an appropriate substitute for free response format.

  8. Rasch Analysis of the Fullerton Advanced Balance (FAB) Scale

    Science.gov (United States)

    Fiedler, Roger C.; Rose, Debra J.

    2011-01-01

    ABSTRACT Purpose: This cross-sectional study explores the psychometric properties and dimensionality of the Fullerton Advanced Balance (FAB) Scale, a multi-item balance test for higher-functioning older adults. Methods: Participants (n=480) were community-dwelling adults able to ambulate independently. Data gathering consisted of survey and balance performance assessment. Psychometric properties were assessed using Rasch analysis. Results: Mean age of participants was 76.4 (SD=7.1) years. Mean FAB Scale scores were 24.7/40 (SD=7.5). Analyses for scale dimensionality showed that 9 of the 10 items fit a unidimensional measure of balance. Item 10 (Reactive Postural Control) did not fit the model. The reliability of the scale to separate persons was 0.81 out of 1.00; the reliability of the scale to separate items in terms of their difficulty was 0.99 out of 1.00. Cronbach's alpha for a 10-item model was 0.805. Items of differing difficulties formed a useful ordinal hierarchy for scaling patterns of expected balance ability scoring for a normative population. Conclusion: The FAB Scale appears to be a reliable and valid tool to assess balance function in higher-functioning older adults. The test was found to discriminate among participants of varying balance abilities. Further exploration of concurrent validity of Rasch-generated expected item scoring patterns should be undertaken to determine the test's diagnostic and prescriptive utility. PMID:22210989

  9. Rasch Analysis of the Fullerton Advanced Balance (FAB) Scale.

    Science.gov (United States)

    Klein, Penelope J; Fiedler, Roger C; Rose, Debra J

    2011-01-01

    This cross-sectional study explores the psychometric properties and dimensionality of the Fullerton Advanced Balance (FAB) Scale, a multi-item balance test for higher-functioning older adults. Participants (n=480) were community-dwelling adults able to ambulate independently. Data gathering consisted of survey and balance performance assessment. Psychometric properties were assessed using Rasch analysis. Mean age of participants was 76.4 (SD=7.1) years. Mean FAB Scale scores were 24.7/40 (SD=7.5). Analyses for scale dimensionality showed that 9 of the 10 items fit a unidimensional measure of balance. Item 10 (Reactive Postural Control) did not fit the model. The reliability of the scale to separate persons was 0.81 out of 1.00; the reliability of the scale to separate items in terms of their difficulty was 0.99 out of 1.00. Cronbach's alpha for a 10-item model was 0.805. Items of differing difficulties formed a useful ordinal hierarchy for scaling patterns of expected balance ability scoring for a normative population. The FAB Scale appears to be a reliable and valid tool to assess balance function in higher-functioning older adults. The test was found to discriminate among participants of varying balance abilities. Further exploration of concurrent validity of Rasch-generated expected item scoring patterns should be undertaken to determine the test's diagnostic and prescriptive utility.

  10. Evaluation of the Short Parkinson's Evaluation Scale: a new friendly scale for the evaluation of Parkinson's disease in clinical drug trials.

    Science.gov (United States)

    Rabey, J M; Bass, H; Bonuccelli, U; Brooks, D; Klotz, P; Korczyn, A D; Kraus, P; Martinez-Martin, P; Morrish, P; Van Sauten, W; Van Hilten, B

    1997-08-01

    The extensive use of the Unified Parkinson's Disease Rating Scale (UPDRS) has revealed low interrater reliability in some items and redundancy in others. In view of these shortcomings, we have structured a new scale that includes a zero-to three-point scale for each item in the evaluation of PD. The mental axis includes memory, thought disorders, and depression. Activities of daily living (ADL) includes eight items: speech, eating, feeding, dressing, hygiene, handwriting, walking, and turning in bed. The motor examination includes eight items: speech, tremor, rest and posture, rigidity, finger tapping, arising from chair, gait, and postural stability. Complications of therapy were also included: dyskinesias, dystonia, motor fluctuations, and freezing episodes, collected by history. In addition, a global scoring for motor fluctuations that should complement the Hoehn and Yahr Scale was incorporated. In this report, we present a statistical analysis of the ADL, motor evaluation, and complications of therapy sections. Concerning the interrater reliability mean, Kendall's W values were >0.9 for most of the items in the Short Parkinson's Evaluation Scale (SPES). Kendall's W <0.8 (motor evaluation) was found for two items of the SPES and nine items of the UPDRS. The mean interrater reliability for both scales across all seven centers (seven Kendall's W for seven centers) (Mann-Whitney test) showed no statistical differences between the scales. Spearman's correlations between items of both scales were significant. Factor analysis of the SPES and UPDRS data revealed a four-factor solution that explained approximately 60% of the data. All participating centers found the SPES easier to apply and quicker to complete, when compared with the UPDRS. The results obtained strongly favor the introduction of SPES for clinical practice.

  11. Further Investigating Method Effects Associated with Negatively Worded Items on Self-Report Surveys

    Science.gov (United States)

    DiStefano, Christine; Motl, Robert W.

    2006-01-01

    This article used multitrait-multimethod methodology and covariance modeling for an investigation of the presence and correlates of method effects associated with negatively worded items on the Rosenberg Self-Esteem (RSE) scale (Rosenberg, 1989) using a sample of 757 adults. Results showed that method effects associated with negative item phrasing…

  12. Reducing calories, fat, saturated fat, and sodium in restaurant menu items: Effects on consumer acceptance.

    Science.gov (United States)

    Patel, Anjali A; Lopez, Nanette V; Lawless, Harry T; Njike, Valentine; Beleche, Mariana; Katz, David L

    2016-12-01

    To assess consumer acceptance of reductions of calories, fat, saturated fat, and sodium to current restaurant recipes. Twenty-four menu items, from six restaurant chains, were slightly modified and moderately modified by reducing targeted ingredients. Restaurant customers (n = 1,838) were recruited for a taste test and were blinded to the recipe version as well as the purpose of the study. Overall consumer acceptance was measured using a 9-point hedonic (like/dislike) scale, likelihood to purchase scale, Just-About-Right (JAR) 5-point scale, penalty analysis, and alienation analysis. Overall, modified recipes of 19 menu items were scored similar to (or better than) their respective current versions. Eleven menu items were found to be acceptable in the slightly modified recipe version, and eight menu items were found to be acceptable in the moderately modified recipe version. Acceptable ingredient modifications resulted in a reduction of up to 26% in calories and a reduction of up to 31% in sodium per serving. The majority of restaurant menu items with small reductions of calories, fat, saturated fat, and sodium were acceptable. Given the frequency of eating foods away from home, these reductions could be effective in creating dietary improvements for restaurant diners. © 2016 The Obesity Society.

  13. [Drug Addiction Self-Help Recovery scale (DASH-scale): an approach to the measurement of recovery from drug addiction in self-help program among drug addicts].

    Science.gov (United States)

    Shimane, Takuya; Misago, Chizuru

    2004-12-01

    The purpose of the study was to develop a scale for measuring the recovery in self-help program for drug addicts. Our study sites were fourteen self-help groups for drug addicts called "DARC: Drug Addiction Rehabilitation Center". DARC activities were based on Narcotics Anonymous types of self-help program. The 25-items DASH-scale questionnaire was developed using data, which were obtained through in-depth interview among DARC staff. A cross-sectional study among recovering addicts participating in "DARC" activities was implemented from Jan 2004 to Feb 2004. 164 subjects were responded to our questionnaire. Factor analysis was carried out and items with weaker or split loadings were removed. Factor analysis of DASH-scale results produced a surprisingly clean four-factor solution. 19-items were left to form the final DASH-scale; regular life-style (6 items), acceptance of drug addiction (5 items), sympathy with member (5 items), reborn (3 items). The internal consistency (Cronbach's Alpha) of these scales was very high (0.87). Low but significant concurrent correlations were observed between the DASH-scale and the Rosenberg Self-Esteem Scale (0.22), Purpose in Life Test (0.35). Discriminant validity of the DASH-scale was supported by significant increase with exposed period of self-help program. Evidence supports the DASH-scale was possible to measure recovery in self-help program.

  14. Development of emotional stability scale

    Directory of Open Access Journals (Sweden)

    M Chaturvedi

    2010-01-01

    Full Text Available Background: Emotional stability remains the central theme in personality studies. The concept of stable emotional behavior at any level is that which reflects the fruits of normal emotional development. The study aims at development of an emotional stability scale. Materials and Methods: Based on available literature the components of emotional stability were identified and 250 items were developed, covering each component. Two-stage elimination of items was carried out, i.e. through judges′ opinions and item analysis. Results: Fifty items with highest ′t′ values covering 5 dimensions of emotional stability viz pessimism vs. optimism, anxiety vs. calm, aggression vs. tolerance., dependence vs. autonomy., apathy vs. empathy were retained in the final scale. Reliability as checked by Cronbach′s alpha was .81 and by split half method it was .79. Content validity and construct validity were checked. Norms are given in the form of cumulative percentages. Conclusion: Based on the psychometric principles a 50 item, self-administered 5 point Lickert type rating scale was developed for measurement of emotional stability.

  15. A Generalized Logistic Regression Procedure to Detect Differential Item Functioning among Multiple Groups

    Science.gov (United States)

    Magis, David; Raiche, Gilles; Beland, Sebastien; Gerard, Paul

    2011-01-01

    We present an extension of the logistic regression procedure to identify dichotomous differential item functioning (DIF) in the presence of more than two groups of respondents. Starting from the usual framework of a single focal group, we propose a general approach to estimate the item response functions in each group and to test for the presence…

  16. Simulating Biomass Fast Pyrolysis at the Single Particle Scale

    Energy Technology Data Exchange (ETDEWEB)

    Ciesielski, Peter [National Renewable Energy Laboratory (NREL); Wiggins, Gavin [ORNL; Daw, C Stuart [ORNL; Jakes, Joseph E. [U.S. Forest Service, Forest Products Laboratory, Madison, Wisconsin, USA

    2017-07-01

    Simulating fast pyrolysis at the scale of single particles allows for the investigation of the impacts of feedstock-specific parameters such as particle size, shape, and species of origin. For this reason particle-scale modeling has emerged as an important tool for understanding how variations in feedstock properties affect the outcomes of pyrolysis processes. The origins of feedstock properties are largely dictated by the composition and hierarchical structure of biomass, from the microstructural porosity to the external morphology of milled particles. These properties may be accounted for in simulations of fast pyrolysis by several different computational approaches depending on the level of structural and chemical complexity included in the model. The predictive utility of particle-scale simulations of fast pyrolysis can still be enhanced substantially by advancements in several areas. Most notably, considerable progress would be facilitated by the development of pyrolysis kinetic schemes that are decoupled from transport phenomena, predict product evolution from whole-biomass with increased chemical speciation, and are still tractable with present-day computational resources.

  17. The Internet Gaming Disorder Scale.

    Science.gov (United States)

    Lemmens, Jeroen S; Valkenburg, Patti M; Gentile, Douglas A

    2015-06-01

    Recently, the American Psychiatric Association included Internet gaming disorder (IGD) in the appendix of the 5th edition of the Diagnostic and Statistical Manual of Mental Disorders (DSM-5). The main aim of the current study was to test the reliability and validity of 4 survey instruments to measure IGD on the basis of the 9 criteria from the DSM-5: a long (27-item) and short (9-item) polytomous scale and a long (27-item) and short (9-item) dichotomous scale. The psychometric properties of these scales were tested among a representative sample of 2,444 Dutch adolescents and adults, ages 13-40 years. Confirmatory factor analyses demonstrated that the structural validity (i.e., the dimensional structure) of all scales was satisfactory. Both types of assessment (polytomous and dichotomous) were also reliable (i.e., internally consistent) and showed good criterion-related validity, as indicated by positive correlations with time spent playing games, loneliness, and aggression and negative correlations with self-esteem, prosocial behavior, and life satisfaction. The dichotomous 9-item IGD scale showed solid psychometric properties and was the most practical scale for diagnostic purposes. Latent class analysis of this dichotomous scale indicated that 3 groups could be discerned: normal gamers, risky gamers, and disordered gamers. On the basis of the number of people in this last group, the prevalence of IGD among 13- through 40-year-olds in the Netherlands is approximately 4%. If the DSM-5 threshold for diagnosis (experiencing 5 or more criteria) is applied, the prevalence of disordered gamers is more than 5%. (c) 2015 APA, all rights reserved).

  18. Derivation of the acceptance and self-worth adjustment scale.

    Science.gov (United States)

    Tabrett, Daryl R; Latham, Keziah

    2010-11-01

    The original 55-item Nottingham Adjustment Scale (NAS) is a first generation self-report instrument constructed using classical test theory to evaluate adjustment to vision loss. This study assesses the function of the NAS using Rasch analysis in a sample of adults with visual impairment and presents a revised second-generation instrument. Ninety-nine subjects with established vision loss (median onset 5 years) were administered the NAS. Rasch analysis was performed to: (1) determine optimum response scale function, (2) aid item reduction, (3) determine reliability indices and item targeting, (4) assess unidimensionality using Rasch-based principal component analysis, (5) assess differential item functioning (notable defined as >1.0 logit), and (6) formulate person measures to correlate with Geriatric Depression Scale scores and distance visual acuity to indicate convergent and discriminant validity, respectively. Response categories exhibited underutilization, which when repaired improved response scale functioning and ordered structural calibrations. Misfitting items were removed iteratively until all items had mean-square infit and outfit values of 0.70 to 1.30. However, principal component analysis confirmed insufficient unidimensionality (two contrasts identified, eigenvalues 2.4 and 2.3). Removal of these contrasts and two further iterations restored unidimensionality. Despite item mistargeting (1.58 logits), the revised 19-item instrument demonstrated good person (0.85) and item (0.96) reliability coefficients, good convergent and discriminant validity, and no systematic differential item functioning. The resultant 19-item instrument was termed the Acceptance and Self-Worth Adjustment Scale (AS-WAS). In those with established vision loss, the 19-item Acceptance and Self-Worth Adjustment Scale is a reliable and valid instrument that estimates the level of adjustment concerned with acceptance, attitudes, self-esteem, self-efficacy, and locus of control. An

  19. Longitudinal and dynamic measurement invariance of the FACIT-Fatigue scale: an application of the measurement model of derivatives to ECOG-ACRIN study E2805.

    Science.gov (United States)

    Estabrook, Ryne; Cella, David; Zhao, Fengmin; Manola, Judith; DiPaola, Robert S; Wagner, Lynne I; Haas, Naomi B

    2018-03-05

    While quality of life measures may be used to assess meaningful change and group differences, their scaling and validation often rely on a single occasion of measurement. Using the 13-item FACIT-Fatigue questionnaire at three timepoints, this study tests whether individual items change together in ways consistent with a general fatigue factor. The measurement model of derivatives (MMOD) is a novel method for measurement evaluation that directly assesses whether a given factor structure accurately describes how individual test items change over time. MMOD transforms item-level longitudinal data into a set of orthogonal change scores, each one representing either a within-person longitudinal mean or a different type of longitudinal change. These change scores are then factor analyzed and tested for invariance. This approach is applied to the FACIT-Fatigue scale in a sample of patients with renal cell carcinoma treated on 'ECOG-ACRIN Cancer Research Group (ECOG-ACRIN) study 2805. Analyses revealed strong evidence of unidimensionality, and apparent factorial invariance using traditional techniques. MMOD revealed a small but statistically significant difference in factor structure ([Formula: see text], [Formula: see text]), where factor loadings were weaker and more variable for measuring longitudinal change. The differences in factor structure were not large enough to substantially affect scale usage in this application, but they do reveal some variability across items in the FACIT-Fatigue in their ability to detect change. Future applications should consider differential sensitivity of individual items in multi-item scales, and perhaps even capitalize upon these differences by selecting items that are more sensitive to change.

  20. Construct Validity of the Dutch Version of the 12-Item Partners in Health Scale: Measuring Patient Self-Management Behaviour and Knowledge in Patients with Chronic Obstructive Pulmonary Disease

    NARCIS (Netherlands)

    Lenferink, Anke; Effing, T.W.; Harvey, Peter; Battersby, Malcolm; Frith, Peter; van Beurden, Wendy; van der Palen, Jacobus Adrianus Maria

    2016-01-01

    Objective The 12-item Partners in Health scale (PIH) was developed in Australia to measure self-management behaviour and knowledge in patients with chronic diseases, and has undergone several changes. Our aim was to assess the construct validity and reliability of the latest PIH version in Dutch

  1. Development of the Attributed Dignity Scale.

    Science.gov (United States)

    Jacelon, Cynthia S; Dixon, Jane; Knafl, Kathleen A

    2009-07-01

    A sequential, multi-method approach to instrument development beginning with concept analysis, followed by (a) item generation from qualitative data, (b) review of items by expert and lay person panels, (c) cognitive appraisal interviews, (d) pilot testing, and (e) evaluating construct validity was used to develop a measure of attributed dignity in older adults. The resulting positively scored, 23-item scale has three dimensions: Self-Value, Behavioral Respect-Self, and Behavioral Respect-Others. Item-total correlations in the pilot study ranged from 0.39 to 0.85. Correlations between the Attributed Dignity Scale (ADS) and both Rosenberg's Self-Esteem Scale (0.17) and Crowne and Marlowe's Social Desirability Scale (0.36) were modest and in the expected direction, indicating attributed dignity is a related but independent concept. Next steps include testing the ADS with a larger sample to complete factor analysis, test-retest stability, and further study of the relationships between attributed dignity and other concepts.

  2. 100-point scale evaluating job satisfaction and the results of the 12-item General Health Questionnaire in occupational workers.

    Science.gov (United States)

    Kawada, Tomoyuki; Yamada, Natsuki

    2012-01-01

    Job satisfaction is an important factor in the occupational lives of workers. In this study, the relationship between one-dimensional scale of job satisfaction and psychological wellbeing was evaluated. A total of 1,742 workers (1,191 men and 551 women) participated. 100-point scale evaluating job satisfaction (0 [extremely dissatisfied] to 100 [extremely satisfied]) and the General Health Questionnaire, 12-item version (GHQ-12) evaluating psychological wellbeing were used. A multiple regression analysis was then used, controlling for gender and age. The change in the GHQ-12 and job satisfaction scores after a two-year interval was also evaluated. The mean age for the subjects was 42.2 years for the men and 36.2 years for the women. The GHQ-12 and job satisfaction scores were significantly correlated in each generation. The partial correlation coefficients between the changes in the two variables, controlling for age, were -0.395 for men and -0.435 for women (pjob satisfaction score was associated with the GHQ-12 results (pjob satisfaction, was significantly associated with psychological wellbeing as judged using the GHQ-12.

  3. Defining the minimal detectable change in scores on the eight-item Morisky Medication Adherence Scale.

    Science.gov (United States)

    Muntner, Paul; Joyce, Cara; Holt, Elizabeth; He, Jiang; Morisky, Donald; Webber, Larry S; Krousel-Wood, Marie

    2011-05-01

    Self-report scales are used to assess medication adherence. Data on how to discriminate change in self-reported adherence over time from random variability are limited. To determine the minimal detectable change for scores on the 8-item Morisky Medication Adherence Scale (MMAS-8). The MMAS-8 was administered twice, using a standard telephone script, with administration separated by 14-22 days, to 210 participants taking antihypertensive medication in the CoSMO (Cohort Study of Medication Adherence among Older Adults). MMAS-8 scores were calculated and participants were grouped into previously defined categories (<6, 6 to <8, and 8 for low, medium, and high adherence). The mean (SD) age of participants was 78.1 (5.8) years, 43.8% were black, and 68.1% were women. Overall, 8.1% (17/210), 16.2% (34/210), and 51.0% (107/210) of participants had low, medium, and high MMAS-8 scores, respectively, at both survey administrations (overall agreement 75.2%; 158/210). The weighted κ statistic was 0.63 (95% CI 0.53 to 0.72). The intraclass correlation coefficient was 0.78. The within-person standard error of the mean for change in MMAS-8 scores was 0.81, which equated to a minimal detectable change of 1.98 points. Only 4.3% (9/210) of the participants had a change in MMAS-8 of 2 or more points between survey administrations. Within-person changes in MMAS-8 scores of 2 or more points over time may represent a real change in antihypertensive medication adherence.

  4. Validity of the Neuromuscular Recovery Scale: a measurement model approach.

    Science.gov (United States)

    Velozo, Craig; Moorhouse, Michael; Ardolino, Elizabeth; Lorenz, Doug; Suter, Sarah; Basso, D Michele; Behrman, Andrea L

    2015-08-01

    To determine how well the Neuromuscular Recovery Scale (NRS) items fit the Rasch, 1-parameter, partial-credit measurement model. Confirmatory factor analysis (CFA) and principal components analysis (PCA) of residuals were used to determine dimensionality. The Rasch, 1-parameter, partial-credit rating scale model was used to determine rating scale structure, person/item fit, point-measure item correlations, item discrimination, and measurement precision. Seven NeuroRecovery Network clinical sites. Outpatients (N=188) with spinal cord injury. Not applicable. NRS. While the NRS met 1 of 3 CFA criteria, the PCA revealed that the Rasch measurement dimension explained 76.9% of the variance. Ten of 11 items and 91% of the patients fit the Rasch model, with 9 of 11 items showing high discrimination. Sixty-nine percent of the ratings met criteria. The items showed a logical item-difficulty order, with Stand retraining as the easiest item and Walking as the most challenging item. The NRS showed no ceiling or floor effects and separated the sample into almost 5 statistically distinct strata; individuals with an American Spinal Injury Association Impairment Scale (AIS) D classification showed the most ability, and those with an AIS A classification showed the least ability. Items not meeting the rating scale criteria appear to be related to the low frequency counts. The NRS met many of the Rasch model criteria for construct validity. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

  5. Reliability and construct validity of the Spanish version of the 6-item CTS symptoms scale for outcomes assessment in carpal tunnel syndrome.

    Science.gov (United States)

    Rosales, Roberto S; Martin-Hidalgo, Yolanda; Reboso-Morales, Luis; Atroshi, Isam

    2016-03-03

    The purpose of this study was to assess the reliability and construct validity of the Spanish version of the 6-item carpal tunnel syndrome (CTS) symptoms scale (CTS-6). In this cross-sectional study 40 patients diagnosed with CTS based on clinical and neurophysiologic criteria, completed the standard Spanish versions of the CTS-6 and the disabilities of the arm, shoulder and hand (QuickDASH) scales on two occasions with a 1-week interval. Internal-consistency reliability was assessed with the Cronbach alpha coefficient and test-retest reliability with the intraclass correlation coefficient, two way random effect model and absolute agreement definition (ICC2,1). Cross-sectional precision was analyzed with the Standard Error of the Measurement (SEM). Longitudinal precision for test-retest reliability coefficient was assessed with the Standard Error of the Measurement difference (SEMdiff) and the Minimal Detectable Change at 95 % confidence level (MDC95). For assessing construct validity it was hypothesized that the CTS-6 would have a strong positive correlation with the QuickDASH, analyzed with the Pearson correlation coefficient (r). The standard Spanish version of the CTS-6 presented a Cronbach alpha of 0.81 with a SEM of 0.3. Test-retest reliability showed an ICC of 0.85 with a SRMdiff of 0.36 and a MDC95 of 0.7. The correlation between CTS-6 and the QuickDASH was concordant with the a priori formulated construct hypothesis (r 0.69) CONCLUSIONS: The standard Spanish version of the 6-item CTS symptoms scale showed good internal consistency, test-retest reliability and construct validity for outcomes assessment in CTS. The CTS-6 will be useful to clinicians and researchers in Spanish speaking parts of the world. The use of standardized outcome measures across countries also will facilitate comparison of research results in carpal tunnel syndrome.

  6. The Media and Technology Usage and Attitudes Scale: An empirical investigation

    Science.gov (United States)

    Rosen, L.D.; Whaling, K.; Carrier, L.M.; Cheever, N.A.; Rokkum, J.

    2015-01-01

    Current approaches to measuring people’s everyday usage of technology-based media and other computer-related activities have proved to be problematic as they use varied outcome measures, fail to measure behavior in a broad range of technology-related domains and do not take into account recently developed types of technology including smartphones. In the present study, a wide variety of items, covering a range of up-to-date technology and media usage behaviors. Sixty-six items concerning technology and media usage, along with 18 additional items assessing attitudes toward technology, were administered to two independent samples of individuals, comprising 942 participants. Factor analyses were used to create 11 usage subscales representing smartphone usage, general social media usage, Internet searching, e-mailing, media sharing, text messaging, video gaming, online friendships, Facebook friendships, phone calling, and watching television in addition to four attitude-based subscales: positive attitudes, negative attitudes, technological anxiety/dependence, and attitudes toward task-switching. All subscales showed strong reliabilities and relationships between the subscales and pre-existing measures of daily media usage and Internet addiction were as predicted. Given the reliability and validity results, the new Media and Technology Usage and Attitudes Scale was suggested as a method of measuring media and technology involvement across a variety of types of research studies either as a single 60-item scale or any subset of the 15 subscales. PMID:25722534

  7. The Media and Technology Usage and Attitudes Scale: An empirical investigation.

    Science.gov (United States)

    Rosen, L D; Whaling, K; Carrier, L M; Cheever, N A; Rokkum, J

    2013-11-01

    Current approaches to measuring people's everyday usage of technology-based media and other computer-related activities have proved to be problematic as they use varied outcome measures, fail to measure behavior in a broad range of technology-related domains and do not take into account recently developed types of technology including smartphones. In the present study, a wide variety of items, covering a range of up-to-date technology and media usage behaviors. Sixty-six items concerning technology and media usage, along with 18 additional items assessing attitudes toward technology, were administered to two independent samples of individuals, comprising 942 participants. Factor analyses were used to create 11 usage subscales representing smartphone usage, general social media usage, Internet searching, e-mailing, media sharing, text messaging, video gaming, online friendships, Facebook friendships, phone calling, and watching television in addition to four attitude-based subscales: positive attitudes, negative attitudes, technological anxiety/dependence, and attitudes toward task-switching. All subscales showed strong reliabilities and relationships between the subscales and pre-existing measures of daily media usage and Internet addiction were as predicted. Given the reliability and validity results, the new Media and Technology Usage and Attitudes Scale was suggested as a method of measuring media and technology involvement across a variety of types of research studies either as a single 60-item scale or any subset of the 15 subscales.

  8. A cross-cultural investigation into the dimensional structure and stability of the Barriers to Research and Utilization Scale (BARRIERS Scale).

    Science.gov (United States)

    Williams, Brett; Brown, Ted; Costello, Shane

    2015-10-24

    It is important that scales exhibit strong measurement properties including those related to the investigation of issues that impact evidence-based practice. The validity of the Barriers to Research Utilization Scale (BARRIERS Scale) has recently been questioned in a systematic review. This study investigated the dimensional structure and stability of the 28 item BARRIERS Scale when completed by three groups of participants from three different cross-cultural environments. Data from the BARRIERS Scale completed by 696 occupational therapists from Australia (n = 137), Taiwan (n = 413), and the United Kingdom (n = 144) were analysed using principal components analysis, followed by Procrustes Transformation. Poorly fitting items were identified by low communalities, cross-loading, and theoretically inconsistent primary loadings, and were systematically removed until good fit was achieved. The cross-cultural stability of the component structure of the BARRIERS Scale was examined. A four component, 19 item version of the BARRIERS Scale emerged that demonstrated an improved dimensional fit and stability across the three participant groups. The resulting four components were consistent with the BARRIERS Scale as originally conceptualised. Findings from the study suggest that the four component, 19 item version of the BARRIERS Scale is a robust and valid measure for identifying barriers to research utilization for occupational therapists in paediatric health care settings across Australia, United Kingdom, and Taiwan. The four component 19 item version of the BARRIERS Scale exhibited good dimensional structure, internal consistency, and stability.

  9. Evaluating Change in Behavioral Preferences: Multidimensional Scaling Single-Ideal Point Model

    Science.gov (United States)

    Ding, Cody

    2016-01-01

    The purpose of the article is to propose a multidimensional scaling single-ideal point model as a method to evaluate changes in individuals' preferences under the explicit methodological framework of behavioral preference assessment. One example is used to illustrate the approach for a clear idea of what this approach can accomplish.

  10. [Psychometric properties of the Activities Daily Life Scale (ADL)].

    Science.gov (United States)

    Boyer, L; Murcia, A; Belzeaux, R; Loundou, A; Azorin, J-M; Chabannes, J-M; Dassa, D; Naudin, J; Samuelian, J-C; Lancon, C

    2010-10-01

    Deficits in social functioning are an important core feature of mental health. Recently in France, the Activities Daily Life (ADL) scale has been proposed by the French authorities to assess social functioning for all hospitalized patients in a psychiatric ward. The perspective is to use this scale in the financing and organization of mental health services in France. The ADL scale is a 6-item (dressing/undressing, walking/mobility, eating/drinking, using toilets, behaviour, relationships/communication) heteroquestionnaire completed by a health care professional at the beginning of each hospitalization, assessing functioning of patients suffering from mental health diseases. However, limited consensus exists on this scale. The psychometric properties of the ADL scale have not been assessed. There is a pressing need for detailed examination of its performance. The aim of this study was to explore ADL psychometric properties in a sample of hospitalized patients in a psychiatric ward. We retrospectively analyzed data for all episodes of care delivered to hospitalized patients in a psychiatric ward in our French Public Hospital from January 1, 2008 to June 30, 2008. The study involved retrospective review of administrative and medical databases. The following data were collected: age, gender, diagnoses based on the International Classification of Diseases - 10th version, ADL scale and Assessment of Social Self-Sufficiency scale (ASSS). The psychometric properties were examined using construct validity, reliability, external validity, reproducibility and sensitivity to change. Data analysis was performed using SPSS 15.0 and WINSTEP software. A total of 1066 patients completed the ADL scale. Among them, 49.7% were male, mean age was 36.5 ± 10.8, and 83.5% were single. Schizophrenia, schizotypal and delusional disorders (40.0%), mood disorders (27.9%) and mental and behavioural disorders due to psychoactive substance use (12%) were the most common diagnoses. Factor

  11. The measurement of tritium in Canadian food items

    International Nuclear Information System (INIS)

    Brown, R.M.

    1995-03-01

    Food items locally grown near Perth, Ontario and grocery store produce and locally grown items from the Pickering-Ajax area in the vicinity of the Pickering Nuclear Generating Station (PNGS) have been analyzed for free water tritium (HTO) and organically bound tritium (OBT). The technique of measuring 3 He ingrowth in samples by mass spectrometry has been used because of its sensitivity and freedom from opportunity for contamination during processing and measurement. Concentrations observed at each site were of the order expected on the basis of known levels of tritium in the local atmosphere and precipitation. There was considerable variation between different materials and limited correlation between materials of a single type. (author). 10 refs., 8 tabs., 4 figs

  12. Short forms of the Social Interaction Anxiety Scale and the Social Phobia Scale.

    Science.gov (United States)

    Fergus, Thomas A; Valentiner, David P; McGrath, Patrick B; Gier-Lonsway, Stephanie L; Kim, Hyun-Soo

    2012-01-01

    Mattick and Clarke's (1998) Social Interaction Anxiety Scale (SIAS) and Social Phobia Scale (SPS) are commonly used self-report measures that assess 2 dimensions of social anxiety. Given the need for short, readable measures, this research proposes short forms of both scales. Item-level analyses of readability characteristics of the SIAS and SPS items led to the selection of 6 items from each scale for use in the short forms. The SIAS and SPS short forms had reading levels at approximately the 6th and 5th grade level, respectively. Results using nonclinical (Study 1: N = 469) and clinical (Study 2: N = 145) samples identified these short forms as being factorially sound, possessing adequate internal consistency, and having strong convergence with their full-length counterparts. Moreover, these short forms showed convergence with other measures of social anxiety, showed divergence from measures assessing related constructs, and predicted concurrent interpersonal functioning. Recommendations for the use of these short forms are discussed.

  13. A New Extension of the Binomial Error Model for Responses to Items of Varying Difficulty in Educational Testing and Attitude Surveys.

    Directory of Open Access Journals (Sweden)

    James A Wiley

    Full Text Available We put forward a new item response model which is an extension of the binomial error model first introduced by Keats and Lord. Like the binomial error model, the basic latent variable can be interpreted as a probability of responding in a certain way to an arbitrarily specified item. For a set of dichotomous items, this model gives predictions that are similar to other single parameter IRT models (such as the Rasch model but has certain advantages in more complex cases. The first is that in specifying a flexible two-parameter Beta distribution for the latent variable, it is easy to formulate models for randomized experiments in which there is no reason to believe that either the latent variable or its distribution vary over randomly composed experimental groups. Second, the elementary response function is such that extensions to more complex cases (e.g., polychotomous responses, unfolding scales are straightforward. Third, the probability metric of the latent trait allows tractable extensions to cover a wide variety of stochastic response processes.

  14. Development of an item bank for computerized adaptive test (CAT) measurement of pain

    DEFF Research Database (Denmark)

    Petersen, Morten Aa.; Aaronson, Neil K; Chie, Wei-Chu

    2016-01-01

    PURPOSE: Patient-reported outcomes should ideally be adapted to the individual patient while maintaining comparability of scores across patients. This is achievable using computerized adaptive testing (CAT). The aim here was to develop an item bank for CAT measurement of the pain domain as measured...... were obtained from 1103 cancer patients from five countries. Psychometric evaluations showed that 16 items could be retained in a unidimensional item bank. Evaluations indicated that use of the CAT measure may reduce sample size requirements with 15-25 % compared to using the QLQ-C30 pain scale....... CONCLUSIONS: We have established an item bank of 16 items suitable for CAT measurement of pain. While being backward compatible with the QLQ-C30, the new item bank will significantly improve measurement precision of pain. We recommend initiating CAT measurement by screening for pain using the two original QLQ...

  15. The Adaptation of Acceptance of Couple Violence Scale into Turkish: Validity and Reliability Studies

    Directory of Open Access Journals (Sweden)

    Özcan SEZER

    2008-01-01

    Full Text Available This study investigates the validity and reliability of the Turkish adaptation ofAcceptance of Couple Violence Scale (ACVS. The data of research has been attainedfrom 474 (M =243, F=231 high school students who were attending 1st, 2nd and 3thclass and coming from middle socio-economic levels in Malatya. Acceptance of CoupleViolence Scale has 11 items, Likert type and 4 point response format. The constructvalidity of ACVS was conducted by using exploratory factor analysis and varimaxrotation. Single independent factor with the eigenvalue over 1.00 has been found. Thisfactor explained 44% of total variance. To test concurrent validity, correlations betweenscores on ACVS and Aggressiveness Questionnaire were calculated. There was asignificant relationship between scores on the two scales (r= .61. Cronbach alphacoefficient of the scale was found “.87”; test-retest correlation coefficient was “r=.80”.Item-total correlation co-efficiencies vary between “.52” and “.71”. Findings show thatACVS can be used with acceptable level of validity and reliability for high schoolstudents.

  16. Test-retest reliability of selected items of Health Behaviour in School-aged Children (HBSC survey questionnaire in Beijing, China

    Directory of Open Access Journals (Sweden)

    Liu Yang

    2010-08-01

    Full Text Available Abstract Background Children's health and health behaviour are essential for their development and it is important to obtain abundant and accurate information to understand young people's health and health behaviour. The Health Behaviour in School-aged Children (HBSC study is among the first large-scale international surveys on adolescent health through self-report questionnaires. So far, more than 40 countries in Europe and North America have been involved in the HBSC study. The purpose of this study is to assess the test-retest reliability of selected items in the Chinese version of the HBSC survey questionnaire in a sample of adolescents in Beijing, China. Methods A sample of 95 male and female students aged 11 or 15 years old participated in a test and retest with a three weeks interval. Student Identity numbers of respondents were utilized to permit matching of test-retest questionnaires. 23 items concerning physical activity, sedentary behaviour, sleep and substance use were evaluated by using the percentage of response shifts and the single measure Intraclass Correlation Coefficients (ICC with 95% confidence interval (CI for all respondents and stratified by gender and age. Items on substance use were only evaluated for school children aged 15 years old. Results The percentage of no response shift between test and retest varied from 32% for the item on computer use at weekends to 92% for the three items on smoking. Of all the 23 items evaluated, 6 items (26% showed a moderate reliability, 12 items (52% displayed a substantial reliability and 4 items (17% indicated almost perfect reliability. No gender and age group difference of the test-retest reliability was found except for a few items on sedentary behaviour. Conclusions The overall findings of this study suggest that most selected indicators in the HBSC survey questionnaire have satisfactory test-retest reliability for the students in Beijing. Further test-retest studies in a large

  17. Developing economic order quantity model for non-instantaneous deteriorating items in vendor-managed inventory (VMI) system

    Science.gov (United States)

    Tat, Roya; Allah Taleizadeh, Ata; Esmaeili, Maryam

    2015-05-01

    This paper develops an economic order quantity model for non-instantaneous deteriorating items with and without shortages to investigate the performance of the vendor-managed inventory (VMI) system. This model is developed for a two-level supply chain consisting of a single supplier and single retailer with a single non-instantaneous deteriorating item. A numerical example and sensitivity analysis are provided to illustrate how increasing or reducing the related parameters change the optimal values of the decision variables of the two proposed models. The results show that VMI works better and charges lower cost in all conditions.

  18. Statistical analysis of error rate of large-scale single flux quantum logic circuit by considering fluctuation of timing parameters

    International Nuclear Information System (INIS)

    Yamanashi, Yuki; Masubuchi, Kota; Yoshikawa, Nobuyuki

    2016-01-01

    The relationship between the timing margin and the error rate of the large-scale single flux quantum logic circuits is quantitatively investigated to establish a timing design guideline. We observed that the fluctuation in the set-up/hold time of single flux quantum logic gates caused by thermal noises is the most probable origin of the logical error of the large-scale single flux quantum circuit. The appropriate timing margin for stable operation of the large-scale logic circuit is discussed by taking the fluctuation of setup/hold time and the timing jitter in the single flux quantum circuits. As a case study, the dependence of the error rate of the 1-million-bit single flux quantum shift register on the timing margin is statistically analyzed. The result indicates that adjustment of timing margin and the bias voltage is important for stable operation of a large-scale SFQ logic circuit.

  19. Statistical analysis of error rate of large-scale single flux quantum logic circuit by considering fluctuation of timing parameters

    Energy Technology Data Exchange (ETDEWEB)

    Yamanashi, Yuki, E-mail: yamanasi@ynu.ac.jp [Department of Electrical and Computer Engineering, Yokohama National University, Tokiwadai 79-5, Hodogaya-ku, Yokohama 240-8501 (Japan); Masubuchi, Kota; Yoshikawa, Nobuyuki [Department of Electrical and Computer Engineering, Yokohama National University, Tokiwadai 79-5, Hodogaya-ku, Yokohama 240-8501 (Japan)

    2016-11-15

    The relationship between the timing margin and the error rate of the large-scale single flux quantum logic circuits is quantitatively investigated to establish a timing design guideline. We observed that the fluctuation in the set-up/hold time of single flux quantum logic gates caused by thermal noises is the most probable origin of the logical error of the large-scale single flux quantum circuit. The appropriate timing margin for stable operation of the large-scale logic circuit is discussed by taking the fluctuation of setup/hold time and the timing jitter in the single flux quantum circuits. As a case study, the dependence of the error rate of the 1-million-bit single flux quantum shift register on the timing margin is statistically analyzed. The result indicates that adjustment of timing margin and the bias voltage is important for stable operation of a large-scale SFQ logic circuit.

  20. A new Integrated Negative Symptom structure of the Positive and Negative Syndrome Scale (PANSS) in schizophrenia using item response analysis.

    Science.gov (United States)

    Khan, Anzalee; Lindenmayer, Jean-Pierre; Opler, Mark; Yavorsky, Christian; Rothman, Brian; Lucic, Luka

    2013-10-01

    Debate persists with regard to how best to categorize the syndromal dimension of negative symptoms in schizophrenia. The aim was to first review published Principle Components Analysis (PCA) of the PANSS, and extract items most frequently included in the negative domain, and secondly, to examine the quality of items using Item Response Theory (IRT) to select items that best represent a measurable dimension (or dimensions) of negative symptoms. First, 22 factor analyses and PCA met were included. Second, using a large dataset (n=7187) of participants in clinical trials with chronic schizophrenia, we extracted items loading on one or more PCA. Third, items not loading with a value of ≥ 0.5, or loading on more than one component with values of ≥ 0.5 were discarded. Fourth, resulting items were included in a non-parametric IRT and retained based on Option Characteristic Curves (OCCs) and Item Characteristic Curves (ICCs). 15 items loaded on a negative domain in at least one study, with Emotional Withdrawal loading on all studies. Non-parametric IRT retained nine items as an Integrated Negative Factor: Emotional Withdrawal, Blunted Affect, Passive/Apathetic Social Withdrawal, Poor Rapport, Lack of Spontaneity/Conversation Flow, Active Social Avoidance, Disturbance of Volition, Stereotyped Thinking and Difficulty in Abstract Thinking. This is the first study to use a psychometric IRT process to arrive at a set of negative symptom items. Future steps will include further examination of these nine items in terms of their stability, sensitivity to change, and correlations with functional and cognitive outcomes. © 2013 Elsevier B.V. All rights reserved.

  1. Reliability and known-group validity of the Arabic version of the 8-item Morisky Medication Adherence Scale among type 2 diabetes mellitus patients.

    Science.gov (United States)

    Ashur, S T; Shamsuddin, K; Shah, S A; Bosseri, S; Morisky, D E

    2015-12-13

    No validation study has previously been made for the Arabic version of the 8-item Morisky Medication Adherence Scale (MMAS-8(©)) as a measure for medication adherence in diabetes. This study in 2013 tested the reliability and validity of the Arabic MMAS-8 for type 2 diabetes mellitus patients attending a referral centre in Tripoli, Libya. A convenience sample of 103 patients self-completed the questionnaire. Reliability was tested using Cronbach alpha, average inter-item correlation and Spearman-Brown coefficient. Known-group validity was tested by comparing MMAS-8 scores of patients grouped by glycaemic control. The Arabic version showed adequate internal consistency (α = 0.70) and moderate split-half reliability (r = 0.65). Known-group validity was supported as a significant association was found between medication adherence and glycaemic control, with a moderate effect size (ϕc = 0.34). The Arabic version displayed good psychometric properties and could support diabetes research and practice in Arab countries.

  2. Poisson and negative binomial item count techniques for surveys with sensitive question.

    Science.gov (United States)

    Tian, Guo-Liang; Tang, Man-Lai; Wu, Qin; Liu, Yin

    2017-04-01

    Although the item count technique is useful in surveys with sensitive questions, privacy of those respondents who possess the sensitive characteristic of interest may not be well protected due to a defect in its original design. In this article, we propose two new survey designs (namely the Poisson item count technique and negative binomial item count technique) which replace several independent Bernoulli random variables required by the original item count technique with a single Poisson or negative binomial random variable, respectively. The proposed models not only provide closed form variance estimate and confidence interval within [0, 1] for the sensitive proportion, but also simplify the survey design of the original item count technique. Most importantly, the new designs do not leak respondents' privacy. Empirical results show that the proposed techniques perform satisfactorily in the sense that it yields accurate parameter estimate and confidence interval.

  3. Scaling of the steady state and stability behaviour of single and two-phase natural circulation systems

    International Nuclear Information System (INIS)

    Vijayan, P.K.; Nayak, A.K.; Bade, M.H.; Kumar, N.; Saha, D.; Sinha, R.K.

    2002-01-01

    Scaling methods for both single-phase and two-phase natural circulation systems have been presented. For single-phase systems, simulation of the steady state flow can be achieved by preserving just one nondimensional parameter. For uniform diameter two-phase systems also, it is possible to simulate the steady state behaviour with just one non-dimensional parameter. Simulation of the stability behaviour requires geometric similarity in addition to the similarity of the physical parameters appearing in the governing equations. The scaling laws proposed have been tested with experimental data in case of single-phase natural circulation. (author)

  4. Abusive Supervision Scale Development in Indonesia

    Directory of Open Access Journals (Sweden)

    Fenika Wulani

    2014-02-01

    Full Text Available The purpose of this study was to develop a scale of abusive supervision in Indonesia. The study was conducted with a different context and scale development method from Tepper’s (2000 abusive supervision scale. The abusive supervision scale from Tepper (2000 was developed in the U.S., which has a cultural orientation of low power distance. The current study was conducted in Indonesia, which has a high power distance. This study used interview procedures to obtain information about supervisor’s abusive behavior, and it was also assessed by experts. The results of this study indicated that abusive supervision was a 3-dimensional construct. There were anger-active abuse (6 items, humiliation-active abuse (4 items, and passive abuse (15 items. These scales have internal reliabilities of 0.947, 0.922, and 0.845, in sequence.

  5. Hippocampal damage equally impairs memory for single items and memory for conjunctions.

    Science.gov (United States)

    Stark, Craig E L; Squire, Larry R

    2003-01-01

    single-item and associative memory.

  6. Validity and reliability of the Spanish version of the 10-item CD-RISC in patients with fibromyalgia

    Science.gov (United States)

    2014-01-01

    Background No resilience scale has been validated in Spanish patients with fibromyalgia. The aim of this study was to evaluate the validity and reliability of the 10-item CD-RISC in a sample of Spanish patients with fibromyalgia. Methods Design: Observational prospective multicenter study. Sample: Patients with diagnoses of fibromyalgia recruited from primary care settings (N = 208). Instruments: In addition to sociodemographic data, the following questionnaires were administered: Pain Visual Analogue Scale (PVAS), the 10-item Connor-Davidson Resilience scale (10-item CD-RISC), the Fibromyalgia Impact Questionnaire (FIQ), the Hospital Anxiety and Depression Scale (HADS), the Pain Catastrophizing Scale (PCS), the Chronic Pain Acceptance Questionnaire (CPAQ), and the Mindful Attention Awareness Scale (MAAS). Results Regarding construct validity, the factor solution in the Principal Component Analysis (PCA) was considered adequate, so the KMO test had a value of 0.91, and the Barlett’s test of sphericity was significant (χ2 = 852.8; gl = 45; p fibromyalgia, acceptable psychometric properties, with a high level of reliability and validity. PMID:24484847

  7. Retrieval of very large numbers of items in the Web of Science: an exercise to develop accurate search strategies

    NARCIS (Netherlands)

    Arencibia-Jorge, R.; Leydesdorff, L.; Chinchilla-Rodríguez, Z.; Rousseau, R.; Paris, S.W.

    2009-01-01

    The Web of Science interface counts at most 100,000 retrieved items from a single query. If the query results in a dataset containing more than 100,000 items the number of retrieved items is indicated as >100,000. The problem studied here is how to find the exact number of items in a query that

  8. Differential item functioning analysis of the Vanderbilt Expertise Test for cars.

    Science.gov (United States)

    Lee, Woo-Yeol; Cho, Sun-Joo; McGugin, Rankin W; Van Gulick, Ana Beth; Gauthier, Isabel

    2015-01-01

    The Vanderbilt Expertise Test for cars (VETcar) is a test of visual learning for contemporary car models. We used item response theory to assess the VETcar and in particular used differential item functioning (DIF) analysis to ask if the test functions the same way in laboratory versus online settings and for different groups based on age and gender. An exploratory factor analysis found evidence of multidimensionality in the VETcar, although a single dimension was deemed sufficient to capture the recognition ability measured by the test. We selected a unidimensional three-parameter logistic item response model to examine item characteristics and subject abilities. The VETcar had satisfactory internal consistency. A substantial number of items showed DIF at a medium effect size for test setting and for age group, whereas gender DIF was negligible. Because online subjects were on average older than those tested in the lab, we focused on the age groups to conduct a multigroup item response theory analysis. This revealed that most items on the test favored the younger group. DIF could be more the rule than the exception when measuring performance with familiar object categories, therefore posing a challenge for the measurement of either domain-general visual abilities or category-specific knowledge.

  9. The construct validity of the Major Depression Inventory: A Rasch analysis of a self-rating scale in primary care.

    Science.gov (United States)

    Nielsen, Marie Germund; Ørnbøl, Eva; Vestergaard, Mogens; Bech, Per; Christensen, Kaj Sparle

    2017-06-01

    We aimed to assess the measurement properties of the ten-item Major Depression Inventory when used on clinical suspicion in general practice by performing a Rasch analysis. General practitioners asked consecutive persons to respond to the web-based Major Depression Inventory on clinical suspicion of depression. We included 22 practices and 245 persons. Rasch analysis was performed using RUMM2030 software. The Rasch model fit suggests that all items contribute to a single underlying trait (defined as internal construct validity). Mokken analysis was used to test dimensionality and scalability. Our Rasch analysis showed misfit concerning the sleep and appetite items (items 9 and 10). The response categories were disordered for eight items. After modifying the original six-point to a four-point scoring system for all items, we achieved ordered response categories for all ten items. The person separation reliability was acceptable (0.82) for the initial model. Dimensionality testing did not support combining the ten items to create a total score. The scale appeared to be well targeted to this clinical sample. No significant differential item functioning was observed for gender, age, work status and education. The Rasch and Mokken analyses revealed two dimensions, but the Major Depression Inventory showed fit to one scale if items 9 and 10 were excluded. Our study indicated scalability problems in the current version of the Major Depression Inventory. The conducted analysis revealed better statistical fit when items 9 and 10 were excluded. Copyright © 2017 Elsevier Inc. All rights reserved.

  10. Differential item functioning (DIF) analyses of health-related quality of life instruments using logistic regression

    DEFF Research Database (Denmark)

    Scott, Neil W.; Fayers, Peter M.; Aaronson, Neil K.

    2010-01-01

    Differential item functioning (DIF) methods can be used to determine whether different subgroups respond differently to particular items within a health-related quality of life (HRQoL) subscale, after allowing for overall subgroup differences in that scale. This article reviews issues that arise...

  11. Modeling the World Health Organization Disability Assessment Schedule II using non-parametric item response models.

    Science.gov (United States)

    Galindo-Garre, Francisca; Hidalgo, María Dolores; Guilera, Georgina; Pino, Oscar; Rojo, J Emilio; Gómez-Benito, Juana

    2015-03-01

    The World Health Organization Disability Assessment Schedule II (WHO-DAS II) is a multidimensional instrument developed for measuring disability. It comprises six domains (getting around, self-care, getting along with others, life activities and participation in society). The main purpose of this paper is the evaluation of the psychometric properties for each domain of the WHO-DAS II with parametric and non-parametric Item Response Theory (IRT) models. A secondary objective is to assess whether the WHO-DAS II items within each domain form a hierarchy of invariantly ordered severity indicators of disability. A sample of 352 patients with a schizophrenia spectrum disorder is used in this study. The 36 items WHO-DAS II was administered during the consultation. Partial Credit and Mokken scale models are used to study the psychometric properties of the questionnaire. The psychometric properties of the WHO-DAS II scale are satisfactory for all the domains. However, we identify a few items that do not discriminate satisfactorily between different levels of disability and cannot be invariantly ordered in the scale. In conclusion the WHO-DAS II can be used to assess overall disability in patients with schizophrenia, but some domains are too general to assess functionality in these patients because they contain items that are not applicable to this pathology. Copyright © 2014 John Wiley & Sons, Ltd.

  12. Factor structure and internal consistency of the 12-item General Health Questionnaire (GHQ-12 and the Subjective Vitality Scale (VS, and the relationship between them: a study from France

    Directory of Open Access Journals (Sweden)

    Ismaïl Amany

    2009-03-01

    Full Text Available Abstract Background The objectives of this study were to test the factor structure and internal consistency of the 12-item General Health Questionnaire (GHQ-12 and the Subjective Vitality Scale (VS in elderly French people, and to test the relationship between these two questionnaires. Methods Using a standard 'forward-backward' translation procedure, the English language versions of the two instruments (i.e. the 12-item General Health Questionnaire and the Subjective Vitality Scale were translated into French. A sample of adults aged 58–72 years then completed both questionnaires. Internal consistency was assessed by Cronbach's alpha coefficient. The factor structures of the two instruments were extracted by confirmatory factor analysis (CFA. Finally, the relationship between the two instruments was assessed by correlation analysis. Results In all, 217 elderly adults participated in the study. The mean age of the respondents was 61.7 (SD = 6.2 years. The mean GHQ-12 score was 17.4 (SD = 8.0, and analysis showed satisfactory internal consistency (Cronbach's alpha coefficient = 0.78. The mean VS score was 22.4 (SD = 7.4 and its internal consistency was found to be good (Cronbach's alpha coefficient = 0.83. While CFA showed that the VS was uni-dimensional, analysis for the GHQ-12 demonstrated a good fit not only to the two-factor model (positive vs. negative items but also to a three-factor model. As expected, there was a strong and significant negative correlation between the GHQ-12 and the VS (r = -0.71, P Conclusion The results showed that the French versions of the 12-item General Health Questionnaire (GHQ-12 and the Subjective Vitality Scale (VS are reliable measures of psychological distress and vitality. They also confirm a significant negative correlation between these two instruments, lending support to their convergent validity in an elderly French population. The findings indicate that both measures have good structural

  13. Escala fatorial de socialização: versão reduzida: seleção de itens e propriedades psicométricas Agreeableness scale: short version: item selection and psychometric properties

    Directory of Open Access Journals (Sweden)

    Maiana Farias Oliveira Nunes

    2010-01-01

    Full Text Available O objetivo desse estudo foi selecionar itens da Escala Fatorial de Socialização (EFS para a obtenção de uma versão reduzida, que mantivesse propriedades psicométricas adequadas. Baseou-se em uma amostra de 1.100 sujeitos. Para a seleção de itens, realizou-se análise qualitativa, buscando aqueles sem conteúdo clínico explícito e uma análise quantitativa, pelo modelo de Rasch. Tais critérios permitiram reduzir a EFS de 70 para 28 itens. As características psicométricas da versão reduzida foram verificadas pela comparação entre versões por Rasch e pela reanálise dos dados de estudos de validade realizados com a EFS. A versão reduzida manteve características psicométricas adequadas, o que sugere a possibilidade de utilização dessa versão da EFS em situações de avaliação com tempo restrito.This study aimed at selecting items from the Agreeableness Factor Scale for obtaining a short version of this test that could keep adequate psychometric properties. One thousand one hundred participants composed the sample. Items were selected using a qualitative strategy, which focused on item content that was not related to clinical descriptions and a quantitative analysis based on Rasch's model. The scale was reduced from 70 to 28 items, based on these criteria. In order to check the psychometric properties of the short version, both versions were compared by Rasch indices and by reanalyzing validity studies conducted with the original scale. The short version kept good psychometric properties, which suggests the possibility of using it when there is time restriction.

  14. A Confirmatory Factor Analysis of Reilly's Role Overload Scale

    Science.gov (United States)

    Thiagarajan, Palaniappan; Chakrabarty, Subhra; Taylor, Ronald D.

    2006-01-01

    In 1982, Reilly developed a 13-item scale to measure role overload. This scale has been widely used, but most studies did not assess the unidimensionality of the scale. Given the significance of unidimensionality in scale development, the current study reports a confirmatory factor analysis of the 13-item scale in two samples. Based on the…

  15. Elders Health Empowerment Scale

    Science.gov (United States)

    2014-01-01

    Introduction: Empowerment refers to patient skills that allow them to become primary decision-makers in control of daily self-management of health problems. As important the concept as it is, particularly for elders with chronic diseases, few available instruments have been validated for use with Spanish speaking people. Objective: Translate and adapt the Health Empowerment Scale (HES) for a Spanish-speaking older adults sample and perform its psychometric validation. Methods: The HES was adapted based on the Diabetes Empowerment Scale-Short Form. Where "diabetes" was mentioned in the original tool, it was replaced with "health" terms to cover all kinds of conditions that could affect health empowerment. Statistical and Psychometric Analyses were conducted on 648 urban-dwelling seniors. Results: The HES had an acceptable internal consistency with a Cronbach's α of 0.89. The convergent validity was supported by significant Pearson's Coefficient correlations between the HES total and item scores and the General Self Efficacy Scale (r= 0.77), Swedish Rheumatic Disease Empowerment Scale (r= 0.69) and Making Decisions Empowerment Scale (r= 0.70). Construct validity was evaluated using item analysis, half-split test and corrected item to total correlation coefficients; with good internal consistency (α> 0.8). The content validity was supported by Scale and Item Content Validity Index of 0.98 and 1.0, respectively. Conclusions: HES had acceptable face validity and reliability coefficients; which added to its ease administration and users' unbiased comprehension, could set it as a suitable tool in evaluating elder's outpatient empowerment-based medical education programs. PMID:25767307

  16. The impact of ordinate scaling on the visual analysis of single-case data.

    Science.gov (United States)

    Dart, Evan H; Radley, Keith C

    2017-08-01

    Visual analysis is the primary method for detecting the presence of treatment effects in graphically displayed single-case data and it is often referred to as the "gold standard." Although researchers have developed standards for the application of visual analysis (e.g., Horner et al., 2005), over- and underestimation of effect size magnitude is not uncommon among analysts. Several characteristics have been identified as potential contributors to these errors; however, researchers have largely focused on characteristics of the data itself (e.g., autocorrelation), paying less attention to characteristics of the graphic display which are largely in control of the analyst (e.g., ordinate scaling). The current study investigated the impact that differences in ordinate scaling, a graphic display characteristic, had on experts' accuracy in judgments regarding the magnitude of effect present in single-case percentage data. 32 participants were asked to evaluate eight ABAB data sets (2 each presenting null, small, moderate, and large effects) along with three iterations of each (32 graphs in total) in which only the ordinate scale was manipulated. Results suggest that raters are less accurate in their detection of treatment effects as the ordinate scale is constricted. Additionally, raters were more likely to overestimate the size of a treatment effect when the ordinate scale was constricted. Copyright © 2017 Society for the Study of School Psychology. Published by Elsevier Ltd. All rights reserved.

  17. Measuring everyday functional competence using the Rasch assessment of everyday activity limitations (REAL) item bank.

    Science.gov (United States)

    Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Vonkeman, Harald E; van de Laar, Mart A F J

    2017-11-01

    Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Seventy-two items generated from patient interviews and mapped to the International Classification of Functioning, Disability and Health (ICF) domestic life chapter were administered to 1128 adults representative of the Dutch population. The partial credit model was fitted to the item responses and evaluated with respect to its assumptions, model fit, and differential item functioning (DIF). Measurement performance of a computerized adaptive testing (CAT) algorithm was compared with the SF-36 physical functioning scale (PF-10). A final bank of 41 items was developed. All items demonstrated acceptable fit to the partial credit model and measurement invariance across age, sex, and educational level. Five- and ten-item CAT simulations were shown to have high measurement precision, which exceeded that of SF-36 physical functioning scale across the physical function continuum. Floor effects were absent for a 10-item empirical CAT simulation, and ceiling effects were low (13.5%) compared with SF-36 physical functioning (38.1%). CAT also discriminated better than SF-36 physical functioning between age groups, number of chronic conditions, and respondents with or without rheumatic conditions. The Rasch assessment of everyday activity limitations (REAL) item bank will hopefully prove a useful instrument for assessing everyday activity limitations. T-scores obtained using derived measures can be used to benchmark physical function outcomes against the general Dutch adult population.

  18. Validation of the MOS Social Support Survey 6-item (MOS-SSS-6) measure with two large population-based samples of Australian women.

    Science.gov (United States)

    Holden, Libby; Lee, Christina; Hockey, Richard; Ware, Robert S; Dobson, Annette J

    2014-12-01

    This study aimed to validate a 6-item 1-factor global measure of social support developed from the Medical Outcomes Study Social Support Survey (MOS-SSS) for use in large epidemiological studies. Data were obtained from two large population-based samples of participants in the Australian Longitudinal Study on Women's Health. The two cohorts were aged 53-58 and 28-33 years at data collection (N = 10,616 and 8,977, respectively). Items selected for the 6-item 1-factor measure were derived from the factor structure obtained from unpublished work using an earlier wave of data from one of these cohorts. Descriptive statistics, including polychoric correlations, were used to describe the abbreviated scale. Cronbach's alpha was used to assess internal consistency and confirmatory factor analysis to assess scale validity. Concurrent validity was assessed using correlations between the new 6-item version and established 19-item version, and other concurrent variables. In both cohorts, the new 6-item 1-factor measure showed strong internal consistency and scale reliability. It had excellent goodness-of-fit indices, similar to those of the established 19-item measure. Both versions correlated similarly with concurrent measures. The 6-item 1-factor MOS-SSS measures global functional social support with fewer items than the established 19-item measure.

  19. Cardiac Depression Scale: Mokken scaling in heart failure patients

    Directory of Open Access Journals (Sweden)

    Ski Chantal F

    2012-11-01

    Full Text Available Abstract Background There is a high prevalence of depression in patients with heart failure (HF that is associated with worsening prognosis. The value of using a reliable and valid instrument to measure depression in this population is therefore essential. We validated the Cardiac Depression Scale (CDS in heart failure patients using a model of ordinal unidimensional measurement known as Mokken scaling. Findings We administered in face-to-face interviews the CDS to 603 patients with HF. Data were analysed using Mokken scale analysis. Items of the CDS formed a statistically significant unidimensional Mokken scale of low strength (H0.8. Conclusions The CDS has a hierarchy of items which can be interpreted in terms of the increasingly serious effects of depression occurring as a result of HF. Identifying an appropriate instrument to measure depression in patients with HF allows for early identification and better medical management.

  20. FABRICATING EMPLOYEE ENGAGEMENT FOR ORGANIZATIONAL EFFECTIVENESS: FRAMING ITEMS FOR THE 'CAR-PER-ET-WELL' SCALE

    Directory of Open Access Journals (Sweden)

    Dr. Manodip Ray Chauduri

    2017-04-01

    Full Text Available Organization thrive on people. The epicenter of organizational excellence revolves around to the degree and extent of human involvement at work. To have a committed workforce, ensures satisfication, consummation and fulfillment in the minds of employess. A satisfied worker is a happy worker and of course can prove to be most productive, prolific and industrious in his work and in execution of his responsibilities. With a brief introduction on the concept of employess engagement, the paper through detailed literature survey, outlines various aspects of employee engagement, underlying employee career prospects, the significance of ethical framework and significance of employee well-being in the organizational domain. The objective of the paper is to reach understanding of certain identifiable areas within the field of employee engagement viz.; career, performance, ethics and wellness. These issues are quite pertinent for competitive survival of organizations in the current turbulent business climate. A scale "car-per-et-well" has been developed for item analysis in the present study. The coverage of the study makes an attempt to reach out to teh relevance of employee engagement for organizational accomplishment

  1. Comparison of the Fullerton Advanced Balance Scale, Mini-BESTest, and Berg Balance Scale to Predict Falls in Parkinson Disease.

    Science.gov (United States)

    Schlenstedt, Christian; Brombacher, Stephanie; Hartwigsen, Gesa; Weisser, Burkhard; Möller, Bettina; Deuschl, Günther

    2016-04-01

    The correct identification of patients with Parkinson disease (PD) at risk for falling is important to initiate appropriate treatment early. This study compared the Fullerton Advanced Balance (FAB) scale with the Mini-Balance Evaluation Systems Test (Mini-BESTest) and Berg Balance Scale (BBS) to identify individuals with PD at risk for falls and to analyze which of the items of the scales best predict future falls. This was a prospective study to assess predictive criterion-related validity. The study was conducted at a university hospital in an urban community. Eighty-five patients with idiopathic PD (Hoehn and Yahr stages: 1-4) participated in the study. Measures were number of falls (assessed prospectively over 6 months), FAB scale, Mini-BESTest, BBS, and Unified Parkinson's Disease Rating Scale. The FAB scale, Mini-BESTest, and BBS showed similar accuracy to predict future falls, with values for area under the curve (AUC) of the receiver operating characteristic (ROC) curve of 0.68, 0.65, and 0.69, respectively. A model combining the items "tandem stance," "rise to toes," "one-leg stance," "compensatory stepping backward," "turning," and "placing alternate foot on stool" had an AUC of 0.84 of the ROC curve. There was a dropout rate of 19/85 participants. The FAB scale, Mini-BESTest, and BBS provide moderate capacity to predict "fallers" (people with one or more falls) from "nonfallers." Only some items of the 3 scales contribute to the detection of future falls. Clinicians should particularly focus on the item "tandem stance" along with the items "one-leg stance," "rise to toes," "compensatory stepping backward," "turning 360°," and "placing foot on stool" when analyzing postural control deficits related to fall risk. Future research should analyze whether balance training including the aforementioned items is effective in reducing fall risk. © 2016 American Physical Therapy Association.

  2. The Role of Content and Context in PISA Interest Scales: A study of the embedded interest items in the PISA 2006 science assessment

    Science.gov (United States)

    Drechsel, Barbara; Carstensen, Claus; Prenzel, Manfred

    2011-01-01

    This paper focuses interest in science as one of the attitudinal aspects of scientific literacy. Large-scale data from the Programme for International Student Assessment (PISA) 2006 are analysed in order to describe student interest more precisely. So far the analyses have provided a general indicator of interest, aggregated over all contexts and contents in the science test. With its innovative approach PISA embeds interest items within the cognitive test unit and its contents and contexts. The main difference from conventional interest measures is that in most questionnaires, a relatively small number of interest items cover broad fields of contents and contexts. The science units represent a number of systematically differentiated scientific contexts and contents. The units' stimulus texts allow for concrete descriptions of relevant content aspects, applications, and contexts. In the analyses, multidimensional item response models are applied in order to disentangle student interest. The results indicate that multidimensional models fit the data. A two-dimensional model separating interest into two different knowledge of science dimensions described in the PISA science framework is further analysed with respect to gender, performance differences, and country. The findings give a comprehensive description of students' interest in science. The paper deals with methodological problems and describes requirements of the test construction for further assessments. The results are discussed with regard to their significance for science education.

  3. Perception that "everything requires a lot of effort": transcultural SCL-25 item validation.

    Science.gov (United States)

    Moreau, Nicolas; Hassan, Ghayda; Rousseau, Cécile; Chenguiti, Khalid

    2009-09-01

    This brief report illustrates how the migration context can affect specific item validity of mental health measures. The SCL-25 was administered to 432 recently settled immigrants (220 Haitian and 212 Arabs). We performed descriptive analyses, as well as Infit and Outfit statistics analyses using WINSTEPS Rasch Measurement Software based on Item Response Theory. The participants' comments about the item You feel everything requires a lot of effort in the SCL-25 were also qualitatively analyzed. Results revealed that the item You feel everything requires a lot of effort is an outlier and does not adjust in an expected and valid fashion with its cluster items, as it is over-endorsed by Haitian and Arab healthy participants. Our study thus shows that, in transcultural mental health research, the cultural and migratory contexts may interact and significantly influence the meaning of some symptom items and consequently, the validity of symptom scales.

  4. Anti-control of chaos of single time-scale brushless DC motor.

    Science.gov (United States)

    Ge, Zheng-Ming; Chang, Ching-Ming; Chen, Yen-Sheng

    2006-09-15

    Anti-control of chaos of single time-scale brushless DC motors is studied in this paper. In order to analyse a variety of periodic and chaotic phenomena, we employ several numerical techniques such as phase portraits, bifurcation diagrams and Lyapunov exponents. Anti-control of chaos can be achieved by adding an external constant term or an external periodic term.

  5. The Long-Term Conditions Questionnaire: conceptual framework and item development.

    Science.gov (United States)

    Peters, Michele; Potter, Caroline M; Kelly, Laura; Hunter, Cheryl; Gibbons, Elizabeth; Jenkinson, Crispin; Coulter, Angela; Forder, Julien; Towers, Ann-Marie; A'Court, Christine; Fitzpatrick, Ray

    2016-01-01

    To identify the main issues of importance when living with long-term conditions to refine a conceptual framework for informing the item development of a patient-reported outcome measure for long-term conditions. Semi-structured qualitative interviews (n=48) were conducted with people living with at least one long-term condition. Participants were recruited through primary care. The interviews were transcribed verbatim and analyzed by thematic analysis. The analysis served to refine the conceptual framework, based on reviews of the literature and stakeholder consultations, for developing candidate items for a new measure for long-term conditions. Three main organizing concepts were identified: impact of long-term conditions, experience of services and support, and self-care. The findings helped to refine a conceptual framework, leading to the development of 23 items that represent issues of importance in long-term conditions. The 23 candidate items formed the first draft of the measure, currently named the Long-Term Conditions Questionnaire. The aim of this study was to refine the conceptual framework and develop items for a patient-reported outcome measure for long-term conditions, including single and multiple morbidities and physical and mental health conditions. Qualitative interviews identified the key themes for assessing outcomes in long-term conditions, and these underpinned the development of the initial draft of the measure. These initial items will undergo cognitive testing to refine the items prior to further validation in a survey.

  6. Assessing Health Status in Inflammatory Bowel Disease using a Novel Single-Item Numeric Rating Scale

    Science.gov (United States)

    Surti, Bijal; Spiegel, Brennan; Ippoliti, Andrew; Vasiliauskas, Eric; Simpson, Peter; Shih, David; Targan, Stephan; McGovern, Dermot; Melmed, Gil Y.

    2014-01-01

    Background Current instruments used to measure disease activity and health-related quality of life (HRQOL) in patients with Crohn’s disease (CD) and ulcerative colitis (UC) are often cumbersome, time-consuming, and expensive; although used in clinical trials, they are not convenient for clinical practice. A numeric rating scale (NRS) is a quick, inexpensive, and convenient patient-reported outcome (PRO) that can capture the patient’s overall perception of health. Aims To assess the validity, reliability, and responsiveness of an NRS and evaluate its use in clinical practice in patients with CD and UC. Methods We prospectively evaluated patient-reported NRS scores and measured correlations between NRS and a range of severity measures, including physician-reported NRS, Crohn’s disease activity index (CDAI), Harvey-Bradshaw index (HBI), inflammatory bowel disease questionnaire (IBDQ), and C-reactive protein (CRP) in patients with CD. Subsequently, we evaluated the correlation between the NRS and standard measures of health status (HBI or simple colitis clinical activity index [SCCAI]) and laboratory tests (sedimentation rate [ESR], CRP, and fecal calprotectin) in patients with CD and UC. Results The patient-reported NRS showed excellent correlation with CDAI (R2=0.59, p<0.0001), IBDQ (R2=0.66, p<0.0001), and HBI (R2=0.32, p<0.0001) in patients with CD. The NRS showed poor, but statistically significant correlation with SCCAI (R2=0.25, p<0.0001) in patients with UC. The NRS did not correlate with CRP, ESR, or calprotectin. The NRS was reliable and responsive to change. Conclusions The NRS is a valid, reliable, and responsive measure that may be useful to evaluate patients with CD and possibly UC. PMID:23250673

  7. The relationship between early changes in the HAMD-17 anxiety/somatization factor items and treatment outcome among depressed outpatients.

    Science.gov (United States)

    Farabaugh, Amy; Mischoulon, David; Fava, Maurizio; Wu, Shirley L; Mascarini, Alessandra; Tossani, Eliana; Alpert, Jonathan E

    2005-03-01

    The 17-item Hamilton Rating Scale for Depression (HAMD-17) Anxiety/Somatization factor includes six items: Anxiety (psychic), Anxiety (somatic), Somatic Symptoms (gastrointestinal), Somatic Symptoms (general), Hypochondriasis and Insight. This study examines the relationship between early changes (defined as those observed between baseline and week 1) in these HAMD-17 Anxiety/Somatization Factor items and treatment outcome among major depressive disorder (MDD) patients who participated in a study comparing the antidepressant efficacy of a standardized extract of hypericum with both placebo and fluoxetine. Following a 1-week, single-blind washout, patients with MDD diagnosed by the Structured Clinical Interview for DSM-IV (SCID) were randomized to 12 weeks of double-blind treatment with hypericum extract (900 mg/day), fluoxetine (20 mg/day) or placebo. The relationship between early changes in HAMD-17 anxiety/somatization factor items and treatment outcome was assessed separately for patients who received study treatment (hypericum or fluoxetine) versus placebo with a logistic regression method. One hundred and thirty-five patients (female 57%, mean age=37.3+/-11.0 years; mean baseline HAMD-17=19.7+/-3.2 years) were randomized to double-blind treatment and were included in the intent-to-treat (ITT) analyses. After adjusting for baseline HAMD-17 scores and for multiple comparisons with the Bonferroni correction, patients who remitted (HAMD-17 score Somatic Symptoms (General) scores than non-remitters. No other significant differences in early changes were noted for the remaining items between remitters versus non-remitters who received active treatment. For patients treated with placebo, early change was not predictive of remission for any of the items after Bonferroni correction. In conclusion, the presence of early improvement on the HAMD-17 item concerning fatigue and general somatic symptoms is significantly predictive of achieving remission at endpoint with

  8. Control Algorithms for Large-scale Single-axis Photovoltaic Trackers

    Directory of Open Access Journals (Sweden)

    Dorian Schneider

    2012-01-01

    Full Text Available The electrical yield of large-scale photovoltaic power plants can be greatly improved by employing solar trackers. While fixed-tilt superstructures are stationary and immobile, trackers move the PV-module plane in order to optimize its alignment to the sun. This paper introduces control algorithms for single-axis trackers (SAT, including a discussion for optimal alignment and backtracking. The results are used to simulate and compare the electrical yield of fixed-tilt and SAT systems. The proposed algorithms have been field tested, and are in operation in solar parks worldwide.

  9. Prevalence of item level negative symptoms in first episode psychosis diagnoses.

    LENUS (Irish Health Repository)

    Lyne, John

    2012-03-01

    The relevance of negative symptoms across the diagnostic spectrum of the psychoses remains uncertain. The purpose of this study was to report on prevalence of item and subscale level negative symptoms across the first episode psychosis (FEP) diagnostic spectrum in an epidemiological sample, and to ascertain whether items and subscales were more prevalent in a schizophrenia spectrum diagnoses group compared to an \\'all other psychotic diagnoses\\' group. We measured negative symptoms in 330 patients presenting with FEP using the Scale for Assessment of Negative Symptoms (SANS), and ascertained diagnosis using the Structured Clinical Interview for DSM IV. Prevalence of SANS items and subscales were tabulated across all psychotic diagnoses, and logistic regression analysis determined which items and subscales were predictive of schizophrenia spectrum diagnoses. SANS items were most prevalent in schizophrenia spectrum conditions but frequently presented in other FEP diagnoses, particularly substance induced psychotic disorder and Major Depressive Disorder. Brief psychotic disorder and bipolar disorders had low levels of negative symptoms. SANS items and subscales which significantly predicted schizophrenia spectrum diagnoses, were also frequently present in some of the other psychotic diagnoses. Conclusions: SANS items have high prevalence in FEP, and while commonest in schizophrenia spectrum conditions are not restricted to this diagnostic subgroup.

  10. Profiling medical school learning environments in Malaysia: a validation study of the Johns Hopkins Learning Environment Scale

    Directory of Open Access Journals (Sweden)

    Sean Tackett

    2015-07-01

    Full Text Available Purpose: While a strong learning environment is critical to medical student education, the assessment of medical school learning environments has confounded researchers. Our goal was to assess the validity and utility of the Johns Hopkins Learning Environment Scale (JHLES for preclinical students at three Malaysian medical schools with distinct educational and institutional models. Two schools were new international partnerships, and the third was school leaver program established without international partnership. Methods: First- and second-year students responded anonymously to surveys at the end of the academic year. The surveys included the JHLES, a 28-item survey using five-point Likert scale response options, the Dundee Ready Educational Environment Measure (DREEM, the most widely used method to assess learning environments internationally, a personal growth scale, and single-item global learning environment assessment variables. Results: The overall response rate was 369/429 (86%. After adjusting for the medical school year, gender, and ethnicity of the respondents, the JHLES detected differences across institutions in four out of seven domains (57%, with each school having a unique domain profile. The DREEM detected differences in one out of five categories (20%. The JHLES was more strongly correlated than the DREEM to two thirds of the single-item variables and the personal growth scale. The JHLES showed high internal reliability for the total score (α=0.92 and the seven domains (α, 0.56-0.85. Conclusion: The JHLES detected variation between learning environment domains across three educational settings, thereby creating unique learning environment profiles. Interpretation of these profiles may allow schools to understand how they are currently supporting trainees and identify areas needing attention.

  11. Test-retest reliability at the item level and total score level of the Norwegian version of the Spinal Cord Injury Falls Concern Scale (SCI-FCS).

    Science.gov (United States)

    Roaldsen, Kirsti Skavberg; Måøy, Åsa Blad; Jørgensen, Vivien; Stanghelle, Johan Kvalvik

    2016-05-01

    Translation of the Spinal Cord Injury Falls Concern Scale (SCI-FCS), and investigation of test-retest reliability on item-level and total-score-level. Translation, adaptation and test-retest study. A specialized rehabilitation setting in Norway. Fifty-four wheelchair users with a spinal cord injury. The median age of the cohort was 49 years, and the median number of years after injury was 13. Interventions/measurements: The SCI-FCS was translated and back-translated according to guidelines. Individuals answered the SCI-FCS twice over the course of one week. We investigated item-level test-retest reliability using Svensson's rank-based statistical method for disagreement analysis of paired ordinal data. For relative reliability, we analyzed the total-score-level test-retest reliability with intraclass correlation coefficients (ICC2.1), the standard error of measurement (SEM), and the smallest detectable change (SDC) for absolute reliability/measurement-error assessment and Cronbach's alpha for internal consistency. All items showed satisfactory percentage agreement (≥69%) between test and retest. There were small but non-negligible systematic disagreements among three items; we recovered an 11-13% higher chance for a lower second score. There was no disagreement due to random variance. The test-retest agreement (ICC2.1) was excellent (0.83). The SEM was 2.6 (12%), and the SDC was 7.1 (32%). The Cronbach's alpha was high (0.88). The Norwegian SCI-FCS is highly reliable for wheelchair users with chronic spinal cord injuries.

  12. Evaluation of item candidates for a diabetic retinopathy quality of life item bank.

    Science.gov (United States)

    Fenwick, Eva K; Pesudovs, Konrad; Khadka, Jyoti; Rees, Gwyn; Wong, Tien Y; Lamoureux, Ecosse L

    2013-09-01

    We are developing an item bank assessing the impact of diabetic retinopathy (DR) on quality of life (QoL) using a rigorous multi-staged process combining qualitative and quantitative methods. We describe here the first two qualitative phases: content development and item evaluation. After a comprehensive literature review, items were generated from four sources: (1) 34 previously validated patient-reported outcome measures; (2) five published qualitative articles; (3) eight focus groups and 18 semi-structured interviews with 57 DR patients; and (4) seven semi-structured interviews with diabetes or ophthalmic experts. Items were then evaluated during 3 stages, namely binning (grouping) and winnowing (reduction) based on key criteria and panel consensus; development of item stems and response options; and pre-testing of items via cognitive interviews with patients. The content development phase yielded 1,165 unique items across 7 QoL domains. After 3 sessions of binning and winnowing, items were reduced to a minimally representative set (n = 312) across 9 domains of QoL: visual symptoms; ocular surface symptoms; activity limitation; mobility; emotional; health concerns; social; convenience; and economic. After 8 cognitive interviews, 42 items were amended resulting in a final set of 314 items. We have employed a systematic approach to develop items for a DR-specific QoL item bank. The psychometric properties of the nine QoL subscales will be assessed using Rasch analysis. The resulting validated item bank will allow clinicians and researchers to better understand the QoL impact of DR and DR therapies from the patient's perspective.

  13. Criterion validity of the Short Mood and Feelings Questionnaire and one- and two-item depression screens in young adolescents

    Directory of Open Access Journals (Sweden)

    McCauley Elizabeth

    2010-02-01

    Full Text Available Abstract Background The use of short screening questionnaires may be a promising option for identifying children at risk for depression in a community setting. The objective of this study was to assess the validity of the Short Mood and Feelings Questionnaire (SMFQ and one- and two-item screening instruments for depressive disorders in a school-based sample of young adolescents. Methods Participants were 521 sixth-grade students attending public middle schools. Child and parent versions of the SMFQ were administered to evaluate the child's depressive symptoms. The presence of any depressive disorder during the previous month was assessed using the Diagnostic Interview Schedule for Children (DISC as the criterion standard. First, we assessed the diagnostic accuracy of child, parent, and combined scores of the full 13-item SMFQ by calculating the area under the receiver operating characteristic curve (AUC, sensitivity and specificity. The same approach was then used to evaluate the accuracy of a two-item scale consisting of only depressed mood and anhedonia items, and a single depressed mood item. Results The combined child + parent SMFQ score showed the highest accuracy (AUC = 0.86. Diagnostic accuracy was lower for child (AUC = 0.73 and parent (AUC = 0.74 SMFQ versions. Corresponding versions of one- and two-item screens had lower AUC estimates, but the combined versions of the brief screens each still showed moderate accuracy. Furthermore, child and combined versions of the two-item screen demonstrated higher sensitivity (although lower specificity than either the one-item screen or the full SMFQ. Conclusions Under conditions where parents accompany children to screening settings (e.g. primary care, use of a child + parent version of the SMFQ is recommended. However, when parents are not available, and the cost of a false positive result is minimal, then a one- or two-item screen may be useful for initial identification of at-risk youth.

  14. Islam and Environmental Consciousness: A New Scale Development.

    Science.gov (United States)

    Emari, Hossein; Vazifehdoust, Hossein; Nikoomaram, Hashem

    2017-04-01

    This research proposed a new construct, Islamic environmental consciousness (IEC), and developed a measurement scale to support this construct. Churchill's (J Mark Res 16(1):64-73, 1979) paradigm, adapted by Negra and Mzoughi (Internet Res 22(4):426-442, 2012), was utilized. A total of 32 items were generated based on the verses of the Qur'an from nine interviews with teachers in an Islamic seminary. This set of items was reduced to 19 after dropping redundant or non-representative items. In a pilot study, factor analysis of the 19-item scale yielded a two-factor structure scale of seven items with a reliability ranging from 0.7 to 0.8. The Islamic environmental consciousness scale (IECS) was statistically confirmed and validated in a subsequent investigation. The proposed measurement scale warrants further exploratory study. Future research should assess the IECS's validity across different Muslim countries, locales, and various Islamic schools of thought and practice. IEC is proposed as a new construct that focuses primarily on the Qur'an and seeks to achieve acceptance by both Sunni and Shia denominations. In this study, both cognitive attitudes and behavioral aspects were considered in the design of the IECS.

  15. Forced-Choice Assessment of Work-Related Maladaptive Personality Traits: Preliminary Evidence From an Application of Thurstonian Item Response Modeling.

    Science.gov (United States)

    Guenole, Nigel; Brown, Anna A; Cooper, Andrew J

    2018-06-01

    This article describes an investigation of whether Thurstonian item response modeling is a viable method for assessment of maladaptive traits. Forced-choice responses from 420 working adults to a broad-range personality inventory assessing six maladaptive traits were considered. The Thurstonian item response model's fit to the forced-choice data was adequate, while the fit of a counterpart item response model to responses to the same items but arranged in a single-stimulus design was poor. Monotrait heteromethod correlations indicated corresponding traits in the two formats overlapped substantially, although they did not measure equivalent constructs. A better goodness of fit and higher factor loadings for the Thurstonian item response model, coupled with a clearer conceptual alignment to the theoretical trait definitions, suggested that the single-stimulus item responses were influenced by biases that the independent clusters measurement model did not account for. Researchers may wish to consider forced-choice designs and appropriate item response modeling techniques such as Thurstonian item response modeling for personality questionnaire applications in industrial psychology, especially when assessing maladaptive traits. We recommend further investigation of this approach in actual selection situations and with different assessment instruments.

  16. Selection of material balance areas and item control areas

    International Nuclear Information System (INIS)

    1975-04-01

    Section 70.58, ''Fundamental Nuclear Material Controls,'' of 10 CFR Part 70, ''Special Nuclear Material,'' requires certain licensees authorized to possess more than one effective kilogram of special nuclear material to establish Material Balance Areas (MBAs) or Item Control Areas (ICAs) for the physical and administrative control of nuclear materials. This section requires that: (1) each MBA be an identifiable physical area such that the quantity of nuclear material being moved into or out of the MBA is represented by a measured value; (2) the number of MBAs be sufficient to localize nuclear material losses or thefts and identify the mechanisms; (3) the custody of all nuclear material within an MBA or ICA be the responsibility of a single designated individual; and (4) ICAs be established according to the same criteria as MBAs except that control into and out of such areas would be by item identity and count for previously determined special nuclear material quantities, the validity of which must be ensured by tamper-safing unless the items are sealed sources. This guide describes bases acceptable to the NRC staff for the selection of material balance areas and item control areas. (U.S.)

  17. Using Patient Health Questionnaire-9 item parameters of a common metric resulted in similar depression scores compared to independent item response theory model reestimation.

    Science.gov (United States)

    Liegl, Gregor; Wahl, Inka; Berghöfer, Anne; Nolte, Sandra; Pieh, Christoph; Rose, Matthias; Fischer, Felix

    2016-03-01

    To investigate the validity of a common depression metric in independent samples. We applied a common metrics approach based on item-response theory for measuring depression to four German-speaking samples that completed the Patient Health Questionnaire (PHQ-9). We compared the PHQ item parameters reported for this common metric to reestimated item parameters that derived from fitting a generalized partial credit model solely to the PHQ-9 items. We calibrated the new model on the same scale as the common metric using two approaches (estimation with shifted prior and Stocking-Lord linking). By fitting a mixed-effects model and using Bland-Altman plots, we investigated the agreement between latent depression scores resulting from the different estimation models. We found different item parameters across samples and estimation methods. Although differences in latent depression scores between different estimation methods were statistically significant, these were clinically irrelevant. Our findings provide evidence that it is possible to estimate latent depression scores by using the item parameters from a common metric instead of reestimating and linking a model. The use of common metric parameters is simple, for example, using a Web application (http://www.common-metrics.org) and offers a long-term perspective to improve the comparability of patient-reported outcome measures. Copyright © 2016 Elsevier Inc. All rights reserved.

  18. Sharing medicine: the candidacy of medicines and other household items for sharing, Dominican Republic.

    Directory of Open Access Journals (Sweden)

    Michael N Dohn

    Full Text Available People share medicines and problems can result from this behavior. Successful interventions to change sharing behavior will require understanding people's motives and purposes for sharing medicines. Better information about how medicines fit into the gifting and reciprocity system could be useful in designing interventions to modify medicine sharing behavior. However, it is uncertain how people situate medicines among other items that might be shared. This investigation is a descriptive study of how people sort medicines and other shareable items.This study in the Dominican Republic examined how a convenience sample (31 people sorted medicines and rated their shareability in relation to other common household items. We used non-metric multidimensional scaling to produce association maps in which the distances between items offer a visual representation of the collective opinion of the participants regarding the relationships among the items. In addition, from a pile sort constrained by four categories of whether sharing or loaning the item was acceptable (on a scale from not shareable to very shareable, we assessed the degree to which the participants rated the medicines as shareable compared to other items. Participants consistently grouped medicines together in all pile sort activities; yet, medicines were mixed with other items when rated by their candidacy to be shared. Compared to the other items, participants had more variability of opinion as to whether medicines should be shared.People think of medicines as a distinct group, suggesting that interventions might be designed to apply to medicines as a group. People's differing opinions as to whether it was appropriate to share medicines imply a degree of uncertainty or ambiguity that health promotion interventions might exploit to alter attitudes and behaviors. These findings have implications for the design of health promotion interventions to impact medicine sharing behavior.

  19. Cultural adaptation and validation of Stroke Impact Scale 3.0 version in Uganda: A small-scale study.

    Science.gov (United States)

    Kamwesiga, Julius T; von Koch, Lena; Kottorp, Anders; Guidetti, Susanne

    2016-01-01

    Knowledge is scarce about the impact of stroke in Uganda, and culturally adapted, psychometrically tested patient-reported outcome measures are lacking. The Stroke Impact Scale 3.0 is recommended, but it has not been culturally adapted and validated in Uganda. To culturally adapt and determine the psychometric properties of the Stroke Impact Scale 3.0 in the Ugandan context on a small scale. The Stroke Impact Scale 3.0 was culturally adapted to form Stroke Impact Scale 3.0 Uganda ( in English ) by involving 25 participants in three different expert committees. Subsequently, Stroke Impact Scale 3.0 Uganda from English to Luganda language was done in accordance with guidelines. The first language in Uganda is English and Luganda is the main spoken language in Kampala city and its surroundings. Translation of Stroke Impact Scale 3.0 Uganda ( both in English and Luganda ) was then tested psychometrically by applying a Rasch model on data collected from 95 participants with stroke. Overall, 10 of 59 (17%) items in the eight domains of the Stroke Impact Scale 3.0 were culturally adapted. The majority were 6 of 10 items in the domain Activities of Daily Living, 2 of 9 items in the domain Mobility, and 2 of 5 items in the domain Hand function. Only in two domains, all items demonstrated acceptable goodness of fit to the Rasch model. There were also more than 5% person misfits in the domains Participation and Emotion, while the Communication, Mobility, and Hand function domains had the lowest proportions of person misfits. The reliability coefficient was equal or larger than 0.90 in all domains except the Emotion domain, which was below the set criterion of 0.80 (0.75). The cultural adaptation and translation of Stroke Impact Scale 3.0 Uganda provides initial evidence of validity of the Stroke Impact Scale 3.0 when used in this context. The results provide support for several aspects of validity and precision but also point out issues for further adaptation and improvement

  20. Item response theory analysis of the Pain Self-Efficacy Questionnaire.

    Science.gov (United States)

    Costa, Daniel S J; Asghari, Ali; Nicholas, Michael K

    2017-01-01

    The Pain Self-Efficacy Questionnaire (PSEQ) is a 10-item instrument designed to assess the extent to which a person in pain believes s/he is able to accomplish various activities despite their pain. There is strong evidence for the validity and reliability of both the full-length PSEQ and a 2-item version. The purpose of this study is to further examine the properties of the PSEQ using an item response theory (IRT) approach. We used the two-parameter graded response model to examine the category probability curves, and location and discrimination parameters of the 10 PSEQ items. In item response theory, responses to a set of items are assumed to be probabilistically determined by a latent (unobserved) variable. In the graded-response model specifically, item response threshold (the value of the latent variable for which adjacent response categories are equally likely) and discrimination parameters are estimated for each item. Participants were 1511 mixed, chronic pain patients attending for initial assessment at a tertiary pain management centre. All items except item 7 ('I can cope with my pain without medication') performed well in IRT analysis, and the category probability curves suggested that participants used the 7-point response scale consistently. Items 6 ('I can still do many of the things I enjoy doing, such as hobbies or leisure activity, despite pain'), 8 ('I can still accomplish most of my goals in life, despite the pain') and 9 ('I can live a normal lifestyle, despite the pain') captured higher levels of the latent variable with greater precision. The results from this IRT analysis add to the body of evidence based on classical test theory illustrating the strong psychometric properties of the PSEQ. Despite the relatively poor performance of Item 7, its clinical utility warrants its retention in the questionnaire. The strong psychometric properties of the PSEQ support its use as an effective tool for assessing self-efficacy in people with pain