rasch psychometric approach: Topics by WorldWideScience.org

Sample records for rasch psychometric approach

Rasch analysis suggested three unidimensional domains for Affiliate Stigma Scale: additional psychometric evaluation.

Science.gov (United States)

Chang, Chih-Cheng; Su, Jian-An; Tsai, Ching-Shu; Yen, Cheng-Fang; Liu, Jiun-Horng; Lin, Chung-Ying

2015-06-01

To examine the psychometrics of the Affiliate Stigma Scale using rigorous psychometric analysis: classical test theory (CTT) (traditional) and Rasch analysis (modern). Differential item functioning (DIF) items were also tested using Rasch analysis. Caregivers of relatives with mental illness (n = 453; mean age: 53.29 ± 13.50 years) were recruited from southern Taiwan. Each participant filled out four questionnaires: Affiliate Stigma Scale, Rosenberg Self-Esteem Scale, Beck Anxiety Inventory, and one background information sheet. CTT analyses showed that the Affiliate Stigma Scale had satisfactory internal consistency (α = 0.85-0.94) and concurrent validity (Rosenberg Self-Esteem Scale: r = -0.52 to -0.46; Beck Anxiety Inventory: r = 0.27-0.34). Rasch analyses supported the unidimensionality of three domains in the Affiliate Stigma Scale and indicated four DIF items (affect domain: 1; cognitive domain: 3) across gender. Our findings, based on rigorous statistical analysis, verified the psychometrics of the Affiliate Stigma Scale and reported its DIF items. We conclude that the three domains of the Affiliate Stigma Scale can be separately used and are suitable for measuring the affiliate stigma of caregivers of relatives with mental illness. Copyright © 2015 Elsevier Inc. All rights reserved.
Higher Education End-of-Course Evaluations: Assessing the Psychometric Properties Utilizing Exploratory Factor Analysis and Rasch Modeling Approaches

Directory of Open Access Journals (Sweden)

Kelly D. Bradley

2016-07-01

Full Text Available This paper offers a critical assessment of the psychometric properties of a standard higher education end-of-course evaluation. Using both exploratory factor analysis (EFA and Rasch modeling, the authors investigate the (a an overall assessment of dimensionality using EFA, (b a secondary assessment of dimensionality using a principal components analysis (PCA of the residuals when the items are fit to the Rasch model, and (c an assessment of item-level properties using item-level statistics provided when the items are fit to the Rasch model. The results support the usage of the scale as a supplement to high-stakes decision making such as tenure. However, the lack of precise targeting of item difficulty to person ability combined with the low person separation index renders rank-ordering professors according to minuscule differences in overall subscale scores a highly questionable practice.
Modern psychometrics for assessing achievement goal orientation: a Rasch analysis.

Science.gov (United States)

Muis, Krista R; Winne, Philip H; Edwards, Ordene V

2009-09-01

A program of research is needed that assesses the psychometric properties of instruments designed to quantify students' achievement goal orientations to clarify inconsistencies across previous studies and to provide a stronger basis for future research. We conducted traditional psychometric and modern Rasch-model analyses of the Achievement Goals Questionnaire (AGQ, Elliot & McGregor, 2001) and the Patterns of Adaptive Learning Scale (PALS, Midgley et al., 2000) to provide an in-depth analysis of the two most popular instruments in educational psychology. For Study 1, 217 undergraduate students enrolled in educational psychology courses participated. Thirty-four were male and 181 were female (two did not respond). Participants completed the AGQ in the context of their educational psychology class. For Study 2, 126 undergraduate students enrolled in educational psychology courses participated. Thirty were male and 95 were female (one did not respond). Participants completed the PALS in the context of their educational psychology class. Traditional psychometric assessments of the AGQ and PALS replicated previous studies. For both, reliability estimates ranged from good to very good for raw subscale scores and fit for the models of goal orientations were good. Based on traditional psychometrics, the AGQ and PALS are valid and reliable indicators of achievement goals. Rasch analyses revealed that estimates of reliability for items were very good but respondent ability estimates varied from poor to good for both the AGQ and PALS. These findings indicate that items validly and reliably reflect a group's aggregate goal orientation, but using either instrument to characterize an individual's goal orientation is hazardous.
A psychometric revision of the European American Values Scale for Asian Americans using the Rasch model

OpenAIRE

Hong, S; Kim, Bryan S.K.; Wolfe, M M

2005-01-01

The 18-item European American Values Scale for Asian Americans (M. M. Wolfe, P H. Yang, E C. Wong, & D. R. Atkinson, 2001) was revised on the basis of results from a psychometric analysis using the Rasch Model (G. Rasch,1960). The results led to the establishment of the 25-item European AmericanValues Scale for Asian Americans-Revised.
Development and validation of the Brazilian version of the Attitudes to Aging Questionnaire (AAQ: An example of merging classical psychometric theory and the Rasch measurement model

Directory of Open Access Journals (Sweden)

Trentini Clarissa M

2008-01-01

Full Text Available Abstract Background Aging has determined a demographic shift in the world, which is considered a major societal achievement, and a challenge. Aging is primarily a subjective experience, shaped by factors such as gender and culture. There is a lack of instruments to assess attitudes to aging adequately. In addition, there is no instrument developed or validated in developing region contexts, so that the particularities of ageing in these areas are not included in the measures available. This paper aims to develop and validate a reliable attitude to aging instrument by combining classical psychometric approach and Rasch analysis. Methods Pilot study and field trial are described in details. Statistical analysis included classic psychometric theory (EFA and CFA and Rasch measurement model. The latter was applied to examine unidimensionality, response scale and item fit. Results Sample was composed of 424 Brazilian old adults, which was compared to an international sample (n = 5238. The final instrument shows excellent psychometric performance (discriminant validity, confirmatory factor analysis and Rasch fit statistics. Rasch analysis indicated that modifications in the response scale and item deletions improved the initial solution derived from the classic approach. Conclusion The combination of classic and modern psychometric theories in a complementary way is fruitful for development and validation of instruments. The construction of a reliable Brazilian Attitudes to Aging Questionnaire is important for assessing cultural specificities of aging in a transcultural perspective and can be applied in international cross-cultural investigations running less risk of cultural bias.
Psychometric validation of the Persian Bergen Social Media Addiction Scale using classic test theory and Rasch models.

Science.gov (United States)

Lin, Chung-Ying; Broström, Anders; Nilsen, Per; Griffiths, Mark D; Pakpour, Amir H

2017-12-01

Background and aims The Bergen Social Media Addiction Scale (BSMAS), a six-item self-report scale that is a brief and effective psychometric instrument for assessing at-risk social media addiction on the Internet. However, its psychometric properties in Persian have never been examined and no studies have applied Rasch analysis for the psychometric testing. This study aimed to verify the construct validity of the Persian BSMAS using confirmatory factor analysis (CFA) and Rasch models among 2,676 Iranian adolescents. Methods In addition to construct validity, measurement invariance in CFA and differential item functioning (DIF) in Rasch analysis across gender were tested for in the Persian BSMAS. Results Both CFA [comparative fit index (CFI) = 0.993; Tucker-Lewis index (TLI) = 0.989; root mean square error of approximation (RMSEA) = 0.057; standardized root mean square residual (SRMR) = 0.039] and Rasch (infit MnSq = 0.88-1.28; outfit MnSq = 0.86-1.22) confirmed the unidimensionality of the BSMAS. Moreover, measurement invariance was supported in multigroup CFA including metric invariance (ΔCFI = -0.001; ΔSRMR = 0.003; ΔRMSEA = -0.005) and scalar invariance (ΔCFI = -0.002; ΔSRMR = 0.005; ΔRMSEA = 0.001) across gender. No item displayed DIF (DIF contrast = -0.48 to 0.24) in Rasch across gender. Conclusions Given the Persian BSMAS was unidimensional, it is concluded that the instrument can be used to assess how an adolescent is addicted to social media on the Internet. Moreover, users of the instrument may comfortably compare the sum scores of the BSMAS across gender.
Using the Rasch Measurement Model in Psychometric Analysis of the Family Effectiveness Measure

Science.gov (United States)

McCreary, Linda L.; Conrad, Karen M.; Conrad, Kendon J.; Scott, Christy K; Funk, Rodney R.; Dennis, Michael L.

2013-01-01

Background Valid assessment of family functioning can play a vital role in optimizing client outcomes. Because family functioning is influenced by family structure, socioeconomic context, and culture, existing measures of family functioning--primarily developed with nuclear, middle class European American families--may not be valid assessments of families in diverse populations. The Family Effectiveness Measure was developed to address this limitation. Objectives To test the Family Effectiveness Measure with data from a primarily low-income African American convenience sample, using the Rasch measurement model. Method A sample of 607 adult women completed the measure. Rasch analysis was used to assess unidimensionality, response category functioning, item fit, person reliability, differential item functioning by race and parental status, and item hierarchy. Criterion-related validity was tested using correlations with five other variables related to family functioning. Results The Family Effectiveness Measure measures two separate constructs: The effective family functioning construct was a psychometrically sound measure of the target construct that was more efficient due to the deletion of 22 items. The ineffective family functioning construct consisted of 16 of those deleted items but was not as strong psychometrically. Items in both constructs evidenced no differential item functioning by race. Criterion-related validity was supported for both. Discussion In contrast to the prevailing conceptualization that family functioning is a single construct, assessed by positively and negatively worded items, use of the Rasch analysis suggested the existence of two constructs. While the effective family functioning is a strong and efficient measure of family functioning, the ineffective family functioning will require additional item development and psychometric testing. PMID:23636342
Loglinear Rasch model tests

NARCIS (Netherlands)

Kelderman, Hendrikus

1984-01-01

Existing statistical tests for the fit of the Rasch model have been criticized, because they are only sensitive to specific violations of its assumptions. Contingency table methods using loglinear models have been used to test various psychometric models. In this paper, the assumptions of the Rasch
Educational Leadership Effectiveness: A Rasch Analysis

Science.gov (United States)

Sinnema, Claire; Ludlow, Larry; Robinson, Viviane

2016-01-01

Purpose: The purposes of this paper are, first, to establish the psychometric properties of the ELP tool, and, second, to test, using a Rasch item response theory analysis, the hypothesized progression of challenge presented by the items included in the tool. Design/ Methodology/ Approach: Data were collected at two time points through a survey of…
A psychometric evaluation of the Swedish version of the Research Utilization Questionnaire using a Rasch measurement model.

Science.gov (United States)

Lundberg, Veronica; Boström, Anne-Marie; Malinowsky, Camilla

2017-07-30

Evidence-based practice and research utilisation has become a commonly used concept in health care. The Research Utilization Questionnaire (RUQ) has been recognised to be a widely used instrument measuring the perception of research utilisation among nursing staff in clinical practice. Few studies have however analysed the psychometric properties of the RUQ. The aim of this study was to examine the psychometric properties of the Swedish version of the three subscales in RUQ using a Rasch measurement model. This study has a cross-sectional design using a sample of 163 staff (response rate 81%) working in one nursing home in Sweden. Data were collected using the Swedish version of RUQ in 2012. The three subscales Attitudes towards research, Availability of and support for research use and Use of research findings in clinical practice were investigated. Data were analysed using a Rasch measurement model. The results indicate presence of multidimensionality in all subscales. Moreover, internal scale validity and person response validity also provide some less satisfactory results, especially for the subscale Use of research findings. Overall, there seems to be a problem with the negatively worded statements. The findings suggest that clarification and refining of items, including additional psychometric evaluation of the RUQ, are needed before using the instrument in clinical practice and research studies among staff in nursing homes. © 2017 Nordic College of Caring Science.
Modern psychometric approaches to analysis of scales for health-related quality of life

DEFF Research Database (Denmark)

Bjorner, Jakob Bue; Bech, Per

2016-01-01

In recent years, much effort has been invested in the development of new instruments for assessment of health-related quality of life (HRQOL). For many new instruments, modern psychometric methods, such as item response theory (IRT) models, have been used, either as supplemental to classical....... The models include Rasch models (Rasch 1980; Fischer and Molenaar 1995), other IRT models (Samejima 1969; van der Linden and Hambleton 1997), and factor analytic models for categorical data (Muthén 1984). “Modern” psychometric methods have actually a rather long history within psychiatric research (both...
Rasch analysis on OSCE data : An illustrative example.

Science.gov (United States)

Tor, E; Steketee, C

2011-01-01

The Objective Structured Clinical Examination (OSCE) is a widely used tool for the assessment of clinical competence in health professional education. The goal of the OSCE is to make reproducible decisions on pass/fail status as well as students' levels of clinical competence according to their demonstrated abilities based on the scores. This paper explores the use of the polytomous Rasch model in evaluating the psychometric properties of OSCE scores through a case study. The authors analysed an OSCE data set (comprised of 11 stations) for 80 fourth year medical students based on the polytomous Rasch model in an effort to answer two research questions: (1) Do the clinical tasks assessed in the 11 OSCE stations map on to a common underlying construct, namely clinical competence? (2) What other insights can Rasch analysis offer in terms of scaling, item analysis and instrument validation over and above the conventional analysis based on classical test theory? The OSCE data set has demonstrated a sufficient degree of fit to the Rasch model (Χ(2) = 17.060, DF=22, p=0.76) indicating that the 11 OSCE station scores have sufficient psychometric properties to form a measure for a common underlying construct, i.e. clinical competence. Individual OSCE station scores with good fit to the Rasch model (p > 0.1 for all Χ(2) statistics) further corroborated the characteristic of unidimensionality of the OSCE scale for clinical competence. A Person Separation Index (PSI) of 0.704 indicates sufficient level of reliability for the OSCE scores. Other useful findings from the Rasch analysis that provide insights, over and above the analysis based on classical test theory, are also exemplified using the data set. The polytomous Rasch model provides a useful and supplementary approach to the calibration and analysis of OSCE examination data.
Rasch model analysis of the Depression, Anxiety and Stress Scales (DASS).

Science.gov (United States)

Shea, Tracey L; Tennant, Alan; Pallant, Julie F

2009-05-09

There is a growing awareness of the need for easily administered, psychometrically sound screening tools to identify individuals with elevated levels of psychological distress. Although support has been found for the psychometric properties of the Depression, Anxiety and Stress Scales (DASS) using classical test theory approaches it has not been subjected to Rasch analysis. The aim of this study was to use Rasch analysis to assess the psychometric properties of the DASS-21 scales, using two different administration modes. The DASS-21 was administered to 420 participants with half the sample responding to a web-based version and the other half completing a traditional pencil-and-paper version. Conformity of DASS-21 scales to a Rasch partial credit model was assessed using the RUMM2020 software. To achieve adequate model fit it was necessary to remove one item from each of the DASS-21 subscales. The reduced scales showed adequate internal consistency reliability, unidimensionality and freedom from differential item functioning for sex, age and mode of administration. Analysis of all DASS-21 items combined did not support its use as a measure of general psychological distress. A scale combining the anxiety and stress items showed satisfactory fit to the Rasch model after removal of three items. The results provide support for the measurement properties, internal consistency reliability, and unidimensionality of three slightly modified DASS-21 scales, across two different administration methods. The further use of Rasch analysis on the DASS-21 in larger and broader samples is recommended to confirm the findings of the current study.
Rasch model analysis of the Depression, Anxiety and Stress Scales (DASS)

Science.gov (United States)

Shea, Tracey L; Tennant, Alan; Pallant, Julie F

2009-01-01

Background There is a growing awareness of the need for easily administered, psychometrically sound screening tools to identify individuals with elevated levels of psychological distress. Although support has been found for the psychometric properties of the Depression, Anxiety and Stress Scales (DASS) using classical test theory approaches it has not been subjected to Rasch analysis. The aim of this study was to use Rasch analysis to assess the psychometric properties of the DASS-21 scales, using two different administration modes. Methods The DASS-21 was administered to 420 participants with half the sample responding to a web-based version and the other half completing a traditional pencil-and-paper version. Conformity of DASS-21 scales to a Rasch partial credit model was assessed using the RUMM2020 software. Results To achieve adequate model fit it was necessary to remove one item from each of the DASS-21 subscales. The reduced scales showed adequate internal consistency reliability, unidimensionality and freedom from differential item functioning for sex, age and mode of administration. Analysis of all DASS-21 items combined did not support its use as a measure of general psychological distress. A scale combining the anxiety and stress items showed satisfactory fit to the Rasch model after removal of three items. Conclusion The results provide support for the measurement properties, internal consistency reliability, and unidimensionality of three slightly modified DASS-21 scales, across two different administration methods. The further use of Rasch analysis on the DASS-21 in larger and broader samples is recommended to confirm the findings of the current study. PMID:19426512
Rasch model analysis of the Depression, Anxiety and Stress Scales (DASS

Directory of Open Access Journals (Sweden)

Tennant Alan

2009-05-01

Full Text Available Abstract Background There is a growing awareness of the need for easily administered, psychometrically sound screening tools to identify individuals with elevated levels of psychological distress. Although support has been found for the psychometric properties of the Depression, Anxiety and Stress Scales (DASS using classical test theory approaches it has not been subjected to Rasch analysis. The aim of this study was to use Rasch analysis to assess the psychometric properties of the DASS-21 scales, using two different administration modes. Methods The DASS-21 was administered to 420 participants with half the sample responding to a web-based version and the other half completing a traditional pencil-and-paper version. Conformity of DASS-21 scales to a Rasch partial credit model was assessed using the RUMM2020 software. Results To achieve adequate model fit it was necessary to remove one item from each of the DASS-21 subscales. The reduced scales showed adequate internal consistency reliability, unidimensionality and freedom from differential item functioning for sex, age and mode of administration. Analysis of all DASS-21 items combined did not support its use as a measure of general psychological distress. A scale combining the anxiety and stress items showed satisfactory fit to the Rasch model after removal of three items. Conclusion The results provide support for the measurement properties, internal consistency reliability, and unidimensionality of three slightly modified DASS-21 scales, across two different administration methods. The further use of Rasch analysis on the DASS-21 in larger and broader samples is recommended to confirm the findings of the current study.
Psychometric properties of the Oswestry disability index: Rasch analysis of responses in a work-disabled population.

Science.gov (United States)

Lochhead, Lois E; MacMillan, Peter D

2013-01-01

The Oswestry disability index (ODI) is the most widely used measure of perceived disability for low back conditions. It has been adopted without adaptation in functional capacity evaluation (FCE). Rigorous testing of the ODI with modern psychometric methods, in this setting, is warranted. To determine the psychometric properties of the ODI in FCE: unidimensionality; differential item functioning; item coverage and to identify poorly functioning items, allowing for improvement of these items and recalibration of the scale. Rasch analysis, specifically Masters' partial credit model, was conducted on data. 133 work-disabled individuals presenting for FCE in northern British Columbia, Canada. All items had one poorly functioning option. Items were rescaled from six categories to five, improving the psychometric properties of the ODI as a unidimensional (disability due to back pain) scale. Item difficulty range is sufficient for a population with mild to severe disability. Although two of the ten ODI items functioned marginally unsatisfactorily in the unrevised state, the 5-option revised ODI appears superior. Use in clinical settings across a broad spectrum of disability levels could help establish its psychometric properties. Health professionals should be aware that the ODI may perform differently depending on client population.
The Cervical Dystonia Impact Profile (CDIP-58: Can a Rasch developed patient reported outcome measure satisfy traditional psychometric criteria?

Directory of Open Access Journals (Sweden)

Bhatia Kailash P

2008-08-01

Full Text Available Abstract Background The United States Food and Drug Administration (FDA are currently producing guidelines for the scientific adequacy of patient reported outcome measures (PROMs in clinical trials, which will have implications for the selection of scales used in future clinical trials. In this study, we examine how the Cervical Dystonia Impact Profile (CDIP-58, a rigorous Rasch measurement developed neurologic PROM, stands up to traditional psychometric criteria for three reasons: 1 provide traditional psychometric evidence for the CDIP-58 in line with proposed FDA guidelines; 2 enable researchers and clinicians to compare it with existing dystonia PROMs; and 3 help researchers and clinicians bridge the knowledge gap between old and new methods of reliability and validity testing. Methods We evaluated traditional psychometric properties of data quality, scaling assumptions, targeting, reliability and validity in a group of 391 people with CD. The main outcome measures used were the CDIP-58, Medical Outcome Study Short Form-36, the 28-item General Health Questionnaire, and Hospital and Anxiety and Depression Scale. Results A total of 391 people returned completed questionnaires (corrected response rate 87%. Analyses showed: 1 data quality was high (low missing data ≤ 4%, subscale scores could be computed for > 96% of the sample; 2 item groupings passed tests for scaling assumptions; 3 good targeting (except for the Sleep subscale, ceiling effect = 27%; 4 good reliability (Cronbach's alpha ≥ 0.92, test-retest intraclass correlations ≥ 0.83; and 5 validity was supported. Conclusion This study has shown that new psychometric methods can produce a PROM that stands up to traditional criteria and supports the clinical advantages of Rasch analysis.
Rasch analysis of the Multiple Sclerosis Impact Scale (MSIS-29

Directory of Open Access Journals (Sweden)

Misajon Rose

2009-06-01

Full Text Available Abstract Background Multiple Sclerosis (MS is a degenerative neurological disease that causes impairments, including spasticity, pain, fatigue, and bladder dysfunction, which negatively impact on quality of life. The Multiple Sclerosis Impact Scale (MSIS-29 is a disease-specific health-related quality of life (HRQoL instrument, developed using the patient's perspective on disease impact. It consists of two subscales assessing the physical (MSIS-29-PHYS and psychological (MSIS-29-PSYCH impact of MS. Although previous studies have found support for the psychometric properties of the MSIS-29 using traditional methods of scale evaluation, the scale has not been subjected to a detailed Rasch analysis. Therefore, the objective of this study was to use Rasch analysis to assess the internal validity of the scale, and its response format, item fit, targeting, internal consistency and dimensionality. Methods Ninety-two persons with definite MS residing in the community were recruited from a tertiary hospital database. Patients completed the MSIS-29 as part of a larger study. Rasch analysis was undertaken to assess the psychometric properties of the MSIS-29. Results Rasch analysis showed overall support for the psychometric properties of the two MSIS-29 subscales, however it was necessary to reduce the response format of the MSIS-29-PHYS to a 3-point response scale. Both subscales were unidimensional, had good internal consistency, and were free from item bias for sex and age. Dimensionality testing indicated it was not appropriate to combine the two subscales to form a total MSIS score. Conclusion In this first study to use Rasch analysis to fully assess the psychometric properties of the MSIS-29 support was found for the two subscales but not for the use of the total scale. Further use of Rasch analysis on the MSIS-29 in larger and broader samples is recommended to confirm these findings.
Rasch analysis of the Multiple Sclerosis Impact Scale (MSIS-29)

Science.gov (United States)

Ramp, Melina; Khan, Fary; Misajon, Rose Anne; Pallant, Julie F

2009-01-01

Background Multiple Sclerosis (MS) is a degenerative neurological disease that causes impairments, including spasticity, pain, fatigue, and bladder dysfunction, which negatively impact on quality of life. The Multiple Sclerosis Impact Scale (MSIS-29) is a disease-specific health-related quality of life (HRQoL) instrument, developed using the patient's perspective on disease impact. It consists of two subscales assessing the physical (MSIS-29-PHYS) and psychological (MSIS-29-PSYCH) impact of MS. Although previous studies have found support for the psychometric properties of the MSIS-29 using traditional methods of scale evaluation, the scale has not been subjected to a detailed Rasch analysis. Therefore, the objective of this study was to use Rasch analysis to assess the internal validity of the scale, and its response format, item fit, targeting, internal consistency and dimensionality. Methods Ninety-two persons with definite MS residing in the community were recruited from a tertiary hospital database. Patients completed the MSIS-29 as part of a larger study. Rasch analysis was undertaken to assess the psychometric properties of the MSIS-29. Results Rasch analysis showed overall support for the psychometric properties of the two MSIS-29 subscales, however it was necessary to reduce the response format of the MSIS-29-PHYS to a 3-point response scale. Both subscales were unidimensional, had good internal consistency, and were free from item bias for sex and age. Dimensionality testing indicated it was not appropriate to combine the two subscales to form a total MSIS score. Conclusion In this first study to use Rasch analysis to fully assess the psychometric properties of the MSIS-29 support was found for the two subscales but not for the use of the total scale. Further use of Rasch analysis on the MSIS-29 in larger and broader samples is recommended to confirm these findings. PMID:19545445
Psychometrics evaluation of Charcot-Marie-Tooth Neuropathy Score (CMTNSv2) second version, using Rasch analysis.

Science.gov (United States)

Sadjadi, Reza; Reilly, Mary M; Shy, Michael E; Pareyson, Davide; Laura, Matilde; Murphy, Sinead; Feely, Shawna M E; Grider, Tiffany; Bacon, Chelsea; Piscosquito, Giuseppe; Calabrese, Daniela; Burns, Ted M

2014-09-01

Charcot-Marie-Tooth Neuropathy Score second version (CMTNSv2) is a validated clinical outcome measure developed for use in clinical trials to monitor disease impairment and progression in affected CMT patients. Currently, all items of CMTNSv2 have identical contribution to the total score. We used Rasch analysis to further explore psychometric properties of CMTNSv2, and in particular, category response functioning, and their weight on the overall disease progression. Weighted category responses represent a more accurate estimate of actual values measuring disease severity and therefore could potentially be used in improving the current version. © 2014 Peripheral Nerve Society.

Psychometric properties of the Zarit Caregiver Burden Interview administered to caregivers to patients with Duchenne muscular dystrophy: a Rasch analysis.

Science.gov (United States)

Landfeldt, Erik; Mayhew, Anna; Straub, Volker; Bushby, Katharine; Lochmüller, Hanns; Lindgren, Peter

2017-12-18

To explore the psychometric properties of the full 22-item English (UK and US) version of the Zarit Caregiver Burden Interview administered to caregivers to patients with Duchenne muscular dystrophy. Caregivers to patients with Duchenne muscular dystrophy from the United Kingdom and the United States, recruited through the TREAT-NMD network, completed the Zarit Caregiver Burden Interview online. The psychometric properties of the Zarit Caregiver Burden Interview were examined using Rasch analysis. A total of 475 caregivers completed the Zarit Caregiver Burden Interview. Model misfit was identified for 9 of 22 items (mean item fit residual 0.061, SD: 2.736) and 13 of 22 items displayed disordered thresholds. The overall item-trait interaction chi-square value was 499 (198 degrees of freedom, p Interview fails to fully operationalize a quantitative conceptualization of caregiver burden among caregivers to patients with Duchenne muscular dystrophy from the United Kingdom and the United States. Further research is needed to understand the psychometric properties of the Zarit Caregiver Burden Interview in other populations and settings. Implications for Rehabilitation Duchenne muscular dystrophy is a terminal disease characterized by progressive muscle degeneration resulting in substantial disability and a significant burden on family caregivers. The Zarit Caregiver Burden Interview is one of the most widely applied measures of caregiver burden. Our Rasch analysis suggests that the Zarit Caregiver Burden Interview is not fit for purpose to measure burden in UK and US caregivers to patients with Duchenne muscular dystrophy. Clinicians and decision-makers should interpret Zarit Caregiver Burden Interview data from these populations with caution.
Rasch Analysis of the Locus-of-Hope Scale. Brief Report

Science.gov (United States)

Gadiana, Leny G.; David, Adonis P.

2015-01-01

The Locus-of-Hope Scale (LHS) was developed as a measure of the locus-of-hope dimensions (Bernardo, 2010). The present study adds to the emerging literature on locus-of-hope by assessing the psychometric properties of the LHS using Rasch analysis. The results from the Rasch analyses of the four subscales of LHS provided evidence on the…
Measuring Mindfulness: A Rasch Analysis of the Freiburg Mindfulness Inventory

Directory of Open Access Journals (Sweden)

Siobhan Lynch

2011-12-01

Full Text Available The objective of the study was to assess the psychometric properties of the Freiburg Mindfulness Inventory (FMI-14 using a Rasch model approach in a cross-sectional design. The scale was administered to N = 130 British patients with different psychosomatic conditions. The scale failed to show clear one-factoriality and item 13 did not fit the Rasch model. A two-factorial solution without item 13, however, appeared to fit well. The scale seemed to work equally well in different subgroups such as patients with or without mindfulness practice. However, some limitations of the validity of both the one-factorial and the two-factorial version of the scale were observed. Sizeable floor and ceiling effects limit the diagnostical use of the instrument. In summary, the study demonstrates that the two-factorial version of the FMI-13 shows acceptable approximation to Rasch requirements, but is in need of further improvement. The one-factorial solution did not fit well, and cannot be recommended for further use.
Enhancing measurement in science education research through Rasch analysis: Rationale and properties

Directory of Open Access Journals (Sweden)

Jørgen Sjaastad

2014-10-01

Full Text Available This article presents the basic rationale of Rasch theory and seven core properties of Rasch modeling; analyses of test targeting, person separation, person fit, item fit, differential item functioning, functioning of response categories and tests of unidimensionality. Illustrative examples are provided consecutively, drawing on Rasch analysis of data from a survey where students in the 9th grade responded to questions regarding their mathematics competence. The relationship between Rasch theory and classical test theory is commented on. Rasch theory provides science and mathematics education researchers with valuable tools to evaluate the psychometric quality of tests and questionnaires and support the development of these.
Scale construction utilising the Rasch unidimensional measurement model: A measurement of adolescent attitudes towards abortion.

Science.gov (United States)

Hendriks, Jacqueline; Fyfe, Sue; Styles, Irene; Skinner, S Rachel; Merriman, Gareth

2012-01-01

Measurement scales seeking to quantify latent traits like attitudes, are often developed using traditional psychometric approaches. Application of the Rasch unidimensional measurement model may complement or replace these techniques, as the model can be used to construct scales and check their psychometric properties. If data fit the model, then a scale with invariant measurement properties, including interval-level scores, will have been developed. This paper highlights the unique properties of the Rasch model. Items developed to measure adolescent attitudes towards abortion are used to exemplify the process. Ten attitude and intention items relating to abortion were answered by 406 adolescents aged 12 to 19 years, as part of the "Teen Relationships Study". The sampling framework captured a range of sexual and pregnancy experiences. Items were assessed for fit to the Rasch model including checks for Differential Item Functioning (DIF) by gender, sexual experience or pregnancy experience. Rasch analysis of the original dataset initially demonstrated that some items did not fit the model. Rescoring of one item (B5) and removal of another (L31) resulted in fit, as shown by a non-significant item-trait interaction total chi-square and a mean log residual fit statistic for items of -0.05 (SD=1.43). No DIF existed for the revised scale. However, items did not distinguish as well amongst persons with the most intense attitudes as they did for other persons. A person separation index of 0.82 indicated good reliability. Application of the Rasch model produced a valid and reliable scale measuring adolescent attitudes towards abortion, with stable measurement properties. The Rasch process provided an extensive range of diagnostic information concerning item and person fit, enabling changes to be made to scale items. This example shows the value of the Rasch model in developing scales for both social science and health disciplines.
A gentle introduction to Rasch measurement models for metrologists

International Nuclear Information System (INIS)

Mari, Luca; Wilson, Mark

2013-01-01

The talk introduces the basics of Rasch models by systematically interpreting them in the conceptual and lexical framework of the International Vocabulary of Metrology, third edition (VIM3). An admittedly simple example of physical measurement highlights the analogies between physical transducers and tests, as they can be understood as measuring instruments of Rasch models and psychometrics in general. From the talk natural scientists and engineers might learn something of Rasch models, as a specifically relevant case of social measurement, and social scientists might re-interpret something of their knowledge of measurement in the light of the current physical measurement models
Rasch analysis on OSCE data : An illustrative example

Directory of Open Access Journals (Sweden)

Tor E

2011-06-01

Full Text Available BackgroundThe Objective Structured Clinical Examination (OSCE is awidely used tool for the assessment of clinical competencein health professional education. The goal of the OSCE is tomake reproducible decisions on pass/fail status as well asstudents’ levels of clinical competence according to theirdemonstrated abilities based on the scores. This paperexplores the use of the polytomous Rasch model inevaluating the psychometric properties of OSCE scoresthrough a case study.MethodThe authors analysed an OSCE data set (comprised of 11stations for 80 fourth year medical students based on thepolytomous Rasch model in an effort to answer tworesearch questions: (1 Do the clinical tasks assessed in the11 OSCE stations map on to a common underlyingconstruct, namely clinical competence? (2 What otherinsights can Rasch analysis offer in terms of scaling, itemanalysis and instrument validation over and above theconventional analysis based on classical test theory?ResultsThe OSCE data set has demonstrated a sufficient degree offit to the Rasch model (χ2 = 17.060, DF=22, p=0.76indicating that the 11 OSCE station scores have sufficientpsychometric properties to form a measure for a commonunderlying construct, i.e. clinical competence. IndividualOSCE station scores with good fit to the Rasch model (p >0.1 for all χ2 statistics further corroborated thecharacteristic of unidimensionality of the OSCE scale forclinical competence. A Person Separation Index (PSI of0.704 indicates sufficient level of reliability for the OSCEscores. Other useful findings from the Rasch analysis thatprovide insights, over and above the analysis based onclassical test theory, are also exemplified using the data set.ConclusionThe polytomous Rasch model provides a useful andsupplementary approach to the calibration and analysis ofOSCE examination data.
Examination of a Social-Networking Site Activities Scale (SNSAS) Using Rasch Analysis

Science.gov (United States)

Alhaythami, Hassan; Karpinski, Aryn; Kirschner, Paul; Bolden, Edward

2017-01-01

This study examined the psychometric properties of a social-networking site (SNS) activities scale (SNSAS) using Rasch Analysis. Items were also examined with Rasch Principal Components Analysis (PCA) and Differential Item Functioning (DIF) across groups of university students (i.e., males and females from the United States [US] and Europe; N =…
Measuring students' perceptions of plagiarism: modification and Rasch validation of a plagiarism attitude scale.

Science.gov (United States)

Howard, Steven J; Ehrich, John F; Walton, Russell

2014-01-01

Plagiarism is a significant area of concern in higher education, given university students' high self-reported rates of plagiarism. However, research remains inconsistent in prevalence estimates and suggested precursors of plagiarism. This may be a function of the unclear psychometric properties of the measurement tools adopted. To investigate this, we modified an existing plagiarism scale (to broaden its scope), established its psychometric properties using traditional (EFA, Cronbach's alpha) and modern (Rasch analysis) survey evaluation approaches, and examined results of well-functioning items. Results indicated that traditional and modern psychometric approaches differed in their recommendations. Further, responses indicated that although most respondents acknowledged the seriousness of plagiarism, these attitudes were neither unanimous nor consistent across the range of issues assessed. This study thus provides rigorous psychometric testing of a plagiarism attitude scale and baseline data from which to begin a discussion of contextual, personal, and external factors that influence students' plagiarism attitudes.
Psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale: A Rasch rating scale analysis and confirmatory factor analysis.

Science.gov (United States)

Pilatti, Angelina; Lozano, Oscar M; Cyders, Melissa A

2015-12-01

The present study was aimed at determining the psychometric properties of the Spanish version of the UPPS-P Impulsive Behavior Scale in a sample of college students. Participants were 318 college students (36.2% men; mean age = 20.9 years, SD = 6.4 years). The psychometric properties of this Spanish version were analyzed using the Rasch model, and the factor structure was examined using confirmatory factor analysis. The verification of the global fit of the data showed adequate indexes for persons and items. The reliability estimates were high for both items and persons. Differential item functioning across gender was found for 23 items, which likely reflects known differences in impulsivity levels between men and women. The factor structure of the Spanish version of the UPPS-P replicates previous work with the original UPPS-P Scale. Overall, results suggest that test scores from the Spanish version of the UPPS-P show adequate psychometric properties to accurately assess the multidimensional model of impulsivity, which represents the most exhaustive measure of this construct. (c) 2015 APA, all rights reserved).
Psychometric properties of the International Classification of Functioning, Disability and Health set for spinal cord injury nursing based on Rasch analysis.

Science.gov (United States)

Li, Kun; Yan, Tiebin; You, Liming; Xie, Sumei; Li, Yun; Tang, Jie; Wang, Yingmin; Gao, Yan

2018-02-01

To examine the psychometric properties of the International Classification of Functioning, Disability and Health (ICF) set for spinal cord injury nursing (ICF-SCIN) using Rasch analysis. A total of 140 spinal cord injury patients were recruited between December 2013 and March 2014 through convenience sampling. Nurses used the components body functions (BF), body structures (BS), and activities and participation (AP) of the ICF-SCIN to rate the patients' functioning. Rasch analysis was performed using RUMM 2030 software. In each component, categories were rescored from 01234 to 01112 because of reversed thresholds. Nine testlets were created to overcome local dependency. Four categories which fit to the Rasch model poorly were deleted. After modification, the components BF, BS, and AP showed good fit to the Rasch model with a Bonferroni-adjusted significant level (χ 2 = 86.29, p = 0.006; χ 2 = 22.44, p = 0.130; χ 2 = 39.92, p = 0.159). The person separation indices (PSIs) for the three components were 0.80, 0.54, and 0.97, respectively. No differential item functioning (DIF) was detected across age, gender, or educational level. The fit properties of the ICF set were satisfactory after modifications. The ICF-SCIN has the potential as a nursing assessment instrument for measuring the functioning of patients with spinal cord injury. Implications for rehabilitation The International Classification of Functioning, Disability and Health (ICF) set for spinal cord injury nursing contains a group of categories which can reflect the functioning of spinal cord injury patients from the perspective of nurses. The components body functions (BF), body structures (BS), and activities and participation (AP) of the ICF set for spinal cord injury achieved the fit to the Rasch model through rescoring, generating testlets, and deleting categories with poor fit. The ICF set for spinal cord injury nursing (ICF-SCIN) has the potential to be used as a
Measuring situational avoidance in older drivers: An application of Rasch analysis.

Science.gov (United States)

Davis, Jessica; Conlon, Elizabeth; Ownsworth, Tamara; Morrissey, Shirley

2016-02-01

Situational avoidance is a form of driving self-regulation at the strategic level of driving behaviour. It has typically been defined as the purposeful avoidance of driving situations perceived as challenging or potentially hazardous. To date, assessment of the psychometric properties of existing scales that measure situational avoidance has been sparse. This study examined the contribution of Rasch analysis to the situational avoidance construct. Three hundred and ninety-nine Australian drivers (M=66.75, SD=10.14, range: 48-91 years) completed the Situational Avoidance Questionnaire (SAQ). Following removal of the item Parallel Parking, the scale conformed to a Rasch model, showing good person separation, sufficient reliability, little disordering of thresholds, and no evidence of differential item functioning by age or gender. The residuals were independent supporting the assumption of unidimensionality and in conforming to a Rasch model, SAQ items were found to be hierarchical or cumulative. Increased avoidance was associated with factors known to be related to driving self-regulation more broadly, including older age, female gender, reduced driving space and frequency, reporting a change in driving in the past five years and poorer indices of health (i.e., self-rated mood, vision and cognitive function). Overall, these results support the use of the SAQ as a psychometrically sound measure of situational avoidance. Application of Rasch analysis to this area of research advances understanding of the driving self-regulation construct and its practice by drivers in baby boomer and older adult generations. Copyright © 2015 Elsevier Ltd. All rights reserved.
Developing a Measure of Therapist Adherence to Contingency Management: An Application of the Many-Facet Rasch Model

Science.gov (United States)

Chapman, Jason E.; Sheidow, Ashli J.; Henggeler, Scott W.; Halliday-Boykins, Colleen A.; Cunningham, Phillippe B.

2008-01-01

A unique application of the Many-Facet Rasch Model (MFRM) is introduced as the preferred method for evaluating the psychometric properties of a measure of therapist adherence to Contingency Management (CM) treatment of adolescent substance use. The utility of psychometric methods based in Classical Test Theory was limited by complexities of the…
A Rasch Analysis of the Junior Metacognitive Awareness Inventory with Singapore Students

Science.gov (United States)

Ning, Hoi Kwan

2018-01-01

The psychometric properties of the 2 versions of the Junior Metacognitive Awareness Inventory were examined with Singapore student samples. Other than 2 misfitting items and an underutilized response scale, Rasch analysis demonstrated that the instruments have good measurement precision, and no differential item functioning was detected across…
With hiccups and bumps: the development of a Rasch-based instrument to measure elementary students' understanding of the nature of science.

Science.gov (United States)

Peoples, Shelagh M; O'Dwyer, Laura M; Shields, Katherine A; Wang, Yang

2013-01-01

This research describes the development process, psychometric analyses and part validation study of a theoretically-grounded Rasch-based instrument, the Nature of Science Instrument-Elementary (NOSI-E). The NOSI-E was designed to measure elementary students' understanding of the Nature of Science (NOS). Evidence is provided for three of the six validity aspects (content, substantive and generalizability) needed to support the construct validity of the NOSI-E. A future article will examine the structural and external validity aspects. Rasch modeling proved especially productive in scale improvement efforts. The instrument, designed for large-scale assessment use, is conceptualized using five construct domains. Data from 741 elementary students were used to pilot the Rasch scale, with continuous improvements made over three successive administrations. The psychometric properties of the NOSI-E instrument are consistent with the basic assumptions of Rasch measurement, namely that the items are well-fitting and invariant. Items from each of the five domains (Empirical, Theory-Laden, Certainty, Inventive, and Socially and Culturally Embedded) are spread along the scale's continuum and appear to overlap well. Most importantly, the scale seems appropriately calibrated and responsive for elementary school-aged children, the target age group. As a result, the NOSI-E should prove beneficial for science education research. As the United States' science education reform efforts move toward students' learning science through engaging in authentic scientific practices (NRC, 2011), it will be important to assess whether this new approach to teaching science is effective. The NOSI-E can be used as one measure of whether this reform effort has an impact.
Comparison of CTT and Rasch-based approaches for the analysis of longitudinal Patient Reported Outcomes.

Science.gov (United States)

Blanchin, Myriam; Hardouin, Jean-Benoit; Le Neel, Tanguy; Kubis, Gildas; Blanchard, Claire; Mirallié, Eric; Sébille, Véronique

2011-04-15

Health sciences frequently deal with Patient Reported Outcomes (PRO) data for the evaluation of concepts, in particular health-related quality of life, which cannot be directly measured and are often called latent variables. Two approaches are commonly used for the analysis of such data: Classical Test Theory (CTT) and Item Response Theory (IRT). Longitudinal data are often collected to analyze the evolution of an outcome over time. The most adequate strategy to analyze longitudinal latent variables, which can be either based on CTT or IRT models, remains to be identified. This strategy must take into account the latent characteristic of what PROs are intended to measure as well as the specificity of longitudinal designs. A simple and widely used IRT model is the Rasch model. The purpose of our study was to compare CTT and Rasch-based approaches to analyze longitudinal PRO data regarding type I error, power, and time effect estimation bias. Four methods were compared: the Score and Mixed models (SM) method based on the CTT approach, the Rasch and Mixed models (RM), the Plausible Values (PV), and the Longitudinal Rasch model (LRM) methods all based on the Rasch model. All methods have shown comparable results in terms of type I error, all close to 5 per cent. LRM and SM methods presented comparable power and unbiased time effect estimations, whereas RM and PV methods showed low power and biased time effect estimations. This suggests that RM and PV methods should be avoided to analyze longitudinal latent variables. Copyright © 2010 John Wiley & Sons, Ltd.
Measuring Math Anxiety (in Spanish) with the Rasch Rating Scale Model.

Science.gov (United States)

Prieto, Gerardo; Delgado, Ana R

2007-01-01

Two successive studies probed the psychometric properties of a Math Anxiety questionnaire (in Spanish) by means of the Rasch Rating Scale Model. Participants were 411 and 216 Spanish adolescents. Convergent validity was examined by correlating the scale with both the Fennema and Sherman Attitude Scale and a math achievement test. The results show that the scores are psychometrically appropriate, and replicate those reported in meta-analyses: medium-sized negative correlations with achievement and with attitudes toward mathematics, as well as moderate sex-related differences (with girls presenting higher anxiety levels than boys).
Improving the psychometric properties of the Mooney problem ...

African Journals Online (AJOL)

This study aims to examine the psychometric characteristics of Mooney Problem Checklist (MPCL) items using the Rasch measurement model framework in the context of polytechnics. The MPCL with eleven dimensions was administered to 252 respondents who were selected from seven polytechnic institutions in Malaysia ...
Analysis of the Professional Choice Self-Efficacy Scale Using the Rasch-Andrich Rating Scale Model

Science.gov (United States)

Ambiel, Rodolfo A. M.; Noronha, Ana Paula Porto; de Francisco Carvalho, Lucas

2015-01-01

The aim of this research was to analyze the psychometrics properties of the professional choice self-efficacy scale (PCSES), using the Rasch-Andrich rating scale model. The PCSES assesses four factors: self-appraisal, gathering occupational information, practical professional information search and future planning. Participants were 883 Brazilian…
Classical test theory and Rasch analysis validation of the Upper Limb Functional Index in subjects with upper limb musculoskeletal disorders.

Science.gov (United States)

Bravini, Elisabetta; Franchignoni, Franco; Giordano, Andrea; Sartorio, Francesco; Ferriero, Giorgio; Vercelli, Stefano; Foti, Calogero

2015-01-01

To perform a comprehensive analysis of the psychometric properties and dimensionality of the Upper Limb Functional Index (ULFI) using both classical test theory and Rasch analysis (RA). Prospective, single-group observational design. Freestanding rehabilitation center. Convenience sample of Italian-speaking subjects with upper limb musculoskeletal disorders (N=174). Not applicable. The Italian version of the ULFI. Data were analyzed using parallel analysis, exploratory factor analysis, and RA for evaluating dimensionality, functioning of rating scale categories, item fit, hierarchy of item difficulties, and reliability indices. Parallel analysis revealed 2 factors explaining 32.5% and 10.7% of the response variance. RA confirmed the failure of the unidimensionality assumption, and 6 items out of the 25 misfitted the Rasch model. When the analysis was rerun excluding the misfitting items, the scale showed acceptable fit values, loading meaningfully to a single factor. Item separation reliability and person separation reliability were .98 and .89, respectively. Cronbach alpha was .92. RA revealed weakness of the scale concerning dimensionality and internal construct validity. However, a set of 19 ULFI items defined through the statistical process demonstrated a unidimensional structure, good psychometric properties, and clinical meaningfulness. These findings represent a useful starting point for further analyses of the tool (based on modern psychometric approaches and confirmatory factor analysis) in larger samples, including different patient populations and nationalities. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.

Rasch Analysis of the Fullerton Advanced Balance (FAB) Scale

Science.gov (United States)

Fiedler, Roger C.; Rose, Debra J.

2011-01-01

ABSTRACT Purpose: This cross-sectional study explores the psychometric properties and dimensionality of the Fullerton Advanced Balance (FAB) Scale, a multi-item balance test for higher-functioning older adults. Methods: Participants (n=480) were community-dwelling adults able to ambulate independently. Data gathering consisted of survey and balance performance assessment. Psychometric properties were assessed using Rasch analysis. Results: Mean age of participants was 76.4 (SD=7.1) years. Mean FAB Scale scores were 24.7/40 (SD=7.5). Analyses for scale dimensionality showed that 9 of the 10 items fit a unidimensional measure of balance. Item 10 (Reactive Postural Control) did not fit the model. The reliability of the scale to separate persons was 0.81 out of 1.00; the reliability of the scale to separate items in terms of their difficulty was 0.99 out of 1.00. Cronbach's alpha for a 10-item model was 0.805. Items of differing difficulties formed a useful ordinal hierarchy for scaling patterns of expected balance ability scoring for a normative population. Conclusion: The FAB Scale appears to be a reliable and valid tool to assess balance function in higher-functioning older adults. The test was found to discriminate among participants of varying balance abilities. Further exploration of concurrent validity of Rasch-generated expected item scoring patterns should be undertaken to determine the test's diagnostic and prescriptive utility. PMID:22210989
Rasch Analysis of the Fullerton Advanced Balance (FAB) Scale.

Science.gov (United States)

Klein, Penelope J; Fiedler, Roger C; Rose, Debra J

2011-01-01

This cross-sectional study explores the psychometric properties and dimensionality of the Fullerton Advanced Balance (FAB) Scale, a multi-item balance test for higher-functioning older adults. Participants (n=480) were community-dwelling adults able to ambulate independently. Data gathering consisted of survey and balance performance assessment. Psychometric properties were assessed using Rasch analysis. Mean age of participants was 76.4 (SD=7.1) years. Mean FAB Scale scores were 24.7/40 (SD=7.5). Analyses for scale dimensionality showed that 9 of the 10 items fit a unidimensional measure of balance. Item 10 (Reactive Postural Control) did not fit the model. The reliability of the scale to separate persons was 0.81 out of 1.00; the reliability of the scale to separate items in terms of their difficulty was 0.99 out of 1.00. Cronbach's alpha for a 10-item model was 0.805. Items of differing difficulties formed a useful ordinal hierarchy for scaling patterns of expected balance ability scoring for a normative population. The FAB Scale appears to be a reliable and valid tool to assess balance function in higher-functioning older adults. The test was found to discriminate among participants of varying balance abilities. Further exploration of concurrent validity of Rasch-generated expected item scoring patterns should be undertaken to determine the test's diagnostic and prescriptive utility.
Rasch-family models are more valuable than score-based approaches for analysing longitudinal patient-reported outcomes with missing data.

Science.gov (United States)

de Bock, Élodie; Hardouin, Jean-Benoit; Blanchin, Myriam; Le Neel, Tanguy; Kubis, Gildas; Bonnaud-Antignac, Angélique; Dantan, Étienne; Sébille, Véronique

2016-10-01

The objective was to compare classical test theory and Rasch-family models derived from item response theory for the analysis of longitudinal patient-reported outcomes data with possibly informative intermittent missing items. A simulation study was performed in order to assess and compare the performance of classical test theory and Rasch model in terms of bias, control of the type I error and power of the test of time effect. The type I error was controlled for classical test theory and Rasch model whether data were complete or some items were missing. Both methods were unbiased and displayed similar power with complete data. When items were missing, Rasch model remained unbiased and displayed higher power than classical test theory. Rasch model performed better than the classical test theory approach regarding the analysis of longitudinal patient-reported outcomes with possibly informative intermittent missing items mainly for power. This study highlights the interest of Rasch-based models in clinical research and epidemiology for the analysis of incomplete patient-reported outcomes data. © The Author(s) 2013.
The development and psychometric validation of the Ethical Awareness Scale.

Science.gov (United States)

Milliken, Aimee; Ludlow, Larry; DeSanto-Madeya, Susan; Grace, Pamela

2018-04-19

To develop and psychometrically assess the Ethical Awareness Scale using Rasch measurement principles and a Rasch item response theory model. Critical care nurses must be equipped to provide good (ethical) patient care. This requires ethical awareness, which involves recognizing the ethical implications of all nursing actions. Ethical awareness is imperative in successfully addressing patient needs. Evidence suggests that the ethical import of everyday issues may often go unnoticed by nurses in practice. Assessing nurses' ethical awareness is a necessary first step in preparing nurses to identify and manage ethical issues in the highly dynamic critical care environment. A cross-sectional design was used in two phases of instrument development. Using Rasch principles, an item bank representing nursing actions was developed (33 items). Content validity testing was performed. Eighteen items were selected for face validity testing. Two rounds of operational testing were performed with critical care nurses in Boston between February-April 2017. A Rasch analysis suggests sufficient item invariance across samples and sufficient construct validity. The analysis further demonstrates a progression of items uniformly along a hierarchical continuum; items that match respondent ability levels; response categories that are sufficiently used; and adequate internal consistency. Mean ethical awareness scores were in the low/moderate range. The results suggest the Ethical Awareness Scale is a psychometrically sound, reliable and valid measure of ethical awareness in critical care nurses. © 2018 John Wiley & Sons Ltd.
The Swedish version of the Acceptance of Chronic Health Conditions Scale for people with multiple sclerosis: Translation, cultural adaptation and psychometric properties.

Science.gov (United States)

Forslin, Mia; Kottorp, Anders; Kierkegaard, Marie; Johansson, Sverker

2016-11-11

To translate and culturally adapt the Acceptance of Chronic Health Conditions (ACHC) Scale for people with multiple sclerosis into Swedish, and to analyse the psychometric properties of the Swedish version. Ten people with multiple sclerosis participated in translation and cultural adaptation of the ACHC Scale; 148 people with multiple sclerosis were included in evaluation of the psychometric properties of the scale. Translation and cultural adaptation were carried out through translation and back-translation, by expert committee evaluation and pre-test with cognitive interviews in people with multiple sclerosis. The psychometric properties of the Swedish version were evaluated using Rasch analysis. The Swedish version of the ACHC Scale was an acceptable equivalent to the original version. Seven of the original 10 items fitted the Rasch model and demonstrated ability to separate between groups. A 5-item version, including 2 items and 3 super-items, demonstrated better psychometric properties, but lower ability to separate between groups. The Swedish version of the ACHC Scale with the original 10 items did not fit the Rasch model. Two solutions, either with 7 items (ACHC-7) or with 2 items and 3 super-items (ACHC-5), demonstrated acceptable psychometric properties. Use of the ACHC-5 Scale with super-items is recommended, since this solution adjusts for local dependency among items.
A psychometric revision of the Asian values scale using the Rasch model

OpenAIRE

Kim, Bryan S. K.; Hong, Sehee

2004-01-01

The 36-item Asian Values Scale (B. S. K. Kim, D. R. Atkinson, & P H. Yang, 1999) was revised on the basis of G. Rasch (1960) model and data from 618 Asian Americans. The results led to the establishment of a 25-item measure named the Asian Values Scale-Revised.
Validation of the brief version of the Recovery Self-Assessment (RSA-B) using Rasch measurement theory.

Science.gov (United States)

Barbic, Skye P; Kidd, Sean A; Davidson, Larry; McKenzie, Kwame; O'Connell, Maria J

2015-12-01

In psychiatry, the recovery paradigm is increasingly identified as the overarching framework for service provision. Currently, the Recovery Self-Assessment (RSA), a 36-item rating scale, is commonly used to assess the uptake of a recovery orientation in clinical services. However, the consumer version of the RSA has been found challenging to complete because of length and the reading level required. In response to this feedback, a brief 12-item version of the RSA was developed (RSA-B). This article describes the development of the modified instrument and the application of traditional psychometric analysis and Rasch Measurement Theory to test the psychometrics properties of the RSA-B. Data from a multisite study of adults with serious mental illnesses (n = 1256) who were followed by assertive community treatment teams were examined for reliability, clinical meaning, targeting, response categories, model fit, reliability, dependency, and raw interval-level measurement. Analyses were performed using the Rasch Unidimensional Measurement Model (RUMM 2030). Adequate fit to the Rasch model was observed (χ2 = 112.46, df = 90, p = .06) and internal consistency was good (r = .86). However, Rasch analysis revealed limitations of the 12-item version, with items covering only 39% of the targeted theoretical continuum, 2 misfitting items, and strong evidence for the 5 option response categories not working as intended. This study revealed areas for improvement in the shortened version of the 12-item RSA-B. A revisit of the conceptual model and original 36-item rating scale is encouraged to select items that will help practitioners and researchers measure the full range of recovery orientation. (c) 2015 APA, all rights reserved).
Unified Balance Scale: an activity-based, bed to community, and aetiology-independent measure of balance calibrated with Rasch analysis.

Science.gov (United States)

La Porta, Fabio; Franceschini, Marco; Caselli, Serena; Cavallini, Paola; Susassi, Sonia; Tennant, Alan

2011-04-01

To build a new activity-based, "bed to community", aetiology-independent measure of balance within the neurological rehabilitation setting by merging some existing scales. Balance scales were selected using a conceptual framework and subsequently administered to a convenience sample of adult patients with balance problems due to different neurological aetiologies. Data were then processed using classical psychometric analyses and Rasch analysis in order to construct a new balance measurement tool. The Berg Balance Scale, the Tinetti Scales and the Fullerton Advanced Balance Scale were selected and administered to a sample of patients, giving 302 observations. Classical psychometric analyses (item and scale analysis; confirmatory factor analysis) were undertaken on the pooled 40-item set with confirmation of unidimensionality. The subsequent Rasch analysis allowed the identification of a 27-item set satisfying the Rasch Model's requirements for fundamental measurement, with further confirmation of unidimensionality by post-hoc confirmatory factor analysis. The new scale (Unified Balance Scale) holds proven measurement properties and may be a candidate tool for "bed to community" balance measurement for patients with balance problems within the neuro-rehabilitation setting. Future studies are warranted to explore further its external validity and other clinical properties, as well as to improve its usability.
Construct validity of the Heart Failure Screening Tool (Heart-FaST) to identify heart failure patients at risk of poor self-care: Rasch analysis.

Science.gov (United States)

Reynolds, Nicholas A; Ski, Chantal F; McEvedy, Samantha M; Thompson, David R; Cameron, Jan

2018-02-14

The aim of this study was to psychometrically evaluate the Heart Failure Screening Tool (Heart-FaST) via: (1) examination of internal construct validity; (2) testing of scale function in accordance with design; and (3) recommendation for change/s, if items are not well adjusted, to improve psychometric credential. Self-care is vital to the management of heart failure. The Heart-FaST may provide a prospective assessment of risk, regarding the likelihood that patients with heart failure will engage in self-care. Psychometric validation of the Heart-FaST using Rasch analysis. The Heart-FaST was administered to 135 patients (median age = 68, IQR = 59-78 years; 105 males) enrolled in a multidisciplinary heart failure management program. The Heart-FaST is a nurse-administered tool for screening patients with HF at risk of poor self-care. A Rasch analysis of responses was conducted which tested data against Rasch model expectations, including whether items serve as unbiased, non-redundant indicators of risk and measure a single construct and that rating scales operate as intended. The results showed that data met Rasch model expectations after rescoring or deleting items due to poor discrimination, disordered thresholds, differential item functioning, or response dependence. There was no evidence of multidimensionality which supports the use of total scores from Heart-FaST as indicators of risk. Aggregate scores from this modified screening tool rank heart failure patients according to their "risk of poor self-care" demonstrating that the Heart-FaST items constitute a meaningful scale to identify heart failure patients at risk of poor engagement in heart failure self-care. © 2018 John Wiley & Sons Ltd.
Item-Level Psychometrics of the Glasgow Outcome Scale: Extended Structured Interviews.

Science.gov (United States)

Hong, Ickpyo; Li, Chih-Ying; Velozo, Craig A

2016-04-01

The Glasgow Outcome Scale-Extended (GOSE) structured interview captures critical components of activities and participation, including home, shopping, work, leisure, and family/friend relationships. Eighty-nine community dwelling adults with mild-moderate traumatic brain injury (TBI) were recruited (average = 2.7 year post injury). Nine items of the 19 items were used for the psychometrics analysis purpose. Factor analysis and item-level psychometrics were investigated using the Rasch partial-credit model. Although the principal components analysis of residuals suggests that a single measurement factor dominates the measure, the instrument did not meet the factor analysis criteria. Five items met the rating scale criteria. Eight items fit the Rasch model. The instrument demonstrated low person reliability (0.63), low person strata (2.07), and a slight ceiling effect. The GOSE demonstrated limitations in precisely measuring activities/participation for individuals after TBI. Future studies should examine the impact of the low precision of the GOSE on effect size. © The Author(s) 2016.
Developing a Measure of Therapist Adherence to Contingency Management: An Application of the Many-Facet Rasch Model.

Science.gov (United States)

Chapman, Jason E; Sheidow, Ashli J; Henggeler, Scott W; Halliday-Boykins, Colleen; Cunningham, Phillippe B

2008-06-01

A unique application of the Many-Facet Rasch Model (MFRM) is introduced as the preferred method for evaluating the psychometric properties of a measure of therapist adherence to Contingency Management (CM) treatment of adolescent substance use. The utility of psychometric methods based in Classical Test Theory was limited by complexities of the data, including: (a) ratings provided by multiple informants (i.e., youth, caregivers, and therapists), (b) data from separate research studies, (c) repeated measurements, (d) multiple versions of the questionnaire, and (e) missing data. Two dimensions of CM adherence were supported: adherence to Cognitive Behavioral components and adherence to Monitoring components. The rating scale performed differently for items in these subscales, and of 11 items evaluated, eight were found to perform well. The MFRM is presented as a highly flexible approach that can be used to overcome the limitations of traditional methods in the development of adherence measures for evidence-based practices.
Scale construction utilising the Rasch unidimensional measurement model: A measurement of adolescent attitudes towards abortion

Directory of Open Access Journals (Sweden)

Jacqueline Hendriks

2012-05-01

Full Text Available BackgroundMeasurement scales seeking to quantify latent traits likeattitudes, are often developed using traditionalpsychometric approaches. Application of the Raschunidimensional measurement model may complement orreplace these techniques, as the model can be used toconstruct scales and check their psychometric properties. Ifdata fit the model, then a scale with invariant measurementproperties, including interval-level scores, will have beendeveloped.AimsThis paper highlights the unique properties of the Raschmodel. Items developed to measure adolescent attitudestowards abortion are used to exemplify the process.MethodTen attitude and intention items relating to abortion wereanswered by 406 adolescents aged 12 to 19 years, as part ofthe “Teen Relationships Study”. The sampling frameworkcaptured a range of sexual and pregnancy experiences.Items were assessed for fit to the Rasch model includingchecks for Differential Item Functioning (DIF by gender,sexual experience or pregnancy experience.ResultsRasch analysis of the original dataset initially demonstratedthat some items did not fit the model. Rescoring of one item(B5 and removal of another (L31 resulted in fit, as shownby a non-significant item-trait interaction total chi-squareand a mean log residual fit statistic for items of -0.05(SD=1.43. No DIF existed for the revised scale. However,items did not distinguish as well amongst persons with themost intense attitudes as they did for other persons. Aperson separation index of 0.82 indicated good reliability.ConclusionApplication of the Rasch model produced a valid andreliable scale measuring adolescent attitudes towardsabortion, with stable measurement properties. The Raschprocess provided an extensive range of diagnosticinformation concerning item and person fit, enablingchanges to be made to scale items. This example shows thevalue of the Rasch model in developing scales for bothsocial science and health disciplines.
Factors associated with knowledge of diabetes in patients with type 2 diabetes using the Diabetes Knowledge Test validated with Rasch analysis.

Directory of Open Access Journals (Sweden)

Eva K Fenwick

Full Text Available OBJECTIVE: In patients with Type 2 diabetes, to determine the factors associated with diabetes knowledge, derived from Rasch analysis, and compare results with a traditional raw scoring method. RESEARCH DESIGN & METHODS: Participants in this cross-sectional study underwent a comprehensive clinical and biochemical assessment. Diabetes knowledge (main outcome was assessed using the Diabetes Knowledge Test (DKT which was psychometrically validated using Rasch analysis. The relationship between diabetes knowledge and risk factors identified during univariate analyses was examined using multivariable linear regression. The results using raw and Rasch-transformed methods were descriptively compared. RESULTS: 181 patients (mean age±standard deviation = 66.97±9.17 years; 113 (62% male were included. Using Rasch-derived DKT scores, those with greater education (β = 1.14; CI: 0.25,2.04, p = 0.013; had seen an ophthalmologist (β = 1.65; CI: 0.63,2.66, p = 0.002, and spoke English at home (β = 1.37; CI: 0.43,2.31, p = 0.005 had significantly better diabetes knowledge than those with less education, had not seen an ophthalmologist and spoke a language other than English, respectively. Patients who were members of the National Diabetes Service Scheme (NDSS and had seen a diabetes educator also had better diabetes knowledge than their counterparts. Higher HbA1c level was independently associated with worse diabetes knowledge. Using raw measures, access to an ophthalmologist and NDSS membership were not independently associated with diabetes knowledge. CONCLUSIONS: Sociodemographic, clinical and service use factors were independently associated with diabetes knowledge based on both raw scores and Rasch-derived scores, which supports the implementation of targeted interventions to improve patients' knowledge. Choice of psychometric analytical method can affect study outcomes and should be considered during intervention
Factors Associated with Knowledge of Diabetes in Patients with Type 2 Diabetes Using the Diabetes Knowledge Test Validated with Rasch Analysis

Science.gov (United States)

Fenwick, Eva K.; Xie, Jing; Rees, Gwyn; Finger, Robert P.; Lamoureux, Ecosse L.

2013-01-01

Objective In patients with Type 2 diabetes, to determine the factors associated with diabetes knowledge, derived from Rasch analysis, and compare results with a traditional raw scoring method. Research Design & Methods Participants in this cross-sectional study underwent a comprehensive clinical and biochemical assessment. Diabetes knowledge (main outcome) was assessed using the Diabetes Knowledge Test (DKT) which was psychometrically validated using Rasch analysis. The relationship between diabetes knowledge and risk factors identified during univariate analyses was examined using multivariable linear regression. The results using raw and Rasch-transformed methods were descriptively compared. Results 181 patients (mean age±standard deviation = 66.97±9.17 years; 113 (62%) male) were included. Using Rasch-derived DKT scores, those with greater education (β = 1.14; CI: 0.25,2.04, p = 0.013); had seen an ophthalmologist (β = 1.65; CI: 0.63,2.66, p = 0.002), and spoke English at home (β = 1.37; CI: 0.43,2.31, p = 0.005) had significantly better diabetes knowledge than those with less education, had not seen an ophthalmologist and spoke a language other than English, respectively. Patients who were members of the National Diabetes Service Scheme (NDSS) and had seen a diabetes educator also had better diabetes knowledge than their counterparts. Higher HbA1c level was independently associated with worse diabetes knowledge. Using raw measures, access to an ophthalmologist and NDSS membership were not independently associated with diabetes knowledge. Conclusions Sociodemographic, clinical and service use factors were independently associated with diabetes knowledge based on both raw scores and Rasch-derived scores, which supports the implementation of targeted interventions to improve patients' knowledge. Choice of psychometric analytical method can affect study outcomes and should be considered during intervention development. PMID:24312484
Using Rasch Analysis to Evaluate the Reliability and Validity of the Swallowing Quality of Life Questionnaire: An Item Response Theory Approach.

Science.gov (United States)

Cordier, Reinie; Speyer, Renée; Schindler, Antonio; Michou, Emilia; Heijnen, Bas Joris; Baijens, Laura; Karaduman, Ayşe; Swan, Katina; Clavé, Pere; Joosten, Annette Veronica

2018-02-01

The Swallowing Quality of Life questionnaire (SWAL-QOL) is widely used clinically and in research to evaluate quality of life related to swallowing difficulties. It has been described as a valid and reliable tool, but was developed and tested using classic test theory. This study describes the reliability and validity of the SWAL-QOL using item response theory (IRT; Rasch analysis). SWAL-QOL data were gathered from 507 participants at risk of oropharyngeal dysphagia (OD) across four European countries. OD was confirmed in 75.7% of participants via videofluoroscopy and/or fiberoptic endoscopic evaluation, or a clinical diagnosis based on meeting selected criteria. Patients with esophageal dysphagia were excluded. Data were analysed using Rasch analysis. Item and person reliability was good for all the items combined. However, person reliability was poor for 8 subscales and item reliability was poor for one subscale. Eight subscales exhibited poor person separation and two exhibited poor item separation. Overall item and person fit statistics were acceptable. However, at an individual item fit level results indicated unpredictable item responses for 28 items, and item redundancy for 10 items. The item-person dimensionality map confirmed these findings. Results from the overall Rasch model fit and Principal Component Analysis were suggestive of a second dimension. For all the items combined, none of the item categories were 'category', 'threshold' or 'step' disordered; however, all subscales demonstrated category disordered functioning. Findings suggest an urgent need to further investigate the underlying structure of the SWAL-QOL and its psychometric characteristics using IRT.
Psychometric Properties of the Fatigue Severity Scale in Polio Survivors

Science.gov (United States)

Burger, Helena; Franchignoni, Franco; Puzic, Natasa; Giordano, Andrea

2010-01-01

The objective of this study was to evaluate by means of classical test theory and Rasch analysis the scaling characteristics and psychometric properties of the Fatigue Severity Scale (FSS) in polio survivors. A questionnaire, consisting of five general questions (sex, age, age at time of acute polio, sequelae of polio, and new symptoms), the FSS,…
Construct validity of the psychological general well being index (PGWBI in a sample of patients undergoing treatment for stress-related exhaustion: a rasch analysis

Directory of Open Access Journals (Sweden)

Lundgren-Nilsson Åsa

2013-01-01

Full Text Available Abstract Purpose The Psychological General Well Being Index (PGWBI is a widely used scale across many conditions. Over time issues have been raised about the dimensional structure of the scale, and it has not yet been subjected to scrutiny by modern Psychometric approaches. The current study thus evaluates the PGWBI with Rasch- and factor analysis. Methods Consecutive patients recruited to a tertiary stress clinic were administered the PGBWI as part of routine clinical assessment at baseline and three months. Data from the scale was subjected to Factor Analyses and to Rasch analysis. In both cases adjustments for local independence violations were allowed. Results 179 patients were recruited, with a mean age of 43 years, and of whom 70% were female. An initial Confirmatory Factor Analysis (CFA with baseline data failed, but the modification indices also indicated considerable levels of local dependency requiring errors to be correlated. An EFA highlighted positive and negative effect domains. Rasch analysis confirmed that fit of data to the model was influenced by local dependency, and that in practice if the items from the six underlying domains were treated as six ‘super’ items, the scale was shown to measure one dominant construct of well being. An interval scale transformation was therefore possible. A significant improvement in well-being was observed over a three month period. Conclusion The PGWBI scale has satisfactory internal construct validity when tested with modern psychometric techniques, using data obtained from patients treated for stress-related exhaustion. The instrument has qualities that make it suitable also for monitoring well-being during interventions for stress-related exhaustion/clinical burnout.
Measurement of change in health status with Rasch models.

Science.gov (United States)

Anselmi, Pasquale; Vidotto, Giulio; Bettinardi, Ornella; Bertolotti, Giorgio

2015-02-07

The traditional approach to the measurement of change presents important drawbacks (no information at individual level, ordinal scores, variance of the measurement instrument across time points), which Rasch models overcome. The article aims to illustrate the features of the measurement of change with Rasch models. To illustrate the measurement of change using Rasch models, the quantitative data of a longitudinal study of heart-surgery patients (N = 98) were used. The scale "Perception of Positive Change" was used as an example of measurement instrument. All patients underwent cardiac rehabilitation, individual psychological intervention, and educational intervention. Nineteen patients also attended progressive muscle relaxation group trainings. The scale was administered before and after the interventions. Three Rasch approaches were used. Two separate analyses were run on the data from the two time points to test the invariance of the instrument. An analysis was run on the stacked data from both time points to measure change in a common frame of reference. Results of the latter analysis were compared with those of an analysis that removed the influence of local dependency on patient measures. Statistics t, χ(2) and F were used for comparing the patient and item measures estimated in the Rasch analyses (a-priori α = .05). Infit, Outfit, R and item Strata were used for investigating Rasch model fit, reliability, and validity of the instrument. Data of all 98 patients were included in the analyses. The instrument was reliable, valid, and substantively unidimensional (Infit, Outfit instrument occurred across the two time, which prevented the use of the two separate analyses to unambiguously measure change. Local dependency had a negligible effect on patient measures (p ≥ .8674). Thirteen patients improved, whereas 3 worsened. The patients who attended the relaxation group trainings did not report greater improvement than those who did not (p�
Construct Validity of the Holistic Complementary and Alternative Medicines Questionnaire (HCAMQ—An Investigation Using Modern Psychometric Approaches

Directory of Open Access Journals (Sweden)

Paula Kersten

2011-01-01

Full Text Available The scientific basis of efficacy studies of complementary medicine requires the availability of validated measures. The Holistic Complementary and Alternative Medicine Questionnaire (HCAMQ is one such measure. This article aimed to examine its construct validity, using a modern psychometric approach. The HCAMQ was completed by 221 patients (mean age 66.8, SD 8.29, 58% females with chronic stable pain predominantly from a single joint (hip or knee of mechanical origin, waiting for a hip (40% or knee (60% joint replacement, on enrolment in a study investigating the effects of acupuncture and placebo controls. The HCAMQ contains a Holistic Health (HH Subscale (five items and a CAM subscale (six items. Validity of the subscales was tested using Cronbach alpha's, factor analysis, Mokken scaling and Rasch analysis, which did not support the original two-factor structure of the scale. A five-item HH subscale and a four-item CAM subscale (worded in a negative direction fitted the Rasch model and were unidimensional (χ2=8.44, P=0.39, PSI=0.69 versus χ2=17.33, P=0.03, PSI=0.77. Two CAM items (worded in the positive direction had significant misfit. In conclusion, we have shown that the original two-factor structure of the HCAMQ could not be supported but that two valid shortened subscales can be used, one for HH Beliefs (four-item HH, and the other for CAM Beliefs (four-item CAM. It is recommended that consideration is given to rewording the two discarded positively worded CAM questions to enhance construct validity.
The Reliability and Validity of the Power-Load-Margin Inventory: A Rasch Analysis.

Science.gov (United States)

Hardigan, Patrick C; Cohen, Stanley R; Hagen, Kathleen P

2015-01-01

Margin is a function of the relationship of stress to strength. The greater the margin, the more likely students are able to successfully navigate academic structures. This study examined the psychometric properties of a newly created instrument designed to measure margin - the Power-Load-Margin Inventory (PLMI). The PLMI was created using eight domains: (A) Student's aptitude and ability, (B) Course structure, (C) External motivation, (D) Student health, (E) Instructor style, (F) Internal motivation, (G) Life opportunities, and (H) University support structure. A three-point response scale was used to measure the domains: (1) stress, (2) neither stress nor strength, and (3) strength. The PLMI was administered to 586 medical, dental, and pharmacy students. A Rasch rating scale model was used to examine the psychometric properties of the PLMI. The PLMI demonstrated acceptable psychometric properties for use with pharmacy, dental, and medical students. The PLMI's primary weakness was with the subscales' reliability. We attribute this to the small number of items per subscale.

Psychometric Properties of the COPD-Specific Beliefs About Medicine Questionnaire in an Outpatient Population

DEFF Research Database (Denmark)

Topp, Marie; Vestbo, Jørgen; Mortensen, Erik Lykke

2016-01-01

a Danish respiratory outpatient clinic. The Rasch model was used to evaluate psychometric characteristics of the BMQ-COPD and to obtain necessity and concerns scales fulfilling criteria of unidimensionality and overall fit, and with all items showing individual item fit with no local dependencies...
Rasch Analysis of Lebanese Nurses’ Responses to the EIS Questionnaire

Directory of Open Access Journals (Sweden)

Michael Clinton

2014-08-01

Full Text Available This study examined the psychometric characteristics of a 32-item modified version of the Ethical Issues Scale (EIS. Data were collected from 59 registered nurses at the American University of Beirut Medical Centre (AUBMC. Data were analyzed using WINSTEPS Rasch analysis software. The four-category EIS rating scale needs modification for future studies in Lebanon. All EIS scale items need rewording prior to translation into Arabic to avoid confusion among Lebanese nurses. Principal component analysis (PCA of residuals indicated the possible presence of additional dimensions. Additional EIS items are needed to improve targeting.
Comparison of formula and number-right scoring in undergraduate medical training: a Rasch model analysis.

Science.gov (United States)

Cecilio-Fernandes, Dario; Medema, Harro; Collares, Carlos Fernando; Schuwirth, Lambert; Cohen-Schotanus, Janke; Tio, René A

2017-11-09

Progress testing is an assessment tool used to periodically assess all students at the end-of-curriculum level. Because students cannot know everything, it is important that they recognize their lack of knowledge. For that reason, the formula-scoring method has usually been used. However, where partial knowledge needs to be taken into account, the number-right scoring method is used. Research comparing both methods has yielded conflicting results. As far as we know, in all these studies, Classical Test Theory or Generalizability Theory was used to analyze the data. In contrast to these studies, we will explore the use of the Rasch model to compare both methods. A 2 × 2 crossover design was used in a study where 298 students from four medical schools participated. A sample of 200 previously used questions from the progress tests was selected. The data were analyzed using the Rasch model, which provides fit parameters, reliability coefficients, and response option analysis. The fit parameters were in the optimal interval ranging from 0.50 to 1.50, and the means were around 1.00. The person and item reliability coefficients were higher in the number-right condition than in the formula-scoring condition. The response option analysis showed that the majority of dysfunctional items emerged in the formula-scoring condition. The findings of this study support the use of number-right scoring over formula scoring. Rasch model analyses showed that tests with number-right scoring have better psychometric properties than formula scoring. However, choosing the appropriate scoring method should depend not only on psychometric properties but also on self-directed test-taking strategies and metacognitive skills.
Graphical Rasch models

DEFF Research Database (Denmark)

Kreiner, Svend; Christensen, Karl Bang

Rasch models; Partial Credit models; Rating Scale models; Item bias; Differential item functioning; Local independence; Graphical models......Rasch models; Partial Credit models; Rating Scale models; Item bias; Differential item functioning; Local independence; Graphical models...
Construct Validity of the Holistic Complementary and Alternative Medicines Questionnaire (HCAMQ)—An Investigation Using Modern Psychometric Approaches

Science.gov (United States)

Kersten, Paula; White, P. J.; Tennant, A.

2011-01-01

The scientific basis of efficacy studies of complementary medicine requires the availability of validated measures. The Holistic Complementary and Alternative Medicine Questionnaire (HCAMQ) is one such measure. This article aimed to examine its construct validity, using a modern psychometric approach. The HCAMQ was completed by 221 patients (mean age 66.8, SD 8.29, 58% females) with chronic stable pain predominantly from a single joint (hip or knee) of mechanical origin, waiting for a hip (40%) or knee (60%) joint replacement, on enrolment in a study investigating the effects of acupuncture and placebo controls. The HCAMQ contains a Holistic Health (HH) Subscale (five items) and a CAM subscale (six items). Validity of the subscales was tested using Cronbach alpha's, factor analysis, Mokken scaling and Rasch analysis, which did not support the original two-factor structure of the scale. A five-item HH subscale and a four-item CAM subscale (worded in a negative direction) fitted the Rasch model and were unidimensional (χ2 = 8.44, P = 0.39, PSI = 0.69 versus χ2 = 17.33, P = 0.03, PSI = 0.77). Two CAM items (worded in the positive direction) had significant misfit. In conclusion, we have shown that the original two-factor structure of the HCAMQ could not be supported but that two valid shortened subscales can be used, one for HH Beliefs (four-item HH), and the other for CAM Beliefs (four-item CAM). It is recommended that consideration is given to rewording the two discarded positively worded CAM questions to enhance construct validity. PMID:19793835
Using and Developing Measurement Instruments in Science Education: A Rasch Modeling Approach. Science & Engineering Education Sources

Science.gov (United States)

Liu, Xiufeng

2010-01-01

This book meets a demand in the science education community for a comprehensive and introductory measurement book in science education. It describes measurement instruments reported in refereed science education research journals, and introduces the Rasch modeling approach to developing measurement instruments in common science assessment domains,…
Factor and Rasch analysis of the Fonseca anamnestic index for the diagnosis of myogenous temporomandibular disorder.

Science.gov (United States)

Rodrigues-Bigaton, Delaine; de Castro, Ester M; Pires, Paulo F

Rasch analysis has been used in recent studies to test the psychometric properties of a questionnaire. The conditions for use of the Rasch model are one-dimensionality (assessed via prior factor analysis) and local independence (the probability of getting a particular item right or wrong should not be conditioned upon success or failure in another). To evaluate the dimensionality and the psychometric properties of the Fonseca anamnestic index (FAI), such as the fit of the data to the model, the degree of difficulty of the items, and the ability to respond in patients with myogenous temporomandibular disorder (TMD). The sample consisted of 94 women with myogenous TMD, diagnosed by the Research Diagnostic Criteria for Temporomandibular Disorders (RDC/TMD), who answered the FAI. For the factor analysis, we applied the Kaiser-Meyer-Olkin test, Bartlett's sphericity, Spearman's correlation, and the determinant of the correlation matrix. For extraction of the factors/dimensions, an eigenvalue >1.0 was used, followed by oblique oblimin rotation. The Rasch analysis was conducted on the dimension that showed the highest proportion of variance explained. Adequate sample "n" and FAI multidimensionality were observed. Dimension 1 (primary) consisted of items 1, 2, 3, 6, and 7. All items of dimension 1 showed adequate fit to the model, being observed according to the degree of difficulty (from most difficult to easiest), respectively, items 2, 1, 3, 6, and 7. The FAI presented multidimensionality with its main dimension consisting of five reliable items with adequate fit to the composition of its structure. Copyright © 2017 Associação Brasileira de Pesquisa e Pós-Graduação em Fisioterapia. Publicado por Elsevier Editora Ltda. All rights reserved.
Rasch validation of the Chinese parent-child interaction scale (CPCIS).

Science.gov (United States)

Ip, Patrick; Tso, Winnie; Rao, Nirmala; Ho, Frederick Ka Wing; Chan, Ko Ling; Fu, King Wa; Li, Sophia Ling; Goh, Winnie; Wong, Wilfred Hing-Sang; Chow, Chun Bong

2018-03-15

Proper parent-child interaction is crucial for child development, but an assessment tool in Chinese is currently lacking. This study aimed to develop and validate a parent-reported parent-child interaction scale for Chinese preschool children. The Chinese parent-child interaction scale (CPCIS) was designed by an expert panel based on the literature and clinical observations in the Chinese context. The initial CPCIS had 14 parent-child interactive activity items. Psychometric properties of the CPCIS were examined using the Rasch model and confirmatory factor analysis (CFA). Convergent validity was investigated by the associations between CPCIS and family income, maternal education level, and children's school readiness. The study recruited 567 Chinese parent-child pairs from diverse socioeconomic backgrounds, who completed the CPCIS. Six out of the 14 items in the initial CPCIS were dropped due to suboptimal fit values. The refined 8-item CPCIS was shown to be valid and reliable by Rasch models and CFA. The person separation reliability and Cronbach's α of the CPCIS were 0.81 and 0.82, respectively. The CPCIS scores were positively associated with family's socioeconomic status (η 2 = 0.05, P parent-child interactions in Chinese families.
Application of Rasch analysis to the parent adherence report questionnaire in juvenile idiopathic arthritis.

Science.gov (United States)

Toupin April, Karine; Higgins, Johanne; Ehrmann Feldman, Debbie

2016-07-28

Adherence to treatment in children with juvenile idiopathic arthritis (JIA) is associated with better outcomes. Assessing patient adherence in JIA, as well as attitudes and beliefs about prescribed treatments, is important for the clinician in order to optimize patient management. The objective of the current study was to evaluate the psychometric properties of the Parent (proxy-report) Adherence Report Questionnaires (PARQ), which assesses beliefs and behaviors related to adherence to treatments prescribed for JIA. A Rasch analysis was conducted on data collected with parents of children with JIA from two studies in which the PARQ was used as a measure of adherence. The PARQ showed preliminary evidence of multidimensionality with two factors, accounting for 38 % and 27 % of the variance respectively. The PARQ in its original version does not adhere to expectations of the Rasch model. A transformed version of the PARQ obtained by deletion of the general adherence scale and modification of visual analog scales into 5-point likert scales improved fit to the model and showed preliminary evidence of unidimensionality. The PARQ was transformed based on the results of the Rasch analysis. The transformed version of the PARQ shows preliminary evidence of unidimensionality and may allow computation of a total score, although further testing is needed to verify these findings.
Causal Rasch models.

Science.gov (United States)

Stenner, A Jackson; Fisher, William P; Stone, Mark H; Burdick, Donald S

2013-01-01

Rasch's unidimensional models for measurement show how to connect object measures (e.g., reader abilities), measurement mechanisms (e.g., machine-generated cloze reading items), and observational outcomes (e.g., counts correct on reading instruments). Substantive theory shows what interventions or manipulations to the measurement mechanism can be traded off against a change to the object measure to hold the observed outcome constant. A Rasch model integrated with a substantive theory dictates the form and substance of permissible interventions. Rasch analysis, absent construct theory and an associated specification equation, is a black box in which understanding may be more illusory than not. Finally, the quantitative hypothesis can be tested by comparing theory-based trade-off relations with observed trade-off relations. Only quantitative variables (as measured) support such trade-offs. Note that to test the quantitative hypothesis requires more than manipulation of the algebraic equivalencies in the Rasch model or descriptively fitting data to the model. A causal Rasch model involves experimental intervention/manipulation on either reader ability or text complexity or a conjoint intervention on both simultaneously to yield a successful prediction of the resultant observed outcome (count correct). We conjecture that when this type of manipulation is introduced for individual reader text encounters and model predictions are consistent with observations, the quantitative hypothesis is sustained.
Causal Rasch models

Science.gov (United States)

Stenner, A. Jackson; Fisher, William P.; Stone, Mark H.; Burdick, Donald S.

2013-01-01

Rasch's unidimensional models for measurement show how to connect object measures (e.g., reader abilities), measurement mechanisms (e.g., machine-generated cloze reading items), and observational outcomes (e.g., counts correct on reading instruments). Substantive theory shows what interventions or manipulations to the measurement mechanism can be traded off against a change to the object measure to hold the observed outcome constant. A Rasch model integrated with a substantive theory dictates the form and substance of permissible interventions. Rasch analysis, absent construct theory and an associated specification equation, is a black box in which understanding may be more illusory than not. Finally, the quantitative hypothesis can be tested by comparing theory-based trade-off relations with observed trade-off relations. Only quantitative variables (as measured) support such trade-offs. Note that to test the quantitative hypothesis requires more than manipulation of the algebraic equivalencies in the Rasch model or descriptively fitting data to the model. A causal Rasch model involves experimental intervention/manipulation on either reader ability or text complexity or a conjoint intervention on both simultaneously to yield a successful prediction of the resultant observed outcome (count correct). We conjecture that when this type of manipulation is introduced for individual reader text encounters and model predictions are consistent with observations, the quantitative hypothesis is sustained. PMID:23986726
Rasch analysis of the 23-item version of the Roland Morris Disability Questionnaire

DEFF Research Database (Denmark)

Kent, Peter; Grotle, Margreth; Dunn, Kate M

2015-01-01

OBJECTIVE: To determine the psychometric properties of the 23-item version of the Roland Morris Disability Questionnaire (RMDQ-23) and to quantify their stability across 2 cultures/languages and 2 types of care-settings. METHODS: Rasch analysis of data from 1,000 patients with low back pain from...... clinical characteristics (such as age, gender, pain intensity, pain duration and care setting), depending on the country. CONCLUSION: As similar results have been found for the RMDQ-24, we believe it is timely to reconsider whether: (i) the RMDQ should be reconstructed using an item-response theory...
Using Rasch Measurement To Investigate the Cross-form Equivalence and Clinical Utility of Spanish and English Versions of a Diabetes Questionnaire: A Pilot Study.

Science.gov (United States)

Gerber, Ben; Smith, Everett V., Jr.; Girotti, Mariela; Pelaez, Lourdes; Lawless, Kimberly; Smolin, Louanne; Brodsky, Irwin; Eiser, Arnold

2002-01-01

Used Rasch measurement to study the psychometric properties of data obtained from a newly developed Diabetes Questionnaire designed to measure diabetes knowledge, attitudes, and self-care. Responses of 26 diabetes patients to the English version of the questionnaire and 24 patients to the Spanish version support the cross-form equivalence and…
Further psychometric evaluation and revision of the Mayo-Portland Adaptability Inventory in a national sample.

Science.gov (United States)

Malec, James F; Kragness, Miriam; Evans, Randall W; Finlay, Karen L; Kent, Ann; Lezak, Muriel D

2003-01-01

To evaluate the internal consistency of the Mayo-Portland Adaptability Inventory (MPAI), further refine the instrument, and provide reference data based on a large, geographically diverse sample of persons with acquired brain injury (ABI). 386 persons, most with moderate to severe ABI. Outpatient, community-based, and residential rehabilitation facilities for persons with ABI located in the United States: West, Midwest, and Southeast. Rasch, item cluster, principal components, and traditional psychometric analyses for internal consistency of MPAI data and subscales. With rescoring of rating scales for 4 items, a 29-item version of the MPAI showed satisfactory internal consistency by Rasch (Person Reliability=.88; Item Reliability=.99) and traditional psychometric indicators (Cronbach's alpha=.89). Three rationally derived subscales for Ability, Activity, and Participation demonstrated psychometric properties that were equivalent to subscales derived empirically through item cluster and factor analyses. For the 3 subscales, Person Reliability ranged from.78 to.79; Item Reliability, from.98 to.99; and Cronbach's alpha, from.76 to.83. Subscales correlated moderately (Pearson r =.49-.65) with each other and strongly with the overall scale (Pearson r=.82-.86). Outcome after ABI is represented by the unitary dimension described by the MPAI. MPAI subscales further define regions of this dimension that may be useful for evaluation of clinical cases and program evaluation.
Imputation by the mean score should be avoided when validating a Patient Reported Outcomes questionnaire by a Rasch model in presence of informative missing data

LENUS (Irish Health Repository)

Hardouin, Jean-Benoit

2011-07-14

Abstract Background Nowadays, more and more clinical scales consisting in responses given by the patients to some items (Patient Reported Outcomes - PRO), are validated with models based on Item Response Theory, and more specifically, with a Rasch model. In the validation sample, presence of missing data is frequent. The aim of this paper is to compare sixteen methods for handling the missing data (mainly based on simple imputation) in the context of psychometric validation of PRO by a Rasch model. The main indexes used for validation by a Rasch model are compared. Methods A simulation study was performed allowing to consider several cases, notably the possibility for the missing values to be informative or not and the rate of missing data. Results Several imputations methods produce bias on psychometrical indexes (generally, the imputation methods artificially improve the psychometric qualities of the scale). In particular, this is the case with the method based on the Personal Mean Score (PMS) which is the most commonly used imputation method in practice. Conclusions Several imputation methods should be avoided, in particular PMS imputation. From a general point of view, it is important to use an imputation method that considers both the ability of the patient (measured for example by his\\/her score), and the difficulty of the item (measured for example by its rate of favourable responses). Another recommendation is to always consider the addition of a random process in the imputation method, because such a process allows reducing the bias. Last, the analysis realized without imputation of the missing data (available case analyses) is an interesting alternative to the simple imputation in this context.
Examination of an eHealth literacy scale and a health literacy scale in a population with moderate to high cardiovascular risk: Rasch analyses.

Directory of Open Access Journals (Sweden)

Sarah S Richtering

Full Text Available Electronic health (eHealth strategies are evolving making it important to have valid scales to assess eHealth and health literacy. Item response theory methods, such as the Rasch measurement model, are increasingly used for the psychometric evaluation of scales. This paper aims to examine the internal construct validity of an eHealth and health literacy scale using Rasch analysis in a population with moderate to high cardiovascular disease risk.The first 397 participants of the CONNECT study completed the electronic health Literacy Scale (eHEALS and the Health Literacy Questionnaire (HLQ. Overall Rasch model fit as well as five key psychometric properties were analysed: unidimensionality, response thresholds, targeting, differential item functioning and internal consistency.The eHEALS had good overall model fit (χ2 = 54.8, p = 0.06, ordered response thresholds, reasonable targeting and good internal consistency (person separation index (PSI 0.90. It did, however, appear to measure two constructs of eHealth literacy. The HLQ subscales (except subscale 5 did not fit the Rasch model (χ2: 18.18-60.60, p: 0.00-0.58 and had suboptimal targeting for most subscales. Subscales 6 to 9 displayed disordered thresholds indicating participants had difficulty distinguishing between response options. All subscales did, nonetheless, demonstrate moderate to good internal consistency (PSI: 0.62-0.82.Rasch analyses demonstrated that the eHEALS has good measures of internal construct validity although it appears to capture different aspects of eHealth literacy (e.g. using eHealth and understanding eHealth. Whilst further studies are required to confirm this finding, it may be necessary for these constructs of the eHEALS to be scored separately. The nine HLQ subscales were shown to measure a single construct of health literacy. However, participants' scores may not represent their actual level of ability, as distinction between response categories was unclear for
Fitting polytomous Rasch models in SAS

DEFF Research Database (Denmark)

Christensen, Karl Bang

2006-01-01

The item parameters of a polytomous Rasch model can be estimated using marginal and conditional approaches. This paper describes how this can be done in SAS (V8.2) for three item parameter estimation procedures: marginal maximum likelihood estimation, conditional maximum likelihood estimation, an...
Evaluation of the Edinburgh Post Natal Depression Scale using Rasch analysis

Directory of Open Access Journals (Sweden)

Tennant Alan

2006-06-01

Full Text Available Abstract Background The Edinburgh Postnatal Depression Scale (EPDS is a 10 item self-rating post-natal depression scale which has seen widespread use in epidemiological and clinical studies. Concern has been raised over the validity of the EPDS as a single summed scale, with suggestions that it measures two separate aspects, one of depressive feelings, the other of anxiety. Methods As part of a larger cross-sectional study conducted in Melbourne, Australia, a community sample (324 women, ranging in age from 18 to 44 years: mean = 32 yrs, SD = 4.6, was obtained by inviting primiparous women to participate voluntarily in this study. Data from the EPDS were fitted to the Rasch measurement model and tested for appropriate category ordering, for item bias through Differential Item Functioning (DIF analysis, and for unidimensionality through tests of the assumption of local independence. Results Rasch analysis of the data from the ten item scale initially demonstrated a lack of fit to the model with a significant Item-Trait Interaction total chi-square (chi Square = 82.8, df = 40; p Conclusion The results of this study suggest that EPDS, in its original 10 item form, is not a viable scale for the unidimensional measurement of depression. Rasch analysis suggests that a revised eight item version (EPDS-8 would provide a more psychometrically robust scale. The revised cut points of 7/8 and 9/10 for the EPDS-8 show high levels of agreement with the original case identification for the EPDS-10.
Predicting responses from Rasch measures.

Science.gov (United States)

Linacre, John M

2010-01-01

There is a growing family of Rasch models for polytomous observations. Selecting a suitable model for an existing dataset, estimating its parameters and evaluating its fit is now routine. Problems arise when the model parameters are to be estimated from the current data, but used to predict future data. In particular, ambiguities in the nature of the current data, or overfit of the model to the current dataset, may mean that better fit to the current data may lead to worse fit to future data. The predictive power of several Rasch and Rasch-related models are discussed in the context of the Netflix Prize. Rasch-related models are proposed based on Singular Value Decomposition (SVD) and Boltzmann Machines.
Polytomous Rasch Models in Counseling Assessment

Science.gov (United States)

Willse, John T.

2017-01-01

This article provides a brief introduction to the Rasch model. Motivation for using Rasch analyses is provided. Important Rasch model concepts and key aspects of result interpretation are introduced, with major points reinforced using a simulation demonstration. Concrete guidelines are provided regarding sample size and the evaluation of items.

What Is Embodiment? A Psychometric Approach

Science.gov (United States)

Longo, Matthew R.; Schuur, Friederike; Kammers, Marjolein P. M.; Tsakiris, Manos; Haggard, Patrick

2008-01-01

What is it like to have a body? The present study takes a psychometric approach to this question. We collected structured introspective reports of the rubber hand illusion, to systematically investigate the structure of bodily self-consciousness. Participants observed a rubber hand that was stroked either synchronously or asynchronously with their…
Brazilian WHOQOL-OLD Module version: a Rasch analysis of a new instrument Versão em português do Módulo WHOQOL-OLD: análise de Rasch de um novo instrumento

Directory of Open Access Journals (Sweden)

Eduardo Chachamovich

2008-04-01

Full Text Available OBJECTIVE: To evaluate the Brazilian version of WHOQOL-OLD Module and to test potential changes to the instrument to increase its psychometric adequacy. METHODS: A total of 424 older adults living in a city in Southern Brazil completed the WHOQOL-OLD instrument, in 2005. Rasch analysis was used to explore the psychometric performance of the scale, as implemented by the RUMM2020 software. Item-trait interaction, threshold disorders, presence of differential item functioning and item fit, were analyzed. RESULTS: Two ("death and dying" and "sensory abilities" out of six domains showed inadequate item-trait interactions. Rescoring the response scale and deleting the most misperforming items led to scale improvement. The evaluation of domains and items individually showed that the "intimacy" domain does perform well in contrast to the findings using the classical approach. In addition, the "sensory abilities" domain does not derive an interval measure in its current format. CONCLUSIONS: Unidimensionality and local independence were seen in all domains. Changes in the response scale and deletion of problematic items improved the scale's performance.OBJETIVO: Analisar a versão brasileira do Módulo WHOQOL-OLD, indicando alterações potenciais do instrumento para aumentar a adequação psicométrica. MÉTODOS: O total de 424 idosos residentes em Porto Alegre, RS, responderam o instrumento WHOQOL-OLD em 2005. O modelo de Rasch foi utilizado para a análise do desempenho psicométrico da escala, a partir do software RUMM2020. Foram analisadas a interação item-traço, a presença de funcionamento diferencial dos itens e a adequação dos itens ao modelo de Rasch. RESULTADOS: Dois domínios ("morte e morrer" e "funcionamento do sensório" apresentaram interação item-total insuficiente. Remodelar a escala de resposta e excluir itens com pior performance resultou em melhora da escala. A análise dos domínios e itens individualmente foi capaz de indicar
Assessing social isolation in motor neurone disease: a Rasch analysis of the MND Social Withdrawal Scale.

Science.gov (United States)

Gibbons, Chris J; Thornton, Everard W; Ealing, John; Shaw, Pamela J; Talbot, Kevin; Tennant, Alan; Young, Carolyn A

2013-11-15

Social withdrawal is described as the condition in which an individual experiences a desire to make social contact, but is unable to satisfy that desire. It is an important issue for patients with motor neurone disease who are likely to experience severe physical impairment. This study aims to reassess the psychometric and scaling properties of the MND Social Withdrawal Scale (MND-SWS) domains and examine the feasibility of a summary scale, by applying scale data to the Rasch model. The MND Social Withdrawal Scale was administered to 298 patients with a diagnosis of MND, alongside the Hospital Anxiety and Depression Scale. The factor structure of the MND Social Withdrawal Scale was assessed using confirmatory factor analysis. Model fit, category threshold analysis, differential item functioning (DIF), dimensionality and local dependency were evaluated. Factor analysis confirmed the suitability of the four-factor solution suggested by the original authors. Mokken scale analysis suggested the removal of item five. Rasch analysis removed a further three items; from the Community (one item) and Emotional (two items) withdrawal subscales. Following item reduction, each scale exhibited excellent fit to the Rasch model. A 14-item Summary scale was shown to fit the Rasch model after subtesting the items into three subtests corresponding to the Community, Family and Emotional subscales, indicating that items from these three subscales could be summed together to create a total measure for social withdrawal. Removal of four items from the Social Withdrawal Scale led to a four factor solution with a 14-item hierarchical Summary scale that were all unidimensional, free for DIF and well fitted to the Rasch model. The scale is reliable and allows clinicians and researchers to measure social withdrawal in MND along a unidimensional construct. © 2013. Published by Elsevier B.V. All rights reserved.
FIM measurement properties and Rasch model details.

Science.gov (United States)

Wright, B D; Linacre, J M; Smith, R M; Heinemann, A W; Granger, C V

1997-12-01

To summarize, we take issue with the criticisms of Dickson & Köhler for two main reasons: 1. Rasch analysis provides a model from which to approach the analysis of the FIM, an ordinal scale, as an interval scale. The existence of examples of items or individuals which do not fit the model does not disprove the overall efficacy of the model; and 2. the principal components analysis of FIM motor items as presented by Dickson & Köhler tends to undermine rather than support their argument. Their own analyses produce a single major factor explaining between 58.5 and 67.1% of the variance, depending upon the sample, with secondary factors explaining much less variance. Finally, analysis of item response, or latent trait, is a powerful method for understanding the meaning of a measure. However, it presumes that item scores are accurate. Another concern is that Dickson & Köhler do not address the issue of reliability of scoring the FIM items on which they report, a critical point in comparing results. The Uniform Data System for Medical Rehabilitation (UDSMRSM) expends extensive effort in the training of clinicians of subscribing facilities to score items accurately. This is followed up with a credentialing process. Phase 1 involves the testing of individual clinicians who are submitting data to determine if they have achieved mastery over the use of the FIM instrument. Phase 2 involves examining the data for outlying values. When Dickson & Köhler investigate more carefully the application of the Rasch model to their FIM data, they will discover that the results presented in their paper support rather than contradict their application of the Rasch model! This paper is typical of supposed refutations of Rasch model applications. Dickson & Köhler will find that idiosyncrasies in their data and misunderstandings of the Rasch model are the only basis for a claim to have disproven the relevance of the model to FIM data. The Rasch model is a mathematical theorem (like
Causal Rasch models

Directory of Open Access Journals (Sweden)

A. Jackson Stenner

2013-08-01

Full Text Available Rasch’s unidimensional models for measurement show how to connect object measures (e.g., reader abilities, measurement mechanisms (e.g., machine-generated cloze reading items, and observational outcomes (e.g., counts correct on reading instruments. Substantive theory shows what interventions or manipulations to the measurement mechanism can be traded off against a change to the object measure to hold the observed outcome constant. A Rasch model integrated with a substantive theory dictates the form and substance of permissible interventions. Rasch analysis, absent construct theory and an associated specification equation, is a black box in which understanding may be more illusory than not. Finally, the quantitative hypothesis can be tested by comparing theory-based trade-off relations with observed trade-off relations. Only quantitative variables (as measured support such trade-offs. Note that to test the quantitative hypothesis requires more than manipulation of the algebraic equivalencies in the Rasch model or descriptively fitting data to the model. A causal Rasch model involves experimental intervention/manipulation on either reader ability or text complexity or a conjoint intervention on both simultaneously to yield a successful prediction of the resultant observed outcome (count correct. We conjecture that when this type of manipulation is introduced for individual reader text encounters and model predictions are consistent with observations, the quantitative hypothesis is sustained.
Rasch models suggested the satisfactory psychometric properties of the World Health Organization Quality of Life-Brief among lung cancer patients.

Science.gov (United States)

Lin, Chung-Ying; Yang, Szu-Chun; Lai, Wu-Wei; Su, Wu-Chou; Wang, Jung-Der

2017-03-01

The study examined whether the items of the World Health Organization Quality of Life-Brief questionnaire can assess its four underlying domains (Physical, Psychological, Social, and Environment) in a sample of lung cancer patients. All patients ( n = 1150) were recruited from a medical center in Tainan, and each participant completed the World Health Organization Quality of Life-Brief. Several Rasch rating scale models were used to examine the data-model fit, and Rasch analyses corroborated that each domain of the World Health Organization Quality of Life-Brief could be unidimensional. Although three items were found to have a poor fit, all the other items fit the unidimensionality with ordered thresholds.
Sample Size and Statistical Conclusions from Tests of Fit to the Rasch Model According to the Rasch Unidimensional Measurement Model (Rumm) Program in Health Outcome Measurement.

Science.gov (United States)

Hagell, Peter; Westergren, Albert

Sample size is a major factor in statistical null hypothesis testing, which is the basis for many approaches to testing Rasch model fit. Few sample size recommendations for testing fit to the Rasch model concern the Rasch Unidimensional Measurement Models (RUMM) software, which features chi-square and ANOVA/F-ratio based fit statistics, including Bonferroni and algebraic sample size adjustments. This paper explores the occurrence of Type I errors with RUMM fit statistics, and the effects of algebraic sample size adjustments. Data with simulated Rasch model fitting 25-item dichotomous scales and sample sizes ranging from N = 50 to N = 2500 were analysed with and without algebraically adjusted sample sizes. Results suggest the occurrence of Type I errors with N less then or equal to 500, and that Bonferroni correction as well as downward algebraic sample size adjustment are useful to avoid such errors, whereas upward adjustment of smaller samples falsely signal misfit. Our observations suggest that sample sizes around N = 250 to N = 500 may provide a good balance for the statistical interpretation of the RUMM fit statistics studied here with respect to Type I errors and under the assumption of Rasch model fit within the examined frame of reference (i.e., about 25 item parameters well targeted to the sample).
Gender fairness in self-efficacy? A Rasch-based validity study of the General Academic Self-efficacy scale (GASE)

DEFF Research Database (Denmark)

Nielsen, Tine; Vang, Maria Louison; Dammeyer, Jesper

2018-01-01

Studies have reported gender differences in academic self-efficacy. However, how and if academic self-efficacy questionnaires are gender-biased has not been psychometrically investigated. The psychometric properties of a general version of The Physics Self-Efficacy Questionnaire – the General...... Academic Self-Efficacy Scale (GASE) – were analyzed using Rasch measurement models, with data from 1018 Danish university students (psychology and technical), focusing on gender invariance and the sufficiency of the score. The short 4-item GASE scale was found to be essentially objective and construct...... valid and satisfactorily reliable, though differential item functioning was found relative to gender and academic discipline, and can be used to assess students’ general academic self-efficacy. Research on gender and self-efficacy needs to take gender into account and equate scores appropriately...
Examining Teacher Grades Using Rasch Measurement Theory

Science.gov (United States)

Randall, Jennifer; Engelhard, George, Jr.

2009-01-01

In this study, we present an approach to questionnaire design within educational research based on Guttman's mapping sentences and Many-Facet Rasch Measurement Theory. We designed a 54-item questionnaire using Guttman's mapping sentences to examine the grading practices of teachers. Each item in the questionnaire represented a unique student…
Psychometric evaluation of Persian Nomophobia Questionnaire: Differential item functioning and measurement invariance across gender.

Science.gov (United States)

Lin, Chung-Ying; Griffiths, Mark D; Pakpour, Amir H

2018-03-01

Background and aims Research examining problematic mobile phone use has increased markedly over the past 5 years and has been related to "no mobile phone phobia" (so-called nomophobia). The 20-item Nomophobia Questionnaire (NMP-Q) is the only instrument that assesses nomophobia with an underlying theoretical structure and robust psychometric testing. This study aimed to confirm the construct validity of the Persian NMP-Q using Rasch and confirmatory factor analysis (CFA) models. Methods After ensuring the linguistic validity, Rasch models were used to examine the unidimensionality of each Persian NMP-Q factor among 3,216 Iranian adolescents and CFAs were used to confirm its four-factor structure. Differential item functioning (DIF) and multigroup CFA were used to examine whether males and females interpreted the NMP-Q similarly, including item content and NMP-Q structure. Results Each factor was unidimensional according to the Rach findings, and the four-factor structure was supported by CFA. Two items did not quite fit the Rasch models (Item 14: "I would be nervous because I could not know if someone had tried to get a hold of me;" Item 9: "If I could not check my smartphone for a while, I would feel a desire to check it"). No DIF items were found across gender and measurement invariance was supported in multigroup CFA across gender. Conclusions Due to the satisfactory psychometric properties, it is concluded that the Persian NMP-Q can be used to assess nomophobia among adolescents. Moreover, NMP-Q users may compare its scores between genders in the knowledge that there are no score differences contributed by different understandings of NMP-Q items.
Gendered language attitudes: exploring language as a gendered construct using Rasch measurement theory.

Science.gov (United States)

Knisely, Kris A; Wind, Stefanie A

2015-01-01

Gendered language attitudes (GLAs) are gender-based perceptions of language varieties based on connections between gender-related and linguistic characteristics of individuals, including the perception of language varieties as possessing degrees of masculinity and femininity. This study combines substantive theory about language learning and gender with a model based on Rasch measurement theory to explore the psychometric properties of a new measure of GLAs. Findings suggest that GLAs is a unidimensional construct and that the items used can be used to describe differences among students in terms of the strength of their GLAs. Implications for research, theory, and practice are discussed. Special emphasis is given to the teaching and learning of languages.
Rasch analysis of the Rosenberg Self-Esteem Scale with African Americans.

Science.gov (United States)

Chao, Ruth Chu-Lien; Vidacovich, Courtney; Green, Kathy E

2017-03-01

Effectively diagnosing African Americans' self-esteem has posed an unresolved challenge. To address this assessment issue, we conducted exploratory factor analysis and Rasch analysis to assess the psychometric characteristics of the Rosenberg Self-Esteem Scale (RSES, Rosenberg, 1965) for African American college students. The dimensional structure of the RSES was first identified with the first subsample (i.e., calibration subsample) and then held up under cross-validation with a second subsample (i.e., validation subsample). Exploratory factor analysis and Rasch analysis both supported unidimensionality of the measure, with that finding replicated for a random split of the sample. Response scale use was generally appropriate, items were endorsed at a high level reflecting high levels of self-esteem, and person separation and reliability of person separation were adequate, and reflected results similar to those found in prior research. However, as some categories were infrequently used, we also collapsed scale points and found a slight improvement in scale and item indices. No differential item functioning was found by sex or having received professional assistance versus not; there were no mean score differences by age group, marital status, or year in college. Two items were seen as problematic. Implications for theory and research on multicultural mental health are discussed. (PsycINFO Database Record (c) 2017 APA, all rights reserved).
Item-level psychometrics and predictors of performance for Spanish/English bilingual speakers on an object and action naming battery.

Science.gov (United States)

Edmonds, Lisa A; Donovan, Neila J

2012-04-01

There is a pressing need for psychometrically sound naming materials for Spanish/English bilingual adults. To address this need, in this study the authors examined the psychometric properties of An Object and Action Naming Battery (An O&A Battery; Druks & Masterson, 2000) in bilingual speakers. Ninety-one Spanish/English bilinguals named O&A Battery items in English and Spanish. Responses underwent a Rasch analysis. Using correlation and regression analyses, the authors evaluated the effect of psycholinguistic (e.g., imageability) and participant (e.g., proficiency ratings) variables on accuracy. Rasch analysis determined unidimensionality across English and Spanish nouns and verbs and robust item-level psychometric properties, evidence for content validity. Few items did not fit the model, there were no ceiling or floor effects after uninformative and misfit items were removed, and items reflected a range of difficulty. Reliability coefficients were high, and the number of statistically different ability levels provided indices of sensitivity. Regression analyses revealed significant correlations between psycholinguistic variables and accuracy, providing preliminary construct validity. The participant variables that contributed most to accuracy were proficiency ratings and time of language use. Results suggest adequate content and construct validity of O&A items retained in the analysis for Spanish/English bilingual adults and support future efforts to evaluate naming in older bilinguals and persons with bilingual aphasia.
Quantifying Human Response: Linking metrological and psychometric characterisations of Man as a Measurement Instrument

International Nuclear Information System (INIS)

Pendrill, L R; Fisher, William P Jr

2013-01-01

A better understanding of how to characterise human response is essential to improved person-centred care and other situations where human factors are crucial. Challenges to introducing classical metrological concepts such as measurement uncertainty and traceability when characterising Man as a Measurement Instrument include the failure of many statistical tools when applied to ordinal measurement scales and a lack of metrological references in, for instance, healthcare. The present work attempts to link metrological and psychometric (Rasch) characterisation of Man as a Measurement Instrument in a study of elementary tasks, such as counting dots, where one knows independently the expected value because the measurement object (collection of dots) is prepared in advance. The analysis is compared and contrasted with recent approaches to this problem by others, for instance using signal error fidelity
Psychometric properties of the painDETECT questionnaire in rhuematoid arthritis, psoriatic arthritis and spondyloarthritis

DEFF Research Database (Denmark)

Rifbjerg-Madsen, Signe; Wæhrens, Eva Elisabet Ejlersen; Danneskiold-Samsøe, Bente

2017-01-01

that can identify underlying pain mechanisms are needed. The painDETECT questionnaire (PDQ) was originally designed to differentiate between pain phenotypes. The objectives were to evaluate the psychometric properties of the PDQ in patients with inflammatory arthritis by applying Rasch analysis...... and to explore the reliability of pain classification by test-retest. METHODS: For the Rasch analysis 900 questionnaires from patients with RA, PsA and SpA (300 per diagnosis) were extracted from 'the DANBIO painDETECT study'. The analysis was directed at the seven items assessing somatosensory symptoms...... and included: 1) the performance of the six-category Likert scale; 2) whether a unidimensional construct was defined; 3) the reliability and precision of estimates. Another group of 30 patients diagnosed with RA, PsA or SpA participated in a test-retest study. Intraclass Correlation Coefficients (ICC...
From Rasch scores to regression

DEFF Research Database (Denmark)

Christensen, Karl Bang

2006-01-01

Rasch models provide a framework for measurement and modelling latent variables. Having measured a latent variable in a population a comparison of groups will often be of interest. For this purpose the use of observed raw scores will often be inadequate because these lack interval scale propertie....... This paper compares two approaches to group comparison: linear regression models using estimated person locations as outcome variables and latent regression models based on the distribution of the score....
Achievement Testing with the Wechsler Quicktest: An Examination of Its Psychometric Properties and Applied Utility with a Greek-Cypriot Sample

Science.gov (United States)

Vrachimi-Souroulla, Andry; Panayiotou, Georgia; Kokkinos, Constantinos M.; Lamprianou, Iasonas

2011-01-01

The study aimed to field-test a Greek version of the Wechsler Quicktest and to examine its psychometric properties. The Quicktest was individually administered to 208 students, aged 5-14 years, along with a reading test. Based on the Rasch analysis, data for the Quicktest subtests showed acceptable fit to the model. Also, correlations were found…
Escala fatorial de socialização: versão reduzida: seleção de itens e propriedades psicométricas Agreeableness scale: short version: item selection and psychometric properties

Directory of Open Access Journals (Sweden)

Maiana Farias Oliveira Nunes

2010-01-01

Full Text Available O objetivo desse estudo foi selecionar itens da Escala Fatorial de Socialização (EFS para a obtenção de uma versão reduzida, que mantivesse propriedades psicométricas adequadas. Baseou-se em uma amostra de 1.100 sujeitos. Para a seleção de itens, realizou-se análise qualitativa, buscando aqueles sem conteúdo clínico explícito e uma análise quantitativa, pelo modelo de Rasch. Tais critérios permitiram reduzir a EFS de 70 para 28 itens. As características psicométricas da versão reduzida foram verificadas pela comparação entre versões por Rasch e pela reanálise dos dados de estudos de validade realizados com a EFS. A versão reduzida manteve características psicométricas adequadas, o que sugere a possibilidade de utilização dessa versão da EFS em situações de avaliação com tempo restrito.This study aimed at selecting items from the Agreeableness Factor Scale for obtaining a short version of this test that could keep adequate psychometric properties. One thousand one hundred participants composed the sample. Items were selected using a qualitative strategy, which focused on item content that was not related to clinical descriptions and a quantitative analysis based on Rasch's model. The scale was reduced from 70 to 28 items, based on these criteria. In order to check the psychometric properties of the short version, both versions were compared by Rasch indices and by reanalyzing validity studies conducted with the original scale. The short version kept good psychometric properties, which suggests the possibility of using it when there is time restriction.
Analysis of Local Dependence and Multidimensionality in Graphical Loglinear Rasch Models

DEFF Research Database (Denmark)

Kreiner, Svend; Christensen, Karl Bang

2004-01-01

Local independence; Multidimensionality; Differential item functioning; Uniform local dependence and DIF; Graphical Rasch models; Loglinear Rasch model......Local independence; Multidimensionality; Differential item functioning; Uniform local dependence and DIF; Graphical Rasch models; Loglinear Rasch model...
Rasch-modeling the Portuguese SOCRATES in a clinical sample.

Science.gov (United States)

Lopes, Paulo; Prieto, Gerardo; Delgado, Ana R; Gamito, Pedro; Trigo, Hélder

2010-06-01

The Stages of Change Readiness and Treatment Eagerness Scale (SOCRATES) assesses motivation for treatment in the drug-dependent population. The development of adequate measures of motivation is needed in order to properly understand the role of this construct in rehabilitation. This study probed the psychometric properties of the SOCRATES in the Portuguese population by means of the Rasch Rating Scale Model, which allows the conjoint measurement of items and persons. The participants were 166 substance abusers under treatment for their addiction. Results show that the functioning of the five response categories is not optimal; our re-analysis indicates that a three-category system is the most appropriate one. By using this response category system, both model fit and estimation accuracy are improved. The discussion takes into account other factors such as item format and content in order to make suggestions for the development of better motivation-for-treatment scales. (PsycINFO Database Record (c) 2010 APA, all rights reserved).

Rasch Measurement in Language Research: Creating the Foreign Language Classroom Anxiety Inventory

Directory of Open Access Journals (Sweden)

Miranda J. Walker

2014-11-01

Full Text Available The purpose of this study was to construct a new scale for measuring foreign language classroom anxiety (FLCA. It begun with the creation of an extended item pool generated by qualitative methods. Subsequent Rasch and semantic analyses led to the final 18-item Foreign Language Classroom Anxiety Inventory (FLCAI. In comparison with the Foreign Language Classroom Anxiety Scale (FLCAS, the FLCAI demonstrated more convincing evidence of unidimensionality and the optimal 5-point Likert scale functioned better. The FLCAI, while 55% the length of the FLCAS, thus more practical for classroom practitioners to administer and analyse, maintains its psychometric properties and covers a wider range on the construct continuum thus improving the degree of validity of the instrument. Finally, test anxiety was shown to be a component of FLCA.
Using classical test theory, item response theory, and Rasch measurement theory to evaluate patient-reported outcome measures: a comparison of worked examples.

Science.gov (United States)

Petrillo, Jennifer; Cano, Stefan J; McLeod, Lori D; Coon, Cheryl D

2015-01-01

To provide comparisons and a worked example of item- and scale-level evaluations based on three psychometric methods used in patient-reported outcome development-classical test theory (CTT), item response theory (IRT), and Rasch measurement theory (RMT)-in an analysis of the National Eye Institute Visual Functioning Questionnaire (VFQ-25). Baseline VFQ-25 data from 240 participants with diabetic macular edema from a randomized, double-masked, multicenter clinical trial were used to evaluate the VFQ at the total score level. CTT, RMT, and IRT evaluations were conducted, and results were assessed in a head-to-head comparison. Results were similar across the three methods, with IRT and RMT providing more detailed diagnostic information on how to improve the scale. CTT led to the identification of two problematic items that threaten the validity of the overall scale score, sets of redundant items, and skewed response categories. IRT and RMT additionally identified poor fit for one item, many locally dependent items, poor targeting, and disordering of over half the response categories. Selection of a psychometric approach depends on many factors. Researchers should justify their evaluation method and consider the intended audience. If the instrument is being developed for descriptive purposes and on a restricted budget, a cursory examination of the CTT-based psychometric properties may be all that is possible. In a high-stakes situation, such as the development of a patient-reported outcome instrument for consideration in pharmaceutical labeling, however, a thorough psychometric evaluation including IRT or RMT should be considered, with final item-level decisions made on the basis of both quantitative and qualitative results. Copyright © 2015. Published by Elsevier Inc.
Psychometrics and latent structure of the IDS and QIDS with young adult students.

Science.gov (United States)

González, David Andrés; Boals, Adriel; Jenkins, Sharon Rae; Schuler, Eric R; Taylor, Daniel

2013-07-01

Students and young adults have high rates of suicide and depression, thus are a population of interest. To date, there is no normative psychometric information on the IDS and QIDS in these populations. Furthermore, there is equivocal evidence on the factor structure and subscales of the IDS. Two samples of young adult students (ns=475 and 1681) were given multiple measures to test the psychometrics and dimensionality of the IDS and QIDS. The IDS, its subscales, and QIDS had acceptable internal consistencies (αs=.79-90) and favorable convergent and divergent validity correlations. A three-factor structure and two Rasch-derived subscales best fit the IDS. The samples were collected from one university, which may influence generalizability. The IDS and QIDS are desirable measures of depressive symptoms when studying young adult students. Copyright © 2013 Elsevier B.V. All rights reserved.
Power analysis on the time effect for the longitudinal Rasch model.

Science.gov (United States)

Feddag, M L; Blanchin, M; Hardouin, J B; Sebille, V

2014-01-01

Statistics literature in the social, behavioral, and biomedical sciences typically stress the importance of power analysis. Patient Reported Outcomes (PRO) such as quality of life and other perceived health measures (pain, fatigue, stress,...) are increasingly used as important health outcomes in clinical trials or in epidemiological studies. They cannot be directly observed nor measured as other clinical or biological data and they are often collected through questionnaires with binary or polytomous items. The Rasch model is the well known model in the item response theory (IRT) for binary data. The article proposes an approach to evaluate the statistical power of the time effect for the longitudinal Rasch model with two time points. The performance of this method is compared to the one obtained by simulation study. Finally, the proposed approach is illustrated on one subscale of the SF-36 questionnaire.
Factor structure and item level psychometrics of the Social Problem Solving Inventory-Revised: Short Form in traumatic brain injury.

Science.gov (United States)

Li, Chih-Ying; Waid-Ebbs, Julia; Velozo, Craig A; Heaton, Shelley C

2016-01-01

Social problem-solving deficits characterise individuals with traumatic brain injury (TBI), and poor social problem solving interferes with daily functioning and productive lifestyles. Therefore, it is of vital importance to use the appropriate instrument to identify deficits in social problem solving for individuals with TBI. This study investigates factor structure and item-level psychometrics of the Social Problem Solving Inventory-Revised: Short Form (SPSI-R:S), for adults with moderate and severe TBI. Secondary analysis of 90 adults with moderate and severe TBI who completed the SPSI-R:S was performed. An exploratory factor analysis (EFA), principal components analysis (PCA) and Rasch analysis examined the factor structure and item-level psychometrics of the SPSI-R:S. The EFA showed three dominant factors, with positively worded items represented as the most definite factor. The other two factors are negative problem-solving orientation and skills; and negative problem-solving emotion. Rasch analyses confirmed the three factors are each unidimensional constructs. It was concluded that the total score interpretability of the SPSI-R:S may be challenging due to the multidimensional structure of the total measure. Instead, we propose using three separate SPSI-R:S subscores to measure social problem solving for the TBI population.
Factor Structure and Item Level Psychometrics of the Social Problem Solving Inventory Revised-Short Form in Traumatic Brain Injury

Science.gov (United States)

Li, Chih-Ying; Waid-Ebbs, Julia; Velozo, Craig A.; Heaton, Shelley C.

2016-01-01

Primary Objective Social problem solving deficits characterize individuals with traumatic brain injury (TBI). Poor social problem solving interferes with daily functioning and productive lifestyles. Therefore, it is of vital importance to use the appropriate instrument to identify deficits in social problem solving for individuals with TBI. This study investigates factor structure and item-level psychometrics of the Social Problem Solving Inventory-Revised Short Form (SPSI-R:S), for adults with moderate and severe TBI. Research Design Secondary analysis of 90 adults with moderate and severe TBI who completed the SPSI-R:S. Methods and Procedures An exploratory factor analysis (EFA), principal components analysis (PCA) and Rasch analysis examined the factor structure and item-level psychometrics of the SPSI-R:S. Main Outcomes and Results The EFA showed three dominant factors, with positively worded items represented as the most definite factor. The other two factors are negative problem solving orientation and skills; and negative problem solving emotion. Rasch analyses confirmed the three factors are each unidimensional constructs. Conclusions The total score interpretability of the SPSI-R:S may be challenging due to the multidimensional structure of the total measure. Instead, we propose using three separate SPSI-R:S subscores to measure social problem solving for the TBI population. PMID:26052731
Using item response theory to explore the psychometric properties of extended matching questions examination in undergraduate medical education

Directory of Open Access Journals (Sweden)

Lawton Gemma

2005-03-01

Full Text Available Abstract Background As assessment has been shown to direct learning, it is critical that the examinations developed to test clinical competence in medical undergraduates are valid and reliable. The use of extended matching questions (EMQ has been advocated to overcome some of the criticisms of using multiple-choice questions to test factual and applied knowledge. Methods We analysed the results from the Extended Matching Questions Examination taken by 4th year undergraduate medical students in the academic year 2001 to 2002. Rasch analysis was used to examine whether the set of questions used in the examination mapped on to a unidimensional scale, the degree of difficulty of questions within and between the various medical and surgical specialties and the pattern of responses within individual questions to assess the impact of the distractor options. Results Analysis of a subset of items and of the full examination demonstrated internal construct validity and the absence of bias on the majority of questions. Three main patterns of response selection were identified. Conclusion Modern psychometric methods based upon the work of Rasch provide a useful approach to the calibration and analysis of EMQ undergraduate medical assessments. The approach allows for a formal test of the unidimensionality of the questions and thus the validity of the summed score. Given the metric calibration which follows fit to the model, it also allows for the establishment of items banks to facilitate continuity and equity in exam standards.
Measuring the Quality of Life of Visually Impaired Children: First Stage Psychometric Evaluation of the Novel VQoL_CYP Instrument.

Directory of Open Access Journals (Sweden)

Valerija Tadić

Full Text Available To report piloting and initial validation of the VQoL_CYP, a novel age-appropriate vision-related quality of life (VQoL instrument for self-reporting by children with visual impairment (VI.Participants were a random patient sample of children with VI aged 10-15 years. 69 patients, drawn from patient databases at Great Ormond Street Hospital and Moorfields Eye Hospital, United Kingdom, participated in piloting of the draft 47-item VQoL instrument, which enabled preliminary item reduction. Subsequent administration of the instrument, alongside functional vision (FV and generic health-related quality of life (HRQoL self-report measures, to 101 children with VI comprising a nationally representative sample enabled further item reduction and evaluation of psychometric properties using Rasch analysis. Construct validity was assessed through Pearson correlation coefficients.Item reduction through piloting (8 items removed for skewness and individual item response pattern and validation (1 item removed for skewness and 3 for misfit in Rasch produced a 35-item scale, with fit values within acceptable limits, no notable differential item functioning, good measurement precision, ordered response categories and acceptable targeting in Rasch. The VQoL_CYP showed good construct validity, correlating strongly with HRQoL scores, moderately with FV scores but not with acuity.Robust child-appropriate self-report VQoL measures for children with VI are necessary for understanding the broader impacts of living with a visual disability, distinguishing these from limited functioning per se. Future planned use in larger patient samples will allow further psychometric development of the VQoL_CYP as an adjunct to objective outcomes assessment.
Cross-cultural adaptation and analysis of the psychometric properties of the Balance Evaluation Systems Test and MiniBESTest in the elderly and individuals with Parkinson's disease: application of the Rasch model

Directory of Open Access Journals (Sweden)

Angelica C. Maia

2013-06-01

Full Text Available BACKGROUND: Older adults and individuals with neurological problems such as Parkinson's disease (PD exhibit balance deficits that might impair their mobility and independence. The assessment of balance must be useful in identifying the presence of instability and orient interventions. OBJECTIVE: To translate and perform a cross-cultural adaptation of the Balance Evaluation Systems Test (BESTest and MiniBESTest to Brazilian Portuguese and analyze its psychometric properties. METHOD: The tests were translated and adapted to Portuguese according to a standard method and then subjected to a test-retest reliability assessment (10 older adults; 10 individuals with PD. The psychometric properties were assessed by the Rasch model (35 older adults; 35 individuals with PD. RESULTS: The reliability coefficient of the tests relative to the items and subjects varied from 0.91 and 0.98, which is indicative of the stability and reproducibility of the measures. In the BESTest, the person (4.19 and item (5.36 separation index established six balance ability levels and seven levels of difficulty, respectively. In the MiniBESTest, the person (3.16 and item (6.41 separation index established four balance ability levels and nine levels of difficulty, respectively. Two items in the BESTest did not fit with the model expectations, but the construct validity was not compromised. No item in the MiniBESTest was erratic. CONCLUSIONS: The results corroborate the diagnostic and screening functions of the BESTest and MiniBESTest, respectively, and indicate that the Brazilian versions exhibit adequate reliability, construct validity, response stability, and capacity to distinguish among various balance ability levels in older adults and individuals with PD.
Mayo-Portland adaptability inventory: comparing psychometrics in cerebrovascular accident to traumatic brain injury.

Science.gov (United States)

Malec, James F; Kean, Jacob; Altman, Irwin M; Swick, Shannon

2012-12-01

(1) To evaluate the measurement reliability and construct validity of the Mayo-Portland Adaptability Inventory, 4th revision (MPAI-4) in a sample consisting exclusively of patients with cerebrovascular accident (CVA) using single parameter (Rasch) item-response methods; (2) to examine the differential item functioning (DIF) by sex within the CVA population; and (3) to examine DIF and differential test functioning (DTF) across traumatic brain injury (TBI) and CVA samples. Retrospective psychometric analysis of rating scale data. Home- and community-based brain injury rehabilitation program. Individuals post-CVA (n=861) and individuals with TBI (n=603). Not applicable. MPAI-4. Item data on admission to community-based rehabilitation were submitted to Rasch, DIF, and DTF analyses. The final calibration in the CVA sample revealed satisfactory reliability/separation for persons (.91/3.16) and items (1.00/23.64). DIF showed that items for pain, anger, audition, and memory were associated with higher levels of disability for CVA than TBI patients; whereas, self-care, mobility, and use of hands indicated greater overall disability for TBI patients. DTF analyses showed a high degree of association between the 2 sets of items (R=.92; R(2)=.85) and, at most, a 3.7 point difference in raw scores. The MPAI-4 demonstrates satisfactory psychometric properties for use with individuals with CVA applying for interdisciplinary posthospital rehabilitation. DIF reveals clinically meaningful differences between CVA and TBI groups that should be considered in results at the item and subscale level. Copyright © 2012 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Physics Metacognition Inventory Part II: Confirmatory factor analysis and Rasch analysis

Science.gov (United States)

Taasoobshirazi, Gita; Bailey, MarLynn; Farley, John

2015-11-01

The Physics Metacognition Inventory was developed to measure physics students' metacognition for problem solving. In one of our earlier studies, an exploratory factor analysis provided evidence of preliminary construct validity, revealing six components of students' metacognition when solving physics problems including knowledge of cognition, planning, monitoring, evaluation, debugging, and information management. The college students' scores on the inventory were found to be reliable and related to students' physics motivation and physics grade. However, the results of the exploratory factor analysis indicated that the questionnaire could be revised to improve its construct validity. The goal of this study was to revise the questionnaire and establish its construct validity through a confirmatory factor analysis. In addition, a Rasch analysis was applied to the data to better understand the psychometric properties of the inventory and to further evaluate the construct validity. Results indicated that the final, revised inventory is a valid, reliable, and efficient tool for assessing student metacognition for physics problem solving.
Psychometric Properties of the Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite Scale.

Science.gov (United States)

Barnett, Carolina; Merkies, Ingemar S J; Katzberg, Hans; Bril, Vera

2015-09-02

The Quantitative Myasthenia Gravis Score and the Myasthenia Gravis Composite are two commonly used outcome measures in Myasthenia Gravis. So far, their measurement properties have not been compared, so we aimed to study their psychometric properties using the Rasch model. 251 patients with stable myasthenia gravis were assessed with both scales, and 211 patients returned for a second assessment. We studied fit to the Rasch model at the first visit, and compared item fit, thresholds, differential item functioning, local dependence, person separation index, and tests for unidimensionality. We also assessed test-retest reliability and estimated the Minimal Detectable Change. Neither scale fit the Rasch model (X2p Myasthenia Gravis Composite had lower discrimination properties than the Quantitative Myasthenia Gravis Scale (Person Separation Index: 0.14 and 0.7). There was local dependence in both scales, as well as differential item functioning for ocular and generalized disease. Disordered thresholds were found in 6(60%) items of the Myasthenia Gravis Composite and in 4(31%) of the Quantitative Myasthenia Gravis Score. Both tools had adequate test-retest reliability (ICCs >0.8). The minimally detectable change was 4.9 points for the Myasthenia Gravis Composite and 4.3 points for the Quantitative Myasthenia Gravis Score. Neither scale fulfilled Rasch model expectations. The Quantitative Myasthenia Gravis Score has higher discrimination than the Myasthenia Gravis Composite. Both tools have items with disordered thresholds, differential item functioning and local dependency. There was evidence of multidimensionality in the QMGS. The minimal detectable change values are higher than previous studies on the minimal significant change. These findings might inform future modifications of these tools.
Rasch Analysis: A Primer for School Psychology Researchers and Practitioners

Science.gov (United States)

Boone, William J.; Noltemeyer, Amity

2017-01-01

In order to progress as a field, school psychology research must be informed by effective measurement techniques. One approach to address the need for careful measurement is Rasch analysis. This technique can (a) facilitate the development of instruments that provide useful data, (b) provide data that can be used confidently for both descriptive…
Some Improved Diagnostics for Failure of The Rasch Model.

Science.gov (United States)

Molenaar, Ivo W.

1983-01-01

Goodness of fit tests for the Rasch model are typically large-sample, global measures. This paper offers suggestions for small-sample exploratory techniques for examining the fit of item data to the Rasch model. (Author/JKS)
Thorndike, Thurstone and Rasch: A Comparison of Their Approaches to Item-Invariant Measurement.

Science.gov (United States)

Englehard, George, Jr.

The methods used by E. L. Thorndike, L. L. Thurstone, and G. Rasch to address issues related to item-invariant measurement and the scoring of individual performance are compared. The analyses highlight the close connection among the three methods, and suggest that progress in measurement theory reflects the movement from essentially ad hoc methods…
Evaluation of the Edinburgh Post Natal Depression Scale using Rasch analysis

Science.gov (United States)

Pallant, Julie F; Miller, Renée L; Tennant, Alan

2006-01-01

Background The Edinburgh Postnatal Depression Scale (EPDS) is a 10 item self-rating post-natal depression scale which has seen widespread use in epidemiological and clinical studies. Concern has been raised over the validity of the EPDS as a single summed scale, with suggestions that it measures two separate aspects, one of depressive feelings, the other of anxiety. Methods As part of a larger cross-sectional study conducted in Melbourne, Australia, a community sample (324 women, ranging in age from 18 to 44 years: mean = 32 yrs, SD = 4.6), was obtained by inviting primiparous women to participate voluntarily in this study. Data from the EPDS were fitted to the Rasch measurement model and tested for appropriate category ordering, for item bias through Differential Item Functioning (DIF) analysis, and for unidimensionality through tests of the assumption of local independence. Results Rasch analysis of the data from the ten item scale initially demonstrated a lack of fit to the model with a significant Item-Trait Interaction total chi-square (chi Square = 82.8, df = 40; p < .001). Removal of two items (items 7 and 8) resulted in a non-significant Item-Trait Interaction total chi-square with a residual mean value for items of -0.467 with a standard deviation of 0.850, showing fit to the model. No DIF existed in the final 8-item scale (EPDS-8) and all items showed fit to model expectations. Principal Components Analysis of the residuals supported the local independence assumption, and unidimensionality of the revised EPDS-8 scale. Revised cut points were identified for EPDS-8 to maintain the case identification of the original scale. Conclusion The results of this study suggest that EPDS, in its original 10 item form, is not a viable scale for the unidimensional measurement of depression. Rasch analysis suggests that a revised eight item version (EPDS-8) would provide a more psychometrically robust scale. The revised cut points of 7/8 and 9/10 for the EPDS-8 show high
Addressing the targeting range of the ABILHAND-56 in relapsing-remitting multiple sclerosis: A mixed methods psychometric study.

Science.gov (United States)

Cleanthous, Sophie; Strzok, Sara; Pompilus, Farrah; Cano, Stefan; Marquis, Patrick; Cohan, Stanley; Goldman, Myla D; Kresa-Reahl, Kiren; Petrillo, Jennifer; Castrillo-Viguera, Carmen; Cadavid, Diego; Chen, Shih-Yin

2018-01-01

ABILHAND, a manual ability patient-reported outcome instrument originally developed for stroke patients, has been used in multiple sclerosis clinical trials; however, psychometric analyses indicated the measure's limited measurement range and precision in higher-functioning multiple sclerosis patients. The purpose of this study was to identify candidate items to expand the measurement range of the ABILHAND-56, thus improving its ability to detect differences in manual ability in higher-functioning multiple sclerosis patients. A step-wise mixed methods design strategy was used, comprising two waves of patient interviews, a combination of qualitative (concept elicitation and cognitive debriefing) and quantitative (Rasch measurement theory) analytic techniques, and consultation interviews with three clinical neurologists specializing in multiple sclerosis. Original ABILHAND was well understood in this context of use. Eighty-two new manual ability concepts were identified. Draft supplementary items were generated and refined with patient and neurologist input. Rasch measurement theory psychometric analysis indicated supplementary items improved targeting to higher-functioning multiple sclerosis patients and measurement precision. The final pool of Early Multiple Sclerosis Manual Ability items comprises 20 items. The synthesis of qualitative and quantitative methods used in this study improves the ABILHAND content validity to more effectively identify manual ability changes in early multiple sclerosis and potentially help determine treatment effect in higher-functioning patients in clinical trials.
Emotional vitality in caregivers: application of Rasch Measurement Theory with secondary data to development and test a new measure.

Science.gov (United States)

Barbic, Skye P; Bartlett, Susan J; Mayo, Nancy E

2015-07-01

To describe the practical steps in identifying items and evaluating scoring strategies for a new measure of emotional vitality in informal caregivers of individuals who have experienced a significant health event. The psychometric properties of responses to selected items from validated health-related quality of life and other psychosocial questionnaires administered four times over a one-year period were evaluated using Rasch Measurement Theory. Community. A total of 409 individuals providing informal care at home to older adults who had experienced a recent stroke. Rasch Measurement Theory was used to test the ordering of response option thresholds, fit, spread of the item locations, residual correlations, person separation index, and stability across time. Based on a theoretical framework developed in earlier work, we identified 22 candidate items from a pool of relevant psychosocial measures available. Of these, additional evaluation resulted in 19 items that could be used to assess the five core domains. The overall model fit was reasonable (χ(2) = 202.26, DF = 117, p = 0.06), stable across time, with borderline evidence of multidimensionality (10%). Items and people covered a continuum ranging from -3.7 to +2.7 logits, reflecting coverage of the measurement continuum, with a person separation index of 0.85. Mean fit of caregivers was lower than expected (-1.31 ±1.10 logits). Established methods from the Rasch Measurement Theory were applied to develop a prototype measure of emotional vitality that is acceptable, reliable, and can be used to obtain an interval level score for use in future research and clinical settings. © The Author(s) 2014.
Validity study of the Beck Anxiety Inventory (Portuguese version by the Rasch Rating Scale model

Directory of Open Access Journals (Sweden)

Sónia Quintão

2013-01-01

Full Text Available Our objective was to conduct a validation study of the Portuguese version of the Beck Anxiety Inventory (BAI by means of the Rasch Rating Scale Model, and then compare it with the most used scales of anxiety in Portugal. The sample consisted of 1,160 adults (427 men and 733 women, aged 18-82 years old (M=33.39; SD=11.85. Instruments were Beck Anxiety Inventory, State-Trait Anxiety Inventory and Zung Self-Rating Anxiety Scale. It was found that Beck Anxiety Inventory's system of four categories, the data-model fit, and people reliability were adequate. The measure can be considered as unidimensional. Gender and age-related differences were not a threat to the validity. BAI correlated significantly with other anxiety measures. In conclusion, BAI shows good psychometric quality.
Comparison of proficiency in an anesthesiology course across distinct medical student cohorts: Psychometric approaches to test equating

Directory of Open Access Journals (Sweden)

Shu-Wei Liao

2014-03-01

Conclusion: Although both the chained linear equating method and Rasch analysis can be readily applied to practical test-equating issues in medical education, Rasch analysis exhibited more versatility in test parameter estimation and item bank development for clinical curriculums.

Rasch Analysis of the Premature Ejaculation Diagnostic Tool (PEDT and the International Index of Erectile Function (IIEF in an Iranian Sample of Prostate Cancer Patients.

Directory of Open Access Journals (Sweden)

Chung-Ying Lin

Full Text Available Male sexual dysfunction is an increasing problem across a variety of general and clinical populations, such as cancer populations; especially among prostate cancer patients who tend to receive treatments that often result in erectile dysfunction (ED and/or premature ejaculation (PE. Therefore, in order to diagnose ED and PE in these populations, adequate and efficient instruments such as the International Index of Erectile Function 5-item version (IIEF-5 and the Premature Ejaculation Diagnostic Tool (PEDT are needed. However, since this is an important topic additional evidence of psychometric properties of the IIEF-5 and the PEDT in such samples are required. Thus the aim of the present study was to use Rasch models to investigate the construct validity, local dependency, score order, and differential item functioning (DIF of both questionnaires in a sample of prostate cancer patients.Prostate cancer patients (n = 1058, mean±SD age = 64.07±6.84 years who visited urology clinics were invited to fill out the IIEF-5 and the PEDT. Construct validity was examined using infit and outfit mean square (MnSq and local dependency using correlations between each two residual Rasch scores. Score order was investigated using step and average measures of difficulty and DIF using DIF contrast.All IIEF-5 and PEDT items had acceptable infit and outfit MnSq. Step measures revealed that all but two items had disordered categories in terms of scores 1 to 3. Only one local dependency was found, and no items displayed DIF across age, educational level, and help seeking.The results showed that both the IIEF-5 and the PEDT had sound psychometric properties in the Rasch analyses, although some score disordering could be detected in both instruments. The results of no DIF items in both instruments suggest using them to compare ED and PE across age and educational level is adequate.
Rasch analysis of the Mini-Mental Adjustment to Cancer Scale (mini-MAC) among a heterogeneous sample of long-term cancer survivors: a cross-sectional study.

Science.gov (United States)

Zucca, Alison; Lambert, Sylvie D; Boyes, Allison W; Pallant, Julie F

2012-05-20

The mini-Mental Adjustment to Cancer Scale (mini-MAC) is a well-recognised, popular measure of coping in psycho-oncology and assesses five cancer-specific coping strategies. It has been suggested that these five subscales could be grouped to form the over-arching adaptive and maladptive coping subscales to facilitate the interpretation and clinical application of the scale. Despite the popularity of the mini-MAC, few studies have examined its psychometric properties among long-term cancer survivors, and further validation of the mini-MAC is needed to substantiate its use with the growing population of survivors. Therefore, this study examined the psychometric properties and dimensionality of the mini-MAC in a sample of long-term cancer survivors using Rasch analysis. RUMM 2030 was used to analyse the mini-MAC data (n=851). Separate Rasch analyses were conducted for each of the original mini-MAC subscales as well as the over-arching adaptive and maladaptive coping subscales to examine summary and individual model fit statistics, person separation index (PSI), response format, local dependency, targeting, item bias (or differential item functioning -DIF), and dimensionality. For the fighting spirit, fatalism, and helplessness-hopelessness subscales, a revised three-point response format seemed more optimal than the original four-point response. To achieve model fit, items were deleted from four of the five subscales - Anxious Preoccupation items 7, 25, and 29; Cognitive Avoidance items 11 and 17; Fighting Spirit item 18; and Helplessness-Hopelessness items 16 and 20. For those subscales with sufficient items, analyses supported unidimensionality. Combining items to form the adaptive and maladaptive subscales was partially supported. The original five subscales required item deletion and/or rescaling to improve goodness of fit to the Rasch model. While evidence was found for overarching subscales of adaptive and maladaptive coping, extensive modifications were
Rasch analysis of the Mini-Mental Adjustment to Cancer Scale (mini-MAC among a heterogeneous sample of long-term cancer survivors: A cross-sectional study

Directory of Open Access Journals (Sweden)

Zucca Alison

2012-05-01

Full Text Available Abstract Background The mini-Mental Adjustment to Cancer Scale (mini-MAC is a well-recognised, popular measure of coping in psycho-oncology and assesses five cancer-specific coping strategies. It has been suggested that these five subscales could be grouped to form the over-arching adaptive and maladptive coping subscales to facilitate the interpretation and clinical application of the scale. Despite the popularity of the mini-MAC, few studies have examined its psychometric properties among long-term cancer survivors, and further validation of the mini-MAC is needed to substantiate its use with the growing population of survivors. Therefore, this study examined the psychometric properties and dimensionality of the mini-MAC in a sample of long-term cancer survivors using Rasch analysis. Methods RUMM 2030 was used to analyse the mini-MAC data (n=851. Separate Rasch analyses were conducted for each of the original mini-MAC subscales as well as the over-arching adaptive and maladaptive coping subscales to examine summary and individual model fit statistics, person separation index (PSI, response format, local dependency, targeting, item bias (or differential item functioning -DIF, and dimensionality. Results For the fighting spirit, fatalism, and helplessness-hopelessness subscales, a revised three-point response format seemed more optimal than the original four-point response. To achieve model fit, items were deleted from four of the five subscales – Anxious Preoccupation items 7, 25, and 29; Cognitive Avoidance items 11 and 17; Fighting Spirit item 18; and Helplessness-Hopelessness items 16 and 20. For those subscales with sufficient items, analyses supported unidimensionality. Combining items to form the adaptive and maladaptive subscales was partially supported. Conclusions The original five subscales required item deletion and/or rescaling to improve goodness of fit to the Rasch model. While evidence was found for overarching subscales of
Self-rating of daily time management in children: psychometric properties of the Time-S.

Science.gov (United States)

Sköld, Annika; Janeslätt, Gunnel Kristina

2017-05-01

Impaired ability to manage time has been shown in several diagnoses common in childhood. Impaired ability involves activities and participation domain (daily time management, DTM) and body function and structure domain (time-processing ability, TPA). DTM needs to be evaluated from an individual's own perspective. To date, there has been a lack of self-rating instruments for children that focus on DTM. The aim of this study is to describe psychometric properties of Time-S when used in children aged 10-17 years with a diagnosis of ADHD, Autism, CP or mild ID. Further, to test whether TPA correlates with self-rated DTM. Eighty-three children aged 10-17 years participated in the study. Rasch analysis was used to assess psychometric properties. Correlation analysis was performed between Time-S and a measure of TPA. The 21 items of the Time-S questionnaire fit into a unitary construct measuring self-perceived daily management of an individual's time. A non-significant, small correlation was found between TPA and DTM. The results indicate good psychometric properties for the questionnaire. The questionnaire is potentially useful in intervention planning and evaluation.
Improving Measurement of Trait Competitiveness: A Rasch Analysis of the Revised Competitiveness Index With Samples From New Zealand and US University Students.

Science.gov (United States)

Krägeloh, Christian U; Medvedev, Oleg N; Hill, Erin M; Webster, Craig S; Booth, Roger J; Henning, Marcus A

2018-01-01

Measuring competitiveness is necessary to fully understand variables affecting student learning. The 14-item Revised Competitiveness Index has become a widely used measure to assess trait competitiveness. The current study reports on a Rasch analysis to investigate the psychometric properties of the Revised Competitiveness Index and to improve its precision for international comparisons. Students were recruited from medical studies at a university in New Zealand, undergraduate health sciences courses at another New Zealand university, and a psychology undergraduate class at a university in the United States. Rasch model estimate parameters were affected by local dependency and item misfit. Best fit to the Rasch model (χ 2 (20) = 15.86, p = .73, person separation index = .95) was obtained for the Enjoyment of Competition subscale after combining locally dependent items into a subtest and discarding the highly misfitting Item 9. The only modifications required to obtain a suitable fit (χ 2 (25) = 25.81, p = .42, person separation index = .77) for the Contentiousness subscale were a subtest to combine two locally dependent items and splitting this subtest by country to deal with differential item functioning. The results support reliability and internal construct validity of the modified Revised Competitiveness Index. Precision of the measure may be enhanced using the ordinal-to-interval conversion algorithms presented here, allowing the use of parametric statistics without breaking fundamental statistical assumptions.
Examining the Psychometric Quality of Multiple-Choice Assessment Items using Mokken Scale Analysis.

Science.gov (United States)

Wind, Stefanie A

The concept of invariant measurement is typically associated with Rasch measurement theory (Engelhard, 2013). Concerned with the appropriateness of the parametric transformation upon which the Rasch model is based, Mokken (1971) proposed a nonparametric procedure for evaluating the quality of social science measurement that is theoretically and empirically related to the Rasch model. Mokken's nonparametric procedure can be used to evaluate the quality of dichotomous and polytomous items in terms of the requirements for invariant measurement. Despite these potential benefits, the use of Mokken scaling to examine the properties of multiple-choice (MC) items in education has not yet been fully explored. A nonparametric approach to evaluating MC items is promising in that this approach facilitates the evaluation of assessments in terms of invariant measurement without imposing potentially inappropriate transformations. Using Rasch-based indices of measurement quality as a frame of reference, data from an eighth-grade physical science assessment are used to illustrate and explore Mokken-based techniques for evaluating the quality of MC items. Implications for research and practice are discussed.
On the Construct Validity of the Academic Motivation Scale: a CFA and Rasch Analysis approach

DEFF Research Database (Denmark)

Andersen, Martin Stolpe; Nielsen, Tine

subscales measuring Extrinsic Motivation (EM) and one scale measuring Amotivation (AM), each with 4 items. The AMS was translated into Danish and data was collected from psychology students (N = 607) at two Danish universities in 6 different study terms. The construct validity of the seven scales was first...... investigated using confirmatory factor analysis with mixed results of some acceptable and some non-acceptable fit indices for the model. Secondly, Rasch analyses were conducted for each of the seven subscales, using the partial credit model (PCM) and graphical loglinear rasch models (GLLRM). This resulted...... in fit to the PCM in the case of IM to Accomplish (retaining three out of four items), and fit to GLLRMs in two cases: 1) IM to know with evidence of local dependence between all four items. 2) AM (retaining three out of four items) with evidence of gender-based differential item functioning, which...
Propriedades psicométricas da versão brasileira da escala de qualidade de vida específica para acidente vascular encefálico: aplicação do modelo Rasch Psychometric properties of the Brazilian version of the Stroke Specific Quality of Life Scale: application of the Rasch model

Directory of Open Access Journals (Sweden)

RCM Lima

2008-04-01

Full Text Available CONTEXTUALIZAÇÃO: O acidente vascular encefálico (AVE produz déficits importantes na qualidade de vida (QV dos indivíduos. Medidas específicas de QV são necessárias para compreender e quantificar o impacto dessa patologia. OBJETIVO: O objetivo desse estudo foi adaptar transculturalmente o Stroke Specific Quality of Life Scale (SSQOL para o Português (Brasil e avaliar suas propriedades psicométricas. MATERIAIS E MÉTODOS: O SSQOL foi traduzido e adaptado seguindo instruções padronizadas e submetido a exame de confiabilidade teste-reteste (10 hemiplégicos. As propriedades psicométricas foram investigadas pela análise Rasch em 50 hemiplégicos. RESULTADOS: Foram detectados coeficientes de confiabilidade de 0,92 para itens e indivíduos. O índice de separação dos hemiplégicos foi 3,34 e dos itens, 3,36, ou seja, os itens separaram as pessoas em pelo menos três níveis de QV e em três níveis de QV - baixa, média e alta. Dos 49 itens, quatro não se enquadram no modelo, o que compromete a validade de constructo do instrumento, embora o padrão errático dos itens se justifique na amostra examinada. CONCLUSÕES: O instrumento mostrou-se clinicamente útil na população avaliada. Novos estudos em populações com outras características já estão em andamento.BACKGROUND: Stroke results in important deficits, which reduce individuals’ quality of life (QOL. Specific QOL measurements are necessary to understand and quantify the impact of this pathological condition. OBJECTIVE: The aim of this study was to make a transcultural adaptation of the Stroke Specific Quality of Life Scale (SSQOL into Brazilian Portuguese and to assess its psychometric properties. METHODS: The SSQOL was translated and adapted in accordance with standardized procedures and was subjected to test-retest reliability analysis with 10 hemiplegic subjects. The psychometric properties were investigated using Rasch analysis on 50 hemiplegics. RESULTS: Reliability
Psychometrics of the self-report safe driving behavior measure for older adults.

Science.gov (United States)

Classen, Sherrilene; Wen, Pey-Shan; Velozo, Craig A; Bédard, Michel; Winter, Sandra M; Brumback, Babette; Lanford, Desiree N

2012-01-01

We investigated the psychometric properties of the 68-item Safe Driving Behavior Measure (SDBM) with 80 older drivers, 80 caregivers, and 2 evaluators from two sites. Using Rasch analysis, we examined unidimensionality and local dependence; rating scale; item- and person-level psychometrics; and item hierarchy of older drivers, caregivers, and driving evaluators who had completed the SDBM. The evidence suggested the SDBM is unidimensional, but pairs of items showed local dependency. Across the three rater groups, the data showed good person (≥3.4) and item (≥3.6) separation as well as good person (≥.93) and item reliability (≥.92). Cronbach's α was ≥.96, and few items were misfitting. Some of the items did not follow the hypothesized order of item difficulty. The SDBM classified the older drivers into six ability levels, but to fully calibrate the instrument it must be refined in terms of its items (e.g., item exclusion) and then tested among participants of lesser ability. Copyright © 2012 by the American Occupational Therapy Association, Inc.
Validating Quantitative Measurement Using Qualitative Data: Combining Rasch Scaling and Latent Semantic Analysis in Psychiatry

Science.gov (United States)

Lange, Rense

2015-02-01

An extension of concurrent validity is proposed that uses qualitative data for the purpose of validating quantitative measures. The approach relies on Latent Semantic Analysis (LSA) which places verbal (written) statements in a high dimensional semantic space. Using data from a medical / psychiatric domain as a case study - Near Death Experiences, or NDE - we established concurrent validity by connecting NDErs qualitative (written) experiential accounts with their locations on a Rasch scalable measure of NDE intensity. Concurrent validity received strong empirical support since the variance in the Rasch measures could be predicted reliably from the coordinates of their accounts in the LSA derived semantic space (R2 = 0.33). These coordinates also predicted NDErs age with considerable precision (R2 = 0.25). Both estimates are probably artificially low due to the small available data samples (n = 588). It appears that Rasch scalability of NDE intensity is a prerequisite for these findings, as each intensity level is associated (at least probabilistically) with a well- defined pattern of item endorsements.
Methodologically Sound: Evaluating the Psychometric Approach to the Assessment of Human Life History [Reply to Copping, Campbell, and Muncer, 2014

Science.gov (United States)

Cabeza de Baca, Tomás; Black, Candace Jasmine; García, Rafael Antonio; Fernandes, Heitor Barcellos Ferreira; Wolf, Pedro Sofio Abril; Woodley of Menie, Michael Anthony

2016-01-01

Copping, Campbell, and Muncer (2014) have recently published an article critical of the psychometric approach to the assessment of life history (LH) strategy. Their purported goal was testing for the convergent validation and examining the psychometric structure of the High-K Strategy Scale (HKSS). As much of the literature on the psychometrics of human LH during the past decade or so has emanated from our research laboratory and those of close collaborators, we have prepared this detailed response. Our response is organized into four main sections: (1) A review of psychometric methods for the assessment of human LH strategy, expounding upon the essence of our approach; (2) our theoretical/conceptual concerns regarding the critique, addressing the broader issues raised by the critique regarding the latent and hierarchical structure of LH strategy; (3) our statistical/methodological concerns regarding the critique, examining the validity and persuasiveness of the empirical case made specifically against the HKSS; and (4) our recommendations for future research that we think might be helpful in closing the gap between the psychometric and biometric approaches to measurement in this area. Clearly stating our theoretical positions, describing our existing body of work, and acknowledgintheir limitations should assist future researchers in planning and implementing more informed and prudent empirical research that will synthesize the psychometric approach to the assessment of LH strategy with complementary methods. PMID:25844774
Methodologically Sound: Evaluating the Psychometric Approach to the Assessment of Human Life History [Reply to Copping, Campbell, and Muncer, 2014

Directory of Open Access Journals (Sweden)

Aurelio José Figueredo

2015-04-01

Full Text Available Copping, Campbell, and Muncer (2014 have recently published an article critical of the psychometric approach to the assessment of life history (LH strategy. Their purported goal was testing for the convergent validation and examining the psychometric structure of the High-K Strategy Scale (HKSS. As much of the literature on the psychometrics of human LH during the past decade or so has emanated from our research laboratory and those of close collaborators, we have prepared this detailed response. Our response is organized into four main sections: (1 A review of psychometric methods for the assessment of human LH strategy, expounding upon the essence of our approach; (2 our theoretical/conceptual concerns regarding the critique, addressing the broader issues raised by the critique regarding the latent and hierarchical structure of LH strategy; (3 our statistical/methodological concerns regarding the critique, examining the validity and persuasiveness of the empirical case made specifically against the HKSS; and (4 our recommendations for future research that we think might be helpful in closing the gap between the psychometric and biometric approaches to measurement in this area. Clearly stating our theoretical positions, describing our existing body of work, and acknowledging their limitations should assist future researchers in planning and implementing more informed and prudent empirical research that will synthesize the psychometric approach to the assessment of LH strategy with complementary methods.
Understanding Rasch Measurement: Partial Credit Model and Pivot Anchoring.

Science.gov (United States)

Bode, Rita K.

2001-01-01

Describes the Rasch measurement partial credit model, what it is, how it differs from other Rasch models, and when and how to use it. Also describes the calibration of instruments with increasingly complex items. Explains pivot anchoring and illustrates its use and describes the effect of pivot anchoring on step calibrations, item hierarchy, and…
Psychometric assessment of the patient activation measure short form (PAM-13) in rural settings.

Science.gov (United States)

Hung, Man; Carter, Marjorie; Hayden, Candace; Dzierzon, Rhonda; Morales, Jose; Snow, Laverne; Butler, Jorie; Bateman, Kim; Samore, Matthew

2013-04-01

The patient activation measure short form (PAM-13) assesses patients' self-reported health management skills, knowledge, confidence, and motivation. We used item response theory to evaluate the psychometric properties of the PAM-13 utilized in rural settings. A Rasch partial credit model analysis was conducted on the PAM-13 instrument using a sample of 812 rural patients recruited by providers and our research staff. Specially, we examined dimensionality, item fit, and quality of measures, category response curves, and item differential functioning. Convergent and divergent validities were also examined. The PAM-13 instrument has excellent convergent and divergent validities. It is fairly unidimensional, and all items fit the Rasch model well. It has relatively high person and item reliability indices. Majority of the items were free of item differential functioning. There were, however, some issues with ceiling effects. Additionally, there was a lack of responses for category one across all items. Patient activation measure short form (PAM-13) performs well in some areas, but not all. In general, more items need to be added to cover the upper end of the trait. The four response categories of PAM-13 should be collapsed into three.
A Psychometric Approach to Theory-Based Behavior Change Intervention Development: Example From the Colorado Meaning-Activity Project.

Science.gov (United States)

Masters, Kevin S; Ross, Kaile M; Hooker, Stephanie A; Wooldridge, Jennalee L

2018-05-18

There has been a notable disconnect between theories of behavior change and behavior change interventions. Because few interventions are both explicitly and adequately theory-based, investigators cannot assess the impact of theory on intervention effectiveness. Theory-based interventions, designed to deliberately engage the theory's proposed mechanisms of change, are needed to adequately test theories. Thus, systematic approaches to theory-based intervention development are needed. This article will introduce and discuss the psychometric method of developing theory-based interventions. The psychometric approach to intervention development utilizes basic psychometric principles at each step of the intervention development process in order to build a theoretically driven intervention to, subsequently, be tested in process (mechanism) and outcome studies. Five stages of intervention development are presented as follows: (i) Choice of theory; (ii) Identification and characterization of key concepts and expected relations; (iii) Intervention construction; (iv) Initial testing and revision; and (v) Empirical testing of the intervention. Examples of this approach from the Colorado Meaning-Activity Project (COMAP) are presented. Based on self-determination theory integrated with meaning or purpose, and utilizing a motivational interviewing approach, the COMAP intervention is individually based with an initial interview followed by smart phone-delivered interventions for increasing daily activity. The psychometric approach to intervention development is one method to ensure careful consideration of theory in all steps of intervention development. This structured approach supports developing a research culture that endorses deliberate and systematic operationalization of theory into behavior change intervention from the outset of intervention development.
The Social Provisions Scale: psychometric properties of the SPS-10 among participants in nature-based services.

Science.gov (United States)

Steigen, Anne Mari; Bergh, Daniel

2018-02-05

This article analyses the psychometric properties of the Social Provisions Scale 10-items version. The Social Provisions Scale was analysed by means of the polytomous Rasch model, applied to data on 93 young adults (16-30 years) out of school or work, participating in different nature-based services, due to mental or drug-related problems. The psychometric analysis concludes that the original scale has difficulties related to targeting and construct validity. In order to improve the psychometric properties, the scale was modified to include eight items measuring functional support. The modification was based on theoretical and statistical considerations. After modifications the scale showed not only satisfying psychometric properties, but it also clarified uncertainties regarding construct validity of the measure. However, further analysis on larger samples are required. Implications for Rehabilitation Social support is important for a variety of rehabilitation outcomes and for different patient groups in the rehabilitation context, including people with mental health or drug-related problems. Social Provisions Scale may be used as a screening tool to assess social support of participants in rehabilitation, and the scale may also be an important instrument in rehabilitation research. There might be issues measuring structural support using a 10-items version of the Social Provisions Scale but it seemed to work well as an 8-item scale measuring functional support.
Psychometric validation of the Persian nine-item Internet Gaming Disorder Scale - Short Form: Does gender and hours spent online gaming affect the interpretations of item descriptions?

Science.gov (United States)

Wu, Tzu-Yi; Lin, Chung-Ying; Årestedt, Kristofer; Griffiths, Mark D; Broström, Anders; Pakpour, Amir H

2017-06-01

Background and aims The nine-item Internet Gaming Disorder Scale - Short Form (IGDS-SF9) is brief and effective to evaluate Internet Gaming Disorder (IGD) severity. Although its scores show promising psychometric properties, less is known about whether different groups of gamers interpret the items similarly. This study aimed to verify the construct validity of the Persian IGDS-SF9 and examine the scores in relation to gender and hours spent online gaming among 2,363 Iranian adolescents. Methods Confirmatory factor analysis (CFA) and Rasch analysis were used to examine the construct validity of the IGDS-SF9. The effects of gender and time spent online gaming per week were investigated by multigroup CFA and Rasch differential item functioning (DIF). Results The unidimensionality of the IGDS-SF9 was supported in both CFA and Rasch. However, Item 4 (fail to control or cease gaming activities) displayed DIF (DIF contrast = 0.55) slightly over the recommended cutoff in Rasch but was invariant in multigroup CFA across gender. Items 4 (DIF contrast = -0.67) and 9 (jeopardize or lose an important thing because of gaming activity; DIF contrast = 0.61) displayed DIF in Rasch and were non-invariant in multigroup CFA across time spent online gaming. Conclusions Given the Persian IGDS-SF9 was unidimensional, it is concluded that the instrument can be used to assess IGD severity. However, users of the instrument are cautioned concerning the comparisons of the sum scores of the IGDS-SF9 across gender and across adolescents spending different amounts of time online gaming.
Assessment of time management skills: psychometric properties of the Swedish version.

Science.gov (United States)

Janeslätt, Gunnel Kristina; Holmqvist, Kajsa Lidström; White, Suzanne; Holmefur, Marie

2018-05-01

Persons with impaired time management skills are often in need of occupational therapy. Valid and reliable instruments to assess time management and organizational skills are needed for the evaluation of intervention. The purpose of this study was to evaluate the psychometric properties of a Swedish version of the Assessment of Time Management Skills (ATMS-S) for persons with and without impaired time management skills. A total of 238 persons participated in the study, of whom 94 had self-reported impaired time management skills due to mental disorders such as schizophrenic spectrum or neurodevelopmental disorders such as attention deficit/hyperactivity disorder (ADHD), autism spectrum disorder (ASD) and mild intellectual disabilities, and 144 persons had no reported impaired time management skills. Rasch analysis was used to analyze data. Three subscales were detected: the time management subscale with 11 items, the organization & planning subscale with 11 items, and the subscale of regulation of emotions with 5 items, with excellent to acceptable psychometric properties. The conclusions were that: ATMS-S is a valid instrument for self-rating of time management, organization & planning and for the regulation of emotions. ATMS-S can be useful for persons with mental disorders including mild neurodevelopmental disorders.
Monte Carlo tests of the Rasch model based on scalability coefficients

DEFF Research Database (Denmark)

Christensen, Karl Bang; Kreiner, Svend

2010-01-01

that summarizes the number of Guttman errors in the data matrix. These coefficients are shown to yield efficient tests of the Rasch model using p-values computed using Markov chain Monte Carlo methods. The power of the tests of unequal item discrimination, and their ability to distinguish between local dependence......For item responses fitting the Rasch model, the assumptions underlying the Mokken model of double monotonicity are met. This makes non-parametric item response theory a natural starting-point for Rasch item analysis. This paper studies scalability coefficients based on Loevinger's H coefficient...
Funding Medical Research Projects: Taking into Account Referees' Severity and Consistency through Many-Faceted Rasch Modeling of Projects' Scores.

Science.gov (United States)

Tesio, Luigi; Simone, Anna; Grzeda, Mariuzs T; Ponzio, Michela; Dati, Gabriele; Zaratin, Paola; Perucca, Laura; Battaglia, Mario A

2015-01-01

The funding policy of research projects often relies on scores assigned by a panel of experts (referees). The non-linear nature of raw scores and the severity and inconsistency of individual raters may generate unfair numeric project rankings. Rasch measurement (many-facets version, MFRM) provides a valid alternative to scoring. MFRM was applied to the scores achieved by 75 research projects on multiple sclerosis sent in response to a previous annual call by FISM-Italian Foundation for Multiple Sclerosis. This allowed to simulate, a posteriori, the impact of MFRM on the funding scenario. The applications were each scored by 2 to 4 independent referees (total = 131) on a 10-item, 0-3 rating scale called FISM-ProQual-P. The rotation plan assured "connection" of all pairs of projects through at least 1 shared referee.The questionnaire fulfilled satisfactorily the stringent criteria of Rasch measurement for psychometric quality (unidimensionality, reliability and data-model fit). Arbitrarily, 2 acceptability thresholds were set at a raw score of 21/30 and at the equivalent Rasch measure of 61.5/100, respectively. When the cut-off was switched from score to measure 8 out of 18 acceptable projects had to be rejected, while 15 rejected projects became eligible for funding. Some referees, of various severity, were grossly inconsistent (z-std fit indexes less than -1.9 or greater than 1.9). The FISM-ProQual-P questionnaire seems a valid and reliable scale. MFRM may help the decision-making process for allocating funds to MS research projects but also in other fields. In repeated assessment exercises it can help the selection of reliable referees. Their severity can be steadily calibrated, thus obviating the need to connect them with other referees assessing the same projects.

Was Pre-Modern Man a Child? The Quintessence of the Psychometric and Developmental Approaches

Science.gov (United States)

Oesterdiekhoff, Georg W.

2012-01-01

The essay integrates the psychometric intelligence approach with the cognitive-developmental approach or the stage theory erected by Piaget and his disciples. The latter led to Piagetian Cross-Cultural Psychology and the accumulation of an immense body of data. It shows that different IQ levels are indicative of the peculiar stages of cognitive…
The psychometric properties of the Emotional Quotient Inventory 2.0 in South Africa

Directory of Open Access Journals (Sweden)

Casper J.J. van Zyl

2014-11-01

Research purpose: The purpose of this study is to examine the psychometric properties of the Emotional Quotient Inventory (EQ-i 2.0 in South Africa. Item response and classical test theory methods are employed to investigate its item functioning and factor structure. Motivation for the study: Although there has been some scientific research published on the EQ-i in South Africa, there has been no research on the revised version, the EQ-i 2.0. In addition, criticism has been levied against the estimation of internal consistency reliability in the field of emotional intelligence. This study aims to fill these gaps in the literature. Research design, approach and method: This study followed a quantitative, non-experimental,cross-sectional design using secondary data. The sample comprised 1144 working adults(570 men and 574 women. The data were collected through an online platform as part of the standardisation process in South Africa. Main findings: Results from Rasch analysis showed that almost all the items fit the model.Cronbach’s alpha and McDonald’s omega estimates revealed satisfactory reliabilities.Confirmatory factor analysis at the composite level revealed acceptable fit with the exception of the total EQ model. Practical/managerial implications: This study supports the claim of reliability and validity ofthe EQ-i 2.0 in the South African context. Contribution/value-add: The study contributes significantly to the international body of evidence regarding the psychometric properties of the EQ-i 2.0 and provides supporting evidence for the appropriate use of this assessment in South Africa.
Estimating the Multilevel Rasch Model: With the lme4 Package

Directory of Open Access Journals (Sweden)

Harold Doran

2007-02-01

Full Text Available Traditional Rasch estimation of the item and student parameters via marginal maximum likelihood, joint maximum likelihood or conditional maximum likelihood, assume individuals in clustered settings are uncorrelated and items within a test that share a grouping structure are also uncorrelated. These assumptions are often violated, particularly in educational testing situations, in which students are grouped into classrooms and many test items share a common grouping structure, such as a content strand or a reading passage. Consequently, one possible approach is to explicitly recognize the clustered nature of the data and directly incorporate random effects to account for the various dependencies. This article demonstrates how the multilevel Rasch model can be estimated using the functions in R for mixed-effects models with crossed or partially crossed random effects. We demonstrate how to model the following hierarchical data structures: a individuals clustered in similar settings (e.g., classrooms, schools, b items nested within a particular group (such as a content strand or a reading passage, and c how to estimate a teacher × content strand interaction.
Work environment impact scale: testing the psychometric properties of the Swedish version.

Science.gov (United States)

Ekbladh, Elin; Fan, Chia-Wei; Sandqvist, Jan; Hemmingsson, Helena; Taylor, Renée

2014-01-01

The Work Environment Impact Scale (WEIS) is an assessment that focuses on the fit between a person and his or her work environment. It is based on Kielhofner's Model of Human Occupation and designed to gather information on how clients experience their work environment. The aim of this study was to examine the psychometric properties of the Swedish version of the WEIS assessment instrument. In total, 95 ratings on the 17-item WEIS were obtained from a sample of clients with experience of sick leave due to different medical conditions. Rasch analysis was used to analyze the data. Overall, the WEIS items together cohered to form a single construct of increasingly challenging work environmental factors. The hierarchical ordering of the items along the continuum followed a logical and expected pattern, and the participants were validly measured by the scale. The three occupational therapists serving as raters validly used the scale, but demonstrated a relatively high rater separation index, indicating differences in rater severity. The findings provide evidence that the Swedish version of the WEIS is a psychometrically sound assessment across diagnoses and occupations, which can provide valuable information about experiences of work environment challenges.
Rasch family models in e-learning: analyzing architectural sketching with a digital pen.

Science.gov (United States)

Scalise, Kathleen; Cheng, Nancy Yen-Wen; Oskui, Nargas

2009-01-01

Since architecture students studying design drawing are usually assessed qualitatively on the basis of their final products, the challenges and stages of their learning have remained masked. To clarify the challenges in design drawing, we have been using the BEAR Assessment System and Rasch family models to measure levels of understanding for individuals and groups, in order to correct pedagogical assumptions and tune teaching materials. This chapter discusses the analysis of 81 drawings created by architectural students to solve a space layout problem, collected and analyzed with digital pen-and-paper technology. The approach allows us to map developmental performance criteria and perceive achievement overlaps in learning domains assumed separate, and then re-conceptualize a three-part framework to represent learning in architectural drawing. Results and measurement evidence from the assessment and Rasch modeling are discussed.
Measurement properties of the CLOX Executive Clock Drawing Task in an inpatient stroke rehabilitation setting.

Science.gov (United States)

Zuverza-Chavarria, Virginia; Tsanadis, John

2011-05-01

The goal of this study was to explore the psychometric properties of the CLOX Executive Clock Drawing Task (Royall, Cordes, & Polk, 1998) in persons who had sustained a stroke and were receiving inpatient rehabilitation. Rasch modeling was utilized to examine the psychometric properties of the CLOX. Separate analyses were conducted for the free draw (CLOX 1) and copy (CLOX 2) portions of the measure to investigate each presentation mode independently. The sample consisted of 66 inpatient adults who had sustained a stroke. CLOX 1 met most Rasch model expectations for item fit, unidimensionality, test reliability, and sample targeting. CLOX 2 was less psychometrically sound and contained two items with significant misfit. CLOX 2 demonstrated a significant ceiling effect that resulted in poor sample targeting. CLOX 1 is a psychometrically sound screening instrument for assessing persons with stroke receiving inpatient rehabilitation. In addition to the psychometric weaknesses of CLOX 2, its interpretive yield is minimal and clinicians may consider omitting it. Recommendations are made for using the Rasch item-person maps in clinical practice.
Psychometric validation of the Persian nine-item Internet Gaming Disorder Scale – Short Form: Does gender and hours spent online gaming affect the interpretations of item descriptions?

Science.gov (United States)

Wu, Tzu-Yi; Lin, Chung-Ying; Årestedt, Kristofer; Griffiths, Mark D.; Broström, Anders; Pakpour, Amir H.

2017-01-01

Background and aims The nine-item Internet Gaming Disorder Scale – Short Form (IGDS-SF9) is brief and effective to evaluate Internet Gaming Disorder (IGD) severity. Although its scores show promising psychometric properties, less is known about whether different groups of gamers interpret the items similarly. This study aimed to verify the construct validity of the Persian IGDS-SF9 and examine the scores in relation to gender and hours spent online gaming among 2,363 Iranian adolescents. Methods Confirmatory factor analysis (CFA) and Rasch analysis were used to examine the construct validity of the IGDS-SF9. The effects of gender and time spent online gaming per week were investigated by multigroup CFA and Rasch differential item functioning (DIF). Results The unidimensionality of the IGDS-SF9 was supported in both CFA and Rasch. However, Item 4 (fail to control or cease gaming activities) displayed DIF (DIF contrast = 0.55) slightly over the recommended cutoff in Rasch but was invariant in multigroup CFA across gender. Items 4 (DIF contrast = −0.67) and 9 (jeopardize or lose an important thing because of gaming activity; DIF contrast = 0.61) displayed DIF in Rasch and were non-invariant in multigroup CFA across time spent online gaming. Conclusions Given the Persian IGDS-SF9 was unidimensional, it is concluded that the instrument can be used to assess IGD severity. However, users of the instrument are cautioned concerning the comparisons of the sum scores of the IGDS-SF9 across gender and across adolescents spending different amounts of time online gaming. PMID:28571474
Improving the Individual Work Performance Questionnaire using Rasch analysis.

OpenAIRE

Koopmans, L.; Bernaards, C.M.; Hildebrandt, V.H.; Buuren, S. van; Beek, A.J. van der; Vet, H.C.W. de

2014-01-01

Recently, the Individual Work Performance Questionnaire (IWPQ) version 0.2 was developed using Rasch analysis. The goal of the current study was to improve targeting of the IWPQ scales by including additional items. The IWPQ 0.2 (original) and 0.3 (including additional items) were examined using Rasch analysis. Additional items that showed misfit or did not improve targeting were removed from the IWPQ 0.3, resulting in a final IWPQ 1.0. Subsequently, the scales showed good model fit and relia...
Detecting Aberrant Response Patterns in the Rasch Model. Rapport 87-3.

Science.gov (United States)

Kogut, Jan

In this paper, the detection of response patterns aberrant from the Rasch model is considered. For this purpose, a new person fit index, recently developed by I. W. Molenaar (1987) and an iterative estimation procedure are used in a simulation study of Rasch model data mixed with aberrant data. Three kinds of aberrant response behavior are…
A discussion of the limitations of the psychometric and cultural theory approaches to risk perception

International Nuclear Information System (INIS)

Sjoeberg, L.

1996-01-01

Risk perception has traditionally been conceived as a cognitive phenomenon, basically a question of information processing. The very term perception suggests that information processing is involved and of crucial importance. Kahneman and Tversky suggested that the use of 'heuristics' in the intuitive estimation of probabilities accounts for biased probability perception, hence claiming to explain risk perception as well. The psychometric approach of Slovic et al, a further step in in the cognitive tradition, conceives of perceived risk as a function of general properties of a hazard. However, the psychometric approach is shown here to explain only about 20% of the variance of perceived risk, even less of risk acceptability. Its claim to explanatory power is based on a statistical illusion: mean values were investigated and accounted for, across hazards. A currently popular alternative to the psychometric tradition, Cultural Theory, is even less successful and explains only about 5% of the variance of perceived risk. The claims of this approach were also based on a statistical illusion: 'significant' results were reported and interpreted as being of substantial importance. The present paper presents a new approach: attitude to the risk generating technology, general sensitivity to risks and specific risk explained well over 60% of the variance of perceived risk of nuclear waste, in a study of extensive data from a representative sample of the Swedish population. The attitude component functioning as an explanatory factor of perceived risk, rather than as a consequence of perceived risk, suggests strongly that perceived risk is something other than cognition. Implications for risk communication are discussed. (author)
Measuring health-related problem solving among African Americans with multiple chronic conditions: application of Rasch analysis.

Science.gov (United States)

Fitzpatrick, Stephanie L; Hill-Briggs, Felicia

2015-10-01

Identification of patients with poor chronic disease self-management skills can facilitate treatment planning, determine effectiveness of interventions, and reduce disease complications. This paper describes the use of a Rasch model, the Rating Scale Model, to examine psychometric properties of the 50-item Health Problem-Solving Scale (HPSS) among 320 African American patients with high risk for cardiovascular disease. Items on the positive/effective HPSS subscales targeted patients at low, moderate, and high levels of positive/effective problem solving, whereas items on the negative/ineffective problem solving subscales mostly targeted those at moderate or high levels of ineffective problem solving. Validity was examined by correlating factor scores on the measure with clinical and behavioral measures. Items on the HPSS show promise in the ability to assess health-related problem solving among high risk patients. However, further revisions of the scale are needed to increase its usability and validity with large, diverse patient populations in the future.
Oswestry Disability Index: a psychometric analysis with 1,610 patients.

Science.gov (United States)

Brodke, Darrel S; Goz, Vadim; Lawrence, Brandon D; Spiker, W Ryan; Neese, Ashley; Hung, Man

2017-03-01

One-fourth of the adult US population has or will experience back pain and has undergone one of a myriad of treatments. Understanding the outcomes of these many treatments from pharmacologic to surgical, from manipulation to modality, allows for a better understanding and value-driven decision making. Patient-reported outcome measures are the current standard and include general and disease-specific measures. The Oswestry Disability Index (ODI) is the most commonly used disease-specific patient-reported outcome tool to measure functional disability related to back pain. Few studies have evaluated its psychometric properties in a large patient sample using a modern tool such as the Rasch analysis model. This study aims to identify the benefits and deficiencies of the ODI as an outcome tool for assessing patients with back pain. This study aimed to investigate the psychometric properties, performance, and applicability of the ODI in patients with back pain who visited a university-based outpatient clinic. This study used a secondary analysis-assessment of diagnostic tool on consecutive patients. The sample comprised 1,610 patients visiting an academic spine center. The ODI was the outcome measure. Detailed Rasch analysis of the ODI was performed. Standard descriptive statistics were also assessed. The ODI performed well overall. It demonstrated suboptimal unidimensionality (ie, unexplained variance after accounting for the first dimension) of 8.3%. Person reliability was good, at 0.85, and item reliability was excellent, at 1.00. The overall item fit for the ODI was good with an outfit mean square of 1.02. The ODI had a floor effect of 29.9% and ceiling effect of 3.9%. The raw score to measure correlation of the ODI was excellent, at 0.944. The ODI performed relatively well overall, with some problematic findings. It had good person and item reliability, although it did not demonstrate strong evidence of unidimensionality. The ODI has moderately poor coverage, with a
Guessing and the Rasch Model

Science.gov (United States)

Holster, Trevor A.; Lake, J.

2016-01-01

Stewart questioned Beglar's use of Rasch analysis of the Vocabulary Size Test (VST) and advocated the use of 3-parameter logistic item response theory (3PLIRT) on the basis that it models a non-zero lower asymptote for items, often called a "guessing" parameter. In support of this theory, Stewart presented fit statistics derived from…
Item Information in the Rasch Model

NARCIS (Netherlands)

Engelen, Ron J.H.; van der Linden, Willem J.; Oosterloo, Sebe J.

1988-01-01

Fisher's information measure for the item difficulty parameter in the Rasch model and its marginal and conditional formulations are investigated. It is shown that expected item information in the unconditional model equals information in the marginal model, provided the assumption of sampling
Reliability in the Rasch Model

Czech Academy of Sciences Publication Activity Database

Martinková, Patrícia; Zvára, K.

2007-01-01

Roč. 43, č. 3 (2007), s. 315-326 ISSN 0023-5954 R&D Projects: GA MŠk(CZ) 1M06014 Institutional research plan: CEZ:AV0Z10300504 Keywords : Cronbach's alpha * Rasch model * reliability Subject RIV: BB - Applied Statistics, Operational Research Impact factor: 0.552, year: 2007 http://dml.cz/handle/10338.dmlcz/135776
Psychometric properties of the Mini-Mental State Examination in patients with acquired brain injury in Turkey.

Science.gov (United States)

Elhan, Atilla H; Kutlay, Sehim; Küçükdeveci, Ayse A; Cotuk, Cigdem; Oztürk, Gülsah; Tesio, Luigi; Tennant, Alan

2005-09-01

To evaluate the psychometric properties of Mini-Mental State Examination (MMSE) in patients with acquired brain injury in Turkey. A total of 207 patients with acquired brain injury were assessed. Reliability was tested by internal consistency and the person separation index; internal construct validity by Rasch analysis; external construct validity by correlation with cognitive disability; and cross-cultural validity by differential item functioning analysis compared with Italian MMSE data. Reliability was adequate with a Cronbach's alpha of 0.75 and person separation index of 0.76. After collapsing some categories, and adjustment for differential item functioning, internal construct validity was supported by fit of the data to Rasch model. Differential item functioning for culture was found in 2 items and after adjustment, data could be pooled between Turkey and Italy. External construct validity was supported by expected associations. The Turkish version of the Mini-Mental State Examination can be used as a cognitive screening tool in acquired brain injury. Cross-cultural validity between Italy and Turkey is supported, given appropriate adjustment for differential item functioning. However, shortfalls in reliability at the individual level, as well as the presence of differential item functioning suggest that a better instrument should be developed to screen for cognitive deficits following acquired brain injury.
Screening for depressed mood in an adolescent psychiatric context by brief self-assessment scales -- testing psychometric validity of WHO-5 and BDI-6 indices by latent trait analyses

DEFF Research Database (Denmark)

Blom, Eva Henje; Bech, Per; Högberg, Göran

2012-01-01

of two such scales, which may be used in a two-step screening procedure, the WHO-Five Well-being Index (WHO-5) and the six-item version of Beck's Depression Inventory (BDI-6). METHOD: 66 adolescent psychiatric patients with a clinical diagnosis of major depressive disorder (MDD), 60 girls and 6 boys......, aged 14--18 years, mean age 16.8 years, completed the WHO-5 scale as well as the BDI-6. Statistical validity was tested by Mokken and Rasch analyses. RESULTS: The correlation between WHO-5 and BDI-6 was -0.49 (p=0.0001). Mokken analyses showed a coefficient of homogeneity for the WHO-5 of 0.......52 and for the BDI-6 of 0.46. Rasch analysis also accepted unidimensionality when testing males versus females (p > 0.05). CONCLUSIONS: The WHO-5 is psychometrically valid in an adolescent psychiatric context including both genders to assess the wellness dimension and applicable as a first step in screening for MDD...
Psychometric properties of the hebrew translation of the patient activation measure (PAM-13).

Science.gov (United States)

Magnezi, Racheli; Glasser, Saralee

2014-01-01

"Patient activation" reflects involvement in managing ones health. This cross-sectional study assessed the psychometric properties of the Hebrew translation (PAM-H) of the PAM-13. A nationally representative sample of 203 Hebrew-speaking Israeli adults answered the PAM-H, PHQ-9 depression scale, SF-12, and Self-efficacy Scale via telephone. Mean PAM-H scores were 70.7±15.4. Rasch analysis indicated that the PAM-H is a good measure of activation. There were no differences in PAM-H scores based on gender, age or education. Subjects with chronic disease scored lower than those without. Scores correlated with the Self-efficacy Scale (0.47), Total SF-12 (0.39) and PHQ-9 (-0.35, PPAM-H score of those who scored below 10 (72.1±14.8) on the PHQ-9 (not depressed) compared to those scoring ≥10 (i.e. probable depression) (59.2±15.8; t 3.75; P = 0.001). The PAM-H psychometric properties indicate its usefulness with the Hebrew-speaking Israeli population. PAM-H can be useful for assessing programs aimed at effecting changes in patient compliance, health behaviors, etc. Researchers in Israel should use a single translation of the PAM-13 so that findings can be compared, increasing understanding of patient activation.
Diagnosis of students' ability in a statistical course based on Rasch probabilistic outcome

Science.gov (United States)

Mahmud, Zamalia; Ramli, Wan Syahira Wan; Sapri, Shamsiah; Ahmad, Sanizah

2017-06-01

Measuring students' ability and performance are important in assessing how well students have learned and mastered the statistical courses. Any improvement in learning will depend on the student's approaches to learning, which are relevant to some factors of learning, namely assessment methods carrying out tasks consisting of quizzes, tests, assignment and final examination. This study has attempted an alternative approach to measure students' ability in an undergraduate statistical course based on the Rasch probabilistic model. Firstly, this study aims to explore the learning outcome patterns of students in a statistics course (Applied Probability and Statistics) based on an Entrance-Exit survey. This is followed by investigating students' perceived learning ability based on four Course Learning Outcomes (CLOs) and students' actual learning ability based on their final examination scores. Rasch analysis revealed that students perceived themselves as lacking the ability to understand about 95% of the statistics concepts at the beginning of the class but eventually they had a good understanding at the end of the 14 weeks class. In terms of students' performance in their final examination, their ability in understanding the topics varies at different probability values given the ability of the students and difficulty of the questions. Majority found the probability and counting rules topic to be the most difficult to learn.
USING RASCH ANALYSIS TO EXPLORE WHAT STUDENTS LEARN ABOUT PROBABILITY CONCEPTS

Directory of Open Access Journals (Sweden)

Zamalia Mahmud

2015-01-01

Full Text Available Students’ understanding of probability concepts have been investigated from various different perspectives. This study was set out to investigate perceived understanding of probability concepts of forty-four students from the STAT131 Understanding Uncertainty and Variation course at the University of Wollongong, NSW. Rasch measurement which is based on a probabilistic model was used to identify concepts that students find easy, moderate and difficult to understand. Data were captured from the e-learning Moodle platform where students provided their responses through an on-line quiz. As illustrated in the Rasch map, 96% of the students could understand about sample space, simple events, mutually exclusive events and tree diagram while 67% of the students found concepts of conditional and independent events rather easy to understand.Keywords: Perceived Understanding, Probability Concepts, Rasch Measurement Model DOI: dx.doi.org/10.22342/jme.61.1

Measuring Health-Related Quality of Life in Strabismus: A Modification of the Adult Strabismus-20 (AS-20 Questionnaire Using Rasch Analysis.

Directory of Open Access Journals (Sweden)

Vijaya K Gothwal

Full Text Available To evaluate the psychometric properties of the Adult Strabismus-20 (AS-20- a health-related quality of life (HRQoL questionnaire in adults with strabismus, and if flawed, to revise the AS-20 and its subscales creating valid measurement scales.584 adults (meanage, 27.5 years with strabismus were recruited from an outpatient clinic at a South Indian tertiary eye care centre and were administered the AS-20 questionnaire.The AS-20 was translated and back translated into two Indian languages. The AS-20 and its two 10-item subscales - 'psychosocial' and 'function'were assessed separately for fit to the Rasch model, including an assessment of the rating scale, unidimensionality (by principal components analysis, measurement precision by person separation reliability, PSR, targeting, and differential item functioning (DIF; notable > 1.0 logits.Response categories were not used as intended, thereby, required re-organization and reducing their number from 5 to 3. The AS-20 had adequate measurement precision (PSR = 0.87 but lacked unidimensionality; however, deletion of the six multi-dimensionality causing items and an additional three misfitting items resulted in 11-item unidimensional questionnaire (AS-11. Two items failed to satisfy the model expectations in the 'psychosocial' subscale and were deleted - resulting in an 8-item unidimensional scale with adequate PSR (0.81 and targeting (0.23 logits. One item misfit in the 'function' subscale and was deleted-resulting in a 9 item Rasch-revised unidimensional subscale with acceptable PSR (0.80 and targeting (0.97 logits.None of the items displayed notable DIF by age, gender and level of education.The AS-11 and its two Rasch-revised subscales - 8-item psychosocial and 9-item function subscale may be more appropriate than the original AS-20 and its two 10-item subscales for use as unidimensional measures of HRQoL in adults with strabismus in India. Further work is required to establish the validity of the
Evaluation of the psychometric properties of the Nighttime Symptoms of COPD Instrument

Directory of Open Access Journals (Sweden)

Mocarski M

2015-03-01

Full Text Available Michelle Mocarski,1 Erica Zaiser,2 Dylan Trundell,2 Barry J Make,3 Asha Hareendran21Forest Research Institute, Inc., an affiliate of Actavis, Inc., Jersey City, NJ, USA; 2Evidera, London, UK; 3National Jewish Health, Denver, CO, USA Background: Nighttime symptoms can negatively impact the quality of life of patients with chronic obstructive pulmonary disease (COPD. The Nighttime Symptoms of COPD Instrument (NiSCI was designed to measure the occurrence and severity of nighttime symptoms in patients with COPD, the impact of symptoms on nighttime awakenings, and rescue medication use. The objective of this study was to explore item reduction, inform scoring recommendations, and evaluate the psychometric properties of the NiSCI.Methods: COPD patients participating in a Phase III clinical trial completed the NiSCI daily. Item analyses were conducted using weekly mean and single day scores. Descriptive statistics (including percentage of respondents at floor/ceiling and inter-item correlations, factor analyses, and Rasch model analyses were conducted to examine item performance and scoring. Test–retest reliability was assessed for the final instrument using the intraclass correlation coefficient (ICC. Correlations with assessments conducted during study visits were used to evaluate convergent and known-groups validity.Results: Data from 1,663 COPD patients aged 40–93 years were analyzed. Item analyses supported the generation of four scores. A one-factor structure was confirmed with factor analysis and Rasch analysis for the symptom severity score. Test–retest reliability was confirmed for the six-item symptom severity (ICC, 0.85, number of nighttime awakenings (ICC, 0.82, and rescue medication (ICC, 0.68 scores. Convergent validity was supported by significant correlations between the NiSCI, St George’s Respiratory Questionnaire, and Exacerbations of Chronic Obstructive Pulmonary Disease Tool-Respiratory Symptoms scores.Conclusion: The
Psychometric Principles in Measurement for Geoscience Education Research: A Climate Change Example

Science.gov (United States)

Libarkin, J. C.; Gold, A. U.; Harris, S. E.; McNeal, K.; Bowles, R.

2015-12-01

Understanding learning in geoscience classrooms requires that we use valid and reliable instruments aligned with intended learning outcomes. Nearly one hundred instruments assessing conceptual understanding in undergraduate science and engineering classrooms (often called concept inventories) have been published and are actively being used to investigate learning. The techniques used to develop these instruments vary widely, often with little attention to psychometric principles of measurement. This paper will discuss the importance of using psychometric principles to design, evaluate, and revise research instruments, with particular attention to the validity and reliability steps that must be undertaken to ensure that research instruments are providing meaningful measurement. An example from a climate change inventory developed by the authors will be used to exemplify the importance of validity and reliability, including the value of item response theory for instrument development. A 24-item instrument was developed based on published items, conceptions research, and instructor experience. Rasch analysis of over 1000 responses provided evidence for the removal of 5 items for misfit and one item for potential bias as measured via differential item functioning. The resulting 18-item instrument can be considered a valid and reliable measure based on pre- and post-implementation metrics. Consideration of the relationship between respondent demographics and concept inventory scores provides unique insight into the relationship between gender, religiosity, values and climate change understanding.
Validation of VARK learning modalities questionnaire using Rasch analysis

Science.gov (United States)

Fitkov-Norris, E. D.; Yeghiazarian, A.

2015-02-01

This article discusses the application of Rasch analysis to assess the internal validity of a four sub-scale VARK (Visual, Auditory, Read/Write and Kinaesthetic) learning styles instrument. The results from the analysis show that the Rasch model fits the majority of the VARK questionnaire data and the sample data support the internal validity of the four sub-constructs at 1% level of significance for all but one item. While this suggests that the instrument could potentially be used as a predictor for a person's learning preference orientation, further analysis is necessary to confirm the invariability of the instrument across different user groups across factors such as gender, age, educational and cultural background.
Creating a brief rating scale for the assessment of learning disabilities using reliability and true score estimates of the scale's items based on the Rasch model.

Science.gov (United States)

Sideridis, Georgios; Padeliadu, Susana

2013-01-01

The purpose of the present studies was to provide the means to create brief versions of instruments that can aid the diagnosis and classification of students with learning disabilities and comorbid disorders (e.g., attention-deficit/hyperactivity disorder). A sample of 1,108 students with and without a diagnosis of learning disabilities took part in study 1. Using information from modern theory methods (i.e., the Rasch model), a scale was created that included fewer than one third of the original battery items designed to assess reading skills. This best item synthesis was then evaluated for its predictive and criterion validity with a valid external reading battery (study 2). Using a sample of 232 students with and without learning disabilities, results indicated that the brief version of the scale was equally effective as the original scale in predicting reading achievement. Analysis of the content of the brief scale indicated that the best item synthesis involved items from cognition, motivation, strategy use, and advanced reading skills. It is suggested that multiple psychometric criteria be employed in evaluating the psychometric adequacy of scales used for the assessment and identification of learning disabilities and comorbid disorders.
Rasch Measurement Analysis of a 25-Item Version of the Mueller/McCloskey Nurse Job Satisfaction Scale in a Sample of Nurses in Lebanon and Qatar

Directory of Open Access Journals (Sweden)

Michael Clinton

2015-06-01

Full Text Available The Mueller/McCloskey Nurse Job Satisfaction Scale (MMSS is widely used, but its psychometric characteristics have not been sufficiently validated for use in Middle Eastern countries. The objective of our methodological study was to determine the psychometric suitability of a 25-item version of the MMSS (MMSS-25 for use in middle-income and high-income Middle Eastern countries. A total of 1,322 registered nurses, 859 in Lebanon and 463 in Qatar, completed the MMSS-25 as part of a cross-sectional multinational investigation of nursing shortages in the region. We used the Rasch rating scale model to investigate the psychometric performance of the MMSS-25. We identified possible item bias among MMSS-25 items. We conducted confirmatory factor analyses (CFA to compare the fit to our data of five factor structures reported in the literature. We concluded that irrespective of administration in English or Arabic, the MMSS-25 is not sufficiently productive of measurement for use in the region. A core set of 13 items (MMSS-13, Cronbach’s α = .82 loading on five dimensions eliminates redundant MMSS items and is suitable for initial screening of nurses’ satisfaction. Of the five factor structures we examined, the MMSS-13 was the only close fit to our data (comparative fit index = 0.951; Tucker–Lewis index = 0.931; root mean square error of approximation = 0.051; p value = .401. The MMSS-13 has psychometric characteristics superior to MMSS-25, but additional items are required to meet the research-specific objectives of future studies of nurses’ job satisfaction in Middle Eastern countries.
Predicting the need for institutional care shortly after admission to rehabilitation: Rasch analysis and predictive validity of the BRASS Index.

Science.gov (United States)

Panella, L; La Porta, F; Caselli, S; Marchisio, S; Tennant, A

2012-09-01

Effective discharge planning is increasingly recognised as a critical component of hospital-based Rehabilitation. The BRASS index is a risk screening tool for identification, shortly after hospital admission, of patients who are at risk of post-discharge problems. To evaluate the internal construct validity and reliability of the Blaylock Risk Assessment Screening Score (BRASS) within the rehabilitation setting. Observational prospective study. Rehabilitation ward of an Italian district hospital. One hundred and four consecutively admitted patients. Using classical psychometric methods and Rasch analysis (RA), the internal construct validity and reliability of the BRASS were examined. Also, external and predictive validity of the Rasch-modified BRASS (RMB) score were determined. Reliability of the original BRASS was low (Cronbach's alpha=0.595) and factor analyses showed that it was clearly multidimensional. A RA, based on a reduced 7-BRASS item set (RMB), satisfied model's expectations. Reliability was 0.777. The RMB scores strongly correlated with the original BRASS (rho=0.952; P28 days (RR=7.6, 95%CI=1.8-31.9). This study demonstrated that the original BRASS was multidimensional and unreliable. However, the RMB holds adequate internal construct validity and is sufficiently reliable as a predictor of discharge problems for group, but not individual use. The application of tools and methods (such as the BRASS Index) developed under the biomedical paradigm in a Physical and Rehabilitation Medicine setting may have limitations. Further research is needed to develop, within the rehabilitation setting, a valid measuring tool of risk of post-discharge problems at the individual level.
A Comparison of Uniform DIF Effect Size Estimators under the MIMIC and Rasch Models

Science.gov (United States)

Jin, Ying; Myers, Nicholas D.; Ahn, Soyeon; Penfield, Randall D.

2013-01-01

The Rasch model, a member of a larger group of models within item response theory, is widely used in empirical studies. Detection of uniform differential item functioning (DIF) within the Rasch model typically employs null hypothesis testing with a concomitant consideration of effect size (e.g., signed area [SA]). Parametric equivalence between…
The Rasch Poisson counts model for incomplete data : An application of the EM algorithm

NARCIS (Netherlands)

Jansen, G.G.H.

Rasch's Poisson counts model is a latent trait model for the situation in which K tests are administered to N examinees and the test score is a count [e.g., the repeated occurrence of some event, such as the number of items completed or the number of items answered (in)correctly]. The Rasch Poisson
A Multi-Marker Genetic Association Test Based on the Rasch Model Applied to Alzheimer's Disease.

Directory of Open Access Journals (Sweden)

Wenjia Wang

Full Text Available Results from Genome-Wide Association Studies (GWAS have shown that the genetic basis of complex traits often include many genetic variants with small to moderate effects whose identification remains a challenging problem. In this context multi-marker analysis at the gene and pathway level can complement traditional point-wise approaches that treat the genetic markers individually. In this paper we propose a novel statistical approach for multi-marker analysis based on the Rasch model. The method summarizes the categorical genotypes of SNPs by a generalized logistic function into a genetic score that can be used for association analysis. Through different sets of simulations, the false-positive rate and power of the proposed approach are compared to a set of existing methods, and shows good performances. The application of the Rasch model on Alzheimer's Disease (AD ADNI GWAS dataset also allows a coherent interpretation of the results. Our analysis supports the idea that APOE is a major susceptibility gene for AD. In the top genes selected by proposed method, several could be functionally linked to AD. In particular, a pathway analysis of these genes also highlights the metabolism of cholesterol, that is known to play a key role in AD pathogenesis. Interestingly, many of these top genes can be integrated in a hypothetic signalling network.
Towards the development of clinical measures for spinal cord injury based on the International Classification of Functioning, Disability and Health with Rasch analyses.

Science.gov (United States)

Ballert, Carolina S; Stucki, Gerold; Biering-Sørensen, Fin; Cieza, Alarcos

2014-09-01

To determine whether the International Classification of Functioning, Disability and Health (ICF) categories relevant to spinal cord injury (SCI) can be integrated in clinical measures and to obtain insights to guide their future operationalization. Specific aims are to find out whether the ICF categories relevant to SCI fit a Rasch model taking into consideration the dimensionality found in previous investigations, local item dependencies, or differential item functioning. All second-level ICF categories collected in the Development of ICF Core Sets for SCI project in specialized centers within 15 countries from 2006 through 2008. Secondary data analysis. Adults (N=1048) with SCI from the early postacute and long-term living context. Not applicable. Two unidimensional Rasch analyses: one for the ICF categories from body functions and body structures components and another for the ICF categories from the activities and participation component. Results support good reliability and targeting of the ICF categories in both dimensions. In each dimension, few ICF categories were subject to misfit. Local item dependency was observed between ICF categories of the same chapters. Group effects for age and sex were observed only to a small extent. The validity of ICF categories to develop measures of functioning in SCI for clinical practice and research is to some extent supported. Model adjustments were suggested to further improve their operationalization and psychometrics. Copyright © 2014 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
A Rasch and factor analysis of the Functional Assessment of Cancer Therapy-General (FACT-G

Directory of Open Access Journals (Sweden)

Selby Peter J

2007-04-01

Full Text Available Abstract Background Although the Functional Assessment of Cancer Therapy – General questionnaire (FACT-G has been validated few studies have explored the factor structure of the instrument, in particular using non-sample dependent measurement techniques, such as Rasch Models. Furthermore, few studies have explored the relationship between item fit to the Rasch Model and clinical utility. The aim of this study was to investigate the dimensionality and measurement properties of the FACT-G with Rasch Models and Factor analysis. Methods A factor analysis and Rasch analysis (Partial Credit Model was carried out on the FACT-G completed by a heterogeneous sample of cancer patients (n = 465. For the Rasch analysis item fit (infit mean squares ≥ 1.30, dimensionality and item invariance were assessed. The impact of removing misfitting items on the clinical utility of the subscales and FACT-G total scale was also assessed. Results The factor analysis demonstrated a four factor structure of the FACT-G which broadly corresponded to the four subscales of the instrument. Internal consistency for these four scales was very good (Cronbach's alpha 0.72 – 0.85. The Rasch analysis demonstrated that each of the subscales and the FACT-G total scale had misfitting items (infit means square ≥ 1.30. All these scales with the exception of the Social & Family Well-being Scale (SFWB were unidimensional. When misfitting items were removed, the effect sizes and the clinical utility of the instrument were maintained for the subscales and the total FACT-G scores. Conclusion The results of the traditional factor analysis and Rasch analysis of the FACT-G broadly agreed. Caution should be exercised when utilising the Social & Family Well-being scale and further work is required to determine whether this scale is best represented by two factors. Additionally, removing misfitting items from scales should be performed alongside an assessment of the impact on clinical utility.
The Mayo-Portland Participation Index: A brief and psychometrically sound measure of brain injury outcome.

Science.gov (United States)

Malec, James F

2004-12-01

To evaluate the internal consistency, interrater agreement, concurrent validity, and floor and ceiling effects of the 8-item Participation Index (M2PI) of the Mayo-Portland Adaptability Inventory (MPAI). M2PI data derived from MPAIs completed independently by the people with acquired brain injury undergoing evaluation, their significant others, and rehabilitation staff were submitted to Rasch Facets analysis to determine the internal consistency of each independent rater group and of composite measures that combined rater groups. Correlations with the full-scale MPAI were examined to assess concurrent validity, as was interrater agreement. Outpatient rehabilitation in academic physical medicine and rehabilitation department. People with acquired brain injury (N=134) consecutively seen for evaluation, significant others, and evaluating staff. Not applicable. The MPAI and M2PI. The M2PI showed satisfactory internal consistency, concurrent validity, interrater agreement, and minimal floor and ceiling effects, although evidence of rater bias was also apparent. Composite indices showed more desirable psychometric properties than ratings by individual rater groups. The M2PI, particularly in composite indices and with attention to rater biases, provides an outcome measure with satisfactory psychometric qualities and the potential to represent the varying perspectives of people with acquired brain injury, significant others, and rehabilitation staff.
Analysis of letter name knowledge using Rasch measurement.

Science.gov (United States)

Bowles, Ryan P; Skibbe, Lori E; Justice, Laura M

2011-01-01

Letter name knowledge (LNK) is a key predictor of later reading ability and has been emphasized strongly in recent educational policy. Studies of LNK have implicitly treated it as a unidimensional construct with all letters equally relevant to its measurement. However, some empirical research suggests that contextual factors can affect the measurement of LNK. In this study, we analyze responses from 909 children on measures of LNK using the Rasch model and its extensions, and consider two contextual factors: the format of assessment and the own-name advantage, which states that children are more likely to know letters in their own first names. Results indicate that both contextual factors have important impacts on measurement and that LNK does not meet the requirements of Rasch measurement even when accounting for the contextual factors. These findings introduce philosophical concerns for measurement of constrained skills which have limited content for assessment.
Genes, Culture and Conservatism - A Psychometric-Genetic Approach

NARCIS (Netherlands)

Schwabe, Inga; Jonker, Willem; van den Berg, Stéphanie Martine

2016-01-01

The Wilson−Patterson conservatism scale was psychometrically evaluated using homogeneity analysis and item response theory models. Results showed that this scale actually measures two different aspects in people: on the one hand people vary in their agreement with either conservative or liberal
Genes, culture and conservatism : A psychometric-genetic approach

NARCIS (Netherlands)

Schwabe, I.; Jonker, Wilfried; Van Den Berg, Stéphanie M.

2016-01-01

The Wilson-Patterson conservatism scale was psychometrically evaluated using homogeneity analysis and item response theory models. Results showed that this scale actually measures two different aspects in people: on the one hand people vary in their agreement with either conservative or liberal
Sample Size Determination for Rasch Model Tests

Science.gov (United States)

Draxler, Clemens

2010-01-01

This paper is concerned with supplementing statistical tests for the Rasch model so that additionally to the probability of the error of the first kind (Type I probability) the probability of the error of the second kind (Type II probability) can be controlled at a predetermined level by basing the test on the appropriate number of observations.…
Exploring the measurement properties of the osteopathy clinical teaching questionnaire using Rasch analysis.

Science.gov (United States)

Vaughan, Brett

2018-01-01

Clinical teaching evaluations are common in health profession education programs to ensure students are receiving a quality clinical education experience. Questionnaires students use to evaluate their clinical teachers have been developed in professions such as medicine and nursing. The development of a questionnaire that is specifically for the osteopathy on-campus, student-led clinic environment is warranted. Previous work developed the 30-item Osteopathy Clinical Teaching Questionnaire. The current study utilised Rasch analysis to investigate the construct validity of the Osteopathy Clinical Teaching Questionnaire and provide evidence for the validity argument through fit to the Rasch model. Senior osteopathy students at four institutions in Australia, New Zealand and the United Kingdom rated their clinical teachers using the Osteopathy Clinical Teaching Questionnaire. Three hundred and ninety-nine valid responses were received and the data were evaluated for fit to the Rasch model. Reliability estimations (Cronbach's alpha and McDonald's omega) were also evaluated for the final model. The initial analysis demonstrated the data did not fit the Rasch model. Accordingly, modifications to the questionnaire were made including removing items, removing person responses, and rescoring one item. The final model contained 12 items and fit to the Rasch model was adequate. Support for unidimensionality was demonstrated through both the Principal Components Analysis/t-test, and the Cronbach's alpha and McDonald's omega reliability estimates. Analysis of the questionnaire using McDonald's omega hierarchical supported a general factor (quality of clinical teaching in osteopathy). The evidence for unidimensionality and the presence of a general factor support the calculation of a total score for the questionnaire as a sufficient statistic. Further work is now required to investigate the reliability of the 12-item Osteopathy Clinical Teaching Questionnaire to provide evidence
Rasch analysis of the Dutch version of the Oxford elbow score

Directory of Open Access Journals (Sweden)

de Haan J

2011-08-01

Full Text Available Jeroen de Haan1, Niels Schep2, Wim Tuinebreijer2, Peter Patka2, Dennis den Hartog21Department of Surgery and Traumatology, Westfriesgasthuis, Hoorn, the Netherlands; 2Department of Surgery and Traumatology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the NetherlandsBackground: The Oxford elbow score (OES is a patient-rated, 12-item questionnaire that measures quality of life in relation to elbow disorders. This English questionnaire has been proven to be a reliable and valid instrument. Recently, the OES has been translated into Dutch and examined for its reliability, validity, and responsiveness in a group of Dutch patients with elbow pathology. The aim of this study was to analyze the Dutch version of the OES (OES-DV in combination with Rasch analysis or the one-parameter item response theory to examine the structure of the questionnaire.Methods: The OES-DV was administered to 103 patients (68 female, 35 male. The mean age of the patients was 44.3 ± 14.7 (range 15–75 years. Rasch analysis was performed using the Winsteps® Rasch Measurement Version 3.70.1.1 and a rating scale parameterization.Results: The person separation index, which is a measure of person reliability, was excellent (2.30. All the items of the OES had a reasonable mean square infit or outfit value between 0.6 and 1.7. The threshold of items were ordered, so the categories can function as intended. Principal component analysis of the residuals partly confirmed the multidimensionality of the English version of the OES. The OES distinguished 3.4 strata, which indicates that about three ranges can be differentiated.Conclusion: Rasch analysis of the OES-DV showed that the data fit to the stringent Rasch model. The multidimensionality of the English version of the OES was partly confirmed, and the four items of the function and three items of the pain domain were recognized as separate domains. The category rating scale of the OES-DV works well. The OES can
Psychometric evaluation of the Sibling Cancer Needs Instrument (SCNI): an instrument to assess the psychosocial unmet needs of young people who are siblings of cancer patients.

Science.gov (United States)

Patterson, P; McDonald, F E J; Butow, P; White, K J; Costa, D S J; Millar, B; Bell, M L; Wakefield, C E; Cohn, R J

2014-03-01

The current study sought to establish the psychometric properties of the revised Sibling Cancer Needs Instrument (SCNI) when completed by young people who have a brother or sister with cancer. The participants were 106 young people aged between 12 and 24 who had a living brother or sister diagnosed with any type or stage of cancer in the last 5 years. They were recruited from multiple settings. The initial step in determining the dimensional structure of the questionnaire was exploratory factor analysis and further assessment followed using Rasch analysis. Construct validity and test-retest reliability (n = 17) were also assessed. The final SCNI has 45 items and seven domains: information; practical assistance; "time out" and recreation; feelings; support (friends and other young people); understanding from my family; and sibling relationship. There was a reasonable spread of responses across the scale for every item. Rasch analysis results suggested that overall, respondents used the scale consistently. Support for construct validity was provided by the correlations between psychological distress and the SCNI domains. The internal consistency was good to excellent; Cronbach's alphas ranged from 0.78 to 0.94. The test-retest reliability of the overall measure is 0.88. The SCNI is the first measure of psychosocial unmet needs which has been developed for young people who have a brother or sister with cancer. The sound psychometric properties allow the instrument to be used with confidence. The measure will provide a substantial clinical benefit in highlighting the unmet needs of this population to assist with the prioritisation of targeted supportive care services and evaluating the impact of interventions targeted at siblings.

Teachers' Checklist on Reading-Related Behavioral Characteristics of Chinese Primary Students: A Rasch Measurement Model Analysis

Science.gov (United States)

Chan, David W.; Ho, Connie Suk-han; Chung, Kevin K. H.; Tsang, Suk-man; Lee, Suk-han

2010-01-01

Data of item responses to the Hong Kong Specific Learning Difficulties Behaviour Checklist from 673 Chinese primary grade students were analyzed using the dichotomous Rasch measurement model. Rasch scaling suggested that the data fit the model adequately with a latent dimension of global dyslexic dysfunctioning. Estimates of item attributes and…
Rasch-built Overall Disability Scale (R-ODS) for immune-mediated peripheral neuropathies.

Science.gov (United States)

van Nes, S I; Vanhoutte, E K; van Doorn, P A; Hermans, M; Bakkers, M; Kuitwaard, K; Faber, C G; Merkies, I S J

2011-01-25

To develop a patient-based, linearly weighted scale that captures activity and social participation limitations in patients with Guillain-Barré syndrome (GBS), chronic inflammatory demyelinating polyradiculoneuropathy (CIDP), and gammopathy-related polyneuropathy (MGUSP). A preliminary Rasch-built Overall Disability Scale (R-ODS) containing 146 activity and participation items was constructed, based on the WHO International Classification of Functioning, Disability and Health, literature search, and patient interviews. The preliminary R-ODS was assessed twice (interval: 2-4 weeks; test-retest reliability studies) in 294 patients who experienced GBS in the past (n = 174) or currently have stable CIDP (n = 80) or MGUSP (n = 40). Data were analyzed using the Rasch unidimensional measurement model (RUMM2020). The preliminary R-ODS did not meet the Rasch model expectations. Based on disordered thresholds, misfit statistics, item bias, and local dependency, items were systematically removed to improve the model fit, regularly controlling the class intervals and model statistics. Finally, we succeeded in constructing a 24-item scale that fulfilled all Rasch requirements. "Reading a newspaper/book" and "eating" were the 2 easiest items; "standing for hours" and "running" were the most difficult ones. Good validity and reliability were obtained. The R-ODS is a linearly weighted scale that specifically captures activity and social participation limitations in patients with GBS, CIDP, and MGUSP. Compared to the Overall Disability Sum Score, the R-ODS represents a wider range of item difficulties, thereby better targeting patients with different ability levels. If responsive, the R-ODS will be valuable for future clinical trials and follow-up studies in these conditions.
Rasch Analysis of Professional Behavior in Medical Education

Science.gov (United States)

Lange, R.; Verhulst, S. J.; Roberts, N. K.; Dorsey, J. K.

2015-01-01

The use of students' "consumer feedback" to assess faculty behavior and improve the process of medical education is a significant challenge. We used quantitative Rasch measurement to analyze pre-categorized student comments listed by 385 graduating medical students. We found that students differed little with respect to the number of…
A Cross-Cultural Validation of Stage Development: A Rasch Re-Analysis of Longitudinal Socio-Moral Reasoning Data

Science.gov (United States)

Boom, Jan; Wouters, Hans; Keller, Monika

2007-01-01

Kohlberg's characterization of moral development as displaying an invariant hierarchical order of structurally consistent stages is losing ground. However, by applying Rasch analysis, Dawson recently gave new interpretation and support to his characterization of stage development. Using Rasch models, we replicated and strengthened her findings in…
Factors Influencing Singapore Students' Choice of Physics as a Tertiary Field of Study: A Rasch analysis

Science.gov (United States)

Oon, Pey-Tee; Subramaniam, R.

2013-01-01

Asian students often perform well in international science and mathematics assessments. Their attitude toward technical subjects, such as physics, remains curious for many. The present study examines Singapore school students' views on various aspects of physics according to whether they intend to choose physics as an advanced field of study. A sample of 1076 physics students, from 16 secondary schools and junior colleges, participated in this study. The students were categorized as physics choosers or non-choosers according to their indicated intention, as sought in the survey, to study or not to study physics as a major subject at university after their leaving level examinations. Rasch-anchored analysis was employed to interpret the results; the use of Rasch analysis has helped to overcome significantly the psychometric limitations inherent in the treatment of Likert scale type of data using traditional analysis. As expected, the image of physics as a difficult subject surfaced in the samples used in our study. The students recognized unequivocally the utilitarian value of physics: physics is said to enhance career options and is necessary for technological progress to occur in a country. They also showed high interest in school physics-this is so even for students who are not keen to study physics in the future, a finding which is at variance with other studies reported from Western countries. School physics is seen to be relevant, and physics teachers are viewed as being able to foster students' interest in physics. Laboratory work, enrichment activities, and physics textbooks were reported to be important in order to encourage students to like physics. Though the physics choosers showed greater intention in physics, they were generally not inclined to pursue physics-related careers after graduation. Parents and peers at school, on the other hand, are perceived to display unenthusiastic attitudes toward physics. Possible reasons for these are discussed along
Rasch model based analysis of the Force Concept Inventory

Directory of Open Access Journals (Sweden)

Maja Planinic

2010-03-01

Full Text Available The Force Concept Inventory (FCI is an important diagnostic instrument which is widely used in the field of physics education research. It is therefore very important to evaluate and monitor its functioning using different tools for statistical analysis. One of such tools is the stochastic Rasch model, which enables construction of linear measures for persons and items from raw test scores and which can provide important insight in the structure and functioning of the test (how item difficulties are distributed within the test, how well the items fit the model, and how well the items work together to define the underlying construct. The data for the Rasch analysis come from the large-scale research conducted in 2006-07, which investigated Croatian high school students’ conceptual understanding of mechanics on a representative sample of 1676 students (age 17–18 years. The instrument used in research was the FCI. The average FCI score for the whole sample was found to be (27.7±0.4%, indicating that most of the students were still non-Newtonians at the end of high school, despite the fact that physics is a compulsory subject in Croatian schools. The large set of obtained data was analyzed with the Rasch measurement computer software WINSTEPS 3.66. Since the FCI is routinely used as pretest and post-test on two very different types of population (non-Newtonian and predominantly Newtonian, an additional predominantly Newtonian sample (N=141, average FCI score of 64.5% of first year students enrolled in introductory physics course at University of Zagreb was also analyzed. The Rasch model based analysis suggests that the FCI has succeeded in defining a sufficiently unidimensional construct for each population. The analysis of fit of data to the model found no grossly misfitting items which would degrade measurement. Some items with larger misfit and items with significantly different difficulties in the two samples of students do require further
Investigating Psychometric Isomorphism for Traditional and Performance-Based Assessment

Science.gov (United States)

Fay, Derek M.; Levy, Roy; Mehta, Vandhana

2018-01-01

A common practice in educational assessment is to construct multiple forms of an assessment that consists of tasks with similar psychometric properties. This study utilizes a Bayesian multilevel item response model and descriptive graphical representations to evaluate the psychometric similarity of variations of the same task. These approaches for…
TESTING THE ASSUMPTIONS AND INTERPRETING THE RESULTS OF THE RASCH MODEL USING LOG-LINEAR PROCEDURES IN SPSS

NARCIS (Netherlands)

TENVERGERT, E; GILLESPIE, M; KINGMA, J

This paper shows how to use the log-linear subroutine of SPSS to fit the Rasch model. It also shows how to fit less restrictive models obtained by relaxing specific assumptions of the Rasch model. Conditional maximum likelihood estimation was achieved by including dummy variables for the total
Comparative study of middle school students' attitudes towards science: Rasch analysis of entire TIMSS 2011 attitudinal data for England, Singapore and the U.S.A. as well as psychometric properties of attitudes scale

Science.gov (United States)

Pey Tee, Oon; Subramaniam, R.

2018-02-01

We report here on a comparative study of middle school students' attitudes towards science involving three countries: England, Singapore and the U.S.A. Complete attitudinal data sets from TIMSS (Trends in International Mathematics and Science Study) 2011 were used, thus giving a very large sample size (N = 20,246), compared to other studies in the journal literature. The Rasch model was used to analyse the data, and the findings have shed some useful light on not only how the Western and Asian students responded on a comparative basis in the various scales related to attitudes but also on the validity, reliability, and unidimensionality of the attitudes instrument used in TIMSS 2011. There may be a need for TIMSS test developers to consider doing away with negatively phrased items in the attitudes instrument and phrasing these positively as the Rasch framework shows that response bias is associated with these statements.
Psychometric Validation of the BODY-Q in Danish Patients Undergoing Weight Loss and Body Contouring Surgery

DEFF Research Database (Denmark)

Poulsen, Lotte; Klassen, Anne; Rose, Michael

2017-01-01

study aims to psychometrically validate the BODY-Q for use in Danish patients. Methods: The process consisted of 3 stages: translation and linguistic validation, field-test, and data analysis. The translation was performed in accordance with the International Society for Pharmacoeconomics and Outcomes...... assessments with an overall response rate at 76%. Cronbach α values were ≥ 0.90, and person separation index values were in general high. The Rasch Measurement Theory analysis provided broad support for the reliability and validity of the Danish version of the BODY-Q scales. Item fit was outside the criteria...... for 34 of 138 items, and of these, 21 had a significant chi-square P value after Bonferroni adjustment. Most items (128 of 138) had ordered thresholds, indicating that response options worked as intended. Conclusion: The Danish version of the BODY-Q is a reliable and valid patient-reported outcome...
Rasch-built Overall Disability Scale for patients with chemotherapy-induced peripheral neuropathy (CIPN-R-ODS).

Science.gov (United States)

Binda, D; Vanhoutte, E K; Cavaletti, G; Cornblath, D R; Postma, T J; Frigeni, B; Alberti, P; Bruna, J; Velasco, R; Argyriou, A A; Kalofonos, H P; Psimaras, D; Ricard, D; Pace, A; Galiè, E; Briani, C; Dalla Torre, C; Lalisang, R I; Boogerd, W; Brandsma, D; Koeppen, S; Hense, J; Storey, D; Kerrigan, S; Schenone, A; Fabbri, S; Rossi, E; Valsecchi, M G; Faber, C G; Merkies, I S J; Galimberti, S; Lanzani, F; Mattavelli, L; Piatti, M L; Bidoli, P; Cazzaniga, M; Cortinovis, D; Lucchetta, M; Campagnolo, M; Bakkers, M; Brouwer, B; Boogerd, W; Grant, R; Reni, L; Piras, B; Pessino, A; Padua, L; Granata, G; Leandri, M; Ghignotti, I; Plasmati, R; Pastorelli, F; Heimans, J J; Eurelings, M; Meijer, R J; Grisold, W; Lindeck Pozza, E; Mazzeo, A; Toscano, A; Russo, M; Tomasello, C; Altavilla, G; Penas Prado, M; Dominguez Gonzalez, C; Dorsey, S G

2013-09-01

Chemotherapy-induced peripheral neuropathy (CIPN) is a common neurological side-effect of cancer treatment and may lead to declines in patients' daily functioning and quality of life. To date, there are no modern clinimetrically well-evaluated outcome measures available to assess disability in CIPN patients. The objective of the study was to develop an interval-weighted scale to capture activity limitations and participation restrictions in CIPN patients using the Rasch methodology and to determine its validity and reliability properties. A preliminary Rasch-built Overall Disability Scale (pre-R-ODS) comprising 146 items was assessed twice (interval: 2-3 weeks; test-retest reliability) in 281 CIPN patients with a stable clinical condition. The obtained data were subjected to Rasch analyses to determine whether model expectations would be met, and if necessarily, adaptations were made to obtain proper model fit (internal validity). External validity was obtained by correlating the CIPN-R-ODS with the National Cancer Institute-Common Toxicity Criteria (NCI-CTC) neuropathy scales and the Pain-Intensity Numeric-Rating-Scale (PI-NRS). The preliminary R-ODS did not meet Rasch model's expectations. Items displaying misfit statistics, disordered thresholds, item bias or local dependency were systematically removed. The final CIPN-R-ODS consisting of 28 items fulfilled all the model's expectations with proper validity and reliability, and was unidimensional. The final CIPN-R-ODS is a Rasch-built disease-specific, interval measure suitable to detect disability in CIPN patients and bypasses the shortcomings of classical test theory ordinal-based measures. Its use is recommended in future clinical trials in CIPN. Copyright © 2013 Elsevier Ltd. All rights reserved.
Metrology of human-based and other qualitative measurements

Science.gov (United States)

Pendrill, Leslie; Petersson, Niclas

2016-09-01

The metrology of human-based and other qualitative measurements is in its infancy—concepts such as traceability and uncertainty are as yet poorly developed. This paper reviews how a measurement system analysis approach, particularly invoking as performance metric the ability of a probe (such as a human being) acting as a measurement instrument to make a successful decision, can enable a more general metrological treatment of qualitative observations. Measures based on human observations are typically qualitative, not only in sectors, such as health care, services and safety, where the human factor is obvious, but also in customer perception of traditional products of all kinds. A principal challenge is that the usual tools of statistics normally employed for expressing measurement accuracy and uncertainty will probably not work reliably if relations between distances on different portions of scales are not fully known, as is typical of ordinal or other qualitative measurements. A key enabling insight is to connect the treatment of decision risks associated with measurement uncertainty to generalized linear modelling (GLM). Handling qualitative observations in this way unites information theory, the perceptive identification and choice paradigms of psychophysics. The Rasch invariant measure psychometric GLM approach in particular enables a proper treatment of ordinal data; a clear separation of probe and item attribute estimates; simple expressions for instrument sensitivity; etc. Examples include two aspects of the care of breast cancer patients, from diagnosis to rehabilitation. The Rasch approach leads in turn to opportunities of establishing metrological references for quality assurance of qualitative measurements. In psychometrics, one could imagine a certified reference for knowledge challenge, for example, a particular concept in understanding physics or for product quality of a certain health care service. Multivariate methods, such as Principal Component
Metrology of human-based and other qualitative measurements

International Nuclear Information System (INIS)

Pendrill, Leslie; Petersson, Niclas

2016-01-01

The metrology of human-based and other qualitative measurements is in its infancy—concepts such as traceability and uncertainty are as yet poorly developed. This paper reviews how a measurement system analysis approach, particularly invoking as performance metric the ability of a probe (such as a human being) acting as a measurement instrument to make a successful decision, can enable a more general metrological treatment of qualitative observations. Measures based on human observations are typically qualitative, not only in sectors, such as health care, services and safety, where the human factor is obvious, but also in customer perception of traditional products of all kinds. A principal challenge is that the usual tools of statistics normally employed for expressing measurement accuracy and uncertainty will probably not work reliably if relations between distances on different portions of scales are not fully known, as is typical of ordinal or other qualitative measurements. A key enabling insight is to connect the treatment of decision risks associated with measurement uncertainty to generalized linear modelling (GLM). Handling qualitative observations in this way unites information theory, the perceptive identification and choice paradigms of psychophysics. The Rasch invariant measure psychometric GLM approach in particular enables a proper treatment of ordinal data; a clear separation of probe and item attribute estimates; simple expressions for instrument sensitivity; etc. Examples include two aspects of the care of breast cancer patients, from diagnosis to rehabilitation. The Rasch approach leads in turn to opportunities of establishing metrological references for quality assurance of qualitative measurements. In psychometrics, one could imagine a certified reference for knowledge challenge, for example, a particular concept in understanding physics or for product quality of a certain health care service. Multivariate methods, such as Principal Component
Improving the Individual Work Performance Questionnaire using Rasch analysis.

NARCIS (Netherlands)

Koopmans, L.; Bernaards, C.M.; Hildebrandt, V.H.; Buuren, S. van; Beek, A.J. van der; Vet, H.C.W. de

2014-01-01

Recently, the Individual Work Performance Questionnaire (IWPQ) version 0.2 was developed using Rasch analysis. The goal of the current study was to improve targeting of the IWPQ scales by including additional items. The IWPQ 0.2 (original) and 0.3 (including additional items) were examined using
USING RASCH ANALYSIS TO EXPLORE WHAT STUDENTS LEARN ABOUT PROBABILITY CONCEPTS

Directory of Open Access Journals (Sweden)

Zamalia Mahmud

2015-01-01

Full Text Available Students’ understanding of probability concepts have been investigated from various different perspectives. This study was set out to investigate perceived understanding of probability concepts of forty-four students from the STAT131 Understanding Uncertainty and Variation course at the University of Wollongong, NSW. Rasch measurement which is based on a probabilistic model was used to identify concepts that students find easy, moderate and difficult to understand. Data were captured from the e-learning Moodle platform where students provided their responses through an on-line quiz. As illustrated in the Rasch map, 96% of the students could understand about sample space, simple events, mutually exclusive events and tree diagram while 67% of the students found concepts of conditional and independent events rather easy to understand
Using Rasch Analysis To Explore What Students Learn About Probability Concepts

Directory of Open Access Journals (Sweden)

Zamalia Mahmud

2015-01-01

Full Text Available Students’ understanding of probability concepts have been investigated from various different perspectives. This study was set out to investigate perceived understanding of probability concepts of forty-four students from the STAT131 Understanding Uncertainty and Variation course at the University of Wollongong, NSW. Rasch measurement which is based on a probabilistic model was used to identify concepts that students find easy, moderate and difficult to understand. Data were captured from the e-learning Moodle platform where students provided their responses through an on-line quiz. As illustrated in the Rasch map, 96% of the students could understand about sample space, simple events, mutually exclusive events and tree diagram while 67% of the students found concepts of conditional and independent events rather easy to understand.
An Analysis of Cross Racial Identity Scale Scores Using Classical Test Theory and Rasch Item Response Models

Science.gov (United States)

Sussman, Joshua; Beaujean, A. Alexander; Worrell, Frank C.; Watson, Stevie

2013-01-01

Item response models (IRMs) were used to analyze Cross Racial Identity Scale (CRIS) scores. Rasch analysis scores were compared with classical test theory (CTT) scores. The partial credit model demonstrated a high goodness of fit and correlations between Rasch and CTT scores ranged from 0.91 to 0.99. CRIS scores are supported by both methods.…
Rasch scaling of the Oswestry Disability Index and the Roland-Morris Disability Questionnaire

DEFF Research Database (Denmark)

Lauridsen, Henrik Hein; Hartvigsen, Jan

Questionnaire (RMQ) and the Oswestry Disability Index (ODI), however, only few studies have tested these questionnaires using Rasch analysis. This study used Rasch scaling to test the construct validity of the Danish versions of the RMQ (23-item Patrick version) and the ODI (version 2.1a) in a heterogeneous...... on an ordinal scale into interval scaling in addition to optimising the fit of instrument items to the target population. In low back pain research the two most commonly used and well-validated questionnaires to assess functional status in patients with low back pain are the Roland-Morris Disability...
Cross-cultural validity of the Spanish version of PHQ-9 among pregnant Peruvian women: a Rasch item response theory analysis.

Science.gov (United States)

Zhong, Qiuyue; Gelaye, Bizu; Fann, Jesse R; Sanchez, Sixto E; Williams, Michelle A

2014-04-01

We sought to evaluate the validity of the Spanish language version of the patient health questionnaire-9 (PHQ-9) depression scale in a large sample of pregnant Peruvian women using Rasch item response theory (IRT) approaches. We further sought to examine the appropriateness of the response formats, reliability and potential differential item functioning (DIF) by maternal age, educational attainment and employment status. This cross-sectional study was conducted among 1520 pregnant women in Lima, Peru. A structured interview was used to collect information on demographic characteristics and PHQ-9 items. Data from the PHQ-9 were fitted to the Rasch IRT model and tested for appropriate category ordering, the assumptions of unidimensionality and local independence, item fit, reliability and presence of DIF. The Spanish language version of PHQ-9 demonstrated unidimensionality, local independence, and acceptable fit for the Rasch IRT model. However, we detected disordered response categories for the original four response categories. After collapsing "more than half the days" and "nearly every day", the response categories ordered properly and the PHQ-9 fit the Rasch IRT model. The PHQ-9 had moderate internal consistency (person separation index, PSI=0.72). Additionally, the items of PHQ-9 were free of DIF with regard to age, educational attainment, and employment status. The Spanish language version of the PHQ-9 was shown to have item properties of an effective screening instrument. Collapsing rating scale categories and reconstructing three-point Likert scale for all items improved the fit of the instrument. Future studies are warranted to establish new cutoff scores and criterion validity of the three-point Likert scale response options for the Spanish language version of the PHQ-9. Copyright © 2014 Elsevier B.V. All rights reserved.
Quantitative Reasoning in Environmental Science: Rasch Measurement to Support QR Assessment

Directory of Open Access Journals (Sweden)

Robert L. Mayes

2015-07-01

Full Text Available The ability of middle and high school students to reason quantitatively within the context of environmental science was investigated. A quantitative reasoning (QR learning progression, with associated QR assessments in the content areas of biodiversity, water, and carbon, was developed based on three QR progress variables: quantification act, quantitative interpretation, and quantitative modeling. Diagnostic instruments were developed specifically for the progress variable quantitative interpretation (QI, each consisting of 96 Likert-scale items. Each content version of the instrument focused on three scale levels (macro scale, micro scale, and landscape scale and four elements of QI identified in prior research (trend, translation, prediction, and revision. The QI assessments were completed by 362, 6th to 12th grade students in three U.S. states. Rasch (1960/1980 measurement was used to determine item and person measures for the QI instruments, both to examine validity and reliability characteristics of the instrument administration and inform the evolution of the learning progression. Rasch methods allowed identification of several QI instrument revisions, including modification of specific items, reducing number of items to avoid cognitive fatigue, reconsidering proposed item difficulty levels, and reducing Likert scale to 4 levels. Rasch diagnostics also indicated favorable levels of instrument reliability and appropriate targeting of item abilities to student abilities for the majority of participants. A revised QI instrument is available for STEM researchers and educators.

Psychometric evaluation of the Korean Version of the Self-Efficacy for Exercise Scale for older adults.

Science.gov (United States)

Choi, Mona; Ahn, Sangwoo; Jung, Dukyoo

2015-01-01

We evaluated the psychometric properties of the Korean version of the Self-Efficacy for Exercise Scale (SEE-K). The SEE-K consists of nine items and was translated into Korean using the forward-backward translation method. We administered it to 212 community-dwelling older adults along with measures of outcome expectation for exercise, quality of life, and physical activity. The validity was determined using confirmatory factor analysis and Rasch analysis with INFIT and OUTFIT statistics, which showed acceptable model fit. The concurrent validity was confirmed according to positive correlations between the SEE-K, outcome expectation for exercise, and quality of life. Furthermore, the high physical activity group had higher SEE-K scores. Finally, the reliability of the SEE-K was deemed acceptable based on Cronbach's alpha, coefficients of determination, and person and item separation indices with reliability. Thus, the SEE-K appears to have satisfactory validity and reliability among older adults in South Korea. Copyright © 2015 Elsevier Inc. All rights reserved.
Exploring Variability Sources in Student Evaluation of Teaching via Many-Facet Rasch Model

Directory of Open Access Journals (Sweden)

Bengü BÖRKAN

2017-03-01

Full Text Available Evaluating quality of teaching is important in nearly every higher education institute. The most common way of assessing teaching effectiveness takes place through students. Student Evaluation of Teaching (SET is used to gather information about students’ experiences with a course and instructor’s performance at some point of semester. SET can be considered as a type of rater mediated performance assessment where students are the raters and instructors are the examinees. When performance assessment becomes a rater mediated assessment process, extra measures need to be taken into consideration in order to create more reliable and fair assessment practices. The study has two main purposes; (a to examine the extent to which the facets (instructor, student, and rating items contribute to instructors’ score variance and (b to examine the students’ judging behavior in order to detect any potential source of bias in student evaluation of teaching by using the Many-Facet Rasch Model. The data set includes one thousand 235 students’ responses from 254 courses. The results show that a students greatly differ in the severity while rating instructors, b students were fairly consistent in their ratings, c students as a group and individual level are tend to display halo effect in their ratings, d students are clustered at the highest two categories of the scale and e the variation in item measures is fairly low. The findings have practical implications for the SET practices by improving the psychometric quality of measurement.
Rasch Calibration of Perceived Weights of Different Sports Games

Science.gov (United States)

Kang, Sang-Jo; Kang, Minsoo

2006-01-01

In many countries, an athlete's performance at sporting competitions is often used as part of the selection criteria for entry into college. These criteria could be biased depending upon the procedures utilized by the authorities in a particular country. The purpose of this study was to calibrate, by using the Rasch rating scale model, the…
Emotional Intelligence and Nurse Recruitment: Rasch and confirmatory factor analysis of the trait emotional intelligence questionnaire short form.

Science.gov (United States)

Snowden, Austyn; Watson, Roger; Stenhouse, Rosie; Hale, Claire

2015-12-01

To examine the construct validity of the Trait Emotional Intelligence Questionnaire Short form. Emotional intelligence involves the identification and regulation of our own emotions and the emotions of others. It is therefore a potentially useful construct in the investigation of recruitment and retention in nursing and many questionnaires have been constructed to measure it. Secondary analysis of existing dataset of responses to Trait Emotional Intelligence Questionnaire Short form using concurrent application of Rasch analysis and confirmatory factor analysis. First year undergraduate nursing and computing students completed Trait Emotional Intelligence Questionnaire-Short Form in September 2013. Responses were analysed by synthesising results of Rasch analysis and confirmatory factor analysis. Participants (N = 938) completed Trait Emotional Intelligence Questionnaire Short form. Rasch analysis showed the majority of the Trait Emotional Intelligence Questionnaire-Short Form items made a unique contribution to the latent trait of emotional intelligence. Five items did not fit the model and differential item functioning (gender) accounted for this misfit. Confirmatory factor analysis revealed a four-factor structure consisting of: self-confidence, empathy, uncertainty and social connection. All five misfitting items from the Rasch analysis belonged to the 'social connection' factor. The concurrent use of Rasch and factor analysis allowed for novel interpretation of Trait Emotional Intelligence Questionnaire Short form. Much of the response variation in Trait Emotional Intelligence Questionnaire Short form can be accounted for by the social connection factor. Implications for practice are discussed. © 2015 John Wiley & Sons Ltd.
Measuring Instrument Constructs of Return Factors for Green Office Building Investments Variables Using Rasch Measurement Model

Directory of Open Access Journals (Sweden)

Isa Mona

2016-01-01

Full Text Available This paper is a preliminary study on rationalising green office building investments in Malaysia. The aim of this paper is attempt to introduce the application of Rasch measurement model analysis to determine the validity and reliability of each construct in the questionnaire. In achieving this objective, a questionnaire survey was developed consists of 6 sections and a total of 106 responses were received from various investors who own and lease office buildings in Kuala Lumpur. The Rasch Measurement analysis is used to measure the quality control of item constructs in the instrument by measuring the specific objectivity within the same dimension, to reduce ambiguous measures, and a realistic estimation of precision and implicit quality. The Rasch analysis consists of the summary statistics, item unidimensionality and item measures. A result shows the items and respondent (person reliability is at 0.91 and 0.95 respectively.
Obtaining Content Weights for Test Specifications from Job Analysis Task Surveys: An Application of the Many-Facets Rasch Model

Science.gov (United States)

Wang, Ning; Stahl, John

2012-01-01

This article discusses the use of the Many-Facets Rasch Model, via the FACETS computer program (Linacre, 2006a), to scale job/practice analysis survey data as well as to combine multiple rating scales into single composite weights representing the tasks' relative importance. Results from the Many-Facets Rasch Model are compared with those…
Rasch Validation of a Measure of Reform-Oriented Science Teaching Practices

Science.gov (United States)

You, Hye Sun

2016-06-01

Growing evidence from recent curriculum documents and previous research suggests that reform-oriented science teaching practices promote students' conceptual understanding, levels of achievement, and motivation to learn, especially when students are actively engaged in constructing their ideas through scientific inquiries. However, it is difficult to identify to what extent science teachers engage students in reform-oriented teaching practices (RTPs) in their science classrooms. In order to exactly diagnose the current status of science teachers' implementation of the RTPs, a valid and reliable instrument tool is needed. The principles of validity and reliability are fundamental cornerstones in developing a robust measurement tool. As such, this study was motivated by the desire to point out the limitations of the existing statistical and psychometric analyses and to further examine the validation of the RTP survey instrument. This paper thus aims at calibrating the items of the RTPs for science teachers using the Rasch model. The survey instrument scale was adapted from the 2012 National Survey of Science and Mathematics Education (NSSME) data. A total of 3701 science teachers from 1403 schools from across the USA participated in the NSSME survey. After calibrating the RTP items and persons on the same scale, the RTP instrument well represented the population of US science teachers. Model-data fit determined by Infit and Outfit statistics was within an appropriate range (0.5-1.5), supporting the unidimensional structure of the RTPs. The ordered category thresholds and the probability of the thresholds showed that the five-point rating scale functioned well. The results of this study support the use of the RTP measure from the 2012 NSSME in assessing usage of RTPs.
Assessment of upper limb capacity in children with unilateral cerebral palsy: construct validity of a Rasch-reduced Modified House Classification

NARCIS (Netherlands)

Geerdink, Yvonne; Lindeboom, Robert; de Wolf, Sander; Steenbergen, Bert; Geurts, Alexander C. H.; Aarts, Pauline

2014-01-01

The aim of this study was to test and improve the unidimensionality and item hierarchy of the Modified House Classification (MHC) for the assessment of upper limb capacity in children with unilateral cerebral palsy (CP) using Rasch analysis. The construct validity of the Rasch-reduced item set was
Rasch Measurement of Collaborative Problem Solving in an Online Environment.

Science.gov (United States)

Harding, Susan-Marie E; Griffin, Patrick E

2016-01-01

This paper describes an approach to the assessment of human to human collaborative problem solving using a set of online interactive tasks completed by student dyads. Within the dyad, roles were nominated as either A or B and students selected their own roles. The question as to whether role selection affected individual student performance measures is addressed. Process stream data was captured from 3402 students in six countries who explored the problem space by clicking, dragging the mouse, moving the cursor and collaborating with their partner through a chat box window. Process stream data were explored to identify behavioural indicators that represented elements of a conceptual framework. These indicative behaviours were coded into a series of dichotomous items. These items represented actions and chats performed by students. The frequency of occurrence was used as a proxy measure of item difficulty. Then given a measure of item difficulty, student ability could be estimated using the difficulty estimates of the range of items demonstrated by the student. The Rasch simple logistic model was used to review the indicators to identify those that were consistent with the assumptions of the model and were invariant across national samples, language, curriculum and age of the student. The data were analysed using a one and two dimension, one parameter model. Rasch separation reliability, fit to the model, distribution of students and items on the underpinning construct, estimates for each country and the effect of role differences are reported. This study provides evidence that collaborative problem solving can be assessed in an online environment involving human to human interaction using behavioural indicators shown to have a consistent relationship between the estimate of student ability, and the probability of demonstrating the behaviour.
Four theorems on the psychometric function.

Science.gov (United States)

May, Keith A; Solomon, Joshua A

2013-01-01

In a 2-alternative forced-choice (2AFC) discrimination task, observers choose which of two stimuli has the higher value. The psychometric function for this task gives the probability of a correct response for a given stimulus difference, Δx. This paper proves four theorems about the psychometric function. Assuming the observer applies a transducer and adds noise, Theorem 1 derives a convenient general expression for the psychometric function. Discrimination data are often fitted with a Weibull function. Theorem 2 proves that the Weibull "slope" parameter, β, can be approximated by β(Noise) x β(Transducer), where β(Noise) is the β of the Weibull function that fits best to the cumulative noise distribution, and β(Transducer) depends on the transducer. We derive general expressions for β(Noise) and β(Transducer), from which we derive expressions for specific cases. One case that follows naturally from our general analysis is Pelli's finding that, when d' ∝ (Δx)(b), β ≈ β(Noise) x b. We also consider two limiting cases. Theorem 3 proves that, as sensitivity improves, 2AFC performance will usually approach that for a linear transducer, whatever the actual transducer; we show that this does not apply at signal levels where the transducer gradient is zero, which explains why it does not apply to contrast detection. Theorem 4 proves that, when the exponent of a power-function transducer approaches zero, 2AFC performance approaches that of a logarithmic transducer. We show that the power-function exponents of 0.4-0.5 fitted to suprathreshold contrast discrimination data are close enough to zero for the fitted psychometric function to be practically indistinguishable from that of a log transducer. Finally, Weibull β reflects the shape of the noise distribution, and we used our results to assess the recent claim that internal noise has higher kurtosis than a Gaussian. Our analysis of β for contrast discrimination suggests that, if internal noise is stimulus
Four theorems on the psychometric function.

Directory of Open Access Journals (Sweden)

Keith A May

Full Text Available In a 2-alternative forced-choice (2AFC discrimination task, observers choose which of two stimuli has the higher value. The psychometric function for this task gives the probability of a correct response for a given stimulus difference, Δx. This paper proves four theorems about the psychometric function. Assuming the observer applies a transducer and adds noise, Theorem 1 derives a convenient general expression for the psychometric function. Discrimination data are often fitted with a Weibull function. Theorem 2 proves that the Weibull "slope" parameter, β, can be approximated by β(Noise x β(Transducer, where β(Noise is the β of the Weibull function that fits best to the cumulative noise distribution, and β(Transducer depends on the transducer. We derive general expressions for β(Noise and β(Transducer, from which we derive expressions for specific cases. One case that follows naturally from our general analysis is Pelli's finding that, when d' ∝ (Δx(b, β ≈ β(Noise x b. We also consider two limiting cases. Theorem 3 proves that, as sensitivity improves, 2AFC performance will usually approach that for a linear transducer, whatever the actual transducer; we show that this does not apply at signal levels where the transducer gradient is zero, which explains why it does not apply to contrast detection. Theorem 4 proves that, when the exponent of a power-function transducer approaches zero, 2AFC performance approaches that of a logarithmic transducer. We show that the power-function exponents of 0.4-0.5 fitted to suprathreshold contrast discrimination data are close enough to zero for the fitted psychometric function to be practically indistinguishable from that of a log transducer. Finally, Weibull β reflects the shape of the noise distribution, and we used our results to assess the recent claim that internal noise has higher kurtosis than a Gaussian. Our analysis of β for contrast discrimination suggests that, if internal noise is
Affective stress responses during leisure time: Validity evaluation of a modified version of the Stress-Energy Questionnaire.

Science.gov (United States)

Hadžibajramović, Emina; Ahlborg, Gunnar; Håkansson, Carita; Lundgren-Nilsson, Åsa; Grimby-Ekman, Anna

2015-12-01

Psychosocial stress at work is one of the most important factors behind increasing sick-leave rates. In addition to work stressors, it is important to account for non-work-related stressors when assessing stress responses. In this study, a modified version of the Stress-Energy Questionnaire (SEQ), the SEQ during leisure time (SEQ-LT) was introduced for assessing the affective stress response during leisure time. The aim of this study was to investigate the internal construct validity of the SEQ-LT. A second aim was to define the cut-off points for the scales, which could indicate high and low levels of leisure-time stress and energy, respectively. Internal construct validity of the SEQ-LT was evaluated using a Rasch analysis. We examined the unidimensionality and other psychometric properties of the scale by the fit to the Rasch model. A criterion-based approach was used for classification into high and low stress/energy levels. The psychometric properties of the stress and energy scales of the SEQ-LT were satisfactory, having accommodated for local dependency. The cut-off point for low stress was proposed to be in the interval between 2.45 and 3.02 on the Rasch metric score; while for high stress, it was between 3.65 and 3.90. The suggested cut-off points for the low and high energy levels were values between 1.73-1.97 and 2.66-3.08, respectively. The stress and energy scale of the SEQ-LT satisfied the measurement criteria defined by the Rasch analysis and it provided a useful tool for non-work-related assessment of stress responses. We provide guidelines on how to interpret the scale values. © 2015 the Nordic Societies of Public Health.
Decay of Iconic Memory Traces Is Related to Psychometric Intelligence: A Fixed-Links Modeling Approach

Science.gov (United States)

Miller, Robert; Rammsayer, Thomas H.; Schweizer, Karl; Troche, Stefan J.

2010-01-01

Several memory processes have been examined regarding their relation to psychometric intelligence with the exception of sensory memory. This study examined the relation between decay of iconic memory traces, measured with a partial-report task, and psychometric intelligence, assessed with the Berlin Intelligence Structure test, in 111…
Rasch scaling paranormal belief and experience: structure and semantics of Thalbourne's Australian Sheep-Goat Scale.

Science.gov (United States)

Lange, Rense; Thalbourne, Michael A

2002-12-01

Research on the relation between demographic variables and paranormal belief remains controversial given the possible semantic distortions introduced by item and test level biases. We illustrate how Rasch scaling can be used to detect such biases and to quantify their effects, using the Australian Sheep-Goal Scale as a substantive example. Based on data from 1.822 respondents, this test was Rasch scalable, reliable, and unbiased at the test level. Consistent with other research in which unbiased measures of paranormal belief were used, extremely weak age and sex effects were found (partial eta2 = .005 and .012, respectively).
Evaluation of the psychometric properties of the Nighttime Symptoms of COPD Instrument.

Science.gov (United States)

Mocarski, Michelle; Zaiser, Erica; Trundell, Dylan; Make, Barry J; Hareendran, Asha

2015-01-01

Nighttime symptoms can negatively impact the quality of life of patients with chronic obstructive pulmonary disease (COPD). The Nighttime Symptoms of COPD Instrument (NiSCI) was designed to measure the occurrence and severity of nighttime symptoms in patients with COPD, the impact of symptoms on nighttime awakenings, and rescue medication use. The objective of this study was to explore item reduction, inform scoring recommendations, and evaluate the psychometric properties of the NiSCI. COPD patients participating in a Phase III clinical trial completed the NiSCI daily. Item analyses were conducted using weekly mean and single day scores. Descriptive statistics (including percentage of respondents at floor/ceiling and inter-item correlations), factor analyses, and Rasch model analyses were conducted to examine item performance and scoring. Test-retest reliability was assessed for the final instrument using the intraclass correlation coefficient (ICC). Correlations with assessments conducted during study visits were used to evaluate convergent and known-groups validity. Data from 1,663 COPD patients aged 40-93 years were analyzed. Item analyses supported the generation of four scores. A one-factor structure was confirmed with factor analysis and Rasch analysis for the symptom severity score. Test-retest reliability was confirmed for the six-item symptom severity (ICC, 0.85), number of nighttime awakenings (ICC, 0.82), and rescue medication (ICC, 0.68) scores. Convergent validity was supported by significant correlations between the NiSCI, St George's Respiratory Questionnaire, and Exacerbations of Chronic Obstructive Pulmonary Disease Tool-Respiratory Symptoms scores. The results suggest that the NiSCI can be used to determine the severity of nighttime COPD symptoms, the number of nighttime awakenings due to COPD symptoms, and the nighttime use of rescue medication. The NiSCI is a reliable and valid instrument to evaluate these concepts in COPD patients in clinical
The construct validity of the Major Depression Inventory: A Rasch analysis of a self-rating scale in primary care.

Science.gov (United States)

Nielsen, Marie Germund; Ørnbøl, Eva; Vestergaard, Mogens; Bech, Per; Christensen, Kaj Sparle

2017-06-01

We aimed to assess the measurement properties of the ten-item Major Depression Inventory when used on clinical suspicion in general practice by performing a Rasch analysis. General practitioners asked consecutive persons to respond to the web-based Major Depression Inventory on clinical suspicion of depression. We included 22 practices and 245 persons. Rasch analysis was performed using RUMM2030 software. The Rasch model fit suggests that all items contribute to a single underlying trait (defined as internal construct validity). Mokken analysis was used to test dimensionality and scalability. Our Rasch analysis showed misfit concerning the sleep and appetite items (items 9 and 10). The response categories were disordered for eight items. After modifying the original six-point to a four-point scoring system for all items, we achieved ordered response categories for all ten items. The person separation reliability was acceptable (0.82) for the initial model. Dimensionality testing did not support combining the ten items to create a total score. The scale appeared to be well targeted to this clinical sample. No significant differential item functioning was observed for gender, age, work status and education. The Rasch and Mokken analyses revealed two dimensions, but the Major Depression Inventory showed fit to one scale if items 9 and 10 were excluded. Our study indicated scalability problems in the current version of the Major Depression Inventory. The conducted analysis revealed better statistical fit when items 9 and 10 were excluded. Copyright © 2017 Elsevier Inc. All rights reserved.
Analysis of High School German Textbooks through Rasch Measurement Model

Science.gov (United States)

Batdi, Veli; Elaldi, Senel

2016-01-01

The purpose of the present study is to analyze German teacher trainers' views on high school German textbooks through the Rasch measurement model. A survey research design was employed and study group consisted of a total of 21 teacher trainers, three from each region and selected randomly from provinces which are located in seven regions and…
Exploring students’ perceived and actual ability in solving statistical problems based on Rasch measurement tools

Science.gov (United States)

Azila Che Musa, Nor; Mahmud, Zamalia; Baharun, Norhayati

2017-09-01

One of the important skills that is required from any student who are learning statistics is knowing how to solve statistical problems correctly using appropriate statistical methods. This will enable them to arrive at a conclusion and make a significant contribution and decision for the society. In this study, a group of 22 students majoring in statistics at UiTM Shah Alam were given problems relating to topics on testing of hypothesis which require them to solve the problems using confidence interval, traditional and p-value approach. Hypothesis testing is one of the techniques used in solving real problems and it is listed as one of the difficult concepts for students to grasp. The objectives of this study is to explore students’ perceived and actual ability in solving statistical problems and to determine which item in statistical problem solving that students find difficult to grasp. Students’ perceived and actual ability were measured based on the instruments developed from the respective topics. Rasch measurement tools such as Wright map and item measures for fit statistics were used to accomplish the objectives. Data were collected and analysed using Winsteps 3.90 software which is developed based on the Rasch measurement model. The results showed that students’ perceived themselves as moderately competent in solving the statistical problems using confidence interval and p-value approach even though their actual performance showed otherwise. Item measures for fit statistics also showed that the maximum estimated measures were found on two problems. These measures indicate that none of the students have attempted these problems correctly due to reasons which include their lack of understanding in confidence interval and probability values.
Psychometric properties of the communication Confidence Rating Scale for Aphasia (CCRSA): phase 1.

Science.gov (United States)

Cherney, Leora R; Babbitt, Edna M; Semik, Patrick; Heinemann, Allen W

2011-01-01

Confidence is a construct that has not been explored previously in aphasia research. We developed the Communication Confidence Rating Scale for Aphasia (CCRSA) to assess confidence in communicating in a variety of activities and evaluated its psychometric properties using rating scale (Rasch) analysis. The CCRSA was administered to 21 individuals with aphasia before and after participation in a computer-based language therapy study. Person reliability of the 8-item CCRSA was .77. The 5-category rating scale demonstrated monotonic increases in average measures from low to high ratings. However, one item ("I follow news, sports, stories on TV/movies") misfit the construct defined by the other items (mean square infit = 1.69, item-measure correlation = .41). Deleting this item improved reliability to .79; the 7 remaining items demonstrated excellent fit to the underlying construct, although there was a modest ceiling effect in this sample. Pre- to posttreatment changes on the 7-item CCRSA measure were statistically significant using a paired samples t test. Findings support the reliability and sensitivity of the CCRSA in assessing participants' self-report of communication confidence. Further evaluation of communication confidence is required with larger and more diverse samples.
%lrasch_mml: A SAS Macro for Marginal Maximum Likelihood Estimation in Longitudinal Polytomous Rasch Models

Directory of Open Access Journals (Sweden)

Maja Olsbjerg

2015-10-01

Full Text Available Item response theory models are often applied when a number items are used to measure a unidimensional latent variable. Originally proposed and used within educational research, they are also used when focus is on physical functioning or psychological wellbeing. Modern applications often need more general models, typically models for multidimensional latent variables or longitudinal models for repeated measurements. This paper describes a SAS macro that fits two-dimensional polytomous Rasch models using a specification of the model that is sufficiently flexible to accommodate longitudinal Rasch models. The macro estimates item parameters using marginal maximum likelihood estimation. A graphical presentation of item characteristic curves is included.

R in Psychometrics and Psychometrics in R

OpenAIRE

Leeuw, Jan de

2006-01-01

In psychometrics, and in the closely related fields of quantititative methods for the social and educational sciences, R is not yet used very often. Traditional mainframe packages such as SAS and SPSS are still dominant at the user-level, Stata has made inroads at the teaching level, and Matlab is quite prominent at the research level. In this paper we define the most visible techniques in the psychometrics area, we give an overview of what is available in R, and we discuss what is m...
A psychometric approach to supervisory competency assessment

Directory of Open Access Journals (Sweden)

A. Vorster

2003-10-01

Full Text Available The primary purpose of this study was to evaluate the possibility of using a psychometric approach for assessing supervisory competencies relevant to the mining and refining environment. The competency questionnaire was developed using supervisory roles and registered supervisory unit standards from the United Kingdom (UK, as no registered unit standards exist in South Africa. Twenty-four supervisors from three departments (Production, Engineering and Laboratory were evaluated by 125 raters; besides by themselves, also by their managers, peers, customers and their sub-ordinates. Based on difference scores derived from the Importance and Performance scales, a single factor was extracted with an internal reliability of 0,965. No statistical significant differences were obtained (ANOVA’s, t-test and F-statistics between groups based on biographical variables or between rater groups. The findings and their implications are further discussed. Opsomming Die primêre doel van die studie was om die moontlikheid vir die gebruik van ’n psigometriese benadering tot toesighouerbevoegdheidsbeoordeling, te evalueer. Die bevoegdheidsvraelys is ontwikkel deur gebruik te maak van toesighouersrolle en geregistreerde toesighouerseenheidstandaarde van die Verenigde Koningkryk, as gevolg van ‘n gebrek aan bestaande eenheidstandaarde in Suid-Afrika. Vier-en-twintig toesighouers van drie departemente (Produksie, Ingenieurswese en Laboratorium is deur 125 beoordelaars geëvalueer; buiten deur hulself, ook deur hul bestuurders, kollegas, kliënte en hul ondergeskiktes. ’n Enkele faktor, met ’n betroubaarheid van 0,965, gebaseer op die verskiltellings van die Prestasie- en Belangrikheidskaal, is onttrek. Geen beduidende verskille (ANOVA’s, t-toetse en F-statistiek kon tussen groepe gebaseer op biografiese veranderlikes en die onderskeie beoordelaarsgroepe gevind word nie. Hierdie bevindinge en die implikasies daarvan word verder bespreek.
A note on contemporary psychometrics.

Science.gov (United States)

Vitoratou, Silia; Pickles, Andrew

2017-12-01

Psychometrics provide the mathematical underpinnings for psychological assessment. From the late 19th century, a plethora of methodological research achievements equipped researchers and clinicians with efficient tools whose practical value becomes more evident in the era of the internet and big data. Nowadays, powerful probabilistic models exist for most types of data and research questions. As the usability of the psychometric scales is better comprehended, there is an increased interest in applied research outcomes. Paradoxically, while the interest in applications for psychometric scales increases, publishing research on the development and/or evaluation of those scales per se, is not welcomed by many relevant journals. This special issue in psychometrics is therefore a great opportunity to briefly review the main ideas and methods used in psychometrics, and to discuss the challenges in contemporary applied psychometrics.
Rasch Analysis of the Adult Strabismus Quality of Life Questionnaire (AS-20 among Chinese Adult Patients with Strabismus.

Directory of Open Access Journals (Sweden)

Zonghua Wang

Full Text Available The impact of strabismus on visual function, self-image, self-esteem, and social interactions decrease health-related quality of life (HRQoL.The purpose of this study was to evaluate and refine the adult strabismus quality of life questionnaire (AS-20 by using Rasch analysis among Chinese adult patients with strabismus.We evaluated the fitness of the AS-20 with Rasch model in Chinese population by assessing unidimensionality, infit and outfit, person and item separation index and reliability, response ordering, targeting and differential item functioning (DIF.The overall AS-20 did not demonstrate unidimensional; however, it was achieved separately in the two Rasch-revised subscales: the psychosocial subscale (11 items and the function subscale (9 items. The features of good targeting, optimal item infit and outfit, and no notable local dependence were found for each of the subscales. The rating scale was appropriate for the psychosocial subscale but a reduction to four response categories was required for the function subscale. No significant DIF were revealed for any demographic and clinical factors (e.g., age, gender, and strabismus types.The AS-20 was demonstrated by Rasch analysis to be a rigorous instrument for measuring health-related quality of life in Chinese strabismus patents if some revisions were made regarding the subscale construct and response options.
Spurious Latent Class Problem in the Mixed Rasch Model: A Comparison of Three Maximum Likelihood Estimation Methods under Different Ability Distributions

Science.gov (United States)

Sen, Sedat

2018-01-01

Recent research has shown that over-extraction of latent classes can be observed in the Bayesian estimation of the mixed Rasch model when the distribution of ability is non-normal. This study examined the effect of non-normal ability distributions on the number of latent classes in the mixed Rasch model when estimated with maximum likelihood…
Parental Health Attributions of Childhood Health and Illness: Development of the Pediatric Cultural Health Attributions Questionnaire (Pedi-CHAQ).

Science.gov (United States)

Vaughn, Lisa M; McLinden, Daniel J; Shellmer, Diana; Baker, Raymond C

2011-01-01

The causes attributed to childhood health and illness across cultures (cultural health attributions) are key factors that are now more frequently identified as affecting the health outcomes of children. Research suggests that the causes attributed to an event such as illness are thought to affect subsequent motivation, emotional response, decision making, and behavior. To date, there is no measure of health attributions appropriate for use with parents of pediatric patients. Using the Many-Facets approach to Rasch analysis, this study assesses the psychometrics of a newly developed instrument, the Pediatric Health Attributions Questionnaire (Pedi-CHAQ), a measure designed to assess the cultural health attributions of parents in diverse communities. Results suggest acceptable Rasch model statistics of fit and reliability for the Pedi-CHAQ. A shortened version of the questionnaire was developed as a result of this study and next steps are discussed.
Comparison of scoring approaches for the NEI VFQ-25 in low vision.

Science.gov (United States)

Dougherty, Bradley E; Bullimore, Mark A

2010-08-01

The aim of this study was to evaluate different approaches to scoring the National Eye Institute Visual Functioning Questionnaire-25 (NEI VFQ-25) in patients with low vision including scoring by the standard method, by Rasch analysis, and by use of an algorithm created by Massof to approximate Rasch person measure. Subscale validity and use of a 7-item short form instrument proposed by Ryan et al. were also investigated. NEI VFQ-25 data from 50 patients with low vision were analyzed using the standard method of summing Likert-type scores and calculating an overall average, Rasch analysis using Winsteps software, and the Massof algorithm in Excel. Correlations between scores were calculated. Rasch person separation reliability and other indicators were calculated to determine the validity of the subscales and of the 7-item instrument. Scores calculated using all three methods were highly correlated, but evidence of floor and ceiling effects was found with the standard scoring method. None of the subscales investigated proved valid. The 7-item instrument showed acceptable person separation reliability and good targeting and item performance. Although standard scores and Rasch scores are highly correlated, Rasch analysis has the advantages of eliminating floor and ceiling effects and producing interval-scaled data. The Massof algorithm for approximation of the Rasch person measure performed well in this group of low-vision patients. The validity of the subscales VFQ-25 should be reconsidered.
Nonparametric tests for equality of psychometric functions.

Science.gov (United States)

García-Pérez, Miguel A; Núñez-Antón, Vicente

2017-12-07

Many empirical studies measure psychometric functions (curves describing how observers' performance varies with stimulus magnitude) because these functions capture the effects of experimental conditions. To assess these effects, parametric curves are often fitted to the data and comparisons are carried out by testing for equality of mean parameter estimates across conditions. This approach is parametric and, thus, vulnerable to violations of the implied assumptions. Furthermore, testing for equality of means of parameters may be misleading: Psychometric functions may vary meaningfully across conditions on an observer-by-observer basis with no effect on the mean values of the estimated parameters. Alternative approaches to assess equality of psychometric functions per se are thus needed. This paper compares three nonparametric tests that are applicable in all situations of interest: The existing generalized Mantel-Haenszel test, a generalization of the Berry-Mielke test that was developed here, and a split variant of the generalized Mantel-Haenszel test also developed here. Their statistical properties (accuracy and power) are studied via simulation and the results show that all tests are indistinguishable as to accuracy but they differ non-uniformly as to power. Empirical use of the tests is illustrated via analyses of published data sets and practical recommendations are given. The computer code in MATLAB and R to conduct these tests is available as Electronic Supplemental Material.
Multi-faceted Rasch measurement and bias patterns in EFL writing performance assessment.

Science.gov (United States)

He, Tung-Hsien; Gou, Wen Johnny; Chien, Ya-Chen; Chen, I-Shan Jenny; Chang, Shan-Mao

2013-04-01

This study applied multi-faceted Rasch measurement to examine rater bias in the assessment of essays written by college students learning English as a foreign language. Four raters who had received different academic training from four distinctive disciplines applied a six-category rating scale to analytically rate essays on an argumentative topic and on a descriptive topic. FACETS, a Rasch computer program, was utilized to pinpoint bias patterns by analyzing the rater-topic, rater-category, and topic-category interactions. Results showed: argumentative essays were rated more severely than were descriptive essays; the linguistics-major rater was the most lenient rater, while the literature-major rater was the severest one; and the category of language use received the severest ratings, whereas content was given the most lenient ratings. The severity hierarchies for raters, essay topics, and rating categories suggested that raters' academic training and their perceptions of the importance of categories were associated with their bias patterns. Implications for rater training are discussed.
Revalidating the Arabic Scale for Teachers' Ratings of Basic Education Gifted Students' Characteristics Using Rasch Modeling

Directory of Open Access Journals (Sweden)

Salah Eldin Farah Atallah Bakheit

2013-12-01

Full Text Available The Arabic scale for teachers' ratings of basic education gifted students' characteristics is one of the most common Arabic measures used for initial identification of gifted students in some Arabic countries. One of the shortcomings of this scale is that it is based on the classical the-ory of measurement. This study sought to reval-idate the scale in the light of Rasch modeling which rests upon the modern theory of meas-urement and to develop different criteria for in-terpreting the levels of individuals' traits. The scale was administered to 830 of Basic Educa-tion students in Khartoum (ages ranged from 7 to 12 years. Two groups of students partici-pated in the study: a calibration sample (N = 250 and a standardization sample (N = 580. The statistical treatments were performed using the PSAW 18 and RUMM 2020 programs ac-cording to Rasch's unidimentional model. Six of the scale's items were deleted for not conform-ing to Rasch Modeling. This left the scale with 31 items. Besides, new criteria for the scale were developed by obtaining the t-scores and special education scores that match the various ratings of the individuals' ability.
Future of Psychometrics: Ask What Psychometrics Can Do for Psychology

Science.gov (United States)

Sijtsma, Klaas

2012-01-01

I address two issues that were inspired by my work on the Dutch Committee on Tests and Testing (COTAN). The first issue is the understanding of problems test constructors and researchers using tests have of psychometric knowledge. I argue that this understanding is important for a field, like psychometrics, for which the dissemination of…
Psychometrics in action, science as practice.

Science.gov (United States)

Pearce, Jacob

2017-07-27

Practitioners in health sciences education and assessment regularly use a range of psychometric techniques to analyse data, evaluate models, and make crucial progression decisions regarding student learning. However, a recent editorial entitled "Is Psychometrics Science?" highlighted some core epistemological and practical problems in psychometrics, and brought its legitimacy into question. This paper attempts to address these issues by applying some key ideas from history and philosophy of science (HPS) discourse. I present some of the conceptual developments in HPS that have bearing on the psychometrics debate. Next, by shifting the focus onto what constitutes the practice of science, I discuss psychometrics in action. Some incorrectly conceptualize science as an assemblage of truths, rather than an assemblage of tools and goals. Psychometrics, however, seems to be an assemblage of methods and techniques. Psychometrics in action represents a range of practices using specific tools in specific contexts. This does not render the practice of psychometrics meaningless or futile. Engaging in debates about whether or not we should regard psychometrics as 'scientific' is, however, a fruitless enterprise. The key question and focus should be whether, on what grounds, and in what contexts, the existing methods and techniques used by psychometricians can be justified or criticized.
Evaluation of Neuropsychiatric Function in Phenylketonuria: Psychometric Properties of the ADHD Rating Scale-IV and Adult ADHD Self-Report Scale Inattention Subscale in Phenylketonuria.

Science.gov (United States)

Wyrwich, Kathleen W; Auguste, Priscilla; Yu, Ren; Zhang, Charlie; Dewees, Benjamin; Winslow, Barbara; Yu, Shui; Merilainen, Markus; Prasad, Suyash

2015-06-01

Previous qualitative research among adults and parents of children with phenylketonuria (PKU) has identified inattention as an important psychiatric aspect of this condition. The parent-reported ADHD Rating Scale-IV (ADHD RS-IV) and the Adult ADHD Self-Report Scale (ASRS) have been validated for measuring inattention symptoms in persons with attention-deficit/hyperactivity disorder (ADHD); however, their psychometric attributes for measuring PKU-related inattention have not been established. The primary objective of this investigation was to demonstrate the reliability, validity, and responsiveness of the ADHD RS-IV and ASRS inattention symptoms subscales in a randomized controlled trial of patients with PKU aged 8 years or older. A post hoc analysis investigated the psychometric properties (Rasch model fit, reliability, construct validity, and responsiveness) of the ADHD RS-IV and ASRS inattention subscales using data from a phase 3b, double-blind, placebo-controlled clinical trial in those with PKU aged 8 years or older. The Rasch results revealed good model fit, and reliability analyses revealed strong internal consistency reliability (α ≥ 0.87) and reproducibility (intraclass correlation coefficient ≥ 0.87) for both measures. Both inattention measures demonstrated the ability to discriminate between known groups (P < 0.001) created by the Clinical Global Impression-Severity scale. Correlations between the ADHD RS-IV and the ASRS with the Clinical Global Impression-Severity scale and the age-appropriate Behavior Rating Inventory of Executive Function Working Memory subscale were consistently moderate to strong (r ≥ 0.56). Similarly, results of the change score correlations were of moderate magnitude (r ≥ 0.43) for both measures when compared with changes over time in Behavior Rating Inventory of Executive Function Working Memory subscales. These findings of reliability, validity, and responsiveness of both the ADHD RS-IV and the ASRS inattention scales
A developmental screening tool for toddlers with multiple domains based on Rasch analysis.

Science.gov (United States)

Hwang, Ai-Wen; Chou, Yeh-Tai; Hsieh, Ching-Lin; Hsieh, Wu-Shiun; Liao, Hua-Fang; Wong, Alice May-Kuen

2015-01-01

Using multidomain developmental screening tools is a feasible method for pediatric health care professionals to identify children at risk of developmental problems in multiple domains simultaneously. The purpose of this study was to develop a Rasch-based tool for Multidimensional Screening in Child Development (MuSiC) for children aged 0-3 years. The MuSic was developed by constructing items bank based on three commonly used screening tools, validating with developmental status (at risk for delay or not) on five developmental domains. Parents of a convenient sample of 632 children (aged 3-35.5 months) with and without developmental delays responded to items from the three screening tools funded by health authorities in Taiwan. Item bank was determined by item fit of Rasch analysis for each of the five developmental domains (cognitive skills, language skills, gross motor skills, fine motor skills, and socioadaptive skills). Children's performance scores in logits derived in Rasch analysis were validated with developmental status for each domain using the area under receiver operating characteristic curves. MuSiC, a 75-item developmental screening tool for five domains, was derived. The diagnostic validity of all five domains was acceptable for all stages of development, except for the infant stage (≤11 months and 15 days). MuSiC can be applied simultaneously to well-child care visits as a universal screening tool for children aged 1-3 years on multiple domains. Items with sound validity for infants need to be further developed. Copyright © 2014. Published by Elsevier B.V.
Combining choice experiments with psychometric scales to assess the social acceptability of wind energy projects: A latent class approach

International Nuclear Information System (INIS)

Strazzera, Elisabetta; Mura, Marina; Contu, Davide

2012-01-01

A choice experiment exercise is combined with psychometric scales in order: (1) to identify factors that explain support/opposition toward a wind energy development project; and (2) to assess (monetary) trade-offs between attributes of the project. A Latent Class estimator is fitted to the data, and different utility parameters are estimated, conditional on class allocation. It is found that the probability of class membership depends on specific psychometric variables. Visual impacts on valued sites are an important factor of opposition toward a project, and this effect is magnified when identity values are attached to the specific site, so much that no trade-off would be acceptable for a class of individuals characterized by strong place attachment. Conversely, other classes of individuals are willing to accept compensations, in form of private and/or public benefits. The distribution of benefits in the territory, and preservation of the option value related to the possible development of an archeological site, are important for a class of individuals concerned with the sustainability of the local economy. - Highlights: ► A Choice Experiment approach is used to assess acceptability of a wind farm project. ► Psychometric variables are used to model heterogeneity in a Latent Class model. ► No trade-off would be acceptable for a class of individuals. ► Another class of individuals is interested in private benefits. ► Other classes are interested in public benefits and sustainability of the development.
Calibration of a Chemistry Test Using the Rasch Model

Directory of Open Access Journals (Sweden)

Nancy Coromoto Martín Guaregua

2011-11-01

Full Text Available The Rasch model was used to calibrate a general chemistry test for the purpose of analyzing the advantages and information the model provides. The sample was composed of 219 college freshmen. Of the 12 questions used, good fit was achieved in 10. The evaluation shows that although there are items of variable difficulty, there are gaps on the scale; in order to make the test complete, it will be necessary to design new items to fill in these gaps.
Measuring the impact of health problems among adults with limited mobility in Thailand: further validation of the Perceived Impact of Problem Profile

Directory of Open Access Journals (Sweden)

Manderson Lenore

2008-01-01

Full Text Available Abstract Background The Perceived Impact of Problem Profile (PIPP was developed to provide a tool for measuring the impact of a health condition from the individual's perspective, using the ICF model as a framework. One of the aims of the ICF is to enable the comparison of data across countries, however, relatively little is known about the subjective experience of disability in middle and low-income countries. The aim of this study was to assess the validity of the Perceived Impact of Problem Profile (PIPP for use among adults with a disability in Thailand using Rasch analysis. Methods A total of 210 adults with mobility impairment from the urban, rural and remote areas of northeast Thailand completed the PIPP, which contains 23 items assessing both impact and distress across five key domains (Self-care, Mobility, Participation, Relationships, and Psychological Well-being. Rasch analysis, using RUMM2020, was conducted to assess the internal validity and psychometric properties of the PIPP Impact subscales. Validation of the PIPP Impact scales was conducted by comparing scores across the different response levels of the EQ5D items. Results Rasch analysis indicated that participants did not clearly differentiate between 'impact' and 'distress,' the two aspects assessed by the PIPP. Further analyses were therefore limited to the PIPP Impact subscales. These showed adequate psychometric properties, demonstrating fit to the Rasch model and good person separation reliability. Preliminary validity testing using the EQ5D items provided support for the PIPP Impact subscales. Conclusion The results provide further support for the psychometric properties of the PIPP Impact scales and indicate that it is a suitable tool for use among adults with a locomotor disability in Thailand. Further research is needed to validate the PIPP across different cultural contexts and health conditions and to assess the usefulness of separate Impact and Distress subscales.
Measuring the impact of health problems among adults with limited mobility in Thailand: further validation of the Perceived Impact of Problem Profile

Science.gov (United States)

Misajon, RoseAnne; Pallant, Julie F; Manderson, Lenore; Chirawatkul, Siriporn

2008-01-01

Background The Perceived Impact of Problem Profile (PIPP) was developed to provide a tool for measuring the impact of a health condition from the individual's perspective, using the ICF model as a framework. One of the aims of the ICF is to enable the comparison of data across countries, however, relatively little is known about the subjective experience of disability in middle and low-income countries. The aim of this study was to assess the validity of the Perceived Impact of Problem Profile (PIPP) for use among adults with a disability in Thailand using Rasch analysis. Methods A total of 210 adults with mobility impairment from the urban, rural and remote areas of northeast Thailand completed the PIPP, which contains 23 items assessing both impact and distress across five key domains (Self-care, Mobility, Participation, Relationships, and Psychological Well-being). Rasch analysis, using RUMM2020, was conducted to assess the internal validity and psychometric properties of the PIPP Impact subscales. Validation of the PIPP Impact scales was conducted by comparing scores across the different response levels of the EQ5D items. Results Rasch analysis indicated that participants did not clearly differentiate between 'impact' and 'distress,' the two aspects assessed by the PIPP. Further analyses were therefore limited to the PIPP Impact subscales. These showed adequate psychometric properties, demonstrating fit to the Rasch model and good person separation reliability. Preliminary validity testing using the EQ5D items provided support for the PIPP Impact subscales. Conclusion The results provide further support for the psychometric properties of the PIPP Impact scales and indicate that it is a suitable tool for use among adults with a locomotor disability in Thailand. Further research is needed to validate the PIPP across different cultural contexts and health conditions and to assess the usefulness of separate Impact and Distress subscales. PMID:18208616
Development and Psychometric Evaluation of the School Bullying Scales: A Rasch Measurement Approach

Science.gov (United States)

Cheng, Ying-Yao; Chen, Li-Ming; Liu, Kun-Shia; Chen, Yi-Ling

2011-01-01

The study aims to develop three school bullying scales--the Bully Scale, the Victim Scale, and the Witness Scale--to assess secondary school students' bullying behaviors, including physical bullying, verbal bullying, relational bullying, and cyber bullying. The items of the three scales were developed from viewpoints of bullies, victims, and…
Nature, nurture, and item response theory: a psychometric approach to behaviour genetics

NARCIS (Netherlands)

Schwabe, Inga

2016-01-01

This dissertation discusses a number of psychometric issues that require special attention in the analysis of genetically-informative data, such as data on twins. These include heterogeneous measurement error, scaling and scale transformation, and harmonization of phenotypes. It is shown how

Teater vajab ellujäämiseks värsket dramaturgiat / Jane Rasch ; vahendanud Eva-Liisa Linder

Index Scriptorium Estoniae

Rasch, Jane

2009-01-01

Taani teatriteadlane Jane Rasch õpetas 10.-14. augustini 2009 Viljandi Kultuuriakadeemias toimunud draamakirjutuskursusel "Ideedest näidendi stsenaariumini". Ka näitekirjanike koolitamisest Taanis jt. Põhjamaades
Quantifying Local, Response Dependence between Two Polytomous Items Using the Rasch Model

Science.gov (United States)

Andrich, David; Humphry, Stephen M.; Marais, Ida

2012-01-01

Models of modern test theory imply statistical independence among responses, generally referred to as "local independence." One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation as a process in the dichotomous Rasch model,…
Measuring everyday functional competence using the Rasch assessment of everyday activity limitations (REAL) item bank.

Science.gov (United States)

Oude Voshaar, Martijn A H; Ten Klooster, Peter M; Vonkeman, Harald E; van de Laar, Mart A F J

2017-11-01

Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Seventy-two items generated from patient interviews and mapped to the International Classification of Functioning, Disability and Health (ICF) domestic life chapter were administered to 1128 adults representative of the Dutch population. The partial credit model was fitted to the item responses and evaluated with respect to its assumptions, model fit, and differential item functioning (DIF). Measurement performance of a computerized adaptive testing (CAT) algorithm was compared with the SF-36 physical functioning scale (PF-10). A final bank of 41 items was developed. All items demonstrated acceptable fit to the partial credit model and measurement invariance across age, sex, and educational level. Five- and ten-item CAT simulations were shown to have high measurement precision, which exceeded that of SF-36 physical functioning scale across the physical function continuum. Floor effects were absent for a 10-item empirical CAT simulation, and ceiling effects were low (13.5%) compared with SF-36 physical functioning (38.1%). CAT also discriminated better than SF-36 physical functioning between age groups, number of chronic conditions, and respondents with or without rheumatic conditions. The Rasch assessment of everyday activity limitations (REAL) item bank will hopefully prove a useful instrument for assessing everyday activity limitations. T-scores obtained using derived measures can be used to benchmark physical function outcomes against the general Dutch adult population.
Dependence module of the MINI plus adapted for sugar dependence: psychometric properties Módulo de dependência do MINI plus adaptado para dependência de açúcar: propriedades psicométricas

Directory of Open Access Journals (Sweden)

Marco Aurélio Camargo da Rosa

2013-01-01

Full Text Available This study aimed to analyze the factorial structure and the scale of measurement of the items for dependence of the Diagnostic and Statistical Manual of Mental Disorders (DSM-IV adapted for sugar consumption in order to verify if the structural characteristics can be applied to sugar dependence. The questionnaire was applied to a sample of 500 subjects in Brazil (67% female; mean age: 38 y.o.; 43% from weight control clinics; 63% with normal BMI. An exploratory factor analysis was performed to determine the factorial structure and unidimensionality; and, a Rasch model analysis, to verify unidimensionality and items distribution. The model with best fit is unidimensional. All items had good fit to the Rasch model with a reliability of .99, infit between .86 and 1.14 and outfit from .71 to 1.20. The items of MINI Plus adapted for sugar dependence presented good psychometric properties, suggesting that the dependence criteria of DSM-IV support the verification of the construct for sugar addiction.Analisar a estrutura fatorial e a escala de medida dos critérios de dependência do DSM-IV adaptado para açúcar a fim de verificar se as características estruturais são aplicaveis para dependência de açúcar. O questionário foi aplicado numa amostra de 500 pessoas (67% mulheres; média idade: 38 anos; 43% de clínicas obesidade; 63% IMC normal. A análise fatorial exploratória determinou a estrutura fatorial e unidimensionalidade; a análise de Rasch, a unidimensionalidade e distribuição dos itens. O modelo com melhor ajuste era unidimensional. Todos os itens apresentaram ajustes adequados na análise de Rasch com confiabilidade de 0,99, infit entre 0,86 a 1,14 e outfit entre 0,71 e 1,20. Os itens de dependência do MINI Plus adaptados para açúcar apresentaram boas propriedades psicométricas, sugerindo que os critérios do DSM-IV contribuem na verificação do constructo dependência de açúcar.
A comparison of Rasch item-fit and Cronbach's alpha item reduction analysis for the development of a Quality of Life scale for children and adolescents.

Science.gov (United States)

Erhart, M; Hagquist, C; Auquier, P; Rajmil, L; Power, M; Ravens-Sieberer, U

2010-07-01

This study compares item reduction analysis based on classical test theory (maximizing Cronbach's alpha - approach A), with analysis based on the Rasch Partial Credit Model item-fit (approach B), as applied to children and adolescents' health-related quality of life (HRQoL) items. The reliability and structural, cross-cultural and known-group validity of the measures were examined. Within the European KIDSCREEN project, 3019 children and adolescents (8-18 years) from seven European countries answered 19 HRQoL items of the Physical Well-being dimension of a preliminary KIDSCREEN instrument. The Cronbach's alpha and corrected item total correlation (approach A) were compared with infit mean squares and the Q-index item-fit derived according to a partial credit model (approach B). Cross-cultural differential item functioning (DIF ordinal logistic regression approach), structural validity (confirmatory factor analysis and residual correlation) and relative validity (RV) for socio-demographic and health-related factors were calculated for approaches (A) and (B). Approach (A) led to the retention of 13 items, compared with 11 items with approach (B). The item overlap was 69% for (A) and 78% for (B). The correlation coefficient of the summated ratings was 0.93. The Cronbach's alpha was similar for both versions [0.86 (A); 0.85 (B)]. Both approaches selected some items that are not strictly unidimensional and items displaying DIF. RV ratios favoured (A) with regard to socio-demographic aspects. Approach (B) was superior in RV with regard to health-related aspects. Both types of item reduction analysis should be accompanied by additional analyses. Neither of the two approaches was universally superior with regard to cultural, structural and known-group validity. However, the results support the usability of the Rasch method for developing new HRQoL measures for children and adolescents.
Accounting for Local Dependence with the Rasch Model: The Paradox of Information Increase.

Science.gov (United States)

Andrich, David

Test theories imply statistical, local independence. Where local independence is violated, models of modern test theory that account for it have been proposed. One violation of local independence occurs when the response to one item governs the response to a subsequent item. Expanding on a formulation of this kind of violation between two items in the dichotomous Rasch model, this paper derives three related implications. First, it formalises how the polytomous Rasch model for an item constituted by summing the scores of the dependent items absorbs the dependence in its threshold structure. Second, it shows that as a consequence the unit when the dependence is accounted for is not the same as if the items had no response dependence. Third, it explains the paradox, known, but not explained in the literature, that the greater the dependence of the constituent items the greater the apparent information in the constituted polytomous item when it should provide less information.
Measuring parental stress in mothers of infants: A Rasch-based construct validity study

DEFF Research Database (Denmark)

Nielsen, Tine; Pontoppidan, Maiken; Kristensen, Ingeborg Hedegaard

of the Danish language version of the PSS in a community sample of 1110 mothers of children aged 0 to 12 months employing the Rasch family of IRT models, and emphasizing the issues of unidimensionality and equal item functioning (no DIF) relative to the age and educational levels of the mothers. No adequate fit......) were found each to fit so-called graphical loglinear Rasch models: The parental stress subscale fit a model adjusted for local response dependence between some item pairs, as well as DIF for one item relative to mothers’ level of education and DIF for another item relative to age and educational level...... of the mothers. The parental satisfaction subscale fit a model adjusted only for local response dependence. The findings are in line with the original interpretation of the PSS. We recommend that the scoring of the PSS is changed to reflect the two subscales and the dichotomization of response categories...
Propiedades psicométricas del Cuestionario de Apoyo Social Funcional y de la Escala de Soledad en adultos mayores no institucionalizados en España Psychometric properties of the Functional Social Support Questionnaire and the Loneliness Scale in non-institutionalized older adults in Spain

Directory of Open Access Journals (Sweden)

Alba Ayala

2012-08-01

Full Text Available Objetivos: Este estudio analiza las propiedades psicométricas del Cuestionario de Apoyo Social Funcional Duke-UNC (DUFSS, Duke-UNC Questionnaire of Functional Social Support y de la Escala de Soledad de De Jong-Gierveld en una muestra de adultos mayores no institucionalizados. Métodos: Muestra de 1106 adultos mayores no institucionalizados incluidos en una encuesta nacional sobre calidad de vida. Ambas escalas se analizaron según la teoría clásica de los tests (aceptabilidad, consistencia interna, validez interna, validez convergente, validez discriminativa y precisión y análisis Rasch. Resultados: Las puntuaciones medias ± desviación estándar fueron de 44,95 ± 8,9 para el DUFSS y 1,92 ± 1,83 para la Escala de Soledad. El α de Cronbach fue 0,94 para el DUFSS y 0,77 para la Escala de Soledad. El análisis factorial mostró dos factores en ambas escalas (varianza explicada: 73,8% para el DUFSS y 67,7% para la Escala de Soledad. Ambos instrumentos mostraron un coeficiente de correlación de -0,59 entre sí. El análisis Rasch en el DUFSS identificó dos dimensiones, con un buen ajuste al modelo, mientras que la Escala de Soledad no mostró buen ajuste de los datos al modelo. Conclusiones: El cuestionario DUFSS, con algunas modificaciones, cumple las asunciones del modelo Rasch, y aporta medidas lineales. Sin embargo, hacen falta más estudios de análisis Rasch con la Escala de Soledad. Según la teoría clásica de los tests, el DUFSS tiene buena consistencia interna para comparación de personas y la Escala de Soledad la tiene para comparación de grupos. Ambas escalas presentan una validez de constructo satisfactoria.Objectives: To examine the psychometric properties of the Social Support Questionnaire Duke-UNC (DUFSS and the De Jong-Gierveld Loneliness Scale in a sample of non-institutionalized older adults. Methods: The sample consisted of 1,106 non-institutionalized older adults included in a national survey on quality of life
Dimensionality and predictive validity of the HAM-Nat, a test of natural sciences for medical school admission.

Science.gov (United States)

Hissbach, Johanna C; Klusmann, Dietrich; Hampe, Wolfgang

2011-10-14

Knowledge in natural sciences generally predicts study performance in the first two years of the medical curriculum. In order to reduce delay and dropout in the preclinical years, Hamburg Medical School decided to develop a natural science test (HAM-Nat) for student selection. In the present study, two different approaches to scale construction are presented: a unidimensional scale and a scale composed of three subject specific dimensions. Their psychometric properties and relations to academic success are compared. 334 first year medical students of the 2006 cohort responded to 52 multiple choice items from biology, physics, and chemistry. For the construction of scales we generated two random subsamples, one for development and one for validation. In the development sample, unidimensional item sets were extracted from the item pool by means of weighted least squares (WLS) factor analysis, and subsequently fitted to the Rasch model. In the validation sample, the scales were subjected to confirmatory factor analysis and, again, Rasch modelling. The outcome measure was academic success after two years. Although the correlational structure within the item set is weak, a unidimensional scale could be fitted to the Rasch model. However, psychometric properties of this scale deteriorated in the validation sample. A model with three highly correlated subject specific factors performed better. All summary scales predicted academic success with an odds ratio of about 2.0. Prediction was independent of high school grades and there was a slight tendency for prediction to be better in females than in males. A model separating biology, physics, and chemistry into different Rasch scales seems to be more suitable for item bank development than a unidimensional model, even when these scales are highly correlated and enter into a global score. When such a combination scale is used to select the upper quartile of applicants, the proportion of successful completion of the curriculum
Dimensionality and predictive validity of the HAM-Nat, a test of natural sciences for medical school admission

Directory of Open Access Journals (Sweden)

Hissbach Johanna C

2011-10-01

Full Text Available Abstract Background Knowledge in natural sciences generally predicts study performance in the first two years of the medical curriculum. In order to reduce delay and dropout in the preclinical years, Hamburg Medical School decided to develop a natural science test (HAM-Nat for student selection. In the present study, two different approaches to scale construction are presented: a unidimensional scale and a scale composed of three subject specific dimensions. Their psychometric properties and relations to academic success are compared. Methods 334 first year medical students of the 2006 cohort responded to 52 multiple choice items from biology, physics, and chemistry. For the construction of scales we generated two random subsamples, one for development and one for validation. In the development sample, unidimensional item sets were extracted from the item pool by means of weighted least squares (WLS factor analysis, and subsequently fitted to the Rasch model. In the validation sample, the scales were subjected to confirmatory factor analysis and, again, Rasch modelling. The outcome measure was academic success after two years. Results Although the correlational structure within the item set is weak, a unidimensional scale could be fitted to the Rasch model. However, psychometric properties of this scale deteriorated in the validation sample. A model with three highly correlated subject specific factors performed better. All summary scales predicted academic success with an odds ratio of about 2.0. Prediction was independent of high school grades and there was a slight tendency for prediction to be better in females than in males. Conclusions A model separating biology, physics, and chemistry into different Rasch scales seems to be more suitable for item bank development than a unidimensional model, even when these scales are highly correlated and enter into a global score. When such a combination scale is used to select the upper quartile of
A Protocol for Advanced Psychometric Assessment of Surveys

Science.gov (United States)

Squires, Janet E.; Hayduk, Leslie; Hutchinson, Alison M.; Cranley, Lisa A.; Gierl, Mark; Cummings, Greta G.; Norton, Peter G.; Estabrooks, Carole A.

2013-01-01

Background and Purpose. In this paper, we present a protocol for advanced psychometric assessments of surveys based on the Standards for Educational and Psychological Testing. We use the Alberta Context Tool (ACT) as an exemplar survey to which this protocol can be applied. Methods. Data mapping, acceptability, reliability, and validity are addressed. Acceptability is assessed with missing data frequencies and the time required to complete the survey. Reliability is assessed with internal consistency coefficients and information functions. A unitary approach to validity consisting of accumulating evidence based on instrument content, response processes, internal structure, and relations to other variables is taken. We also address assessing performance of survey data when aggregated to higher levels (e.g., nursing unit). Discussion. In this paper we present a protocol for advanced psychometric assessment of survey data using the Alberta Context Tool (ACT) as an exemplar survey; application of the protocol to the ACT survey is underway. Psychometric assessment of any survey is essential to obtaining reliable and valid research findings. This protocol can be adapted for use with any nursing survey. PMID:23401759
Psychometric Evaluation of the Student Authorship Questionnaire: A Confirmatory Factor Analysis Approach

Science.gov (United States)

Ballantine, Joan; Guo, Xin; Larres, Patricia

2015-01-01

This research provides new insights into the measurement of students' authorial identity and its potential for minimising the incidence of unintentional plagiarism by providing evidence about the psychometric properties of the Student Authorship Questionnaire (SAQ). Exploratory and confirmatory factor analyses (EFA and CFA) are employed to…
Análise de Rasch aplicada a questionário sobre consumo de tabaco em escolares adolescentes = Rasch analysis applied to a questionnaire on tobacco use among adolescent students

Directory of Open Access Journals (Sweden)

Santos, Wendel Mombaque dos

2014-01-01

Conclusões: O uso do método de Rasch possibilitou verificar a exposição de cada participante às diferentes condições de exposição ao tabaco, assim como demonstrou que o questionário que avalia a exposição ao tabaco deve tratar cada questão com pesos diferentes
Psychometric properties of a new short version of the State-Trait Anxiety Inventory (STAI) for the assessment of anxiety in the elderly.

Science.gov (United States)

Fernández-Blázquez, M A; Ávila-Villanueva, M; López-Pina, J A; Zea-Sevilla, M A; Frades-Payo, B

2015-01-01

Anxiety has negative effects on the cognitive performance and psychosocial adjustment of elderly people. Given the high prevalence of anxiety symptoms in patients suffering from cognitive impairment, it has been suggested that these symptoms may be an early marker of dementia. The State-Trait Anxiety Inventory (STAI) is one of the most widely-used scales for evaluating anxiety in elderly people. However, inasmuch as the STAI may be difficult to apply to older people, having a short form of it would be desirable. The participants comprised 489 community-dwelling individuals aged 68 years and over. All of them were volunteers in a longitudinal study for early detection of Alzheimer' Disease (Proyecto Vallecas). The full sample was divided in two homogeneous subgroups: Group A, used to reduce the number of items and response options, and Group B, the group used to determine the psychometric properties of the new short form (STAIr). A dichotomous Rasch model was used to obtain the STAIr. No statistically significant differences for STAIr scores were found with respect to sociodemographic variables. Psychometric properties and normative data were obtained for the new short version. The STAIr is composed of 13 items and data fits the model well. Since it is short and easy to apply to elderly people, STAIr will be very useful in clinical and research settings. Copyright © 2013 Sociedad Española de Neurología. Published by Elsevier España, S.L.U. All rights reserved.
Psychometric Evaluation of the Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF).

Science.gov (United States)

Gelhorn, Heather L; Roberts, Laurie J; Khandelwal, Nikhil; Revicki, Dennis A; DeRogatis, Leonard R; Dobs, Adrian; Hepp, Zsolt; Miller, Michael G

2017-08-01

The Hypogonadism Impact of Symptoms Questionnaire Short Form (HIS-Q-SF) is a patient-reported outcome measurement designed to evaluate the symptoms of hypogonadism. The HIS-Q-SF is an abbreviated version including17 items from the original 28-item HIS-Q. To conduct item analyses and reduction, evaluate the psychometric properties of the HIS-Q-SF, and provide guidance on score interpretation. A 12-week observational longitudinal study of hypogonadal men was conducted as part of the original HIS-Q psychometric evaluation. Participants completed the original HIS-Q every 2 weeks. Blood samples were collected to evaluate testosterone levels. Participants completed the Aging Male's Symptoms Scale, the International Index of Erectile Function, the Short Form-12, and the PROMIS Sexual Activity, Satisfaction with Sex Life, Sleep Disturbance, and Applied Cognition Scales (baseline and weeks 6 and 12). Clinicians completed the Clinical Global Impression of Severity and Change scales and a clinical form. Item performance was evaluated using descriptive statistics and Rasch analyses. Reliability (internal consistency and test-retest), validity (concurrent and know groups), and responsiveness were assessed. One hundred seventy-seven men participated (mean age = 54.1 years, range = 23-83). Similar to the full HIS-Q, the final abbreviated HIS-Q-SF instrument includes five domains (sexual, energy, sleep, cognition, and mood) with two sexual subdomains (libido and sexual function). For key domains, test-retest reliability was very good, and construct validity was good for all domains. Known-groups validity was demonstrated for all domain scores, subdomain scores, and total score based on the Clinical Global Impression-Severity. All domains and subdomains were responsive to change based on patient-rated anchor questions. The HIS-Q-SF could be a useful tool in clinical practice, epidemiologic studies, and other academic research settings. Careful consideration was given to the
Development of a Patient-Reported Palliative Care-Specific Health Classification System: The POS-E.

Science.gov (United States)

Dzingina, Mendwas; Higginson, Irene J; McCrone, Paul; Murtagh, Fliss E M

2017-06-01

Generic preference-based measures are commonly used to estimate quality-adjusted life-years (QALYs) to inform resource-allocation decisions. However, concerns have been raised that generic measures may be inappropriate in palliative care. Our objective was to derive a health-state classification system that is amenable to valuation from the ten-item Palliative Care Outcome Scale (POS), a widely used patient-reported outcome measure in palliative care. The dimensional structure of the original POS was assessed using factor analysis. Item performance was assessed, using Rasch analysis and psychometric criteria, to enable the selection of items that represent the dimensions covered by the POS. Data from six studies of patients receiving palliative care were combined (N = 1011) and randomly split into two halves for development and validation. Analysis was undertaken on the development data, and results were validated by repeating the analysis with the validation dataset. Following Rasch and factor analyses, a classification system of seven items was derived. Each item had two to three levels. Rasch threshold map helped identify a set of 14 plausible health states that can be used for the valuation of the instrument to derive a preference-based index. Combining factor analysis and Rasch analysis with psychometric criteria provides a valid method of constructing a classification system for a palliative care-specific preference-based measure. The next stage is to obtain preference weights so the measure can be used in economic evaluations in palliative care.
A longitudinal evaluation of the Center for Epidemiologic Studies-Depression scale (CES-D) in a Rheumatoid Arthritis Population using Rasch Analysis

Science.gov (United States)

Covic, Tanya; Pallant, Julie F; Conaghan, Philip G; Tennant, Alan

2007-01-01

Background The aim of this study was to test the internal validity of the total Center for Epidemiologic Studies-Depression (CES-D) scale using Rasch analysis in a rheumatoid arthritis (RA) population. Methods CES-D was administered to 157 patients with RA over three time points within a 12 month period. Rasch analysis was applied using RUMM2020 software to assess the overall fit of the model, the response scale used, individual item fit, differential item functioning (DIF) and person separation. Results Pooled data across three time points was shown to fit the Rasch model with removal of seven items from the original 20-item CES-D scale. It was necessary to rescore the response format from four to three categories in order to improve the scale's fit. Two items demonstrated some DIF for age and gender but were retained within the 13-item CES-D scale. A new cut point for depression score of 9 was found to correspond to the original cut point score of 16 in the full CES-D scale. Conclusion This Rasch analysis of the CES-D in a longstanding RA cohort resulted in the construction of a modified 13-item scale with good internal validity. Further validation of the modified scale is recommended particularly in relation to the new cut point for depression. PMID:17629902
A longitudinal evaluation of the Center for Epidemiologic Studies-Depression scale (CES-D in a Rheumatoid Arthritis Population using Rasch Analysis

Directory of Open Access Journals (Sweden)

Tennant Alan

2007-07-01

Full Text Available Abstract Background The aim of this study was to test the internal validity of the total Center for Epidemiologic Studies-Depression (CES-D scale using Rasch analysis in a rheumatoid arthritis (RA population. Methods CES-D was administered to 157 patients with RA over three time points within a 12 month period. Rasch analysis was applied using RUMM2020 software to assess the overall fit of the model, the response scale used, individual item fit, differential item functioning (DIF and person separation. Results Pooled data across three time points was shown to fit the Rasch model with removal of seven items from the original 20-item CES-D scale. It was necessary to rescore the response format from four to three categories in order to improve the scale's fit. Two items demonstrated some DIF for age and gender but were retained within the 13-item CES-D scale. A new cut point for depression score of 9 was found to correspond to the original cut point score of 16 in the full CES-D scale. Conclusion This Rasch analysis of the CES-D in a longstanding RA cohort resulted in the construction of a modified 13-item scale with good internal validity. Further validation of the modified scale is recommended particularly in relation to the new cut point for depression.
Dimensionality of the Knee Numeric-Entity Evaluation Score (KNEES-ACL)

DEFF Research Database (Denmark)

Comins, J D; Krogsgaard, M R; Kreiner, Svend

2013-01-01

The benefit of anterior cruciate ligament (ACL) reconstruction has been questioned based on patient-reported outcome measures (PROMs). Valid interpretation of such results requires confirmation of the psychometric properties of the PROM. Rasch analysis is the gold standard for validation of PROMs...
Historical Views of Invariance: Evidence from the Measurement Theories of Thorndike, Thurstone, and Rasch.

Science.gov (United States)

Engelhard, George, Jr.

1992-01-01

A historical perspective is provided of the concept of invariance in measurement theory, describing sample-invariant item calibration and item-invariant measurement of individuals. Invariance as a key measurement concept is illustrated through the measurement theories of E. L. Thorndike, L. L. Thurstone, and G. Rasch. (SLD)

The Psychometric Toolbox: An Excel Package for Use in Measurement and Psychometrics Courses

Science.gov (United States)

Ferrando, Pere J.; Masip-Cabrera, Antoni; Navarro-González, David; Lorenzo-Seva, Urbano

2017-01-01

The Psychometric Toolbox (PT) is a user-friendly, non-commercial package mainly intended to be used for instructional purposes in introductory courses of educational and psychological measurement, psychometrics and statistics. The PT package is organized in six separate modules or sub-programs: Data preprocessor (descriptive analyses and data…
A Rasch and confirmatory factor analysis of the General Health Questionnaire (GHQ - 12

Directory of Open Access Journals (Sweden)

Velikova Galina

2010-04-01

Full Text Available Abstract Background The General Health Questionnaire (GHQ - 12 was designed as a short questionnaire to assess psychiatric morbidity. Despite the fact that studies have suggested a number of competing multidimensional factor structures, it continues to be largely used as a unidimensional instrument. This may have an impact on the identification of psychiatric morbidity in target populations. The aim of this study was to explore the dimensionality of the GHQ-12 and to evaluate a number of alternative models for the instrument. Methods The data were drawn from a large heterogeneous sample of cancer patients. The Partial Credit Model (Rasch was applied to the 12-item GHQ. Item misfit (infit mean square ≥ 1.3 was identified, misfitting items removed and unidimensionality and differential item functioning (age, gender, and treatment aims were assessed. The factor structures of the various alternative models proposed in the literature were explored and optimum model fit evaluated using Confirmatory Factor Analysis. Results The Rasch analysis of the 12-item GHQ identified six misfitting items. Removal of these items produced a six-item instrument which was not unidimensional. The Rasch analysis of an 8-item GHQ demonstrated two unidimensional structures corresponding to Anxiety/Depression and Social Dysfunction. No significant differential item functioning was observed by age, gender and treatment aims for the six- and eight-item GHQ. Two models competed for best fit from the confirmatory factor analysis, namely the GHQ-8 and Hankin's (2008 unidimensional model, however, the GHQ-8 produced the best overall fit statistics. Conclusions The results are consistent with the evidence that the GHQ-12 is a multi-dimensional instrument. Use of the summated scores for the GHQ-12 could potentially lead to an incorrect assessment of patients' psychiatric morbidity. Further evaluation of the GHQ-12 with different target populations is warranted.
El modelo de rasch en dirección de operaciones

Directory of Open Access Journals (Sweden)

Lidia Sanchez

2012-11-01

Full Text Available Durante décadas, en el área de Dirección de Operaciones, se ha destacado la necesidad de un acercamiento entre el mundo académico y el profesional, reclamando la realización de estudios empíricos que aporten soluciones prácticas a los profesionales. De ahí que durante los últimos años se hayan producido dos fenómenos clave: la convergencia entre los temas objeto de investigación y los temas de interés para las empresas; y un aumento en el número de estudios empíricos realizados.Ahora bien, otro factor importantísimo a la hora de aportar conocimientos prácticos a la disciplina es la herramienta o metodología aplicada. Por ello, el desarrollo de nuevas herramientas o la aplicación de otras ya existentes en otros campos es un tema interesante. Es en este punto donde adquiere importancia la Metodología de Rasch.Esta técnica ha sido tradicionalmente utilizada en disciplinas tales como la Psicología o la Medicina. Sin embargo, ya hace algunos años, ha comenzado a utilizarse en otras áreas de conocimiento, entre ellas, el área de Administración y Dirección de Empresas. No obstante, su aplicación al área concreta de Operaciones es escasa y, por ello, las posibilidades de desarrollo e investigaciones futuras son numerosas.La Metodología de Rasch, útil para el diseño y la explotación de encuestas, se basa en tres principios, unidimensionalidad, aditividad e invarianza, y permite obtener medidas objetivas a partir del análisis de variables categóricas. Entre sus múltiples aplicaciones destacamos las siguientes: análisis de la viabilidad y fiabilidad globales, análisis de la unidimensionalidad del constructo, análisis de escalas del cuestionario, priorización (ordenación de los ítems y/o de los sujetos, análisis DAFO… Es, por lo tanto, una metodología muy rica con multitud de posibilidades para su aplicación en la disciplina.Dado su incipiente desarrollo en esta área de conocimiento el objetivo de este
Psychometric Properties of the Theory of Mind Assessment Scale in a Sample of Adolescents and Adults.

Science.gov (United States)

Bosco, Francesca M; Gabbatore, Ilaria; Tirassa, Maurizio; Testa, Silvia

2016-01-01

This research aimed at the evaluation of the psychometric properties of the Theory of Mind Assessment Scale (Th.o.m.a.s.). Th.o.m.a.s. is a semi-structured interview meant to evaluate a person's Theory of Mind (ToM). It is composed of several questions organized in four scales, each focusing on one of the areas of knowledge in which such faculty may manifest itself: Scale A (I-Me) investigates first-order first-person ToM; Scale B (Other-Self) investigates third-person ToM from an allocentric perspective; Scale C (I-Other) again investigates third-person ToM, but from an egocentric perspective; and Scale D (Other-Me) investigates second-order ToM. The psychometric proprieties of Th.o.m.a.s. were evaluated in a sample of 156 healthy persons: 80 preadolescent and adolescent (aged 11-17 years, 42 females) and 76 adults (aged from 20 to 67 years, 35 females). Th.o.m.a.s. scores show good inter-rater agreement and internal consistency; the scores increase with age. Evidence of criterion validity was found as Scale B scores were correlated with those of an independent instrument for the evaluation of ToM, the Strange Stories task. Confirmatory factor analysis (CFA) showed good fit of the four-factors theoretical model to the data, although the four factors were highly correlated. For each of the four scales, Rasch analyses showed that, with few exceptions, items fitted the Partial credit model and their functioning was invariant for gender and age. The results of this study, along with those of previous researches with clinical samples, show that Th.o.m.a.s. is a promising instrument to assess ToM in different populations.
Using the Rasch model as an objective and probabilistic technique to integrate different soil properties

Science.gov (United States)

Rebollo, Francisco J.; Jesús Moral García, Francisco

2016-04-01

Soil apparent electrical conductivity (ECa) is one of the simplest, least expensive soil measurements that integrates many soil properties affecting crop productivity, including, for instance, soil texture, water content, and cation exchange capacity. The ECa measurements obtained with a 3100 Veris sensor, operating in both shallow (0-30 cm), ECs, and deep (0-90 cm), ECd, mode, can be used as an additional and essential information to be included in a probabilistic model, the Rasch model, with the aim of quantifying the overall soil fertililty potential in an agricultural field. This quantification should integrate the main soil physical and chemical properties, with different units. In this work, the formulation of the Rasch model integrates 11 soil properties (clay, silt and sand content, organic matter -OM-, pH, total nitrogen -TN-, available phosphorus -AP- and potassium -AK-, cation exchange capacity -CEC-, ECd, and ECs) measured at 70 locations in a field. The main outputs of the model include a ranking of all soil samples according to their relative fertility potential and the unexpected behaviours of some soil samples and properties. In the case study, the considered soil variables fit the model reasonably, having an important influence on soil fertility, except pH, probably due to its homogeneity in the field. Moreover, ECd, ECs are the most influential properties on soil fertility and, on the other hand, AP and AK the less influential properties. The use of the Rasch model to estimate soil fertility potential (always in a relative way, taking into account the characteristics of the studied soil) constitutes a new application of great practical importance, enabling to rationally determine locations in a field where high soil fertility potential exists and establishing those soil samples or properties which have any anomaly; this information can be necessary to conduct site-specific treatments, leading to a more cost-effective and sustainable field
On the Psychometric Study of Human Life History Strategies.

Science.gov (United States)

Richardson, George B; Sanning, Blair K; Lai, Mark H C; Copping, Lee T; Hardesty, Patrick H; Kruger, Daniel J

2017-01-01

This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS), provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a) use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b) confirm the dimensionality of life history indicators, and (c) establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested-there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
On the Psychometric Study of Human Life History Strategies

Directory of Open Access Journals (Sweden)

George B. Richardson

2017-02-01

Full Text Available This article attends to recent discussions of validity in psychometric research on human life history strategy (LHS, provides a constructive critique of the extant literature, and describes strategies for improving construct validity. To place the psychometric study of human LHS on more solid ground, our review indicates that researchers should (a use approaches to psychometric modeling that are consistent with their philosophies of measurement, (b confirm the dimensionality of life history indicators, and (c establish measurement invariance for at least a subset of indicators. Because we see confirming the dimensionality of life history indicators as the next step toward placing the psychometrics of human LHS on more solid ground, we use nationally representative data and structural equation modeling to test the structure of middle adult life history indicators. We found statistically independent mating competition and Super-K dimensions and the effects of parental harshness and childhood unpredictability on Super-K were consistent with past research. However, childhood socioeconomic status had a moderate positive effect on mating competition and no effect on Super-K, while unpredictability did not predict mating competition. We conclude that human LHS is more complex than previously suggested—there does not seem to be a single dimension of human LHS among Western adults and the effects of environmental components seem to vary between mating competition and Super-K.
Reconsidering the psychometrics of quality of life assessment in light of response shift and appraisal

Directory of Open Access Journals (Sweden)

Schwartz Carolyn E

2004-03-01

Full Text Available Abstract The increasing evidence for response shift phenomena in quality of life (QOL assessment points to the necessity to reconsider both the measurement model and the application of psychometric analyses. The proposed psychometric model posits that the QOL true score is always contingent upon parameters of the appraisal process. This new model calls into question existing methods for establishing the reliability and validity of QOL assessment tools and suggests several new approaches for describing the psychometric properties of these scales. Recommendations for integrating the assessment of appraisal into QOL research and clinical practice are discussed.
Reconsidering the psychometrics of quality of life assessment in light of response shift and appraisal

Science.gov (United States)

Schwartz, Carolyn E; Rapkin, Bruce D

2004-01-01

The increasing evidence for response shift phenomena in quality of life (QOL) assessment points to the necessity to reconsider both the measurement model and the application of psychometric analyses. The proposed psychometric model posits that the QOL true score is always contingent upon parameters of the appraisal process. This new model calls into question existing methods for establishing the reliability and validity of QOL assessment tools and suggests several new approaches for describing the psychometric properties of these scales. Recommendations for integrating the assessment of appraisal into QOL research and clinical practice are discussed. PMID:15038830
The measurement of place attachment: validity and generalizability of a psychometric approach

Science.gov (United States)

Daniel R. Williams; Jerry J. Vaske

2003-01-01

To enhance land managersâ ability to address deeper landscape meanings and place-specific symbolic values in natural resource decision making, this study evaluated the psychometric properties of a place attachment measure designed to capture the extent of emotions and feelings people have for places. Building on previous measurement efforts, this study examined the...
Genes, Culture and Conservatism-A Psychometric-Genetic Approach.

Science.gov (United States)

Schwabe, Inga; Jonker, Wilfried; van den Berg, Stéphanie M

2016-07-01

The Wilson-Patterson conservatism scale was psychometrically evaluated using homogeneity analysis and item response theory models. Results showed that this scale actually measures two different aspects in people: on the one hand people vary in their agreement with either conservative or liberal catch-phrases and on the other hand people vary in their use of the "?" response category of the scale. A 9-item subscale was constructed, consisting of items that seemed to measure liberalism, and this subscale was subsequently used in a biometric analysis including genotype-environment interaction, correcting for non-homogeneous measurement error. Biometric results showed significant genetic and shared environmental influences, and significant genotype-environment interaction effects, suggesting that individuals with a genetic predisposition for conservatism show more non-shared variance but less shared variance than individuals with a genetic predisposition for liberalism.
RhinAsthma patient perspective: A Rasch validation study.

Science.gov (United States)

Molinengo, Giorgia; Baiardini, Ilaria; Braido, Fulvio; Loera, Barbara

2018-02-01

In daily practice, Health-Related Quality of Life (HRQoL) tools are useful for supplementing clinical data with the patient's perspective. To encourage their use by clinicians, the availability of tools that can quickly provide valid results is crucial. A new HRQoL tool has been proposed for patients with asthma and rhinitis: the RhinAsthma Patient Perspective-RAPP. The aim of this study was to evaluate the psychometric robustness of the RAPP using the Item Response Theory (IRT) approach, to evaluate the scalability of items and test whether or not patients use the items response scale correctly. 155 patients (53.5% women, mean age 39.1, range 16-76) were recruited during a multicenter study. RAPP metric properties were investigated using IRT models. Differential item functioning (DIF) was used for gender, age, and asthma control test (ACT). The RAPP adequately fitted the Rating Scale model, demonstrating the equality of the rating scale structure for all items. All statistics on items were satisfactory. The RAPP had adequate internal reliability and showed good ability to discriminate among different groups of participants. DIF analysis indicated that there were no differential item functioning issues for gender. One item showed a DIF by age and four items by ACT. The psychometric evaluation performed using IRT models demonstrated that the RAPP met all the criteria to be considered a reliable and valid method of measurement. From a clinical perspective, this will allow physicians to confidently interpret scores as good indicators of Quality of Life of patients with asthma.
Linguistic validation of stigmatisation degree, self-esteem and knowledge questionnaire among asthma patients using Rasch analysis.

Science.gov (United States)

Ahmad, Sohail; Ismail, Ahmad Izuanuddin; Khan, Tahir Mehmood; Akram, Waqas; Mohd Zim, Mohd Arif; Ismail, Nahlah Elkudssiah

2017-04-01

The stigmatisation degree, self-esteem and knowledge either directly or indirectly influence the control and self-management of asthma. To date, there is no valid and reliable instrument that can assess these key issues collectively. The main aim of this study was to test the reliability and validity of the newly devised and translated "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" among adult asthma patients using the Rasch measurement model. This cross-sectional study recruited thirty adult asthma patients from two respiratory specialist clinics in Selangor, Malaysia. The newly devised self-administered questionnaire was adapted from relevant publications and translated into the Malay language using international standard translation guidelines. Content and face validation was done. The data were extracted and analysed for real item reliability and construct validation using the Rasch model. The translated "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" showed high real item reliability values of 0.90, 0.86 and 0.89 for stigmatisation degree, self-esteem, and knowledge of asthma, respectively. Furthermore, all values of point measure correlation (PTMEA Corr) analysis were within the acceptable specified range of the Rasch model. Infit/outfit mean square values and Z standard (ZSTD) values of each item verified the construct validity and suggested retaining all the items in the questionnaire. The reliability analyses and output tables of item measures for construct validation proved the translated Malaysian version of "Stigmatisation Degree, Self-Esteem and Knowledge Questionnaire" as a valid and highly reliable questionnaire.
Psychometrics

NARCIS (Netherlands)

Borsboom, D.; Molenaar, D.; Wright, J.D.

2015-01-01

Psychometrics is a scientific discipline concerned with the construction of measurement models for psychological data. In these models, a theoretical construct (e.g., intelligence) is systematically coordinated with observables (e.g., IQ scores). This is often done through latent variable models,
Latent trait standardization of the benzodiazepine dependence self-report questionnaire using the Rasch scaling model.

NARCIS (Netherlands)

Kan, C.C.; Ven, A.H.G.S. van der; Breteler, M.H.M.; Zitman, F.G.

2001-01-01

The aim of the present study was to obtain standardized scores that correspond with the raw scores on the four Rasch scales of the Benzodiazepine Dependence-Self Report Questionnaire (Bendep-SRQ). The eligible normative group for standardization of the Bendep-SRQ scales consisted of 217 general
Latent Trait Standardization of the Benzodiazepine Dependence Self-Report Questionnaire using the Rasch Scaling Model

NARCIS (Netherlands)

Kan, C.C.; Ven, A.H.G.S. van der; Breteler, M.H.M.; Zitman, F.G.

2001-01-01

The aim of the present study was to obtain standardized scores that correspond with the raw scores on the four Rasch scales of the Benzodiazepine Dependence-Self Report Questionnaire (Bendep-SRQ). The eligible normative group for standardization of the Bendep-SRQ scales consisted of 217 general
A psychometric validation analysis of Eysenck’s Neuroticism and Extraversion Scales in a sample of first time depressed patients

DEFF Research Database (Denmark)

Møller, Stine Bjerrum; Bech, Per; Kessing, Lars Vedel

2015-01-01

Eysenck and Eysenck identified the two-factor structure of personality, namely neuroticism and extraversion which has been widely used in clinical psychiatry, and generated much research on the psychometric properties of the scales. Using a classical psychometric approach the neuroticism...... and extraversion scales have shown robust psychometric properties. The present study used both classical psychometric and item response theory (IRT) analyses to evaluate the neuroticism and extraversion scales and improve scalability of the instrument neuroticism and extraversion. A first time depressed sample...... symptoms related to interpersonal sensitivity were identified. For the extraversion scale a shorter and psychometrically more robust version was identified together with a short introversion scale. Clinically discriminant validity was analysed using correlations. The correlation between depression (Ham...
Catquest-9SF questionnaire: validation of Malay and Chinese-language versions using Rasch analysis.

Science.gov (United States)

Adnan, Tassha Hilda; Mohamed Apandi, Mokhlisoh; Kamaruddin, Haireen; Salowi, Mohamad Aziz; Law, Kian Boon; Haniff, Jamaiyah; Goh, Pik Pin

2018-01-05

Catquest questionnaire was originally developed in Swedish to measure patients' self-assessed visual function to evaluate the benefit of cataract surgery. The result of the Rasch analysis leading to the creation of the nine-item short form of Catquest, (Catquest-9SF), and it had been translated and validated in English. The aim is therefore to evaluate the translated Catquest-9SF questionnaire in Malay and Chinese (Mandarin) language version for measuring patient-reported visual function among cataract population in Malaysia. The English version of Catquest-9SF questionnaire was translated and back translated into Malay and Chinese languages. The Malay and Chinese translated versions were self-administered by 236 and 202 pre-operative patients drawn from a cataract surgery waiting list, respectively. The translated Catquest-9SF data and its four response options were assessed for fit to the Rasch model. The Catquest-9SF performed well in the Malay and Chinese translated versions fulfilling all criteria for valid measurement, as demonstrated by Rasch analysis. Both versions of questionnaire had ordered response thresholds, with a good person separation (Malay 2.84; and Chinese 2.59) and patient separation reliability (Malay 0.89; Chinese 0.87). Targeting was 0.30 and -0.11 logits in Malay and Chinese versions respectively, indicating that the item difficulty was well suited to the visual abilities of the patients. All items fit a single overall construct (Malay infit range 0.85-1.26, outfit range 0.73-1.13; Chinese infit range 0.80-1.51, outfit range 0.71-1.36), unidimensional by principal components analysis, and was free of Differential Item Functioning (DIF). These results support the good overall functioning of the Catquest-9SF in patients with cataract. The translated questionnaire to Malay and Chinese-language versions are reliable and valid in measuring visual disability outcomes in the Malaysian cataract population.
An introduction to Item Response Theory and Rasch Analysis of the Eating Assessment Tool (EAT-10).

Science.gov (United States)

Kean, Jacob; Brodke, Darrel S; Biber, Joshua; Gross, Paul

2018-03-01

Item response theory has its origins in educational measurement and is now commonly applied in health-related measurement of latent traits, such as function and symptoms. This application is due in large part to gains in the precision of measurement attributable to item response theory and corresponding decreases in response burden, study costs, and study duration. The purpose of this paper is twofold: introduce basic concepts of item response theory and demonstrate this analytic approach in a worked example, a Rasch model (1PL) analysis of the Eating Assessment Tool (EAT-10), a commonly used measure for oropharyngeal dysphagia. The results of the analysis were largely concordant with previous studies of the EAT-10 and illustrate for brain impairment clinicians and researchers how IRT analysis can yield greater precision of measurement.
Development of a foot impact scale for rheumatoid arthritis.

Science.gov (United States)

Helliwell, Philip; Reay, Naomi; Gilworth, Gill; Redmond, Anthony; Slade, Anita; Tennant, Alan; Woodburn, James

2005-06-15

To develop a new foot impact scale to assess foot status in rheumatoid arthritis (RA) using established qualitative methodology and the latest item response techniques (Rasch analysis). Foot problems in RA were explored by conducting qualitative interviews that were then used to generate items for a new foot impact scale. Further validation was undertaken following postal surveys and Rasch analysis. Analysis of the first postal survey (n = 192 responses) produced a 63-item binary response, 4-subscale instrument. The 4 subscales covered the domains impairment, activities, participation, and footwear. Following test-retest postal surveys and additional analysis, the instrument was reduced to a 2 subscale, 51-item questionnaire covering the domains of impairments/shoes and activities/participation. Initial results of these subscales indicate good psychometric properties, external validity, and test-retest reliability. A foot impact scale to assess the impact of RA and to measure the effect of interventions has been developed. The 2 scales comprising the instrument demonstrate good psychometric properties.

Mixing Interviews and Rasch Modeling: Demonstrating a Procedure Used to Develop an Instrument That Measures Trust

Science.gov (United States)

David, Shannon L.; Hitchcock, John H.; Ragan, Brian; Brooks, Gordon; Starkey, Chad

2018-01-01

Developing psychometrically sound instruments can be difficult, especially if little is known about the constructs of interest. When constructs of interest are unclear, a mixed methods approach can be useful. Qualitative inquiry can be used to explore a construct's meaning in a way that informs item writing and allows the strengths of one analysis…
Psychometric characteristics of health-related quality-of-life questionnaires in oropharyngeal dysphagia.

Science.gov (United States)

Timmerman, Angelique A; Speyer, Renée; Heijnen, Bas J; Klijn-Zwijnenberg, Iris R

2014-04-01

Dysphagia can have severe consequences for the patient's health, influencing health-related quality of life (HRQoL). Sound psychometric properties of HRQoL questionnaires are a precondition for assessing the impact of dysphagia, the focus of this study, resulting in recommendations for the appropriate use of these questionnaires in both clinical practice and research contexts. We performed a systematic review starting with a search for and retrieval of all full-text articles on the development of HRQoL questionnaires related to oropharyngeal dysphagia and/or their psychometric validation from the electronic databases PubMed and Embase published up to June 2011. Psychometric properties were judged according to quality criteria proposed for health status questionnaires. Eight questionnaires were included in this study. Four are aimed solely at HRQoL in oropharyngeal dysphagia: the deglutition handicap index (DHI), dysphagia handicap index (DHI'), M.D. Anderson Dysphagia Inventory (MDADI), and SWAL-QOL, while the EDGQ, EORTC QLQ-STO 22, EORTC QLQ-OG 25 and EORTC QLQ-H&N35 focus on other primary diseases resulting in dysphagia. The psychometric properties of the DHI, DHI', MDADI, and SWAL-QOL were evaluated. For appropriate applicability of HRQoL questionnaires, strong scores on the psychometric criteria face validity, criterion validity, and interpretability are prerequisites. The SWAL-QOL has the strongest ratings for these criteria, while the DHI' is the most easy to apply given its 25 items and the use of a uniform scoring format. For optimal use of HRQoL questionnaires in diverse settings, it is necessary to combine psychometric and utility approaches.
Saving energy in 1-D : tailoring energy-saving advice using a Rasch-based energy recommender system

NARCIS (Netherlands)

Starke, Alain; Willemsen, Martijn; Snijders, Chris; Ge, Mouhzi; Ricci, Francesco

2015-01-01

Although there are numerous possibilities to save energy, conservation initiatives often do not tailor their content to the consumer. By considering energy conservation as a one-dimensional construct, where different behaviors have different execution difficulties, we have set out a Rasch-based
Development of Rasch-based item banks for the assessment of work performance in patients with musculoskeletal diseases.

Science.gov (United States)

Mueller, Evelyn A; Bengel, Juergen; Wirtz, Markus A

2013-12-01

This study aimed to develop a self-description assessment instrument to measure work performance in patients with musculoskeletal diseases. In terms of the International Classification of Functioning, Disability and Health (ICF), work performance is defined as the degree of meeting the work demands (activities) at the actual workplace (environment). To account for the fact that work performance depends on the work demands of the job, we strived to develop item banks that allow a flexible use of item subgroups depending on the specific work demands of the patients' jobs. Item development included the collection of work tasks from literature and content validation through expert surveys and patient interviews. The resulting 122 items were answered by 621 patients with musculoskeletal diseases. Exploratory factor analysis to ascertain dimensionality and Rasch analysis (partial credit model) for each of the resulting dimensions were performed. Exploratory factor analysis resulted in four dimensions, and subsequent Rasch analysis led to the following item banks: 'impaired productivity' (15 items), 'impaired cognitive performance' (18), 'impaired coping with stress' (13) and 'impaired physical performance' (low physical workload 20 items, high physical workload 10 items). The item banks exhibited person separation indices (reliability) between 0.89 and 0.96. The assessment of work performance adds the activities component to the more commonly employed participation component of the ICF-model. The four item banks can be adapted to specific jobs where necessary without losing comparability of person measures, as the item banks are based on Rasch analysis.
Improving the psychometric properties of dot-probe attention measures using response-based computation.

Science.gov (United States)

Evans, Travis C; Britton, Jennifer C

2018-09-01

Abnormal threat-related attention in anxiety disorders is most commonly assessed and modified using the dot-probe paradigm; however, poor psychometric properties of reaction-time measures may contribute to inconsistencies across studies. Typically, standard attention measures are derived using average reaction-times obtained in experimentally-defined conditions. However, current approaches based on experimentally-defined conditions are limited. In this study, the psychometric properties of a novel response-based computation approach to analyze dot-probe data are compared to standard measures of attention. 148 adults (19.19 ± 1.42 years, 84 women) completed a standardized dot-probe task including threatening and neutral faces. We generated both standard and response-based measures of attention bias, attentional orientation, and attentional disengagement. We compared overall internal consistency, number of trials necessary to reach internal consistency, test-retest reliability (n = 72), and criterion validity obtained using each approach. Compared to standard attention measures, response-based measures demonstrated uniformly high levels of internal consistency with relatively few trials and varying improvements in test-retest reliability. Additionally, response-based measures demonstrated specific evidence of anxiety-related associations above and beyond both standard attention measures and other confounds. Future studies are necessary to validate this approach in clinical samples. Response-based attention measures demonstrate superior psychometric properties compared to standard attention measures, which may improve the detection of anxiety-related associations and treatment-related changes in clinical samples. Copyright © 2018 Elsevier Ltd. All rights reserved.
Villa Marie Nursing Home, Grange, Templemore Road, Roscrea, Tipperary.

LENUS (Irish Health Repository)

Hardouin, Jean-Benoit

2011-07-14

Abstract Background Nowadays, more and more clinical scales consisting in responses given by the patients to some items (Patient Reported Outcomes - PRO), are validated with models based on Item Response Theory, and more specifically, with a Rasch model. In the validation sample, presence of missing data is frequent. The aim of this paper is to compare sixteen methods for handling the missing data (mainly based on simple imputation) in the context of psychometric validation of PRO by a Rasch model. The main indexes used for validation by a Rasch model are compared. Methods A simulation study was performed allowing to consider several cases, notably the possibility for the missing values to be informative or not and the rate of missing data. Results Several imputations methods produce bias on psychometrical indexes (generally, the imputation methods artificially improve the psychometric qualities of the scale). In particular, this is the case with the method based on the Personal Mean Score (PMS) which is the most commonly used imputation method in practice. Conclusions Several imputation methods should be avoided, in particular PMS imputation. From a general point of view, it is important to use an imputation method that considers both the ability of the patient (measured for example by his\\/her score), and the difficulty of the item (measured for example by its rate of favourable responses). Another recommendation is to always consider the addition of a random process in the imputation method, because such a process allows reducing the bias. Last, the analysis realized without imputation of the missing data (available case analyses) is an interesting alternative to the simple imputation in this context.
The patient satisfaction questionnaire of EUprimecare project: measurement properties.

Science.gov (United States)

Cimas, Marta; Ayala, Alba; García-Pérez, Sonia; Sarria-Santamera, Antonio; Forjaz, Maria João

2016-06-01

The measurement of patient satisfaction is considered an essential outcome indicator to evaluate health care quality. Patient satisfaction is considered a multi-dimensional construct, which would include a variety of domains. Although a large number of studies have proposed scales to measure patient satisfaction, there is a lack of psychometric information on them. This study aims to describe the psychometric properties of the Primary Care Satisfaction Scale (PCSS) of the EUprimecare project. A cross-sectional survey of patient satisfaction with primary care was carried out by telephone interview. Primary care services of Estonia, Finland, Germany, Hungary, Lithuania, Italy and Spain. A total of 3020 adult patients aged 18-65 years old attending primary care services. Classic psychometric properties were analysed and Rasch analysis was used to assess the following measurement properties: fit to the Rasch model; uni-dimensionality; reliability; differential item functioning (DIF) by gender, age, civil status, area of residency and country; local independency; adequacy of response scale; and scale targeting. To achieve good fit to the Rasch model, the original response scales of three items (1, 2 and 6) were rescored and Item 3 (waiting time in the room) was removed. The scale was uni-dimensional and Person Separation Index was 0.79, indicating a good reliability. All items were free from bias. PCSS linear measure displayed satisfactory convergent validity with overall satisfaction with primary care. PCSS, as a reliable and valid scale, could be used to measure patient satisfaction in primary care in Europe. © The Author 2016. Published by Oxford University Press in association with the International Society for Quality in Health Care; all rights reserved.
Reliability and validity of the Turkish version of the Rapid Estimate of Adult Literacy in Dentistry (TREALD-30).

Science.gov (United States)

Peker, Kadriye; Köse, Taha Emre; Güray, Beliz; Uysal, Ömer; Erdem, Tamer Lütfi

2017-04-01

To culturally adapt the Turkish version of Rapid Estimate of Adult Literacy in Dentistry (TREALD-30) for Turkish-speaking adult dental patients and to evaluate its psychometric properties. After translation and cross-cultural adaptation, TREALD-30 was tested in a sample of 127 adult patients who attended a dental school clinic in Istanbul. Data were collected through clinical examinations and self-completed questionnaires, including TREALD-30, the Oral Health Impact Profile (OHIP), the Rapid Estimate of Adult Literacy in Medicine (REALM), two health literacy screening questions, and socio-behavioral characteristics. Psychometric properties were examined using Classical Test Theory (CTT) and Rasch analysis. Internal consistency (Cronbach's Alpha = 0.91) and test-retest reliability (Intraclass correlation coefficient = 0.99) were satisfactory for TREALD-30. It exhibited good convergent and predictive validity. Monthly family income, years of education, dental flossing, health literacy, and health literacy skills were found as stronger predictors of patients'oral health literacy (OHL). Confirmatory factor analysis (CFA) confirmed a two-factor model. The Rasch model explained 37.9% of the total variance in this dataset. In addition, TREALD-30 had eleven misfitting items, which indicated evidence of multidimensionality. The reliability indeces provided in Rasch analysis (person separation reliability = 0.91 and expected-a-posteriori/plausible reliability = 0.94) indicated that TREALD-30 had acceptable reliability. TREALD-30 showed satisfactory psychometric properties. It may be used to identify patients with low OHL. Socio-demographic factors, oral health behaviors and health literacy skills should be taken into account when planning future studies to assess the OHL in both clinical and community settings.
Rasch Analysis of the Activities-specific Balance Confidence (ABC) Scale in Older Adults Seeking Outpatient Rehabilitation Services.

Science.gov (United States)

Wang, Ying-Chih; Sindhu, Bhagwant; Lehman, Leigh; Li, Xiaoyan; Yen, Sheng-Che; Kapellusch, Jay

2018-03-30

Study Design Cross-sectional study of 5,012 older patients seeking outpatient rehabilitation therapy in 123 clinics. Background The Activities-Specific Balance Confidence (ABC) Scale measures confidence in performing various ambulatory activities without falling or experiencing a sense of unsteadiness. Objectives Our purposes were to: (1) examine the ABC Scale (0-100) using the Rasch analysis, (2) assess statistically reliable change, and (3) develop a functional staging to guide clinical interpretation of the patient's improvement. Methods We examined rating scale structure, item difficulty hierarchy, item fit, person-item match, separation index, differential item functioning (DIF), test precision, and unidimensionality. Additionally, we estimated the minimal detectable change (MDC) and developed a functional staging. Results Item 'walking outside on icy sidewalks' was the most difficult item, while 'reach for a small can off a shelf at eye level' was the easiest item. Overall, average patient ability estimates of 56.2 (20.3) was slightly higher than the average item difficulty estimates of 45.9 (7.8). With a separation index equaled to 3.65, the ABC items can differentiate persons into 5.2 statistically distinct strata. Most ABC items were free of DIF. For example, 'walk outside on icy sidewalks' was easier for patients who was underweight. Results supported unidimensionality of the ABC Scale, with the first factor explained 77% of the total variance. The estimated MDC was 15 points. We provided an example of functional staging application. Conclusion Results supported sound psychometric properties and clinical usage of the ABC Scale for older adults seeking outpatient rehabilitation therapy. Level of Evidence 2c. J Orthop Sports Phys Ther, Epub 30 Mar 2018. doi:10.2519/jospt.2018.8023.
Organizational Culture Influence On Total Productive Maintenance (TPM and Operational Performance Using RASCH Model Analysis

Directory of Open Access Journals (Sweden)

Mohd Norhasni Mohd Asaad

2014-02-01

Full Text Available Abstract. Market globalization, competitive product and services, high economic crises are the most critical factors that influence the success of the manufacturing companies in global market. Therefore it is critical to the manufacturing companies to be efficient in production and lean tool may used to achieve that. The most frequently used is the Total Preventive Maintenance (TPM, even though there are many studies have been conducted in relation to the TPM but there is limited research in investigating the effects of the TPM on operational performance. However, the result of the studies was not consistent, where TPM practice may have positive and negative impact on operational performance. Among the reason is the culture of the organization that influenced the implementation of TPM and operational performance. Due to that this study attempts to investigate the influence of organizational culture on the TPM implementation and operational performance. Rasch model is used in this study due to its ability in interpreting and analyzing the ability of respondents in performing the difficult items. The online questionnaires were distributed to 63 randomly selected automotive companies located at Northern Region of Malaysia. Results of the study revealed that the organizational culture has influenced on the successful implementation of TPM and operational performance. Therefore by the implementation of TPM in outstanding organizational culture can improve operational performance. Keyword: Total Preventive Maintenance (TPM, Lean manufacturing, Operational performance, Organizational culture, Rasch modeldoi:10.12695/ajtm.2013.6.2.2How to cite this article:Mohd Asaad, M.N and Yusoff, R.Z. (2013. Organizational Culture Influence On Total Productive Maintenance (TPM and Operational Performance Using RASCH Model Analysis . The Asian Journal of Technology Management 6 (2: 72-81. Print ISSN: 1978-6956; Online ISSN: 2089-791X. doi:10.12695/ajtm
Rasch Analysis of the Bruininks-Oseretsky Test of Motor Proficiency--Second Edition in Intellectual Disabilities

Science.gov (United States)

Wuang, Yee-Pay; Lin, Yueh-Hsien; Su, Chwen-Yng

2009-01-01

The Bruininks-Oseretsky Test of Motor Proficiency-Second Edition (BOT-2) is widely used to assess motor skills for both clinical and research purposes; however, its validity has not been adequately assessed in intellectual disabilities (ID). This study used partial credit Rasch model to examine the measurement properties of the BOT-2 among 446…
Using the Mixture Rasch Model to Explore Knowledge Resources Students Invoke in Mathematic and Science Assessments

Science.gov (United States)

Zhang, Danhui; Orrill, Chandra; Campbell, Todd

2015-01-01

The purpose of this study was to investigate whether mixture Rasch models followed by qualitative item-by-item analysis of selected Programme for International Student Assessment (PISA) mathematics and science items offered insight into knowledge students invoke in mathematics and science separately and combined. The researchers administered an…
Development of Patient-reported Outcomes Measure of Pharmaceutical Therapy for Quality of Life (PROMPT-QoL): A novel instrument for medication management.

Science.gov (United States)

Sakthong, Phantipa; Suksanga, Phattrapa; Sakulbumrungsil, Rungpetch; Winit-Watjana, Win

2015-01-01

Medicines can affect a patient's health-related quality of life (HRQoL), but there exists no standardized HRQoL measure for medication management. To develop the new HRQoL instrument "Patient-reported Outcomes Measure of Pharmaceutical Therapy for Quality of Life" (PROMPT-QoL), and to evaluate its content validity and preliminary psychometrics using a Rasch model. The PROMPT-QoL questionnaire was developed through the concept review, item generation, cognitive interviews, and initial psychometric evaluation. Its first draft was initially tested by Round-1 interviews of 120 adult outpatients taking their medicines at least three months continuously. The final draft with 43 items was then constructed and checked by 10 physicians and 5 pharmacists for the questionnaire importance and content validity. Round-2 interviews in six patient groups with 10 patients of each were conducted to elicit patients' understanding of the questionnaire and assess preliminary psychometrics using the Rasch analysis, including fit statistics, person and item reliabilities. The 43-item PROMPT-QoL comprised 10 domains: General Attitude toward Medication Use, Medicine Information, Disease Information, Medicine Effectiveness, Impacts of Medicines and Side-effects, Psychological Impacts of Medication Use, Convenience, Availability and Accessibility, Therapeutic Relationship with Healthcare Providers, and Overall QoL. Based on the patient interviews and expert review, the questionnaire was considered important, useful, and comprehensive. All items and domains yielded content validity indexes above the acceptable values of 0.80 and 0.90, respectively. In Round 2, thirty-nine problems identified in Group 1 were reduced to two issues in Group 6 after amendments. The Rasch analysis revealed eight items were misfit and two domains were reliable for both personal and item aspects (Medicine Information and Psychological Impacts of Medication Use). The newly developed PROMPT-QoL has favorable content
Computational Psychometrics for the Measurement of Collaborative Problem Solving Skills

Science.gov (United States)

Polyak, Stephen T.; von Davier, Alina A.; Peterschmidt, Kurt

2017-01-01

This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses. PMID:29238314
Computational Psychometrics for the Measurement of Collaborative Problem Solving Skills

Directory of Open Access Journals (Sweden)

Stephen T. Polyak

2017-11-01

Full Text Available This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
Computational Psychometrics for the Measurement of Collaborative Problem Solving Skills.

Science.gov (United States)

Polyak, Stephen T; von Davier, Alina A; Peterschmidt, Kurt

2017-01-01

This paper describes a psychometrically-based approach to the measurement of collaborative problem solving skills, by mining and classifying behavioral data both in real-time and in post-game analyses. The data were collected from a sample of middle school children who interacted with a game-like, online simulation of collaborative problem solving tasks. In this simulation, a user is required to collaborate with a virtual agent to solve a series of tasks within a first-person maze environment. The tasks were developed following the psychometric principles of Evidence Centered Design (ECD) and are aligned with the Holistic Framework developed by ACT. The analyses presented in this paper are an application of an emerging discipline called computational psychometrics which is growing out of traditional psychometrics and incorporates techniques from educational data mining, machine learning and other computer/cognitive science fields. In the real-time analysis, our aim was to start with limited knowledge of skill mastery, and then demonstrate a form of continuous Bayesian evidence tracing that updates sub-skill level probabilities as new conversation flow event evidence is presented. This is performed using Bayes' rule and conversation item conditional probability tables. The items are polytomous and each response option has been tagged with a skill at a performance level. In our post-game analysis, our goal was to discover unique gameplay profiles by performing a cluster analysis of user's sub-skill performance scores based on their patterns of selected dialog responses.
Measuring Engagement in Later Life Activities: Rasch-Based Scenario Scales for Work, Caregiving, Informal Helping, and Volunteering

Science.gov (United States)

Ludlow, Larry H.; Matz-Costa, Christina; Johnson, Clair; Brown, Melissa; Besen, Elyssa; James, Jacquelyn B.

2014-01-01

The development of Rasch-based "comparative engagement scenarios" based on Guttman's facet theory and sentence mapping procedures is described. The scenario scales measuring engagement in work, caregiving, informal helping, and volunteering illuminate the lived experiences of role involvement among older adults and offer multiple…
Measuring Attending Behavior and Short-Term Memory with Knox's Cube Test.

Science.gov (United States)

Stone, Mark H.; Wright, Benjamin D.

1983-01-01

A new revision was developed using Rasch psychometric techniques to build a Knox's Cube Test (KCT) variable and item bank using the tapping series from all previous editions. The report forms developed give a clear picture of the subject's performance set in a context that is both normative and criterion. (Author/BW)
Differences between Mothers' and Fathers' Ratings of Family Functioning with the Family Assessment Device: The Validity of Combined Parent Scores

Science.gov (United States)

Cooke, Dawson; Marais, Ida; Cavanagh, Robert; Kendall, Garth; Priddis, Lynn

2015-01-01

The psychometric properties of the General Functioning subscale of the McMaster Family Assessment Device were examined using the Rasch Model (N = 237 couples). Mothers' and fathers' ratings of the General Functioning subscale of the McMaster Family Assessment Device are recommended, provided these are analyzed separately. More than a quarter of…
Author Details

African Journals Online (AJOL)

Matore, M. E. E. M.. Vol 9, No 6S (2017) - Articles Improving the psychometric properties of the Mooney problem checklist by using Rasch measurement model. Abstract PDF. ISSN: 1112-9867. AJOL African Journals Online. HOW TO USE AJOL... for Researchers · for Librarians · for Authors · FAQ's · More about AJOL ...

Development and psychometric evaluation of the breast size satisfaction scale.

Science.gov (United States)

Pahlevan Sharif, Saeed

2017-10-09

Purpose The purpose of this paper is to develop and evaluate psychometrically an instrument named the Breast Size Satisfaction Scale (BSSS) to assess breast size satisfaction. Design/methodology/approach The present scale was developed using a set of 16 computer-generated 3D images of breasts to overcome some of the limitations of existing instruments. The images were presented to participants and they were asked to select the figure that most accurately depicted their actual breast size and the figure that most closely represented their ideal breast size. Breast size satisfaction was computed by subtracting the absolute value of the difference between ideal and actual perceived size from 16, such that higher values indicate greater breast size satisfaction. Findings Study 1 ( n=65 female undergraduate students) showed good test-retest reliability and study 2 ( n=1,000 Iranian women, aged 18 years and above) provided support for convergent validity using a nomological network approach. Originality/value The BSSS demonstrated good psychometric properties and thus can be used in future studies to assess breast size satisfaction among women.
Development of a Performance-Based Measure of Executive Functions in Patients with Schizophrenia.

Directory of Open Access Journals (Sweden)

En-Chi Chiu

Full Text Available A performance-based measure for assessing executive functions (EF is useful to understand patients' real life performance of EF. This study aimed to develop a performance-based measure of executive functions (PEF based on the Lezak model and to examine psychometric properties (i.e., unidimensionality and reliability of the PEF using Rasch analysis in patients with schizophrenia. We developed the PEF in three phases: (1 designing the preliminary version of PEF; (2 consultation with experts, cognitive interviews with patients, and pilot tests on patients to revise the preliminary PEF; (3 establishment of the final version of the PEF and examination of unidimensionality and Rasch reliability. Two hundred patients were assessed using the revised PEF. After deleting items which did not satisfy the Rasch model's expectations, the final version of the PEF contained 1 practice item and 13 test items for assessing the four domains of EF (i.e., volition, planning, purposive action, and effective performance. For unidimensional and multidimensional Rasch analyses, the 4 domains showed good reliability (i.e., 0.77-0.85 and 0.87-0.90, respectively. Our results showed that the PEF had satisfactory unidimensionality and Rasch reliability. Therefore, clinicians and researchers could use the PEF to assess the four domains of EF in patients with schizophrenia.
Generalized Network Psychometrics : Combining Network and Latent Variable Models

NARCIS (Netherlands)

Epskamp, S.; Rhemtulla, M.; Borsboom, D.

2017-01-01

We introduce the network model as a formal psychometric model, conceptualizing the covariance between psychometric indicators as resulting from pairwise interactions between observable variables in a network structure. This contrasts with standard psychometric models, in which the covariance between
Item and response-category functioning of the Persian version of the KIDSCREEN-27: Rasch partial credit model

Directory of Open Access Journals (Sweden)

Jafari Peyman

2012-10-01

Full Text Available Abstract Background The purpose of the study was to determine whether the Persian version of the KIDSCREEN-27 has the optimal number of response category to measure health-related quality of life (HRQoL in children and adolescents. Moreover, we aimed to determine if all the items contributed adequately to their own domain. Findings The Persian version of the KIDSCREEN-27 was completed by 1083 school children and 1070 of their parents. The Rasch partial credit model (PCM was used to investigate item statistics and ordering of response categories. The PCM showed that no item was misfitting. The PCM also revealed that, successive response categories for all items were located in the expected order except for category 1 in self- and proxy-reports. Conclusions Although Rasch analysis confirms that all the items belong to their own underlying construct, response categories should be reorganized and evaluated in further studies, especially in children with chronic conditions.
Assessing Pre-Service Physics Teachers’ Energy Literacy: An Application of Rasch measurement

Science.gov (United States)

Yusup, M.; Setiawan, A.; Rustaman, N. Y.; Kaniawati, I.

2017-09-01

This paper aims to present a summary of pre-service physics teachers’ responses on energy literacy assessment. A total of 123 pre-service physics teacher in first through third year of education participated. Data were analyzed using Rasch modeling. Research findings indicate that pre-service physics teachers show their low self-system toward energy conservation. They were also still lack of metacognitive and cognitive competencies. These finding provide information for the future development of curriculum, teaching and learning that can improve pre-service physics teachers’ energy literacy.
Loosening Psychometric Constraints on Educational Assessments

Science.gov (United States)

Kane, Michael T.

2017-01-01

In response to an argument by Baird, Andrich, Hopfenbeck and Stobart (2017), Michael Kane states that there needs to be a better fit between educational assessment and learning theory. In line with this goal, Kane will examine how psychometric constraints might be loosened by relaxing some psychometric "rules" in some assessment…
Sensitivity of Mantel Haenszel Model and Rasch Model as Viewed From Sample Size

OpenAIRE

ALWI, IDRUS

2011-01-01

The aims of this research is to study the sensitivity comparison of Mantel Haenszel and Rasch Model for detection differential item functioning, observed from the sample size. These two differential item functioning (DIF) methods were compared using simulate binary item respon data sets of varying sample size, 200 and 400 examinees were used in the analyses, a detection method of differential item functioning (DIF) based on gender difference. These test conditions were replication 4 tim...
Psychometric Properties of the Inventário Dimensional Clínico da Personalidade (IDCP using the Rating Scale Model

Directory of Open Access Journals (Sweden)

Lucas de Francisco Carvalho

2014-08-01

Full Text Available The aim of this study was to evaluate the performance of the Dimensional Clinical Personality Inventory (DCPI using Rasch-based person and item analysis. 1281 participants were recruited, between 18 and 90 years of age (M=26.64; SD=8.94, 431 men (33.6%. Of the total sample, 127 (9.9% were patients diagnosed with axis I disorders and/or axis II according to DSM-IV-TR. Results indicated the IDCP scales performed reasonably well, and the usefulness of the analyses presented, demonstrates the Rasch model’s applicability for clinical applications. Among the important tools offered by the Rasch model, we explore the use of the person-item map, which visually presents the intuitively understandable psychological construct along the dimensional scale of the instrument.
Assessing Validity of Measurement in Learning Disabilities Using Hierarchical Generalized Linear Modeling: The Roles of Anxiety and Motivation

Science.gov (United States)

Sideridis, Georgios D.

2016-01-01

The purpose of the present studies was to test the hypothesis that the psychometric characteristics of ability scales may be significantly distorted if one accounts for emotional factors during test taking. Specifically, the present studies evaluate the effects of anxiety and motivation on the item difficulties of the Rasch model. In Study 1, the…
Author Details

African Journals Online (AJOL)

Mohamad, M. Vol 9, No 6S (2017) - Articles Improving the psychometric properties of the Mooney problem checklist by using Rasch measurement model. Abstract PDF. ISSN: 1112-9867. AJOL African Journals Online. HOW TO USE AJOL... for Researchers · for Librarians · for Authors · FAQ's · More about AJOL · AJOL's ...
78th Annual Meeting of the Psychometric Society

CERN Document Server

Bolt, Daniel; Ark, L; Wang, Wen-Chung

2015-01-01

The 78th Annual Meeting of the Psychometric Society (IMPS) builds on the Psychometric Society's mission to share quantitative methods relevant to psychology. The chapters of this volume present cutting-edge work in the field. Topics include studies of item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Additional psychometric topics relate to structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis, among others. The papers in this volume will be especially useful for researchers in the social sciences who use quantitative methods. Prior knowledge of statistical methods is recommended. The 78th annual meeting took place in Arnhem, The Netherlands between July 22nd and 26th, 2013. The previous volume to showcase work from the Psychometric Society’s Meeting is New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 201...
Rasch Analysis of the Fullerton Advanced Balance (FAB) Scale

OpenAIRE

Klein, Penelope J.; Fiedler, Roger C.; Rose, Debra J.

2011-01-01

Purpose: This cross-sectional study explores the psychometric properties and dimensionality of the Fullerton Advanced Balance (FAB) Scale, a multi-item balance test for higher-functioning older adults.
Improving the measurement of health-related quality of life in adolescent with idiopathic scoliosis: the SRS-7, a Rasch-developed short form of the SRS-22 questionnaire.

Science.gov (United States)

Caronni, Antonio; Zaina, Fabio; Negrini, Stefano

2014-04-01

Scoliosis Research Society-22 (SRS-22) questionnaire was developed to evaluate health-related quality of life (HRQL) in adolescent idiopathic scoliosis (AIS) patients. Rasch analysis (RA) is a statistical procedure which turns questionnaire ordinal scores into interval measures. Measures from Rasch-compatible questionnaires can be used, similar to body temperature or blood pressure, to quantify disease severity progression and treatment efficacy. Purpose of the current work is to present Rasch analysis (RA) of the SRS-22 questionnaire and to develop an SRS-22 Rasch-approved short form. 300 SRS-22 were randomly collected from 2447 consecutive IS adolescents at their first evaluation (229 females; 13.9 ± 1.9 years; 26.9 ± 14.7 Cobb°) in a scoliosis outpatient clinic. RA showed both disordered thresholds and overall misfit of the SRS-22. Sixteen items were re-scored and two misfitting items (6 and 14) removed to obtain a Rasch-compatible questionnaire. Participants HRQL measured too high with the rearranged questionnaire, indicating a severe SRS-22 ceiling effect. RA also highlighted SRS-22 multidimensionality, with pain/function not merging with self-image/mental health items. Item 3 showed differential item functioning (DIF) for both curve and hump amplitude. A 7-item questionnaire (SRS-7) was prepared by selecting single items from the original SRS-22. SRS-7 showed fit to the model, unidimensionality and no DIF. Compared with the SRS-22, the short form scale shows better targeting of the participants' population. RA shows that SRS-22 has poor clinimetric properties; moreover, when used with AIS at first evaluation, SRS-22 is affected by a severe ceiling effect. SRS-7, an SRS-22 7-item short form questionnaire, provides an HRQL interval measure better tailored to these participants. Copyright © 2014 Elsevier Ltd. All rights reserved.
Another Look at the PART-O Using the Traumatic Brain Injury Model Systems National Database: Scoring to Optimize Psychometrics.

Science.gov (United States)

Malec, James F; Whiteneck, Gale G; Bogner, Jennifer A

2016-02-01

To integrate previous approaches to scoring the Participation Assessment with Recombined Tools-Objective (PART-O) in a unidimensional scale. Retrospective analysis of PART-O data from the Traumatic Brain Injury Model Systems. Community. Data from individuals (N=469) selected randomly from participants who completed 1-year follow-up in the Traumatic Brain Injury Model Systems were used in Rasch model development. The model was subsequently tested on data from additional random samples of similar size at 1-, 2-, 5-, 10-, and >15-year follow-ups. Not applicable. PART-O. After combining items for productivity and social interaction, the initial analysis at 1-year follow-up indicated relatively good fit to the Rasch model (person reliability=.80) but also suggested item misfit and that the 0-to-5 scale used for most items did not consistently show clear separation between rating levels. Reducing item rating scales to 3 levels (except combined and dichotomous items) resolved these issues and demonstrated good item level discrimination, fit, and person reliability (.81), with no evidence of multidimensionality. These results replicated in analyses at each additional follow-up period. Modifications to item scoring for the PART-O resulted in a unidimensional parametric equivalent measure that addresses previous concerns about competing item relations, and it fit the Rasch model consistently across follow-up periods. The person-item map shows a progression toward greater community participation from solitary and dyadic activities, such as leaving the house and having a friend through social and productivity activities, to group activities with others who share interests or beliefs. Copyright © 2016 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Identifying potential misfit items in cognitive process of learning engineering mathematics based on Rasch model

International Nuclear Information System (INIS)

Ataei, Sh; Mahmud, Z; Khalid, M N

2014-01-01

The students learning outcomes clarify what students should know and be able to demonstrate after completing their course. So, one of the issues on the process of teaching and learning is how to assess students' learning. This paper describes an application of the dichotomous Rasch measurement model in measuring the cognitive process of engineering students' learning of mathematics. This study provides insights into the perspective of 54 engineering students' cognitive ability in learning Calculus III based on Bloom's Taxonomy on 31 items. The results denote that some of the examination questions are either too difficult or too easy for the majority of the students. This analysis yields FIT statistics which are able to identify if there is data departure from the Rasch theoretical model. The study has identified some potential misfit items based on the measurement of ZSTD where the removal misfit item was accomplished based on the MNSQ outfit of above 1.3 or less than 0.7 logit. Therefore, it is recommended that these items be reviewed or revised to better match the range of students' ability in the respective course.
An independent psychometric evaluation of the PROMS measure of music perception skills

NARCIS (Netherlands)

Kunert, R.; Willems, R.M.; Hagoort, P.

2016-01-01

The Profile of Music Perception Skills (PROMS) is a recently developed measure of perceptual music skills which has been shown to have promising psychometric properties. In this paper we extend the evaluation of its brief version to three kinds of validity using an individual difference approach.
Students' approaches to learning in a clinical practicum: A psychometric evaluation based on item response theory.

Science.gov (United States)

Zhao, Yue; Kuan, Hoi Kei; Chung, Joyce O K; Chan, Cecilia K Y; Li, William H C

2018-07-01

The investigation of learning approaches in the clinical workplace context has remained an under-researched area. Despite the validation of learning approach instruments and their applications in various clinical contexts, little is known about the extent to which an individual item, that reflects a specific learning strategy and motive, effectively contributes to characterizing students' learning approaches. This study aimed to measure nursing students' approaches to learning in a clinical practicum using the Approaches to Learning at Work Questionnaire (ALWQ). Survey research design was used in the study. A sample of year 3 nursing students (n = 208) who undertook a 6-week clinical practicum course participated in the study. Factor analyses were conducted, followed by an item response theory analysis, including model assumption evaluation (unidimensionality and local independence), item calibration and goodness-of-fit assessment. Two subscales, deep and surface, were derived. Findings suggested that: (a) items measuring the deep motive from intrinsic interest and deep strategies of relating new ideas to similar situations, and that of concept mapping served as the strongest discriminating indicators; (b) the surface strategy of memorizing facts and details without an overall picture exhibited the highest discriminating power among all surface items; and, (c) both subscales appeared to be informative in assessing a broad range of the corresponding latent trait. The 21-item ALWQ derived from this study presented an efficient, internally consistent and precise measure. Findings provided a useful psychometric evaluation of the ALWQ in the clinical practicum context, added evidence to the utility of the ALWQ for nursing education practice and research, and echoed the discussions from previous studies on the role of the contextual factors in influencing student choices of different learning strategies. They provided insights for clinical educators to measure
A measure of family eating habits: initial psychometric properties using the profile pattern approach (PPA).

Science.gov (United States)

Klempel, Natalie; Kim, Se-Kang; Wilson, Monique; Annunziato, Rachel A

2013-01-01

Although it seems likely that family characteristics and eating habits are a major factor in the development of eating behaviors, there are no self-report measures that examine how individuals view their family's eating habits. Seventy-one women ages 18-22 were recruited from a private university in a large northeastern city and asked to complete a short questionnaire packet consisting of demographic questions, the newly developed Family Eating Habits Questionnaire (FEHQ) and the Eating Inventory (EI). Internal consistency and test-retest reliability of the FEHQ was established. Significant associations were found between the FEHQ and the EI, indicating convergent validity for the FEHQ. Further validation was conducted using a novel statistical technique, the profile pattern approach (PPA). The results of the present study are limited by the restricted sample characteristic of a university setting. However, our findings show that the family eating habits' measure appears psychometrically sound. A future aim will be to continue validating this instrument in other samples, particularly to determine its predictive value. Copyright © 2012 Elsevier Ltd. All rights reserved.
Inter-regional metric disadvantages when comparing countries’ happiness on a global scale. A Rasch based consequential validity analysis

Directory of Open Access Journals (Sweden)

Diego Fernando Rojas-Gualdrón

2017-07-01

Full Text Available Measurement confounding due to socioeconomic differences between world regions may bias the estimations of countries’ happiness and global inequality. Potential implications of this bias have not been researched. In this study, the consequential validity of the Happy Planet Index, 2012 as an indicator of global inequality is evaluated from the Rasch measurement perspective. Differential Item Functioning by world region and bias in the estimated magnitude of inequalities were analyzed. The recalculated measure showed a good fit to Rasch model assumptions. The original index underestimated relative inequalities between world regions by 20%. DIF had no effect on relative measures but affected absolute measures by overestimating world average happiness and underestimating its variance. These findings suggest measurement confounding by unmeasured characteristics. Metric disadvantages must be adjusted to make fair comparisons. Public policy decisions based on biased estimations could have relevant negative consequences on people’s health and well-being by not focusing efforts on real vulnerable populations.
A comparison between patients with epiphora and cataract of the activity limitations they experience in daily life due to their visual disability.

Science.gov (United States)

Bohman, Elin; Wyon, Maria; Lundström, Mats; Dafgård Kopp, Eva

2018-02-01

The objective of this study was to compare patients with epiphora and cataract in terms of the activity limitations they experience in daily life due to their visual disability and to validate the use of the Catquest-9SF questionnaire for epiphora patients. Seventy-two consecutively encountered adult patients with confirmed lacrimal obstruction and listed for dacryocystorhinostomy (DCR) or lacrimal intubation at the St. Erik Eye Hospital, Stockholm, Sweden, completed the Catquest-9SF questionnaire, which measures activity limitations in daily life due to visual disability. The psychometric qualities of the Catquest-9SF results obtained from this group of patients were evaluated by Rasch analysis. Rasch analysis was further employed to convert the ordinal raw data to a Rasch score for comparison with the preoperative scores of patients registered in the Swedish National Cataract Register (NCR) during March 2013. The Catquest-9SF exhibited good psychometric qualities when investigating epiphora patients, with the exception of a misfit for Item 4, the item regarding facial recognition. On the Rasch scale (-5.43 = no activity limitations to +5.01 = severe activity limitations), the mean score for epiphora patients was -0.82 while for patients listed for 1st eye and 2nd eye cataract surgery it was -0.17 and -0.76, respectively. An equivalence test confirmed that the reported visual disability of epiphora patients was not significantly different from visual disability reported by patients waiting for 2nd eye cataract surgery. The Catquest-9SF is a valid measure of visual disability in patients with epiphora. Epiphora patients experience visual disability to the same degree as patients awaiting 2nd eye cataract surgery. © 2017 Acta Ophthalmologica Scandinavica Foundation. Published by John Wiley & Sons Ltd.

Evaluation properties of the French version of the OUT-PATSAT35 satisfaction with care questionnaire according to classical and item response theory analyses.

Science.gov (United States)

Panouillères, M; Anota, A; Nguyen, T V; Brédart, A; Bosset, J F; Monnier, A; Mercier, M; Hardouin, J B

2014-09-01

The present study investigates the properties of the French version of the OUT-PATSAT35 questionnaire, which evaluates the outpatients' satisfaction with care in oncology using classical analysis (CTT) and item response theory (IRT). This cross-sectional multicenter study includes 692 patients who completed the questionnaire at the end of their ambulatory treatment. CTT analyses tested the main psychometric properties (convergent and divergent validity, and internal consistency). IRT analyses were conducted separately for each OUT-PATSAT35 domain (the doctors, the nurses or the radiation therapists and the services/organization) by models from the Rasch family. We examined the fit of the data to the model expectations and tested whether the model assumptions of unidimensionality, monotonicity and local independence were respected. A total of 605 (87.4%) respondents were analyzed with a mean age of 64 years (range 29-88). Internal consistency for all scales separately and for the three main domains was good (Cronbach's α 0.74-0.98). IRT analyses were performed with the partial credit model. No disordered thresholds of polytomous items were found. Each domain showed high reliability but fitted poorly to the Rasch models. Three items in particular, the item about "promptness" in the doctors' domain and the items about "accessibility" and "environment" in the services/organization domain, presented the highest default of fit. A correct fit of the Rasch model can be obtained by dropping these items. Most of the local dependence concerned items about "information provided" in each domain. A major deviation of unidimensionality was found in the nurses' domain. CTT showed good psychometric properties of the OUT-PATSAT35. However, the Rasch analysis revealed some misfitting and redundant items. Taking the above problems into consideration, it could be interesting to refine the questionnaire in a future study.
Trial-dependent psychometric functions accounting for perceptual learning in 2-AFC discrimination tasks.

Science.gov (United States)

Kattner, Florian; Cochrane, Aaron; Green, C Shawn

2017-09-01

The majority of theoretical models of learning consider learning to be a continuous function of experience. However, most perceptual learning studies use thresholds estimated by fitting psychometric functions to independent blocks, sometimes then fitting a parametric function to these block-wise estimated thresholds. Critically, such approaches tend to violate the basic principle that learning is continuous through time (e.g., by aggregating trials into large "blocks" for analysis that each assume stationarity, then fitting learning functions to these aggregated blocks). To address this discrepancy between base theory and analysis practice, here we instead propose fitting a parametric function to thresholds from each individual trial. In particular, we implemented a dynamic psychometric function whose parameters were allowed to change continuously with each trial, thus parameterizing nonstationarity. We fit the resulting continuous time parametric model to data from two different perceptual learning tasks. In nearly every case, the quality of the fits derived from the continuous time parametric model outperformed the fits derived from a nonparametric approach wherein separate psychometric functions were fit to blocks of trials. Because such a continuous trial-dependent model of perceptual learning also offers a number of additional advantages (e.g., the ability to extrapolate beyond the observed data; the ability to estimate performance on individual critical trials), we suggest that this technique would be a useful addition to each psychophysicist's analysis toolkit.
What is the best measure for assessing diabetes distress? A comparison of the Problem Areas in Diabetes and Diabetes Distress Scale

DEFF Research Database (Denmark)

Fenwick, Eva K.; Rees, Gwyn; Holmes-Truscott, Elizabeth

2018-01-01

This study used Rasch analysis to examine the psychometric validity of the Diabetes Distress Scale and the Problem Areas in Diabetes scale to assess diabetes distress in 3338 adults with diabetes (1609 completed the Problem Areas in Diabetes scale (n = 675 type 1 diabetes; n = 934 type 2 diabetes......) and 1705 completed the Diabetes Distress Scale (n = 693 type 1 diabetes; n = 1012 type 2 diabetes)). While criterion and convergent validity were good, Rasch analysis revealed suboptimal precision and targeting, and item misfit. Unresolvable multidimensionality within the Diabetes Distress Scale suggests...... a total score should be avoided, while suboptimal precision suggests that the Physician-related and Interpersonal distress subscales should be used cautiously....
Validation of the Danish version of the McGill Ingestive Skills Assessment using classical test theory and the Rasch model

DEFF Research Database (Denmark)

Hansen, Tina; Lambert, Heather C; Faber, Jens

2012-01-01

Purpose: The study aimed to validate the Danish version of the Canadian the "McGill Ingestive Skills Assessment" (MISA-DK) for measuring dysphagia in frail elders. Method: One-hundred and ten consecutive older medical patients were recruited to the study. Reliability was assessed by internal...... consistency (Chronbach's alpha). External construct validity (convergent and known-groups validity) was evaluated against theoretical constructs assessing the complex concept of ingestive skills. Internal construct validity was tested using Rasch analysis. Results: High internal consistency reliability...... with Chronbach's alpha of 0.77-0.95 was evident. External construct validity was supported by expected high correlations with most of the constructs related to ingestive skills (r(s)¿=¿0.53 to r(s)¿=¿0.66). The MISA-DK discriminated significantly between known-groups. Fit to the Rasch model (x(2) (df)¿=¿12 (12...
An application of dichotomous and polytomous Rasch models for scoring energy insecurity

International Nuclear Information System (INIS)

Murray, Anthony G.; Mills, Bradford F.

2012-01-01

Household food security in the United States has been extensively researched and a number of indexes have been generated. However, household energy security has been largely ignored even though low-income households spend almost equal income shares on food and energy. This paper uses Rasch models and household responses to energy security questions in the 2005 Residential Energy Consumption Survey to generate an energy insecurity index that is consistent with those found in the food insecurity literature. The analysis yields several important findings for the generation of policy relevant household energy insecurity indexes. Questions that indicate reduction of basic expenditures, such as food, clothing, and shelter, are easiest for households to affirm implying low exposure to energy insecurity. Conversely, questions that concern households leaving the residence due to extreme temperatures consistently imply high exposure to energy insecurity. Households that score in the top decile of the energy insecurity index are more likely to be headed by single-females, be younger, and have a Black household head. Rasch models also identify flaws within survey. Particularly, the scope of the questions is quite broad and a refinement of the survey questions to focus on specific attributes of energy insecurity would likely improve future energy security indexes. - Highlights: ► A novel household energy insecurity index is generated for low-income U.S. families. ► Severely energy insecure households have unique characteristics. ► Energy insecure households are more likely to participate in LIHEAP. ► RECS survey questions should be modified for an improved energy insecurity index.
Using the Rasch measurement model to design a report writing assessment instrument.

Science.gov (United States)

Carlson, Wayne R

2013-01-01

This paper describes how the Rasch measurement model was used to develop an assessment instrument designed to measure student ability to write law enforcement incident and investigative reports. The ability to write reports is a requirement of all law enforcement recruits in the state of Michigan and is a part of the state's mandatory basic training curriculum, which is promulgated by the Michigan Commission on Law Enforcement Standards (MCOLES). Recently, MCOLES conducted research to modernize its training and testing in the area of report writing. A structured validation process was used, which included: a) an examination of the job tasks of a patrol officer, b) input from content experts, c) a review of the professional research, and d) the creation of an instrument to measure student competency. The Rasch model addressed several measurement principles that were central to construct validity, which were particularly useful for assessing student performances. Based on the results of the report writing validation project, the state established a legitimate connectivity between the report writing standard and the essential job functions of a patrol officer in Michigan. The project also produced an authentic instrument for measuring minimum levels of report writing competency, which generated results that are valid for inferences of student ability. Ultimately, the state of Michigan must ensure the safety of its citizens by licensing only those patrol officers who possess a minimum level of core competency. Maintaining the validity and reliability of both the training and testing processes can ensure that the system for producing such candidates functions as intended.
Controlling response dependence in the measurement of change using the Rasch model.

Science.gov (United States)

Andrich, David

2017-01-01

The advantages of using person location estimates from the Rasch model over raw scores for the measurement of change using a common test include the linearization of scores and the automatic handling of statistical properties of repeated measurements. However, the application of the model requires that the responses to the items are statistically independent in the sense that the specific responses to the items on the first time of testing do not affect the responses at a second time. This requirement implies that the responses to the items at both times of assessment are governed only by the invariant location parameters of the items at the two times of testing and the location parameters of each person each time. A specific form of dependence that is pertinent when the same items are used is when the observed response to an item at the second time of testing is affected by the response to the same item at the first time, a form of dependence which has been referred to as response dependence. This paper presents the logic of applying the Rasch model to quantify, control and remove the effect of response dependence in the measurement of change when the same items are used on two occasions. The logic is illustrated with four sets of simulation studies with dichotomous items and with a small example of real data. It is shown that the presence of response dependence can reduce the evidence of change, a reduction which may impact interpretations at the individual, research, and policy levels.
Bayesian psychometric scaling

NARCIS (Netherlands)

Fox, Gerardus J.A.; van den Berg, Stéphanie Martine; Veldkamp, Bernard P.; Irwing, P.; Booth, T.; Hughes, D.

2015-01-01

In educational and psychological studies, psychometric methods are involved in the measurement of constructs, and in constructing and validating measurement instruments. Assessment results are typically used to measure student proficiency levels and test characteristics. Recently, Bayesian item
Rasch Analyses of Very Low Food Security among Households and Children in the Three City Study.

Science.gov (United States)

Moffitt, Robert A; Ribar, David C

2016-04-01

The longitudinal Three City Study of low-income families with children measures food hardships using fewer questions and some different questions from the standard U.S. instrument for measuring food security, the Household Food Security Survey Module (HFSSM) in the Current Population Survey (CPS). We utilize a Rasch measurement model to identify thresholds of very low food security among households and very low food security among children in the Three City Study that are comparable to thresholds from the HFSSM. We also use the Three City Study to empirically investigate the determinants of food insecurity and of these specific food insecurity outcomes, estimating a multivariate behavioral Rasch model that is adapted to address longitudinal data. The estimation results indicate that participation in the Supplemental Nutrition Assistance Program and the Temporary Assistance for Needy Families program reduce food insecurity, while poverty and disability among caregivers increase it. Besides its longitudinal structure, the Three City Study measures many more characteristics about households than the CPS. Our estimates reveal that financial assistance through social networks and a household's own financial assets reduce food insecurity, while its outstanding loans increase insecurity.
Rasch Analyses of Very Low Food Security among Households and Children in the Three City Study*

Science.gov (United States)

Moffitt, Robert A.; Ribar, David C.

2017-01-01

The longitudinal Three City Study of low-income families with children measures food hardships using fewer questions and some different questions from the standard U.S. instrument for measuring food security, the Household Food Security Survey Module (HFSSM) in the Current Population Survey (CPS). We utilize a Rasch measurement model to identify thresholds of very low food security among households and very low food security among children in the Three City Study that are comparable to thresholds from the HFSSM. We also use the Three City Study to empirically investigate the determinants of food insecurity and of these specific food insecurity outcomes, estimating a multivariate behavioral Rasch model that is adapted to address longitudinal data. The estimation results indicate that participation in the Supplemental Nutrition Assistance Program and the Temporary Assistance for Needy Families program reduce food insecurity, while poverty and disability among caregivers increase it. Besides its longitudinal structure, the Three City Study measures many more characteristics about households than the CPS. Our estimates reveal that financial assistance through social networks and a household's own financial assets reduce food insecurity, while its outstanding loans increase insecurity. PMID:29187764
Psychometric properties of the Danish MCMI-I translation

DEFF Research Database (Denmark)

Mortensen, E L; Simonsen, E

1990-01-01

A translation of the MCMI-I has been in use in Denmark for some years. An untested assumption in the interpretation of the pattern of test results is that the psychometric characteristics of the Danish and American versions are similar. The purpose of this study was to evaluate the psychometric...... properties of the questionnaire by using traditional psychometric analysis techniques on the results of a sample consisting of 423 patients and 179 normal controls. Coefficient alpha was calculated for the 20 clinical subscales of the test and the Danish results were strikingly similar to the original...... coefficients reported by Millon. Furthermore, factor analysis of the subscales showed a factor structure very similar to American findings, and it is concluded that the psychometric properties of the Danish MCMI are not significantly different from the original....
Clinical usefulness of the clock drawing test applying rasch analysis in predicting of cognitive impairment.

Science.gov (United States)

Yoo, Doo Han; Lee, Jae Shin

2016-07-01

[Purpose] This study examined the clinical usefulness of the clock drawing test applying Rasch analysis for predicting the level of cognitive impairment. [Subjects and Methods] A total of 187 stroke patients with cognitive impairment were enrolled in this study. The 187 patients were evaluated by the clock drawing test developed through Rasch analysis along with the mini-mental state examination of cognitive evaluation tool. An analysis of the variance was performed to examine the significance of the mini-mental state examination and the clock drawing test according to the general characteristics of the subjects. Receiver operating characteristic analysis was performed to determine the cutoff point for cognitive impairment and to calculate the sensitivity and specificity values. [Results] The results of comparison of the clock drawing test with the mini-mental state showed significant differences in according to gender, age, education, and affected side. A total CDT of 10.5, which was selected as the cutoff point to identify cognitive impairement, showed a sensitivity, specificity, Youden index, positive predictive, and negative predicive values of 86.4%, 91.5%, 0.8, 95%, and 88.2%. [Conclusion] The clock drawing test is believed to be useful in assessments and interventions based on its excellent ability to identify cognitive disorders.
79th Annual Meeting of the Psychometric Society

CERN Document Server

Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Chow, Sy-Miin

2015-01-01

These research articles from the 79th Annual Meeting of the Psychometric Society (IMPS) cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 79th annual meeting took place in Madison, WI between July 21nd and 25th, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013) and Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015).
80th Annual Meeting of the Psychometric Society

CERN Document Server

Bolt, Daniel; Wang, Wen-Chung; Douglas, Jeffrey; Wiberg, Marie

2016-01-01

The research articles in this volume cover timely quantitative psychology topics, including new methods in item response theory, computerized adaptive testing, cognitive diagnostic modeling, and psychological scaling. Topics within general quantitative methodology include structural equation modeling, factor analysis, causal modeling, mediation, missing data methods, and longitudinal data analysis. These methods will appeal, in particular, to researchers in the social sciences. The 80th annual meeting took place in Beijing, China, between the 12th and 16th of July, 2014. Previous volumes to showcase work from the Psychometric Society’s Meeting are New Developments in Quantitative Psychology: Presentations from the 77th Annual Psychometric Society Meeting (Springer, 2013), Quantitative Psychology Research: The 78th Annual Meeting of the Psychometric Society (Springer, 2015), and Quantitative Psychology Research: The 79th Annual Meeting of the Psychometric Society, Wisconsin, USA, 2014 (Springer, 2015).
A comparison of three methods of assessing differential item functioning (DIF) in the Hospital Anxiety Depression Scale: ordinal logistic regression, Rasch analysis and the Mantel chi-square procedure.

Science.gov (United States)

Cameron, Isobel M; Scott, Neil W; Adler, Mats; Reid, Ian C

2014-12-01

It is important for clinical practice and research that measurement scales of well-being and quality of life exhibit only minimal differential item functioning (DIF). DIF occurs where different groups of people endorse items in a scale to different extents after being matched by the intended scale attribute. We investigate the equivalence or otherwise of common methods of assessing DIF. Three methods of measuring age- and sex-related DIF (ordinal logistic regression, Rasch analysis and Mantel χ(2) procedure) were applied to Hospital Anxiety Depression Scale (HADS) data pertaining to a sample of 1,068 patients consulting primary care practitioners. Three items were flagged by all three approaches as having either age- or sex-related DIF with a consistent direction of effect; a further three items identified did not meet stricter criteria for important DIF using at least one method. When applying strict criteria for significant DIF, ordinal logistic regression was slightly less sensitive. Ordinal logistic regression, Rasch analysis and contingency table methods yielded consistent results when identifying DIF in the HADS depression and HADS anxiety scales. Regardless of methods applied, investigators should use a combination of statistical significance, magnitude of the DIF effect and investigator judgement when interpreting the results.
Protocol: validation of the INCODE barometer to measure the innovation compe-tence through the Rasch Measurement Theory

Directory of Open Access Journals (Sweden)

Lidia Sanchez

2017-06-01

Full Text Available This communication presents a protocol in order to show the different phases that must be followed in order to validate the INCODE barometer, which is used to measure the innovation competence, with Rasch Measurement Theory. Five phases are stated: dimensionality analysis, individual reliability and validity analysis of ítems and persons, global reliability and validity analysis, and cathegory analysis.
81st Annual Meeting of the Psychometric Society

CERN Document Server

Wiberg, Marie; Culpepper, Steven; Douglas, Jeffrey; Wang, Wen-Chung

2017-01-01

This proceedings volume compiles and expands on selected and peer reviewed presentations given at the 81st Annual Meeting of the Psychometric Society (IMPS), organized by the University of North Carolina at Greensboro, and held in Asheville, North Carolina, July 11th to 17th, 2016. IMPS is one of the largest international meetings focusing on quantitative measurement in psychology, education, and the social sciences, both in terms of participants and number of presentations. The meeting built on the Psychometric Society's mission to share quantitative methods relevant to psychology, addressing a diverse set of psychometric topics including item response theory, factor analysis, structural equation modeling, time series analysis, mediation analysis, cognitive diagnostic models, and multi-level models. Selected presenters were invited to revise and expand their contributions and to have them peer reviewed and published in this proceedings volume. Previous volumes to showcase work from the Psychometric Society�...
Self-report measure of financial exploitation of older adults.

Science.gov (United States)

Conrad, Kendon J; Iris, Madelyn; Ridings, John W; Langley, Kate; Wilber, Kathleen H

2010-12-01

this study was designed to improve the measurement of financial exploitation (FE) by testing psychometric properties of the older adult financial exploitation measure (OAFEM), a client self-report instrument. rasch item response theory and traditional validation approaches were used. Questionnaires were administered by 22 adult protective services investigators from 7 agencies in Illinois to 227 substantiated abuse clients. Analyses included tests for dimensionality, model fit, and additional construct validation. Results from the OAFEM were also compared with the substantiation decision of abuse and with investigators' assessments of FE using a staff report version. Hypotheses were generated to test hypothesized relationships. the OAFEM, including the original 79-, 54-, and 30-item measures, met stringent Rasch analysis fit and unidimensionality criteria and had high internal consistency and item reliability. The validation results were supportive, while leading to reconsideration of aspects of the hypothesized theoretical hierarchy. Thresholds were suggested to demonstrate levels of severity. the measure is now available to aid in the assessment of FE of older adults by both clinicians and researchers. Theoretical refinements developed using the empirically generated item hierarchy may help to improve assessment and intervention.
A psychometric appraisal of the DREEM

Directory of Open Access Journals (Sweden)

Hammond Sean M

2012-01-01

Full Text Available Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM. Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89, subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming's indices shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the
A psychometric appraisal of the DREEM

LENUS (Irish Health Repository)

Hammond, Sean M

2012-01-12

Abstract Background The quality of the Educational environment is a key determinant of a student centred curriculum. Evaluation of the educational environment is an important component of programme appraisal. In order to conduct such evaluation use of a comprehensive, valid and reliable instrument is essential. One of most widely used contemporary tools for evaluation of the learning environment is the Dundee Ready Education Environment Measure (DREEM). Apart from the initial psychometric evaluation of the DREEM, few published studies report its psychometric properties in detail. The aim of this study was to examine the psychometric quality of the DREEM measure in the context of medical education in Ireland and to explore the construct validity of the device. Methods 239 final year medical students were asked to complete the DREEM inventory. Anonymised responses were entered into a database. Data analysis was performed using PASW 18 and confirmatory factor analysis performed. Results Whilst the total DREEM score had an acceptable level of internal consistency (alpha 0.89), subscale analysis shows that two subscales had sub-optimal internal consistency. Multiple group confirmatory factor analysis (using Fleming\\'s indices) shows an overall fit of 0.76, representing a weak but acceptable level of fit. 17 of the 50 items manifest fit indices less than 0.70. We sought the best fitting oblique solution to the 5-subscale structure, which showed large correlations, suggesting that the independence of the separate scales is open to question. Conclusions There has perhaps been an inadequate focus on establishing and maintaining the psychometric credentials of the DREEM. The present study highlights two concerns. Firstly, the internal consistency of the 5 scales is quite variable and, in our sample, appears rather low. Secondly, the construct validity is not well supported. We suggest that users of the DREEM will provide basic psychometric appraisal of the device in future

Applied psychometrics in clinical psychiatry: the pharmacopsychometric triangle

DEFF Research Database (Denmark)

Bech, P; Bech, P

2009-01-01

OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety, antidepress......OBJECTIVE: To consider applied psychometrics in psychiatry as a discipline focusing on pharmacopsychology rather than psychopharmacology as illustrated by the pharmacopsychometric triangle. METHOD: The pharmacopsychological dimensions of clinically valid effects of drugs (antianxiety...... psychometrics in psychiatry have been found to cover a pharmacopsychometric triangle illustrating the measurements of wanted and unwanted effects of pharmacotherapeutic drugs as well as health-related quality of life....
The Interpersonal Relationship Inventory: continued psychometric evaluation.

Science.gov (United States)

Tilden, V P; Hirsch, A M; Nelson, C A

1994-01-01

For norm-referenced measures to be useful in social-behavioral research, investigators who develop measures face several psychometric challenges, including: (a) adequate domain specification; (b) adequate initial evidence of reliability and validity; and (c) ongoing evidence of psychometric quality. The Interpersonal Relationship Inventory (IPRI) was developed in response to gaps in measurement of social relationships, and contributed scales for reciprocity and conflict to a measure of social support. For the IPRI, the first two points were addressed during the period of instrument development. The measure now has been in use for 4 years. This article reports evidence addressing the third challenge: ongoing evidence of psychometric quality. Findings from 19 studies using the IPRI provide compelling evidence for internal consistency reliability and construct validity of the scales.
Validation of the malaysian versions of parents and children health survey for asthma by using rasch-model.

Science.gov (United States)

Hussein, Maryam Se; Akram, Waqas; Mamat, Mohd Nor; Majeed, Abu Bakar Abdul; Ismail, Nahlah Elkudssiah Binti

2015-04-01

In recent years, health-related quality of life (HRQOL) has become an important outcome measure in epidemiologic studies and clinical trials. For patients with asthma there are many instruments but most of them have been developed in English. With the increase in research project, researchers working in other languages have two options; either to develop a new measure or to translate an already developed measure. Children Health Survey for Asthma is developed by American Academy of Paediatrics which has two versions one for the parents (CHSA) and the other for the child (CHSA-C). However, there is no Malay version of the CHSA or the CHSA-C. The aim of this study was to translate and determine the validity and reliability of the Malaysian versions of Parent and Children Health Survey for Asthma. Questionnaires were translated to Bahasa Malayu using previously established guidelines, data from 180 respondents (asthmatic children and their parent) were analysed using Rasch-Model; as, it is an approach that has been increasingly used in health field and also it explores the performance of each item rather than total set score. The internal consistency was high for the parent questionnaire (CHSA) (reliability score for persons = 0.88 and for items was 0.97), and good for child questionnaire (CHSA-C) (reliability score for persons = 0.83 and for items was 0.94). Also, this study shows that all items measure for both questionnaires (CHSA and CHSA-C) are fitted to Rasch-Model. This study produced questionnaires that are conceptually equivalent to the original, easy to understand for the children and their parents, and good in terms of internal consistency. Because of the questionnaire has two versions one for the child and the other for the parents, they could be used in clinical practice to measure the effect of asthma on the child and their families. This current research had translated two instruments to other language (BahasaMalayu) and evaluated their reliability and
A new psychometric questionnaire for reporting of somatosensory percepts

Science.gov (United States)

Kim, L. H.; McLeod, R. S.; Kiss, Z. H. T.

2018-02-01

Objective. There have been remarkable advances over the past decade in neural prostheses to restore lost motor function. However, restoration of somatosensory feedback, which is essential for fine motor control and user acceptance, has lagged behind. With an increasing interest in using electrical stimulation to restore somatosensory sensations within the peripheral (PNS) and central nervous systems (CNS), it is critical to characterize the percepts evoked by electrical stimulation in a standardized manner with a validated psychometric questionnaire. This will allow comparison of results from applications at various nervous system levels in multiple settings. Approach. We compiled a summary of published reports of somatosensory percepts that were elicited by electrical stimulation in humans and used these to develop a new psychometric questionnaire. Results. This new questionnaire was able to characterize subjective evoked sensations with good test-retest reliability (Spearman’s correlation coefficients ranging 0.716 ⩽ ρ ⩽ 1.000, p ⩽ 0.005) in 13 subjects receiving stimulation through neural implants in both the CNS and PNS. Furthermore, the new questionnaire captured more descriptors (M = 2.65, SD = 0.91) that would have been missed by being categorized as ‘other sensations’, using a previous questionnaire (M = 1.40, SD = 0.77, t(12) = -10.24, p psychometric questionnaire will aid in establishing consistency and standardization of reporting in future studies of somatosensory neural prostheses.
Use of Rasch Analysis to Evaluate and Refine the Community Balance and Mobility Scale for Use in Ambulatory Community-Dwelling Adults Following Stroke

Science.gov (United States)

Pollock, Courtney L.; Brouwer, Brenda; Garland, S. Jayne

2016-01-01

Background The Community Balance and Mobility Scale (CB&M) is increasingly used to evaluate walking balance following stroke. Objective This study applied Rasch analysis to evaluate and refine the CB&M for use in ambulatory community-dwelling adults following stroke. Methods The CB&M content was linked to task demands and motor skill classifications. Rasch analysis was used to evaluate internal construct validity (structural validity) and refine the CB&M for use with ambulatory community-dwelling adults following stroke. The CB&M data were collected at 3 time points: at discharge from inpatient rehabilitation and at 6 and 12 months postdischarge (N=238). Rasch analysis evaluated scale dimensionality, item and person fit, item response bias, scoring hierarchy, and targeting. Disordered scoring hierarchy was resolved by collapsing scoring categories. Highly correlated and “misfitting” items were removed. Sensitivity to change was evaluated with standardized response means (SRMs) and one-way repeated-measures analysis of variance. Results The CB&M was primarily linked to closed body transport task demands. Significant item-trait interaction, disordered scoring hierarchies, and multidimensionality were found. Scoring categories were collapsed in 15/19 items, and 5 misfitting items were removed. The resulting stroke-specific 14-item unidimensional CB&M (CB&MStroke) fit Rasch model expectations, with no item response bias, acceptable targeting (13% floor effects and 0% ceiling effects), and moderate-to-strong sensitivity to change at 6 months postdischarge (SRM=0.63; 95% confidence interval=−1.523, −0.142) and 12 months postdischarge (SRM=0.73; 95% confidence interval=−2.318, −0.760). Limitations Findings are limited to a modest-sized sample of individuals with mild-to-moderate balance impairment following stroke. Conclusions The CB&MStroke shows promise as a clinical scale for measuring change in walking balance in ambulatory community-dwelling adults
Tree-Based Global Model Tests for Polytomous Rasch Models

Science.gov (United States)

Komboz, Basil; Strobl, Carolin; Zeileis, Achim

2018-01-01

Psychometric measurement models are only valid if measurement invariance holds between test takers of different groups. Global model tests, such as the well-established likelihood ratio (LR) test, are sensitive to violations of measurement invariance, such as differential item functioning and differential step functioning. However, these…
Current psychometric and methodological issues in the measurement of overgeneral autobiographical memory.

Science.gov (United States)

Griffith, James W; Sumner, Jennifer A; Raes, Filip; Barnhofer, Thorsten; Debeer, Elise; Hermans, Dirk

2012-12-01

Autobiographical memory is a multifaceted construct that is related to psychopathology and other difficulties in functioning. Across many studies, a variety of methods have been used to study autobiographical memory. The relationship between overgeneral autobiographical memory (OGM) and psychopathology has been of particular interest, and many studies of this cognitive phenomenon rely on the Autobiographical Memory Test (AMT) to assess it. In this paper, we examine several methodological approaches to studying autobiographical memory, and focus primarily on methodological and psychometric considerations in OGM research. We pay particular attention to what is known about the reliability, validity, and methodological variations of the AMT. The AMT has adequate psychometric properties, but there is great variability in methodology across studies that use it. Methodological recommendations and suggestions for future studies are presented. Copyright © 2011 Elsevier Ltd. All rights reserved.
Evaluating the Psychometric Quality of Social Skills Measures: A Systematic Review.

Science.gov (United States)

Cordier, Reinie; Speyer, Renée; Chen, Yu-Wei; Wilkes-Gillan, Sarah; Brown, Ted; Bourke-Taylor, Helen; Doma, Kenji; Leicht, Anthony

2015-01-01

Impairments in social functioning are associated with an array of adverse outcomes. Social skills measures are commonly used by health professionals to assess and plan the treatment of social skills difficulties. There is a need to comprehensively evaluate the quality of psychometric properties reported across these measures to guide assessment and treatment planning. To conduct a systematic review of the literature on the psychometric properties of social skills and behaviours measures for both children and adults. A systematic search was performed using four electronic databases: CINAHL, PsycINFO, Embase and Pubmed; the Health and Psychosocial Instruments database; and grey literature using PsycExtra and Google Scholar. The psychometric properties of the social skills measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria. Thirty-Six studies and nine manuals were included to assess the psychometric properties of thirteen social skills measures that met the inclusion criteria. Most measures obtained excellent overall methodological quality scores for internal consistency and reliability. However, eight measures did not report measurement error, nine measures did not report cross-cultural validity and eleven measures did not report criterion validity. The overall quality of the psychometric properties of most measures was satisfactory. The SSBS-2, HCSBS and PKBS-2 were the three measures with the most robust evidence of sound psychometric quality in at least seven of the eight psychometric properties that were appraised. A universal working definition of social functioning as an overarching construct is recommended. There is a need for ongoing research in the area of the psychometric properties of social skills and behaviours instruments.
Evaluating the Psychometric Quality of Social Skills Measures: A Systematic Review

Science.gov (United States)

Brown, Ted; Bourke-Taylor, Helen; Doma, Kenji; Leicht, Anthony

2015-01-01

Introduction Impairments in social functioning are associated with an array of adverse outcomes. Social skills measures are commonly used by health professionals to assess and plan the treatment of social skills difficulties. There is a need to comprehensively evaluate the quality of psychometric properties reported across these measures to guide assessment and treatment planning. Objective To conduct a systematic review of the literature on the psychometric properties of social skills and behaviours measures for both children and adults. Methods A systematic search was performed using four electronic databases: CINAHL, PsycINFO, Embase and Pubmed; the Health and Psychosocial Instruments database; and grey literature using PsycExtra and Google Scholar. The psychometric properties of the social skills measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria. Results Thirty-Six studies and nine manuals were included to assess the psychometric properties of thirteen social skills measures that met the inclusion criteria. Most measures obtained excellent overall methodological quality scores for internal consistency and reliability. However, eight measures did not report measurement error, nine measures did not report cross-cultural validity and eleven measures did not report criterion validity. Conclusions The overall quality of the psychometric properties of most measures was satisfactory. The SSBS-2, HCSBS and PKBS-2 were the three measures with the most robust evidence of sound psychometric quality in at least seven of the eight psychometric properties that were appraised. A universal working definition of social functioning as an overarching construct is recommended. There is a need for ongoing research in the area of the psychometric properties of social skills and behaviours instruments. PMID:26151362
Evaluating the Psychometric Quality of Social Skills Measures: A Systematic Review.

Directory of Open Access Journals (Sweden)

Reinie Cordier

Full Text Available Impairments in social functioning are associated with an array of adverse outcomes. Social skills measures are commonly used by health professionals to assess and plan the treatment of social skills difficulties. There is a need to comprehensively evaluate the quality of psychometric properties reported across these measures to guide assessment and treatment planning.To conduct a systematic review of the literature on the psychometric properties of social skills and behaviours measures for both children and adults.A systematic search was performed using four electronic databases: CINAHL, PsycINFO, Embase and Pubmed; the Health and Psychosocial Instruments database; and grey literature using PsycExtra and Google Scholar. The psychometric properties of the social skills measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria.Thirty-Six studies and nine manuals were included to assess the psychometric properties of thirteen social skills measures that met the inclusion criteria. Most measures obtained excellent overall methodological quality scores for internal consistency and reliability. However, eight measures did not report measurement error, nine measures did not report cross-cultural validity and eleven measures did not report criterion validity.The overall quality of the psychometric properties of most measures was satisfactory. The SSBS-2, HCSBS and PKBS-2 were the three measures with the most robust evidence of sound psychometric quality in at least seven of the eight psychometric properties that were appraised. A universal working definition of social functioning as an overarching construct is recommended. There is a need for ongoing research in the area of the psychometric properties of social skills and behaviours instruments.
A measure of early physical functioning (EPF) post-stroke.

Science.gov (United States)

Finch, Lois E; Higgins, Johanne; Wood-Dauphinee, Sharon; Mayo, Nancy E

2008-07-01

To develop a comprehensive measure of Early Physical Functioning (EPF) post-stroke quantified through Rasch analysis and conceptualized using the International Classification of Functioning Disability and Health (ICF). An observational cohort study. A cohort of 262 subjects (mean age 71.6 (standard deviation 12.5) years) hospitalized post-acute stroke. Functional assessments were made within 3 days of stroke with items from valid and reliable indices commonly utilized to evaluate stroke survivors. Information on important variables was also collected. Principal component and Rasch analysis confirmed the factor structure, and dimensionality of the measure. Rasch analysis combined items across ICF components to develop the measure. Items were deleted iteratively, those retained fit the model and were related to the construct; reliability and validity were assessed. A 38-item unidimensional measure of the EPF met all Rasch model requirements. The item difficulty matched the person ability (mean person measure: -0.31; standard error 0.37 logits), reliability of the person-item-hierarchy was excellent at 0.97. Initial validity was adequate. The 38-item EPF measure was developed. It expands the range of assessment post acute stroke; it covers a broad spectrum of difficulty with good initial psychometric properties that, once revalidated, can assist in planning and evaluating early interventions.
Psychometric properties of the Cumulated Ambulation Score

DEFF Research Database (Denmark)

Ferriero, Giorgio; Kristensen, Morten T; Invernizzi, Marco

2018-01-01

INTRODUCTION: In the geriatric population, independent mobility is a key factor in determining readiness for discharge following acute hospitalization. The Cumulated Ambulation Score (CAS) is a potentially valuable score that allows day-to-day measurements of basic mobility. The CAS was developed...... and validated in older patients with hip fracture as an early postoperative predictor of short-term outcome, but it is also used to assess geriatric in-patients with acute medical illness. Despite the fast- accumulating literature on the CAS, to date no systematic review synthesizing its psychometric properties....... Of 49 studies identified, 17 examined the psychometric properties of the CAS. EVIDENCE SYNTHESIS: Most papers dealt with patients after hip fracture surgery, and only 4 studies assessed the CAS psychometric characteristics also in geriatric in-patients with acute medical illness. Two versions of CAS...
Meeting the requirements of both classroom-based and systemic assessment of mathematics proficiency: The potential of Rasch measurement theory

Directory of Open Access Journals (Sweden)

Tim Dunne

2012-11-01

Full Text Available The challenges inherent in assessing mathematical proficiency depend on a number of factors, amongst which are an explicit view of what constitutes mathematical proficiency, an understanding of how children learn and the purpose and function of teaching. All of these factors impact on the choice of approach to assessment. In this article we distinguish between two broad types of assessment, classroom-based and systemic assessment. We argue that the process of assessment informed by Rasch measurement theory (RMT can potentially support the demands of both classroom-based and systemic assessment, particularly if a developmental approach to learning is adopted, and an underlying model of developing mathematical proficiency is explicit in the assessment instruments and their supporting material. An example of a mathematics instrument and its analysis which illustrates this approach, is presented. We note that the role of assessment in the 21st century is potentially powerful. This influential role can only be justified if the assessments are of high quality and can be selected to match suitable moments in learning progress and the teaching process. Users of assessment data must have sufficient knowledge and insight to interpret the resulting numbers validly, and have sufficient discernment to make considered educational inferences from the data for teaching and learning responses.
Psychometric Properties of Virtual Reality Vignette Performance Measures: A Novel Approach for Assessing Adolescents' Social Competency Skills

Science.gov (United States)

Paschall, Mallie J.; Fishbein, Diana H.; Hubal, Robert C.; Eldreth, Diana

2005-01-01

This study examined the psychometric properties of performance measures for three novel, interactive virtual reality vignette exercises developed to assess social competency skills of at-risk adolescents. Performance data were collected from 117 African-American male 15-17 year olds. Data for 18 performance measures were obtained, based on…
A Comparison between Discrimination Indices and Item-Response Theory Using the Rasch Model in a Clinical Course Written Examination of a Medical School.

Science.gov (United States)

Park, Jong Cook; Kim, Kwang Sig

2012-03-01

The reliability of test is determined by each items' characteristics. Item analysis is achieved by classical test theory and item response theory. The purpose of the study was to compare the discrimination indices with item response theory using the Rasch model. Thirty-one 4th-year medical school students participated in the clinical course written examination, which included 22 A-type items and 3 R-type items. Point biserial correlation coefficient (C(pbs)) was compared to method of extreme group (D), biserial correlation coefficient (C(bs)), item-total correlation coefficient (C(it)), and corrected item-total correlation coeffcient (C(cit)). Rasch model was applied to estimate item difficulty and examinee's ability and to calculate item fit statistics using joint maximum likelihood. Explanatory power (r2) of Cpbs is decreased in the following order: C(cit) (1.00), C(it) (0.99), C(bs) (0.94), and D (0.45). The ranges of difficulty logit and standard error and ability logit and standard error were -0.82 to 0.80 and 0.37 to 0.76, -3.69 to 3.19 and 0.45 to 1.03, respectively. Item 9 and 23 have outfit > or =1.3. Student 1, 5, 7, 18, 26, 30, and 32 have fit > or =1.3. C(pbs), C(cit), and C(it) are good discrimination parameters. Rasch model can estimate item difficulty parameter and examinee's ability parameter with standard error. The fit statistics can identify bad items and unpredictable examinee's responses.
Measuring Patient-Reported Outcomes: Key Metrics in Reconstructive Surgery.

Science.gov (United States)

Voineskos, Sophocles H; Nelson, Jonas A; Klassen, Anne F; Pusic, Andrea L

2018-01-29

Satisfaction and improved quality of life are among the most important outcomes for patients undergoing plastic and reconstructive surgery for a variety of diseases and conditions. Patient-reported outcome measures (PROMs) are essential tools for evaluating the benefits of newly developed surgical techniques. Modern PROMs are being developed with new psychometric approaches, such as Rasch Measurement Theory, and their measurement properties (validity, reliability, responsiveness) are rigorously tested. These advances have resulted in the availability of PROMs that provide clinically meaningful data and effectively measure functional as well as psychosocial outcomes. This article guides the reader through the steps of creating a PROM and highlights the potential research and clinical uses of such instruments. Limitations of PROMs and anticipated future directions in this field are discussed.
Quality of life in the Danish general population--normative data and validity of WHOQOL-BREF using Rasch and item response theory models

DEFF Research Database (Denmark)

Noerholm, V; Groenvold, M; Watt, T

2004-01-01

BACKGROUND: The main objective of this study was to investigate the construct validity of the WHOQOL-BREF by use of Rasch and Item Response Theory models and to examine the stability of the model across high/low scoring individuals, gender, education, and depressive illness. Furthermore......, the objective of the study was to estimate the reference data for the quality of life questionnaire WHOQOL-BREF in the general Danish population and in subgroups defined by age, gender, and education. METHODS: Mail-out-mail-back questionnaires were sent to a randomly selected sample of the Danish general...... population. The response rate was 68.5%, and the sample reported here contained 1101 respondents: 578 women and 519 men (four respondents did not indicate their genders). RESULTS: Each of the four domains of the WHOQOL-BREF scale fitted a two-parameter IRT model, but did not fit the Rasch model. Due...
Development and psychometric evaluation of a new team effectiveness scale for all types of community adult mental health teams: a mixed-methods approach.

Science.gov (United States)

El Ansari, Walid; Lyubovnikova, Joanne; Middleton, Hugh; Dawson, Jeremy F; Naylor, Paul B; West, Michael A

2016-05-01

Defining 'effectiveness' in the context of community mental health teams (CMHTs) has become increasingly difficult under the current pattern of provision required in National Health Service mental health services in England. The aim of this study was to establish the characteristics of multi-professional team working effectiveness in adult CMHTs to develop a new measure of CMHT effectiveness. The study was conducted between May and November 2010 and comprised two stages. Stage 1 used a formative evaluative approach based on the Productivity Measurement and Enhancement System to develop the scale with multiple stakeholder groups over a series of qualitative workshops held in various locations across England. Stage 2 analysed responses from a cross-sectional survey of 1500 members in 135 CMHTs from 11 Mental Health Trusts in England to determine the scale's psychometric properties. Based on an analysis of its structural validity and reliability, the resultant 20-item scale demonstrated good psychometric properties and captured one overall latent factor of CMHT effectiveness comprising seven dimensions: improved service user well-being, creative problem-solving, continuous care, inter-team working, respect between professionals, engagement with carers and therapeutic relationships with service users. The scale will be of significant value to CMHTs and healthcare commissioners both nationally and internationally for monitoring, evaluating and improving team functioning in practice. © 2015 John Wiley & Sons Ltd.
[The methodological assessment and qualitative evaluation of psychometric performance tests based on the example of modern tests that assess reading and spelling skills].

Science.gov (United States)

Galuschka, Katharina; Rothe, Josefine; Schulte-Körne, Gerd

2015-09-01

This article looks at a means of objectively evaluating the quality of psychometric tests. This approach enables users to evaluate psychometric tests based on their methodological characteristics, in order to decide which instrument should be used. Reading and spelling assessment tools serve as examples. The paper also provides a review of German psychometric tests for the assessment of reading and spelling skills. This method facilitates the identification of psychometric tests.of high methodological quality which can be used for the assessment of reading and spelling skills. Reading performance should ideally be assessed with the following instruments: ELFE 1-6, LGVT 6-12, LESEN 6-7, LESEN 8-9, or WLLP-R. The tests to be used for the evaluation of spelling skills are DERET 1-2+, DERET 3-4+, WRT 1+, WRT 2+, WRT 3+, WRT 4+ or HSP 1-10.
Review of the Psychometric Evidence of the Perceived Stress Scale

Directory of Open Access Journals (Sweden)

Eun-Hyun Lee, RN, PhD

2012-12-01

Conclusion: Overall, the PSS is an easy-to-use questionnaire with established acceptable psychometric properties. However, future studies should evaluate these psychometric properties in greater depth, and validate the scale using diverse populations.

Using the Many-Faceted Rasch Model to Evaluate Standard Setting Judgments: An Illustration with the Advanced Placement Environmental Science Exam

Science.gov (United States)

Kaliski, Pamela K.; Wind, Stefanie A.; Engelhard, George, Jr.; Morgan, Deanna L.; Plake, Barbara S.; Reshetar, Rosemary A.

2013-01-01

The many-faceted Rasch (MFR) model has been used to evaluate the quality of ratings on constructed response assessments; however, it can also be used to evaluate the quality of judgments from panel-based standard setting procedures. The current study illustrates the use of the MFR model for examining the quality of ratings obtained from a standard…
Measurement of Online Student Engagement: Utilization of Continuous Online Student Behavior Indicators as Items in a Partial Credit Rasch Model

Science.gov (United States)

Anderson, Elizabeth

2017-01-01

Student engagement has been shown to be essential to the development of research-based best practices for K-12 education. It has been defined and measured in numerous ways. The purpose of this research study was to develop a measure of online student engagement for grades 3 through 8 using a partial credit Rasch model and validate the measure…
Bilimsel Araştırma Ödevlerinin Çok Yüzeyli Rasch Ölçme Modeli ile Değerlendirilmesi

Directory of Open Access Journals (Sweden)

Ramazan BAŞTÜRK

2010-06-01

Full Text Available The purpose of this study is to investigate the usefulness of the many-facet Rasch model (MFRM in evaluating the quality of performance related to preparing Research assignment in higher education. The Rasch Model utilizes item response theory stating that the probability of a correct response to a test item/task depends largely on a single parameter, the ability of the person. MFRM extends this one-parameter model to other facets, for example, rater severity, rating scale format, task difficulty levels. This paper specifically investigated research preparation ability in terms of items/task difficulty and rater severity/leniency. Fourth year counseling students prepared research assignments during the autumn semester of the 2009-2010 school years in the “Research Methods in Education” course. Six judges evaluated each students assignments using “Research Assignment Evaluation Rubric”. The results of this study demonstrated that the MFRM is a powerful tool for handling polytomous data in performance and peer assessment in higher education.
Psychometric Research in Reading.

Science.gov (United States)

Davis, Frederick B.

This review of psychometric research in reading analyzes the factors which seem related to reading comprehension skills. Experimental analysis of reading comprehension by L. E. Thorndike revealed two major components: knowledge of word meanings and verbal reasoning abilities. Subsequent analysis of experimental studies of reading comprehension…
Dimensions of personality pathology in adolescents: Psychometric properties of the DAPP-BQ-A

OpenAIRE

Tromp, N.B.; Koot, H.M.

2008-01-01

This study aimed to contribute to the dimensional approach to personality pathology by addressing the applicability of a personality pathology questionnaire, originally developed for adults, in adolescent samples. The psychometric properties of the Dimensional Assessment of Personality Pathology-Basic Questionnaire for Adolescents (DAPP-BQA) were studied in two samples including 170 adolescents referred for mental health services and 1,628 nonreferred adolescents, respectively. Factor analysi...
Self-reported competency--validation of the Norwegian version of the patient competency rating scale for traumatic brain injury.

Science.gov (United States)

Sveen, Unni; Andelic, Nada; Bautz-Holter, Erik; Røe, Cecilie

2015-01-01

To evaluate the psychometric properties of the Norwegian version of the Patient Competency Rating Scale (PCRS) in patients with traumatic brain injury (TBI) at 12 months post-injury. Demographic and injury-related data were registered upon admission to the hospital in 148 TBI patients with mild, moderate, or severe TBI. At 12 months post-injury, competency in activities and global functioning were measured using the PCRS patient version and the Glasgow Outcome Scale-Extended (GOSE). Descriptive reliability statistics, factor analysis and Rasch modeling were applied to explore the psychometric properties of the PCRS. External validity was evaluated using the GOSE. The PCRS can be divided into three subscales that reflect interpersonal/emotional, cognitive, and activities of daily living competency. The three-factor solution explained 56.6% of the variance in functioning. The internal consistency was very good, with a Cronbach's α of 0.95. Item 30, "controlling my laughter", did not load above 0.40 on any factors and did not fit the Rasch model. The external validity of the subscales was acceptable, with correlations between 0.50 and 0.52 with the GOSE. The Norwegian version of the PCRS is reliable, has an acceptable construct and external validity, and can be recommended for use during the later phases of TBI.
[Design and validation of an oral health questionnaire for preoperative anaesthetic evaluation].

Science.gov (United States)

Ruíz-López Del Prado, Gema; Blaya-Nováková, Vendula; Saz-Parkinson, Zuleika; Álvarez-Montero, Óscar Luis; Ayala, Alba; Muñoz-Moreno, Maria Fe; Forjaz, Maria João

Dental injuries incurred during endotracheal intubation are more frequent in patients with previous oral pathology. The study objectives were to develop an oral health questionnaire for preanaesthesia evaluation, easy to apply for personnel without special dental training; and establish a cut-off value for detecting persons with poor oral health. Validation study of a self-administered questionnaire, designed according to a literature review and an expert group's recommendations. The questionnaire was applied to a sample of patients evaluated in a preanaesthesia consultation. Rasch analysis of the questionnaire psychometric properties included viability, acceptability, content validity and reliability of the scale. The sample included 115 individuals, 50.4% of men, with a median age of 58 years (range: 38-71). The final analysis of 11 items presented a Person Separation Index of 0.861 and good adjustment of data to the Rasch model. The scale was unidimensional and its items were not biased by sex, age or nationality. The oral health linear measure presented good construct validity. The cut-off value was set at 52 points. The questionnaire showed sufficient psychometric properties to be considered a reliable tool, valid for measuring the state of oral health in preoperative anaesthetic evaluations. Copyright © 2016 Sociedade Brasileira de Anestesiologia. Publicado por Elsevier Editora Ltda. All rights reserved.
Design and validation of an oral health questionnaire for preoperative anaesthetic evaluation

Directory of Open Access Journals (Sweden)

Gema Ruíz-López del Prado

Full Text Available Abstract Background and objectives: Dental injuries incurred during endotracheal intubation are more frequent in patients with previous oral pathology. The study objectives were to develop an oral health questionnaire for preanaesthesia evaluation, easy to apply for personnel without special dental training; and establish a cut-off value for detecting persons with poor oral health. Methods: Validation study of a self-administered questionnaire, designed according to a literature review and an expert group's recommendations. The questionnaire was applied to a sample of patients evaluated in a preanaesthesia consultation. Rasch analysis of the questionnaire psychometric properties included viability, acceptability, content validity and reliability of the scale. Results: The sample included 115 individuals, 50.4% of men, with a median age of 58 years (range: 38-71. The final analysis of 11 items presented a Person Separation Index of 0.861 and good adjustment of data to the Rasch model. The scale was unidimensional and its items were not biased by sex, age or nationality. The oral health linear measure presented good construct validity. The cut-off value was set at 52 points. Conclusions: The questionnaire showed sufficient psychometric properties to be considered a reliable tool, valid for measuring the state of oral health in preoperative anaesthetic evaluations.
Design and validation of an oral health questionnaire for preoperative anaesthetic evaluation.

Science.gov (United States)

Ruíz-López Del Prado, Gema; Blaya-Nováková, Vendula; Saz-Parkinson, Zuleika; Álvarez-Montero, Óscar Luis; Ayala, Alba; Muñoz-Moreno, Maria Fe; Forjaz, Maria João

Dental injuries incurred during endotracheal intubation are more frequent in patients with previous oral pathology. The study objectives were to develop an oral health questionnaire for preanaesthesia evaluation, easy to apply for personnel without special dental training; and establish a cut-off value for detecting persons with poor oral health. Validation study of a self-administered questionnaire, designed according to a literature review and an expert group's recommendations. The questionnaire was applied to a sample of patients evaluated in a preanaesthesia consultation. Rasch analysis of the questionnaire psychometric properties included viability, acceptability, content validity and reliability of the scale. The sample included 115 individuals, 50.4% of men, with a median age of 58 years (range: 38-71). The final analysis of 11 items presented a Person Separation Index of 0.861 and good adjustment of data to the Rasch model. The scale was unidimensional and its items were not biased by sex, age or nationality. The oral health linear measure presented good construct validity. The cut-off value was set at 52 points. The questionnaire showed sufficient psychometric properties to be considered a reliable tool, valid for measuring the state of oral health in preoperative anaesthetic evaluations. Copyright © 2016 Sociedade Brasileira de Anestesiologia. Published by Elsevier Editora Ltda. All rights reserved.
Profil Perencanaan Karir Siswa Sekolah Menengah Kejuruan dengan Pemodelan Rasch Berdasarkan Jenis Kelamin

Directory of Open Access Journals (Sweden)

Itsar Bolo Rangka

2017-08-01

Full Text Available This research aimed to (1 perform inventory career planning of students, and (2 measuring students career planning based on gender. Data analysis used Rasch model for 45 students with actual power measurement 0.9272652. The research findings showed (1 inventory career planning has been fit with the theoretic model, and (2 female student have a tendency to a higher career planning rather than male student. In the future, to measurement of student’s career planning by using this inventory can only produce a high measurement information for students who have a mediocre ability. Further, the researcher need to consider to eliminate item No. 12 in this inventory due to the biased towards the male gender.
Acceptance on Mobile Learning via SMS: A Rasch Model Analysis

Directory of Open Access Journals (Sweden)

Issham Ismail

2010-04-01

Full Text Available This study investigated whether mobile learning via Short Message Service (SMS-learning is accepted by the students enrolled in the distance learning academic programme in the Universiti Sains Malaysia. This study explored the impact of perceived usefulness, perceived ease of use and usability of the system to their acceptability. The survey was constructed using a questionnaire consisting of statements regarding the participants’ demographics, experiences in and perception of using mobile learning via SMS, involving 105 students from management and sciences disciplines. The Rasch Model Analysis was used for measurement correspond to a 5 point Likert. Results indicated that the usability of the system contributed to be effectiveness in assisting the students with their study. Respondents agree that SMS-learning is easy, effective and useful to help them study. However, the results found that there has been a problem in mobile learning that less interaction with lecturers. It implies that the acceptability of students to this mode on communication and interaction is highly endorsed.
Smoking habit and psychometric scores: a community study.

Science.gov (United States)

Waal-Manning, H J; de Hamel, F A

1978-09-13

During the Milton health survey subjects completed a psychometric inventory consisting of the 48 questions of the Middlesex Hospital questionnaire (MHQ) and 26 from the hostility and direction of hostility questionnaire (HDHQ) designed to examine nine psychological dimensions. The 1209 subjects were classified into smoking categories and the scores for each psychometric trait were calculated. Women scored higher than men and heavy smokers scored higher than "never smokers". The psychometric traits and the scores of the four smoking categories after correcting for age and Quetelet's index showed statistically significant differences by analysis of variance in respect of somatic anxiety and depression for both men and women; and free-floating anxiety, phobic anxiety, hysteria, acting out hostility, self criticism and guilt in women. For somatic anxiety the increase in score almost exactly paralleled the increasing quantity of tobacco consumed.
The importance of statistical modelling in clinical research : Comparing multidimensional Rasch-, structural equation and linear regression models for analyzing the depression of relatives of psychiatric patients.

Science.gov (United States)

Alexandrowicz, Rainer W; Jahn, Rebecca; Friedrich, Fabian; Unger, Anne

2016-06-01

Various studies have shown that caregiving relatives of schizophrenic patients are at risk of suffering from depression. These studies differ with respect to the applied statistical methods, which could influence the findings. Therefore, the present study analyzes to which extent different methods may cause differing results. The present study contrasts by means of one data set the results of three different modelling approaches, Rasch Modelling (RM), Structural Equation Modelling (SEM), and Linear Regression Modelling (LRM). The results of the three models varied considerably, reflecting the different assumptions of the respective models. Latent trait models (i. e., RM and SEM) generally provide more convincing results by correcting for measurement error and the RM specifically proves superior for it treats ordered categorical data most adequately.
Using Rasch models to develop and validate an environmental thinking learning progression

Science.gov (United States)

Hashimoto-Martell, Erin A.

Environmental understanding is highly relevant in today's global society. Social, economic, and political structures are connected to the state of environmental degradation and exploitation, and disproportionately affect those in poor or urban communities (Brulle & Pellow, 2006; Executive Order No. 12898, 1994). Environmental education must challenge the way we live, and our social and ecological quality of life, with the goal of responsible action. The development of a learning progression in environmental thinking, along with a corresponding assessment, could provide a tool that could be used across environmental education programs to help evaluate and guide programmatic decisions. This study sought to determine if a scale could be constructed that allowed individuals to be ordered along a continuum of environmental thinking. First, I developed the Environmental Thinking Learning Progression, a scale of environmental thinking from novice to advanced, based on the current available research and literature. The scale consisted of four subscales, each measuring a different aspect of environmental thinking: place consciousness, human connection, agency, and science concepts. Second, a measurement instrument was developed, so that the data appropriately fit the model using Rasch analysis. A Rasch analysis of the data placed respondents along a continuum, given the range of item difficulty for each subscale. Across three iterations of instrument revision and data collection, findings indicated that the items were ordered in a hierarchical way that corresponded to the construct of environmental thinking. Comparisons between groups showed that the average score of respondents who had participated in environmental education programs was significantly higher than those who had not. A comparison between males and females showed no significant difference in average measure, however, there were varied significant differences between how racial/ethnic groups performed. Overall
Accounting for standard errors of vision-specific latent trait in regression models.

Science.gov (United States)

Wong, Wan Ling; Li, Xiang; Li, Jialiang; Wong, Tien Yin; Cheng, Ching-Yu; Lamoureux, Ecosse L

2014-07-11

To demonstrate the effectiveness of Hierarchical Bayesian (HB) approach in a modeling framework for association effects that accounts for SEs of vision-specific latent traits assessed using Rasch analysis. A systematic literature review was conducted in four major ophthalmic journals to evaluate Rasch analysis performed on vision-specific instruments. The HB approach was used to synthesize the Rasch model and multiple linear regression model for the assessment of the association effects related to vision-specific latent traits. The effectiveness of this novel HB one-stage "joint-analysis" approach allows all model parameters to be estimated simultaneously and was compared with the frequently used two-stage "separate-analysis" approach in our simulation study (Rasch analysis followed by traditional statistical analyses without adjustment for SE of latent trait). Sixty-six reviewed articles performed evaluation and validation of vision-specific instruments using Rasch analysis, and 86.4% (n = 57) performed further statistical analyses on the Rasch-scaled data using traditional statistical methods; none took into consideration SEs of the estimated Rasch-scaled scores. The two models on real data differed for effect size estimations and the identification of "independent risk factors." Simulation results showed that our proposed HB one-stage "joint-analysis" approach produces greater accuracy (average of 5-fold decrease in bias) with comparable power and precision in estimation of associations when compared with the frequently used two-stage "separate-analysis" procedure despite accounting for greater uncertainty due to the latent trait. Patient-reported data, using Rasch analysis techniques, do not take into account the SE of latent trait in association analyses. The HB one-stage "joint-analysis" is a better approach, producing accurate effect size estimations and information about the independent association of exposure variables with vision-specific latent traits
Testing Psychometrics of Healthcare Empowerment Questionnaires ...

African Journals Online (AJOL)

Testing Psychometrics of Healthcare Empowerment Questionnaires (HCEQ) among Iranian ... PROMOTING ACCESS TO AFRICAN RESEARCH ... translation and backtranslation procedures, pilot testing, and getting views of expert panel.
Psychometric Evaluation and Discussions of English Language Learners' Listening Comprehension

Science.gov (United States)

Seo, Daeryong; Taherbhai, Husein; Frantz, Roger

2016-01-01

The importance of listening in the context of English language acquisition is gaining acceptance, but its unique attributes in language performance, while substantively and qualitatively justifiable, are generally not psychometrically defined. This article psychometrically supports listening as a distinct domain among the three other domains of…
The Alliance Negotiation Scale: A psychometric investigation.

Science.gov (United States)

Doran, Jennifer M; Safran, Jeremy D; Muran, J Christopher

2016-08-01

This study investigates the utility and psychometric properties of a new measure of psychotherapy process, the Alliance Negotiation Scale (ANS; Doran, Safran, Waizmann, Bolger, & Muran, 2012). The ANS was designed to operationalize the theoretical construct of negotiation (Safran & Muran, 2000), and to extend our current understanding of the working alliance concept (Bordin, 1979). The ANS was also intended to improve upon existing measures such as the Working Alliance Inventory (WAI; Horvath & Greenberg, 1986, 1989) and its short form (WAI-S; Tracey & Kokotovic, 1989) by expanding the emphasis on negative therapy process. The present study investigates the psychometric validity of the ANS test scores and interpretation-including confirming its original factor structure and evaluating its internal consistency and construct validity. Construct validity was examined through the ANS' convergence and divergence with several existing scales that measure theoretically related constructs. The results bolster and extend previous findings about the psychometric integrity of the ANS, and begin to illuminate the relationship between negotiation and other important variables in psychotherapy research. (PsycINFO Database Record (c) 2016 APA, all rights reserved).
Patient self-report section of the ASES questionnaire: a Spanish validation study using classical test theory and the Rasch model.

Science.gov (United States)

Vrotsou, Kalliopi; Cuéllar, Ricardo; Silió, Félix; Rodriguez, Miguel Ángel; Garay, Daniel; Busto, Gorka; Trancho, Ziortza; Escobar, Antonio

2016-10-18

The aim of the current study was to validate the self-report section of the American Shoulder and Elbow Surgeons questionnaire (ASES-p) into Spanish. Shoulder pathology patients were recruited and followed up to 6 months post treatment. The ASES-p, Constant, SF-36 and Barthel scales were filled-in pre and post treatment. Reliability was tested with Cronbach's alpha, convergent validity with Spearman's correlations coefficients. Confirmatory factor analysis (CFA) and the Rasch model were implemented for assessing structural validity and unidimensionality of the scale. Models with and without the pain item were considered. Responsiveness to change was explored via standardised effect sizes. Results were acceptable for both tested models. Cronbach's alpha was 0.91, total scale correlations with Constant and physical SF-36 dimensions were >0.50. Factor loadings for CFA were >0.40. The Rasch model confirmed unidimensionality of the scale, even though item 10 "do usual sport" was suggested as non-informative. Finally, patients with improved post treatment shoulder function and those receiving surgery had higher standardised effect sizes. The adapted Spanish ASES-p version is a valid and reliable tool for shoulder evaluation and its unidimensionality is supported by the data.
Risk perception and control, an integration of the psychometric research paradigm and social psychology

International Nuclear Information System (INIS)

Haugen, K.

1998-01-01

Full text of publication follows: this paper argues that perceptual control is an essential component in human risk evaluation. Control is seen as an integrative concept between the psychometric research paradigm and various psychological theories. The psychometric approach to the study of risk has mainly dealt with the intuitive judgements people do when they are asked to evaluate risky activities and technologies. It shows that people judge risk in relation to the possible consequences and probabilities related to an outcome; the former more typical for the public and the latter more often used by experts. The psychometric research tradition has concentrated on doing human risk evaluations quantifiable and the reactions predictable. This paper also relates to possible practical implications of this strategy, namely that humans react heterogeneously to different kinds of threats due to perceived control. Theoretical ability to explain and elaborate perceptions of risk, as well as individual reactions, were the main criteria for the literature selection, which includes work on e.g. attribution theory, locus of control, and learned helplessness. Thus, the paper addresses available psychological views for a contribution to a developed theoretical framework for human risk evaluation. It seeks to compare and integrate the psychometric research tradition within social psychological theories. The way in which people find their informational basis for their risk judgements, either from others or from their own perceptions is also discussed. Furthermore, the theories are related to the social and psychological reactions of the Chernobyl accident. The paper concludes that psychological theories can contribute to a more comprehensive framework for the understanding of human risk evaluation, leading to a more coherent and integrative knowledge. (author)

Developing the Polish Educational Needs Assessment Tool (Pol-ENAT) in rheumatoid arthritis and systemic sclerosis: a cross-cultural validation study using Rasch analysis.

Science.gov (United States)

Sierakowska, Matylda; Sierakowski, Stanisław; Sierakowska, Justyna; Horton, Mike; Ndosi, Mwidimi

2015-03-01

To undertake cross-cultural adaptation and validation of the educational needs assessment tool (ENAT) for use with people with rheumatoid arthritis (RA) and systemic sclerosis (SSc) in Poland. The study involved two main phases: (1) cross-cultural adaptation of the ENAT from English into Polish and (2) Cross-cultural validation of Polish Educational Needs Assessment Tool (Pol-ENAT). The first phase followed an established process of cross-cultural adaptation of self-report measures. The second phase involved completion of the Pol-ENAT by patients and subjecting the data to Rasch analysis to assess the construct validity, unidimensionality, internal consistency and cross-cultural invariance. An adequate conceptual equivalence was achieved following the adaptation process. The dataset for validation comprised a total of 278 patients, 237 (85.3 %) of which were female. In each disease group (145, RA and 133, SSc), the 7 domains of the Pol-ENAT were found to fit the Rasch model, X (2)(df) = 16.953(14), p = 0.259 and 8.132(14), p = 0.882 for RA and SSc, respectively. Internal consistency of the Pol-ENAT was high (patient separation index = 0.85 and 0.89 for SSc and RA, respectively), and unidimensionality was confirmed. Cross-cultural differential item functioning (DIF) was detected in some subscales, and DIF-adjusted conversion tables were calibrated to enable cross-cultural comparison of data between Poland and the UK. Using a standard process in cross-cultural adaptation, conceptual equivalence was achieved between the original (UK) ENAT and the adapted Pol-ENAT. Fit to the Rasch model, confirmed that the construct validity, unidimensionality and internal consistency of the ENAT have been preserved.
A systematic review evaluating the psychometric properties of measures of social inclusion.

Science.gov (United States)

Cordier, Reinie; Milbourn, Ben; Martin, Robyn; Buchanan, Angus; Chung, Donna; Speyer, Renée

2017-01-01

Improving social inclusion opportunities for population health has been identified as a priority area for international policy. There is a need to comprehensively examine and evaluate the quality of psychometric properties of measures of social inclusion that are used to guide social policy and outcomes. To conduct a systematic review of the literature on all current measures of social inclusion for any population group, to evaluate the quality of the psychometric properties of identified measures, and to evaluate if they capture the construct of social inclusion. A systematic search was performed using five electronic databases: CINAHL, PsycINFO, Embase, ERIC and Pubmed and grey literature were sourced to identify measures of social inclusion. The psychometric properties of the social inclusion measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria. Of the 109 measures identified, twenty-five measures, involving twenty-five studies and one manual met the inclusion criteria. The overall quality of the reviewed measures was variable, with the Social and Community Opportunities Profile-Short, Social Connectedness Scale and the Social Inclusion Scale demonstrating the strongest evidence for sound psychometric quality. The most common domain included in the measures was connectedness (21), followed by participation (19); the domain of citizenship was covered by the least number of measures (10). No single instrument measured all aspects within the three domains of social inclusion. Of the measures with sound psychometric evidence, the Social and Community Opportunities Profile-Short captured the construct of social inclusion best. The overall quality of the psychometric properties demonstrate that the current suite of available instruments for the measurement of social inclusion are promising but need further refinement. There is a need for a universal working definition of social inclusion as an overarching
Evaluating the MBTI® Form M in a South African context

Directory of Open Access Journals (Sweden)

Casper J.J. van Zyl

2012-09-01

Research purpose: To investigate the reliability, validity and differential item functioning of the MBTI® Form M across groups in South Africa using Classical Test Theory (CTT and Item Response Theory (IRT methods. Motivation for the study: To add to the continual research and improvement of the MBTI® Form M through the investigation of its psychometric properties across groups in South Africa. Research design, approach and method: This study falls within the quantitative research paradigm. Classical test theory methods and Rasch analysis were used to evaluate the functioning of the MBTI Form M across gender and ethnic groups. A cross-sectional study was completed consisting of 10 705 South African respondents. Main findings: Excellent reliability was found for the instrument across groups in the sample. Good evidence for construct validity was found using exploratory factor analysis and confirmatory factor analysis. Some evidence for uniform bias was found across ethnic and gender groups and a few items reflected non-uniform DIF across gender groups only. The effect of uniform and non-uniform DIF did not appear to have major practical implications for the interpretation of the scales. Practical/managerial implications: The results provided evidence that supports the psychometric validity of the MBTI instrument in the South African context. Contribution/value-add: This study is the largest study to date regarding the psychometric functioning of the MBTI instrument in South Africa. It contributes to the evolution of the instrument in line with the legislative requirements concerning the use of psychometric tests in South Africa.
Preliminary study to evaluate the validity of the mini-mental state examination in a normal population in Turkey.

Science.gov (United States)

Küçükdeveci, Ayse A; Kutlay, Sehim; Elhan, Atilla H; Tennant, Alan

2005-03-01

Although the Mini-Mental State Examination (MMSE) is widely used in clinical practice, normative scores for a healthy population have not been documented in Turkey. The aim in this study was to validate the MMSE in a healthy population and to provide normal scores. Internal construct validity of the Turkish version of MMSE among a preliminary sample of 406 normal people was assessed by Rasch unidimensional measurement model. Scores of the normal sample varied according to age and education but not according to sex. The data derived from this sample showed poor fit to the Rasch model (mean item fit, -2.082, SD 3.022). Only four of 11 items met model expectations. There was also differential item functioning by education and age for most items. Thus the internal construct validity of the Turkish MMSE in a normative sample could not be demonstrated by Rasch analysis. The scale failed modern psychometric criteria for scalability. We would therefore suggest other large normative MMSE data sets to be tested in terms of internal construct validity. If these findings are replicated, the validity of MMSE norms and their consequent use in clinical practice should be reconsidered.
Psychometric evaluation of the Danish version of Satisfaction with Daily Occupations (SDO)

DEFF Research Database (Denmark)

Eklund, Mona; Morville, Anne-Le

2014-01-01

AIMS: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. METHODS: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed....
A Rasch analysis of patients' opinions of primary health care professionals' ethical behaviour with respect to communication issues.

Science.gov (United States)

González-de Paz, Luis; Kostov, Belchin; López-Pina, Jose A; Solans-Julián, Pilar; Navarro-Rubio, M Dolors; Sisó-Almirall, Antoni

2015-04-01

Patients' opinions are crucial in assessing the effectiveness of the ethical theories which underlie the care relationship between patients and primary health care professionals. To study the ethical behaviour of primary health care professionals with respect to communication issues according to patients' opinions. Cross-sectional study using a self-administered questionnaire in patients from a network of 15 urban primary health centres. Participants were patients attended at the centres when the study was conducted. We used a Rasch analysis to verify the structure of the 17 questionnaire items, and to calculate interval level measures for patients and items. We analysed differences according to patient subgroups using analysis of variance tests and differences between the endorsement of each item. We analysed 1013 (70.34%) of questionnaires. Data fit to the Rasch model was achieved after collapsing two categories and eliminating five items. Items with the lowest degree of endorsement were related to the management of differences in conflictive situations between patients and health care professionals. We found significant differences (P communication skills were respected by family physicians and nurses. However, opinions on endorsement were lower when patients disagreed with health care professionals. The differences found between patient subgroups demonstrated the importance of trust and confidence between patients and professionals. © The Author 2014. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Development and Psychometric Validation of the Family Outcomes Survey-Revised

Science.gov (United States)

Bailey, Donald B., Jr.; Raspa, Melissa; Olmsted, Murrey G.; Novak, Scott P.; Sam, Ann M.; Humphreys, Betsy P.; Nelson, Robin; Robinson, Nyle; Guillen, Chelsea

2011-01-01

Few psychometrically valid scales exist to assess family outcomes and the helpfulness of early intervention. This article describes the development and psychometric properties of the Family Outcomes Survey-Revised. The revision was prompted by the need to (a) create a new format that would be easier for parents to understand, (b) revise and expand…
Psychometric evaluation of the Danish version of Satisfaction with Daily Occupations (SDO)

DEFF Research Database (Denmark)

Eklund, Mona; Morville, Anne-Le

2013-01-01

Aims: The Satisfaction with Daily Occupations (SDO) scale assesses satisfaction within the domains of work, leisure, domestic tasks, and self-care. The aim was to investigate the psychometric properties of the Danish version of the SDO when used with asylum seekers. Methods: The participants were...... and criterion and concurrent validity. The findings regarding discriminant validity were somewhat inconclusive. The Danish SDO may be regarded as psychometrically sound but further psychometric testing is needed. Key words: validity, reliability, health, Activity...
Validation of online psychometric instruments for common mental health disorders: a systematic review.

Science.gov (United States)

van Ballegooijen, Wouter; Riper, Heleen; Cuijpers, Pim; van Oppen, Patricia; Smit, Johannes H

2016-02-25

Online questionnaires for measuring common mental health disorders such as depression and anxiety disorders are increasingly used. The psychometrics of several pen-and-paper questionnaires have been re-examined for online use and new online instruments have been developed and tested for validity as well. This study aims to review and synthesise the literature on this subject and provide a framework for future research. We searched Medline and PsycINFO for psychometric studies on online instruments for common mental health disorders and extracted the psychometric data. Studies were coded and assessed for quality by independent raters. We included 56 studies on 62 online instruments. For common instruments such as the CES-D, MADRS-S and HADS there is mounting evidence for adequate psychometric properties. Further results are scattered over different instruments and different psychometric characteristics. Few studies included patient populations. We found at least one online measure for each of the included mental health disorders and symptoms. A small number of online questionnaires have been studied thoroughly. This study provides an overview of online instruments to refer to when choosing an instrument for assessing common mental health disorders online, and can structure future psychometric research.
Psychometric qualities of questionnaires for the assessment of otitis media impact.

Science.gov (United States)

Timmerman, A A; Meesters, C M G; Speyer, R; Anteunis, L J C

2007-12-01

The assessment of impact and evaluation of treatment effects in chronic otitis media (OM) calls for a much broader approach than just examining the presence of middle ear effusion or hearing loss. It is increasingly recognised that this condition may result in a comprised quality of life. Several studies have used proxy completed questionnaires to objectify the illness experience associated with chronic OM. To review questionnaires which have been developed to describe the effects of chronic OM on the daily functioning of children. Psychometric properties have been evaluated, in addition to discriminative and evaluative qualities. A systematic review of publications pertaining to developed questionnaires related with chronic OM. Systematic literature searches of PubMed (1966-January 2007) and EMBASE (1989-January 2007) were conducted, supplemented by using free text words to identify publications after January 2005. The included 15 questionnaires were developed for children with recurrent or persistent OM, describing functional health status (FHS), while two questionnaires also evaluate the effect of tympanostomy tubes insertion. The questionnaires generally cover six impact areas (physical symptoms, child development, educational performance, emotional/practical burden and general health status) with physical symptoms being the most prominant. The OM8-30, OMO-22 and OM-6 adequately reflect the multidimensional aspects of FHS in chronic OM. The OMO-22 and OM8-30 show the best psychometric properties for the discrimination of impact severity between children, while the OM-6 was found to have the best qualities for the evaluation of clinical change. Clinical applicability is crucial for the assessment of FHS in chronic OM, but requires a trade-off with necessary psychometric properties.
Intelligence for education: as described by Piaget and measured by psychometrics.

Science.gov (United States)

Shayer, Michael

2008-03-01

Two separate paths to the concept of intelligence are discussed: the psychometric path being concerned with the measurement of intelligence, involving the methodology of norm-referenced testing; the path followed by Piaget, and others, addresses from the start the related question of how intelligence can be described, and employs a criterion-referenced methodology. The achievements of psychometrics are briefly described, with an argument that they now remain important tools of what Kuhn called 'normal science'. The criterion-referenced approach of Piaget and others is described, with evidence from intervention studies that the Genevan descriptions of children-in-action have allowed the choice of contexts within which children can profitably be challenged to go further in their thinking. Hence, Genevan psychology is also now a part of the normal science with important uses, shown both in neo-Piagetian studies and further research stemming from Geneva. Discussion of the 'Flynn effect' sheds light on both paths, with problems still unresolved. The argument is then developed that the relevance of neuroscience needs to be discussed to try to decide in what ways it may provide useful insights into intelligence.
A systematic review evaluating the psychometric properties of measures of social inclusion

Science.gov (United States)

Milbourn, Ben; Martin, Robyn; Buchanan, Angus; Chung, Donna; Speyer, Renée

2017-01-01

Introduction Improving social inclusion opportunities for population health has been identified as a priority area for international policy. There is a need to comprehensively examine and evaluate the quality of psychometric properties of measures of social inclusion that are used to guide social policy and outcomes. Objective To conduct a systematic review of the literature on all current measures of social inclusion for any population group, to evaluate the quality of the psychometric properties of identified measures, and to evaluate if they capture the construct of social inclusion. Methods A systematic search was performed using five electronic databases: CINAHL, PsycINFO, Embase, ERIC and Pubmed and grey literature were sourced to identify measures of social inclusion. The psychometric properties of the social inclusion measures were evaluated against the COSMIN taxonomy of measurement properties using pre-set psychometric criteria. Results Of the 109 measures identified, twenty-five measures, involving twenty-five studies and one manual met the inclusion criteria. The overall quality of the reviewed measures was variable, with the Social and Community Opportunities Profile-Short, Social Connectedness Scale and the Social Inclusion Scale demonstrating the strongest evidence for sound psychometric quality. The most common domain included in the measures was connectedness (21), followed by participation (19); the domain of citizenship was covered by the least number of measures (10). No single instrument measured all aspects within the three domains of social inclusion. Of the measures with sound psychometric evidence, the Social and Community Opportunities Profile-Short captured the construct of social inclusion best. Conclusions The overall quality of the psychometric properties demonstrate that the current suite of available instruments for the measurement of social inclusion are promising but need further refinement. There is a need for a universal working
Validación de la escala para manía de la Universidad Nacional de Colombia usando el análisis de Rasch

Directory of Open Access Journals (Sweden)

Ricardo Sánchez

2011-03-01

Conclusiones. En este primer estudio de la escala para manías usando el análisis de Rasch, se detectó mal ajuste y redundancia de algunos ítems. El síndrome maníaco no queda completamente evaluado por la escala. El instrumento podría mejorarse agregando síntomas depresivos.
Conducting Simulation Studies in Psychometrics

Science.gov (United States)

Feinberg, Richard A.; Rubright, Jonathan D.

2016-01-01

Simulation studies are fundamental to psychometric discourse and play a crucial role in operational and academic research. Yet, resources for psychometricians interested in conducting simulations are scarce. This Instructional Topics in Educational Measurement Series (ITEMS) module is meant to address this deficiency by providing a comprehensive…
Validity of the Neuromuscular Recovery Scale: a measurement model approach.

Science.gov (United States)

Velozo, Craig; Moorhouse, Michael; Ardolino, Elizabeth; Lorenz, Doug; Suter, Sarah; Basso, D Michele; Behrman, Andrea L

2015-08-01

To determine how well the Neuromuscular Recovery Scale (NRS) items fit the Rasch, 1-parameter, partial-credit measurement model. Confirmatory factor analysis (CFA) and principal components analysis (PCA) of residuals were used to determine dimensionality. The Rasch, 1-parameter, partial-credit rating scale model was used to determine rating scale structure, person/item fit, point-measure item correlations, item discrimination, and measurement precision. Seven NeuroRecovery Network clinical sites. Outpatients (N=188) with spinal cord injury. Not applicable. NRS. While the NRS met 1 of 3 CFA criteria, the PCA revealed that the Rasch measurement dimension explained 76.9% of the variance. Ten of 11 items and 91% of the patients fit the Rasch model, with 9 of 11 items showing high discrimination. Sixty-nine percent of the ratings met criteria. The items showed a logical item-difficulty order, with Stand retraining as the easiest item and Walking as the most challenging item. The NRS showed no ceiling or floor effects and separated the sample into almost 5 statistically distinct strata; individuals with an American Spinal Injury Association Impairment Scale (AIS) D classification showed the most ability, and those with an AIS A classification showed the least ability. Items not meeting the rating scale criteria appear to be related to the low frequency counts. The NRS met many of the Rasch model criteria for construct validity. Copyright © 2015 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Health-related quality of life in young men with testicular cancer: validation of the Cancer Assessment for Young Adults (CAYA).

Science.gov (United States)

Hoyt, Michael A; Cano, Stefan J; Saigal, Christopher S; Stanton, Annette L

2013-12-01

Patient-reported outcome instruments are needed to measure health-related quality of life (HRQOL) in young adults with cancer. The purpose of this project was to establish a conceptual model and measurement instrument for assessment of HRQOL in young men with testicular cancer. Patient interviews and a literature review were used to develop a conceptual framework of biopsychosocial domains of cancer-related quality of life and an initial pool of questionnaire items. Items were piloted and refined. Revised items were administered to a sample (N = 171) of young (ages 18-29) men with testicular cancer and repeated 4 weeks later. Rasch measurement methods guided item reduction and scale construction. Traditional psychometric analyses were also performed to allow for comparison with existing measures. The conceptual framework included seven biopsychosocial domains: physical, sexual, intrapersonal, cognitive-emotional, social-relational, educational-vocational-avocational, and spiritual to form independent scales of the resulting questionnaire, the Cancer Assessment for Young Adults-Testicular (CAYA-T). Each scale fulfilled Rasch and traditional psychometric criteria (i.e., person separation index, 0.34-0.82; Cronbach's alpha, 0.70-0.91; and an expected pattern of convergent and discriminant validity correlations). The CAYA-T can be used to assess HRQOL across a comprehensive set of domains as identified by young men with cancer. It passes strict psychometric criteria and has potential as a useful research and clinical tool. The CAYA-T has potential research and clinical value for addressing inter-related aspects of HRQOL in young adult men with cancer. The measure may assist with assessing and monitoring HRQOL across a range of domains and contributing to more comprehensive assessment of biopsychosocial needs of young adults.
Development and validation of a quality of life questionnaire for patients with colostomy or ileostomy

Directory of Open Access Journals (Sweden)

Juul Kristian

2005-10-01

Full Text Available Abstract Background Quality of life of stoma patients is increasingly being addressed in clinical trials. However, the instruments used in the majority of these studies have not been validated specifically for stoma patients. The aim of this paper is to describe the development and validation of a quality-of-life instrument, "Stoma-QOL", specifically for patients with colostomy or ileostomy. Methods Potential items were formulated in English on the basis of the results of a series of semi-structured interviews with 169 adult stoma patients. The process resulted in a preliminary 37-item version, which was translated into French, German, Spanish and Danish, and administered repeatedly to 182 patients with colostomy or ileostomy. A psychometric selection of items was performed through Rasch Analysis. The measurement properties of the final questionnaire version were subsequently tested. Results The 20 items in the final questionnaire covered four domains – sleep, sexual activity, relations to family and close friends, and social relations to other than family and close friends. These items were found to define a unidimensional variable according to Rasch specifications (Infit MNSQ 0.88 (p Conclusion Given the adequacy of the metric properties of the Stoma-QOL suggested by the psychometric analyses, this study confirms the suitability of the instrument in clinical practice and in clinical research.
Linking Existing Instruments to Develop an Activity of Daily Living Item Bank.

Science.gov (United States)

Li, Chih-Ying; Romero, Sergio; Bonilha, Heather S; Simpson, Kit N; Simpson, Annie N; Hong, Ickpyo; Velozo, Craig A

2018-03-01

This study examined dimensionality and item-level psychometric properties of an item bank measuring activities of daily living (ADL) across inpatient rehabilitation facilities and community living centers. Common person equating method was used in the retrospective veterans data set. This study examined dimensionality, model fit, local independence, and monotonicity using factor analyses and fit statistics, principal component analysis (PCA), and differential item functioning (DIF) using Rasch analysis. Following the elimination of invalid data, 371 veterans who completed both the Functional Independence Measure (FIM) and minimum data set (MDS) within 6 days were retained. The FIM-MDS item bank demonstrated good internal consistency (Cronbach's α = .98) and met three rating scale diagnostic criteria and three of the four model fit statistics (comparative fit index/Tucker-Lewis index = 0.98, root mean square error of approximation = 0.14, and standardized root mean residual = 0.07). PCA of Rasch residuals showed the item bank explained 94.2% variance. The item bank covered the range of θ from -1.50 to 1.26 (item), -3.57 to 4.21 (person) with person strata of 6.3. The findings indicated the ADL physical function item bank constructed from FIM and MDS measured a single latent trait with overall acceptable item-level psychometric properties, suggesting that it is an appropriate source for developing efficient test forms such as short forms and computerized adaptive tests.
A reliable and valid questionnaire was developed to measure computer vision syndrome at the workplace.

Science.gov (United States)

Seguí, María del Mar; Cabrero-García, Julio; Crespo, Ana; Verdú, José; Ronda, Elena

2015-06-01

To design and validate a questionnaire to measure visual symptoms related to exposure to computers in the workplace. Our computer vision syndrome questionnaire (CVS-Q) was based on a literature review and validated through discussion with experts and performance of a pretest, pilot test, and retest. Content validity was evaluated by occupational health, optometry, and ophthalmology experts. Rasch analysis was used in the psychometric evaluation of the questionnaire. Criterion validity was determined by calculating the sensitivity and specificity, receiver operator characteristic curve, and cutoff point. Test-retest repeatability was tested using the intraclass correlation coefficient (ICC) and concordance by Cohen's kappa (κ). The CVS-Q was developed with wide consensus among experts and was well accepted by the target group. It assesses the frequency and intensity of 16 symptoms using a single rating scale (symptom severity) that fits the Rasch rating scale model well. The questionnaire has sensitivity and specificity over 70% and achieved good test-retest repeatability both for the scores obtained [ICC = 0.802; 95% confidence interval (CI): 0.673, 0.884] and CVS classification (κ = 0.612; 95% CI: 0.384, 0.839). The CVS-Q has acceptable psychometric properties, making it a valid and reliable tool to control the visual health of computer workers, and can potentially be used in clinical trials and outcome research. Copyright © 2015 Elsevier Inc. All rights reserved.
How mechanisms of perceptual decision-making affect the psychometric function.

Science.gov (United States)

Gold, Joshua I; Ding, Long

2013-04-01

Psychometric functions are often interpreted in the context of Signal Detection Theory, which emphasizes a distinction between sensory processing and non-sensory decision rules in the brain. This framework has helped to relate perceptual sensitivity to the "neurometric" sensitivity of sensory-driven neural activity. However, perceptual sensitivity, as interpreted via Signal Detection Theory, is based on not just how the brain represents relevant sensory information, but also how that information is read out to form the decision variable to which the decision rule is applied. Here we discuss recent advances in our understanding of this readout process and describe its effects on the psychometric function. In particular, we show that particular aspects of the readout process can have specific, identifiable effects on the threshold, slope, upper asymptote, time dependence, and choice dependence of psychometric functions. To illustrate these points, we emphasize studies of perceptual learning that have identified changes in the readout process that can lead to changes in these aspects of the psychometric function. We also discuss methods that have been used to distinguish contributions of the sensory representation versus its readout to psychophysical performance. Copyright © 2012 Elsevier Ltd. All rights reserved.

The Hamilton Depression Scale (HAM-D) and the Montgomery–Åsberg Depression Scale (MADRS)

DEFF Research Database (Denmark)

Bech, Per; Allerup, Peter; Larsen, Erik Roj

2014-01-01

The objective of this re-analysis of the European Genome-Based Therapeutic Drugs for Depression Study (GENDEP) was to psychometrically test the unidimensionality of the full Montgomery Åsberg Depression Rating Scale (MADRS10) and the Hamilton Depression Scale (HAM-D17) versus their respective...... subscales (MADRS5 and HAM-D6) containing the core symptoms of depression severity. Rasch analysis was applied using RUMM 2030 software to assess the overall fit for unidimensionality. Neither the MADRS10 nor the HAM-D17 was found to fit the Rasch model for unidimensionality. The HAM-D6 (containing the items...... of depressed mood, guilt, work and interests, psychomotor retardation, psychic anxiety, and somatic general) as well as the analogue MADRS5 were tested for unidimensionality by use of the RUMM 2030 programme, and only the HAM-D6 was accepted. When testing for invariance across rating weeks or centres, the RUMM...
A psychometric validation of the Short Alcohol Withdrawal Scale (SAWS)

DEFF Research Database (Denmark)

Elholm, Bjarne; Larsen, Klaus; Hornnes, Nete

2010-01-01

The study aimed to evaluate psychometrically a Danish translation of the Short Alcohol Withdrawal Scale (SAWS) in an outpatient setting in patients with Alcohol Dependence (AD) and Alcohol Withdrawal Symptoms/Syndrome (AWS).......The study aimed to evaluate psychometrically a Danish translation of the Short Alcohol Withdrawal Scale (SAWS) in an outpatient setting in patients with Alcohol Dependence (AD) and Alcohol Withdrawal Symptoms/Syndrome (AWS)....
Psychometric Properties of Raw and Scale Scores on Mixed-Format Tests

Science.gov (United States)

Kolen, Michael J.; Lee, Won-Chan

2011-01-01

This paper illustrates that the psychometric properties of scores and scales that are used with mixed-format educational tests can impact the use and interpretation of the scores that are reported to examinees. Psychometric properties that include reliability and conditional standard errors of measurement are considered in this paper. The focus is…
The validity of a professional competence tool for physiotherapy students in simulation-based clinical education: a Rasch analysis.

Science.gov (United States)

Judd, Belinda K; Scanlan, Justin N; Alison, Jennifer A; Waters, Donna; Gordon, Christopher J

2016-08-05

Despite the recent widespread adoption of simulation in clinical education in physiotherapy, there is a lack of validated tools for assessment in this setting. The Assessment of Physiotherapy Practice (APP) is a comprehensive tool used in clinical placement settings in Australia to measure professional competence of physiotherapy students. The aim of the study was to evaluate the validity of the APP for student assessment in simulation settings. A total of 1260 APPs were collected, 971 from students in simulation and 289 from students in clinical placements. Rasch analysis was used to examine the construct validity of the APP tool in three different simulation assessment formats: longitudinal assessment over 1 week of simulation; longitudinal assessment over 2 weeks; and a short-form (25 min) assessment of a single simulation scenario. Comparison with APPs from 5 week clinical placements in hospital and clinic-based settings were also conducted. The APP demonstrated acceptable fit to the expectations of the Rasch model for the 1 and 2 week clinical simulations, exhibiting unidimensional properties that were able to distinguish different levels of student performance. For the short-form simulation, nine of the 20 items recorded greater than 25 % of scores as 'not-assessed' by clinical educators which impacted on the suitability of the APP tool in this simulation format. The APP was a valid assessment tool when used in longitudinal simulation formats. A revised APP may be required for assessment in short-form simulation scenarios.
Psychometric properties of the Late-Life Function and Disability Instrument

DEFF Research Database (Denmark)

Beauchamp, Marla K; Schmidt, Catherine T; Pedersen, Mette M

2014-01-01

The choice of measure for use as a primary outcome in geriatric research is contingent upon the construct of interest and evidence for its psychometric properties. The Late-Life Function and Disability Instrument (LLFDI) has been widely used to assess functional limitations and disability...... in studies with older adults. The primary aim of this systematic review was to evaluate the current available evidence for the psychometric properties of the LLFDI....
Seeking a Balance between the Statistical and Scientific Elements in Psychometrics

Science.gov (United States)

Wilson, Mark

2013-01-01

In this paper, I will review some aspects of psychometric projects that I have been involved in, emphasizing the nature of the work of the psychometricians involved, especially the balance between the statistical and scientific elements of that work. The intent is to seek to understand where psychometrics, as a discipline, has been and where it…
Fitting a Mixture Rasch Model to English as a Foreign Language Listening Tests: The Role of Cognitive and Background Variables in Explaining Latent Differential Item Functioning

Science.gov (United States)

Aryadoust, Vahid

2015-01-01

The present study uses a mixture Rasch model to examine latent differential item functioning in English as a foreign language listening tests. Participants (n = 250) took a listening and lexico-grammatical test and completed the metacognitive awareness listening questionnaire comprising problem solving (PS), planning and evaluation (PE), mental…
Quantitative Psychology : the 82nd Annual Meeting of the Psychometric Society

CERN Document Server

Culpepper, Steven; Janssen, Rianne; González, Jorge; Molenaar, Dylan

2018-01-01

This proceedings book highlights the latest research and developments in psychometrics and statistics. Featuring contributions presented at the 82nd Annual Meeting of the Psychometric Society (IMPS), organized by the University of Zurich and held in Zurich, Switzerland from July 17 to 21, 2017, its 34 chapters address a diverse range of psychometric topics including item response theory, factor analysis, causal inference, Bayesian statistics, test equating, cognitive diagnostic models and multistage adaptive testing. The IMPS is one of the largest international meetings on quantitative measurement in psychology, education and the social sciences, attracting over 500 participants and 250 paper presentations from around the world every year. This book gathers the contributions of selected presenters, which were subsequently expanded and peer-reviewed.
Rasch Validation and Cross-validation of the Health of Nation Outcome Scales (HoNOS) for Monitoring of Psychiatric Disability in Traumatized Refugees in Western Psychiatric Care

DEFF Research Database (Denmark)

Palic, Sabina; Kappel, Michelle Lind; Makransky, Guido

2016-01-01

group. A revised 10-item HoNOS fit the Rasch model at pre-treatment, and also showed excellent fit within the cross-validation data. Culture, gender, and need for translation did not exert serious bias on the measure’s performance. The results establish good monitoring properties of the 10-item Ho...
Measuring the bright side of being blue: a new tool for assessing analytical rumination in depression.

Directory of Open Access Journals (Sweden)

Skye P Barbic

Full Text Available BACKGROUND: Diagnosis and management of depression occurs frequently in the primary care setting. Current diagnostic and management of treatment practices across clinical populations focus on eliminating signs and symptoms of depression. However, there is debate that some interventions may pathologize normal, adaptive responses to stressors. Analytical rumination (AR is an example of an adaptive response of depression that is characterized by enhanced cognitive function to help an individual focus on, analyze, and solve problems. To date, research on AR has been hampered by the lack of theoretically-derived and psychometrically sound instruments. This study developed and tested a clinically meaningful measure of AR. METHODS: Using expert panels and an extensive literature review, we developed a conceptual framework for AR and 22 candidate items. Items were field tested to 579 young adults; 140 of whom completed the items at a second time point. We used Rasch measurement methods to construct and test the item set; and traditional psychometric analyses to compare items to existing rating scales. RESULTS: Data were high quality (0.81; evidence for divergent validity. Evidence of misfit for 2 items suggested that a 20-item scale with 4-point response categories best captured the concept of AR, fitting the Rasch model (χ2 = 95.26; df = 76, p = 0.07, with high reliability (rp = 0.86, ordered response scale structure, and no item bias (gender, age, time. CONCLUSION: Our study provides evidence for a 20-item Analytical Rumination Questionnaire (ARQ that can be used to quantify AR in adults who experience symptoms of depression. The ARQ is psychometrically robust and a clinically useful tool for the assessment and improvement of depression in the primary care setting. Future work is needed to establish the validity of this measure in people with major depression.
Enhancing rigour in the validation of patient reported outcome measures (PROMs: bridging linguistic and psychometric testing

Directory of Open Access Journals (Sweden)

Roberts Gwerfyl

2012-06-01

Full Text Available Abstract Background A strong consensus exists for a systematic approach to linguistic validation of patient reported outcome measures (PROMs and discrete methods for assessing their psychometric properties. Despite the need for robust evidence of the appropriateness of measures, transition from linguistic to psychometric validation is poorly documented or evidenced. This paper demonstrates the importance of linking linguistic and psychometric testing through a purposeful stage which bridges the gap between translation and large-scale validation. Findings Evidence is drawn from a study to develop a Welsh language version of the Beck Depression Inventory-II (BDI-II and investigate its psychometric properties. The BDI-II was translated into Welsh then administered to Welsh-speaking university students (n = 115 and patients with depression (n = 37 concurrent with the English BDI-II, and alongside other established depression and quality of life measures. A Welsh version of the BDI-II was produced that, on administration, showed conceptual equivalence with the original measure; high internal consistency reliability (Cronbach’s alpha = 0.90; 0.96; item homogeneity; adequate correlation with the English BDI-II (r = 0.96; 0.94 and additional measures; and a two-factor structure with one overriding dimension. Nevertheless, in the student sample, the Welsh version showed a significantly lower overall mean than the English (p = 0.002; and significant differences in six mean item scores. This prompted a review and refinement of the translated measure. Conclusions Exploring potential sources of bias in translated measures represents a critical step in the translation-validation process, which until now has been largely underutilised. This paper offers important findings that inform advanced methods of cross-cultural validation of PROMs.
Evaluating the Effectiveness of Collaborative Computer-Intensive Projects in an Undergraduate Psychometrics Course

Science.gov (United States)

Barchard, Kimberly A.; Pace, Larry A.

2010-01-01

Undergraduate psychometrics classes often use computer-intensive active learning projects. However, little research has examined active learning or computer-intensive projects in psychometrics courses. We describe two computer-intensive collaborative learning projects used to teach the design and evaluation of psychological tests. Course…
The Drug Effects Questionnaire: Psychometric Support across Three Drug Types

Science.gov (United States)

Morean, Meghan E.; de Wit, Harriet; King, Andrea C.; Sofuoglu, Mehmet; Rueger, Sandra Y.; O’Malley, Stephanie S.

2013-01-01

Rationale The Drug Effects Questionnaire (DEQ) is widely used in studies of acute subjective response (SR) to a variety of substances, but the format of the DEQ varies widely across studies, and details of its psychometric properties are lacking. Thus, the field would benefit from demonstrating the reliability and validity of the DEQ for use across multiple substances. Objective The current study evaluated the psychometric properties of several variations of DEQ items, which assessed the extent to which participants (1) feel any substance effect(s), (2) feel high, (3) like the effects, (4) dislike the effects, and (5) want more of the substance using 100mm Visual Analog Scales. Methods DEQ data from three placebo-controlled studies were analyzed to examine SR to amphetamine, nicotine, and alcohol. We evaluated the internal structure of the DEQ for use with each substance as well as relationships between scale items, measures of similar constructs, and substance-related behaviors. Results Results provided preliminary psychometric support for items assessing each DEQ construct (FEEL, HIGH, DISLIKE, LIKE, and MORE). Conclusions Based on the study results, we identify several common limitations of extant variants of the DEQ and recommend an improved version of the measure. The simplicity and brevity of the DEQ combined with its promising psychometric properties support its use in future SR research across a variety of substances. PMID:23271193
Psychometric testing and Human Resource Management

Directory of Open Access Journals (Sweden)

R. P. van der Merwe

2002-09-01

Full Text Available This is a cumulative report on the findings of various exploratory research that were done with regard to the practice of psychometric testing in the Eastern Cape. Recent and ongoing developments in the South African labour legislation, and especially the implications of the Employment Equity Act, highlight once again the importance of the validation of all instruments to be used for human assessment and selection purposes. Information was gathered to establish which psychometric tests are used, and for what purposes, in industry today. Biographical information on each organisation is supplied, including the number of employees. The role of psychometric testing in the selection procedure is discussed. The different tests used, as well as the test users, are also indicated. The findings of other, related research, as well as comments, recommendations and shortcomings, are discussed. Opsomming Hierdie is ‘n kumulatiewe verslag wat die resultate verstrek van verskeie verkennende ondersoeke wat gedoen is na die aanwending van psigometriese toetsing in die Oos-Kaap. Onlangse en voortdurende ontwikkelinge in die Suid-Afrikaanse arbeidswetgewing, en veral die implikasies van die Wet op Gelyke Indiensneming, beklemtoon weer eens die belangrikheid van die validering van enige instrumente wat gebruik word vir evaluerings- en keuringsdoeleindes van individue. Inligting is ingewin om te bepaal watter psigometriese toetse, sowel as vir watter doel, vandag in die bedryf gebruik word. Biografiese inligting oor die onderskeie organisasies, insluitende hul aantal werknemers, word verstrek. Die rol van psigometriese toetsing in die keuringsproses word bespreek. Die verskillende toetse wat deur die organisasies gebruik word, sowel as die toetsge-bruikers, word ook aangedui. Die bevindinge van ander, relevante navorsing, sowel as opmerkings, aanbevelings en tekortkominge word bespreek.
Psychometrics of Multiple Choice Questions with Non-Functioning Distracters: Implications to Medical Education.

Science.gov (United States)

Deepak, Kishore K; Al-Umran, Khalid Umran; AI-Sheikh, Mona H; Dkoli, B V; Al-Rubaish, Abdullah

2015-01-01

The functionality of distracters in a multiple choice question plays a very important role. We examined the frequency and impact of functioning and non-functioning distracters on psychometric properties of 5-option items in clinical disciplines. We analyzed item statistics of 1115 multiple choice questions from 15 summative assessments of undergraduate medical students and classified the items into five groups by their number of non-functioning distracters. We analyzed the effect of varying degree of non-functionality ranging from 0 to 4, on test reliability, difficulty index, discrimination index and point biserial correlation. The non-functionality of distracters inversely affected the test reliability and quality of items in a predictable manner. The non-functioning distracters made the items easier and lowered the discrimination index significantly. Three non-functional distracters in a 5-option MCQ significantly affected all psychometric properties (p psychometrically as effective as 5-option items. Our study reveals that a multiple choice question with 3 functional options provides lower most limit of item format that has adequate psychometric property. The test containing items with less number of functioning options have significantly lower reliability. The distracter function analysis and revision of nonfunctioning distracters can serve as important methods to improve the psychometrics and reliability of assessment.
The Body Appreciation Scale-2: item refinement and psychometric evaluation.

Science.gov (United States)

Tylka, Tracy L; Wood-Barcalow, Nichole L

2015-01-01

Considered a positive body image measure, the 13-item Body Appreciation Scale (BAS; Avalos, Tylka, & Wood-Barcalow, 2005) assesses individuals' acceptance of, favorable opinions toward, and respect for their bodies. While the BAS has accrued psychometric support, we improved it by rewording certain BAS items (to eliminate sex-specific versions and body dissatisfaction-based language) and developing additional items based on positive body image research. In three studies, we examined the reworded, newly developed, and retained items to determine their psychometric properties among college and online community (Amazon Mechanical Turk) samples of 820 women and 767 men. After exploratory factor analysis, we retained 10 items (five original BAS items). Confirmatory factor analysis upheld the BAS-2's unidimensionality and invariance across sex and sample type. Its internal consistency, test-retest reliability, and construct (convergent, incremental, and discriminant) validity were supported. The BAS-2 is a psychometrically sound positive body image measure applicable for research and clinical settings. Copyright © 2014 Elsevier Ltd. All rights reserved.
Psychometric Properties of the MMPI-2-RF Somatic Complaints (RC1) Scale

Science.gov (United States)

Thomas, Michael L.; Locke, Dona E. C.

2010-01-01

The MMPI-2 Restructured Form (MMPI-2-RF; Tellegen & Ben-Porath, 2008) was designed to be psychometrically superior to its MMPI-2 counterpart. However, the test has yet to be extensively evaluated in diverse clinical settings. The purpose of this study was to examine the psychometric properties of the MMPI-2-RF Somatic Complaints (RC1) scale in…
Rasch modeling of the Spanish self-report version of the Liebowitz Social Anxiety Scale for Children and Adolescents (LSAS-CA-SR

Directory of Open Access Journals (Sweden)

José A. López-Pina

2008-01-01

Full Text Available El objetivo de este estudio instrumental fue analizar la estructura unidimensional de las subescalas de miedo y evitación de la versión española de la escala de ansiedad social LSAS-CA-SR para niños y adolescentes bajo la familia de modelos de Rasch. La muestra estuvo formada por 454 estudiantes (236 varones y 218 mujeres de educación primaria y secundaria cuya edad variaba entre 10 y 17 años. El modelo de escalas de valoración fue ajustado a los datos de ambas subescalas. Los estadísticos de ajuste (media cuadrática ponderada y media cuadrática no ponderada mostraron un buen ajuste de los ítems al modelo, excepto en los ítems 10 y 16 en la subescala de miedo, y los ítems 6, 7 y 21 en la subescala de evitación. Además, la subdivisión de la muestra global en dos submuestras aleatorias de 150 personas probó que el modelo de escalas de valoración produjo un ordenamiento invariante de los parámetros de los ítems y de los parámetros de las personas. Este estudio respalda, así, la utilidad del modelo de Rasch y su familia para determinar la unidimensionalidad en un test psicológico.
Development of a Microsoft Excel tool for one-parameter Rasch model of continuous items: an application to a safety attitude survey.

Science.gov (United States)

Chien, Tsair-Wei; Shao, Yang; Kuo, Shu-Chun

2017-01-10

Many continuous item responses (CIRs) are encountered in healthcare settings, but no one uses item response theory's (IRT) probabilistic modeling to present graphical presentations for interpreting CIR results. A computer module that is programmed to deal with CIRs is required. To present a computer module, validate it, and verify its usefulness in dealing with CIR data, and then to apply the model to real healthcare data in order to show how the CIR that can be applied to healthcare settings with an example regarding a safety attitude survey. Using Microsoft Excel VBA (Visual Basic for Applications), we designed a computer module that minimizes the residuals and calculates model's expected scores according to person responses across items. Rasch models based on a Wright map and on KIDMAP were demonstrated to interpret results of the safety attitude survey. The author-made CIR module yielded OUTFIT mean square (MNSQ) and person measures equivalent to those yielded by professional Rasch Winsteps software. The probabilistic modeling of the CIR module provides messages that are much more valuable to users and show the CIR advantage over classic test theory. Because of advances in computer technology, healthcare users who are familiar to MS Excel can easily apply the study CIR module to deal with continuous variables to benefit comparisons of data with a logistic distribution and model fit statistics.
A call for policy guidance on psychometric testing in doping control in sport.

Science.gov (United States)

Petróczi, Andrea; Backhouse, Susan H; Barkoukis, Vassilis; Brand, Ralf; Elbe, Anne-Marie; Lazuras, Lambros; Lucidi, Fabio

2015-11-01

One of the fundamental challenges in anti-doping is identifying athletes who use, or are at risk of using, prohibited performance enhancing substances. The growing trend to employ a forensic approach to doping control aims to integrate information from social sciences (e.g., psychology of doping) into organised intelligence to protect clean sport. Beyond the foreseeable consequences of a positive identification as a doping user, this task is further complicated by the discrepancy between what constitutes a doping offence in the World Anti-Doping Code and operationalized in doping research. Whilst psychology plays an important role in developing our understanding of doping behaviour in order to inform intervention and prevention, its contribution to the array of doping diagnostic tools is still in its infancy. In both research and forensic settings, we must acknowledge that (1) socially desirable responding confounds self-reported psychometric test results and (2) that the cognitive complexity surrounding test performance means that the response-time based measures and the lie detector tests for revealing concealed life-events (e.g., doping use) are prone to produce false or non-interpretable outcomes in field settings. Differences in social-cognitive characteristics of doping behaviour that are tested at group level (doping users vs. non-users) cannot be extrapolated to individuals; nor these psychometric measures used for individual diagnostics. In this paper, we present a position statement calling for policy guidance on appropriate use of psychometric assessments in the pursuit of clean sport. We argue that, to date, both self-reported and response-time based psychometric tests for doping have been designed, tested and validated to explore how athletes feel and think about doping in order to develop a better understanding of doping behaviour, not to establish evidence for doping. A false 'positive' psychological profile for doping affects not only the individual

Development and Psychometric Evaluation of Scales: A Survey of Published Articles

Directory of Open Access Journals (Sweden)

Foroozan Atashzadeh-Shoorideh

2016-01-01

Full Text Available Background and purpose: Using valid and reliable instruments is an important way for collecting data in qualitative researches. This paper is a report of a study conducted to examine the extent of psychometric properties of the scales in research papers published in Journal of Advanced Nursing.Methods: In this study, the Journal of Advanced Nursing was chosen for systematic review. All articles which were published during 2007-2009 in this journal were collected and articles related to instrument development were selected. Each article was completely reviewed to identify the methods of instrument validation and reliability.Results: From 980 articles published in Journal of Advanced Nursing during 2007-2009, 41 (4.18% articles were about research methodology. In these, 12 articles (29.27% were related to developing an instrument. In this study, review of 12 articles that published in Journal of Advanced Nursing, 2007-2009, showed that some of the articles did not measure psychometric properties properly, thus some of the developed scales need to measure other types of necessary validity. In addition, reliability testing needs to be performed on each instrument used in a study before other statistical analysis are performed. From 12 articles, all of the articles measured and reported Cronbach’s alpha, but four of them did not measure test-retest.Conclusions: Although researchers put a great emphasis on methodology and statistical analysis, they pay less attention to the psychometric properties of their new instruments. The authors of this article hope to draw the attention of researcher to the importance of measuring psychometric properties of new instruments.Keywords: PSYCHOMETRIC, SCALES, CRITICAL REVIEW
A Psychometric Review of Norm-Referenced Tests Used to Assess Phonological Error Patterns

Science.gov (United States)

Kirk, Celia; Vigeland, Laura

2014-01-01

Purpose: The authors provide a review of the psychometric properties of 6 norm-referenced tests designed to measure children's phonological error patterns. Three aspects of the tests' psychometric adequacy were evaluated: the normative sample, reliability, and validity. Method: The specific criteria used for determining the psychometric…
The emotion regulation questionnaire in women with cancer: A psychometric evaluation and an item response theory analysis.

Science.gov (United States)

Brandão, Tânia; Schulz, Marc S; Gross, James J; Matos, Paula Mena

2017-10-01

Emotion regulation is thought to play an important role in adaptation to cancer. However, the emotion regulation questionnaire (ERQ), a widely used instrument to assess emotion regulation, has not yet been validated in this context. This study addresses this gap by examining the psychometric properties of the ERQ in a sample of Portuguese women with cancer. The ERQ was administered to 204 women with cancer (mean age = 48.89 years, SD = 7.55). Confirmatory factor analysis and item response theory analysis were used to examine psychometric properties of the ERQ. Confirmatory factor analysis confirmed the 2-factor solution proposed by the original authors (expressive suppression and cognitive reappraisal). This solution was invariant across age and type of cancer. Item response theory analyses showed that all items were moderately to highly discriminant and that items are better suited for identifying moderate levels of expressive suppression and cognitive reappraisal. Support was found for the internal consistency and test-retest reliability of the ERQ. The pattern of relationships with emotional control, alexithymia, emotional self-efficacy, attachment, and quality of life provided evidence of the convergent and concurrent validity for both dimensions of the ERQ. Overall, the ERQ is a psychometrically sound approach for assessing emotion regulation strategies in the oncological context. Clinical implications are discussed. Copyright © 2016 John Wiley & Sons, Ltd.
Psychometric properties of carer-reported outcome measures in palliative care: A systematic review

Science.gov (United States)

Michels, Charlotte TJ; Boulton, Mary; Adams, Astrid; Wee, Bee; Peters, Michele

2016-01-01

Background: Informal carers face many challenges in caring for patients with palliative care needs. Selecting suitable valid and reliable outcome measures to determine the impact of caring and carers’ outcomes is a common problem. Aim: To identify outcome measures used for informal carers looking after patients with palliative care needs, and to evaluate the measures’ psychometric properties. Design: A systematic review was conducted. The studies identified were evaluated by independent reviewers (C.T.J.M., M.B., M.P.). Data regarding study characteristics and psychometric properties of the measures were extracted and evaluated. Good psychometric properties indicate a high-quality measure. Data sources: The search was conducted, unrestricted to publication year, in the following electronic databases: Applied Social Sciences Index and Abstracts, Cumulative Index to Nursing and Allied Health Literature, The Cochrane Library, EMBASE, PubMed, PsycINFO, Social Sciences Citation Index and Sociological Abstracts. Results: Our systematic search revealed 4505 potential relevant studies, of which 112 studies met the inclusion criteria using 38 carer measures for informal carers of patients with palliative care needs. Psychometric properties were reported in only 46% (n = 52) of the studies, in relation to 24 measures. Where psychometric data were reported, the focus was mainly on internal consistency (n = 45, 87%), construct validity (n = 27, 52%) and/or reliability (n = 14, 27%). Of these, 24 measures, only four (17%) had been formally validated in informal carers in palliative care. Conclusion: A broad range of outcome measures have been used for informal carers of patients with palliative care needs. Little formal psychometric testing has been undertaken. Furthermore, development and refinement of measures in this field is required. PMID:26407683
Estudo das propriedades psicométricas do Teste de Memória de Reconhecimento – TEM-R

Directory of Open Access Journals (Sweden)

Fabián Javier Marín Rueda

2012-06-01

Full Text Available The purpose was to verify the psychometric qualities of the Memory Test of Recognition (TEM-R. In a first moment an initial version of the TEM-R was applied at 137 college students. It was found that from the 64 initial items, 15 did not show any frequency response. Based on this it was proceeded a reconfiguration of the instrument, fixing the number of 49 items. It was accomplished a new enforcement where participated 531 college students. The results of the internal structure showed an adequacy to the Rasch model, a absence of bias in the items through the analysis of differential items functioning, and an appropriate factor structure. We observed satisfactory reability indexes. Thus, the TEM-R presented adequate psychometric properties for use in the Brazilian reality. Keywords: memory; psychological tests; psychometry; validity; reability.
An antisymmetric psychometric function on a logarithmic scale

NARCIS (Netherlands)

Bergmann Tiest, W.M.; Kappers, A.M.L.

2011-01-01

This very brief report introduces a psychometric function, very suitable for psychophysical data that displays Weber-like behaviour, because it is antisymmetric on a logarithmic scale. © 2011 a Pion publication.
Psychometric properties of the Epworth Sleepiness Scale: A factor analysis and item-response theory approach.

Science.gov (United States)

Pilcher, June J; Switzer, Fred S; Munc, Alec; Donnelly, Janet; Jellen, Julia C; Lamm, Claus

2018-04-01

The purpose of this study is to examine the psychometric properties of the Epworth Sleepiness Scale (ESS) in two languages, German and English. Students from a university in Austria (N = 292; 55 males; mean age = 18.71 ± 1.71 years; 237 females; mean age = 18.24 ± 0.88 years) and a university in the US (N = 329; 128 males; mean age = 18.71 ± 0.88 years; 201 females; mean age = 21.59 ± 2.27 years) completed the ESS. An exploratory-factor analysis was completed to examine dimensionality of the ESS. Item response theory (IRT) analyses were used to provide information about the response rates on the items on the ESS and provide differential item functioning (DIF) analyses to examine whether the items were interpreted differently between the two languages. The factor analyses suggest that the ESS measures two distinct sleepiness constructs. These constructs indicate that the ESS is probing sleepiness in settings requiring active versus passive responding. The IRT analyses found that overall, the items on the ESS perform well as a measure of sleepiness. However, Item 8 and to a lesser extent Item 6 were being interpreted differently by respondents in comparison to the other items. In addition, the DIF analyses showed that the responses between German and English were very similar indicating that there are only minor measurement differences between the two language versions of the ESS. These findings suggest that the ESS provides a reliable measure of propensity to sleepiness; however, it does convey a two-factor approach to sleepiness. Researchers and clinicians can use the German and English versions of the ESS but may wish to exclude Item 8 when calculating a total sleepiness score.
The nutrition for sport knowledge questionnaire (NSKQ): development and validation using classical test theory and Rasch analysis.

Science.gov (United States)

Trakman, Gina Louise; Forsyth, Adrienne; Hoye, Russell; Belski, Regina

2017-01-01

Appropriate dietary intake can have a significant influence on athletic performance. There is a growing consensus on sports nutrition and professionals working with athletes often provide dietary education. However, due to the limitations of existing sports nutrition knowledge questionnaires, previous reports of athletes' nutrition knowledge may be inaccurate. An updated questionnaire has been developed based on a recent review of sports nutrition guidelines. The tool has been validated using a robust methodology that incorporates relevant techniques from classical test theory (CTT) and Item response theory (IRT), namely, Rasch analysis. The final questionnaire has 89 questions and six sub-sections (weight management, macronutrients, micronutrients, sports nutrition, supplements, and alcohol). The content and face validity of the tool have been confirmed based on feedback from expert sports dietitians and university sports students, respectively. The internal reliability of the questionnaire as a whole is high (KR = 0.88), and most sub-sections achieved an acceptable internal reliability. Construct validity has been confirmed, with an independent T-test revealing a significant ( p < 0.001) difference in knowledge scores of nutrition (64 ± 16%) and non-nutrition students (51 ± 19%). Test-retest reliability has been assured, with a strong correlation ( r = 0.92, p < 0.001) between individuals' scores on two attempts of the test, 10 days to 2 weeks apart. Three of the sub-sections fit the Rasch Unidimensional Model. The final version of the questionnaire represents a significant improvement over previous tools. Each nutrition sub-section is unidimensional, and therefore researchers and practitioners can use these individually, as required. Use of the questionnaire will allow researchers to draw conclusions about the effectiveness of nutrition education programs, and differences in knowledge across athletes of varying ages, genders, and athletic
The Mental Vulnerability Questionnaire: a psychometric evaluation

DEFF Research Database (Denmark)

Eplov, Lene Falgaard; Petersen, Janne; Jørgensen, Torben

2010-01-01

The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim of the pre......The Mental Vulnerability Questionnaire was originally a 22 item scale, later reduced to a 12 item scale. In population studies the 12 item scale has been a significant predictor of health and illness. The scale has not been psychometrically evaluated for more than 30 years, and the aim...... 0.30 for the 12 and the 22 item scales. All five Mental Vulnerability scales had positively skewed score distributions which were associated significantly with both SCL-90-R symptom scores and NEO-PI-R personality scales (primarily Neuroticism and Extraversion). Coefficient alpha was highest...
Utilizing Multifaceted Rasch Measurement through Facets to Evaluate Science Education Data Sets Composed of Judges, Respondents, and Rating Scale Items: An Exemplar Utilizing the Elementary Science Teaching Analysis Matrix Instrument

Science.gov (United States)

Boone, William J.; Townsend, J. Scott; Staver, John R.

2016-01-01

When collecting data, science education researchers frequently have multiple respondents evaluate multiple artifacts using multiple criteria. Herein, the authors introduce Multifaceted Rasch Measurement (MFRM) analysis and explain why MFRM must be used when "judges'" data are collected. The authors use data from elementary science…
The Free and Cued Selective Reminding Test: evidence of psychometric adequacy

Directory of Open Access Journals (Sweden)

KATJA OCEPEK-WELIKSON

2009-09-01

Full Text Available These analyses examine the psychometric properties of the Free and Cued Selective Reminding Test with Immediate Recall (FCSRT-IR. FCSRT-IR is a measure of memory under conditions that control attention and cognitive processing in order to obtain an assessment of memory unconfounded by normal agerelated changes in cognition. FCSRT-IR performance has been associated with preclinical and early dementia in several longitudinal epidemiological studies. Factor and item response theory analyses were applied to FCSRT-IR data from patients at a geriatric primary care center who had independently established clinical diagnoses. The results provide supporting evidence for the psychometric adequacy of the FCSR-IR in terms of reliability, essential (sufficient unidimensionality, information across the continuum of memory disability/ability, and classification accuracy. The psychometric adequacy of the FCSRT-IR adds further validity to its use as a case finding strategy for dementia.
Psychometric Analyses of the Birthday Party

Science.gov (United States)

Lee, Young-Sun

2016-01-01

The present research focuses on the psychometric properties of the Birthday Party measure for ages 3-5. The Birthday Party was developed to provide a reliable, valid, and engaging measure of early mathematical content--Number and Operation, Shape, Space, and Pattern--that can be given in either a short or a long form to English and Spanish…
Emotional Considerations in Spasmodic Dysphonia: Psychometric Quantification.

Science.gov (United States)

Cannito, Michael P.

1991-01-01

This study examined emotional characteristics of 18 female spasmodic dysphonic subjects in comparison to matched normal controls across psychometric measures of depression, anxiety, and somatic complaints. Statistically significant differences were noted between groups for all measures and over half of the dysphonic subjects exhibited clinically…
Assessment of Minimal HE (with emphasis on computerized psychometric tests)

Science.gov (United States)

Kappus, Matthew R; Bajaj, Jasmohan S

2012-01-01

Synopsis Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life and driving accidents. The detection of MHE requires specialized testing since it cannot by definition, be diagnosed on standard clinical examination. Psychometric (paper-pencil or computerized or a combination) and neuro-physiological techniques are often used to test for MHE. Paper-pencil psychometric batteries like the Psychometric Hepatic Encephalopathy Score (PHES) have been validated in several countries but do not have US normative values. Computerized tests such as the inhibitory control test (ICT), cognitive drug research system and Scan test have proven useful to diagnose MHE and predict outcomes. The specificity and sensitivity of these tests are similar to the recommended gold standards. Neuro-physiological tests such as the EEG and its interpretations, evoked potentials and Critical Flicker Frequency (CFF) also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike and the testing strategies depend on the normative data available, patient comfort and local expertise. PMID:22321464
Psychometric properties and clinical usefulness of the Oswestry Disability Index.

Science.gov (United States)

Vianin, Michael

2008-12-01

Outcome measures with good reliability, validity, responsiveness, and low burden of administration are clinically useful. The Oswestry Disability Index (ODI) is one of the most commonly used outcome measures for individuals with low back pain. Psychometric properties of the ODI will determine the questionnaire's suitability as a useful clinical tool. A literature search of relevant databases on psychometric evaluation of the ODI was performed. The search was done using the key words disability evaluation, and low back pain, and questionnaires, and reproducibility of results, and the term Oswestry. Inclusion criterion was direct reference regarding psychometric property, interpretability, and burden being included in the abstract. Eight articles met the inclusion criterion. The ODI shows good construct validity; internal consistency is rated as acceptable; test-retest reliability and responsiveness have been shown to be high; and burden of administration is low. The ODI is a valid, reliable, and responsive condition-specific assessment tool that is suited for use in clinical practice. It is easy to administer and score, objectifies clients' complaints, and monitors effects of therapy.
Psychometric Evaluation of Two Appetite Questionnaires in Patients With Heart Failure.

Science.gov (United States)

Andreae, Christina; Strömberg, Anna; Sawatzky, Richard; Årestedt, Kristofer

2015-12-01

Decreased appetite in heart failure (HF) may lead to undernutrition which could negatively influence prognosis. Appetite is a complex clinical issue that is often best measured with the use of self-report instruments. However, there is a lack of self-rated appetite instruments. The Council on Nutrition Appetite Questionnaire (CNAQ) and the Simplified Nutritional Appetite Questionnaire (SNAQ) are validated instruments developed primarily for elderly people. Yet, the psychometric properties have not been evaluated in HF populations. The aim of the present study was to evaluate the psychometric properties of CNAQ and SNAQ in patients with HF. A total of 186 outpatients with reduced ejection fraction and New York Heart Association (NYHA) functional classifications II-IV were included (median age 72 y; 70% men). Data were collected with the use of a questionnaire that included the CNAQ and SNAQ. The psychometric evaluation included data quality, factor structure, construct validity, known-group validity, and internal consistency. Unidimensionality was supported by means of parallel analysis and confirmatory factor analyses (CFAs). The CFA results indicated sufficient model fit. Both construct validity and known-group validity were supported. Internal consistency reliability was acceptable, with ordinal coefficient alpha estimates of 0.82 for CNAQ and 0.77 for SNAQ. CNAQ and SNAQ demonstrated sound psychometric properties and can be used to measure appetite in patients with HF. Copyright © 2015 Elsevier Inc. All rights reserved.
The Psychometric Anatomy of Two Unidimensional Workload Scales

National Research Council Canada - National Science Library

George, Edward

2004-01-01

.... The more specific intent is to encourage reevaluation from a structured psychometric viewpoint. The end goal is to facilitate a uniformly higher standard of measurement quality in unidimensional scaling having complex scale step descriptors...
A psychometric assessment of the LPME scale for the South African skills development context

Directory of Open Access Journals (Sweden)

Maelekanyo Christopher Tshilongamulenzhe

2015-09-01

Full Text Available A thorough examination of psychometric properties of measurement scales is necessary to ensure that these scales comply with the existing scientific conventions. This article assesses the psychometric properties of the Learning Programme Management and Evaluation (LPME scale. A quantitative, non-experimental cross-sectional survey design was used. Data were collected from a sample of 652 respondents comprising skills development practitioners and learners/apprentices. Data were analyzed using Winsteps, SPSS and AMOS computer software. The findings show that the LPME scale meets the psychometric expectations and complies with the established scientific conventions in terms of validity, reliability, fit and unidimensionality.
Psychometric properties of patient-reported outcome measures for hip arthroscopic surgery

DEFF Research Database (Denmark)

Kemp, Joanne L; Collins, Natalie J; Roos, Ewa M.

2013-01-01

Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown.......Patient-reported outcomes (PROs) are considered the gold standard when evaluating outcomes in a surgical population. While the psychometric properties of some PROs have been tested, the properties of newer PROs in patients undergoing hip arthroscopic surgery remain somewhat unknown....
Comparison of the psychometric properties of two balance scales in children with cerebral palsy

OpenAIRE

Jeon, Yong-Jin; Kim, Gyoung-Mo

2016-01-01

[Purpose] The purpose of this study was to compare the item difficulty degree between the Pediatric Balance Scale and Fullerton Advanced Balance scale for children with cerebral palsy. [Subjects and Methods] Forty children with cerebral palsy (male=17, female=23) voluntarily participated in the study. Item difficulty was expressed in the Rasch analysis using a logit value, with a higher value indicative of increasing item difficulty. [Results] Among the 24 items of the combined Pediatric Bala...

Psychometrics and Its Discontents: An Historical Perspective on the Discourse of the Measurement Tradition

Science.gov (United States)

Schoenherr, Jordan Richard; Hamstra, Stanley J.

2016-01-01

Psychometrics has recently undergone extensive criticism within the medical education literature. The use of quantitative measurement using psychometric instruments such as response scales is thought to emphasize a narrow range of relevant learner skills and competencies. Recent reviews and commentaries suggest that a paradigm shift might be…
Internal construct validity of the Shirom-Melamed Burnout Questionnaire (SMBQ

Directory of Open Access Journals (Sweden)

Lundgren-Nilsson Åsa

2012-01-01

Full Text Available Abstract Background Burnout is a mental condition defined as a result of continuous and long-term stress exposure, particularly related to psychosocial factors at work. This paper seeks to examine the psychometric properties of the Shirom-Melamed Burnout Questionnaire (SMBQ for validation of use in a clinical setting. Methods Data from both a clinical (319 and general population (319 samples of health care and social insurance workers were included in the study. Data were analysed using both classical and modern test theory approaches, including Confirmatory Factor Analysis (CFA and Rasch analysis. Results Of the 638 people recruited into the study 416 (65% persons were working full or part time. Data from the SMBQ failed a CFA, and initially failed to satisfy Rasch model expectations. After the removal of 4 of the original items measuring tension, and accommodating local dependency in the data, model expectations were met. As such, the total score from the revised scale is a sufficient statistic for ascertaining burnout and an interval scale transformation is available. The scale as a whole was perfectly targeted to the joint sample. A cut point of 4.4 for severe burnout was chosen at the intersection of the distributions of the clinical and general population. Conclusion A revised 18 item version of the SMBQ satisfies modern measurement standards. Using its cut point it offers the opportunity to identify potential clinical cases of burnout.
Original article The Symbiotic Bond Questionnaire – theoretical background and psychometric qualities

OpenAIRE

Aleksandra Lewandowska-Walter; Magdalena Błażek; Maria Kaźmierczak

2015-01-01

Background The article describes the Symbiotic Bond Questionnaire (SBQ) – the theoretical background as well as its psychometric characteristics and psychological correlates. The items were created on the basis of the definition of symbiotic personality (Johnson, 1994a). Participants and procedure For these initial survey development and cross-validation studies, the factor structure and psychometric properties of the SBQ were examined. To assess the SBQ’s reliability, the...
Dimensionality and scaling properties of the Patient Categorisation Tool in patients with complex rehabilitation needs following acquired brain injury

Directory of Open Access Journals (Sweden)

Richard J. Siegert

2018-03-01

Full Text Available Objective: To investigate the scaling properties of the Patient Categorisation Tool (PCAT as an instrument to measure complexity of rehabilitation needs. Design: Psychometric analysis in a multicentre cohort from the UK national clinical database. Patients: A total of 8,222 patents admitted for specialist inpatient rehabilitation following acquired brain injury. Methods: Dimensionality was explored using principal components analysis with Varimax rotation, followed by Rasch analysis on a random sample of n = 500. Results: Principal components analysis identified 3 components explaining 50% of variance. The partial credit Rasch model was applied for the 17-item PCAT scale using a “super-items” methodology based on the principal components analysis results. Two out of 5 initially created super-items displayed signs of local dependency, which significantly affected the estimates. They were combined into a single super-item resulting in satisfactory model fit and unidimensionality. Differential item functioning (DIF of 2 super-items was addressed by splitting between age groups (<65 and ≥ 65 years to produce the best model fit (χ2/df = 54.72, p = 0.235 and reliability (Person Separation Index (PSI = 0.79. Ordinal-to-interval conversion tables were produced. Conclusion: The PCAT has satisfied expectations of the unidimensional Rasch model in the current sample after minor modifications, and demonstrated acceptable reliability for individual assessment of rehabilitation complexity.
Measuring Life Satisfaction in Parkinson's Disease and Healthy Controls Using the Satisfaction With Life Scale.

Science.gov (United States)

Løvereide, Lise; Hagell, Peter

2016-01-01

The 5-item Satisfaction With Life Scale (SWLS) was designed to measure general life satisfaction (LS). Here we examined the psychometric properties of the SWLS in a cohort of persons with Parkinson`s disease (PwPD) and age and gender matched individuals without PD. The SWLS was administered to PwPD and controls from the Norwegian ParkWest study at 5 and 7 years after the time of diagnosis. Data were analysed according to classical test theory (CTT) and Rasch measurement theory. CTT scaling assumptions for computation of a SWLS total score were met (corrected item-total correlations >0.58). The SWLS was reasonably well targeted to the sample and had good reliability (ordinal alpha, 0.92). The scale exhibited good fit to the Rasch model and successfully separated between 5 statistically distinct strata of people (levels of SWLS). The seven response categories did not work as intended and the scale may benefit from reduction to five response categories. There was no clinically significant differential item functioning. Separate analyses in PwPD and controls yielded very similar results to those from the pooled analysis. This study supports the SWLS as a valid instrument for measuring LS in PD and controls. However, Rasch analyses provided new insights into the performance and validity of the SWLS and identified areas for future revisions in order to further improve the scale.
Psychometric analysis of export market orientation measurement scale in Croatian SME exporters’ context

Directory of Open Access Journals (Sweden)

Dario Miočević

2009-07-01

Full Text Available Market orientation is a vital construct of the marketing concept. Although different conceptualization approaches to market orientation have been discussed by literature so far, a common denominator is its interdependence with business performance. Increasing globalization trends affect both the markets’ convergence and competition. Consequently, focusing on market orientation within an international context is of utmost importance. Export market orientation (EMO is relatively new concept, which puts market orientation into the international context. Since export is a dominant international entry strategy in the Croatian economy which comprises mostly SMEs, it is crucial to investigate the importance of the EMO in the Croatian SME context. Determining an appropriate measurement scale of the EMO to be applied in various national research contexts leading to generalization represents a challenge for marketing academicians. The paper aims to find out whether the EMO construct and measurement scale can be applied within the Croatian SME context. The authors have used the exploratory and the confirmatory factor analysis to determine the psychometric properties of the EMO scale. The results of psychometric assessment of the EMO scale confirm its dimensionability, reliability, validity and applicability in the Croatian SME context. Results clearly indicate the necessity of pursuing EMO activities in order to achieve a high level of export performance.
Measuring everyday functional competence using the Rasch assessment of everyday activity limitations (REAL) item bank

NARCIS (Netherlands)

Oude Voshaar, Martijn A.H.; Ten Klooster, Peter M.; Vonkeman, Harald E.; van de Laar, Mart A.F.J.

2017-01-01

Objective: Traditional patient-reported physical function instruments often poorly differentiate patients with mild-to-moderate disability. We describe the development and psychometric evaluation of a generic item bank for measuring everyday activity limitations in outpatient populations. Study
Development of a Microsoft Excel tool for one-parameter Rasch model of continuous items: an application to a safety attitude survey

Directory of Open Access Journals (Sweden)

Tsair-Wei Chien

2017-01-01

Full Text Available Abstract Background Many continuous item responses (CIRs are encountered in healthcare settings, but no one uses item response theory’s (IRT probabilistic modeling to present graphical presentations for interpreting CIR results. A computer module that is programmed to deal with CIRs is required. To present a computer module, validate it, and verify its usefulness in dealing with CIR data, and then to apply the model to real healthcare data in order to show how the CIR that can be applied to healthcare settings with an example regarding a safety attitude survey. Methods Using Microsoft Excel VBA (Visual Basic for Applications, we designed a computer module that minimizes the residuals and calculates model’s expected scores according to person responses across items. Rasch models based on a Wright map and on KIDMAP were demonstrated to interpret results of the safety attitude survey. Results The author-made CIR module yielded OUTFIT mean square (MNSQ and person measures equivalent to those yielded by professional Rasch Winsteps software. The probabilistic modeling of the CIR module provides messages that are much more valuable to users and show the CIR advantage over classic test theory. Conclusions Because of advances in computer technology, healthcare users who are familiar to MS Excel can easily apply the study CIR module to deal with continuous variables to benefit comparisons of data with a logistic distribution and model fit statistics.
Psychometric Properties of Questionnaires on Functional Health Status in Oropharyngeal Dysphagia: A Systematic Literature Review

Science.gov (United States)

Speyer, Renée; Cordier, Reinie; Kertscher, Berit; Heijnen, Bas J

2014-01-01

Introduction. Questionnaires on Functional Health Status (FHS) are part of the assessment of oropharyngeal dysphagia. Objective. To conduct a systematic review of the literature on the psychometric properties of English-language FHS questionnaires in adults with oropharyngeal dysphagia. Methods. A systematic search was performed using the electronic databases Pubmed and Embase. The psychometric properties of the questionnaires were determined based on the COSMIN taxonomy of measurement properties and definitions for health-related patient-reported outcomes and the COSMIN checklist using preset psychometric criteria. Results. Three questionnaires were included: the Eating Assessment Tool (EAT-10), the Swallowing Outcome after Laryngectomy (SOAL), and the Self-report Symptom Inventory. The Sydney Swallow Questionnaire (SSQ) proved to be identical to the Modified Self-report Symptom Inventory. All FHS questionnaires obtained poor overall methodological quality scores for most measurement properties. Conclusions. The retrieved FHS questionnaires need psychometric reevaluation; if the overall methodological quality shows satisfactory improvement on most measurement properties, the use of the questionnaires in daily clinic and research can be justified. However, in case of insufficient validity and/or reliability scores, new FHS questionnaires need to be developed using and reporting on preestablished psychometric criteria as recommended in literature. PMID:24877095
Psychometric Evaluation of the Diabetes Symptom Checklist-Revised (DSC-R)-A Measure of Symptom Distress

NARCIS (Netherlands)

Arbuckle, R.A.; Humphrey, L.; Vardeva, K.; Arondekar, B.; Scott, J.A.; Snoek, F.J.

2009-01-01

Objective: To assess the psychometric validity, reliability, responsiveness, and minimal important differences of the Diabetes Symptoms Checklist-Revised (DSC-R), a widely used patient-reported outcome measure of diabetes symptom distress. Research Design and Methods: Psychometric validity of the
Clinimetrics and clinical psychometrics: macro- and micro-analysis.

Science.gov (United States)

Tomba, Elena; Bech, Per

2012-01-01

Clinimetrics was introduced three decades ago to specify the domain of clinical markers in clinical medicine (indexes or rating scales). In this perspective, clinical validity is the platform for selecting the various indexes or rating scales (macro-analysis). Psychometric validation of these indexes or rating scales is the measuring aspect (micro-analysis). Clinical judgment analysis by experienced psychiatrists is included in the macro-analysis and the item response theory models are especially preferred in the micro-analysis when using the total score as a sufficient statistic. Clinical assessment tools covering severity of illness scales, prognostic measures, issues of co-morbidity, longitudinal assessments, recovery, stressors, lifestyle, psychological well-being, and illness behavior have been identified. The constructive dialogue in clinimetrics between clinical judgment and psychometric validation procedures is outlined for generating developments of clinical practice in psychiatry. Copyright © 2012 S. Karger AG, Basel.
Weight Bias: A Systematic Review of Characteristics and Psychometric Properties of Self-Report Questionnaires.

Science.gov (United States)

Lacroix, Emilie; Alberga, Angela; Russell-Mathew, Shelly; McLaren, Lindsay; von Ranson, Kristin

2017-01-01

People living with overweight and obesity often experience weight-based stigmatization. Investigations of the prevalence and correlates of weight bias and evaluation of weight bias reduction interventions depend upon psychometrically-sound measurement. Our paper is the first to comprehensively evaluate the psychometric properties, use of people-first language within items, and suitability for use with various populations of available self-report measures of weight bias. We searched five electronic databases to identify English-language self-report questionnaires of weight bias. We rated each questionnaire's psychometric properties based on initial validation reports and subsequent use, and examined item language. Our systematic review identified 40 original self-report questionnaires. Most questionnaires were brief, demonstrated adequate internal consistency, and tapped key cognitive and affective dimensions of weight bias such as stereotypes and blaming. Current psychometric evidence is incomplete for many questionnaires, particularly with regard to the properties of test-retest reliability, sensitivity to change as well as discriminant and structural validity. Most questionnaires were developed prior to debate surrounding terminology preferences, and do not employ people-first language in the items administered to participants. We provide information and recommendations for clinicians and researchers in selecting psychometrically sound measures of weight bias for various purposes and populations, and discuss future directions to improve measurement of this construct. © 2017 The Author(s) Published by S. Karger GmbH, Freiburg.
My Vocational Situation (MVS): Case Example and Psychometric Review.

Science.gov (United States)

Nitsch, Kristian P; Pedersen, Jessica; Miliotto, Alexandra; Petersen, Brett; Robbins, Samantha; Garcia, Ana; Hoisington, Molly Ansel; The, Kimberly J; Smiley, Jill; Janikowski, Timothy

This case report provides an overview of the psychometric properties and clinical utility of the My Vocational Situation (MVS) instrument. The accompanying hypothetical case description illustrates how clinicians could use the MVS to evaluate vocational preferences and outcomes and how the MVS can be used to inform treatment planning and rehabilitation decision making. The information contained in this report is intended to familiarize clinicians with the administration and scoring of the MVS, the psychometric information necessary to interpret results obtained from the MVS, and how the results could be used to provide comprehensive, patient-centered care. It is important to note that the information provided represents only a sample of the available research literature on the MVS. Copyright © 2017 by the American Occupational Therapy Association, Inc.
Developing the Communicative Participation Item Bank: Rasch Analysis Results from a Spasmodic Dysphonia Sample

Science.gov (United States)

Baylor, Carolyn R.; Yorkston, Kathryn M.; Eadie, Tanya L.; Miller, Robert M.; Amtmann, Dagmar

2009-01-01

Purpose: The purpose of this study was to conduct the initial psychometric analyses of the Communicative Participation Item Bank--a new self-report instrument designed to measure the extent to which communication disorders interfere with communicative participation. This item bank is intended for community-dwelling adults across a range of…
Psychometric evaluation of ADAS-Cog and NTB for measuring drug response.

Science.gov (United States)

Karin, A; Hannesdottir, K; Jaeger, J; Annas, P; Segerdahl, M; Karlsson, P; Sjögren, N; von Rosen, T; Miller, F

2014-02-01

To conduct a psychometric analysis to determine the adequacy of instruments that measure cognition in Alzheimer's disease trials. Both the Alzheimer's Disease Assessment Scale - Cognition (ADAS-Cog) and the Neuropsychological Test Battery (NTB) are validated outcome measures for clinical trials in Alzheimer's disease and are approved also for regulatory purposes. However, it is not clear how comparable they are in measuring cognitive function. In fact, many recent trials in Alzheimer's disease patients have failed and it has been questioned if ADAS-Cog still is a sensitive measure. The present paper examines the psychometric properties of ADAS-Cog and NTB, based on a post hoc analysis of data from a clinical trial (NCT01024660), which was conducted by AstraZeneca, in mild-to-moderate Alzheimer's disease (AD) patients, with a Mini Mental State Examination (MMSE) Total score 16-24. Acceptability, reliability, different types of validity and ability to detect change were assessed using relevant statistical methods. Total scores of both tests, as well as separate domains of both tests, including the Wechsler Memory Scale (WMS), Rey Auditory Verbal Learning Test (RAVLT) and Delis-Kaplan Executive Function System (D-KEFS) Verbal Fluency Condition, were analyzed. Overall, NTB performed well, with acceptable reliability and ability to detect change, while ADAS-Cog had insufficient psychometric properties, including ceiling effects in 8 out of a total of 11 ADAS-Cog items in mild AD patients, as well as low test-retest reliability in some of the items. Based on a direct comparison on the same patient sample, we see advantages of the NTB compared with the ADAS-Cog for the evaluation of cognitive function in the population of mild-to-moderate AD patients. The results suggest that not all of ADAS-Cog items are relevant for both mild and moderate AD population. This validation study demonstrates satisfactory psychometric properties of the NTB, while ADAS-Cog was found to be
An exploratory sequential design to validate measures of moral emotions.

Science.gov (United States)

Márquez, Margarita G; Delgado, Ana R

2017-05-01

This paper presents an exploratory and sequential mixed methods approach in validating measures of knowledge of the moral emotions of contempt, anger and disgust. The sample comprised 60 participants in the qualitative phase when a measurement instrument was designed. Item stems, response options and correction keys were planned following the results obtained in a descriptive phenomenological analysis of the interviews. In the quantitative phase, the scale was used with a sample of 102 Spanish participants, and the results were analysed with the Rasch model. In the qualitative phase, salient themes included reasons, objects and action tendencies. In the quantitative phase, good psychometric properties were obtained. The model fit was adequate. However, some changes had to be made to the scale in order to improve the proportion of variance explained. Substantive and methodological im-plications of this mixed-methods study are discussed. Had the study used a single re-search method in isolation, aspects of the global understanding of contempt, anger and disgust would have been lost.
Internal construct validity of the Warwick-Edinburgh Mental Well-being Scale (WEMWBS: a Rasch analysis using data from the Scottish Health Education Population Survey

Directory of Open Access Journals (Sweden)

Platt Stephen

2009-02-01

Full Text Available Abstract Background The Warwick-Edinburgh Mental Well-Being Scale (WEMWBS was developed to meet demand for instruments to measure mental well-being. It comprises 14 positively phrased Likert-style items and fulfils classic criteria for scale development. We report here the internal construct validity of WEMWBS from the perspective of the Rasch measurement model. Methods The model was applied to data collected from 779 respondents in Wave 12 (Autumn 2006 of the Scottish Health Education Population Survey. Respondents were aged 16–74 (average 41.9 yrs. Results Initial fit to model expectations was poor. The items 'I've been feeling good about myself', 'I've been interested in new things' and 'I've been feeling cheerful' all showed significant misfit to model expectations, and were deleted. This led to a marginal improvement in fit to the model. After further analysis, more items were deleted and a strict unidimensional seven item scale (the Short Warwick Edinburgh Mental Well-Being Scale (SWEMWBS was resolved. Many items deleted because of misfit with model expectations showed considerable bias for gender. Two retained items also demonstrated bias for gender but, at the scale level, cancelled out. One further retained item 'I've been feeling optimistic about the future' showed bias for age. The correlation between the 14 item and 7 item versions was 0.954. Given fit to the Rasch model, and strict unidimensionality, SWEMWBS provides an interval scale estimate of mental well-being. Conclusion A short 7 item version of WEMWBS was found to satisfy the strict unidimensionality expectations of the Rasch model, and be largely free of bias. This scale, SWEMWBS, provides a raw score-interval scale transformation for use in parametric procedures. In terms of face validity, SWEMWBS presents a more restricted view of mental well-being than the 14 item WEMWBS, with most items representing aspects of psychological and eudemonic well-being, and few covering
Psychometric properties of the Multidimensional Anxiety Scale for ...

African Journals Online (AJOL)

Aim: To determine the psychometric properties of the Multidimensional Anxiety Scale for Children (MASC) in Nairobi public secondary school children, Kenya. Method: Concurrent self-administration of the MASC and Children's Depression Inventory (CDI) to students in Nairobi public secondary schools. Results: The MASC ...
Psychometric evaluation of the Social Interaction Phobia Scale.

Science.gov (United States)

Reilly, Alison R; Carleton, R Nicholas; Weeks, Justin W

2012-01-01

The present study evaluated the psychometric properties of a novel measure of social anxiety symptoms, the Social Interaction Phobia Scale (SIPS), as a stand-alone item set, using an undergraduate sample (N=512). The 14-item SIPS has three subscales assessing Social Interaction Anxiety, Fear of Overt Evaluation, and Fear of Attracting Attention. Confirmatory factor analyses replicated the three-factor structure for the SIPS originally reported by Carleton et al. All SIPS scores demonstrated good internal consistency. The convergent validity of the SIPS was supported by strong and positive correlations between all SIPS scores and measures of social anxiety and fear of evaluation; the finding that the relationships between all SIPS scores and a social anxiety measure were stronger than relationships between all SIPS scores and measures of other constructs supported the discriminant validity of the SIPS. Results suggest that the SIPS possesses excellent psychometric properties.
Evaluating and Quantifying User and Carer Involvement in Mental Health Care Planning (EQUIP: Co-Development of a New Patient-Reported Outcome Measure.

Directory of Open Access Journals (Sweden)

Penny Bee

Full Text Available International and national health policy seeks to increase service user and carer involvement in mental health care planning, but suitable user-centred tools to assess the success of these initiatives are not yet available. The current study describes the development of a new reliable and valid, interval-scaled service-user and carer reported outcome measure for quantifying user/carer involvement in mental health care planning. Psychometric development reduced a 70-item item bank to a short form questionnaire using a combination of Classical Test, Mokken and Rasch Analyses. Test-retest reliability was calculated using t-tests of interval level scores between baseline and 2-4 week follow-up. Items were worded to be relevant to both service users and carers. Nine items were removed following cognitive debriefing with a service user and carer advisory group. An iterative process of item removal reduced the remaining 61 items to a final 14-item scale. The final scale has acceptable scalability (Ho = .69, reliability (alpha = .92, fit to the Rasch model (χ2(70 = 97.25, p = .02, and no differential item functioning or locally dependent items. Scores remained stable over the 4 week follow-up period, indicating good test-retest reliability. The 'Evaluating the Quality of User and Carer Involvement in Care Planning (EQUIP' scale displays excellent psychometric properties and is capable of unidimensional linear measurement. The scale is short, user and carer-centred and will be of direct benefit to clinicians, services, auditors and researchers wishing to quantify levels of user and carer involvement in care planning.

Evaluation of the Psychometric Properties of the Mental Vulnerability Questionnaire in Undergraduate Students.

Science.gov (United States)

Sequeira, Carlos Alberto da Cruz; Barbosa, Elsa Natalina Mendes; Nogueira, Maria José Carvalho; Sampaio, Francisco Miguel Correia

2017-10-01

Translate, adapt the language, and assess the psychometric properties of the Mental Vulnerability Questionnaire (MVQ) in a Portuguese population sample of young adults. A psychometric validation study was performed. The sample comprised 166 undergraduate students. Factor analysis was applied to extract three indicators. The MVQ showed divergent validity with the Positive Mental Health Questionnaire (p Mental Health Inventory including five items (p mental vulnerability. © 2016 Wiley Periodicals, Inc.
Psychometric properties of Sternberg love scale | Askarpour ...

African Journals Online (AJOL)

Introduction: The aim of study was to evaluate the psychometric indices Sternberg love scale on married men and women in Iranian society. Methods: The study type is correlation (factor analysis). In this research factor analysis was used that is an exploratory and confirmatory technique to study the structure of a set of data, ...
Psychometric Consequences of Subpopulation Item Parameter Drift

Science.gov (United States)

Huggins-Manley, Anne Corinne

2017-01-01

This study defines subpopulation item parameter drift (SIPD) as a change in item parameters over time that is dependent on subpopulations of examinees, and hypothesizes that the presence of SIPD in anchor items is associated with bias and/or lack of invariance in three psychometric outcomes. Results show that SIPD in anchor items is associated…
Psychometric analysis of the Generalized Anxiety Disorder scale (GAD-7) in primary care using modern item response theory.

Science.gov (United States)

Jordan, Pascal; Shedden-Mora, Meike C; Löwe, Bernd

2017-01-01

The Generalized Anxiety Disorder scale (GAD-7) is one of the most frequently used diagnostic self-report scales for screening, diagnosis and severity assessment of anxiety disorder. Its psychometric properties from the view of the Item Response Theory paradigm have rarely been investigated. We aimed to close this gap by analyzing the GAD-7 within a large sample of primary care patients with respect to its psychometric properties and its implications for scoring using Item Response Theory. Robust, nonparametric statistics were used to check unidimensionality of the GAD-7. A graded response model was fitted using a Bayesian approach. The model fit was evaluated using posterior predictive p-values, item information functions were derived and optimal predictions of anxiety were calculated. The sample included N = 3404 primary care patients (60% female; mean age, 52,2; standard deviation 19.2) The analysis indicated no deviations of the GAD-7 scale from unidimensionality and a decent fit of a graded response model. The commonly suggested ultra-brief measure consisting of the first two items, the GAD-2, was supported by item information analysis. The first four items discriminated better than the last three items with respect to latent anxiety. The information provided by the first four items should be weighted more heavily. Moreover, estimates corresponding to low to moderate levels of anxiety show greater variability. The psychometric validity of the GAD-2 was supported by our analysis.
Psychometric Properties of “Community Assessment of Psychic Experiences”: Review and Meta-analyses

Science.gov (United States)

Mark, Winifred; Toulopoulou, Timothea

2016-01-01

The Community Assessment of Psychic Experiences (CAPE) has been used extensively as a measurement for psychosis proneness in clinical and research settings. However, no prior review and meta-analysis have comprehensively examined psychometric properties (reliability and validity) of CAPE scores across different studies. To study CAPE’s internal reliability—ie, how well scale items correlate with one another—111 studies were reviewed. Of these, 18 reported unique internal reliability coefficients using data at hand, which were aggregated in a meta-analysis. Furthermore, to confirm the number and nature of factors tapped by CAPE, 17 factor analytic studies were reviewed and subjected to meta-analysis in cases of discrepancy. Results suggested that CAPE scores were psychometrically reliable—ie, scores obtained could be attributed to true score variance. Our review of factor analytic studies supported a 3-factor model for CAPE consisting of “Positive”, “Negative”, and “Depressive” subscales; and a tripartite structure for the Negative dimension consisting of “Social withdrawal”, “Affective flattening”, and “Avolition” subdimensions. Meta-analysis of factor analytic studies of the Positive dimension revealed a tridimensional structure consisting of “Bizarre experiences”, “Delusional ideations”, and “Perceptual anomalies”. Information on reliability and validity of CAPE scores is important for ensuring accurate measurement of the psychosis proneness phenotype, which in turn facilitates early detection and intervention for psychotic disorders. Apart from enhancing the understanding of psychometric properties of CAPE scores, our review revealed questionable reporting practices possibly reflecting insufficient understanding regarding the significance of psychometric properties. We recommend increased focus on psychometrics in psychology programmes and clinical journals. PMID:26150674
Cross-cultural psychometric assessment of the VAGUS insight into psychosis scale - Spanish version.

Science.gov (United States)

de León, Patricia Ponce; Gerretsen, Philip; Shah, Parita; Saracco-Alvarez, Ricardo; Graff-Guerrero, Ariel; Fresán, Ana

2018-01-01

Impaired insight into illness, a core feature of schizophrenia with negative clinical implications, is a multidimensional phenomenon existing on a continuum. However, the degree to which illness perception in distinct cultures influences the appraisal of insight into illness in schizophrenia remains unclear. As such, we aimed to determine if the psychometric properties of the VAGUS insight into psychosis scale (www.vagusonline.com), which was originally assessed in English speaking Canadians, were similar in a sample of Latino Mexican Spanish speaking patients with schizophrenia. To accomplish this, the VAGUS - Self-Report (SR) version was translated from English to Spanish and psychometrically evaluated in 95 participants. The Spanish version of the VAGUS-SR was internally consistent (ᾳ = 0.713), and demonstrated good convergent and discriminant validity with the subscales of the Positive and Negative Syndrome Scale. Factor analysis identified two components of insight, congruent with two of the components of the English version of the VAGUS-SR. In conclusion, the VAGUS-SR is a brief, novel, and valid measure of insight into illness in schizophrenia, which demonstrated similar psychometric properties in two culturally and linguistically distinct samples with schizophrenia. Future studies should assess whether the VAGUS demonstrates similar psychometric properties in non-Western cultures. Copyright © 2017 Elsevier B.V. All rights reserved.
A Systematic Review of the Psychometric Properties of the Sexual Relationship Power Scale in HIV/AIDS Research

Science.gov (United States)

McMahon, James M.; Volpe, Ellen M.; Klostermann, Keith; Trabold, Nicole; Xue, Ying

2014-01-01

The Sexual Relationship Power Scale (SRPS) was developed over a decade ago to address the lack of reliable and valid measures of relationship power in social, behavioral and medical research. The SRPS and its two subscales (relationship control [RC], decision-making dominance [DMD]) have been used extensively in the field of HIV prevention and sexual risk behavior. We performed a systematic review of the psychometric properties of the SRPS and subscales as reported in the HIV/AIDS literature from 2000 to 2012. A total of 54 published articles were identified that reported reliability or construct validity estimates of the scales. Description of the psychometric properties of the SRPS and subscales are reported according to study population, and several cross-population trends were identified. In general, the SRPS and RC subscale exhibited sound psychometric properties across multiple study populations and research settings. By contrast, the DMD subscale had relatively weak psychometric properties, especially when used with specific populations and research settings. Factors that influenced the psychometric properties of the various scales and subscales included the study population, mean age of the sample, number of items retained in the scale, and modifications to the original scales. We conclude with recommendations for (a) the application and use of the SRPS and subscales, (b) reporting of psychometric properties of the scales in the literature, and (c) areas for future research. PMID:25331613
Psychometric evaluation of the shortened resilience scale among Alzheimer's caregivers.

Science.gov (United States)

Wilks, Scott E

2008-01-01

The purpose of this study was to evaluate psychometric properties of the shortened Resilience Scale (15-item version RS15) among a sample of Alzheimer's caregivers. Self-reported data were collected from 229 participants at 2 Alzheimer's caregiver conferences. RS15 principal axis factoring indicated a single-dimensional solution with all items loaded. Reliability was strong. Convergent validity for the RS15 was suggested through its correlations with stress, family support, and friend support. Odds ratios showed significant likelihoods of high resilience given low stress and high social support. The results confirmed the RS15 to be a psychometrically sound measure that can be used to appraise the efficacy of adaptability among Alzheimer's caregivers.
Work-nonwork interference: Preliminary results on the psychometric properties of a new instrument

Directory of Open Access Journals (Sweden)

Eileen Koekemoer

2010-11-01

Research purpose: The objectives of this study were to investigate the internal validity (construct, discriminant and convergent validity, reliability and external validity (relationship with theoretically relevant variables, including job characteristics, home characteristics, burnout, ill health and life satisfaction of the instrument. Motivation for the study: Work-family interaction is a key topic receiving significant research attention. In order to facilitate comparison across work-family studies, the use of psychometrically sound instruments is of great importance. Research design, approach and method: A cross-sectional survey design was used for the target population of married employees with children working at a tertiary institution in the North West province (n = 366. In addition to the new instrument, job characteristics, home characteristics, burnout, ill health and life satisfaction were measured. Main findings: The results provided evidence for construct, discriminant and convergent validity, reliability and significant relations with external variables. Practical/managerial implications: The new instrument can be used by researchers and managers as a test under development to investigate the interference between work and different nonwork roles (i.e. parental role, spousal role, work role, domestic role and specific relations with antecedents (e.g. job/home characteristics and well-being (e.g. burnout, ill health and life satisfaction. Contribution/value-add: This study provides preliminary information on the psychometric properties of a new instrument that measures the interference between work and nonwork.
Development of Listening Comprehension Tests with Narrative and Expository Texts for Portuguese Students.

Science.gov (United States)

Santos, Sandra; Viana, Fernanda Leopoldina; Ribeiro, Iolanda; Prieto, Gerardo; Brandão, Sara; Cadime, Irene

2015-03-03

This investigation aimed to develop and collect psychometric data for two tests assessing listening comprehension of Portuguese students in primary school: the Test of Listening Comprehension of Narrative Texts (TLC-n) and the Test of Listening Comprehension of Expository Texts (TLC-e). Two studies were conducted. The purpose of study 1 was to construct four test forms for each of the two tests to assess first, second, third and fourth grade students of the primary school. The TLC-n was administered to 1042 students, and the TLC-e was administered to 848 students. The purpose of study 2 was to test the psychometric properties of new items for the TLC-n form for fourth graders, given that the results in study 1 indicated a severe lack of difficult items. The participants were 260 fourth graders. The data were analysed using the Rasch model. Thirty items were selected for each test form. The results provided support for the model assumptions: Unidimensionality and local independence of the items. The reliability coefficients were higher than .70 for all test forms. The TLC-n and the TLC-e present good psychometric properties and represent an important contribution to the learning disabilities assessment field.
Uso del modelo de Rasch para poner en la misma escala las puntuaciones de distintos tests

Directory of Open Access Journals (Sweden)

Gerardo Prieto-Adánez

2003-01-01

Full Text Available Los usuarios de los tests tienen que poner en una escala común las puntuaciones de distintos instrumentos en varias situaciones prácticas, tales como la evaluación académica, la selección de personal, los estudios acerca del cambio de un atributo psicológico o educativo, la construcción de bancos de ítems, la validación intercultural de tests y los estudios sobre el funcionamiento diferencial de los ítems. Situar en una escala común las puntuaciones de diferentes tests es una de las principales aplicaciones del modelo de Rasch. En este artículo, mostramos el proceso de equiparación de dos tests (diseño, análisis de datos e interpretación usando como anclaje un conjunto de ítems comunes.
Psychometric properties of the Triarchic Psychopathy Measure: An item response theory approach.

Science.gov (United States)

Shou, Yiyun; Sellbom, Martin; Xu, Jing

2018-05-01

There is cumulative evidence for the cross-cultural validity of the Triarchic Psychopathy Measure (TriPM; Patrick, 2010) among non-Western populations. Recent studies using correlational and regression analyses show promising construct validity of the TriPM in Chinese samples. However, little is known about the efficiency of items in TriPM in assessing the proposed latent traits. The current study evaluated the psychometric properties of the Chinese TriPM at the item level using item response theory analyses. It also examined the measurement invariance of the TriPM between the Chinese and the U.S. student samples by applying differential item functioning analyses under the item response theory framework. The results supported the unidimensional nature of the Disinhibition and Meanness scales. Both scales had a greater level of precision in the respective underlying constructs at the positive ends. The two scales, however, had several items that were weakly associated with their respective latent traits in the Chinese student sample. Boldness, on the other hand, was found to be multidimensional, and reflected a more normally distributed range of variation. The examination of measurement bias via differential item functioning analyses revealed that a number of items of the TriPM were not equivalent across the Chinese and the U.S. Some modification and adaptation of items might be considered for improving the precision of the TriPM for Chinese participants. (PsycINFO Database Record (c) 2018 APA, all rights reserved).
The End-Stage Renal Disease Adherence Questionnaire (ESRD-AQ): testing the psychometric properties in patients receiving in-center hemodialysis.

OpenAIRE

Kim, Y; Evangelista, LS; Phillips, LR; Pavlish, C; Kopple, JD

2010-01-01

Reported treatment adherence rates of patients with end stage renal disease (ESRD) have been extremely varied due to lack of reliable and valid measurement tools. This study was conducted to develop and test an instrument to measure treatment adherence to hemodialysis (HD) attendance, medications, fluid restrictions, and diet prescription among patients with ESRD. This article describes the methodological approach used to develop and test the psychometric properties (such as reliability and v...
A Psychometric Review of Measures Assessing Discrimination Against Sexual Minorities.

Science.gov (United States)

Morrison, Todd G; Bishop, C J; Morrison, Melanie A; Parker-Taneo, Kandice

2016-08-01

Discrimination against sexual minorities is widespread and has deleterious consequences on victims' psychological and physical wellbeing. However, a review of the psychometric properties of instruments measuring lesbian, gay, and bisexual (LGB) discrimination has not been conducted. The results of this review, which involved evaluating 162 articles, reveal that most have suboptimal psychometric properties. Specifically, myriad scales possess questionable content validity as (1) items are not created in collaboration with sexual minorities; (2) measures possess a small number of items and, thus, may not sufficiently represent the domain of interest; and (3) scales are "adapted" from measures designed to examine race- and gender-based discrimination. Additional limitations include (1) summed scores are computed, often in the absence of scale score reliability metrics; (2) summed scores operate from the questionable assumption that diverse forms of discrimination are necessarily interrelated; (3) the dimensionality of instruments presumed to consist of subscales is seldom tested; (4) tests of criterion-related validity are routinely omitted; and (5) formal tests of measures' construct validity are seldom provided, necessitating that one infer validity based on the results obtained. The absence of "gold standard" measures, the attendant difficulty in formulating a coherent picture of this body of research, and suggestions for psychometric improvements are noted.
Psychometric validation of the Hopkins Symptom Checklist (SCL-90) subscales for depression, anxiety, and interpersonal sensitivity

DEFF Research Database (Denmark)

Bech, P; Bille, J; Møller, S B

2014-01-01

BACKGROUND: The psychometric validity of many subscales of the 90-item Hopkins Symptom Checklist (SCL-90) remains largely unknown. Therefore, the aim of the present study was to evaluate the psychometric properties of the "Hamilton-subscales" for depression (SCL-D16), anxiety (SCL-A14), their 6......-item core-measures (SCL-D6 and SCL-A6), the anxiety symptom scale (SCL-ASS8) and the interpersonal sensitivity scale (IPS5). METHODS: The psychometric properties of the SCL-D16, SCL-A14, SCL-D6, SCL-A6, SCL-ASS8, and the IPS5 were evaluated based on SCL-90 ratings from 850 day patients from a Danish...... SCL-90 subscales were identified. Using these scales it is possible to perform a psychometrically valid evaluation of psychiatric patients regarding the severity of depression (HAM-D6), specific anxiety (SCL-ASS8) and interpersonal sensitivity (IPS5)....
Normalization of the psychometric hepatic encephalopathy score for ...

African Journals Online (AJOL)

Aim: To construct normal values for the tests of the psychometric hepatic encephalopathy score (PHES) and evaluate the prevalence of minimal hepatic encephalopathy (MHE) among Turkish patients with liver cirrhosis. Materials and Methods: One hundred and eighty-five healthy subjects and sixty patients with liver ...
Evaluation of Psychometric Properties of the Malay Version ...

African Journals Online (AJOL)

Evaluation of Psychometric Properties of the Malay Version Perceived Stress Scale in Two Occupational Settings In Malaysia. ... Statistical analysis was carried out using statistical package for the social sciences version 16 (SPSS, Chicago, IL, USA) software. Results: Analysis yielded two factor structure of the Malay version ...
Developing and psychometric of an instrument for reproductive ...

African Journals Online (AJOL)

Background: Due to the socio-cultural characteristics of Iranian adult men and lack of standardized questionnaires to assess their reproductive health associated with sexually transmitted diseases and HIV / AIDS, this study is done with the goal of development and psychometrics of a valid relevant instrument. Method: A ...
Measuring the impact and distress of health problems from the individual's perspective: development of the Perceived Impact of Problem Profile (PIPP)

Science.gov (United States)

Pallant, Julie F; Misajon, RoseAnne; Bennett, Elizabeth; Manderson, Lenore

2006-01-01

Background The aim of this study was to develop and conduct preliminary validation of the Perceived Impact of Problem Profile (PIPP). Based on the biopsychosocial model of health and functioning, the PIPP was intended as a generic research and clinical measurement tool to assess the impact and distress of health conditions from the individuals' perspective. The ICF classification system was used to guide the structure of the PIPP with subscales included to assess impact on self-care, mobility, participation, relationships and psychological well-being. While the ICF focuses on the classification of objective health and health related status, the PIPP broadens this focus to address the individuals' subjective experience of their health condition. Methods An item pool of 23 items assessing both impact and distress on five key domains was generated. These were administered to 169 adults with mobility impairment. Rasch analysis using RUMM2020 was conducted to assess the psychometric properties of each set of items. Preliminary construct validation of the PIPP was performed using the EQ5D. Results For both the Impact and Distress scales of the PIPP, the five subscales (Self-care, Mobility, Participation, Relationships, and Psychological Well-being) showed adequate psychometric properties, demonstrating fit to the Rasch model. All subscales showed adequate person separation reliability and no evidence of differential item functioning for sex, age, educational level or rural vs urban residence. Preliminary validity testing using the EQ5D items provided support for the subscales. Conclusion This preliminary study, using a sample of adults with mobility impairment, provides support for the psychometric properties of the PIPP as a potential clinical and research measurement tool. The PIPP provides a brief, but comprehensive means to assess the key ICF components, focusing on the individuals' perspective of the impact and distress caused by their health condition. Further
Measuring the impact and distress of health problems from the individual's perspective: development of the Perceived Impact of Problem Profile (PIPP

Directory of Open Access Journals (Sweden)

Bennett Elizabeth

2006-06-01

Full Text Available Abstract Background The aim of this study was to develop and conduct preliminary validation of the Perceived Impact of Problem Profile (PIPP. Based on the biopsychosocial model of health and functioning, the PIPP was intended as a generic research and clinical measurement tool to assess the impact and distress of health conditions from the individuals' perspective. The ICF classification system was used to guide the structure of the PIPP with subscales included to assess impact on self-care, mobility, participation, relationships and psychological well-being. While the ICF focuses on the classification of objective health and health related status, the PIPP broadens this focus to address the individuals' subjective experience of their health condition. Methods An item pool of 23 items assessing both impact and distress on five key domains was generated. These were administered to 169 adults with mobility impairment. Rasch analysis using RUMM2020 was conducted to assess the psychometric properties of each set of items. Preliminary construct validation of the PIPP was performed using the EQ5D. Results For both the Impact and Distress scales of the PIPP, the five subscales (Self-care, Mobility, Participation, Relationships, and Psychological Well-being showed adequate psychometric properties, demonstrating fit to the Rasch model. All subscales showed adequate person separation reliability and no evidence of differential item functioning for sex, age, educational level or rural vs urban residence. Preliminary validity testing using the EQ5D items provided support for the subscales. Conclusion This preliminary study, using a sample of adults with mobility impairment, provides support for the psychometric properties of the PIPP as a potential clinical and research measurement tool. The PIPP provides a brief, but comprehensive means to assess the key ICF components, focusing on the individuals' perspective of the impact and distress caused by their

Cultural adaptation and validation of Stroke Impact Scale 3.0 version in Uganda: A small-scale study.

Science.gov (United States)

Kamwesiga, Julius T; von Koch, Lena; Kottorp, Anders; Guidetti, Susanne

2016-01-01

Knowledge is scarce about the impact of stroke in Uganda, and culturally adapted, psychometrically tested patient-reported outcome measures are lacking. The Stroke Impact Scale 3.0 is recommended, but it has not been culturally adapted and validated in Uganda. To culturally adapt and determine the psychometric properties of the Stroke Impact Scale 3.0 in the Ugandan context on a small scale. The Stroke Impact Scale 3.0 was culturally adapted to form Stroke Impact Scale 3.0 Uganda ( in English ) by involving 25 participants in three different expert committees. Subsequently, Stroke Impact Scale 3.0 Uganda from English to Luganda language was done in accordance with guidelines. The first language in Uganda is English and Luganda is the main spoken language in Kampala city and its surroundings. Translation of Stroke Impact Scale 3.0 Uganda ( both in English and Luganda ) was then tested psychometrically by applying a Rasch model on data collected from 95 participants with stroke. Overall, 10 of 59 (17%) items in the eight domains of the Stroke Impact Scale 3.0 were culturally adapted. The majority were 6 of 10 items in the domain Activities of Daily Living, 2 of 9 items in the domain Mobility, and 2 of 5 items in the domain Hand function. Only in two domains, all items demonstrated acceptable goodness of fit to the Rasch model. There were also more than 5% person misfits in the domains Participation and Emotion, while the Communication, Mobility, and Hand function domains had the lowest proportions of person misfits. The reliability coefficient was equal or larger than 0.90 in all domains except the Emotion domain, which was below the set criterion of 0.80 (0.75). The cultural adaptation and translation of Stroke Impact Scale 3.0 Uganda provides initial evidence of validity of the Stroke Impact Scale 3.0 when used in this context. The results provide support for several aspects of validity and precision but also point out issues for further adaptation and improvement
Pharmacy students' opinions of direct-to-consumer advertising: a pilot study at one university.

Science.gov (United States)

Harrington, Amanda R; Desselle, Shane P; Apgar, David A; Hesselbacher, Elizabeth; Pié, Aaron; Quesnel, Aimee; Warholak, Terri L

2013-01-01

Direct-to-consumer advertisement (DTCA) of prescription medications has become an important informational source for health care consumers. As future health care professionals on the front line of potential communication and dispensing of products emerging from DTCA, it is important to elicit the attitudes of student-pharmacists. This study aims to (1) evaluate the validity of the DTCA attitudinal questionnaire using Rasch rating scale analysis and (2) investigate the attitudes of pharmacy students toward DTCA and determine whether these attitudes were associated with years of pharmacy education and demographic characteristics. This investigation used a cross-sectional print-based questionnaire to evaluate the attitudes of pharmacy students toward DTCA of prescription medications. The 16-item questionnaire included items addressing the attitudes of pharmacy students toward DTCA with respect to patients' knowledge of medications, pharmacists' interaction with patients, and overall consumer judgment of medical prescriptions. Analyses included Rasch analysis and a multiple linear regression. A total of 243 students submitted usable questionnaires (85% response rate). Item response categories were collapsed from 5 categories to 3, and 4 items were removed to achieve acceptable Rasch model fit. Pharmacy students demonstrated little difficulty in agreeing with the statements suggesting that DTCA helps patients take a more active role in health care and had the most difficulty in agreeing with items suggesting that DTCA may lead to inappropriate prescribing to satisfy patient requests. Students' overall support for DTCA was the only variable that predicted the questionnaire score (P<.001). In conclusion, the Rasch analysis evaluated the psychometric properties of the instrument and identified the necessity to adapt the questionnaire from previous iterations to adequately fit the student population. Future research should examine factors that contribute to the variance in
An evaluation of the HM prison service "thinking skills programme" using psychometric assessments.

Science.gov (United States)

Gobbett, Matthew J; Sellen, Joselyn L

2014-04-01

The most widely implemented offending behaviour programme in the United Kingdom was Enhanced Thinking Skills (ETS), a cognitive-behavioural group intervention that aimed to develop participant's general cognitive skills. A new offending behaviour programme has been developed to replace ETS: the Thinking Skills Programme (TSP). This study reports an evaluation of the effectiveness of TSP using psychometric assessments. Phasing of the two programmes created an opportunity to compare the two programmes consecutively. Forty participants, 20 from each programme, completed a range of psychometric measures to examine cognition, attitudes, and thinking styles. Analysis of pre- and post-programme psychometric results indicated that participants of TSP demonstrated improvements on 14 of the 15 scales, 9 of which were statistically significant. Effect sizes between pre-post results were generally greater for TSP than ETS, demonstrating that TSP had a more positive impact on the thinking styles and attitudes of participants than the ETS programme.
A psychometric evaluation of the digital logic concept inventory

Science.gov (United States)

Herman, Geoffrey L.; Zilles, Craig; Loui, Michael C.

2014-10-01

Concept inventories hold tremendous promise for promoting the rigorous evaluation of teaching methods that might remedy common student misconceptions and promote deep learning. The measurements from concept inventories can be trusted only if the concept inventories are evaluated both by expert feedback and statistical scrutiny (psychometric evaluation). Classical Test Theory and Item Response Theory provide two psychometric frameworks for evaluating the quality of assessment tools. We discuss how these theories can be applied to assessment tools generally and then apply them to the Digital Logic Concept Inventory (DLCI). We demonstrate that the DLCI is sufficiently reliable for research purposes when used in its entirety and as a post-course assessment of students' conceptual understanding of digital logic. The DLCI can also discriminate between students across a wide range of ability levels, providing the most information about weaker students' ability levels.
Motor assessment instruments and psychometric procedures: A systematic review

Directory of Open Access Journals (Sweden)

Pâmella de Medeiros

2017-03-01

Full Text Available It was our objective to identify the psychometric elements to an epistemological reflection through a systematic review of cross-cultural validation procedures of TGMD-2 batteries, MABC-2 and KTK. Searches were carried out by two evaluators independently without year and language restrictions in six databases: Web of Science, Science Direct, Lilacs, Scopus, Pubmed and The ScientificElectronic Library Online - SciELO. The key words used were: "MABC", "TGMD" and "KTK" all of them combined with the word "validity". There was a total of 734 articles, of which, after the exclusion criteria, remained only 11 studies. It was found that there are differences between the authors in relation to the psychometric factors taken into account in cross-cultural validation. So that there was a lack of unanimity of the validation criteria of all studies in this field.
A biologically inspired psychometric function for accuracy of visual identification as a function of exposure duration

DEFF Research Database (Denmark)

Petersen, Anders; Andersen, Tobias

, all of these having a temporal offset included, as well as the ex-Gaussian, and finally a new psychometric function, motivated from single-neuron studies by (Albrecht, Geisler, Frazor & Crane, 2002). The new psychometric function stands out by having a nonmonotonous hazard rate which is initially...
Validity, Reliability, and the Questionable Role of Psychometrics in Plastic Surgery

Science.gov (United States)

2014-01-01

Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements) is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity. PMID:25289354
Validity, Reliability, and the Questionable Role of Psychometrics in Plastic Surgery

Directory of Open Access Journals (Sweden)

Eric Swanson, MD

2014-06-01

Full Text Available Summary: This report examines the meaning of validity and reliability and the role of psychometrics in plastic surgery. Study titles increasingly include the word “valid” to support the authors’ claims. Studies by other investigators may be labeled “not validated.” Validity simply refers to the ability of a device to measure what it intends to measure. Validity is not an intrinsic test property. It is a relative term most credibly assigned by the independent user. Similarly, the word “reliable” is subject to interpretation. In psychometrics, its meaning is synonymous with “reproducible.” The definitions of valid and reliable are analogous to accuracy and precision. Reliability (both the reliability of the data and the consistency of measurements is a prerequisite for validity. Outcome measures in plastic surgery are intended to be surveys, not tests. The role of psychometric modeling in plastic surgery is unclear, and this discipline introduces difficult jargon that can discourage investigators. Standard statistical tests suffice. The unambiguous term “reproducible” is preferred when discussing data consistency. Study design and methodology are essential considerations when assessing a study’s validity.
Rasch measurement of self-regulated learning in an information and communication technology (ICT)-rich environment.

Science.gov (United States)

Njiru, Joseph N; Waugh, Russell F

2007-01-01

This report describes how a linear scale of self-regulated learning in an ICT-rich environment was created by analysing student data using the Rasch measurement model. A person convenience sample of (N = 409) university students in Western Australia was used. The stem-item sample was initially 41, answered in two perspectives ("I aim for this" and "I actually do this"), and reduced to 16 that fitted the measurement model to form a unidimensional scale. Items for motivation (extrinsic rewards, intrinsic rewards, and social rewards), academic goals (fear of performing poorly) (but not standards), self-learning beliefs (ability and interest), task management (strategies and time management) (but not cooperative learning), Volition (action control (but not environmental control), and self-evaluation (cognitive self-evaluation and metacognition) fitted the measurement model. The proportion of observed variance considered true was 0.90. A new instrument is proposed to handle the conceptually valid but non-fitting items. Characteristics of high self-regulated learners are measured.
Assessment of minimal hepatic encephalopathy (with emphasis on computerized psychometric tests).

Science.gov (United States)

Kappus, Matthew R; Bajaj, Jasmohan S

2012-02-01

Minimal hepatic encephalopathy (MHE) is associated with a high risk of development of overt hepatic encephalopathy, impaired quality of life, and driving accidents. The detection of MHE requires specialized testing because it cannot, by definition, be diagnosed on standard clinical examination. Psychometric and neurophysiologic techniques are often used to test for MHE. Paper-pencil psychometric batteries and computerized tests have proved useful in diagnosing MHE and predicting its outcomes. Neurophysiologic tests also provide useful information. The diagnosis of MHE is an important issue for clinicians and patients alike. Testing strategies depend on the normative data available, patient comfort, and local expertise. Copyright © 2012 Elsevier Inc. All rights reserved.
Development of a Computerized Adaptive Test of Children's Gross Motor Skills.

Science.gov (United States)

Huang, Chien-Yu; Tung, Li-Chen; Chou, Yeh-Tai; Wu, Hing-Man; Chen, Kuan-Lin; Hsieh, Ching-Lin

2018-03-01

To (1) develop a computerized adaptive test for gross motor skills (GM-CAT) as a diagnostic test and an outcome measure, using the gross motor skills subscale of the Comprehensive Developmental Inventory for Infants and Toddlers (CDIIT-GM) as the candidate item bank; and (2) examine the psychometric properties and the efficiency of the GM-CAT. Retrospective study. A developmental center of a medical center. Children with and without developmental delay (N=1738). Not applicable. The CDIIT-GM contains 56 universal items on gross motor skills assessing children's antigravity control, locomotion, and body movement coordination. The item bank of the GM-CAT had 44 items that met the dichotomous Rasch model's assumptions. High Rasch person reliabilities were found for each estimated gross motor skill for the GM-CAT (Rasch person reliabilities =.940-.995, SE=.68-2.43). For children aged 6 to 71 months, the GM-CAT had good concurrent validity (r values =.97-.98), adequate to excellent diagnostic accuracy (area under receiver operating characteristics curve =.80-.98), and moderate to large responsiveness (effect size =.65-5.82). The averages of items administered for the GM-CAT were 7 to 11, depending on the age group. The results of this study support the use of the GM-CAT as a diagnostic and outcome measure to estimate children's gross motor skills in both research and clinical settings. Copyright © 2017 American Congress of Rehabilitation Medicine. Published by Elsevier Inc. All rights reserved.
Scenes for Social Information Processing in Adolescence: Item and factor analytic procedures for psychometric appraisal.

Science.gov (United States)

Vagos, Paula; Rijo, Daniel; Santos, Isabel M

2016-04-01

Relatively little is known about measures used to investigate the validity and applications of social information processing theory. The Scenes for Social Information Processing in Adolescence includes items built using a participatory approach to evaluate the attribution of intent, emotion intensity, response evaluation, and response decision steps of social information processing. We evaluated a sample of 802 Portuguese adolescents (61.5% female; mean age = 16.44 years old) using this instrument. Item analysis and exploratory and confirmatory factor analytic procedures were used for psychometric examination. Two measures for attribution of intent were produced, including hostile and neutral; along with 3 emotion measures, focused on negative emotional states; 8 response evaluation measures; and 4 response decision measures, including prosocial and impaired social behavior. All of these measures achieved good internal consistency values and fit indicators. Boys seemed to favor and choose overt and relational aggression behaviors more often; girls conveyed higher levels of neutral attribution, sadness, and assertiveness and passiveness. The Scenes for Social Information Processing in Adolescence achieved adequate psychometric results and seems a valuable alternative for evaluating social information processing, even if it is essential to continue investigation into its internal and external validity. (c) 2016 APA, all rights reserved.
The validity of self-rating depression scales in patients with chronic widespread pain

DEFF Research Database (Denmark)

Amris, Kirstine; Omerovic, Emina; Danneskiold-Samsøe, Bente

2016-01-01

BACKGROUND: Assessment of depression in chronic pain patients by self-rating questionnaires developed and validated for use in normal and/or psychiatric populations is common. The aim of this study was to evaluate the psychometric properties of the Major Depression Inventory (MDI) in a sample of ...... core of pain-related somatic symptoms. Careful consideration when interpreting questionnaire-derived scores of depression implemented in research and routine clinical care of patients with chronic pain is warranted.......BACKGROUND: Assessment of depression in chronic pain patients by self-rating questionnaires developed and validated for use in normal and/or psychiatric populations is common. The aim of this study was to evaluate the psychometric properties of the Major Depression Inventory (MDI) in a sample...... and further aspects of validity, including fit of individual scale items to a unidimensional model indicating assessment of a single construct (depression), as a prerequisite for measurement. RESULTS: The Rasch analysis revealed substantial problems with the rating scale properties of the MDI and lack...
The Schizotypic Syndrome Questionnaire (SSQ): Psychometrics, validation and norms.

NARCIS (Netherlands)

van Kampen, D.

2006-01-01

This paper examines the psychometric properties (reliability and factor structure) and validity (relationship with various self-report measures and SPEM dysfunction) of the SSQ or Schizotypic Syndrome Questionnaire, a 108-item inventory for the measurement of 12 prodromal or schizotypic symptoms
Assessing leadership decision-making styles: psychometric properties of the Leadership Judgement Indicator

Directory of Open Access Journals (Sweden)

Faraci P

2013-10-01

Full Text Available Palmira Faraci,1 Michael Lock,2 Robert Wheeler2 1Faculty of Human and Social Sciences, University of Enna “Kore”, Enna, Italy; 2Formula 4 Leadership Limited, Nottingham, UK Abstract: This study aimed to validate the Italian version of the Leadership Judgement Indicator, an unconventional instrument devoted to measurement of leaders' judgments and preferred styles, ie, directive, consultative, consensual, or delegative, when dealing with a range of decision-making scenarios. After forward-translation and back-translation, its psychometric properties were estimated for 299 managers at various levels, who were asked to put themselves in the position of leader and to rate the appropriateness of certain ways of responding to challenge. Differences between several groups of managers, ranked in order of seniority, provided evidence for discriminant validity. Internal consistency was adequate. The findings show that the Italian adaptation of the Leadership Judgement Indicator has promising psychometric qualities, suggesting its suitability for use to improve outcomes in both organizational and selection settings. Keywords: Leadership Judgement Indicator, decision-making, situational test, scenarios, psychometric properties
Development and Validation of a Multimedia-based Assessment of Scientific Inquiry Abilities

Science.gov (United States)

Kuo, Che-Yu; Wu, Hsin-Kai; Jen, Tsung-Hau; Hsu, Ying-Shao

2015-09-01

The potential of computer-based assessments for capturing complex learning outcomes has been discussed; however, relatively little is understood about how to leverage such potential for summative and accountability purposes. The aim of this study is to develop and validate a multimedia-based assessment of scientific inquiry abilities (MASIA) to cover a more comprehensive construct of inquiry abilities and target secondary school students in different grades while this potential is leveraged. We implemented five steps derived from the construct modeling approach to design MASIA. During the implementation, multiple sources of evidence were collected in the steps of pilot testing and Rasch modeling to support the validity of MASIA. Particularly, through the participation of 1,066 8th and 11th graders, MASIA showed satisfactory psychometric properties to discriminate students with different levels of inquiry abilities in 101 items in 29 tasks when Rasch models were applied. Additionally, the Wright map indicated that MASIA offered accurate information about students' inquiry abilities because of the comparability of the distributions of student abilities and item difficulties. The analysis results also suggested that MASIA offered precise measures of inquiry abilities when the components (questioning, experimenting, analyzing, and explaining) were regarded as a coherent construct. Finally, the increased mean difficulty thresholds of item responses along with three performance levels across all sub-abilities supported the alignment between our scoring rubrics and our inquiry framework. Together with other sources of validity in the pilot testing, the results offered evidence to support the validity of MASIA.
Disruptive behavior scale for adolescents (DISBA): development and psychometric properties.

Science.gov (United States)

Karimy, Mahmood; Fakhri, Ahmad; Vali, Esmaeel; Vali, Farzaneh; Veiga, Feliciano H; Stein, L A R; Araban, Marzieh

2018-01-01

Growing evidence indicates that if disruptive behavior is left unidentified and untreated, a significant proportion of these problems will persist and may develop into problems linked with delinquency, substance abuse, and violence. Research is needed to develop valid and reliable measures of disruptive behavior to assist recognition and impact of treatments on disruptive behavior. The aim of this study was to develop and evaluate the psychometric properties of a scale for disruptive behavior in adolescents. Six hundred high school students (50% girls), ages ranged 15-18 years old, selected through multi stage random sampling. Psychometrics of the disruptive behavior scale for adolescents (DISBA) (Persian version) was assessed through content validity, explanatory factor analysis (EFA) using Varimax rotation and confirmatory factor analysis (CFA). The reliability of this scale was assessed via internal consistency and test-retest reliability. EFA revealed four factors accounting for 59% of observed variance. The final 29-item scale contained four factors: (1) aggressive school behavior, (2) classroom defiant behavior, (3) unimportance of school, and (4) defiance to school authorities. Furthermore, CFA produced a sufficient Goodness of Fit Index > 0.90. Test-retest and internal consistency reliabilities were acceptable at 0.85 and 0.89, respectively. The findings from this study suggest that the Iranian version of DISBA questionnaire has content validity. Further studies are needed to evaluate stronger psychometric properties for DISBA.
The Youth Psychopathic Traits Inventory: Measurement Invariance and Psychometric Properties among Portuguese Youths

Directory of Open Access Journals (Sweden)

Pedro Pechorro

2016-08-01

Full Text Available The aim of the present study was to examine the psychometric properties of the Youth Psychopathic Traits Inventory (YPI among a mixed-gender sample of 782 Portuguese youth (M = 15.87 years; SD = 1.72, in a school context. Confirmatory factor analysis revealed the expected three-factor first-order structure. Cross-gender measurement invariance and cross-sample measurement invariance using a forensic sample of institutionalized males were also confirmed. The Portuguese version of the YPI demonstrated generally adequate psychometric properties of internal consistency, mean inter-item correlation, convergent validity, discriminant validity, and criterion-related validity of statistically significant associations with conduct disorder symptoms, alcohol abuse, drug use, and unprotected sex. In terms of known-groups validity, males scored higher than females, and males from the school sample scored lower than institutionalized males. The use of the YPI among the Portuguese male and female youth population is psychometrically justified, and it can be a useful measure to identify adolescents with high levels of psychopathic traits.
Psychometric Properties of the Depression Anxiety and Stress Scale-21 in Older Primary Care Patients

OpenAIRE

Gloster, Andrew T.; Rhoades, Howard M.; Novy, Diane; Klotsche, Jens; Senior, Ashley; Kunik, Mark; Wilson, Nancy; Stanley, Melinda A.

2008-01-01

The Depression Anxiety Stress Scale (DASS) was designed to efficiently measure the core symptoms of anxiety and depression and has demonstrated positive psychometric properties in adult samples of anxiety and depression patients and student samples. Despite these findings, the psychometric properties of the DASS remain untested in older adults, for whom the identification of efficient measures of these constructs is especially important.
Psychometric Assessment of Stereoscopic Head-Mounted Displays

Science.gov (United States)

2016-06-29

Journal Article 3. DATES COVERED (From – To) Jan 2015 - Dec 2015 4. TITLE AND SUBTITLE PSYCHOMETRIC ASSESSMENT OF STEREOSCOPIC HEAD- MOUNTED DISPLAYS...to render an immersive three-dimensional constructive environment. The purpose of this effort was to quantify the impact of aircrew vision on an...simulated tasks requiring precise depth discrimination. This work will provide an example validation method for future stereoscopic virtual immersive

Some links on this page may take you to non-federal websites. Their policies may differ from this site.